translating unknown words by analogical learning
play

Translating Unknown Words by Analogical Learning Philippe Langlais - PowerPoint PPT Presentation

Motivations Analogical Learning Experiments Discussion Translating Unknown Words by Analogical Learning Philippe Langlais and Alexandre Patry Dept. I.R.O. Universit e de Montr eal, Qu ebec, Canada { felipe,patryale }


  1. Motivations Analogical Learning Experiments Discussion Translating Unknown Words by Analogical Learning Philippe Langlais and Alexandre Patry Dept. I.R.O. Universit´ e de Montr´ eal, Qu´ ebec, Canada { felipe,patryale } @iro.umontreal.ca EMNLP-CoNLL 2007, Prague, June 30th, 2007

  2. Motivations Analogical Learning Experiments Discussion The Problem we Address Translating unknown words L , seed lexicon lucide ↔ clear-thinking, blind, refocussed, lucid, far-sighted lucidit´ e ↔ well-informed,meti- culously, moderation, penetrating lucidity, clear-headed, clear-sighted futilit´ e ? futile ↔ meaningless, futile faillites ↔ bankruptcies, ban- kruptcy faillite ↔ collapse, bust, insol- vency, ruin, complaining, wall, business, bankruptcy, bankrupt

  3. Motivations Analogical Learning Experiments Discussion Existing Solutions to Translating Unknown Entities • Translating proper-names [Chen et al., 1998] and named-entities [Al-Onaizan and Knight, 2001] ֒ → analog is complementary to those approaches • Paraphrasing unknown entities by making use of massive homogeneous bitexts [Callisson-Burch et al., 2006 ; Cohn and Lapata, 2007] ֒ → analog requires only a seed lexicon (+ target- language lexicon) • Identifying translations in comparable corpora [Fung and Yee, 1998 ; Rapp,1999 ; Tanaka, 1999] ֒ → analog requires only a seed lexicon (+ target- language lexicon)

  4. Motivations Analogical Learning Experiments Discussion Other (more) Popular Solutions to Translating Unknown Words • Removing unknown (source) words before translating • Leaving unknown-words untranslated Our approach : Translating unknown-words by analogy

  5. Motivations Analogical Learning Experiments Discussion Overview Motivations Analogical Learning Introduction Principle Bilingual Lexicon Enrichment Practical Issues Experiments Protocol Results Discussion

  6. Motivations Analogical Learning Experiments Discussion Proportional Analogy [ A : B = C : D ] “A is to B as C is to D” (See for instance [Chiu et al., 2007])

  7. Motivations Analogical Learning Experiments Discussion Formal Analogies Examples taken from (Lepage, 1998) • Formal analogical proportions : honor : hon¯ o rem = o r¯ ¯ a tor : ¯ o r¯ a t¯ o rem reader : unreadable = doer : undoable r´ epression : r´ epressionnaire = r´ eaction : r´ eactionnaire tinggal : ketinggalan = duduk : kedudukan • Formal analogical equation : [ lang : l¨ angste = scharf : ? ]

  8. Motivations Analogical Learning Experiments Discussion Formal Analogies Examples taken from (Lepage, 1998) • Formal analogical proportions : honor : hon¯ o rem = o r¯ ¯ a tor : ¯ o r¯ a t¯ o rem reader : unreadable = doer : undoable r´ epression : r´ epressionnaire = r´ eaction : r´ eactionnaire tinggal : ketinggalan = duduk : kedudukan • Formal analogical equation : [ lang : l¨ angste = scharf : sch¨ arfste ]

  9. Motivations Analogical Learning Experiments Discussion Principle Stroppa & Yvon, 2005 L : training corpus I : input space (defined by a set of features over L ) O : output space (defined by a set of features over L ) X : incomplete observation (the input features are known) 1. Build E I ( X ) : { ( A , B , C ) ∈ L 3 | [ I(A) : I(B) = I(C) : I(X) ] } 2. Build E O ( X ) : { Y | [ O(A) : O(B) = O(C) : Y ] , ∀ ( A , B , C ) ∈ E I ( X ) } 3. Choose O ( X ) among E O ( X ). where I ( X ) and O ( X ) stand for the projection of X into the input and output space respectively.

  10. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) futilit´ e ?

  11. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) faillites faillite futilit´ e ? futilit´ es [ faillites : faillite = futilit´ es : futilit´ e ]

  12. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) faillites futile faillite futilit´ e ? futilit´ es lucide lucidit´ e [ lucide : lucidit´ e = futile : futilit´ e ]

  13. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) faillites futile faillite hostilit´ es futilit´ e ? hostiles futilit´ es lucide lucidit´ e [ hostiles : hostilit´ es = futile : futilit´ e ]

  14. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) faillites futile r´ ealit´ es r´ ealit´ e faillite hostilit´ es futilit´ es futilit´ e ? hostiles futilit´ es lucide lucidit´ e [ r´ ealit´ es : r´ ealit´ e = futilit´ es : futilit´ e ]

  15. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) faillites brutale futile r´ ealit´ es brutalit´ e r´ ealit´ e mutil´ e faillite bestialit´ e hostilit´ es futilit´ es futilit´ e ? natalit´ e facile facilit´ es hostiles futilit´ es bestiale timide natale lucide faillite maille timidit´ e lucidit´ e 2944 analogical equations, 84 valid stems

  16. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-1 : computing E I (futilit´ e) � activit´ es � � fatale , futile , fatalit´ e � � timide , timidit´ e , futile � es , activit´ e , futilit´ � faciles , facilit´ es , futile � � cupide , cupidit´ e , futile � � utilis´ e � es , futilit´ es , utilis´ � humide , humidit´ e , futile � � brutalit´ es � � multipli´ es � es , brutalit´ e , futilit´ es , multipli´ e , futilit´ � qualit´ es , qualit´ e , futilit´ es � � totale , totalit´ e , futile � � utilis´ es , utilis´ e , futilit´ es � � mutil´ es � � f´ e � � active , activit´ e , futile � es , mutil´ e , futilit´ elicit´ es , futilit´ es , f´ elicit´ � mature , maturit´ e , futile � � unit´ es , futilit´ es , unit´ e � � habilit´ es , habilit´ e , futilit´ es � � fragile , fragilit´ e , futile � � subtile , subtilit´ e , futile � � mutil´ e � es , futilit´ es , mutil´ � autorit´ es � � vitale , vitalit´ e , futile � � autoris´ es � es , autorit´ e , futilit´ es , autoris´ e , futilit´ � maille , faillite , mutil´ e � � mute , mutil´ e , futile � � facult´ es � es , facult´ e , futilit´ � f´ es � � utile , futile , utilit´ e � � facilit´ es � elicit´ es , f´ elicit´ e , futilit´ es , facilit´ e , futilit´ � rurale , ruralit´ e , futile � � brutalit´ e � � finales , finalit´ es , futile � es , futilit´ es , brutalit´ � subtiles , subtilit´ es , futile � � spatiale , spatialit´ e , futile � � visit´ es � es , visit´ e , futilit´ � r´ ealit´ es , r´ ealit´ e , futilit´ es � � p´ enale , p´ enalit´ e , futile � � brutales , brutalit´ es , futile � � habilit´ e � � hostilit´ e � (+ 42 others . . . ) es , futilit´ es , habilit´ es , futilit´ es , hostilit´

  17. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-2 : computing E O (futilit´ e) [ faillites : faillite = futilit´ es : futilit´ e ] faillites ↔ bankruptcies , bankruptcy faillite ↔ collapse, bust, insolvency, ruin, complaining, bankrupt, business, bankruptcy , wall futilit´ es ↔ trivialities [ bankruptcies : bankruptcy = trivialities : ? ] { triviality }

  18. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-2 : computing E O (futilit´ e) [ lucide : lucidit´ e = futile : futilit´ e ] lucide ↔ clear-thinking, blind, refocussed, lucid, far- sighted lucidit´ e ↔ well-informed, meticulously, moderation, pene- trating, lucidity, clear-headed, clear-sighted futilit´ e ↔ meaningless, futile [ lucid : lucidity = meaningless : ? ] { meaninglessity } [ lucid : lucidity = futile : ? ] { futiityle, futileity, futilitye } [ far-sighted : clear-sighted = futile : ? ] { fucleile, cleutile, clefuile, cluteile, culetile, cluetile, cfuleile, clfueile, cutleile }

  19. Motivations Analogical Learning Experiments Discussion Bilingual Lexicon Enrichment Step-3 : choosing from E O (futilit´ e) 1124 � = forms generated, 120 seen at least twice (trivialities,27) (meaninglitiesss,3) (meaniitnglyss,2) (triviality,14) (meaningiltyss,2) (superfluous,2) (futile,9) (futiilty,2) (meaningitilesss,2) (futilitye,9) (applicatitrivialitis,2) (meaningitlyss,2) (meaningless,9) (futiitiles,2) (meaninglesity,2) (meaninglityess,8) (futiitly,2) (meaninglessity,2) (trivialitie,6) (futiityl,2) (meaninglityes,2) (futility,4) (futileity,2) (meaninigltyss,2) (meaninglityss,4) (futilessne,2) (meaninitglyss,2) (futilities,3) (high-triviality,2) (meaninitiglesss,2) (meaninglesit,3) (ltbbyirivialitig,2) (mfaninglecilits,2) . . .

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend