compositions of tree to tree statistical machine
play

Compositions of Tree-to-Tree Statistical Machine Translation Models - PowerPoint PPT Presentation

Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti Institute for Natural Language Processing University of Stuttgart, Germany Montral July 28, 2016 Compositions of Tree-to-Tree Statistical Machine


  1. Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti Institute for Natural Language Processing University of Stuttgart, Germany Montréal — July 28, 2016 Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 1

  2. Statistical machine translation Required resource Parallel corpus (containing sentences in both languages) Additional resources Language model data (text in the target language) Word alignments Parse trees Various (morphological analyzers, parsers) Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 2

  3. Statistical machine translation Assume translation direction → (English to Catalan) but no parallel corpus for this pair Pivot approach but parallel corpus for → → (parallel corpus for English-Spanish and Spanish-Catalan) (similar motivation for non-deterministic pre- or post-processing steps) Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 3

  4. Statistical machine translation Consequences 2 translation models (operating in sequence) Inefficiencies in sequential operation (due to sequential pruning) Theoretical guarantees missing for many operations (e.g., tuning) Remedies Partial Evaluation [MayKniVog10] Composition Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 4

  5. Statistical machine translation Vauquois triangle: target source semantics syntax phrase Translation model: Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 5

  6. Statistical machine translation Vauquois triangle: target source semantics syntax phrase Translation model: tree-to-tree Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 5

  7. Overview Background 1 Theoretical Model 2 Unweighted Compositions 3 Weighted Compositions 4 Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 6

  8. Extended tree transducers Definition Transducer M = ( Q , Σ , ( q 1 , q 2 ) , R , wt ) with finite set Q of states finite set Σ of symbols source initial state q 1 and target initial state q 2 finite set R of rules (see below) weight assignment wt : R → A (into a semiring) SENT S � � � q ′ � NP VN NP VP 2 q , — q ′ , q 1 q 2 PRP q ′ CI 1 I J’ Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 7

  9. Extended tree transducers Restrictions Transducer M = ( Q , Σ , ( q 1 , q 2 ) , R , wt ) is STSG if Q = Σ and state = root label SCFG if STSG and all rules are shallow ε -free if no lef-hand side is in Q strict if no right-hand side is in Q simple if ε -free and strict S SENT � � � � NP VP NP VN NP � � � � S SENT S , — SENT , S , — SENT , PRP VBP NP CI V I VBP NP J’ V NP I J’ Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 8

  10. Extended tree transducers Rules: SENT S � � � � q ′ NP NP NP VN NP VP 2 � � � � — q ′ , q , q 2 , NNS — q ′ D NC q 1 q 2 2 , PRP q ′ CI 1 flowers les fleurs I J’ Use in a derivation step: S SENT SENT S NP VP q ′ NP VN NP NP VN NP VP 2 — q 1 — ⇒ PRP NP q 1 q 2 q ′ CI D NC PRP q ′ CI 1 1 I NNS I J’ les fleurs J’ flowers Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 9

  11. Extended tree transducers Initial sentential form: — q 1 q 2 Final sentential form: No more linked states Weight of a derivation: product of the weights of the used rules of a tree pair: sum of all derivations for the pair Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 10

  12. Extended tree transducers Composition Given two weighted relations τ : T Σ × T ∆ → A and τ ′ : T ∆ × T Γ → A � ( τ ; τ ′ )( t , s ) = τ ( t , u ) · τ ′ ( u , s ) u ∈ T ∆ Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 11

  13. Overview Background 1 Theoretical Model 2 Unweighted Compositions 3 Weighted Compositions 4 Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 12

  14. Known results SCFG � STSG � TT [Eis03] and folklore Model Composition closure Reference top-down transducer 1 [Eng75] simple transducer 2 [ArnDau82] other transducer [EngFulMal16] ∞ (top-down tree transducer = all lef-hand sides shallow) Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 13

  15. First result Theorem SCFG are closed under composition Matching original rules Newly constructed rule NP NP NP NP NP — — — a une a ADJ NN NC A ein ADJA NN ADJ NN ein ADJA NN Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 14

  16. Second result Difficulty for STSGs: NP NP ... — — ... DT NP DT NP ADJ NP’ Extending the rules does not help: NP NP DT NP ... — — ... DT NP ADJ NP’ ADJ NP’ CC ADJ NP Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 15

  17. Second result Theorem Compositions of n ≥ 2 simple STSGs are as expressive as compositions of n simple transducers Proof idea: Encode finite states in the intermediate tree(s) SENT S ( SENT , q , q ′ ) S � � � q ′ � NP VN NP VP 2 ( NP , q 2 , q ′ 2 ) VP NP VN q ′ , NP q , — — q 1 q 2 PRP q ′ CI 1 PRP VBP NP ( V , q 1 , q ′ 1 ) CI I J’ I J’ Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 16

  18. Second result Corollary The composition closure for simple STSGs is obtained at level 2 Proof idea: s-STSG � s-TT � s-TT 2 = s-TT 3 = s-STSG 3 = s-STSG 2 Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 17

  19. Third result Theorem The composition hierarchy for the remaining STSGs is infinite Proof idea: Inspect the corresponding proof for transducers and realize that the counterexamples can be generated by STSGs Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 18

  20. Overview Background 1 Theoretical Model 2 Unweighted Compositions 3 Weighted Compositions 4 Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 19

  21. Known results and main lemma s-wTT ; wREL ⊆ s-wTT wREL ; s-wTT ⊆ s-wTT follows from [FulMalVog11] and [Kui99] Lemma s-wTT ⊆ su-TT inj ; wREL Proof idea: Simply annotate rule identifier of applied rule to output and apply weights using the relabeling (su-TT inj = injective relations computed by simple unamb. transducers) Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 20

  22. First weighted result Theorem The composition closure of weighted simple STSGs is obtained at level 2 Proof idea: s-wTT ; s-wTT 2 ⊆ su-TT inj ; wREL ; s-wTT 2 ⊆ su-TT inj ; s-wTT 2 ⊆ su-TT 2 inj ; wREL ; s-wTT ⊆ su-TT 2 inj ; s-wTT ⊆ su-TT 3 inj ; wREL ⊆ s-TT 2 ⊆ su-TT 2 inj ; wREL ⊆ s-wTT 2 ; wREL � �� � injective Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 21

  23. Second weighted result Theorem The composition hierarchy for the remaining weighted STSGs is infinite Proof idea: Utilize linking technique of [Mal15] to lif the corresponding unweighted result Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 22

  24. Summary Model Composition closure (weighted) SCFGs 1 (weighted) simple STSGs 2 (weighted) other STSGs ∞ Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 23

  25. Literature May, Knight, Vogler: Efficient inference through cascades of weighted tree transducers. ACL 2010 Eisner: Learning non-isomorphic tree mappings for machine translation. ACL 2003 Engelfriet: Bottom-up and top-down tree transformations: a comparison. Math. Syst. Theor., 1975 Arnold, Dauchet: Morphismes et bimorphismes d’arbres. Theor. Comput. Sci. 1982 Engelfriet, Fülöp, M.: Composition closure of linear extended top-down tree transducers. Theor. Comput. Syst. 2016 Fülöp, M., Vogler: Weighted extended tree transducers. Fundam. Informaticae 2011 Kuich: Full abstract families of tree series I. In: Jewels are Forever, Springer 1999 M.: The power of weighted regularity-preserving multi bottom-up tree transducers. Int. J. Found. Comput. Sci. 2015 Compositions of Tree-to-Tree Statistical Machine Translation Models Andreas Maletti · 24

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend