preference grammars and soft syntactic constraints for
play

Preference Grammars and Soft Syntactic Constraints for GHKM - PowerPoint PPT Presentation

Preference Grammars and Soft Syntactic Constraints for GHKM Syntax-based SMT Matthias Huck, Hieu Hoang, Philipp Koehn University of Edinburgh 25 October 2014 Introduction Feature-based integration of syntactic information into GHKM


  1. Preference Grammars and Soft Syntactic Constraints for GHKM Syntax-based SMT Matthias Huck, Hieu Hoang, Philipp Koehn University of Edinburgh 25 October 2014

  2. Introduction Feature-based integration of syntactic information into GHKM string-to-tree translation • Preference grammars: soft target-side syntax • Target syntax as a feature rather than via labeled non-terminals in the SCFG • Soft syntactic constraints: non-restrictive source-syntactic enhancement • No hard source-syntactic constraints (as in standard tree-to-tree translation) imposed in extraction or decoding Our empirical evaluation: English → German WMT task

  3. Related Work • GHKM string-to-tree translation: Galley et al. (2004) • Open-source Moses implementation for GHKM translation • GHKM rule extraction: Williams and Koehn (2012) • Decoding with CYK + parsing and cube pruning: Hoang et al. (2009) • Competitive results for European language pairs: Nadejde et al. (2013); Williams et al. (2014) • Preference grammars: beneficial as a syntactic extension of hierarchical systems (Venugopal et al., 2009; Stein et al., 2010) • Soft syntactic constraints: related source-syntactic techniques improved hierarchical (Marton and Resnik, 2008; Vilar et al., 2008; Hoang and Koehn, 2010) and other syntax-based systems (Zhang et al., 2011; Huang et al., 2013) on Chinese → English and Arabic → English tasks

  4. Preference Grammars • Target-side non-terminals not decorated with syntactic labels, but with a single generic non-terminal symbol baseline preference grammar system  X , ADJD → � present , anwesend �   X , ADV → � present , anwesend �  X , X → � present , anwesend � X , AP - PD → � present , anwesend �   . . .  • Distribution of implicit target label vectors stored as additional information with each translation rule X , X → � present , anwesend � # ( ADJD ) 0 . 98 ( ADV ) 0 . 001 ( AP - PD ) 0 . 01 ... • Computation of a tree-wellformedness feature during decoding

  5. Soft Source Syntactic Constraints • Parse source-side data as well • GHKM extractor stores an additional rule property: source syntactic label vectors • Provide parsed input data to the decoder • During decoding, score matches and mismatches of source label vectors and input labels • Soft features, no hard constraints rule source label vectors X , VP - OC → � present , zu präsentieren � ( VB ) , ( NN ) X , ADJD → � present , anwesend � ( ADJP ) , ( ADVP ) ,. . . X , TOP → � X ∼ 0 is now X ∼ 1 . , jetzt ist NP - SB ∼ 0 VP - OC ∼ 1 . � ( TOP , NP , ADJP ) ,. . . X , ˆ S - TOP → � is X ∼ 0 X ∼ 1 , ist ADV ∼ 0 ADJD ∼ 1 � ( VP , ADVP , ADJP ) ,. . .

  6. the army is now about to present flowers .

  7. die Armee ist jetzt dabei , Blumen zu präsentieren . the army is now about to present flowers .

  8. TOP ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . the army is now about to present flowers .

  9. TOP ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . the army is now about to present flowers . the army is now present .

  10. TOP ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present .

  11. TOP TOP NP-SB VP-OC ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present .

  12. TOP TOP NP-SB VP-OC ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present .

  13. TOP TOP NP-SB VP-OC ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S DT NN VBZ ADVP ADJP NP VP . S TOP

  14. TOP TOP NP-SB VP-OC ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S DT NN VBZ ADVP ADJP NP VP . S TOP

  15. TOP TOP NP-SB VP-OC ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S RB JJ DT NN VBZ ADVP ADJP DT NN VBZ ADVP ADJP NP VP . NP VP . S S TOP TOP

  16. TOP TOP NP-SB VP-OC ˆS-TOP ˆS-TOP ✿✿✿✿✿✿✿ NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . jetzt ist die Armee zu präsentieren . ✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿✿ the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S RB JJ DT NN VBZ ADVP ADJP DT NN VBZ ADVP ADJP NP VP . NP VP . S S TOP TOP

  17. TOP ˆS-TOP ˆS-TOP NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . die Armee ist heute anwesend . the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S RB JJ DT NN VBZ ADVP ADJP DT NN VBZ ADVP ADJP NP VP . NP VP . S S TOP TOP

  18. TOP TOP ˆS-TOP ˆS-TOP ˆS-TOP ADV ADJD NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . die Armee ist heute anwesend . the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S RB JJ DT NN VBZ ADVP ADJP DT NN VBZ ADVP ADJP NP VP . NP VP . S S TOP TOP

  19. TOP TOP ˆS-TOP ˆS-TOP ˆS-TOP ADV ADJD NP-OA VVINF die Armee ist jetzt dabei , Blumen zu präsentieren . die Armee ist heute anwesend . the army is now about to present flowers . the army is now present . NNS VB NP TO VP VP RB IN S RB JJ DT NN VBZ ADVP ADJP DT NN VBZ ADVP ADJP NP VP . NP VP . S S TOP TOP

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend