towards probabilistic acceptors and transducers for
play

Towards Probabilistic Acceptors and Transducers for Feature - PowerPoint PPT Presentation

Towards Probabilistic Acceptors and Transducers for Feature Structures Daniel Quernheim Institute for Natural Language Processing, University of Stuttgart daniel@ims.uni-stuttgart.de Kevin Knight Information Sciences Institute, University of


  1. Towards Probabilistic Acceptors and Transducers for Feature Structures Daniel Quernheim Institute for Natural Language Processing, University of Stuttgart daniel@ims.uni-stuttgart.de Kevin Knight Information Sciences Institute, University of Southern California knight@isi.edu Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6) July 12, 2012 Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 1 / 21

  2. Linguistic structures Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 2 / 21

  3. Linguistic structures Strings surface forms, phonology, morphology Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 2 / 21

  4. Linguistic structures Strings surface forms, phonology, morphology Trees syntax Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 2 / 21

  5. Linguistic structures Strings surface forms, phonology, morphology Trees syntax Feature structures (= directed acyclic graphs) deep syntax (LFG etc.) semantics (abstract meaning representations) Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 2 / 21

  6. Feature structures   charge INSTANCE � �   person INSTANCE   THEME 1   “Pascale”  NAME          and INSTANCE           resist INSTANCE             AGENT 1         OP 1     � �    arrest   INSTANCE        THEME        THEME 1  PRED                 intoxicate  INSTANCE            THEME  1  OP 2            � �      public LOCATION INSTANCE     Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 3 / 21

  7. Directed acyclic graphs CHARGE CHARGE �→ charge ( theme , pred ) AND �→ and ( op1 , op2 ) AND RESIST �→ resist ( agent , theme ) ARREST �→ arrest ( theme ) RESIST INTOXICATE INTOXICATE �→ intoxicate ( theme , location ) ARREST PUBLIC �→ public () PERSON �→ person ( name ) PERSON PUBLIC PASCALE �→ "Pascale" PASCALE Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 4 / 21

  8. Translation pipelines Syntax-based MT pipeline fstring → translate → etree → language model → estring Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 5 / 21

  9. Translation pipelines Syntax-based MT pipeline fstring → translate → etree → language model → estring ◮ The individual components are efficiently represented as weighted tree acceptors and transducers . Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 5 / 21

  10. Translation pipelines Syntax-based MT pipeline nl-XTOPs − 1 FSA FSA RTG FSA fstring → translate → etree → language model → estring ◮ The individual components are efficiently represented as weighted tree acceptors and transducers . estring = B EST P ATH ( I NTERSECT ( language model , Y IELD ( B ACKWARDS ( translate , fstring )))) . Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 5 / 21

  11. Translation pipelines (2) Semantics-based MT pipeline fstring → understand → AMR → rank → AMR → generate → etree → rank → estring Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 6 / 21

  12. Translation pipelines (2) Semantics-based MT pipeline fstring → understand → AMR → rank → AMR → generate → etree → rank → estring ◮ No suitable automaton framework is known! Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 6 / 21

  13. Algorithms and automata string automata tree automata graph automata k -best paths through a trees in a weighted WFSA forest EM training Forward-backward Tree transducer EM EM training Determinization of weighted string ac- of weighted tree ac- ceptors ceptors Transducer WFST composition Many transducers not composition closed under compo- sition General tools AT&T FSM, Carmel, Tiburon OpenFST Table: General-purpose algorithms for strings, trees and feature structures. Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 7 / 21

  14. Algorithms and automata string automata tree automata graph automata ? k -best paths through a trees in a weighted WFSA forest EM training Forward-backward Tree transducer EM ? EM training Determinization of weighted string ac- of weighted tree ac- ? ceptors ceptors Transducer WFST composition Many transducers not ? composition closed under compo- sition General tools AT&T FSM, Carmel, Tiburon ? OpenFST Table: General-purpose algorithms for strings, trees and feature structures. Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 7 / 21

  15. Algorithms and automata (2) Our goal ◮ Find an adequate automaton model for the pipeline parts ◮ Investigate algorithms and fill all the blanks! Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 8 / 21

  16. Algorithms and automata (2) Our goal ◮ Find an adequate automaton model for the pipeline parts ◮ Investigate algorithms and fill all the blanks! Candidates ◮ Treating everything as a tree (too weak?) ◮ Unification grammars (HPSG, LFG) (too powerful?) ◮ Hyperedge replacement grammar (too powerful?) Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 8 / 21

  17. Algorithms and automata (2) Our goal ◮ Find an adequate automaton model for the pipeline parts ◮ Investigate algorithms and fill all the blanks! Candidates ◮ Treating everything as a tree (too weak?) ◮ Unification grammars (HPSG, LFG) (too powerful?) ◮ Hyperedge replacement grammar (too powerful?) Some straightforward extension of string/tree automata? ◮ Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 8 / 21

  18. Dag automata finite string automaton: (FSA) one input state, one input symbol, one output state . . . . . . p q σ Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 9 / 21

  19. Dag automata finite string automaton: (FSA) one input state, one input symbol, one output state . . . . . . p q σ finite tree automaton: (FTA) one input state, one input symbol, many output states q 1 . . . . . . p q 2 σ q 3 Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 9 / 21

  20. Dag automata finite string automaton: (FSA) one input state, one input symbol, one output state . . . . . . p q σ finite tree automaton: (FTA) one input state, one input symbol, many output states q 1 . . . . . . p q 2 σ q 3 finite dag automaton: (FDA?) many input states, one input symbol, many output states p 1 q 1 . . . . . . q 2 σ p 2 q 3 Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 9 / 21

  21. Dag automata (2) K AMIMURA and S LUTZKI (1981, 1982) ◮ Dag acceptors and dag-to-tree transducers ◮ They proved a couple of technical properties, no algorithms Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 10 / 21

  22. Dag automata (2) K AMIMURA and S LUTZKI (1981, 1982) ◮ Dag acceptors and dag-to-tree transducers ◮ They proved a couple of technical properties, no algorithms ◮ We investigate their model with some adjustments: ◮ not only adjacent leaves can be connected Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 10 / 21

  23. Dag automata (2) K AMIMURA and S LUTZKI (1981, 1982) ◮ Dag acceptors and dag-to-tree transducers ◮ They proved a couple of technical properties, no algorithms ◮ We investigate their model with some adjustments: WANT ◮ not only adjacent leaves can be connected BELIEVE BOY GIRL Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 10 / 21

  24. Dag automata (2) K AMIMURA and S LUTZKI (1981, 1982) ◮ Dag acceptors and dag-to-tree transducers ◮ They proved a couple of technical properties, no algorithms ◮ We investigate their model with some adjustments: WANT ◮ not only adjacent leaves can be connected BELIEVE ◮ top-down transducers instead of bottom-up ◮ we introduce weights (probabilities) BOY GIRL Quernheim and Knight Towards Probabilistic Acceptors and Transducers for Feature Structures July 12, 2012 10 / 21

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend