a uniform architecture for parsing and generation of
play

A Uniform Architecture for Parsing and Generation of Natural - PDF document

Uniform grammatical processing A Uniform Architecture for Parsing and Generation of Natural Language G unter Neumann DFKI GmbH 66123 Saarbr ucken neumann@dfki.de G unter Neumann DFKI Uniform grammatical processing Overview Work


  1. Uniform grammatical processing A Uniform Architecture for Parsing and Generation of Natural Language G¨ unter Neumann DFKI GmbH 66123 Saarbr¨ ucken neumann@dfki.de G¨ unter Neumann DFKI

  2. Uniform grammatical processing Overview Work based on Neumann:94 (Ph.D. thesis), Neumann:98 (AIJ) 1. Parsing and Generation 2. Results 3. Motivation 4. State of the Art 5. A New Uniform Architecture 6. Parsing and Generation: UTA can do both 7. Interleaving of Parsing and Generation 8. Conclusion and Future Direction G¨ unter Neumann DFKI

  3. Uniform grammatical processing Uniform grammatical processing • Parsing: given a string, compute all possible logical forms (wrt. the given grammar) • Generation: given a logical form, compute all pos- sible strings • Uniformity – use of one and the same grammar for perform- ing both tasks = ⇒ reversible grammar – use of the same algorithm = ⇒ uniform algorithm G¨ unter Neumann DFKI

  4. Uniform grammatical processing Results • Uniform Tabular Algorithm (UTA): – constraint-based grammars – generalized Early deduction – flexible agenda mechanism – On-line – Input as essential feature ∗ dynamic selection function ∗ uniform chart mechanism = ⇒ uniform and task-oriented processing • Performanz model on the basis of a uniform ar- chitecture: – item-sharing between parsing and generation – incremental self-monitoring/revison strategies – generation of un-ambiguous strings – generation of paraphrases – any-time mode − → interleaved parsing and generation • Implementation in Common Lisp and CLOS G¨ unter Neumann DFKI

  5. Uniform grammatical processing Why uniform grammatical processing ? • Theoretical: – Occam’s razor – psycho-linguistic motivations • Practical: – reduced redundancies – simpler consistency tests – knowledge acquistion – compact and modular systems • Application: – grammar development – interactive grammar-/style-checker – incremental text processing – monitoring and revision – generation of paraphrases – processing of elliptic expressions – combination of learning-/ preference-based methods – . . . G¨ unter Neumann DFKI

  6. Uniform grammatical processing Reversible grammars • Language as a relation R: wellformed strings × logical forms ( R ⊆ S × LF ) • Parsing: s , compute { lf i | < s, lf i > ∈ R } • Generation: lf , compute { s i | < s i , lf > ∈ R } • Reversible grammar: define R with one grammar • Ambiguity and paraphrases lf ′ lf L¨ osche das Verzeichnis mit den Systemtools! G¨ unter Neumann DFKI

  7. Uniform grammatical processing Current state of the art Type A Type B Source Source Grammar Grammar Parsing Generation Sem. Expr. Sem. Expr. Grammar Grammar Sem. Expr. Sem. Expr. Parser Generator Generator Parser String String String String Type C Type D Source Grammar Source Grammar Parsing Generation Sem. Expr. Grammar Grammar Sem. Expr. Uniform Algorithm Uniform Algorithm String String G¨ unter Neumann DFKI

  8. Uniform grammatical processing Disadvantages of current models • Types A, B, C – approaches: Block (A), Strzalkowski (C), Dymetman et al. (C) – high degree of redundancies(A,C) – testing of source grammar not possible (A,C) – interleaved parsing and generation not meaningful • Type D – approaches: Shieber, van Noord, Gerdeman – interleaved approach possible – poor dynamic of the models – parsing-oriented chart – restricted view on uniformity G¨ unter Neumann DFKI

  9. Uniform grammatical processing A New Uniform Model Conceptual System Text Text Interpretation Planning Logical Form Reversible Monitoring Grammar Revision (incl. Lexicon) Paraphrasing Uniform Algorithm Item-sharing String G¨ unter Neumann DFKI

  10. Uniform grammatical processing Constraint-based grammars • e.g., LFG, HPSG, CUG • Reversibility – uniform representation ( phon , syn , sem ) – word, phrase, clause level – structure sharing – declarative Example:   sentence cat  � peter, cries �  phon     . . . syntax         verb cat     noun cat   cries   phon   peter   phon                        per 3        � �   per 3 syntax  agr         ,  dtrs syntax  agr num sg           num sg                       � �    rel cry         rel the-peter’  semantics Arg semantics        arg Arg     Sem       semantics Sem G¨ unter Neumann DFKI

  11. Uniform grammatical processing Constraint Logic programming CLP • Generalization of conventional logic program- ming to arbitrary constraint-languages (Hoe- feld&Smolka:88) • Representation of grammar as definite clauses – rule: q ← p 1 , . . . , p n , φ – lexical element: q ← φ • Goal-reduction rule: goal: p 1 , . . . , p ( � x ) , . . . p n , φ clause: p ( � x ) ← q 1 , . . . , q m , ψ = ⇒ new goal: p 1 , . . . , q 1 , . . . , q m , . . . , p n , φ, ψ • Constraint-solver: unify ( φ, ψ ) • Parsing and generation: queries of form ← q, φ G¨ unter Neumann DFKI

  12. Uniform grammatical processing UTA: A Uniform Algorithm for Parsing and Generation • Goal: uniform and task-oriented Processing • Uniform control logic: generalized Earley deduc- tion (based on Pereira&Warren:83) – grammar (rule, lexicon), item sets – item: lemma with selected element ( sel ) – active item (AI): � h ← b 0 . . . b n ; i ; idx � – passive item(PI): � h ← ǫ ; ǫ ; idx � – blocking-test: subsumption G¨ unter Neumann DFKI

  13. Uniform grammatical processing UTA: A Uniform Algorithm for Parsing and Generation • Inference rules: – prediction: abstr(sel (AI)) unify head(rule) – completion: AI minus sel ∗ scanning: sel (AI) unify lexical element ∗ active completion: sel (AI) unify PI ∗ passive completion: PI unify sel (AI) • New clauses (Items): determine sel using dy- namic selection function sf Prediction: � Φ[ Rule ]; sf (Φ[ Rule ] , EF ); Idx � G¨ unter Neumann DFKI

  14. Uniform grammatical processing Parametrization of UTA • Relevant parameter: Essential Feature EF = ⇒ the feature, that carries the input (e.g., phon or sem ) – parametrized selection function ∗ EF guides ordering of processing of rhs(rule) – paramertized item set ∗ EF used for defining equivalence classes • Parsing and generation with UTA = ⇒ main difference is the different input structure G¨ unter Neumann DFKI

  15. Uniform grammatical processing Parametrizable selection function • Choose that element, whose Essential Feature is instantiated, else take the left-most one • Implications: – data-driven selection, e.g., ∗ left-to-right (e.g., for parsing) ∗ functor-first (e.g., for generation) ∗ or both ∗ integration of preferences – grammar itself has influence on control Beispiel:     vp cat:     vp cat:   Tail   sc:         � Arg | Tail � sc:         � � � � Sem sem:         P 0 -P 1 phon: sign ← − sign , sign Sem sem:         no   lex:   Arg           V   v2:         V v2:         P 1 -P phon:     P 0 -P phon: G¨ unter Neumann DFKI

  16. Uniform grammatical processing Structured item set • Idea: divide item set into equivalence classes • Determination of equivalence classes by means of Essential Feature = ⇒ item set is structured according to input struc- ture, e.g., – as sequence in case of parsing – as funktor/argument tree in case of generation – set in case of MRS • Advantage: – application of inference rules on subsets – blocking-test only on subsets – on-the-fly creation • Details: – item set: � AI, PI, Idx � – ∀ items: EF compatible − → Idx – PI: EF of Head, AI: EF of SEL G¨ unter Neumann DFKI

  17. Uniform grammatical processing Flexible agenda mechanism • Guides order of processing of new items • Sorts items according to preference • Activation of clauses and insertion into item set according to preference • Advantage: – depth-first, breadth-first, best-first, random – blocking-test only on “activated” clauses – interleaved parsing and generation: different preference rules G¨ unter Neumann DFKI

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend