Maximum Entropy Models for y Realization Ranking yz Erik Velldal - PowerPoint PPT Presentation

Maximum Entropy Models for y Realization Ranking yz Erik Velldal <erik.velldal@iln.uio.no> y Department of Linguistics and Scandinavian Studies, Stephan Oepen <oe@csli.stanford.edu> z Center for the Study of Language and Information, University of Oslo (Norway) Stanford (USA)

Maximum Entropy Models for Realization Ranking 1 � The problem: Ambiguity in generation, – many ways to formulate a given Realization Ranking � A solution: Use statistics for modeling preferences and soft constraints meaning. � Trained and tested three types of models: n -gram language models (surface oriented) (grammaticality already guaranteed). 1) 2) maximum entropy model (structural features) 3) a combination of 1) and 2) LOGON

Maximum Entropy Models for Realization Ranking 2 � Generation in the LOGON MT-system and the problem of realization Overview � Reference experiments: random choice and n -gram language models. � The relation to parse selection. Treebank data and maximum entropy ranking. � A combined model: MaxEnt + language model. � Results, future work and discussion. models (MaxEnt). LOGON

Maximum Entropy Models for Realization Ranking 3 � LOGON Generation in the LOGON MT-system – Aims at high precision Norwegian–English MT of texts in the tourism domain. – Symbolic, rule-based system, centered on semantic transfer using Minimal Recursion Semantics (MRS; Copestake, Flickinger, Malouf, Riehemann, & Sag, 1995). Includes stochastic methods for ambiguity management. LOGON �

Maximum Entropy Models for Realization Ranking 3 � LOGON Generation in the LOGON MT-system – Aims at high precision Norwegian–English MT of texts in the tourism domain. – Symbolic, rule-based system, centered on semantic transfer using � The LKB Chart Generator (Carroll, Copestake, Flickinger, & Poznanski, Minimal Recursion Semantics (MRS; Copestake, Flickinger, Malouf, Riehemann, & Sag, 1995). Includes stochastic methods for ambiguity management. 1999; Carroll & Oepen, 2005). – Lexically-driven, bottom-up chart generation from MRSs. – Generation based on the LinGO English Resource Grammar ( ERG ; Flickinger, 2002); a general-purpose, wide-coverage grammar, designed using HPSG and MRS. LOGON

� Caused by e.g. the optionality of complementizers and relative pronouns, Maximum Entropy Models for Realization Ranking 4 Generator Ambiguity � Average number of realizations in the current data set is 73 (max = 5712 ). permutation of (intersective) modifiers, different possible topicalizations, as well as lexical and orthographic alternations. All realizations of a given MRS are guaranteed to be semantically (truth-conditionally) equivalent. Grammaticality is ensured wrt. the underlying grammar (LinGO ERG ). Remember that dogs must be on a leash. Remember dogs must be on a leash. On a leash remember that dogs must be. On a leash remember dogs must be. A leash remember that dogs must be on. A leash remember dogs must be on. Dogs remember must be on a leash. LOGON

Maximum Entropy Models for Realization Ranking 5 � The most common approach to the problem of generator ambiguity is to n -gram statistics (Langkilde & Knight, 1998; White, 2004; A Language Model Ranker � Score and rank strings using a language model; Q k p ( w ; : : : ; w ) = p ( w j w ; : : : ; w ) . n 1 k i i � n i � 1 use i =1 Callison-Burch & Flournoy, 2001). � Trained a 4 -gram model on the BNC (100 mill. words). LOGON

Maximum Entropy Models for Realization Ranking 6 � Results on the LOGON data set ‘Rondane’ (864 test items, to be detailed A Language Model Ranker (Cont’d) 48 : 46% 18 : 03% ) 0 : 8776 later in the talk): 0 : 727 ) – Exact match accuracy: � Limitations: Can not model dependencies between non-contiguous (Random choice baseline: – B LEU score: (Random choice baseline: words. No linguistic information. Does not condition the output (string) on the input (MRS). LOGON

Maximum Entropy Models for Realization Ranking 7 � The problem of selecting the best realization can be seen to be “inversely The Relation to Parse Selection � p ( analysis j utterance ) vs. p ( utterance j analysis ) similar” to the problem of selecting the best parse . LOGON � � � �

Maximum Entropy Models for Realization Ranking 7 � The problem of selecting the best realization can be seen to be “inversely The Relation to Parse Selection � p ( analysis j utterance ) vs. p ( utterance j analysis ) � Toutanova, Manning, Shieber, Flickinger, & Oepen (2002) implement a similar” to the problem of selecting the best parse . � Features defined over derivation trees with non-terminals representing the MaxEnt model for parse disambiguation using the Redwoods HPSG treebank. construction types and lexical types of the grammar. LOGON � �

Maximum Entropy Models for Realization Ranking 7 � The problem of selecting the best realization can be seen to be “inversely The Relation to Parse Selection � p ( analysis j utterance ) vs. p ( utterance j analysis ) � Toutanova, Manning, Shieber, Flickinger, & Oepen (2002) implement a similar” to the problem of selecting the best parse . � Features defined over derivation trees with non-terminals representing the MaxEnt model for parse disambiguation using the Redwoods HPSG treebank. � We train a realization ranker in much the same way. � Requires different types of treebanks for training. construction types and lexical types of the grammar. LOGON

Maximum Entropy Models for Realization Ranking 8 Treebanks for Parse Selection Training data for parse Utterances Analyses selection models is typi- cally given by (1) a treebank of utterances paired with their optimal analyses, together with (2) all their competing (suboptimal) analyses. LOGON

Maximum Entropy Models for Realization Ranking 9 Symmetric Treebanks To produce a symmetric Utterances Analyses treebank , exhaustively generate all paraphrases of the treebanked analyses, and assume optimality relation to be bidirectional (Velldal, Oepen, & Flickinger, 2004). LOGON

Maximum Entropy Models for Realization Ranking 10 Treebanks for Realization Ranking We now have the train- Utterances Analyses ing data for a realization ranking model, given by (1) a treebank of analyses paired with their optimal utterances, together with (2) all competing (suboptimal) can- didates. LOGON

Maximum Entropy Models for Realization Ranking 11 ℄ � � % The Rondane Treebank 100 � readings 50 � readings < 100 items words ambiguity baseline Aggregate 10 � readings < 50 5 < readings < 10 87 20.5 580.8 0.42 1 < readings < 5 61 17.3 73.0 1.44 269 15.1 22.5 5.61 172 11.1 6.9 15.66 275 8.8 2.8 40.9 Total 864 13.0 72.9 18.03 The treebank data binned with respect to generator ambiguity, for each group showing the total number of items, average string length, average number of paraphrases, and a random choice baseline for accuracy. LOGON

Maximum Entropy Models for Realization Ranking 12 � Given by a set of features f f ; : : : ; f g and a set of associated weights 1 m f � ; : : : ; � g . 1 m Maximum Entropy Models � The real-valued feature-functions describe relevant properties of the data � The lambda weights determine the contribution or importance of each items. � Prob. of a realization r given a semantics s : P 1 p ( r j s ) = exp ( � f ( r )) i i i Z ( s ) feature. � Learning amounts to finding the optimal weights � that maximize the likelihood of the training corpus. LOGON

Maximum Entropy Models for Realization Ranking 13 MaxEnt Features � H � H � H � H � H Sample HPSG derivation tree for the dog barks . subjh � H � H � H Features record local derivation sub-trees with different hspec third_sg_fin_verb n -grams over lexical degrees of lexicalization, v_unerg_le levels of grandparenting, det_the_le sing_noun etc. Additional features barks record n_intr_le the types. dog LOGON

Maximum Entropy Models for y Realization Ranking yz Erik Velldal - PowerPoint PPT Presentation

Maximum Entropy Models for y Realization Ranking yz Erik Velldal <erik.velldal@iln.uio.no> y Department of Linguistics and Scandinavian Studies, Stephan Oepen <oe@csli.stanford.edu> z Center for the Study of Language and

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a measure of the uncertainty of

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1

Topic III.2: Maximum Entropy Models Discrete Topics in Data Mining Universitt des Saarlandes,

Maximum Entropy Beyond Fact to Explain Selecting Probability Maximum Entropy . . . Explaining a

Comparison Between Bayesian and Maximum Entropy Analysis of Flow Networks 1 Maximum Entropy

A gentle introduction to Maximum Entropy Models and their friends Mark Johnson Brown University

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a

Maximum Entropy Inverse Reinforcement Learning Nomenclature Basis Feature Expectation Matching

Maximum Entropy Tagging (for the Maximum Entropy method itself, refer to NPFL067 added slides

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

1) Entropy = measure of randomness 2) Entropy = measure of compressibility More random = Less

Chapter 2 Entropy, Relative Entropy, and Mutual Infor- mation Peng-Hua Wang Graduate Institute

Probabilistic Models of Human Sentence Experiment 1: Entropy and Sentence Length 2 Processing

Road detection via entropy By Anna Zaidman 1 1 What is entropy? Entropy is a mathematically

Entropy Change in Entropy Reversible Isobaric Process Ideal Gas in a Reversible Process Free

Entropy and The Second Law of Thermodynamics Entropy (S)

Study caf hours Lis ists List syntax List operations copy.deepcopy range

ERISAO and MAORY-NGS Control Systems Gianluca Di Rico INAF Osservatorio Astronomico di

New SeaSonde Features for Remote Operations: Low Power Systems and Automated Antenna Patterns

Democratizing Data with the Clover Transform Framework Christopher Hartfield April 17, 2018

Accessing cosmological simulations A. Klypin (NMSU) Friday, December 17, 2010 Groups: NMSU:

Protecting Computer Systems through Eli liminating or Analyzing Vulnerabilities Byoungyoung Lee

The Smoking Gun of BNV Colour Topologies and String Hadronization in Baryon Number Violating

Today Mind and Body Some philosophical issues about AI One of the oldest philosophical problems