lti The Goal Input: educational text Output: quiz lti The - PowerPoint PPT Presentation

Good Question! Statistical Ranking for Question Generation Michael Heilman and Noah A. Smith lti �

The Goal � Input: educational text � Output: quiz lti �

The Goal � Input: educational text � Output: quiz � Output: ranked list of candidate questions to present to a teacher � Text-to-text generation Knight & Marcu, 00; Clarke, 06 (Compression); Barzilay & McKeown, 05 (Sentence Fusion); Callison-Burch, 07 (Paraphrase Generation); inter alia lti �

Our Approach � Sentence-level factual questions � Acceptable (e.g., grammatical) questions � QG as a series of sentence structure transformations lti �

Outline � Challenges in Question Generation (QG) � Implementation Details � Step-by-Step Example � Rating Questions � Ranking Model Ranking Model � Experiments lti �

Constraints on WH movement Darwin studied how species evolve. Who studied how species evolve? *What did Darwin study how evolve? Ross, 67; Chomsky, 77; � WH movement is well studied. inter alia � We encode this linguistic knowledge with rules. lti �

Complex Input Sentences Lincoln, who was born in Kentucky, moved to Illinois in 1831. Intermediate Form: Lincoln was born in Kentucky. Where was Lincoln born? Step 1: Step 2: Extraction of Transformation Simple Factual into Questions Statements Rule-based lti �

Vague and Awkward Questions, etc. Lincoln, who was born in Kentucky… Weak predictors: Where was Lincoln born? # proper nouns, WH word, Lincoln, who faced many challenges… transformations, etc. What did Lincoln face? Step 1: Step 2: Step 3: Extraction of Transformation Statistical Simple Factual into Questions Ranking Statements Learned from labeled Rule-based output from steps 1&2 lti �

Connections to Prior Work on QG Mitkov & Ha, 03; Kunichika et al ., 04; � Most prior work: Gates, 08; inter alia • Sentence-level factual questions • Syntactic rules for transformation or extraction • Generation in a single step � Contributions: � Contributions: • Multi-step framework • Ranking model learned from labeled output • QG evaluation methodology with broad- domain corpora Overgeneration and Ranking for NLG: Langkilde & Knight 98; Walker et al ., 01 lti �

Outline � Challenges in QG � Implementation Details � Step-by-Step Example � Rating Questions � Ranking Model Ranking Model � Experiments lti ��

Implementation Details � We use BBN Indentifinder to find entity labels, and map these to WH words. • PERSON -> Who Bikel et al ., 99 • LOCATION -> Where • etc. • etc. � We use phrase structure parses from Stanford Parser. Klein & Manning, 03 � We encode transformations in the Tregex tree searching language. Levy & Andrew, 06 lti ��

Example Tregex Rule Constraint: Phrases dominated by a clause with a WH-complementizer cannot undergo movement. SBAR < /ˆWH.*P$/ << NP|ADJP|VP|ADVP|PP=unmv … “<” denotes dominance “<” denotes dominance SBAR SBAR WHAVP S * What did Darwin WRB NP VP study how _ evolve? Darwin studied how species evolve. More details on rules in technical report: M. Heilman and N. A. Smith. lti 2009. Question Generation via Overgenerating Transformations and Ranking. ��

(other candidates) … … During the Gold Rush years in northern During the Gold Rush years in northern California, Los Angeles became known as California, Los Angeles became known as Preprocessing the "Queen of the Cow Counties" for its the "Queen of the Cow Counties" for its role in supplying beef and other foodstuffs role in supplying beef and other foodstuffs role in supplying beef and other foodstuffs role in supplying beef and other foodstuffs to hungry miners in the north. to hungry miners in the north. Extraction of Simplified … … Factual Statements Los Angeles became known as the "Queen of the Cow Counties" for its role in supplying beef and other foodstuffs to hungry miners in the north. lti ��

Los Angeles became known as the "Queen of the Los Angeles became known as the "Queen of the Cow Counties" for its role in supplying beef and Cow Counties" for its role in supplying beef and other foodstuffs to hungry miners in the north. other foodstuffs to hungry miners in the north. Answer Phrase Selection … … Los Angeles became known as the "Queen of the Los Angeles became known as the "Queen of the Cow Counties" for ( Answer Phrase : its role in… ) Cow Counties" for ( Answer Phrase : its role in… ) Main Verb Decomposition Main Verb Decomposition Los Angeles did become known as the "Queen of the Cow Counties" for ( Answer Phrase : its role in… ) Subject Auxiliary Inversion Did Los Angeles become known as the "Queen of the Cow Counties" for ( Answer Phrase : its role in… ) lti ��

Did Los Angeles become known as the "Queen of the Cow Counties" for ( Answer Phrase : its role in… ) Movement and Insertion of Question Phrase What did Los Angeles become known as the "Queen of the Cow Counties" for? … … … Question Ranking 1. What became known as…? 2. What did Los Angeles become known as the "Queen of the Cow Counties" for? 3. Whose role in supplying beef…? 4. … lti ��

Rating Questions � We use rated questions to… • Learn a ranking model • Evaluate our system lti ��

Sources of Data � Existing datasets of questions? • Not focused on sentence-level facts • Lack negative examples • Noisy (e.g., Yahoo questions) • Relatively small • Relatively small Potential Potential future work � Tailored data set: annotators rated output from the overgeneration steps 1&2. lti ��

Rating Scheme � 8 possible deficiencies • ungrammatical, vague, wrong WH word,… � Binary rating for each � No deficiencies: � Any deficiencies: ( . 42 ) κ = � “Moderate” agreement lti ��

Corpora Total English Simple English Wall Street Journal Wikipedia Wikipedia (PTB Sec. 23) Texts 14 18 10 42 Questions 1,448 1,313 474 3,235 Training Testing 2,807 questions 428 questions 36 texts 6 texts lti ��

Ranking Model � Logistic Regression { } ∈ y , • Params. are estimated by optimizing L 2 regularized conditional log-likelihood. regularized conditional log-likelihood. • We use a variant of Newton’s method. le Cessie & Houwelingen, 97 P( ) � To rank, sort by lti ��

Surface Features � WH words in question � Negation words in question � Language model probabilities � Sentence lengths Separate features for question, source sentence, answer phrase lti ��

Features based on Syntactic Analysis � Grammatical categories • Numbers of POS tags, NPs, VPs, etc. � Transformations • E.g., extracted from relative clause � “Vague NP” • Counts of NPs headed by common nouns and with no modifiers • 1.0 for “the president” • 0.0 for “Abraham Lincoln” or “the U.S. president during the Civil War” lti ��

Evaluation Metric Percentage of top-ranked test set questions that were rated acceptable ( ) lti ��

Rankers & Baselines � Ranker with all features Training � Ranker with surface features Training • only sentence lengths, WH words, negation, language model log probabilities. � Expected random (i.e., no ranking) � Expected random (i.e., no ranking) � Oracle lti ��

Noisy at Ranking Results Testing top ranks. 70% Oracle d Acceptable 60% All Features Surface Features 50% Expected Random 40% Pct. Rated 30% 20% 0 100 200 300 400 Number of Top-Ranked Questions All Features performed significantly better than lti Surface Features ( p < .05). ��

Ablation Experiments Feature Set % Acceptable inTop Ranked Fifth All Features 52.3 All – Length 52.3 All – Negation 51.7 All – Lang. Model All – Lang. Model 51.2 51.2 All – WH 50.6 All – Vagueness 48.3 All – Transforms 46.5 All – Grammatical 43.2 lti ��

Conclusions � Overgeneration and ranking for QG. • Rules encode linguistic knowledge • Statistical ranker captures trends not easily encoded with rules � Statistical ranking improved top-ranked � Statistical ranking improved top-ranked output. lti ��

lti The Goal Input: educational text Output: quiz lti The - PowerPoint PPT Presentation

Good Question! Statistical Ranking for Question Generation Michael Heilman and Noah A. Smith lti The Goal Input: educational text Output: quiz lti The Goal Input: educational text Output: quiz Output: ranked list

Models for LTI systems LTI system stands for linear time invariant system Model describing LTI

Topic 2: LTI Systems and Convolution Response of LTI Systems Impulse response and unit

SIMPLE & LEAN PRODUCER Expanding Production and Reducing Costs Health and Safety Update: No

lti 1 (typically) Unsupervised learning in NLP non-convex optimization lti 2

C. H. Perez & Associates C Consulting E lti E ngineers, Inc. i I FDOT District Four

CMU LTI @ KBP 2015 Event Track Zhengzhong Liu Dheeru Dua Jun Araki Teruko Mitamura Eduard Hovy

dt < | ( ) | h t (this has to do with system stability system stability)

INC 212 Signals and systems Lecture#4: Frequency response of LTI systems Assoc. Prof. Benjamas

M u lti v ariable logistic regression G E N E R AL IZE D L IN E AR MOD E L S IN P YTH ON Ita

Textual Predictors of Bill Survival in Congressional Committees Tae Yano , LTI, CMU Noah Smith ,

Graph-Based Lexicon Expansion with Sparsity-Inducing Penalties Dipanjan Das , LTI, CMU Google

Signal and Systems Chapter 2: LTI Systems Representation of DT signals in terms of shifted unit

dt < | ( ) | h t (this has to do with system stability system stability (ECE

Representation of LTI Systems Prof. Seungchul Lee Industrial AI Lab. Transfer Function

VASCO (VAcuum Stability COde) : multi-gas code to calculate gas density lti d t l l t d it

Grating Incident resulting in LTI Agenda Background and Medical Condition Incident

Protocol on SEA Section A4.5: SEA of plans & programmes Transboundary consultations

M8S3 - Applied Regression Professor Jarad Niemi STAT 226 - Iowa State University December 6,

64 years) Professor Marie H Murphy (WG Chair) Dylan Thompson, David R Broom, Jason MR Gill,

How you Yes YOU! Can become a JEDI Developer too GitHub, Git-flow, documentation, pull

Doubly focused enumeration of locally square polynomial values D. J. Bernstein Thanks to:

Plan Social issues Data and Information How the Internet works What computers cant do

Why the AI? Andrew J. Perrin April 27, 2007 Andrew J. Perrin () Why the AI? April 27, 2007 1

Semantic Modeling with Frames Rainer Osswald & Wiebke Petersen Department of Linguistics and

lti The Goal Input: educational text Output: quiz lti The - PowerPoint PPT Presentation

Good Question! Statistical Ranking for Question Generation Michael Heilman and Noah A. Smith lti The Goal Input: educational text Output: quiz lti The Goal Input: educational text Output: quiz Output: ranked list

Models for LTI systems LTI system stands for linear time invariant system Model describing LTI

Topic 2: LTI Systems and Convolution Response of LTI Systems Impulse response and unit

SIMPLE &amp; LEAN PRODUCER Expanding Production and Reducing Costs Health and Safety Update: No

lti 1 (typically) Unsupervised learning in NLP non-convex optimization lti 2

C. H. Perez &amp; Associates C Consulting E lti E ngineers, Inc. i I FDOT District Four

CMU LTI @ KBP 2015 Event Track Zhengzhong Liu Dheeru Dua Jun Araki Teruko Mitamura Eduard Hovy

dt &lt; | ( ) | h t (this has to do with system stability system stability)

INC 212 Signals and systems Lecture#4: Frequency response of LTI systems Assoc. Prof. Benjamas

M u lti v ariable logistic regression G E N E R AL IZE D L IN E AR MOD E L S IN P YTH ON Ita

Textual Predictors of Bill Survival in Congressional Committees Tae Yano , LTI, CMU Noah Smith ,

Graph-Based Lexicon Expansion with Sparsity-Inducing Penalties Dipanjan Das , LTI, CMU Google

Signal and Systems Chapter 2: LTI Systems Representation of DT signals in terms of shifted unit

dt &lt; | ( ) | h t (this has to do with system stability system stability (ECE

Representation of LTI Systems Prof. Seungchul Lee Industrial AI Lab. Transfer Function

VASCO (VAcuum Stability COde) : multi-gas code to calculate gas density lti d t l l t d it

Grating Incident resulting in LTI Agenda Background and Medical Condition Incident

Protocol on SEA Section A4.5: SEA of plans &amp; programmes Transboundary consultations

M8S3 - Applied Regression Professor Jarad Niemi STAT 226 - Iowa State University December 6,

64 years) Professor Marie H Murphy (WG Chair) Dylan Thompson, David R Broom, Jason MR Gill,

How you Yes YOU! Can become a JEDI Developer too GitHub, Git-flow, documentation, pull

Doubly focused enumeration of locally square polynomial values D. J. Bernstein Thanks to:

Plan Social issues Data and Information How the Internet works What computers cant do

Why the AI? Andrew J. Perrin April 27, 2007 Andrew J. Perrin () Why the AI? April 27, 2007 1

Semantic Modeling with Frames Rainer Osswald &amp; Wiebke Petersen Department of Linguistics and

SIMPLE & LEAN PRODUCER Expanding Production and Reducing Costs Health and Safety Update: No

C. H. Perez & Associates C Consulting E lti E ngineers, Inc. i I FDOT District Four

dt < | ( ) | h t (this has to do with system stability system stability)

dt < | ( ) | h t (this has to do with system stability system stability (ECE

Protocol on SEA Section A4.5: SEA of plans & programmes Transboundary consultations

Semantic Modeling with Frames Rainer Osswald & Wiebke Petersen Department of Linguistics and