Math Problems Takuya Matsuzaki Noriko H. Arai (Nagoya University) - PowerPoint PPT Presentation

Solving Natural Language Math Problems Takuya Matsuzaki Noriko H. Arai (Nagoya University) (National Institute of Informatics)

Solving NL Math – why? • It is the first and the last goal of symbolic approach to language understanding (LU) • Formalization of the domain is the prerequisite for LU • Problem solving is the only way to compare different LU systems • Only the input and output are observable • No ground-truth for a mid- layer’s output

System Overview Let 𝑚 be the trajectory of 𝑢 + 2, 𝑢 + 2, 𝑢 f or 𝑢 ranging over ℝ. Problem 𝑃 0, 0, 0 , 𝐵 2, 1, 0 , and 𝐶 1, 2, 0 are on a sphere, 𝑇, centered at 𝐷 𝑏, 𝑐, 𝑑 . Determine the condition on 𝑏, 𝑐, 𝑑 for which 𝑇 intersects with 𝑚. Language Understanding Logical Form in a HOL Formula Rewriting Logical Form in Local Theories CA & ATP Answer 3

Today’s Topics • Parsing Math Problem Text with Combinatory Categorial Grammar • Benchmarking a CAS-based solver with formalized pre-university math problems

Combinatory Categorial Grammar • Word ⇔ (syntactic category, λ -expression) Word type Example “John” ⇔ ( 𝑂𝑄 , john ) Proper noun “cat” ⇔ ( 𝑂 , λx.cat(x)) Common noun “runs” ⇔ ( S ∖ 𝑂𝑄 , λx.run (x)) Intransitive verb “loves” ⇔ ( S ∖ 𝑂𝑄/𝑂𝑄 , λy.λx.love (x,y)) Transitive verb “a” ⇔ ( S/(S ∖ 𝑂𝑄)/𝑂 , λN.λP . ∃ x(Nx ∧ Px)) Indefinite article “every” ⇔ ( S/(S ∖ 𝑂𝑄)/𝑂 , λN.λP . ∀ x(Nx  Px)) Quantifier

Combinatory rules Forward application Backward application Forward composition Y : y X ∖ Y : f X / Y : f Y / Z : g X / Y : f Y : y etc. > < > B X / Z : λz.f (gz) X : f y X : f y a cat S ∖ 𝑂𝑄 ∖ (S ∖ 𝑂𝑄/𝑂𝑄)/𝑂 : 𝑂 : loves λN.λP.λy . ∃ x(Nx ∧ Pxy) λx.cat(x) > S ∖ 𝑂𝑄/𝑂𝑄 : S ∖ 𝑂𝑄 ∖ (S ∖ 𝑂𝑄/𝑂𝑄) : λP.λy . ∃ x(cat(x) ∧ Pxy) λx.λy.love (y,x) John < S ∖ 𝑂𝑄 : λy . ∃ x(cat(x) ∧ love(y,x)) 𝑂𝑄 : john < S : ∃ x(cat(x) ∧ love(john,x))

Combinatory rules Forward application Backward application Forward composition Y : y X ∖ Y : f X / Y : f Y / Z : g X / Y : f Y : y etc. > < > B X / Z : λz.f (gz) X : f y X : f y a cat S ∖ 𝑂𝑄 ∖ (S ∖ 𝑂𝑄/𝑂𝑄)/𝑂 : 𝑂 : λN.λP.λy . ∃ x(Nx ∧ Pxy) loves λx.cat(x) > S ∖ 𝑂𝑄/𝑂𝑄 : S ∖ 𝑂𝑄 ∖ (S ∖ 𝑂𝑄/𝑂𝑄) : λP.λy . ∃ x(cat(x) ∧ Pxy) λx.λy.love (y,x) John < S ∖ 𝑂𝑄 : λy . ∃ x(cat(x) ∧ love(y,x)) 𝑂𝑄 : john < S : ∃ x(cat(x) ∧ love(john,x))

Syntactic Category = Semantic Type + Syntactic Constraints Example “distance” (as in “distance between P and Q ”) • Syntactic cat.: NP Real /PP between , (Point,Point) • Semantic function : λp.dist (p) • Semantic type: (Point, Point)  Real P and Q 𝑂𝑄 𝛽,𝛾 ∖ 𝑂𝑄 𝛽 /𝑂𝑄 𝛾 : 𝑂𝑄 𝑄𝑜𝑢 : 𝑂𝑄 𝑄𝑜𝑢 : P Q λy.λx .(x,y) between 𝑂𝑄 (𝑄𝑜𝑢,𝑄𝑜𝑢) : 𝑄𝑄 𝑐𝑢𝑥𝑜,(𝛽,𝛾) /𝑂𝑄 (𝛽,𝛾) : distance (P,Q) id 𝑂𝑄 𝑆𝑓𝑏𝑚 /𝑄𝑄 𝑐𝑢𝑥𝑜,(𝑄𝑜𝑢,𝑄𝑜𝑢) : 𝑄𝑄 𝑐𝑢𝑥𝑜,(𝑄𝑜𝑢,𝑄𝑜𝑢) : λ p.dist(p) (P,Q) 𝑂𝑄 𝑆𝑓𝑏𝑚 : dist(P,Q)

Comparison with compilers • Compilers : source code  machine code • NL parsing : math problem  logical form • NL parsing = type check + syntax check + denotational semantics • Besides, the grammar is only partially known and ambiguous

Grammar and lexicon: current status • Size • 31 combinatory rules • 6,652 different word forms • 42,154 triples of <word, category, λ -term> • What’s not in textbook (toy) grammars: • Imperatives, pluralities, relation/attribute nouns, context dependent semantics, action verbs, etc. • Coverage: • 70%~80% of university math exam sentences can be parsed (either correctly or wrongly)

Remaining issues • Lexicon / grammar coverage • Hypothesis explosion due to local ambiguity • “y = ax 2 ” : equality or λx.ax 2 or { (x,y) | y = ax 2 } • “if A then B and C” : (A  B) & C or A  (B & C) • Inter-sentential logical structure analysis. E.g., • Sentence 1: If A then B. • Sentence 2: If C then D. • (A  B) & (C  D) • A  (B & (C  D)) • (A  B) & (A  (B & C)  D)

Benchmarking CA-based Problem Solver on Formalized Pre-univ. Math Problems

Motivation • Development of the AR layer of the solver in parallel with the NLU layer • Evaluation on problems with varying difficulty • Estimation of the computational cost of the reasoning on NLU output

Benchmark Problems: Sources • Ex : 288 problems from exercise book series • 200 problems on geometry • 100 problems on integer arithmetic • Univ : 245 problems from the entrance exams of seven national universities • Geometry, real arithmetic, pre-calculus etc. expressible in the theory of RCF • IMO : 212 problems from the International Mathematics Olymipiads (1959-2014) • All geometry and real arithmetic problems • Some of number theory, combinatorics etc. • 2/3 of the all past problems till 2014

Encoding process • Six students (majored in math/CS) and two full-time researchers encoded the problems in a higher-order language • Literal translation • Word-by-word, sentence-by-sentence • No inference • No paraphrase

Example Let D be a point inside acute triangle ABC such that ∠ ADB = ∠ ACB + π/2 and AC ・ BD = AD ・ BC Calculate the ratio ( AB ・ CD )/( AC ・ BD ). (IMO 1993 Problem 2) (Find (x) (exists (A B C D) (&& (is-acute-triangle A B C) (point-inside-of D (triangle A B C)) (= (rad-of-angle (angle A D B)) (+ (rad-of-angle (angle A C B)) (/ (Pi) 2))) (= (* (distance A C) (distance B D)) (* (distance A D) (distance B D))) (= x (/ (* (distance A B) (distance C D)) (* (distance A C) (distance B D))))))))

CAS-based solver

Syntactic Profile (per problem; medians) Pre-univ math benchmark TPTP-THF Problem scale is Ex Univ IMO All at similar level # Formulas 2 2 1 1 10 # Atoms 65 95 65 72 88 Avg atoms/Fml 38 54 56 48 6 # Symbols 16 19 #of λ -abstractions 12 15 9 Different types quantifications # Variables 9 13 8 9 19 λ 3 3 1 2 2 ∀ 0 0 4 0 9 ∃ 4 6 1 4 2 # Connectives 55 78 58 61 52

Overall results Ex • Difficulty of RCF problmes: Ex < Univ < IMO • Difficulty of PA problems: Ex << IMO

Results on RCF problems in Ex • # of Stars = difficulty level assessed by the editors of the practice book series

Results on IMO problems by years • Human Efficiency: IMO participants’ avg. score • Machine Efficiency: system’s score • IMO problems get harder by year both for human and machines

Summary • Natural Language Math Solving System combining • Grammar-driven semantic analysis • Inference by QE • Benchmark result on the inference part • Excercise & entrance exam: ~60% • Mathematical Olympiads: 5~15%

Math Problems Takuya Matsuzaki Noriko H. Arai (Nagoya University) - PowerPoint PPT Presentation

Solving Natural Language Math Problems Takuya Matsuzaki Noriko H. Arai (Nagoya University) (National Institute of Informatics) Solving NL Math why? It is the first and the last goal of symbolic approach to language understanding (LU)

GUST e-Foundry MATH FONTS Latin Modern Math, ver. 1.959 T EX Gyre Bonum Math, ver. 1.005 T EX

Math 211 Math 211 Lecture #1 August 29, 2000 2 Welcome to Math 211 Welcome to Math 211 Math

GRADUATION REQUIREMENTS English 4 Credits -I, II, III, IV Math 4 Credits Math I, Math II,

Math 211 Math 211 Lecture #1 Introduction August 26, 2002 2 Welcome to Math 211 Welcome to

Math Fun For Everyone! 1 Mini Math Attitude Inventory 1. I liked Math... A. A Lot B. A

Math 211 Math 211 Lecture #1 Introduction August 27, 2001 2 Welcome to Math 211 Welcome to

Math 211 Math 211 Lecture #1 Introduction August 26, 2002 2 Welcome to Math 211 Welcome to

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

7th Grade Math Placement & Math Pathways Outcomes Review math placement test logistics

MATH Placement Which Math is Appropriate for your Major? Thomas Harriot College of Arts &

Making Math 20 year math teacher Differentiate between a concept and a skills

Elementary Math Sitton Elementary School May 2, 2017 Mindset What is YOUR experience with math?

7th Grade Math Placement & Math Pathways Math Program Manager slemke@fremont.k12.ca.us

Math Department Teachers Ms. A. Elliott Department Head B.Math, Waterloo Mr. J. Gmach B.Math,

Just Math Using Math to Learn Jus.ce Vs. Using Math to

the Math Classroom -Dr. JP Anderson, Academic Math -Kristen Foxley, College Prep Math

NAVIGATING THE ROAD TO COLLEGE HIGH SCHOOL COUNSELING STAFF JOAN ROSENBERG Principal LOIS

A Frame ork for A Framework for K-12 Science Education: Practices Crosscutting Concepts

Michigan Summer Symposia In the early 1920s UM had strong experimental programs with

IEEE Educational Activities 2018 Sonya Dillard Region 3 Meeting SoutheastCon 2018 Educational

MA111: Contemporary mathematics Entrance Slip (due 5 min past the hour): Five friends are trying

J W Gerth P.S., WRDSB hb_gauthier@ hotmail.com OUR LEARNING GOALS FOR TODAY 1 . Define

Automatic diagnosis and feedback for lexical stress errors in non-native speech: Towards a CAPT

Katrin Lichterfeld Balancing accent, identity and culture 1. Linguistic diversity 2. Accent and

Math Problems Takuya Matsuzaki Noriko H. Arai (Nagoya University) - PowerPoint PPT Presentation

Solving Natural Language Math Problems Takuya Matsuzaki Noriko H. Arai (Nagoya University) (National Institute of Informatics) Solving NL Math why? It is the first and the last goal of symbolic approach to language understanding (LU)

GUST e-Foundry MATH FONTS Latin Modern Math, ver. 1.959 T EX Gyre Bonum Math, ver. 1.005 T EX

Math 211 Math 211 Lecture #1 August 29, 2000 2 Welcome to Math 211 Welcome to Math 211 Math

GRADUATION REQUIREMENTS English 4 Credits -I, II, III, IV Math 4 Credits Math I, Math II,

Math 211 Math 211 Lecture #1 Introduction August 26, 2002 2 Welcome to Math 211 Welcome to

Math Fun For Everyone! 1 Mini Math Attitude Inventory 1. I liked Math... A. A Lot B. A

Math 211 Math 211 Lecture #1 Introduction August 27, 2001 2 Welcome to Math 211 Welcome to

Math 211 Math 211 Lecture #1 Introduction August 26, 2002 2 Welcome to Math 211 Welcome to

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

7th Grade Math Placement &amp; Math Pathways Outcomes Review math placement test logistics

MATH Placement Which Math is Appropriate for your Major? Thomas Harriot College of Arts &amp;

Making Math 20 year math teacher Differentiate between a concept and a skills

Elementary Math Sitton Elementary School May 2, 2017 Mindset What is YOUR experience with math?

7th Grade Math Placement &amp; Math Pathways Math Program Manager slemke@fremont.k12.ca.us

Math Department Teachers Ms. A. Elliott Department Head B.Math, Waterloo Mr. J. Gmach B.Math,

Just Math Using Math to Learn Jus.ce Vs. Using Math to

the Math Classroom -Dr. JP Anderson, Academic Math -Kristen Foxley, College Prep Math

NAVIGATING THE ROAD TO COLLEGE HIGH SCHOOL COUNSELING STAFF JOAN ROSENBERG Principal LOIS

A Frame ork for A Framework for K-12 Science Education: Practices Crosscutting Concepts

Michigan Summer Symposia In the early 1920s UM had strong experimental programs with

IEEE Educational Activities 2018 Sonya Dillard Region 3 Meeting SoutheastCon 2018 Educational

MA111: Contemporary mathematics Entrance Slip (due 5 min past the hour): Five friends are trying

J W Gerth P.S., WRDSB hb_gauthier@ hotmail.com OUR LEARNING GOALS FOR TODAY 1 . Define

Automatic diagnosis and feedback for lexical stress errors in non-native speech: Towards a CAPT

Katrin Lichterfeld Balancing accent, identity and culture 1. Linguistic diversity 2. Accent and

7th Grade Math Placement & Math Pathways Outcomes Review math placement test logistics

MATH Placement Which Math is Appropriate for your Major? Thomas Harriot College of Arts &

7th Grade Math Placement & Math Pathways Math Program Manager slemke@fremont.k12.ca.us