Synthesis of Geometry Proof Problems Chris Alvin, Louisiana State - - PowerPoint PPT Presentation

synthesis of geometry proof problems
SMART_READER_LITE
LIVE PREVIEW

Synthesis of Geometry Proof Problems Chris Alvin, Louisiana State - - PowerPoint PPT Presentation

Synthesis of Geometry Proof Problems Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Rupak Majumdar, Max Planck Institute for Software Systems AAAI 14


slide-1
SLIDE 1

Synthesis of Geometry Proof Problems

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Rupak Majumdar, Max Planck Institute for Software Systems AAAI ’14 Thursday, July 31, 2014

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 1 / 21

slide-2
SLIDE 2

Motivation

Motivation for Automatic Problem Generation

Difficulties students face with their mathematics education: Limited textbook problems, Overcoming absences and reteaching, Variance of time to mastery (slow or fast), and Acquiring problems using personally-designed criteria. Difficulties teachers face educating students: Efficiently develop supplementary materials, Write multiple versions of exams, and Differentiate instruction effectively.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 2 / 21

slide-3
SLIDE 3

Motivation

Why Geometry Domain?

Problem synthesis techniques have been restricted to mostly algebraic

  • domains. [Wolfram-Alpha, Gulwani et al. AAAI ’12, etc.]

Reasoning about diagrams is non-trivial. Automatic theorem proving is well-studied, but basic geometry solution and problem synthesis have yet to be explored.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 3 / 21

slide-4
SLIDE 4

Main Question

Question Answered

Can we synthesize (other) geometry proof problems (and their solutions) from a figure together with a set of properties true of that figure?

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 4 / 21

slide-5
SLIDE 5

Contributions

Our Contributions

We formalize the notion of a geometry proof problem. We present a technique for generating proof problems over a geometric figure in a system we call GeoTutor. Our semi-automated approach takes a figure, analyzes, and generates problems within a few seconds. Supports queryable problem properties.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 5 / 21

slide-6
SLIDE 6

Example

Textbook Problem

If ∆ABE ∼ = ∆ACD, show that ∆ADE ∼ ∆ABC.

B A C D E X

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 6 / 21

slide-7
SLIDE 7

Example

Demonstration

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 7 / 21

slide-8
SLIDE 8

Example

Solution

△ADE ∼ △ABC SAS Similarity ∠EAD ∼ = ∠EAD EA AC = DA AB Reflexive AB ∼ = AC EA ∼ = DA ∼ = segments are proportional △ABE ∼ = △ACD Given C P C T C CPCTC B A C D E X Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 8 / 21

slide-9
SLIDE 9

Definitions

Internal Representation: Hypergraph

Geometric deductions can be written as logical propositions such as P1, P2, . . . , Pk ⊢ Pn as evidenced below with the SAS congruence axiom.

∼ = sides ∼ = ∠’s ∼ = sides ∆2 ∆1 ∼ = ∆’s

We will use a directed hypergraph with edges being many-to-one.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 9 / 21

slide-10
SLIDE 10

Definitions

Definition: Geometry Proof Problem

Definition Let Fig be a figure and let Axioms be a set of geometry axioms. A geometry proof problem over (Fig, Axioms) is a pair (assumptions, goals), where the assumptions and goals are sets of explicit facts about Fig such that an assumption is not a goal, the implicit facts of Fig, assumptions, and Axioms imply each goal in the set of goals using first-order reasoning. Observe that a problem (and solution) is then a path in the hypergraph.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 10 / 21

slide-11
SLIDE 11

Definitions

Problem Synthesis Example

Consider the following statement; we call it the Midpoint Theorem. If segment AB has midpoint M, then 2AM = AB and 2MB = AB.

A B M

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 11 / 21

slide-12
SLIDE 12

Definitions

Problem Synthesis: Midpoint Theorem

2AM = AB

Subst + Simp

AM + MB = AB AM = MB

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB:

2MB = AB

Subst + Simp Substitution Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-13
SLIDE 13

Definitions

Problem Synthesis: Midpoint Theorem

Subst + Simp

AM + MB = AB AM = MB

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB:

2AM = AB 2MB = AB

Subst + Simp Substitution Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-14
SLIDE 14

Definitions

Problem Synthesis: Midpoint Theorem

AM + MB = AB AM = MB

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB:

2AM = AB

Subst + Simp Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-15
SLIDE 15

Definitions

Problem Synthesis: Midpoint Theorem

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB:

2AM = AB

Subst + Simp

AM + MB = AB

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-16
SLIDE 16

Definitions

Problem Synthesis: Midpoint Theorem

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB:

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-17
SLIDE 17

Definitions

Problem Synthesis: Midpoint Theorem

AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB:

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax

Between(M, AB)

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-18
SLIDE 18

Definitions

Problem Synthesis: Midpoint Theorem

AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax

Between(M, AB)

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-19
SLIDE 19

Definitions

Problem Synthesis: Midpoint Theorem

  • Def. of Midpoint

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax

Between(M, AB) AM = MB

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-20
SLIDE 20

Definitions

Problem Synthesis: Midpoint Theorem

Midpoint(M, AB)

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-21
SLIDE 21

Definitions

Problem Synthesis: Midpoint Theorem

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-22
SLIDE 22

Definitions

Problem Synthesis: Midpoint Theorem

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB (S): Midpoint(M, AB) (G) AM = MB

2AM = AB

Subst + Simp

AM + MB = AB

Seg Addition Ax

Between(M, AB) AM = MB

  • Def. of Midpoint

Midpoint(M, AB)

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-23
SLIDE 23

Definitions

Problem Synthesis: Midpoint Theorem

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB (S): Midpoint(M, AB) (G) AM = MB Seg Addition Ax

Between(M, AB)

  • Def. of Midpoint

Midpoint(M, AB)

Subst + Simp

AM + MB = AB AM = MB 2AM = AB

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-24
SLIDE 24

Definitions

Problem Synthesis: Midpoint Theorem

Generated Problems (Hashed by Goal) Between(M, AB): Midpoint(M, AB): AM + MB = AB: AM = MB: 2AM = AB: 2MB = AB: (S): Between(M, AB) (G) AM + MB = AB (S): Midpoint(M, AB) (G) AM = MB (S): Between(M, AB), Midpoint(M, AB) (P): AM + MB = AB AM = MB (G) 2AM = AB Seg Addition Ax

Between(M, AB)

  • Def. of Midpoint

Midpoint(M, AB)

Subst + Simp

AM + MB = AB AM = MB 2AM = AB

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 12 / 21

slide-25
SLIDE 25

Definitions

Interesting Geometry Proof Problem

Definition A geometry problem (assumptions, goals) over (Fig, Axioms) is interesting if the set of assumptions is minimal.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 13 / 21

slide-26
SLIDE 26

Definitions

Interesting vs. Uninteresting

Assume: AD = BC, AB CD, and AC BD. Goal: Prove ABCD is a rectangle. A B C D ∧ ∧ ∧ ∧ > > Solution

ABCD is a Rectangle ∼ = diagonals of Parallelogram ⇒ Rectangle ABCD is Par- allelogram AD = BC Definition of Parallelogram AC BD AB CD Given Given Given

The assumptions are minimal to prove the goal; this problem is interesting.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 14 / 21

slide-27
SLIDE 27

Definitions

Interesting vs. Uninteresting

Assume: AD = BC, AB CD, AC BD, and m∠ACD = 90o Goal: Prove ABCD is a rectangle. A B C D ∧ ∧ ∧ ∧ > > Solution

ABCD is a Rectangle ∼ = diagonals of Parallelogram ⇒ Rectangle ABCD is Par- allelogram AD = BC Definition of Parallelogram AC BD AB CD Given Given Given

Adding assumption m∠ACD = 90o results in an uninteresting problem.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 14 / 21

slide-28
SLIDE 28

Definitions

Strict Geometry Proof Problem

Definition An interesting problem is strict if the set of goals is minimal.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 15 / 21

slide-29
SLIDE 29

Definitions

Strict vs. Non-Strict

Assume: AD = BC, AB CD, and AC BD. Goal: Prove ABCD is a rectangle. A B C D ∧ ∧ ∧ ∧ > > Solution

ABCD is a Rectangle ∼ = diagonals of Parallelogram ⇒ Rectangle ABCD is Par- allelogram AD = BC Definition of Parallelogram AC BD AB CD Given Given Given

Since there is a single goal, the problem is vacuously strict.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 16 / 21

slide-30
SLIDE 30

Definitions

Strict vs. Non-Strict

Assume: AD = BC, AB CD, and AC BD. Goals: Prove ∠ABD is supplementary to ∠BDC and ABCD is a rectangle. A B C D ∧ ∧ ∧ ∧ > > Solution

ABCD is a Rectangle ∼ = diagonals of Parallelogram ⇒ Rectangle AD = BC ABCD is Par- allelogram ∠ABD supp. ∠BDC Definition of Parallelogram AC BD AB CD Given Given Given

Adding a goal not in the solution path results in a non-strict problem.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 16 / 21

slide-31
SLIDE 31

Definitions

Complete Geometry Proof Problem

Definition An interesting geometry problem (assumptions, goals) over (Fig, Axioms) is complete if the implicit facts of Fig, assumptions, and Axioms defines all explicit facts of the figure.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 17 / 21

slide-32
SLIDE 32

Definitions

Complete vs. Interesting

Assume AD = BC, AB CD, and AC BD. Goal: Prove quadrilateral ABCD is a rectangle. A B C D ∧ ∧ ∧ ∧ > > Complete A B C D ∧ ∧ ∧ ∧ > > Interesting, Not Complete The quadrilateral on the right is a square, alas, we cannot strengthen beyond a rectangle since no information is provided about congruent sides.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 18 / 21

slide-33
SLIDE 33

Evaluation

Evaluation Methodology

The corpus contained 110 figures and 155 textbook problems from textbooks in India and the United States. Each textbook problem is defined as a triple: T =< FT, AT, GT > where: FT denotes the set of intrinsic properties of the figure, AT denotes the assumptions as stated in the textbook, and GT the set of goals as stated in the textbook. Our synthesis is sound if the respective set of generated interesting (or complete) problems contains the original problems stated in the textbook.

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 19 / 21

slide-34
SLIDE 34

Evaluation

Results

Figures 110 Strictly Complete Textbook Problems 45 Strictly Interesting Textbook Problems 65

  • Ave. Generated 1-Goal Problems

37

  • Ave. Generated 2-Goal Problems

443 Time (secs / figure) 4.7

Table: Cumulative Results of Synthesis

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 20 / 21

slide-35
SLIDE 35

Thank you for your attention. Any questions?

Chris Alvin, Louisiana State University In Collaboration with Supratik Mukhopadhyay, LSU Sumit Gulwani, Microsoft Research, Redmond Synthesis of Geometry Proof Problems Thursday, July 31, 2014 21 / 21