Reporter: Weibo Gao Anhui Province Key Laboratory of Big Data - - PowerPoint PPT Presentation

reporter weibo gao
SMART_READER_LITE
LIVE PREVIEW

Reporter: Weibo Gao Anhui Province Key Laboratory of Big Data - - PowerPoint PPT Presentation

Reporter: Weibo Gao Anhui Province Key Laboratory of Big Data Analysis and Application 1 Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data


slide-1
SLIDE 1

Anhui Province Key Laboratory of Big Data Analysis and Application

1

Reporter: Weibo Gao

slide-2
SLIDE 2

Anhui Province Key Laboratory of Big Data Analysis and Application

2

Outline

Background 1 2 Problem Definition Framework 3 Experiment 4 Conclusion & Future work 5

slide-3
SLIDE 3

Anhui Province Key Laboratory of Big Data Analysis and Application

3

Background

ØAutomatically answering math problems

Ø A crucial and challenging task in AI Ø Requirements Ø Linguistic understanding ability Ø Semantic understanding Ø Operator extraction Ø Mathematical comprehension ability Ø Understand formulas with free-text format

slide-4
SLIDE 4

Anhui Province Key Laboratory of Big Data Analysis and Application

4

Related work

ØMath word problem

Ø Elementary problem (primary school level) Ø Translate questions text into expression forms for answers Ø Existing methods Ø Rules-schemes-matching methods Ø Statistical learning Ø E.g., template-based, tree-based Ø Seq2seq deep learning

Gwen was organizing her book case making sure each of the shelves had exactly 9 books on

  • it. She has 2 types of books - mystery books

and picture books. If she had 3 shelves of mystery books and 5 shelves of picture books. How many books did she have total?

Math word problem expression

(3 5) 9 72 + ´ =

expression tree Just consist of natural language content

slide-5
SLIDE 5

Anhui Province Key Laboratory of Big Data Analysis and Application

5

Background

ØMath word Problem

Ø Elementary problem (primary school level) Ø Linguistic learning for natural language content Ø Operator extraction (+) Ø Semantic understanding

ØMathematical problem

Ø Complex problem (high school level) Ø Language content Ø Specific but informative formulas Ø Requirement Ø Linguistic understanding Ø Mathematical comprehension

Math word problem Mathematical problem

slide-6
SLIDE 6

Anhui Province Key Laboratory of Big Data Analysis and Application

6

Background

ØChallenges: How to represent formula-enriched problem?

Ø How to to understand formulas with their free-text format? Ø How to design a unified architecture to incorporate linguistic and mathematical information?

! sin % 2 sin, √, x, /, 2 s, i, n, √, x, /, 2

word-level character-level Mathematical information Linguistic information

\sin, \sqrt, \frac

slide-7
SLIDE 7

Anhui Province Key Laboratory of Big Data Analysis and Application

7

Outline

Background 1 2 Problem Definition Framework 3 Experiment 4 Conclusion & Future work 5

slide-8
SLIDE 8

Anhui Province Key Laboratory of Big Data Analysis and Application

8

Problem Definition

ØGiven

Ø Mathematical problem: Ø Token: is a word token or formula token (e.g., quantities, symbols)

ØGoal

Ø Read tokens from Ø Gnerate answer sequence:

{ }

1 2

, , ,

L

P p p p = !

i

p P

{ }

1 2

, , ,

T

Y y y y = ! Answer: 30

3

= { }

Y

1

Y

2

Y Answer sequence Y Mathematical problem P

Problem: Let 3 + x = 13 . Solve x .

Let 3 + x ... Solve x . Let 3 + x

= { }

P

1 w

P

2 f

P

3 f

P

4 f

P

9 w

P

10 f

P

11 w

P

...

Fomulas

: word token : fomula token

w i

P

f i

P

slide-9
SLIDE 9

Anhui Province Key Laboratory of Big Data Analysis and Application

9

Outline

Background 1 3 Framework Problem Definition 2 Experiment 4 Conclusion & Future work 5

slide-10
SLIDE 10

Anhui Province Key Laboratory of Big Data Analysis and Application

10

NMS Framework

ØNMS framework

Ø Formula Graph Construction Ø Develop an assistant tool to construct formula dependency graph Ø Neural Solver Ø FGN: Formula graph network Ø Sequence model: Encoder-Decoder architecture

Semantic space Mathematical space

slide-11
SLIDE 11

Anhui Province Key Laboratory of Big Data Analysis and Application

11

NMS Framework

ØFormula graph construction

Ø Goal: present formulas in a structural way Ø Develop a TeX-based formula-dependent graph tool Ø Nodes Ø Variables: Ø Numbers: 2 Ø Operators: \tan Ø Edges (four relasions) Ø Brother, father, child Ø Relative Ø Features Ø Attribute, content

Ø Reduce redundant Ø Keep structure information Ø Enhance semantic information

Advantages q

slide-12
SLIDE 12

Anhui Province Key Laboratory of Big Data Analysis and Application

12

NMS Framework

ØNeural solver

Ø FGN: capture fomula structure information Ø Sequence model: incorporate semantic and structural information

Neural solver

Formula Graph Network

slide-13
SLIDE 13

Anhui Province Key Laboratory of Big Data Analysis and Application

13

Outline

Background 1 4 Experiment Problem Definition 2 Framework 3 Conclusion & Future work 5

slide-14
SLIDE 14

Anhui Province Key Laboratory of Big Data Analysis and Application

14

Experiment

ØDataset

Ø MATH dataset (high school level)

ØData analysis

Ø Formula tokens take large portions Ø 69% on average Ø Larger portions in shorter problems

ØBaseline methods (seq2seq)

Ø GRU Ø BiGRU Ø RMC Ø Attention Ø Transformer

ØEvaluation metrics

Ø ACC, BLEU, ROUGE

slide-15
SLIDE 15

Anhui Province Key Laboratory of Big Data Analysis and Application

15

Experiment

ØExperiment

Ø Task: solving mathematical problems Ø Observations Ø NMS performs the best Ø Capture mathematical relations effectively Ø Transformer and Seq2Seq-BiGRU perform better than other baselines Ø Design sophisticated encoders Ø RMC performs not very well Ø Probably because it requires many parameters

slide-16
SLIDE 16

Anhui Province Key Laboratory of Big Data Analysis and Application

16

Experiment

ØVisualization

Ø Task: project problems embeddings into 2D space by t-SNE Ø Observations Ø Problems with same concepts learned are easier to be grouped Ø They are closer in the hidden space Ø Problems with simple formula structures cluster nearly Ø E.g., “Set” problems Ø Many types of formulas cause different patterns Ø E.g., “Function” problems More reasonable

slide-17
SLIDE 17

Anhui Province Key Laboratory of Big Data Analysis and Application

17

Outline

Background 1 5 Conclusion & Future work Problem Definition 2 Framework 3 Experiment 4

slide-18
SLIDE 18

Anhui Province Key Laboratory of Big Data Analysis and Application

18

Conclusion & Future work

ØOverall results

Ø Develop a TeX-based formula-dependent graph tool to maintain the structural information of each problem. Ø Design FGN to capture mathematical relations. Ø Design a neural solver to incorporate semantic infomation and structural infomation.

ØFuture work

Ø Seek ways to predict quantities effectively Ø ½ vs. ⁄

""" ###

Ø Design different graph networks for learning formula structure Ø Reasoning on different problem types Ø Consider more specific structures of more complex problems Ø “geometry” problem: containing figures

slide-19
SLIDE 19

Anhui Province Key Laboratory of Big Data Analysis and Application

19

Thanks for your listening!

huangzhy@mail.ustc.edu.cn