reporter weibo gao
play

Reporter: Weibo Gao Anhui Province Key Laboratory of Big Data - PowerPoint PPT Presentation

Reporter: Weibo Gao Anhui Province Key Laboratory of Big Data Analysis and Application 1 Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data


  1. Reporter: Weibo Gao Anhui Province Key Laboratory of Big Data Analysis and Application 1

  2. Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data Analysis and Application 2

  3. Background Ø Automatically answering math problems Ø A crucial and challenging task in AI Ø Requirements Ø Linguistic understanding ability Ø Semantic understanding Ø Operator extraction Ø Mathematical comprehension ability Ø Understand formulas with free-text format Anhui Province Key Laboratory of Big Data Analysis and Application 3

  4. Related work Ø Math word problem Ø Elementary problem (primary school level) Ø Translate questions text into expression forms for answers Ø Existing methods Ø Rules-schemes-matching methods Ø Statistical learning Ø E.g., template-based, tree-based Ø Seq2seq deep learning Just consist of natural language content expression tree Math word problem expression Gwen was organizing her book case making sure each of the shelves had exactly 9 books on + ´ = (3 5) 9 72 it. She has 2 types of books - mystery books and picture books. If she had 3 shelves of mystery books and 5 shelves of picture books. How many books did she have total? Anhui Province Key Laboratory of Big Data Analysis and Application 4

  5. Background Ø Math word Problem Ø Elementary problem (primary school level) Ø Linguistic learning for natural language content Ø Operator extraction (+) Ø Semantic understanding Ø Mathematical problem Ø Complex problem (high school level) Ø Language content Ø Specific but informative formulas Ø Requirement Ø Linguistic understanding Ø Mathematical comprehension Math word problem Mathematical problem Anhui Province Key Laboratory of Big Data Analysis and Application 5

  6. Background Ø Challenges: How to represent formula-enriched problem? Ø How to to understand formulas with their free-text format? sin, √ , x, /, 2 word-level sin % ! 2 s, i, n, √ , x, /, 2 character-level \sin, \sqrt, \frac Ø How to design a unified architecture to incorporate linguistic and mathematical information? Linguistic Mathematical information information Anhui Province Key Laboratory of Big Data Analysis and Application 6

  7. Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data Analysis and Application 7

  8. Problem Definition Ø Given { } = Ø Mathematical problem: P p p , , ! , p 1 2 L Ø Token: is a word token or formula token (e.g., quantities, symbols) p i Ø Goal Ø Read tokens from P { } = Ø Gnerate answer sequence: Y y y , , ! , y 1 2 T Fomulas Answer: 30 Problem: Let 3 + x = 13 . Solve x . P Y = { } = { } Let 3 + x 3 0 Let 3 + x ... Solve x . f w f f w f w Y P P P P P P P Y ... 4 1 2 1 2 3 9 10 11 Answer sequence Y w f P P : word token : fomula token i i Mathematical problem P Anhui Province Key Laboratory of Big Data Analysis and Application 8

  9. Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data Analysis and Application 9

  10. NMS Framework Ø NMS framework Ø Formula Graph Construction Ø Develop an assistant tool to construct formula dependency graph Ø Neural Solver Ø FGN: Formula graph network Ø Sequence model: Encoder-Decoder architecture Semantic space Mathematical space Anhui Province Key Laboratory of Big Data Analysis and Application 10

  11. NMS Framework Ø Formula graph construction Ø Goal : present formulas in a structural way Ø Develop a TeX-based formula-dependent graph tool Ø Nodes Ø Variables: q Ø Numbers: 2 Ø Operators: \tan Ø Edges (four relasions) Ø Brother, father, child Ø Relative Ø Features Ø Attribute, content Advantages Ø Reduce redundant Ø Keep structure information Ø Enhance semantic information Anhui Province Key Laboratory of Big Data Analysis and Application 11

  12. NMS Framework Ø Neural solver Ø FGN: capture fomula structure information Ø Sequence model: incorporate semantic and structural information Formula Graph Network Neural solver Anhui Province Key Laboratory of Big Data Analysis and Application 12

  13. Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data Analysis and Application 13

  14. Experiment Ø Dataset Ø MATH dataset (high school level) Ø Data analysis Ø Formula tokens take large portions Ø 69% on average Ø Larger portions in shorter problems Ø Baseline methods (seq2seq) Ø GRU Ø BiGRU Ø RMC Ø Attention Ø Transformer Ø Evaluation metrics Ø ACC, BLEU, ROUGE Anhui Province Key Laboratory of Big Data Analysis and Application 14

  15. Experiment Ø Experiment Ø Task: solving mathematical problems Ø Observations Ø NMS performs the best Ø Capture mathematical relations effectively Ø Transformer and Seq2Seq-BiGRU perform better than other baselines Ø Design sophisticated encoders Ø RMC performs not very well Ø Probably because it requires many parameters Anhui Province Key Laboratory of Big Data Analysis and Application 15

  16. Experiment Ø Visualization Ø Task: project problems embeddings into 2D space by t-SNE Ø Observations Ø Problems with same concepts learned are easier to be grouped Ø They are closer in the hidden space Ø Problems with simple formula structures cluster nearly Ø E.g., “Set” problems Ø Many types of formulas cause different patterns Ø E.g., “Function” problems More reasonable Anhui Province Key Laboratory of Big Data Analysis and Application 16

  17. Outline Background 1 Problem Definition 2 Framework 3 Experiment 4 Conclusion & Future work 5 Anhui Province Key Laboratory of Big Data Analysis and Application 17

  18. Conclusion & Future work Ø Overall results Ø Develop a TeX-based formula-dependent graph tool to maintain the structural information of each problem. Ø Design FGN to capture mathematical relations. Ø Design a neural solver to incorporate semantic infomation and structural infomation. Ø Future work Ø Seek ways to predict quantities effectively """ ### ⁄ Ø ½ vs. Ø Design different graph networks for learning formula structure Ø Reasoning on different problem types Ø Consider more specific structures of more complex problems Ø “geometry” problem: containing figures Anhui Province Key Laboratory of Big Data Analysis and Application 18

  19. Thanks for your listening! huangzhy@mail.ustc.edu.cn Anhui Province Key Laboratory of Big Data Analysis and Application 19

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend