a unified model for extractive and abstractive
play

A Unified Model for Extractive and Abstractive Summarization using - PowerPoint PPT Presentation

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss Project page Wan-Ting Hsu Chieh-Kai Lin National Tsing Hua University National Tsing Hua University 1 Outline Motivation Our Method Training


  1. A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss Project page Wan-Ting Hsu Chieh-Kai Lin National Tsing Hua University National Tsing Hua University 1

  2. Outline • Motivation • Our Method • Training Procedures • Experiments and Results • Conclusion 2

  3. Outline • Motivation • Our Method • Training Procedures • Experiments and Results • Conclusion 3

  4. Overview Textual Media People spend 12 hours everyday consuming media in 2018. – eMarketer https://www.emarketer.com/topics/topic/time-spent-with-media 4

  5. Overview Textual Media People spend 12 hours everyday consuming media in 2018. – eMarketer https://www.emarketer.com/topics/topic/time-spent-with-media 5

  6. Overview Textual Media People spend 12 hours everyday consuming media in 2018. – eMarketer https://www.emarketer.com/topics/topic/time-spent-with-media 6

  7. Overview Textual Media People spend 12 hours everyday consuming media in 2018. – eMarketer https://www.emarketer.com/topics/topic/time-spent-with-media 7

  8. Overview Text Summarization • To condense a piece of text to a shorter version while maintaining the important points 8

  9. Overview Examples of Text Summarization • Article headlines • Meeting minutes • Movie/book reviews • Bulletins (weather forecasts/stock market reports) 9

  10. Overview Examples of Text Summarization • Article headlines • Meeting minutes • Movie/book reviews • Bulletins (weather forecasts/stock market reports) 10

  11. Overview Examples of Text Summarization • Article headlines • Meeting minutes • Movie/book reviews • Bulletins (weather forecasts/stock market reports) 11

  12. Overview Examples of Text Summarization • Article headlines • Meeting minutes • Movie/book reviews • Bulletins (weather forecasts/stock market reports) 12

  13. Overview Examples of Text Summarization • Article headlines • Meeting minutes • Movie/book reviews • Bulletins (weather forecasts/stock market reports) 13

  14. Overview Automatic Text Summarization • To condense a piece of text to a shorter version while maintaining the important points Extractive Summarization Abstractive Summarization select text from the article generate the summary word-by-word 14

  15. Overview Extractive Summarization • Select phrases or sentences from the source document Representation sentence 1 1 3 9 2 sentence 2 5 6 5 7 sentence 3 8 1 1 4 … - Shen, D.; Sun, J.-T.; Li, H.; Yang, Q.; and Chen, Z. 2007. Document summarization using conditional random fields. IJCAI 2007. - Kågebäck, M., Mogren, O., Tahmasebi, N., & Dubhashi, D. Extractive Summarization using Continuous Vector Space Models. EACL 2014. - Cheng, J., and Lapata, M. Neural summarization by extracting sentences and words. ACL 2016. - Ramesh Nallapati, Feifei Zhai, and Bowen Zhou. Summarunner: A recurrent neural network based sequence model for extractive summarization of documents. AAAI 2017 15

  16. Overview Abstractive Summarization • Select phrases or sentences from the source document Article Encoder Decoder Representations - Alexander M Rush, Sumit Chopra, and Jason Weston. A neural attention model for abstractive sentence summarization. EMNLP 2015. - Ramesh Nallapati, Bowen Zhou, Cicero dos Santos, Caglar Gulcehre, and Bing Xiang. Abstractive text summarization using sequence- tosequence rnns and beyond. CoNLL 2016. - Abigail See, Peter J Liu, and Christopher D Manning. Get to the point: Summarization with pointergenerator networks. ACL 2017. - Romain Paulus, Caiming Xiong, and Richard Socher. A deep reinforced model for abstractive summarization. ICLR 2018. - Fan, Angela, David Grangier, and Michael Auli. Controllable abstractive summarization. arXiv preprint arXiv:1711.05217 (2017). 16

  17. Overview Motivation not concise • Extractive summary Italian artist Johannes Stoetter has painted two naked women (select sentences): to look like a chameleon. • important, correct • incoherent or not concise The 37-year-old has previously transformed his models into frogs and parrots but this may be his most intricate and impressive artwork to date. • Abstractive summary (generate word-by-word): • readable, concise • may lose or mistake some facts • Unified summary: • important, correct • readable, concise 17

  18. Overview Motivation not concise • Extractive summary Italian artist Johannes Stoetter has painted two naked women (select sentences): to look like a chameleon. • important, correct • incoherent or not concise The 37-year-old has previously transformed his models into frogs and parrots but this may be his most intricate and impressive artwork to date. • Abstractive summary (generate word-by-word): • readable, concise concise • may lose or mistake some facts Johannes Stoetter has previously transformed his models into frogs and parrots but this chameleon may be his most • Unified summary: impressive artwork to date. • important, correct • readable, concise 18

  19. Overview Motivation not concise • Extractive summary Italian artist Johannes Stoetter has painted two naked women (select sentences): to look like a chameleon. • important, correct • incoherent or not concise The 37-year-old has previously transformed his models into frogs and parrots but this may be his most intricate and impressive artwork to date. • Abstractive summary (generate word-by-word): • readable, concise concise • may lose or mistake some facts Justin Bieber Johannes Stoetter has previously transformed his models into frogs and parrots but this chameleon may be his most • Unified summary: impressive artwork to date. • important, correct • readable, concise 19

  20. Overview Motivation not concise • Extractive summary Italian artist Johannes Stoetter has painted two naked women (select sentences): to look like a chameleon. • important, correct • incoherent or not concise The 37-year-old has previously transformed his models into frogs and parrots but this may be his most intricate and impressive artwork to date. • Abstractive summary (generate word-by-word): • readable, concise concise • may lose or mistake some facts Justin Bieber Johannes Stoetter has previously transformed his models into frogs and parrots but this chameleon may be his most • Unified summary: impressive artwork to date. • important, correct • readable, concise 20

  21. Outline • Motivation • Our Method • Training Procedures • Experiments and Results • Conclusion 21

  22. Method Models Extractor Ramesh Nallapati, Feifei Zhai, and Bowen Zhou. Summarunner: A recurrent neural network based sequence model for extractive summarization of documents. AAAI 2017 22

  23. Method Models Extractor static sentence attention Ramesh Nallapati, Feifei Zhai, and Bowen Zhou. Summarunner: A recurrent neural network based sequence model for extractive summarization of documents. AAAI 2017 23

  24. Method Models Extractor Abstracter static sentence attention Ramesh Nallapati, Feifei Zhai, and Bowen Zhou. Summarunner: A recurrent neural Abigail See, Peter J Liu, and Christopher D Manning. Get to the point: Summarization network based sequence model for extractive summarization of documents. AAAI 2017 with pointer-generator networks. ACL 2017 24

  25. Method Models Extractor Abstracter static sentence dynamic word attention attention Ramesh Nallapati, Feifei Zhai, and Bowen Zhou. Summarunner: A recurrent neural Abigail See, Peter J Liu, and Christopher D Manning. Get to the point: Summarization network based sequence model for extractive summarization of documents. AAAI 2017 with pointer-generator networks. ACL 2017 25

  26. Method Combined Attention Extractor Abstracter static sentence dynamic word 𝛾 attention attention 𝛽 𝑛 : word index 𝑜 : sentence index 𝑢 : generated word index 26

  27. Method Combined Attention Extractor Abstracter static sentence dynamic word attention attention 𝛾 1 𝛽 1 𝛽 2 𝛽 3 Cindy is lucky. She won $1000. She is going to … 𝑛 : word index 𝑜 : sentence index 𝑢 : generated word index 27

  28. Method Combined Attention Extractor Abstracter static sentence dynamic word attention attention 𝛾 1 𝛾 2 𝛽 1 𝛽 2 𝛽 3 𝛽 4 𝛽 5 𝛽 6 Cindy is lucky. She won $1000. She is going to … 𝑛 : word index 𝑜 : sentence index 𝑢 : generated word index 28

  29. Method Combined Attention Extractor Abstracter static sentence dynamic word attention attention 𝛾 1 𝛾 3 𝛾 2 𝛽 1 𝛽 2 𝛽 3 𝛽 4 𝛽 5 𝛽 6 𝛽 7 𝛽 8 𝛽 9 … Cindy is lucky. She won $1000. She is going to … 𝑛 : word index 𝑜 : sentence index 𝑢 : generated word index 29

  30. Method Combined Attention • Our unified model combines sentence-level and word-level attentions to take advantage of both extractive and abstractive summarization approaches. 30

  31. Method Combined Attention • Updated word attention is used for calculating the context vector and final word distribution 31

  32. Method Encourage Consistency • We propose a novel inconsistency loss function to ensure our unified model to be mutually beneficial to both extractive and abstractive summarization. multiplied attention of top K attended words maximize 32

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend