ne neural t text ge generation f from s struct ctured da
play

Ne Neural T Text Ge Generation f from S Struct ctured Da Data - PowerPoint PPT Presentation

Ne Neural T Text Ge Generation f from S Struct ctured Da Data wi with h Appl Application n to the he Biogr graph phy Domain Rmi Lebret, David Grangier, Michael Auli Fr From Str truc uctur tured ed Data a to Sen entenc


  1. Ne Neural T Text Ge Generation f from S Struct ctured Da Data wi with h Appl Application n to the he Biogr graph phy Domain Rémi Lebret, David Grangier, Michael Auli

  2. Fr From Str truc uctur tured ed Data a to Sen entenc ences es • Why? Machines like to read structured data, people don’t User-friendly access to structured data: Ø Question answering Ø Virtual assistant Ø Profile summary

  3. Co Concept-to to-Te Text Generation • Weather forecast: Cloudy, with temperatures between 10 and 20 degrees. South wind around 20 mph.

  4. Co Concept-to to-Te Text Generation • Flight query: Give me the flights leaving Denver August ninth coming back to Boston before 4pm.

  5. Mo Motivations for r Going La Large Sc Scale • Template-based approaches: PROS CONS Natural language Repetitive No training Scale poorly Small datasets with limited vocabularies • Generating natural language from Wikipedia infoboxes Ø 700K biographies Ø 400K words vocabulary

  6. Ge Generatin ting Bio iograp aphy from Wi Wiki kipe pedi dia In Infobo box Z Copy actions Conditioning on tables (fields + values)

  7. Pr Proposed Approach

  8. Fr From Str truc uctur tured ed Data a to Sen entenc ences es • How? Neural language model for constrained sentence generation Success in: Ø Caption generation (Vinyals et al, 2015) Ø Machine translation (D. Bahdanau et al, 2014) Ø Modeling conversations and dialogues (Shang et al, 2015)

  9. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning

  10. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning

  11. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning copy actions Table descriptors: Ø Name of the field Ø Position from the start Ø Position from the end

  12. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning copy actions Table descriptors: Ø Name of the field Ø Position from the start Ø Position from the end

  13. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning copy actions Table descriptors: Ø Name of the field Ø Position from the start Ø Position from the end

  14. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning Local conditioning à already generated fields copy actions Table descriptors: Ø Name of the field Ø Position from the start Ø Position from the end

  15. g 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) La Langu guage e Model el with Co Conditioning Local conditioning à already generated fields Global conditioning à fields and values copy actions Table descriptors: Ø Name of the field Ø Position from the start Ø Position from the end

  16. Ne Neur ural al Lang Languag uage Mode del l with ith Conditio nditioning ning 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) Embeddings-based model john doe ( 18 april 1352 ) is a

  17. Ne Neur ural al Lang Languag uage Mode del l with ith Conditio nditioning ning 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) Aggregating embeddings –> component-wise max john doe ( 18 april 1352 ) is a

  18. Ne Neur ural al Lang Languag uage Mode del l with ith Conditio nditioning ning 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) Input 𝑦 = 𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - : 𝜔(𝑑 $ ) 𝜔 3 𝑦 = 𝜔(𝑑 $ ); 𝜔(𝑨 ) * ); 𝜔(𝑕 , ); 𝜔(𝑕 - ) john doe ( 18 april 1352 ) is a 𝜔(𝑨 ) * ) 𝜔(𝑕 , ) 𝜔(𝑕 - )

  19. Ne Neur ural al Lang Languag uage Mode del l with ith Conditio nditioning ning 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) Input: Final score: 𝒭 𝑦, 𝑥 𝒳 𝑦, 𝑥 + 𝜚 9 𝜔 3 𝑦 = 𝜔(𝑑 $ ); 𝜔(𝑨 ) * ); 𝜔(𝑕 , ); 𝜔(𝑕 - ) 𝜚 7 𝑦, 𝑥 = 𝜚 3 Non-linear transformation ℎ(𝑦)

  20. � Ne Neur ural al Lang Languag uage Mode del l with ith Conditio nditioning ning 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) Input: Final score: 𝒭 𝑦, 𝑥 𝒳 𝑦, 𝑥 + 𝜚 9 𝜔 3 𝑦 = 𝜔(𝑑 $ ); 𝜔(𝑨 ) * ); 𝜔(𝑕 , ); 𝜔(𝑕 - ) 𝜚 7 𝑦, 𝑥 = 𝜚 3 Softmax function: log 𝑄(𝑥|𝑦) = 𝜚 7 𝑦, 𝑥 − log ? exp 𝜚 7 (𝑦, 𝑥′) -E∈𝒳∪𝒭 Training: Maximize Likelihood of Training Text J 𝑀 7 𝑡 = ? log 𝑄(𝑥 $ |𝑑 $ , 𝑨 ) * , 𝑕 , , 𝑕 - ) $KL

  21. Ex Experi riments

  22. Wi Wiki kiBio da dataset 728,321 Wikipedia biographies (80% - 10% - 10%) • Ø Infobox Ø Introduction section (only 1st sentence for the generation) Available at https://rlebret.github.io/wikipedia-biography-dataset/

  23. Qu Quantitative Results without copy actions with copy actions KN = Kneser-Ney language model (5-gram) • NLM = Neural Language model (11-gram) •

  24. At Attention Mechanism • Adding a bias 𝜚 𝒭 to 𝜚 𝒳 Continuing an incomplete field Handling transitions between fields

  25. Be Beam m Si Size Imp mpact 45 Template KN Table NLM beam size ● 40 • Much faster than Kneser-Ney 345 67 810 35 ● thanks to GPU ● ● ● 15 ●● ●● ● 20 25 1 ● ● ● ● 200 ms BLEU 30 25 • Best BLEU with 𝐿 = 5 5 6 20 4 ● 8 10 15 2025 ● ● ● ● ● ● ● ● ● 3 2 15 ● ● 1 ● 100 200 500 1000 2000 time in ms

  26. Qualitative Results Qu MODEL GENERATED SENTENCE Template KN frederick parker-rhodes ( born november 21 , 1914 – march 2 , 1987 ) was an english cricketer . Table NLM frederick parker-rhodes ( 21 november 1914 – 2 march +Local (field, start) 1987 ) was an australian rules footballer who played with carlton in the victorian football league ( vfl ) during the XXXXs and XXXXs . + Global (field) frederick parker-rhodes ( 21 november 1914 – 2 march 1987 ) was an english mycology and plant pathology , mathematics at the university of uk . + Global (field, word) frederick parker-rhodes ( 21 november 1914 – 2 march 1987 ) was a british computer scientist , best known for his contributions to computational linguistics .

  27. Co Conclusi sion • Generating sentences with: Ø copying facts from the table. Ø understanding type of fields. Ø understanding relation between record tokens and table tokens. Ø network with low capacity → fast generation. • WikiBio dataset available to download Ø https://rlebret.github.io/wikipedia-biography-dataset/

  28. Futur Future e Work • Generating multiple sentences • Loss / evaluation that assess factual accuracy ( ≠ BLEU) Thank you!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend