the extended sparky restaurant corpus
play

The Extended SPaRKy Restaurant Corpus designing a corpus with - PowerPoint PPT Presentation

The Extended SPaRKy Restaurant Corpus designing a corpus with variable information density David M. Howcroft Dietrich Klakow Vera Demberg Department of Language Science and Technology Saarland Informatics Campus, Saarland University, Germany


  1. The Extended SPaRKy Restaurant Corpus designing a corpus with variable information density David M. Howcroft Dietrich Klakow Vera Demberg Department of Language Science and Technology Saarland Informatics Campus, Saarland University, Germany Interspeech 2017 @_dmh Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 1 / 11

  2. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  3. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  4. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  5. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Due Fratelli serves average-priced Italian food, while Andalucia is a Spanish, seafood restaurant with moderately high prices. Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  6. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation Adapting linguistic complexity (specifically, information density) Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  7. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation Adapting linguistic complexity (specifically, information density) Due Fratelli serves average-priced Italian food, while Andalucia is a Spanish, seafood restaurant with moderately high prices. Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  8. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation Adapting linguistic complexity (specifically, information density) Due Fratelli serves average-priced Italian food, while Andalucia is a Spanish, seafood restaurant with moderately high prices. Due Fratelli is an Italian restaurant. Its price is average. On the other hand, Andalucia is somewhat expensive. They serve Spanish, seafood there. Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  9. Spoken Dialogue Systems Automatic Natural Speech Language Recognition Understanding Dialogue User Management Natural Speech Synthesis Language Generation Adapting linguistic complexity (specifically, information density) Due Fratelli serves average-priced Italian food, while Andalucia is a Spanish, seafood restaurant with moderately high prices. Due Fratelli is an Italian restaurant. Its price is average. On the other hand, Andalucia is somewhat expensive. They serve Spanish, seafood there. . . . Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 2 / 11

  10. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  11. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  12. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  13. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) → “NAME is a CUISINE restaurant” Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  14. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  15. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” Machine learning on meaning representations paired with output texts Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  16. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” Machine learning on meaning representations paired with output texts ◮ Semantic Parsing (Zettlemoyer & Collins 2005, inter alia) Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  17. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” Machine learning on meaning representations paired with output texts ◮ Semantic Parsing (Zettlemoyer & Collins 2005, inter alia) ◮ End-to-end Generation (Mairesse et al. 2010, Angeli et al. 2010, Konstas & Lapata 2013, Wen et al. 2015, i.a.) Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  18. Traditional & end-to-end approaches to NLG name price cuisine Due Fratelli $$ Italian Andalucia $$$ Spanish, Seafood Traditionally: we start writing rules... assert_cuisine(NAME, CUISINE) → “NAME is a CUISINE restaurant” → “NAME serves CUISINE food” Machine learning on meaning representations paired with output texts ◮ Semantic Parsing (Zettlemoyer & Collins 2005, inter alia) ◮ End-to-end Generation (Mairesse et al. 2010, Angeli et al. 2010, Konstas & Lapata 2013, Wen et al. 2015, i.a.) Either way, we need a corpus with meaning representations! Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 3 / 11

  19. Discourse-level meaning representations Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 4 / 11

  20. Discourse-level meaning representations Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 4 / 11

  21. Discourse-level meaning representations Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood restaurant. However, Due Fratelli’s price is average, while Andalucia’s price is more expensive. Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 4 / 11

  22. Discourse-level meaning representations Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood restaurant. However, Due Fratelli’s price is average, while Andalucia’s price is more expensive. ◮ The SPaRKy Restaurant Corpus (Walker et al. 2007) Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 4 / 11

  23. Discourse-level meaning representations Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood restaurant. However, Due Fratelli’s price is average, while Andalucia’s price is more expensive. ◮ The SPaRKy Restaurant Corpus (Walker et al. 2007) ◮ 1800 texts from an NLG system Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 4 / 11

  24. Discourse-level meaning representations Due Fratelli is an Italian restaurant, while Andalucia is a Spanish seafood restaurant. However, Due Fratelli’s price is average, while Andalucia’s price is more expensive. ◮ The SPaRKy Restaurant Corpus (Walker et al. 2007) ◮ 1800 texts from an NLG system ◮ discourse semantics, but limited variation Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 4 / 11

  25. Crowdsourced Corpora Howcroft, Klakow, Demberg (UdS) Extended SPaRKy Restaurant Corpus Interspeech 2017 5 / 11

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend