a network based end to end trainable task oriented
play

A Network-based End-to-End Trainable Task-oriented Dialogue System - PowerPoint PPT Presentation

A Network-based End-to-End Trainable Task-oriented Dialogue System Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrki, Milica Gai, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young Presented by: Qihao Shao Overview


  1. A Network-based End-to-End Trainable Task-oriented Dialogue System Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrkšić, Milica Gašić, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young Presented by: Qihao Shao

  2. Overview • Introduction • Model • Wizard-of-Oz Data Collection • Empirical Experiments • Conclusions

  3. Overview • Introduction • Model • Wizard-of-Oz Data Collection • Empirical Experiments • Conclusions

  4. Introduction • Treat as a POMDP and use RL to train dialogue policies • Build end-to-end trainable, non-task-oriented conversational systems using seq2seq model • The authors propose a model by balancing the strengths and the weaknesses of these two

  5. Overview • Introduction • Model • Wizard-of-Oz Data Collection • Empirical Experiments • Conclusions

  6. Model • Intent Network • Belief Trackers • Database Operator • Policy Network • Generation Network

  7. Model

  8. Intent Network • Encoder in the sequence-to-sequence framework • Typically, an LSTM network is used • Alternatively, a CNN can be used

  9. Intent Network

  10. Belief Trackers • Core component of the model • Every slot has its belief tracker • Each tracker is a Jordan type RNN with a CNN feature extractor

  11. Belief Trackers

  12. Belief Trackers

  13. Belief Trackers

  14. Database Operator • The DB query qt is formed by • Then query is applied to the DB to create a binary truth value vector xt over DB entities • The entity referenced by the entity pointer is used to form the final system response

  15. Database Operator

  16. Policy Network • Can be viewed as the glue binding other modules together

  17. Policy Network

  18. Generation Network • Once the output token sequence has been generated, the generic tokens are replaced by their actual values

  19. Generation Network • Attentive Generation Network

  20. Generation Network

  21. Overview • Introduction • Model • Wizard-of-Oz Data Collection • Empirical Experiments • Conclusions

  22. Wizard-of-Oz Data Collection • This paper proposed a novel crowdsourcing version of the Wizard-of-Oz paradigm • Designed two webpages on Amazon Mechanical Turk, one for wizards and the other for users

  23. Wizard-of-Oz Data Collection

  24. Wizard-of-Oz Data Collection

  25. Wizard-of-Oz Data Collection • 99 restaurants in the DB • 3000 HITs (Human Intelligence Tasks) in total • 680 dialogues after data cleaning • Cost ~ 400 USD

  26. Overview • Introduction • Model • Wizard-of-Oz Data Collection • Empirical Experiments • Conclusions

  27. Empirical Experiments

  28. Empirical Experiments

  29. Empirical Experiments

  30. Overview • Introduction • Model • Wizard-of-Oz Data Collection • Empirical Experiments • Conclusions

  31. Conclusions • Combines modularly connected model and end-to- end trainable model • First end-to-end NN-based model that can conduct meaningful dialogues in a task-oriented application

  32. Thank you

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend