A Network-based End-to-End Trainable Task-oriented Dialogue System - - PowerPoint PPT Presentation

a network based end to end trainable task oriented
SMART_READER_LITE
LIVE PREVIEW

A Network-based End-to-End Trainable Task-oriented Dialogue System - - PowerPoint PPT Presentation

A Network-based End-to-End Trainable Task-oriented Dialogue System Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrki, Milica Gai, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young Presented by: Qihao Shao Overview


slide-1
SLIDE 1

A Network-based End-to-End Trainable Task-oriented Dialogue System

Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrkšić, Milica Gašić, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young Presented by: Qihao Shao

slide-2
SLIDE 2

Overview

  • Introduction
  • Model
  • Wizard-of-Oz Data Collection
  • Empirical Experiments
  • Conclusions
slide-3
SLIDE 3

Overview

  • Introduction
  • Model
  • Wizard-of-Oz Data Collection
  • Empirical Experiments
  • Conclusions
slide-4
SLIDE 4

Introduction

  • Treat as a POMDP and use RL to train dialogue policies
  • Build end-to-end trainable, non-task-oriented

conversational systems using seq2seq model

  • The authors propose a model by balancing the

strengths and the weaknesses of these two

slide-5
SLIDE 5

Overview

  • Introduction
  • Model
  • Wizard-of-Oz Data Collection
  • Empirical Experiments
  • Conclusions
slide-6
SLIDE 6

Model

  • Intent Network
  • Belief Trackers
  • Database Operator
  • Policy Network
  • Generation Network
slide-7
SLIDE 7

Model

slide-8
SLIDE 8
  • Encoder in the sequence-to-sequence framework
  • Typically, an LSTM network is used
  • Alternatively, a CNN can be used

Intent Network

slide-9
SLIDE 9

Intent Network

slide-10
SLIDE 10

Belief Trackers

  • Core component of the model
  • Every slot has its belief tracker
  • Each tracker is a Jordan type RNN with a CNN feature

extractor

slide-11
SLIDE 11

Belief Trackers

slide-12
SLIDE 12

Belief Trackers

slide-13
SLIDE 13

Belief Trackers

slide-14
SLIDE 14
  • The DB query qt is formed by
  • Then query is applied to the DB to create a binary

truth value vector xt over DB entities

  • The entity referenced by the entity pointer is used to

form the final system response

Database Operator

slide-15
SLIDE 15

Database Operator

slide-16
SLIDE 16
  • Can be viewed as the glue binding other modules

together

Policy Network

slide-17
SLIDE 17

Policy Network

slide-18
SLIDE 18

Generation Network

  • Once the output token sequence has been generated,

the generic tokens are replaced by their actual values

slide-19
SLIDE 19

Generation Network

  • Attentive Generation Network
slide-20
SLIDE 20

Generation Network

slide-21
SLIDE 21

Overview

  • Introduction
  • Model
  • Wizard-of-Oz Data Collection
  • Empirical Experiments
  • Conclusions
slide-22
SLIDE 22

Wizard-of-Oz Data Collection

  • This paper proposed a novel crowdsourcing version of

the Wizard-of-Oz paradigm

  • Designed two webpages on Amazon Mechanical Turk,
  • ne for wizards and the other for users
slide-23
SLIDE 23

Wizard-of-Oz Data Collection

slide-24
SLIDE 24

Wizard-of-Oz Data Collection

slide-25
SLIDE 25

Wizard-of-Oz Data Collection

  • 99 restaurants in the DB
  • 3000 HITs (Human Intelligence Tasks) in total
  • 680 dialogues after data cleaning
  • Cost ~ 400 USD
slide-26
SLIDE 26

Overview

  • Introduction
  • Model
  • Wizard-of-Oz Data Collection
  • Empirical Experiments
  • Conclusions
slide-27
SLIDE 27

Empirical Experiments

slide-28
SLIDE 28

Empirical Experiments

slide-29
SLIDE 29

Empirical Experiments

slide-30
SLIDE 30

Overview

  • Introduction
  • Model
  • Wizard-of-Oz Data Collection
  • Empirical Experiments
  • Conclusions
slide-31
SLIDE 31

Conclusions

  • Combines modularly connected model and end-to-

end trainable model

  • First end-to-end NN-based model that can conduct

meaningful dialogues in a task-oriented application

slide-32
SLIDE 32

Thank you