A Network-based End-to-End Trainable Task-oriented Dialogue System - - PowerPoint PPT Presentation

▶

May 29, 2023 777 likes •1.12k views

A Network-based End-to-End Trainable Task-oriented Dialogue System Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrki, Milica Gai, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young Presented by: Qihao Shao Overview

SLIDE 1

A Network-based End-to-End Trainable Task-oriented Dialogue System

Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrkšić, Milica Gašić, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young Presented by: Qihao Shao

SLIDE 2

Overview

Introduction
Model
Wizard-of-Oz Data Collection
Empirical Experiments
Conclusions

SLIDE 3

Overview

Introduction
Model
Wizard-of-Oz Data Collection
Empirical Experiments
Conclusions

SLIDE 4

Introduction

Treat as a POMDP and use RL to train dialogue policies
Build end-to-end trainable, non-task-oriented

conversational systems using seq2seq model

The authors propose a model by balancing the

strengths and the weaknesses of these two

SLIDE 5

Overview

Introduction
Model
Wizard-of-Oz Data Collection
Empirical Experiments
Conclusions

SLIDE 6

Model

Intent Network
Belief Trackers
Database Operator
Policy Network
Generation Network

SLIDE 7

Model

SLIDE 8

Encoder in the sequence-to-sequence framework
Typically, an LSTM network is used
Alternatively, a CNN can be used

Intent Network

SLIDE 9

Intent Network

SLIDE 10

Belief Trackers

Core component of the model
Every slot has its belief tracker
Each tracker is a Jordan type RNN with a CNN feature

extractor

SLIDE 11

Belief Trackers

SLIDE 12

Belief Trackers

SLIDE 13

Belief Trackers

SLIDE 14

The DB query qt is formed by
Then query is applied to the DB to create a binary

truth value vector xt over DB entities

The entity referenced by the entity pointer is used to

form the final system response

Database Operator

SLIDE 15

Database Operator

SLIDE 16

Can be viewed as the glue binding other modules

together

Policy Network

SLIDE 17

Policy Network

SLIDE 18

Generation Network

Once the output token sequence has been generated,

the generic tokens are replaced by their actual values

SLIDE 19

Generation Network

Attentive Generation Network

SLIDE 20

Generation Network

SLIDE 21

Overview

Introduction
Model
Wizard-of-Oz Data Collection
Empirical Experiments
Conclusions

SLIDE 22

Wizard-of-Oz Data Collection

This paper proposed a novel crowdsourcing version of

the Wizard-of-Oz paradigm

Designed two webpages on Amazon Mechanical Turk,
ne for wizards and the other for users

SLIDE 23

Wizard-of-Oz Data Collection

SLIDE 24

Wizard-of-Oz Data Collection

SLIDE 25

Wizard-of-Oz Data Collection

99 restaurants in the DB
3000 HITs (Human Intelligence Tasks) in total
680 dialogues after data cleaning
Cost ~ 400 USD

SLIDE 26

Overview

Introduction
Model
Wizard-of-Oz Data Collection
Empirical Experiments
Conclusions

SLIDE 27

Empirical Experiments

SLIDE 28

Empirical Experiments

SLIDE 29

Empirical Experiments

SLIDE 30

Overview

Introduction
Model
Wizard-of-Oz Data Collection
Empirical Experiments
Conclusions

SLIDE 31

Conclusions

Combines modularly connected model and end-to-

end trainable model

First end-to-end NN-based model that can conduct

meaningful dialogues in a task-oriented application

SLIDE 32

A Network-based End-to-End Trainable Task-oriented Dialogue System

Overview

Overview

Introduction

conversational systems using seq2seq model

strengths and the weaknesses of these two

Overview

Model

Model

Intent Network

Intent Network

Belief Trackers

extractor

Belief Trackers

Belief Trackers

Belief Trackers

truth value vector xt over DB entities

form the final system response

Database Operator

Database Operator

together

Policy Network

Policy Network

Generation Network

the generic tokens are replaced by their actual values

Generation Network

Generation Network

Overview

Wizard-of-Oz Data Collection

the Wizard-of-Oz paradigm

Wizard-of-Oz Data Collection

Wizard-of-Oz Data Collection

Wizard-of-Oz Data Collection

Overview

Empirical Experiments

Empirical Experiments

Empirical Experiments

Overview

Conclusions

end trainable model

meaningful dialogues in a task-oriented application

Thank you