Exemplar Encoder Decoder for Neural Conversation Generation By - PowerPoint PPT Presentation

Exemplar Encoder Decoder for Neural Conversation Generation By Gaurav Pandey, Danish Contractor, Vineet Kumar and Sachindra Joshi IBM Research AI

Generative Models for Conversations Context Context Context Response Decoder Embedding Encoder • Context encoder: (1) RNN (2) hierarchical RNN • Decoder: RNN • Objective: log probability of GT response given context. • Can generate novel responses for novel contexts!!

Retrieval Models for Conversations • Retrieve a response from a nearest neighbor index constructed from the training data. • Can be used for closed domain problems. • Advantages: • Answers are grounded in the domain. • Easy to prune answers according to requirements. • Disadvantage: • Can not generate novel responses. Can we use generative models to fix this?

Exemplar Encoder Decoder • Build an index from all context-response pairs offline. • For each context c: • Retrieve a set of exemplar contexts and corresponding responses. 𝑑 (1) , 𝑠 (1) 𝑑 (2) , 𝑠 (2) 𝑑 𝑑 ( 𝐿 ) , 𝑠 ( 𝐿 ) Input Context Index Exemplar conversations • Match the exemplar contexts with c and get the similarities. • Use these similarities to weigh the exemplar responses.

Matching Exemplar Contexts Exemplar contexts Customer: hi . today i have received the wst non- 𝑑 (1) compliance. Agent : i see that you have Encoder an issue with wst non complaints. Input Context Customer : its regarding the tem 𝑡 (1) Customer : i am getting Customer : regarding wst wst non-complaint for tem non-compliant report . i am install c 𝑑 (2) unable to install tivoli Encoder Agent : okay . . let me Encoder endpoint manager ( tem 𝑡 (2) create a ticket to l2 Agent : what is error report support team you get ? Customer : ok . Customer : this one. 𝑡 (3) Customer : i received an email action required : it security noncompliance reported by wst. 𝑑 (3) Encoder Agent : is this showing as wst non complaint ? Customer : yes ... seems . Normalized may i show you the mail that i similarities received ? The normalized similarities are used to weigh the exemplar responses.

𝐿 𝑑 𝑓 𝑠 (1) 𝑠 (1) ∑ 𝑡 ( 𝑙 ) 𝑞 ( 𝑠 | 𝑓 ( 𝑙 ) ) 𝑚𝑚 = log 𝑓 𝒇 ( 𝟐 ) RESPONSE ENCODER DECODER 𝑙 =1 𝑠 (2) 𝑑 𝑓 𝑠 (2) 𝑓 𝒇 ( 𝟑 ) 𝑠 ( 𝐿 ) 𝑑 𝑓 𝑠 ( 𝐿 ) 𝑓 𝒇 ( 𝑳 ) Likelihood r ENCODER CONTEXT Computation c 𝑡 (1) 𝑑 (1) ENCODER CONTEXT 𝑑 (2) 𝑡 ( 𝐿 ) 𝑑 ( 𝐿 ) Exemplar Decoder Exemplar Encoder

Analyzing the Objective c r ( 𝑑 ′ � , 𝑠 ′ � ) Think of exemplar contexts and responses as latent variables log 𝑞 ( 𝑠 𝑑 ) = log ∑ 𝑞 ( 𝑠 𝑑 , 𝑠 ′ � ) 𝑞 ( 𝑑 ′ � | 𝑑 ) ( 𝑑 ′ � , 𝑠 ′ � ) ≤ log ∑ 𝑞 ( 𝑠 𝑑 , 𝑠 𝑙 ) 𝑞 ( 𝑑 𝑙 | 𝑑 ) 1 ≤ 𝑙 ≤ 𝐿 = log ∑ 𝑞 ( 𝑠 𝑓 ( 𝑙 ) ) 𝑡 ( 𝑙 ) 1 ≤ 𝑙 ≤ 𝐿

Evaluation • Exemplar Encoder Decoder • Hierarchical Recurrent Encoder • TF-IDF for retrieving exemplar conversations • Datasets used: • Ubuntu Dialogue Corpus • IBM Tech Support Dataset • Comparison Metrics • Activity and Entity metrics • Embedding metrics

Activity and Entity metrics These metrics compare the precision, recall and F1 score of specific nouns and verb present in the generated response as compared to the groundtruth response. Ubuntu Dialogue Corpus For comparison, the retrieval only model has an activity F1 score of 4.23 and entity F1 score of 2.72 respectively.

Embedding metrics • These metrics compare the word embeddings of the generated response with the words of the groundtruth response. • These metrics do not correlate with human judgements for Ubuntu Corpus 1 . 1 How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Generated and retrieved responses

Discussion • A generative model that utilizes similar conversations for response generation. • Can generate novel responses while ensuring that the responses are grounded in the domain. • Incorporating retrieved conversations during generation improves performance as evident from several metrics. • The proposed idea is general and can be used for image captioning and neural machine translation.

Exemplar Encoder Decoder for Neural Conversation Generation By - PowerPoint PPT Presentation

Exemplar Encoder Decoder for Neural Conversation Generation By Gaurav Pandey, Danish Contractor, Vineet Kumar and Sachindra Joshi IBM Research AI Generative Models for Conversations Context Context Context Response Decoder Embedding

Exercise 2: Encoder / Decoder Framework Goals : Implement basic framework for encoder and decoder

UN13750 Programmable Encoder/Decoder Single chip contains both Encoder and Decoder. Schmitt

The Attention Mechanism & Encoder-Decoder Variants CMSC 470 Marine Carpuat Introduction to

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018

Image and Video Coding: Introduction bitstream encoder decoder Motivation Image and Video

Image and Video Coding: Representation, Acquisition, Display ... 10011 ... encoder decoder

A Hierarchical Encoder-Decoder for Paragraph Summarization Farzaneh Mahdisoltani Department of

Contents PRO-Decoder Function Methods Results Abstract Experiment Computer RBS-Decoder

7 Neural MT 1: Neural Encoder-Decoder Models From Section 3 to Section 6, we focused on the

Attention Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Encoder-decoder Models

Attention Graham Neubig Site https://phontron.com/class/nn4nlp2020/ Encoder-decoder Models

RCIA Fall Retreat: Jesus Exemplar, Aspects of Prayer, Types of Prayer Fall Retreat:

Digital Design Disc: RTL Combinatorial Components 2-to-4 Decoder 4-to-16 Decoder 8-bit Shifter

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder Huadong Chen ! ,

Attention-based Encoder-Decoder Networks NLP challenges Methods for Spelling and Grammatical

Hybrid Sequence Encoder Of Collaborative Experts For Video Retrieval Kaixu Cui, Hui Liu, Cheng

Travis Perkins plc Travis Perkins plc Financial Results Financial Results Year ended 31

Bright Horizons Delivering on the Plan FY17: Interim Results Presentation 21 February 2017

The Spoofax Language Workbench Lennart Kats Eelco Visser Software Engineering implement

Variables in Imperative Languages A variable in an imperative programming language can be

S8822 OPTIMIZING NMT WITH TENSORRT Micah Villmow Senior TensorRT Software Engineer 2 100

Deep Learning for Language Understanding (at Google Scale) Anjuli Kannan Software Engineer,

N EU G EN Text Generation from Meaning Representations Yannis Konstas Joint work

F i n a n c i a l R e s u l t s P r e s e n t a t i o n Aug 3, 2015

Exemplar Encoder Decoder for Neural Conversation Generation By - PowerPoint PPT Presentation

Exemplar Encoder Decoder for Neural Conversation Generation By Gaurav Pandey, Danish Contractor, Vineet Kumar and Sachindra Joshi IBM Research AI Generative Models for Conversations Context Context Context Response Decoder Embedding

Exercise 2: Encoder / Decoder Framework Goals : Implement basic framework for encoder and decoder

UN13750 Programmable Encoder/Decoder Single chip contains both Encoder and Decoder. Schmitt

The Attention Mechanism &amp; Encoder-Decoder Variants CMSC 470 Marine Carpuat Introduction to

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018

Image and Video Coding: Introduction bitstream encoder decoder Motivation Image and Video

Image and Video Coding: Representation, Acquisition, Display ... 10011 ... encoder decoder

A Hierarchical Encoder-Decoder for Paragraph Summarization Farzaneh Mahdisoltani Department of

Contents PRO-Decoder Function Methods Results Abstract Experiment Computer RBS-Decoder

7 Neural MT 1: Neural Encoder-Decoder Models From Section 3 to Section 6, we focused on the

Attention Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Encoder-decoder Models

Attention Graham Neubig Site https://phontron.com/class/nn4nlp2020/ Encoder-decoder Models

RCIA Fall Retreat: Jesus Exemplar, Aspects of Prayer, Types of Prayer Fall Retreat:

Digital Design Disc: RTL Combinatorial Components 2-to-4 Decoder 4-to-16 Decoder 8-bit Shifter

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder Huadong Chen ! ,

Attention-based Encoder-Decoder Networks NLP challenges Methods for Spelling and Grammatical

Hybrid Sequence Encoder Of Collaborative Experts For Video Retrieval Kaixu Cui, Hui Liu, Cheng

Travis Perkins plc Travis Perkins plc Financial Results Financial Results Year ended 31

Bright Horizons Delivering on the Plan FY17: Interim Results Presentation 21 February 2017

The Spoofax Language Workbench Lennart Kats Eelco Visser Software Engineering implement

Variables in Imperative Languages A variable in an imperative programming language can be

S8822 OPTIMIZING NMT WITH TENSORRT Micah Villmow Senior TensorRT Software Engineer 2 100

Deep Learning for Language Understanding (at Google Scale) Anjuli Kannan Software Engineer,

N EU G EN Text Generation from Meaning Representations Yannis Konstas Joint work

F i n a n c i a l R e s u l t s P r e s e n t a t i o n Aug 3, 2015

The Attention Mechanism & Encoder-Decoder Variants CMSC 470 Marine Carpuat Introduction to