Grounding Neural Conversation Models into the Real World Michel - - PowerPoint PPT Presentation

grounding neural conversation models into the real world
SMART_READER_LITE
LIVE PREVIEW

Grounding Neural Conversation Models into the Real World Michel - - PowerPoint PPT Presentation

Grounding Neural Conversation Models into the Real World Michel Galley SCAI October 1 st , 2017 Inform ormation ation Retri trieval eval Conver nversati sation onal l AI AI Natu Na tura ral l La Languag guage Dialogu ogue Pr


slide-1
SLIDE 1

Grounding Neural Conversation Models into the Real World

Michel Galley

SCAI October 1st, 2017

slide-2
SLIDE 2

Inform

  • rmation

ation Retri trieval eval Na Natu tura ral l La Languag guage Pr Processin cessing (N (NLP) LP) Dialogu

  • gue

Conver nversati sation

  • nal

l AI AI

slide-3
SLIDE 3

Natural Language Processing: language in, language out

Twitter doubled its character limit Twitter verdubbelde zijn karakterlimiet

slide-4
SLIDE 4

Traditional NLP pipeline

Twitter doubled its character limit Twitter verdubbelde zijn karakterlimiet

slide-5
SLIDE 5

But the technical landscape has shifted

End-to-End Modeling: Language as emergent behavior

Twitter doubled its character limit Twitter verdubbelde zijn karakterlimiet

slide-6
SLIDE 6

Deep learning: recent state of the art results

Task Test set Metric Best non- neural Best neural Source Machin ine Translat lation ion EN-DE newstest16 BLEU 31.4 34.8 http://matrix.statmt.org DE-EN newstest16 BLEU 35.9 39.9 http://matrix.statmt.org Sentiment Analysis ysis Stanford sentiment bank 5-class Accuracy 71.0 80.7 Socher et al 2013 Question

  • n Answerin

ring WebQuestions test set F1 39.9 52.5 Yih et al 2015 Entity y Linking ng Bing Query Entity Linking set AUC 72.3 78.2 Gao et al 2015 Image Caption

  • nin

ing COCO 2015 challenge Turing test pass% 25.5 32.2 Fang et al 2015 Sentence compres essio sion Google 10K dataset F1 0.75 0.82 Fillipova et al, 2015

Neural systems beat previous state of the art by wide margins across an array of applications

slide-7
SLIDE 7

Conversational AI?

slide-8
SLIDE 8

Fully data-driven conversational AI Twitter:

304M monthly active users 500M tweets per day (6M conversations per day)*

Other sources:

Reddit, movie subtitles, technical data (Ubuntu), etc.

*: statistics as of 2015

slide-9
SLIDE 9

Response Generation as Statistical Machine Translation

[Ritter et al., EMNLP 2011]

Yeah ah , You’re I’m

  • n my way

now now going ing now? w? Good

  • d luck!

k!

Exploit high-frequency word- and phrase-based mappings

“I’m”  “You’re” “sick”  “get better” “lovely!”  “thanks!” “Going to the airport”  “Have a safe flight!”

slide-10
SLIDE 10

Neural Conversation Models

[Sordoni et al., 2015; Vinyals and Le, 2015; Shang et al., 2015; Serban et al., 2016; etc.]

Source: conversation history Target: response

Trained models using up to ~150M conversations.

slide-11
SLIDE 11

Language as emergent behavior: examples

slide-12
SLIDE 12

Language as emergent behavior: examples

slide-13
SLIDE 13

Pronominal gender, number, case

slide-14
SLIDE 14

Pronominal gender, number, case (2)

slide-15
SLIDE 15

It also learns what might be called “inferences”

slide-16
SLIDE 16

Inferences (2)

slide-17
SLIDE 17

Inferences (3)

slide-18
SLIDE 18

NOT grounded

Dialog Systems: Two paradigms

Understanding (NLU) State tracker Generation (NLG) Dialog policy

input x

  • utput y

Traditional

calendar

Grounded

input x

  • utput y

Fully data-driven

Environment

slide-19
SLIDE 19

Fully Data-Driven AND AND Grounded Models

Personalizat sonalization ion data ta (ID, social graph, ...) De Devic ice e se sens nsor

  • rs

(GPS, vision, ...)

[Li et al., 2016] [Ghazvininejad et al., 2017] [Luan et al., 2017] [Mostafazadeh et al., 2017]

slide-20
SLIDE 20

A Knowledge-Grounded Neural Conversation Model

ht

Going to Kusakabe tonight

CONVERSATION HISTORY

Try omakase, the best in town

RESPONSE ht

DECODER DIALOG ENCODER

. . .

WORLD “FACTS”

A

. . .

CONTEXTUALLY-RELEVANT “FACTS” Consistently the best omakase Amazing sushi tasting […] They were out of kaisui […]

FACTS ENCODER

[Ghazvininejad et al., 2017]

slide-21
SLIDE 21

“Infusing” non-conversational knowledge into conversations

You know any good Japanese restaurant in Seattle? Try Kisaku, one of the best sushi restaurants in the city. You know any good A restaurant in B? Try C, one of the best D in the city.

slide-22
SLIDE 22

Knowledge-grounded generated responses

Obsessed with [jewelry company] :-*

  • h my gosh obsessed with their bracelets and the meaning behind them!

I would give ALMOST anything for some [Mexican restaurant] right now. Me too. Creamy jalapeno sauce is really good. Visiting the celebs at Los Angeles airport - [...] w/ 70 others Nice airport terminal. Have a safe flight.

slide-23
SLIDE 23

EOS where do you live

in

in england

england

.

. EOS

Rob Rob Rob Rob

Word embeddings (50k)

england london u.s. great good stay live

  • kay

monday tuesday

Speaker embeddings (70k) Rob_712 skinnyoflynny2 Tomcoatez Kush_322 D_Gomes25 Dreamswalls kierongillen5 TheCharlieZ The_Football_Bar This_Is_Artful DigitalDan285 Jinnmeow3 Bob_Kelly2

[Li et al. A Persona-Based Neural Conversation Model, ACL 2016]

slide-24
SLIDE 24

Personalized Response Generation

Source LSTM Target LSTM

DIALOG HISTORY

query

RESPONSE

response

Target LSTM Source LSTM Personalized data personalized data

Autoencoder Seq2Seq

What’s your job? I’m sales assistant I work in a nursery Software engineer I’m a code ninja I’m a code ninja

[Luan et al., 2017]

slide-25
SLIDE 25

Personalization: generated responses

I am getting a loop back to login page. Ah, ok. Thanks for the info. Have you tried clearing your cache and cookies?

baseline persona

I reset it twice! It still doesn’t work. Let me know if there’s anything I can help you with. I’m sorry to hear that. Are you receiving any error message?

baseline persona

slide-26
SLIDE 26

Image-Grounded Conversations

Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation

  • N. Mostafazadeh, C. Brockett, B. Dolan, M. Galley, J. Gao, G. Spithourakis, L. Vanderwende, IJCNLP 2017

I forgot to take a pic before I took a bite. Is that an ice cream? The weather was amazing at the game. Who is winning?

slide-27
SLIDE 27

Data-driven conversation: toward more informational and “useful” dialogs

Traditional dialogue systems (grounded) chitchat informational, task-completion dialogue Fully data-driven (previously ungrounded)

[Ritter et al., 2011, Sordoni et al., 2015; Vinyals and Le, 2015; Shang et al., 2015; Li et al., 2016; …] [Ghazvi hazvinine nineja jad d et al., 2017; 7; etc.] .]

GROUND NDED! ED!

slide-28
SLIDE 28

Conclusions

ba back ckbo bone sh shell Produce more informational and “use sefu ful” dialogues

slide-29
SLIDE 29

Collaborators

Jiwei Li Stanford Nasrin Mostafazadeh

  • U. Rochester

Marjan Ghazvininejad USC/ISI Alan Ritter Ohio State U. Yi Luan

  • U. Washington

Alessandro Sordoni Microsoft Bill Dolan Jianfeng Gao Chris Quirk Chris Brockett Scott Yih Ming-Wei Chang

slide-30
SLIDE 30

Thank you

  • Marjan Ghazvininejad, Chris Brockett, Ming-Wei Chang, Bill Dolan, Jianfeng Gao, Wen-tau

Yih, Michel Galley. A Knowledge-Grounded Neural Conversation Model.

  • Yi Luan, Chris Brockett, Bill Dolan, Jianfeng Gao and Michel Galley. Multi-T

ask Learning for Speaker-Role Adaptation in Neural Conversation Models. IJCNLP 2017.

  • Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. A Personalized Neural

Conversation Model. In preparation for ACL 2016.

  • Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan, A Diversity-Promoting

Objective Function for Neural Conversation Models, NAACL 2016.

  • Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Meg Mitchell,

Jian-Yun Nie, Jianfeng Gao, and Bill Dolan, A Neural Network Approach to Context-Sensitive Generation of Conversational Responses, NAACL 2015.

  • Alan Ritter, Colin Cherry, Bill Dolan. Data-Driven Response Generation in Social Media,

EMNLP 2011.

mgalley@microsoft.com