Grounding Neural Conversation Models into the Real World
Michel Galley
SCAI October 1st, 2017
Grounding Neural Conversation Models into the Real World Michel - - PowerPoint PPT Presentation
Grounding Neural Conversation Models into the Real World Michel Galley SCAI October 1 st , 2017 Inform ormation ation Retri trieval eval Conver nversati sation onal l AI AI Natu Na tura ral l La Languag guage Dialogu ogue Pr
Michel Galley
SCAI October 1st, 2017
Inform
ation Retri trieval eval Na Natu tura ral l La Languag guage Pr Processin cessing (N (NLP) LP) Dialogu
Conver nversati sation
l AI AI
Natural Language Processing: language in, language out
Twitter doubled its character limit Twitter verdubbelde zijn karakterlimiet
Twitter doubled its character limit Twitter verdubbelde zijn karakterlimiet
Twitter doubled its character limit Twitter verdubbelde zijn karakterlimiet
Task Test set Metric Best non- neural Best neural Source Machin ine Translat lation ion EN-DE newstest16 BLEU 31.4 34.8 http://matrix.statmt.org DE-EN newstest16 BLEU 35.9 39.9 http://matrix.statmt.org Sentiment Analysis ysis Stanford sentiment bank 5-class Accuracy 71.0 80.7 Socher et al 2013 Question
ring WebQuestions test set F1 39.9 52.5 Yih et al 2015 Entity y Linking ng Bing Query Entity Linking set AUC 72.3 78.2 Gao et al 2015 Image Caption
ing COCO 2015 challenge Turing test pass% 25.5 32.2 Fang et al 2015 Sentence compres essio sion Google 10K dataset F1 0.75 0.82 Fillipova et al, 2015
Neural systems beat previous state of the art by wide margins across an array of applications
304M monthly active users 500M tweets per day (6M conversations per day)*
Reddit, movie subtitles, technical data (Ubuntu), etc.
*: statistics as of 2015
[Ritter et al., EMNLP 2011]
Yeah ah , You’re I’m
now now going ing now? w? Good
k!
Exploit high-frequency word- and phrase-based mappings
“I’m” “You’re” “sick” “get better” “lovely!” “thanks!” “Going to the airport” “Have a safe flight!”
[Sordoni et al., 2015; Vinyals and Le, 2015; Shang et al., 2015; Serban et al., 2016; etc.]
Source: conversation history Target: response
Trained models using up to ~150M conversations.
NOT grounded
Understanding (NLU) State tracker Generation (NLG) Dialog policy
input x
Traditional
calendar
Grounded
input x
Fully data-driven
Environment
Personalizat sonalization ion data ta (ID, social graph, ...) De Devic ice e se sens nsor
(GPS, vision, ...)
[Li et al., 2016] [Ghazvininejad et al., 2017] [Luan et al., 2017] [Mostafazadeh et al., 2017]
ht
Going to Kusakabe tonight
CONVERSATION HISTORY
Try omakase, the best in town
RESPONSE ht
DECODER DIALOG ENCODER
WORLD “FACTS”
A
CONTEXTUALLY-RELEVANT “FACTS” Consistently the best omakase Amazing sushi tasting […] They were out of kaisui […]
FACTS ENCODER
[Ghazvininejad et al., 2017]
You know any good Japanese restaurant in Seattle? Try Kisaku, one of the best sushi restaurants in the city. You know any good A restaurant in B? Try C, one of the best D in the city.
Obsessed with [jewelry company] :-*
I would give ALMOST anything for some [Mexican restaurant] right now. Me too. Creamy jalapeno sauce is really good. Visiting the celebs at Los Angeles airport - [...] w/ 70 others Nice airport terminal. Have a safe flight.
EOS where do you live
in
in england
england
.
. EOS
Rob Rob Rob Rob
Word embeddings (50k)
england london u.s. great good stay live
monday tuesday
Speaker embeddings (70k) Rob_712 skinnyoflynny2 Tomcoatez Kush_322 D_Gomes25 Dreamswalls kierongillen5 TheCharlieZ The_Football_Bar This_Is_Artful DigitalDan285 Jinnmeow3 Bob_Kelly2
[Li et al. A Persona-Based Neural Conversation Model, ACL 2016]
Source LSTM Target LSTM
DIALOG HISTORY
query
RESPONSE
response
Target LSTM Source LSTM Personalized data personalized data
Autoencoder Seq2Seq
What’s your job? I’m sales assistant I work in a nursery Software engineer I’m a code ninja I’m a code ninja
[Luan et al., 2017]
I am getting a loop back to login page. Ah, ok. Thanks for the info. Have you tried clearing your cache and cookies?
baseline persona
I reset it twice! It still doesn’t work. Let me know if there’s anything I can help you with. I’m sorry to hear that. Are you receiving any error message?
baseline persona
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation
I forgot to take a pic before I took a bite. Is that an ice cream? The weather was amazing at the game. Who is winning?
Traditional dialogue systems (grounded) chitchat informational, task-completion dialogue Fully data-driven (previously ungrounded)
[Ritter et al., 2011, Sordoni et al., 2015; Vinyals and Le, 2015; Shang et al., 2015; Li et al., 2016; …] [Ghazvi hazvinine nineja jad d et al., 2017; 7; etc.] .]
GROUND NDED! ED!
ba back ckbo bone sh shell Produce more informational and “use sefu ful” dialogues
Jiwei Li Stanford Nasrin Mostafazadeh
Marjan Ghazvininejad USC/ISI Alan Ritter Ohio State U. Yi Luan
Alessandro Sordoni Microsoft Bill Dolan Jianfeng Gao Chris Quirk Chris Brockett Scott Yih Ming-Wei Chang
Yih, Michel Galley. A Knowledge-Grounded Neural Conversation Model.
ask Learning for Speaker-Role Adaptation in Neural Conversation Models. IJCNLP 2017.
Conversation Model. In preparation for ACL 2016.
Objective Function for Neural Conversation Models, NAACL 2016.
Jian-Yun Nie, Jianfeng Gao, and Bill Dolan, A Neural Network Approach to Context-Sensitive Generation of Conversational Responses, NAACL 2015.
EMNLP 2011.
mgalley@microsoft.com