serp based conversations
play

SERP-Based Conversations Maarten de Rijke University of Amsterdam - PowerPoint PPT Presentation

SERP-Based Conversations Maarten de Rijke University of Amsterdam derijke@uva.nl Work in progress Joint work in progress with Pengjie Ren Maarten de Rijke Svitlana Vakulenko Nikos Voskarides Yangjun Zhang U. Amsterdam U. Amsterdam TU


  1. SERP-Based Conversations Maarten de Rijke University of Amsterdam derijke@uva.nl

  2. Work in progress

  3. Joint work in progress with Pengjie Ren Maarten de Rijke Svitlana Vakulenko Nikos Voskarides Yangjun Zhang U. Amsterdam U. Amsterdam TU Wien U. Amsterdam U. Amsterdam � 3

  4. Information retrieval • Connecting people to information • Search • Recommendation • Conversations � 4

  5. Conversational search • Idea of search as conversation has been around since early 1980s (Belkin, CJIS 1980) • Making information retrieval interfaces feel more natural and convenient for their users (Radlinski & Craswell, CHIIR 2017) • Ongoing research and development e ff orts heavily skewed towards question answering tasks � 5

  6. But there’s more …

  7. 
 
 
 Conversational browsing • Using conversations to browse large collections of information objects (Vakulenko et al, ISWC 2018) • User model maintains knowledge state, information goal, navigation strategy 
 • � 7

  8. 
 
 Conversations for exploratory search • Exploratory search important • Educational purposes • Serendipitous discoveries of cultural artifacts – users often look for inspiration, surprises, novel ideas • E-commerce • (Vakulenko et al., SCAI 2017) 
 � 8

  9. Conversational SERPs • Conversational search engine result pages = ? + � 9

  10. Conversational SERPs • Heterogeneous SERPs • Dealing with multiple intents • Multiple answers • Text vs. image/video vs. knowledge cards vs. … • Structured vs. unstructured information • Static blogs/articles vs. live news/ reports • Closed world vs. open world • … • But let’s start a bit simpler … � 10

  11. money made by <movie> � 11

  12. what did you think of the title ? the title pretty much describes the level of the humor in this ben stiller movie . haha , i agree ! do you know if it made any money ? yeah , it made $ 279,167,575 . pretty good . � 12

  13. <movie> plot � 13

  14. that was a good scene what did you like about the movie ? i liked his friend , jack wade . i loved the part where bond arrives in st . petersburg and meets his cia contact , jack wade ( joe don baker ) .

  15. Search as conversation Conversational search engine SERP grounded conversational agent

  16. SERP grounded conversational agents SERPs (Knowledge) Conversation management (CM) Conversational context understanding (CCU) Conversational topic tracking (CTT) Conversational topic shifting (CTS) Response generation (RG) Locating knowledge (KL)

  17. Make it more interactive.

  18. Who shot the first cat video?

  19. Thomas Edison, 1894

  20. End of interaction.

  21. Outline • Introduction • Recent advances • Available datasets • Challenges and ambitions � 22

  22. SERP grounded conversational agents SERPs (Knowledge) Conversation management (CM) Conversational context understanding (CCU) Conversational topic tracking (CTT) Conversational topic shifting (CTS) Response generation (RG) Locating knowledge (KL)

  23. CCU: Conversational context encoding • Non-hierarchical context modeling • Concatenate previous utterances into one sequence � 24

  24. 
 
 
 
 CCU: Conversational context encoding • Hierarchical context modeling • HRED (Serban et al., AAAI 2016) • VHRED (Serban et al., AAAI 2017) 
 � 25

  25. CCU: Conversational context encoding • Incremental context modeling • Incremental Transformer Encoder (ITE) • (Li et al., ACL 2019) � 26

  26. CCU: Conversational context encoding • Knowledge enhanced context modeling • Commonsense conversational model (CCM) • (Li et al. ACL 2019) � 27

  27. CTT: Conversational topic tracking • Use a dialogue graph w1 mdg : gksudo gedit /etc/apt/source.list w2 w3 (type from command line) crunchbang666 : the text editor has opened the file representation source.list but there is no content i typed source instead of sources ... ok so i have it open dbr:Gedit w1 • Capture relation within dialogue u1 c1 w2 genre wikiPageWikiLink corpus using semantic relations u2 c* c2 dbr:GNOME from background knowledge dbr:Text editor w3 wikiPageWikiLink p1 p2 • Semantic coherence measuring as a w4 c3 dbr:Deb(file format) binary classification task wikiPageWikiLink u3 c4 • Top- k shortest path induced dbr:Ubuntu(OS) u4 w5 subgraphs w5 w4 mdg : see the line # deb http://gb.archive. ubuntu w4 all you have to do is delete the ""#"" character crunchbang666 : just the deb or the deb-src line too? • (Vakulenko et al., ISWC 2018) � 28

  28. 
 
 
 
 
 
 
 
 KL: Knowledge selection • Span level selection • Reading comprehension models, e.g., BiDAF • (Seo et al., ICLR 2017) 
 Conversational Background context what do you think about the characters in this movie ? Start pointer End pointer Sunday Ruth � 29

  29. 
 
 
 
 
 
 
 KL: Knowledge selection • Sentence level selection • Sentence ranking à response generation • (Dinan et al., ICLR 2019) • (Lian et al., arXiv 2019) 
 Sentence 1 what do you think about the characters in this movie ? Sentence 2 Conversational Attention Background Context … Sentence n My favorite character is Sunday Ruth � 30

  30. KL: Knowledge selecting à reasoning • Multi-hop walking on knowledge graph • (Liu et al., arXiv 2019) • (Moon et al., ACL 2019) � 31

  31. 
 
 
 
 
 
 RG: Knowledge enhanced response generation • Attentive generation 
 Background Context Attention is XXX My favorite character � 32

  32. 
 
 
 
 RG: Knowledge enhanced response generation • Token copying generation • Pointer network (See et al., ACL 2017) • CaKe (Zhang et al., SCAI 2019) 
 Copy token Background My favorite character is XXX Context � 33

  33. 
 
 
 
 
 RG: Knowledge enhanced response generation • Span copying generation • RefNet (Meng et al., arXiv 2019) 
 Background Copy span … My favorite character is XXX Context XXX � 34

  34. Outline • Introduction • Recent advances • Available datasets • Challenges and ambitions � 35

  35. Dataset • User reviews as knowledge • (Ghazvininejad et al., AAAI 2018) � 36

  36. Dataset • Common sense as knowledge • (Zhou et al., IJCAI 2018) � 37

  37. Dataset • User persona as knowledge • (Zhang et al., ACL 2018) � 38

  38. Dataset • Wikipedia movie articles as knowledge • (Zhou et al., EMNLP 2018) � 39

  39. Dataset • Wikipedia movie articles and IMDb movie reviews as knowledge • (Moghe et al., EMNLP 2018) � 40

  40. Dataset • Wikipedia articles as knowledge • (Dinan et al., ICLR 2019) � 41

  41. Dataset • Wikipedia articles as knowledge grounded to Reddit conversations • (Qin et al., ACL 2019) � 42

  42. Dataset • Knowledge graph as knowledge • (Moon et al., ACL 2019) � 43

  43. Dataset http://www.treccast.ai � 44

  44. Make it more interactive.

  45. What is CatVideoFest?

  46. End of interaction.

  47. Outline • Introduction • Recent advances • Available datasets • Challenges and ambitions � 49

  48. Challenges and ambitions • Conversational topic shifting • What to talk about next? • Di ff erent from web search, user inputs are conversational, support exploration, serendipity • No single correct answer • Di ff erent from machine reading comprehension, user inputs are not always questions with definitive answers. New challenges for modeling as well as evaluation � 50

  49. 
 Challenges and ambitions • Heterogeneous SERPs • Dealing with multiple intents • Text vs. image/video vs. knowledge cards vs. … • Structured vs. unstructured • Static blogs/articles vs. live news/reports 
 New challenges for knowledge locating (selecting and reasoning) across heterogeneous resources � 51

  50. Challenges and ambitions • Interpretable conversations • Explainable for developers • Failure analysis • Identifying influential (online) training instances • Reasoning path on knowledge graph as explanations (Moon et al., ACL 2019, Liu et al., arXiv 2019) • Explainable for users • Response/answer explanation 
 Human: which is your favorite character in this ? Bot: my favorite character was obviously the main character because through his perseverance he was able to escape a dangerous situation . � 52

  51. Wrap-up • SERP-grounded conversations • General idea, recent advances, challenges and ambitions • Work in progress � 53

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend