Learning to Ask Good Questions: Ranking Clarification Questions - - PowerPoint PPT Presentation

learning to ask good questions ranking clarification
SMART_READER_LITE
LIVE PREVIEW

Learning to Ask Good Questions: Ranking Clarification Questions - - PowerPoint PPT Presentation

Learning to Ask Good Questions: Ranking Clarification Questions using Neural Expected Value of Perfect Information Sudha Rao 1 and Hal Daum III 1,2 1 University of Maryland College Park 2 Microsoft Research New York CODE + DATA :


slide-1
SLIDE 1

Learning to Ask Good Questions: Ranking Clarification Questions using Neural Expected Value of Perfect Information

Sudha Rao1 and Hal Daumé III1,2

1University of Maryland

College Park

2Microsoft Research

New York

CODE + DATA: https://github.com/raosudha89/ranking_clarification_questions

slide-2
SLIDE 2

2

Natural Language Understanding

slide-3
SLIDE 3

3

Natural Language Understanding

How long does it take to get a PhD?

slide-4
SLIDE 4

4

Natural Language Understanding

How long does it take to get a PhD? Give me a recipe for lasagna

slide-5
SLIDE 5

5

Please bring me my coffee mug from the kitchen

Natural Language Understanding

How long does it take to get a PhD? Give me a recipe for lasagna

slide-6
SLIDE 6

6

Please bring me my coffee mug from the kitchen

Natural Language Understanding

How long does it take to get a PhD? Give me a recipe for lasagna

slide-7
SLIDE 7

7

Human Interactions

slide-8
SLIDE 8

8

Human Interactions

Please bring me my coffee mug from the kitchen

slide-9
SLIDE 9

9

Human Interactions

Please bring me my coffee mug from the kitchen

slide-10
SLIDE 10

10

What color is your coffee mug?

Human Interactions

Please bring me my coffee mug from the kitchen

slide-11
SLIDE 11

11

Teach Machines to Ask Clarification Questions

slide-12
SLIDE 12

Teach Machines to Ask Clarification Questions

12

How long does it take to get a PhD ? In which field?

slide-13
SLIDE 13

Teach Machines to Ask Clarification Questions

13

Give me a recipe for lasagna How long does it take to get a PhD ? In which field? Any dietary restrictions?

slide-14
SLIDE 14

Teach Machines to Ask Clarification Questions

14

Please bring me my coffee mug from the kitchen What color is your coffee mug? How long does it take to get a PhD ? In which field? Any dietary restrictions? Give me a recipe for lasagna

slide-15
SLIDE 15

Teach Machines to Ask Clarification Questions

15

Please bring me my coffee mug from the kitchen What color is your coffee mug? Context-aware questions about missing information Any dietary restrictions? How long does it take to get a PhD ? In which field? Give me a recipe for lasagna

slide-16
SLIDE 16
  • Heilman. “Automatic factual question generation from text” Ph.D. thesis 2011
  • Vasile, et al. "The first question generation shared task evaluation challenge.” NLG 2010
  • Olney, Graesser, and Person. "Question generation from concept maps." Dialogue & Discourse 2012
  • Chali and Hasan. "Towards Topic-to-Question Generation." ACL 2015
  • Serban, et al. "Generating Factoid Questions With Recurrent Neural Networks” ACL 2016
  • Du, Shao & Cardie "Learning to ask: Neural question generation for reading comprehension" ACL 2017
  • Tang et al. "Learning to Collaborate for Question Answering and Asking." NAACL 2018
  • Mrinmaya and Xing. "Self-Training for Jointly Learning to Ask and Answer Questions." NAACL 2018

16

Reading Comprehension Question Generation

My class is going to the movies on a field trip next week. We have to get permission slips signed before we go. We are going to see a movie that tells the story from a book we read. Q: What do the students need to do before going to the movies? Goal: Assess someone’s understanding of the text

slide-17
SLIDE 17
  • Goddeau, et al. "A form-based dialogue manager for spoken language applications.” 1996
  • Bobrow., et al. "GUS, a frame-driven dialog system." Artificial intelligence 1977
  • Lemon, et al. "An ISU dialogue system exhibiting reinforcement learning of dialogue policies: generic

slot-filling in the TALK in-car system.” EACL 2006

  • Williams, et al. The Dialog State Tracking Challenge” SIGDIAL 2013
  • Young, et al. “Pomdp-based statistical spoken dialog systems: A review.” IEEE 2013
  • Dhingra, et al. "Towards End-to-End Reinforcement Learning of Dialogue Agents for Information

Access.” ACL 2017

  • Bordes, et al. "Learning end-to-end goal-oriented dialog.” ICLR 2017

17

Question Generation for Slot Filling

USER: I want to go to Melbourne on July 14 SYSTEM: What time do you want to leave? USER: I must be in Melbourne by 11 am SYSTEM: Would you like a Delta flight that arrives at 10.15 am? USER: Sure SYSTEM: In what name should I make the reservation? <origin city> <departure city> <origin time> <departure time> <airline>

SLOTS

slide-18
SLIDE 18
  • Liu, et al. “Automatic question generation for literature review writing support."

International Conference on Intelligent Tutoring Systems. 2010

  • Penas and Hovy, “Filling knowledge gaps in text for machine reading” International

Conference on Computational Linguistics: Posters ACL 2010

  • Artzi & Zettlemoyer, “Bootstrapping semantic parsers from conversations” EMNLP 2011
  • Labutov, et al.“Deep questions without deep understanding” ACL 2015
  • Mostafazadeh et al. "Generating natural questions about an image." ACL 2016
  • Mostafazadeh et al. "Multimodal Context for Natural Question and Response

Generation.” IJCNLP 2017.

  • Rothe, Lake and Gureckis. “Question asking as program generation” NIPS 2017.

18

Other types of Question Generation

slide-19
SLIDE 19
  • Clarification Questions Dataset
  • Problem Formulation: Question Ranking
  • Expected Value of Perfect Information (EVPI) inspired model
  • Evaluation
  • Conclusion

19

Talk Outline

slide-20
SLIDE 20
  • Clarification Questions Dataset
  • Problem Formulation: Question Ranking
  • Expected Value of Perfect Information (EVPI) inspired model
  • Evaluation
  • Conclusion

20

Talk Outline

slide-21
SLIDE 21

21

StackExchange Question-Answer Forum

Clarification Questions Dataset

slide-22
SLIDE 22

22

How to configure path or set environment variables for installation?

I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Clarification Questions Dataset

slide-23
SLIDE 23

23

How to configure path or set environment variables for installation?

I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Clarification Questions Dataset

Asaduzzaman, Muhammad, et al. "Answering questions about unanswered questions of stack overflow.” Working Conference on Mining Software Repositories. IEEE Press, 2013.

Finding: Questions go unanswered for a long time if they are not clear enough

slide-24
SLIDE 24

Question comment

24

How to configure path or set environment variables for installation?

I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance! What version of ubuntu do you have?

Clarification Questions Dataset

slide-25
SLIDE 25

Question comment

25

How to configure path or set environment variables for installation?

I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Initial Post

What version of ubuntu do you have? I'm aiming to install ape in Ubuntu 14.04 LTS, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Updated Post

Clarification Questions Dataset

slide-26
SLIDE 26

Edit as an answer to the question Question comment

26

How to configure path or set environment variables for installation?

I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Initial Post

What version of ubuntu do you have? I'm aiming to install ape in Ubuntu 14.04 LTS, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Updated Post

Clarification Questions Dataset

slide-27
SLIDE 27

Edit as an answer to the question Question comment

27

How to configure path or set environment variables for installation?

I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance! What version of ubuntu do you have? I'm aiming to install ape in Ubuntu 14.04 LTS, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

Updated Post

Clarification Questions Dataset

Initial Post

slide-28
SLIDE 28

( , , ) triples

Original post Clarification question posted in comments Edit made to the post in response to the question OR author’s reply to the question comment Dataset Size: ~77 K triples Domains: Askubuntu, Unix, Superuser answer question post

Dataset Creation

28

Clarification Questions Dataset

Note: We identify a question using the question mark (?) token. We match the edit to the answer using timestamp & word embedding similarity based heuristics.

post question answer

slide-29
SLIDE 29
  • Clarification Questions Dataset
  • Problem Formulation: Question Ranking
  • Expected Value of Perfect Information (EVPI) inspired model
  • Evaluation
  • Conclusion

29

Talk Outline

slide-30
SLIDE 30

30

Problem Formulation: Question Ranking

Post

How to configure path

  • r set environment

variables? …

slide-31
SLIDE 31

31

Problem Formulation: Question Ranking

Post

How to configure path

  • r set environment

variables? …

Generate Question Candidates

What version of Ubuntu do you have? What is the make

  • f your wifi card?

What OS are you using?

slide-32
SLIDE 32

32

Problem Formulation: Question Ranking

Post

How to configure path

  • r set environment

variables? …

Generate Question Candidates Rank the question candidates

What version of Ubuntu do you have? What is the make

  • f your wifi card?

What OS are you using? What version of Ubuntu do you have? What OS are you using? What is the make

  • f your wifi card?
slide-33
SLIDE 33

33

Problem Formulation: Question Ranking

How to configure path or set environment variables for installation? I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

What version of Ubuntu do you have? How are you installing ape? Shortlist of useful questions Do you have GSL installed?

slide-34
SLIDE 34
  • Clarification Questions Dataset
  • Problem Formulation: Question Ranking
  • Expected Value of Perfect Information (EVPI) inspired model
  • Evaluation
  • Conclusion

34

Talk Outline

slide-35
SLIDE 35

(a) What version of Ubuntu do you have? à Just right

35

Expected Value of Perfect Information (EVPI) inspired model

Possible questions

Key Idea

How to configure path or set environment variables for installation? I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

slide-36
SLIDE 36

(a) What version of Ubuntu do you have? à Just right (b) What is the make of your wifi card? à Not useful

36

Expected Value of Perfect Information (EVPI) inspired model

Possible questions

Key Idea

How to configure path or set environment variables for installation? I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

slide-37
SLIDE 37

(a) What version of Ubuntu do you have? à Just right (b) What is the make of your wifi card? à Not useful (c) Are you running Ubuntu 14.10 kernel 4.4.0-59- generic on an x86 64 architecture? à Unlikely to add value

37

Expected Value of Perfect Information (EVPI) inspired model

Possible questions

Key Idea

How to configure path or set environment variables for installation? I'm aiming to install ape, a simple code for pseudopotential generation. I'm having this error message while running ./configure <error message> So I have the library but the program installation isn't finding it. Any help? Thanks in advance!

slide-38
SLIDE 38
  • Use EVPI to identify questions that add the most value to the given post

38

Expected Value of Perfect Information (EVPI) inspired model

Avriel, Mordecai, and A. C. Williams. "The value of information and stochastic programming." Operations Research 18.5 (1970)

slide-39
SLIDE 39
  • Use EVPI to identify questions that add the most value to the given post
  • Definition: Value of Perfect Information VPI (x)

How much value does x add to a given information content c?

39

Expected Value of Perfect Information (EVPI) inspired model

Avriel, Mordecai, and A. C. Williams. "The value of information and stochastic programming." Operations Research 18.5 (1970)

slide-40
SLIDE 40
  • Use EVPI to identify questions that add the most value to the given post
  • Definition: Value of Perfect Information VPI (x)

How much value does x add to a given information content c?

  • Since we have not acquired x, we define its value in expectation

40

Expected Value of Perfect Information (EVPI) inspired model

Avriel, Mordecai, and A. C. Williams. "The value of information and stochastic programming." Operations Research 18.5 (1970)

slide-41
SLIDE 41
  • Use EVPI to identify questions that add the most value to the given post
  • Definition: Value of Perfect Information VPI (x)

How much value does x add to a given information content c?

  • Since we have not acquired x, we define its value in expectation

41

Expected Value of Perfect Information (EVPI) inspired model Likelihood of x given c EVPI (x|c) = P (x|c) Utility(x, c) Value of updating c with x x X

Avriel, Mordecai, and A. C. Williams. "The value of information and stochastic programming." Operations Research 18.5 (1970)

slide-42
SLIDE 42

42

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

slide-43
SLIDE 43

: question from set of question candidates Q EVPI ( | )

43

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi p qi p

: given post

slide-44
SLIDE 44

: question from set of question candidates Q EVPI ( | ) = P ( | , )

44

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi p aj p qi qi p

: given post

Likelihood of aj being the answer to qi on post p

slide-45
SLIDE 45

: question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

45

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi p aj p qi p aj qi p

: given post

Utility of updating the post p with answer aj Likelihood of aj being the answer to qi on post p

slide-46
SLIDE 46

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

46

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi

A

p aj p qi p aj aj qi p aj

: given post

Likelihood of aj being the answer to qi on post p Utility of updating the post p with answer aj

slide-47
SLIDE 47

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

47

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model We rank questions based on their EVPI value

qi

A

p aj p qi p aj aj qi p aj

: given post

Likelihood of aj being the answer to qi on post p Utility of updating the post p with answer aj

slide-48
SLIDE 48

1

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

48

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi

A

p aj p qi p aj aj qi p aj

: given post

Likelihood of aj being the answer to qi on post p Utility of updating the post p with answer aj

slide-49
SLIDE 49

1

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

49

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi

A

p aj p qi p aj aj qi p aj

: given post

2

Likelihood of aj being the answer to qi on post p Utility of updating the post p with answer aj

slide-50
SLIDE 50

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

50

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi

A

p aj p qi p aj aj qi p aj

: given post

2

Answer Modeling

1

Utility of updating the post p with answer aj

slide-51
SLIDE 51

Utility of updating the post p with answer aj

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

51

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi

A

p aj p qi p aj aj qi p aj

: given post

2 1 3

Answer Modeling

slide-52
SLIDE 52

: answer from set of answer candidates A : question from set of question candidates Q EVPI ( | ) = P ( | , ) U( + )

52

EVPI formulation for our problem

Expected Value of Perfect Information (EVPI) inspired model

qi

A

p aj p qi p aj aj qi p aj

: given post

2 3

Utility Calculator

1

Answer Modeling

slide-53
SLIDE 53

Question & Answer Candidate Generator

53

1

Expected Value of Perfect Information (EVPI) inspired model

slide-54
SLIDE 54

Question & Answer Candidate Generator

54

1

Expected Value of Perfect Information (EVPI) inspired model

Post p as query Dataset of (post, question, answer) Post as Documents Lucene Search Engine

slide-55
SLIDE 55

Question & Answer Candidate Generator

55

1

Expected Value of Perfect Information (EVPI) inspired model

Post p as query Dataset of (post, question, answer) Post as Documents Lucene Search Engine p1 p2 pj p10 Ten posts similar to given post p

slide-56
SLIDE 56

Question & Answer Candidate Generator

56

1

Expected Value of Perfect Information (EVPI) inspired model

Post p as query Dataset of (post, question, answer) Post as Documents Lucene Search Engine p1 p2 pj p10 q1 q2 qj q10 Ten posts similar to given post p Questions paired with those posts

slide-57
SLIDE 57

Question & Answer Candidate Generator

57

1

Expected Value of Perfect Information (EVPI) inspired model

Post p as query Dataset of (post, question, answer) Post as Documents Lucene Search Engine p1 p2 pj p10 q1 q2 qj q10 a1 a2 aj a10 Ten posts similar to given post p Questions paired with those posts Answers paired with those posts

slide-58
SLIDE 58

Question & Answer Candidate Generator

58

1

Expected Value of Perfect Information (EVPI) inspired model

Post p as query Dataset of (post, question, answer) Post as Documents Lucene Search Engine p1 p2 pj p10 q1 q2 qj q10 a1 a2 aj a10 Ten posts similar to given post p Questions paired with those posts Answers paired with those posts

slide-59
SLIDE 59

59

2

Answer Modeling

Expected Value of Perfect Information (EVPI) inspired model EVPI ( | ) = P ( | , ) U( + )

qi

A

p aj p qi p aj aj

slide-60
SLIDE 60

60

2

Answer Modeling

Expected Value of Perfect Information (EVPI) inspired model

P ( | , )

aj p qi

slide-61
SLIDE 61

61

2

Answer Modeling

Expected Value of Perfect Information (EVPI) inspired model

Post LSTM Question LSTM Word embedding module p qi p qi

P ( | , )

aj p qi

slide-62
SLIDE 62

62

2

Answer Modeling

Expected Value of Perfect Information (EVPI) inspired model

p qi Post LSTM Question LSTM Word embedding module Feedforward p qi p qi

P ( | , ) Embans( , )

aj p qi

slide-63
SLIDE 63

P ( | , ) ≈ cosine_sim(Embans( , ), )

p qi

63

2

Answer Modeling

Post LSTM

Expected Value of Perfect Information (EVPI) inspired model

Question LSTM Word embedding module aj p qi Feedforward p qi p qi aj aj Average

slide-64
SLIDE 64

p qi

64

2

Answer Modeling

Post LSTM

Expected Value of Perfect Information (EVPI) inspired model

Question LSTM Word embedding module Feedforward p qi p qi aj aj Average

Training

p qi

P ( | , ) ≈ cosine_sim(Embans( , ), )

aj p qi

Embans( , )

slide-65
SLIDE 65

p qi

65

2

Answer Modeling

Post LSTM

Expected Value of Perfect Information (EVPI) inspired model

Question LSTM Word embedding module Feedforward p qi p qi aj aj qi : Which version of Ubuntu do you have? Average p qi ai : Ubuntu 14.04 LTS

Training

Close to true ai paired with p

P ( | , ) ≈ cosine_sim(Embans( , ), )

aj p qi

Embans( , )

slide-66
SLIDE 66

p qi

66

2

Answer Modeling

Post LSTM

Expected Value of Perfect Information (EVPI) inspired model

Question LSTM Word embedding module Feedforward p qi p qi aj aj qi : Which version of Ubuntu do you have? qK : What OS are you using? Average p qi

Embans( , )

ak : Ubuntu 11.10 ai : Ubuntu 14.04 LTS

Training

Close to ak paired with qk similar to true qi Close to true ai paired with p

P ( | , ) ≈ cosine_sim(Embans( , ), )

aj p qi

slide-67
SLIDE 67

67

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model EVPI ( | ) = P ( | , ) U( + )

qi

A

p aj p qi p aj aj

slide-68
SLIDE 68

68

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model U( + )

p aj

slide-69
SLIDE 69

p qi aj

69

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model U( + )

p aj

Post LSTM Question LSTM Answer LSTM Word embedding module

Feedforward p qi aj

Value between 0 and 1

slide-70
SLIDE 70

p qi aj

70

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model U( + )

p aj

Post LSTM Question LSTM Answer LSTM Word embedding module

Feedforward p qi aj

Training Value between 0 and 1

slide-71
SLIDE 71

p qi aj

71

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model U( + )

p aj

Post LSTM Question LSTM Answer LSTM Word embedding module

Feedforward p qi aj

y = 1

qi : Which version of Ubuntu do you have? ai : Ubuntu 14.04 LTS

Training Value between 0 and 1 Label

slide-72
SLIDE 72

p qi aj

72

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model U( + )

p aj

Post LSTM Question LSTM Answer LSTM Word embedding module

Feedforward p qi aj

y = 0 y = 1 y = 0

qi : Which version of Ubuntu do you have? ai : Ubuntu 14.04 LTS qj : What OS are you using? aj : Ubuntu 11.10 qk : What is the make of your wifi card? ak : TP-Link TL-WDN4800

Training Value between 0 and 1 Label

slide-73
SLIDE 73

p qi aj

73

3

Utility Calculator

Expected Value of Perfect Information (EVPI) inspired model U( + )

p aj

Post LSTM Question LSTM Answer LSTM Word embedding module

Feedforward p qi aj

y = 0 y = 1 y = 0

qi : Which version of Ubuntu do you have? ai : Ubuntu 14.04 LTS qj : What OS are you using? aj : Ubuntu 11.10 qk : What is the make of your wifi card? ak : TP-Link TL-WDN4800

Training (Minimize binary cross-entropy) Value between 0 and 1 Label

slide-74
SLIDE 74

74

Expected Value of Perfect Information (EVPI) inspired model

Question & Answer Candidate Generator Answer Modeling Utility Calculator

3 2 1

slide-75
SLIDE 75

Train time behavior: For each (p, q, a) in our train set

  • 1. Generate question candidates (Q) and answer candidates (A)
  • 2. Train Answer Model and Utility Calculator

using joint loss function : lossans (p, q, a, Q) + lossutil (y, p, q, a)

75

Expected Value of Perfect Information (EVPI) inspired model

Question & Answer Candidate Generator Answer Modeling Utility Calculator

3 2 1

slide-76
SLIDE 76

Train time behavior: For each (p, q, a) in our train set

  • 1. Generate question candidates (Q) and answer candidates (A)
  • 2. Train Answer Model and Utility Calculator

using joint loss function : lossans (p, q, a, Q) + lossutil (y, p, q, a) Test time behavior: Given a post from our test set

  • 1. Generate question candidates (Q) and answer candidates (A)
  • 2. Calculate P(aj |p, qi) for each q Q using Answer Model
  • 3. Calculate U(p + aj) for each a A using Utility Calculator
  • 4. Rank questions by EVPI (qi | p) = P(aj | p, qi) U(p + aj)

76

Expected Value of Perfect Information (EVPI) inspired model

Question & Answer Candidate Generator Answer Modeling Utility Calculator

3 2 1

aj A

slide-77
SLIDE 77
  • Clarification Questions Dataset
  • Problem Formulation: Question Ranking
  • Expected Value of Perfect Information (EVPI) inspired model
  • Evaluation
  • Conclusion

77

Talk Outline

slide-78
SLIDE 78

78

Evaluation

Too much disk read/write when launching an application I have Xubuntu 13.04 on an old Dell Inspiron. When I launch an application it takes a pretty long time to be launched and I see a lot of disk read/write. If the system was short on memory, this would be understandable as the system would use swap. But that's not the case in my situation (i.e. I have this problem even when the RAM is almost empty). 1. How much ram do you have installed ? and what size it the swap disk partition ? 2. If you do not have any problem with getting a little techy then may i suggest a method ? 3. How is it slow exactly ? boot time ? hdd read/write ? cpu time ? graphics rendering ? 4. What is the longest time you have let it run ? 5. This may be a silly question but ... did you make your usb stick bootable ? 6. Do your system were recently updated ? 7. Why not have two ssds in raid 1 for redundancy ? 8. Is that a `parted -- list` on the synology device ? 9. Can you tell us a little about your configuration ?

  • 10. Did you turn hardware virtualization on in bios/efi ?

Question Candidates Post

slide-79
SLIDE 79

79

Evaluation

Too much disk read/write when launching an application I have Xubuntu 13.04 on an old Dell Inspiron. When I launch an application it takes a pretty long time to be launched and I see a lot of disk read/write. If the system was short on memory, this would be understandable as the system would use swap. But that's not the case in my situation (i.e. I have this problem even when the RAM is almost empty). 1. How much ram do you have installed ? and what size it the swap disk partition ? 2. If you do not have any problem with getting a little techy then may i suggest a method ? 3. How is it slow exactly ? boot time ? hdd read/write ? cpu time ? graphics rendering ? 4. What is the longest time you have let it run ? 5. This may be a silly question but ... did you make your usb stick bootable ? 6. Do your system were recently updated ? 7. Why not have two ssds in raid 1 for redundancy ? 8. Is that a `parted -- list` on the synology device ? 9. Can you tell us a little about your configuration ?

  • 10. Did you turn hardware virtualization on in bios/efi ?

Question Candidates Post Contains more than one good question!

slide-80
SLIDE 80
  • We recruit 10 Unix admin experts using UpWork
  • Given a post and the set of ten question candidates

§ Mark the one best question § Mark any other valid questions

80

Evaluation Evaluation set creation process

slide-81
SLIDE 81
  • We recruit 10 Unix admin experts using UpWork
  • Given a post and the set of ten question candidates

§ Mark the one best question § Mark any other valid questions

  • We annotate a total of 500 posts from our test set
  • Each post is annotated by two experts

81

Evaluation Evaluation set creation process

slide-82
SLIDE 82
  • We recruit 10 Unix admin experts using UpWork
  • Given a post and the set of ten question candidates

§ Mark the one best question § Mark any other valid questions

  • We annotate a total of 500 posts from our test set
  • Each post is annotated by two experts
  • Union of Bests: Questions marked as best by either of the annotators
  • Intersection of Valids: Questions marked as valid by both annotators

82

Evaluation Evaluation set creation process Best Valid A1 A2 A1 A2

Q1 Q2 Q3 Q4 Q5

Union of Bests: { Q2, Q3 } Intersection of Valids: { Q1, Q3, Q5 }

slide-83
SLIDE 83

83

Evaluation

Baseline Models

  • 1. Random: Randomly permute the 10 candidate questions
slide-84
SLIDE 84

84

Evaluation

Baseline Models

  • 1. Random: Randomly permute the 10 candidate questions
  • 2. Bag-of-ngrams: Train linear classifier using bag-of-ngrams of p, q and a
slide-85
SLIDE 85

85

Evaluation

Baseline Models

  • 1. Random: Randomly permute the 10 candidate questions
  • 2. Bag-of-ngrams: Train linear classifier using bag-of-ngrams of p, q and a
  • 3. Community QA (Nakov et al., 2017) :
  • SemEval Task: Rank comments by relevance to post on Qatar Living
  • Winning model: Logistic regression trained with string similarity & word

embedding based features (Nandi et al., 2017)

  • Our baseline: We retrain this model on our dataset
slide-86
SLIDE 86

86

Evaluation

Baseline Models

  • 1. Random: Randomly permute the 10 candidate questions
  • 2. Bag-of-ngrams: Train linear classifier using bag-of-ngrams of p, q and a
  • 3. Community QA (Nakov et al., 2017) :
  • SemEval Task: Rank comments by relevance to post on Qatar Living
  • Winning model: Logistic regression trained with string similarity & word

embedding based features (Nandi et al., 2017)

  • Our baseline: We retrain this model on our dataset
  • 4. Neural (p, q)

Post LSTM

p qi qi Word embedding module

Feedforward

p

Ques LSTM

Value between 0 and 1

slide-87
SLIDE 87

87

Evaluation

Baseline Models

  • 1. Random: Randomly permute the 10 candidate questions
  • 2. Bag-of-ngrams: Train linear classifier using bag-of-ngrams of p, q and a
  • 3. Community QA (Nakov et al., 2017) :
  • SemEval Task: Rank comments by relevance to post on Qatar Living
  • Winning model: Logistic regression trained with string similarity & word

embedding based features (Nandi et al., 2017)

  • Our baseline: We retrain this model on our dataset
  • 4. Neural (p, q)
  • 5. Neural (p, q, a)

Post LSTM

pi qi qi ai ai

Word embedding module Ques LSTM Ans LSTM Feedforward

pi Value between 0 and 1

slide-88
SLIDE 88

88

Evaluation

Baseline Models

  • 1. Random: Randomly permute the 10 candidate questions
  • 2. Bag-of-ngrams: Train linear classifier using bag-of-ngrams of p, q and a
  • 3. Community QA (Nakov et al., 2017) :
  • SemEval Task: Rank comments by relevance to post on Qatar Living
  • Winning model: Logistic regression trained with string similarity & word

embedding based features (Nandi et al., 2017)

  • Our baseline: We retrain this model on our dataset
  • 4. Neural (p, q)
  • 5. Neural (p, q, a)

Both Neural (p, q, a) and EVPI (q | p, a) have similar no. of parameters

Post LSTM

pi qi qi ai ai

Word embedding module Ques LSTM Ans LSTM Feedforward

pi Value between 0 and 1

slide-89
SLIDE 89

89

Evaluation Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

RESULTS

slide-90
SLIDE 90

90

Evaluation 17.5 19.4 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

RESULTS

slide-91
SLIDE 91

91

Evaluation 17.5 19.4 25.2 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

RESULTS

Non-linear vs linear

slide-92
SLIDE 92

92

Evaluation 17.5 19.4 21.9 25.2 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

RESULTS

Explicitly modeling “answer” is useful

slide-93
SLIDE 93

17.5 19.4 23.1 21.9 25.2 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

93

Evaluation

RESULTS

Both use only (p, q)

slide-94
SLIDE 94

94

Evaluation 17.5 19.4 23.1 21.9 25.2 27.7 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

RESULTS

Note: Difference between EVPI and all baselines is statistically significant with p < 0.05

slide-95
SLIDE 95

95

Evaluation 17.5 19.4 23.1 21.9 25.2 27.7 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Union of Best

RESULTS

Note: Difference between EVPI and all baselines is statistically significant with p < 0.05

Mainly differ in their loss function

slide-96
SLIDE 96

96

Evaluation

17.5 19.4 23.1 21.9 25.2 27.7 26.4 25.6 33.6 31.6 34.4 36.1

5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision @1

Intersection of Valids Union of Best

RESULTS

Note: Difference between EVPI and all baselines is statistically significant with p < 0.05

slide-97
SLIDE 97

97

Evaluation

17.5 19.4 21.2 20.9 22.7 23.4 26.4 27.6 30.8 30 31.8 32.2

5 10 15 20 25 30 35 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI Precision @3 Intersection of Valids Union of Best

RESULTS

Not statistically significant

slide-98
SLIDE 98
  • Clarification Questions Dataset
  • Problem Formulation: Question Ranking
  • Expected Value of Perfect Information (EVPI) inspired model
  • Evaluation
  • Conclusion

98

Talk Outline

slide-99
SLIDE 99

ü Key Contributions § Create a dataset of ~77K clarification questions (and answers) with context § Introduce novel model that integrates deep learning with classic notion of expected value of perfect information § Create an evaluation set of size 500 with expert human annotations

99

CONCLUSION

slide-100
SLIDE 100

ü Key Contributions § Create a dataset of ~77K clarification questions (and answers) with context § Introduce novel model that integrates deep learning with classic notion of expected value of perfect information § Create an evaluation set of size 500 with expert human annotations ü Key findings § A context can have multiple good clarification question § Explicitly modeling the answer helps in identifying good questions § EVPI formalism provides leverage over similarly expressive feedforward network

100

CONCLUSION

slide-101
SLIDE 101

ü Key Contributions § Create a dataset of ~77K clarification questions (and answers) with context § Introduce novel model that integrates deep learning with classic notion of expected value of perfect information § Create an evaluation set of size 500 with expert human annotations ü Key findings § A context can have multiple good clarification question § Explicitly modeling the answer helps in identifying good questions § EVPI formalism provides leverage over similarly expressive feedforward network ü Future work § Sequence-to-sequence based question generation model § Multi-turn question generation § How to automatically evaluate performance?

101

CONCLUSION

slide-102
SLIDE 102

ü Key Contributions § Create a dataset of ~77K clarification questions (and answers) with context § Introduce novel model that integrates deep learning with classic notion of expected value of perfect information § Create an evaluation set of size 500 with expert human annotations ü Key findings § A context can have multiple good clarification question § Explicitly modeling the answer helps in identifying good questions § EVPI formalism provides leverage over similarly expressive feedforward network ü Future work § Sequence-to-sequence based question generation model § Multi-turn question generation § How to automatically evaluate performance?

102

CONCLUSION CODE + DATA: https://github.com/raosudha89/ranking_clarification_questions

slide-103
SLIDE 103

Backup Slides

103

slide-104
SLIDE 104

104

Evaluation

17.5 19.4 23.1 21.9 24.1 25.2 27.7 26.4 25.6 33.6 31.6 32.3 34.4 36.1

5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, a) Neural (p, q, a) EVPI

Precision @ 1

Intersection of Valids Union of Best

RESULTS

slide-105
SLIDE 105

105

Evaluation

17.5 19.4 21.2 20.9 23.5 22.7 23.4 26.4 27.6 30.8 30 31.5 31.8 32.2

5 10 15 20 25 30 35 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, a) Neural (p, q, a) EVPI

Precision @3

Intersection of Valids Union of Best

RESULTS

slide-106
SLIDE 106

106

Evaluation

35.2 34.4 40.2 39.2 41.4 42.5 43.6 42.1 42.7 47 45.5 46.5 47.7 49.2

10 20 30 40 50 60 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, a) Neural (p, q, a) EVPI

Mean Average Precision

Intersection of Valids Union of Best

RESULTS

slide-107
SLIDE 107

107

Evaluation 10 10.7 18.5 15.4 20.5 21.4 5 10 15 20 25 30 35 40 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, q, a) EVPI

Precision

True Question

RESULTS

Note: Difference between EVPI and all baselines is statistically significant with p < 0.05

slide-108
SLIDE 108

108

Evaluation

17.4 16.3 22.6 20.6 22.6 22.2 23.7

5 10 15 20 25 30 Random Bag-of-ngrams Community QA Neural (p, q) Neural (p, a) Neural (p, q, a) EVPI Precision @1 Union of Best (with true removed)

Not statistically significant

RESULTS

slide-109
SLIDE 109

109

SAMPLE OUTPUT

Too much disk read/write when launching an application I have Xubuntu 13.04 on an old Dell Inspiron. When I launch an application it takes a pretty long time to be launched and I see a lot of disk read/write. If the system was short on memory, this would be understandable as the system would use swap. But that's not the case in my situation (i.e. I have this problem even when the RAM is almost empty). 0.21 How much ram do you have installed? and what size is the swap disk partition 0.18 Can you tell us a little about your configuration ? 0.17 What is the longest time you have let it run ? 0.11 How is it slow exactly ? boot time ? hdd read/write ? cpu time ? 0.00 If you do not have any problem with getting a little techy may i suggest a method ? 0.00 This may be a silly question but ... did you make your usb stick bootable ? 0.00 Do your system were recently updated ? 0.00 Why not have two ssds in raid 1 for redundancy ? 0.00 Is that a `parted -- list` on the synology device ? 0.00 Did you turn hardware virtualization on in bios/efi ?

Post

Best Valid EVPI value Ranking of Question Candidates

slide-110
SLIDE 110

110

0.24 I doubt it, shutdown and reboot are exactly identical! are you really rebooting? 0.13 Be clear about the problem. Is Ubuntu not showing them even though they are present? 0.11 What is 4g wifi connection? 0.09 Can you type `iwconfig` in terminal and paste what it returns here? 0.09 What does this tell us? 0.08 If I post it as an answer, would you kindle mark as such? 0.06 Which Ubuntu 15? 0.06 What exactly do you mean by make fails? 0.05 Welcome to ask Ubuntu! ; - ) Is the wireless lan disabled in the bios? 0.00 Is Ubuntu detecting your wireless card ? **iwconfig** does list your card? Best Valid EVPI value Ranking of Question Candidates No wifi after restart in Ubuntu 16.04 After upgrading to 16.04, there is no wifi whenever I restart the system. My wireless interface of Ubuntu is RT3290 Wireless 802.11n 1T/1R PCIe On iwconfig I got the following eth0 no wireless extensions…. Currently to start wifi again I have to shutdown, then boot the system again. How to fix the problem?

SAMPLE OUTPUT

slide-111
SLIDE 111

111

LONG SHORT TERM MEMORY (LSTM)

Sepp Hochreiter and J¨urgen Schmidhuber. 1997. Long short-term memory. Neural computation , 9(8):1735–1780. Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. “Glove: Global vectors for word representation” In Empirical Methods on Natural Language Processing.

Expected Value of Perfect Information (EVPI) inspired model