CRQA: Crowd-powered Real-time Automated Question Answering System
Denis Savenkov
Emory University
dsavenk@emory.edu HCOMP, Austin, TX October 31, 2016
Eugene Agichtein
Emory University
eugene@mathcs.emory.edu
CRQA: Crowd-powered Real-time Automated Question Answering System - - PowerPoint PPT Presentation
CRQA: Crowd-powered Real-time Automated Question Answering System Denis Savenkov Eugene Agichtein Emory University Emory University dsavenk@emory.edu eugene@mathcs.emory.edu HCOMP, Austin, TX October 31, 2016 Volume of question search
dsavenk@emory.edu HCOMP, Austin, TX October 31, 2016
eugene@mathcs.emory.edu
[1] “Questions vs. Queries in Informational Search Tasks”, Ryen W. White et al, WWW 2015
2
3
4
(AP Photo/Jeopardy Productions, Inc.)
5
6
7
8
9
https://sites.google.com/site/trecliveqa2016/
10
11
12
a. CQA archives i. Yahoo! Answers ii. Answers.com iii. WikiHow b. Web search API
a. Answers to retrieved questions b. Content blocks from regular web pages
13
14
○ Offline crowdsourcing of answers for long-tail search queries
○ Using crowd to perform complex operations in SQL queries
○ Answering queries using social media
○ Real-time crowdsourcing as a backup plan for dialog
○ Real-time chatbot powered by crowdsourcing
15
16
17
18
19
20
21
22
Answer candidate Answer candidate Answer candidate Answer candidate > sort answers -k crowd_rating if top candidate rating > 2.5
no crowd generated candidates return top candidate True False return longest crowd generated candidate
23
Answer candidate Answer candidate Answer candidate Answer candidate > sort answers -k crowd_rating if top candidate rating > 2.5
no crowd generated candidates return top candidate True False return longest crowd generated candidate
24
Answer candidate Answer candidate Answer candidate Answer candidate
final answer
get ground-truth labels
community response, crawled 2 days after challenge
10-fold cross validation
25
26
27
Method avg-score avg-prec s@2+ s@3+ s@4+ p@2+ p@3+ p@4+
28
Method avg-score avg-prec s@2+ s@3+ s@4+ p@2+ p@3+ p@4+
29
Method avg-score avg-prec s@2+ s@3+ s@4+ p@2+ p@3+ p@4+
30
Method avg-score avg-prec s@2+ s@3+ s@4+ p@2+ p@3+ p@4+
31
Method avg-score avg-prec s@2+ s@3+ s@4+ p@2+ p@3+ p@4+
32
Method avg-score avg-prec s@2+ s@3+ s@4+ p@2+ p@3+ p@4+
no worker answers
no worker ratings
33
34
Less un-answered question thanks to worker answers Ratings help with “bad” answers
35
Many questions on Yahoo! Answers are unanswered Community experts provide an “excellent” answer more often than CRQA
36
Is it bad not wanting to visit your family? It’s nt bad. Just be honest with them. They may be upset but they should understand Chamomile tea should help
37
Less helpful More helpful Arts & Humanities Pets Home & Garden Travel One of the hardest for automatic systems Health ...
38
39
40
41
42
43
44
45
46