Baseline QA system Deliverable 2 C.J Hsu Ryan Bielby O utline - PowerPoint PPT Presentation

Baseline QA system Deliverable 2 C.J Hsu Ryan Bielby

O utline • System Architecture  Frontend System (Answer generator)  Backend System (IR Engine) • Evaluations • Discussions • References

System Architecture • The frontend module (n-gram answer generator) tries to mine the correct answers from the web. • The backend module (IR Engine) tries to find the supporting documents for answers mining from frontend.

The Architecture of Frontend System • Generate two types of query “baseline” and “inexact” queries for each question. • Send these queries to the Teoma (Ask.com) and fetch 100 snippets for each query. • Generate the n-grams (unigram to tetragram) from snippets by“Voting”, ”Filtering”, ”Combining”, ”Scoring” and “Reranking” (Lin 2007).

The Architecture of Frontend System Filtering • Three kinds of filters mentioned in Lin 2007, but we only implement “type-neutral” and incomplete “type-specific” filters. • “type-neutral” filter removes n-grams starting and ending in stop words. • “type-specific” filter removes the n-grams which doesn’t meet the question type. We deal with “Person”, “Time/Date”, “Number” and “Location”

The Architecture of Frontend System Reranking • According to (Lin 2007), every n-gram must be contained in at least 2 snippets, What does “contain” mean here ? A case study will be proposed in Discussion section. • Since our filtering mechanism is not strong enough, we set this threshold to 5 to avoid including some ridiculous n-grams.

The Architecture of Backend System Indexing • We implement a plug-in to help Lucene index TREC document format. • The AQUAINT corpus is indexed by Snowball Analyzer (stop words removal, stemming…). • The indexing process is an independent process from our main pipeline.

The Architecture of Backend System Document Retrieval/Answer Projection • Formulate the boolean query by the stemmed terms from original question and n-gram answer candidate. • The retrieved document must contains the terms from n-gram but may not contain the terms from original question. • The retrieved documents are discarded if their scores are lower than a threshold. • Eg: Q: Where is Mozart born ? Ans: Salzburg. The Lucene Query is “mozart” O R “born” AND “salzburg”

The Architecture of Backend System Passage Retrieval/Answer Projection • We Implement three different passage retrieval algorithms. MITRE V1, MITRE V2, IBM. Please refer to (Tellex et al. 2003). • Intuitively, the term from n-gram should be more important than the term from original question, how do we integrate this concept into the scoring function ? • The document containing the highest score passage is the supporting document.

Evaluations • If the quality of the n-gram generation is terrible, The whole system is doomed to fail even if you develop a brilliant document/passage retrieval algorithm. Vise versa.

Evaluations Frontend System Frontend System Frontend System (type neutral and specific filters) (type neutral filter) strict 0.2361 0.1761 lenient 0.2368 0.1770

Evaluations Whole System with Different Answer Projection Algorithms frontend only whole system Whole system Whole system MITRE V.1 MITRE V.2 IBM strict 0.2361 0.1031 0.0989 0.0981 lenient 0.2368 0.2383 0.2389 0.2383

Discussion Reranking Mechanism of Frontend • O ur system fail on a easy question. “152.1: Where is Mozart born?”. The correct answer is “Salzburg.*Austria”. O ur top one answer is “Salzburg”. • Since only one snippet contain the bigram “Salzburg Austria”. So it got filtered out. However, a lot of snippets contain “Salzburg , Austria”. Exact match is the best solution ?

Discussion Select the best Parameters • There are a lot of parameters in our system, like the threshold to filter out documents, the weight for term when doing passage retrieval and the reranking threshold. • For now, we decide to give Condor a break. We will not perform parameter selection until the we complete all the remaining functions.

References • An Exploration of the Principles Underlying Redundancy-Based Factoid Question. Lin 2007. • Query Formulation for Answer Projection. Gilad Mishne and Marrten de Rijke 2005. • Quantitative Evaluation of Passage Retrieval Algorithms for Question Answering. Stefaine Tellex et al. 2003.

Baseline QA system Deliverable 2 C.J Hsu Ryan Bielby O utline - PowerPoint PPT Presentation

Baseline QA system Deliverable 2 C.J Hsu Ryan Bielby O utline System Architecture Frontend System (Answer generator) Backend System (IR Engine) Evaluations Discussions References System Architecture The frontend

White Paper Runs Name Description baseline2018a Baseline Project-official baseline (official

Technical Baseline Management Technical Baseline Management September 30, 2003 Pat Hascall LAT

Deliverable N: 6.14 Name Deliverable: Project Presentation Covering period:

Deliverable 6.1 Mid-term dissemination and annual presentation and report Document type Deliverable

Deliverable Factsheet Date: 30 September 2014 Deliverable No. D8.4 Working Package WP8 Partner

Regional Educational Laboratories in Appalachia: Putting Research into Action Appalachian Higher

Hillside Marine Baseline Overview AUSTRALIAS NEXT GREAT COPPER PROJECT HILLSIDE: SOUTH

BNL Neutrino Long Baseline Neutrino Initiative N. Simos, BNL NWG Homestake Baseline = 2540 Km

Meeting Staff Baseline Testing: How to Prepare for Workforce Disruptions May 20, 2020 Preparing

The Baseline The Baseline Personal Process Personal Process AU INSY 560, Singapore 1997, Dan

Automatic Summarization Project - Deliverable 3 - Anca Burducea Joe Mulvey Nate Perkins May

D:A-3.1 Project presentation and web portal Deliverable Number: D13.1 Work Package: WP 13 Version:

DELIVERABLE REPORT Grant Agreement number: 688303 Project acronym: LUCA Project title: Laser and

WP3 EX-POST Case studies Comparative Analysis Report Deliverable no.: 3.2 Comparative Analysis

DELIVERABLE GROUP 1 House Legislative Oversight Review of S ecretary of S tates Office 1

DELIVERABLE GROUP 3 House Legislative Oversight Review of S ecretary of S tates Office 1

INF 111 / CSE 121: Software Tools and Methods Lecture Notes for Fall Quarter, 2007 Michele

Software Architecture CRISP - Inter-university Research Centre on public Services Ettore Colombo

CSE543 - Computer and Network Security Module: Web Security Professor Trent Jaeger 1 CSE543 -

About Cardea Our Mission: Improve organizations' abilities to deliver Revenue Cycle Management

FD DAQ General and Backend & Computing Requirements Kurt Biery, Giles Barr, Josh Klein FD

Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Raghav Sethi Michael Kaminsky David G.

SPaT Challenge Webinar Series Webinar #1: Initial SPaT Challenge Activities 2:00 3:30 PM

Define Once, Evaluate Anywhere Building Repeatable and Correct Features at Stripe Kelley Rivoire

Baseline QA system Deliverable 2 C.J Hsu Ryan Bielby O utline - PowerPoint PPT Presentation

Baseline QA system Deliverable 2 C.J Hsu Ryan Bielby O utline System Architecture Frontend System (Answer generator) Backend System (IR Engine) Evaluations Discussions References System Architecture The frontend

White Paper Runs Name Description baseline2018a Baseline Project-official baseline (official

Technical Baseline Management Technical Baseline Management September 30, 2003 Pat Hascall LAT

Deliverable N: 6.14 Name Deliverable: Project Presentation Covering period:

Deliverable 6.1 Mid-term dissemination and annual presentation and report Document type Deliverable

Deliverable Factsheet Date: 30 September 2014 Deliverable No. D8.4 Working Package WP8 Partner

Regional Educational Laboratories in Appalachia: Putting Research into Action Appalachian Higher

Hillside Marine Baseline Overview AUSTRALIAS NEXT GREAT COPPER PROJECT HILLSIDE: SOUTH

BNL Neutrino Long Baseline Neutrino Initiative N. Simos, BNL NWG Homestake Baseline = 2540 Km

Meeting Staff Baseline Testing: How to Prepare for Workforce Disruptions May 20, 2020 Preparing

The Baseline The Baseline Personal Process Personal Process AU INSY 560, Singapore 1997, Dan

Automatic Summarization Project - Deliverable 3 - Anca Burducea Joe Mulvey Nate Perkins May

D:A-3.1 Project presentation and web portal Deliverable Number: D13.1 Work Package: WP 13 Version:

DELIVERABLE REPORT Grant Agreement number: 688303 Project acronym: LUCA Project title: Laser and

WP3 EX-POST Case studies Comparative Analysis Report Deliverable no.: 3.2 Comparative Analysis

DELIVERABLE GROUP 1 House Legislative Oversight Review of S ecretary of S tates Office 1

DELIVERABLE GROUP 3 House Legislative Oversight Review of S ecretary of S tates Office 1

INF 111 / CSE 121: Software Tools and Methods Lecture Notes for Fall Quarter, 2007 Michele

Software Architecture CRISP - Inter-university Research Centre on public Services Ettore Colombo

CSE543 - Computer and Network Security Module: Web Security Professor Trent Jaeger 1 CSE543 -

About Cardea Our Mission: Improve organizations' abilities to deliver Revenue Cycle Management

FD DAQ General and Backend &amp; Computing Requirements Kurt Biery, Giles Barr, Josh Klein FD

Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Raghav Sethi Michael Kaminsky David G.

SPaT Challenge Webinar Series Webinar #1: Initial SPaT Challenge Activities 2:00 3:30 PM

Define Once, Evaluate Anywhere Building Repeatable and Correct Features at Stripe Kelley Rivoire

FD DAQ General and Backend & Computing Requirements Kurt Biery, Giles Barr, Josh Klein FD