NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki - PowerPoint PPT Presentation

NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki KASHIOKA NiCT/ATR, Japan

NTCIR CCLQA � Complex Cross-Lingual Question Answering. List/Event questions List major events in formation of European Union. R l i Relationship questions hi i Does Iraq possess uranium, and if so, where did it come from? Biography questions: Who is Howard Dean? Wh i H d D ? Definition questions: What are stem cells? Questions in English English and getting answers from Chinese ( Simplified ( Simplified , Traditional) or Japanese corpus p , ) J p p

Related studies � Pattern-matching-based [Xu, et al. 2005] [Harabagiu, et al. 2004] [Cui, et al. 2004] et al. 2004] [Cui, et al. 2004] � Basic syntactic/semantic structures like appositives, copulas; predicates and relations. � Centroid-vector-based [Xu, et al. 2003] [Chen, et al. 2006] [Kor, et al. 2007] � Build a target profile for each question, and then compute the similarities between candidates and the target profile. � Others [Biadsy, et al. 2008] O h [Bi d l 2008] � Unsupervised classification model to Biography production using Wikipedia Wikipedia.

Centroid-vector-based Wikipedia Biography.com W WordNet dN Google Definition NewsLibrary.com y Google

Centroid-vector-based cont. � Easy to implement, and fast in speed � In essence, a type of question-side expansion. I f i id i � Hard to obtain sometimes � Wikipedia, WordNet, and Biography.com contain only 82.0%, 40.4%, and 24.6% of TREC05 questions, respectively. i l � Not always contribute positively � Wikipedia negatively impacts Biography questions [Kor, et al. 2007].

Our solution SVM-based model Centroid-vector-based model Regarding complex QA as a SVM- Regarding complex QA as a SVM as a retrieval process as a retrieval process based classification Applying sentence-side pp y g question-side expansion q p expansion Requiring no specific resources , A number of external resources except a general search engine (Google) such as Wikipedia, Biography.com, WordNet, etc. Incorporating multiple features Incorporating multiple features TF-IDF-similarity score TF IDF similarity score

SVM-based Same to centroid-vector model

Learning Evidences by Sentence-side Learning Evidences by Sentence side Expansion For each s i in S Extract 2 or 3 nouns nearest to question target from candidate Extract 2 or 3 nouns nearest to question target from candidate 1 1. s i as topic terms of s i , labeled as R . Combine topic terms R and question target to compose a web 2. query and submit it to Google. Download the top 100 Google snippets. 3. Retain those snippets { e i,i1 i,i1 , …e , …e i,ik i,ik } that contain words in 4. question target and R as Web evidences for candidate s i . end

An Example Who is Anwar Sadat Anwar Sadat Question target Question target Question target Question target ... c255 = before zoweil, late egyptian president anwar sadat , gyp p anwar sadat won the nobel prize for peace after making peace with israel in 1979 ... ... c274 = in 1970 , anwar sadat anwar sadat was elected president of egypt , succeeding the late gamal abdel nasser ... Topic terms Topic terms

An Example Bridge lexical Bridge lexical gap between candidates and profile

SVM-based model

Train-Classifier Features Description Bfull If question target occurs in the exact form. Bbegin If question target occurs at the beginning of an instance or not Bpattern If one of predefined patterns occurs or not Btime Btime If time expression occurs or not Candidates with time expression If time expression occurs or not. Candidates with time expression tend to capture important events involving target Unigram-overlap Overlap of unigrams between an instance and target profile Bigram-overlap Overlap of Bigrams between an instance and target profile TF-IDF similarity TF-IDF-based similarity between an instance and target profile Freq The number of relevant pages returned by Google SVM Classifier

Rules for the Train-Classifier � Manually generate from the abstract of Wikipedia. � Useful to Biography and Definition questions. g p y q

Select-Answer assuming that all Ideal A the features fire Answer Which topic is the nearest topic to ideal <t p1 , p 1 > p1 p 1 er er nding orde nding orde answer <t p2 , p 2 > ... j=15 if the t pj > average probability (1/n) <t pj , p j > t pj , p j j=10 else; j=10 else; descen descen ... <t pn , p n >

Experiments � Three runs for the EN-CS and CS-CS tasks � RUN-3: The Centroid-vector model ( 5 external resources � RUN-3: The Centroid-vector model ( 5 external resources 5 external resources ) 5 external resources ). � Wikipedia (0.2 million Chinese entries); � Baidu Baike (1 million Chinese entries, http://baike.baidu.com); � Google Definition (e.g., define: Nobel prize); � Google news (1000 news sources updated continuously); � Google � Google. � RUN RUN- -1: The SVM 1: The SVM- -based model based model (Google). (Google). � RUN � RUN RUN-2: The SVM RUN 2: The SVM 2: The SVM-based model 2: The SVM based model based model (5 external resources) based model (5 external resources). (5 external resources) (5 external resources). � To compare with the RUN To compare with the RUN- -1 1

Official Result � Three findings: 1. the precision is low; 2. the ranking of difficulties of answering questions; 3. Cross-lingual vs. g q ; g monolingual ( only 10% include at most 1 error ). The best The best (the second (the second is 19.30%) is 19.30%) The second The second (the best is (the best is 43.29%) 43.29%)

Comparison of the Three Runs EN-CS task The conclusion: The conclusion: RUN-1 RUN-2 RUN-3 Event E t 14 54 14.54 14 08 14.08 8 09 8.09 • The proposed SVM-based Definition 22.16 23.37 12.57 models are much better than the baseline by Biography Biography 31 58 31.58 30 27 30.27 20 77 20.77 comparing the RUN-1 and Relation 23.35 22.80 12.10 RUN-2 with RUN-3. all all 22.11 22.11 21.79 21.79 12.73 12.73 • Target profile does not play CS-CS task an important role in the proposed SVM-based Event 14.30 14.07 10.86 model by comparing the d l b h Definition 24.15 25.65 16.18 RUN-1 with RUN-2. Biography 33.76 32.53 18.06 Relation 24.29 23.76 16.50 all 23.16 22.98 15.06

Automatic Scores The best The best (the (the ( second is second is 22.90%) 22.90%) The second The second The second The second (the best is (the best is 37.75%) 37.75%)

IR4QA + CCLQA � Three retrieval results are CMUJAV1-EN-CS- 01 01 -T- limit50, CMUJAV1-EN-CS- 02 limit50, CMUJAV1 EN CS 02 02 -T-limit50, and MITEL-EN- 02 T limit50, and MITEL EN CS- 01 01 -T-limit50. > > >

IR4QA + CCLQA Table 8. The Mean-AP scores of the CJAV1 and the Table 8 The Mean AP scores of the CJAV1 and the MITEL over types of questions CJAV1 MITEL The conclusion: The conclusion: Event 19.53 < 26.57 • The impacts of the IR4QA Definition 48.65 > 37.10 system on the CCLQA Biography 46.65 > 45.15 system are roughly consistent. Relation 32.00 < 41.37 T bl 9 Th F Table 9. The F-scores of the RUN-1 based on the CJAV1 f h RUN 1 b d h CJAV1 • However, the extent of the and MITEL over types of questions impacts are not the same. CJAV1 MITEL Event 17.74 < 18.92 Definition 23.08 > 17.23 Biography 38.37 > 37.87 Relation 33.40 < 36.01

Discussion � Hard to directly evaluate the quality of web evidences learned by sentence-side expansion learned by sentence side expansion � Has the same underlying logic as that of the ROUGE metric [Lin, et al. 2003] and the nugget-pyramid metric [Lin, et al. 2006]: use use unigram overlap to match semantically unigram overlap to match semantically � Speed problem � Have to train an SVM-classifier for each question.

Summary � Propose an SVM-based classification model for NTCIR complex QA system: complex QA system: � Each candidate represents a topic � Learn training data for each topic by sentence-side expansion earn training data for each topic by sentence side expansion � Assume an ideal answer, and classify this ideal answer into topics to find real answers � The SVM-based model achieves competitive performances, and relies on no specific external resources other than Google.

Thanks!

NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki - PowerPoint PPT Presentation

NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki KASHIOKA NiCT/ATR, Japan NTCIR CCLQA Complex Cross-Lingual Question Answering. List/Event questions List major events in formation of European Union. R l i Relationship

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Neuchatel at NTCIR-4 From CLEF to NTCIR Jacques Savoy University of Neuchatel, Switzerland

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

Overview of 2010 ATR Grant Criteria and Target Populations ATR Ohana Project Substance Abuse and

NEW PRODUCT LAUNCH: ATR-500 ADJUSTABLE TRAY ROLLER Part Number: ATR-500 FASTER Installs faster

Overview of the Sixth NTCIR Workshop Noriko Kando National Institute of Informatics

NTCIR 2014 Slides - TUW-IMP at the NTCIR-11 Math-2 Presentation February 2015 CITATIONS READS

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

Status on positron fraction Multi-track event CC fitted Multi-track event 1 track Multi-Track

EMOTION IN SPEECH Nick Campbell ATR Human Information Science Labs Keihanna Science City, Kyoto,

Rachel ORourke, MA, ATR Shannon non Anasta stasia siadi dis s PNP, PMH MHS Kare ren n

Kobe University, NICT, and University of Siegen at TRECVID 2016 AVS Task Yasuyuki Matsumoto,

NICT Use Cases and Requirements for New Models of Human Language to Support Mobile Conversational

On Cross-Domain Data Access for Cyber-Physical-Social S ystems Koji Zettsu zettsu@nict.go.jp

Constructing English Reading Courseware Masao Utiyama (NICT) Midori Tanimura (Kinki Univ.)

October 16 - Quest for Civic Pride in a Peoples Park San Francisco Chronicle - July 20, 1895

COMPLEX ORGANIZATIONS Prepared for: UW Foster School of Business Executive MBA Program October

Welcome Lakehurst Small Business Roundtable (LKE-SBR) July General Meeting Topic: Winning

What is a biography? Biographies A biography gives facts about a persons life. It is not

1 Nex s t s Previou #TALK Presentation Discussion 45min 15min 2 Subject: CTF

Mtrologie et expertise de la performance sportive (slides in English) Conference Paper May

Motion capture A technology of recording the movement of real things -- usually humans

Showcasing Distinguished Texas A&M researchers in Material Science Faculty Research Areas

NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki - PowerPoint PPT Presentation

NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki KASHIOKA NiCT/ATR, Japan NTCIR CCLQA Complex Cross-Lingual Question Answering. List/Event questions List major events in formation of European Union. R l i Relationship

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Neuchatel at NTCIR-4 From CLEF to NTCIR Jacques Savoy University of Neuchatel, Switzerland

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

Overview of 2010 ATR Grant Criteria and Target Populations ATR Ohana Project Substance Abuse and

NEW PRODUCT LAUNCH: ATR-500 ADJUSTABLE TRAY ROLLER Part Number: ATR-500 FASTER Installs faster

Overview of the Sixth NTCIR Workshop Noriko Kando National Institute of Informatics

NTCIR 2014 Slides - TUW-IMP at the NTCIR-11 Math-2 Presentation February 2015 CITATIONS READS

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

Status on positron fraction Multi-track event CC fitted Multi-track event 1 track Multi-Track

EMOTION IN SPEECH Nick Campbell ATR Human Information Science Labs Keihanna Science City, Kyoto,

Rachel ORourke, MA, ATR Shannon non Anasta stasia siadi dis s PNP, PMH MHS Kare ren n

Kobe University, NICT, and University of Siegen at TRECVID 2016 AVS Task Yasuyuki Matsumoto,

NICT Use Cases and Requirements for New Models of Human Language to Support Mobile Conversational

On Cross-Domain Data Access for Cyber-Physical-Social S ystems Koji Zettsu zettsu@nict.go.jp

Constructing English Reading Courseware Masao Utiyama (NICT) Midori Tanimura (Kinki Univ.)

October 16 - Quest for Civic Pride in a Peoples Park San Francisco Chronicle - July 20, 1895

COMPLEX ORGANIZATIONS Prepared for: UW Foster School of Business Executive MBA Program October

Welcome Lakehurst Small Business Roundtable (LKE-SBR) July General Meeting Topic: Winning

What is a biography? Biographies A biography gives facts about a persons life. It is not

1 Nex s t s Previou #TALK Presentation Discussion 45min 15min 2 Subject: CTF

Mtrologie et expertise de la performance sportive (slides in English) Conference Paper May

Motion capture A technology of recording the movement of real things -- usually humans

Showcasing Distinguished Texas A&amp;M researchers in Material Science Faculty Research Areas

Showcasing Distinguished Texas A&M researchers in Material Science Faculty Research Areas