NTTs Question Answering System for NTCIR-6 QAC-4 Ryuichiro - PDF document

NTT’s Question Answering System for NTCIR-6 QAC-4 Ryuichiro Higashinaka and Hideki Isozaki NTT Communication Science Laboratories, NTT Corporation 2-4, Hikaridai, Seika-cho, Kyoto 619-0237, Japan { rh,isozaki } @cslab.kecl.ntt.co.jp Abstract terns for X in a manner similar to Joho et al. [3]. For instance, ‘ Y such as X ’ and ‘ Y ( X ) ’ are such patterns. NTCIR-6 QAC-4 organizers announced that there When one of these patterns matches a sentence, would be no restriction (such as factoid) on QAC4 Y becomes a candidate definition of X . Although questions, but they plan to include many ‘ definition ’ SAIQA-QAC2 used some of these patterns, it sim- questions and ‘ why ’ questions. Therefore, we focused ply considered noun phrases as Y . Therefore, the ex- on these two question types. For ‘definition’ questions, tracted Y was sometimes too short to be informative we used a simple pattern-based approach. For ‘why’ as a definition. To solve this problem, we focus on the questions, hand-crafted rules were used in previous dependency structure of the patterns and extend them work for answer candidate extraction [5]. However, to match modifiers of all words expressed in the pat- such rules greatly depend on developers’ intuition and tern. For example, when X is ‘cats’, the pattern ‘ Y are costly to make. We adopt a supervised machine such as cats’ matches ‘pet animals such as cats’ with X = ‘pet animals’ and Y = ‘cats’. learning approach. We collected causal expressions from the EDR corpus and trained a causal expression To allow this matching, we first fill X of the pat- classifier, integrating lexical, syntactic, and semantic terns with the definition target; e.g., ‘cats’. Then, we create dependency trees for them using CaboCha . 1 features. The experimental results show that our system is effective for ‘why’ and ‘definition’ questions. Finally, We search for these tree patterns through documents by using a tree-based search program tgrep2 2 1 Introduction to obtain the matching trees. Since modifiers are al- lowed to be included in the matched results, the length of Y can be long, overcoming the shortcomings of Our QAC-4 system NCQAW ( N TT C S Labs’ SAIQA-QAC2. The current system has 13 patterns, Q uestion A nswering System for W hy Questions) is including one that simply regards any modifiers of based on SAIQA-QAC2, our factoid question answer- X as Y , which principally looks for rentai (adnom- ing system [2]. Although SAIQA-QAC2 can answer inal modification) or renyou (adverbial modification) some ‘definition’ questions and ‘why’ questions by clauses of X . using ad hoc rules, its performance for these question types has been poor. We modified the answer extraction module and the answer evaluation module for 2.2 Answer Evaluation these question types to improve the performance. In Sections 2 and 3, we describe the answer ex- We evaluate each candidate C by the sum of the traction and evaluation modules for ‘definition’ and scores of content words in C . That is, ‘why’ questions in NCQAW. After briefly describing how we deal with ‘how’ questions in Section 4, Sec- � candscore def ( C ) = wordscore def ( w ) tion 5 presents the results of our system for the QAC-4 w ∈ CW( C ) formal run. Section 6 analyzes errors made by our system, and Section 7 summarizes and mentions future where CW( C ) is the set of content words (verbs, work. nouns, and adjectives) in C . These candidates share many words that are useful to define the specified phrase X . It is reasonable to 2 ‘Definition’ questions expect that a content word shared by many candidates indicates a better definition than another word shared 2.1 Answer Candidate Extraction by only a few candidates. Therefore, we define the We use a simple pattern-based approach. Given a 1 http://chasen.org/˜taku/software/cabocha/ 2 http://tedlab.mit.edu/˜dr/TGrep2/index.html phrase X , the system generates typical definition pat-

word score by the log of the count (term frequency and added word sense tags by using Nihongo Goi- without any normalization) of the word w in the set of Taikei [1]. Then, we built for each sentence a tree all candidates { C i } found by tgrep2. that integrates these lexical, semantic, and syntactic features. We employed BACT , a tree-based boosting wordscore def ( w ) = log(tf( w ; { C i } )) algorithm [4], to train the classifier. Some ‘why’ questions might request purpose (e.g., why do you want to...?). Although the EDR corpus has ‘purpose’ tags, 3 ‘Why’ questions we did not use them because not all purposes can be answers to ‘why’ questions. We use BACT’s output 3.1 Answer Candidate Extraction score ( causal why ( C ) ) as the certainty of the existence of a causal expression in the sentence. Since we thought it would be difficult to find exact answers for ‘why’ questions, we used sentences as the unit of answer candidates ( C ), making the task a sen- Similarity For (b), we used a simple idf (inverse tence extraction problem. We regard sentences having document frequency) score given by the log of the in- at least one of the query words as answer candidates verse ratio of the number of documents that contain since evaluating all sentences in the top-ranked docu- the specified query word w . That is, ments would be computationally expensive in the answer evaluation stage as we explain later. � sim why ( S ) = idf( w ) w ∈ Q ( S ) 3.2 Answer Evaluation where Q ( S ) is the set of query words in the sentence We evaluate each candidate by using the following S . Since the number of sentences is generally large, two scores: sentence classification by BACT can be sometimes computationally expensive, hence the removal of an- (a) certainty of the existence of a causal expression swer candidates without any query words in answer in the candidate, candidate extraction. Another justification for the re- (b) similarity of the question and the candidate. moval is that such sentences with no query words have similarity scores of zero, meaning completely irrele- The final score is determined based on these scores. vant to the question. Suppose the system extracted three answer candi- We normalize the above sentence score by a sig- dates for a question. moid function. Q. Why did John steal the cake? C 1 John was hungry. sim ′ why ( C ) = 1 / (1 + exp( − sim why ( C )) C 2 John did it because he was hungry. C 3 John stole the cake because he was hungry. The final answer ranking is determined by a heuristic Then, C 2 is preferred to C 1 because C 2 has a causal function combining the two scores. expression ‘ because he was hungry ’ whereas C 1 does candscore why ( C ) = causal why ( C ) + sim ′ why ( C ) not. C 3 is preferred to C 2 because C 3 share more words with the question than C 2 . . 4 ‘How’ questions Causal expression For (a), although hand-crafted rules were used in previous work for answer candidate extraction [5], such rules greatly depend on develop- We also applied the supervised machine learning ers’ intuition and are costly to make. Therefore, we method to ‘how’ (procedural) questions. Although adopt a supervised machine learning approach. First, it was not clear which tag in the EDR corpus cor- we build a classifier that determines whether a sen- responds to procedures, we used ‘condition’ tags be- tence contains a causal expression. For this, we use cause we found, through mining the corpus, that some the EDR corpus 3 for obtaining the training samples. procedural expressions are likely to appear just after Sentences in the corpus have annotations for causal conditional expressions. For example, in the sentence expressions by ‘cause’ tags. Sentences with causal ’If the Olympic flame goes out, it gets re-ignited.’, ’If expressions are considered positive examples, while the Olympic flame goes out’ indicates a condition and those without causal expressions are considered neg- ’it gets re-ignited’ a procedure. The answer candi- ative examples. date extraction and evaluation processes are exactly We first analyzed each sentence in the EDR corpus the same as those for ‘why’ questions except for the by CaboCha for word segmentation and dependency change of the tag used in obtaining training samples 3 http://www2.nict.go.jp/r/r312/EDR/index.html for BACT.

NTTs Question Answering System for NTCIR-6 QAC-4 Ryuichiro - PDF document

NTTs Question Answering System for NTCIR-6 QAC-4 Ryuichiro Higashinaka and Hideki Isozaki NTT Communication Science Laboratories, NTT Corporation 2-4, Hikaridai, Seika-cho, Kyoto 619-0237, Japan { rh,isozaki } @cslab.kecl.ntt.co.jp Abstract

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Neuchatel at NTCIR-4 From CLEF to NTCIR Jacques Savoy University of Neuchatel, Switzerland

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

SML Question-Answering System for World History Essay Exams at NTCIR-13 QALab-3 Team : SML

Commercial Multi-vendor NFV System Hayashi Kohei, NTT DOCOMO Jo Hiroyuki, NTT Takahashi

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

A Multilingual Hybrid Question-Answering System Cross-Lingual Open-Domain Question Answering

Forst: Question Answering System for Term and Essay Questions at NTCIR-13 QA Lab-3 Task Kotaro

DGLab Question Answering System & Automatic Evaluation Method at NTCIR-13 QA Lab-3 for

IMTKU Question Answering System for World History Exams at NTCIR-13 QALab-3 Department of

Quantum Neural Network (QNN) - Connecting Quantum and Brain with Optics - Yoshihisa Yamamoto NTT

NTT Global Business NTT , SVP of Global Business & Member of the Board Tsunehisa Okuno

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

Designing deep architectures for Visual Question Answering Matthieu Cord Sorbonne University

Question Answering and AnswerFinder Diego Moll a Centre for Language Technology Department of

Japan 2020 Outline of the meeting: Staff Dates Itinerary Health Costs Payment Plans

Media- & Financial Analyst-Conference 2014 Zehnder Group Review 2013 Hans-Peter Zehnder,

BEIJING OFFICE UPDATE services from both groups. Johnson Tan, formerly head of the Jones Day

i -CABLE CABLE Communications Limited Limited Communications 2012 Interim Results 7 August

Olympic and Paralympic Legacy and Physical Activity Andrew Honeyman Head of Physical Activity

SPORT EVENT PANATTICA FRAMEWORK Taking into consideration research and find- persons with

Sogo Active: How Sponsorship is Helping Canadian Youth Get Active Kelly Murumets, President

HIGH QUALITY CABLE AND WIRE PRODUCTS Advantages of work with EXPERT-CABLE " Flexible

NTTs Question Answering System for NTCIR-6 QAC-4 Ryuichiro - PDF document

NTTs Question Answering System for NTCIR-6 QAC-4 Ryuichiro Higashinaka and Hideki Isozaki NTT Communication Science Laboratories, NTT Corporation 2-4, Hikaridai, Seika-cho, Kyoto 619-0237, Japan { rh,isozaki } @cslab.kecl.ntt.co.jp Abstract

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Neuchatel at NTCIR-4 From CLEF to NTCIR Jacques Savoy University of Neuchatel, Switzerland

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

SML Question-Answering System for World History Essay Exams at NTCIR-13 QALab-3 Team : SML

Commercial Multi-vendor NFV System Hayashi Kohei, NTT DOCOMO Jo Hiroyuki, NTT Takahashi

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

A Multilingual Hybrid Question-Answering System Cross-Lingual Open-Domain Question Answering

Forst: Question Answering System for Term and Essay Questions at NTCIR-13 QA Lab-3 Task Kotaro

DGLab Question Answering System &amp; Automatic Evaluation Method at NTCIR-13 QA Lab-3 for

IMTKU Question Answering System for World History Exams at NTCIR-13 QALab-3 Department of

Quantum Neural Network (QNN) - Connecting Quantum and Brain with Optics - Yoshihisa Yamamoto NTT

NTT Global Business NTT , SVP of Global Business &amp; Member of the Board Tsunehisa Okuno

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

Designing deep architectures for Visual Question Answering Matthieu Cord Sorbonne University

Question Answering and AnswerFinder Diego Moll a Centre for Language Technology Department of

Japan 2020 Outline of the meeting: Staff Dates Itinerary Health Costs Payment Plans

Media- &amp; Financial Analyst-Conference 2014 Zehnder Group Review 2013 Hans-Peter Zehnder,

BEIJING OFFICE UPDATE services from both groups. Johnson Tan, formerly head of the Jones Day

i -CABLE CABLE Communications Limited Limited Communications 2012 Interim Results 7 August

Olympic and Paralympic Legacy and Physical Activity Andrew Honeyman Head of Physical Activity

SPORT EVENT PANATTICA FRAMEWORK Taking into consideration research and find- persons with

Sogo Active: How Sponsorship is Helping Canadian Youth Get Active Kelly Murumets, President

HIGH QUALITY CABLE AND WIRE PRODUCTS Advantages of work with EXPERT-CABLE &quot; Flexible

DGLab Question Answering System & Automatic Evaluation Method at NTCIR-13 QA Lab-3 for

NTT Global Business NTT , SVP of Global Business & Member of the Board Tsunehisa Okuno

Media- & Financial Analyst-Conference 2014 Zehnder Group Review 2013 Hans-Peter Zehnder,

HIGH QUALITY CABLE AND WIRE PRODUCTS Advantages of work with EXPERT-CABLE " Flexible