(Some) Research Trends in Question Answering (Q (QA)
Manoj Kumar Chinnakotla Senior Applied Scientist Artificial Intelligence and Research (AI&R) Microsoft, India Advanced Topics in AI (Spring 2017) IIT Delhi
(Some) Research Trends in Question Answering (Q (QA) Advanced - - PowerPoint PPT Presentation
(Some) Research Trends in Question Answering (Q (QA) Advanced Topics in AI (Spring 2017) IIT Delhi Manoj Kumar Chinnakotla Senior Applied Scientist Artificial Intelligence and Research (AI&R) Microsoft, India Lecture Obje jective and
Manoj Kumar Chinnakotla Senior Applied Scientist Artificial Intelligence and Research (AI&R) Microsoft, India Advanced Topics in AI (Spring 2017) IIT Delhi
above stream
Holy Roman Empire
Train Test History Questions: 3,761 Sentences: 14,217 Questions: 699 Sentences: 2,768 Literature Questions: 4,777 Sentences: 17,972 Questions: 908 Sentences: 3,577 Baselines
their model.
"most answers are themselves words (features) in other questions (e.g., a question on World War II might mention the Battle of the Bulge and vice versa). Thus, word vectors associated with such answers can be trained in the same vector space as question text enabling us to model relationships between answers instead of assuming incorrectly that all answers are independent.“
"We tried transforming Wikipedia sentences into quiz bowl sentences by replacing answer mentions with appropriate descriptors (e.g., \Joseph Heller" with \this author"), but the resulting sentences suffered from a variety of grammatical issues and did not help the final result."
powerful meaning representations. In real world, how will we get those?
to learn such rich representations. This is an unrealistic assumption.
pairs that *fit* their needs (no messy data)
"451 history answers and 595 literature answers that occur on average twelve times in the corpus".
Scoring function over all KB triples
automatically using the following 16 rules
questions
relation can generate “where did e r?” pattern question
Triple pairs: 16 X 14M
shared embedding space
triples based on it
1 𝑙)
1 such that ti 1 ≠ ti
1
Ranking Loss Ranking Loss
800K Vocabulary 3.5M Entities (2 embeddings per entity) 250M Examples (Rules + Paraphrases)
64 dimensions 64 dimensions 64 dimensions 64 dimensions
many correct answers still not ranked at the top
the query-triple similarity metric
human-judged
them
any other dataset
the term equivalences
well as Q, A pairs was interesting
in boosting the accuracy which was introduced towards the end as a rescue step!
performance of the initial model
stead of the original idea itself!
framework?
answers corresponding to their quality.
hand-crafted feature engineering or b) only deep learning based systems
Que Question
Des escription: Can I obtain Driving License my QID is written Employee, I saw list in gulf times but there isn't mentioned EMPLOYEE QID Profession. Ans nswer1: the word employee is a general term that refers to all the staff in your company either the manager, secretary up to the lowest position or whatever positions they have. you are all considered employees of your
(Pot
Ans nswer2: your QID should specify what is the actual profession you have. I think for me, your chances to have a drivers license is low. (Go (Good
Ans Answer3: dear Richard, his asking if he can obtain. means he have the drivers license. (B (Bad ad) Ans Answer4: Slim chance...... (Go (Good
a fully connected network (DFFN-CNN)
crafted features using a fully connected network (DFFN-BLNA)
various resources
2015 and Sem Eval 2016 tasks
Sem SemEval 2015 2015 Sem SemEval 2016 2016 Mod
F1 F1 Ac Accu. Mod
MAP AP F1 F1 Ac Accu. DFFN DFFN-BLNA 62.01* 75.20* DFFN-BLNA 83.91* 66.70* 77.65* DFFN DFFN-CNN 60.86 74.54 DFFN DFFN 81.77 65.75 76.42 JAI AIST 57.29 72.67 Kelp 79.19 64.36 75.11 HITSZ-ICRC 56.44 69.43 ConvKN 78.71 63.55 74.95 DFFN DFFN w/o HCF CF 56.04 69.73 DFFN DFFN w/o HCF CF 74.36 60.22 72.88 DFFN DFFN w/o CNN 51.65 67.12 DFFN DFFN w/o CNN 70.21 56.77 68.65