TREC, TAC, takeoffs, tacks, tasks, and titillations for 2009
Ian Soboroff, NIST ian.soboroff@nist.gov
TREC, TAC, takeoffs, tacks, tasks, and titillations for 2009 Ian - - PowerPoint PPT Presentation
TREC, TAC, takeoffs, tacks, tasks, and titillations for 2009 Ian Soboroff, NIST ian.soboroff@nist.gov Agenda TREC 2008 (some) reflections on TREC TAC, a new evaluation conference for NLP TREC 2009 preview TREC Goals To
Ian Soboroff, NIST ian.soboroff@nist.gov
research ideas to increase communication among academia, industry, and government
labs and commercial products
measures for information retrieval
different aspects of information retrieval
Ellen Voorhees, chair David Lewis James Allan John Prager Chris Buckley Steve Robertson Gord Cormack Mark Sanderson Sue Dumais Ian Soboroff Donna Harman Richard Tong Bill Hersh Ross Wilkinson
Beijing Univ. of Posts & Telecommunications Korea University University of Avignon Brown University Max-Planck-Institut Informatik University of Glasgow Carnegie Mellon University Nat’l Univ. of Ireland, Galway
Chinese Acad. of Sciences Northeastern University
Clearwell Systems, Inc. Open Text Corporation University of Iowa (2) CNIPA ICT Lab Pohang Univ Science & Tech University of Lugano Dalian U. of Technology RMIT University
Dublin City University Sabir Research University of Massachusetts Fondazione Ugo Bordoni SEBIR
Fudan University
University of Neuchatel H5 SUNY Buffalo University of Pittsburgh Heilongjiang Inst. of Tech. TNO ICT University of Texas at Dallas Hong Kong Polytechnic U. Tsinghua University University of Twente IBM Research Lab Universidade do Porto University of Waterloo (2) Indian Inst Tech, Kharagpur University College, London Ursinus College Indiana University
Wuhan University INRIA University of Amsterdam (2) York University Kobe University
blog Craig Macdonald, Iadh Ounis, Ian Soboroff enterprise Peter Bailey, Nick Craswell, Arjen de Vries, Ian Soboroff, Paul Thomas legal Jason Baron, Bruce Hedin, Doug Oard, Stephen Tomlinson million query James Allan, Jay Aslam relevance feedback Chris Buckley, Stephen Robertson
pooling for test collection building
computations
(with negotiation), and more...
possible for any of three topics
domain expert lawyer
clicks
two sampling strategies
(Carterette et al, SIGIR 2006)
1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 Retrieval in a domain Million query Ad Hoc, Robust
Interactive, HARD, fdbk
X→{X,Y,Z} Chinese Spanish Video Speech OCR Enterprise Terabyte Web VLC Novelty Q&A Filtering Routing Legal Genome Static text Streamed text Human-in-the-loop Beyond just English Beyond text Web searching, size Answers, not docs Blog Spam Personal documents
SemEval
TREC DUC RTE
TREC DUC RTE
TREC DUC RTE
end-user tasks (e.g., summarization, QA)
technical support)
(organizational infrastructure, data, assessing, tools)
RTE: systems recognize when one piece of text entails or contradicts another QA: systems return a precise answer in response to a question, focusing on opinion questions asked over blogs Summarization: systems return a fluent summary of documents focused by a narrative or set of questions
articles for a user who has already read an earlier set
answers to opinion question(s) -- joint with QA
another
contexts.
terms returned by QA systems searching the Web Baldwin is Antigua's Prime Minister.
systems The opposition Antigua Labour Party (ALP) has blasted that country's prime minister, Baldwin Spencer, for publicly advocating that Cuba's Fidel Castro be awarded the Order of the Community (OCC) - the Community's highest honour.
clusters of news articles, A and B, where A documents precede B documents
summaries that contribute to satisfying the information need expressed in the topic statement:
B, assuming reader has read cluster A
Why don’t people like Trader Joe’s?
loved it! service could have been better yummy snacks unhelpful clerk parking nightmare innovative Yuk! filthy
Why don’t people like Trader Joe’s?
loved it! service could have been better yummy snacks unhelpful clerk parking nightmare innovative Yuk! filthy
Why don’t people like Trader Joe’s?
loved it! service could have been better yummy snacks unhelpful clerk parking nightmare innovative Yuk! filthy Trader Joe’s is filthy, has poor service, and is a parking nightmare.
Why don’t people like Trader Joe’s?
TARGET: "MythBusters" 1018.1 RIGID LIST Who likes Mythbusterʼs? 1018.2 SQUISHY LIST Why do people like Mythbusterʼs? 1018.3 RIGID LIST Who do people like on Mythbusterʼs?
TARGET: "MythBusters" 1018.1 RIGID LIST Who likes Mythbusterʼs? BLOG06-3334 CAPS_CHAMP BLOG06-8580 Jon BLOG06-3982 Zonk 1018.2 SQUISHY LIST Why do people like Mythbusterʼs? BLOG06-6706 The Mythbusters chicas are purdy . BLOG06-5962 It's geek, period. And a lot of fun. I like that they have women on their team who are also into mechanical stuff and applied science. 1018.3 RIGID LIST Who do people like on Mythbusterʼs? BLOG06-3187 Kari Byron BLOG06-4849 scottie BLOG06-6570 Jamie Hyneman
Population
Huang (York U), Mihai Lupu (IRF)
Chemistry
references
domain experts
Glasgow), Ian Soboroff (NIST)
engines)
from one or more clusters
collection
Clarke (U Waterloo)
products, organizations...
(CSIRO), Arjen de Vries (CWI), Thijs Westerveld (Teezir)
Social Media has a data challenge
S h a m e l e s s P l u g !