WebCLEF 2007 The Overview Valentin Jijkoun, Maarten de RIjke - PowerPoint PPT Presentation

WebCLEF 2007 — The Overview Valentin Jijkoun, Maarten de RIjke

Overview Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history  Task description Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history  Task description  Assessment Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history  Task description  Assessment  Evaluation measures Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history  Task description  Assessment  Evaluation measures  Runs Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history  Task description  Assessment  Evaluation measures  Runs  Results Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

Overview  A bit of history  Task description  Assessment  Evaluation measures  Runs  Results  Conclusion Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 2

WebCLEF — A bit of history Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 3

WebCLEF — A bit of history  Launched as a known-item search task in 2005, repeated in 2006  Resources created used for a number of purposes Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 3

WebCLEF — A bit of history  Launched as a known-item search task in 2005, repeated in 2006  Resources created used for a number of purposes  But there are information needs out there beside navigational ones, even on the web Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 3

WebCLEF — A bit of history  Launched as a known-item search task in 2005, repeated in 2006  Resources created used for a number of purposes  But there are information needs out there beside navigational ones, even on the web  WiQA  Pilot that ran at QA@CLEF 2006  Question answering using Wikipedia  Unidirected informational queries: “Tell me about X” Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 3

Task description Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 4

Task description  Wishes  Task close to Real-World™ information need  Clear definition of a user  Multi-linguality should come naturally  Collections should be a natural source  Collections, topics, assessors’ judgments re-usable  Challenging Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 4

Task description  Wishes  Task close to Real-World™ information need  Clear definition of a user  Multi-linguality should come naturally  Collections should be a natural source  Collections, topics, assessors’ judgments re-usable  Challenging  Our hypothetical user  “A knowledgeable person, writing a survey or overview with a clear goal and audience in mind.”  Locate items of information to be included in the article to be written, and use an automatic system to support this  Use online resources only Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 4

Task description (2) Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 5

Task description (2)  User formulates her information need (“topic”)  A short topic title (e.g., title of the survey article)  A free text description of the goals and intended audience  A list of languages in which the user is willing to accept results  Optional list of known source (URLs of docs the user considers relevant)  Optional list of Google retrieval queries Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 5

Task description (2)  User formulates her information need (“topic”)  A short topic title (e.g., title of the survey article)  A free text description of the goals and intended audience  A list of languages in which the user is willing to accept results  Optional list of known source (URLs of docs the user considers relevant)  Optional list of Google retrieval queries  Example  title : Significance testing  description : I want to write a survey (about 10 screens) or undergraduate students on statistical significance testing, with an overview of the ideas, common ideas and critiques. I will assume some basic knowledge of statistics  language(s) : English  known sources : http://en.wikipedia.org/wiki/Statistical_hypothesis_testing ..  retrieval queries : significance testing ; site:mathworld.wolfram.com ; ... Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 5

Task description (3) Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 6

Task description (3)  Data  Close to Real-World™ scenario, but tractable  Define collection per topic  “Mashup”  All “known” sources specified  Top 1000 results per retrieval queries  Per result: query that retrieved it, rank, conversion (of HTML, PDF, PS) to plain txt Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 6

Task description (3)  Data  Close to Real-World™ scenario, but tractable  Define collection per topic  “Mashup”  All “known” sources specified  Top 1000 results per retrieval queries  Per result: query that retrieved it, rank, conversion (of HTML, PDF, PS) to plain txt  System’s response  Ranked list of plain txt snippets extracted from the sub-collection of the topic  Each indicates its origin Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 6

Assessment Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 7

Assessment  Manual assessment by the topic creators  Somewhat similar to OTHER questions at TREC 2006  Blind  Pool responses of all systems into anonymized sequence of txt segments  For each response only include first 7,000 chars Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 7

Assessment  Manual assessment by the topic creators  Somewhat similar to OTHER questions at TREC 2006  Blind  Pool responses of all systems into anonymized sequence of txt segments  For each response only include first 7,000 chars  Asessor was asked …  To create a list of nuggets (“atomic facts”) that should be included in the article for the topic  Link character spans from a response to nugget  Different spans within a single snippet may be linked to multiple nuggets  Mark as “known” if a span expresses a fact present in known source Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 7

8 Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview

Assessment (3) Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 9

Assessment (3)  Similar to INEX and some TREC tasks, assessment carried out by topic creators Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 9

Evaluation measures Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 10

Evaluation measures  Based on standard precision and recall Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 — The Overview 10

WebCLEF 2007 The Overview Valentin Jijkoun, Maarten de RIjke - PowerPoint PPT Presentation

WebCLEF 2007 The Overview Valentin Jijkoun, Maarten de RIjke Overview Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 The Overview 2 Overview A bit of history Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 The Overview 2

Product Features Technical Training 2007 Technical Training 2007 Technical Training 2007

Platinum 2007 Platinum 2007 14th May 2007 14th May 2007 Good morning, Ladies & Gentlemen.

2007 Regional Freight 2007 Regional Freight Conference Conference May 15, 2007 May 15, 2007

4Q 2007 4Q 2007 Results Results Review Review February 2008 1 Snapshot 12M 2007 REVENUES

Information Meeting 2007 Information Meeting 2007 11 May, 2007 May, 2007 11 2 Contents

First Quarter 2007 Consolidated Results Milan, May 10th, 2007 Index First Quarter 2007

Fourth Quarter 2007 Results Fourth Quarter 2007 Results 30 August 2007 30 August 2007

Macquarie Bank Limited 2007 Annual General Meeting 19 July 2007 Macquarie Bank Limited 2007

Winter Outlook Consultation 07/08 Ed Blackmore Sempra Energy www.sempra.com p/Therm 10 15 20

2007 Full Year Results Presentation 12 months to 31 December 2007 13 February 2008 1 2007 Full

2007 Revenue and Results 2007: strong increase in results Strengthened growth momentum February

THORESEN THAI AGENCIES PUBLIC COMPANY LIMITED An Integrated Shipping Group Third Quarter

THORESEN THAI AGENCIES PUBLIC COMPANY LIMITED An Integrated Shipping Group Second Quarter

E.D.A.S. Conference 2007 E.D.A.S. Conference 2007 Arts & Regeneration: Urban & Rural

Supplementary Information 30 September 2007 Index RevPAR 3 Months to 30 September 2007

01 | KPF Overview 01 | KPF Overview 01 | KPF Overview 01 | KPF Overview 01 | KPF Overview 01 |

Guided Interaction: Rethinking the Query-Result Paradigm Arnab Nandi H.V. Jagadish University

GRADY Principals Report GO! WHOA! iFinish Data (Incompletes Spring 2020) Initially we

Visualizing public health data for communicable disease management and control Anamaria Crisan

Suraiya Shiwnarain and Joelyz Wolcott Research Impacts Research Impacts By: Joelyz Wolcott &

Information diffusion kernels Based on the technical report by John Lafferty and Guy Lebanon,

Model-Independent Online Learning for Influence Maximization Sharan Vaswani 1 , Branislav Kveton

A Recurrent Neural Cascade-based Model for Continuous-Time Diffusion Sylvain Lamprier LIP6 -

What do we get out of studying systems as networks? examples: Political/Financial Networks ! Mark

Sambuz

Useful Links

Newsletter

Mail Us

WebCLEF 2007 The Overview Valentin Jijkoun, Maarten de RIjke - PowerPoint PPT Presentation

WebCLEF 2007 The Overview Valentin Jijkoun, Maarten de RIjke Overview Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 The Overview 2 Overview A bit of history Valentin Jijkoun, Maarten de Rijke/WebCLEF 2007 The Overview 2

Product Features Technical Training 2007 Technical Training 2007 Technical Training 2007

Platinum 2007 Platinum 2007 14th May 2007 14th May 2007 Good morning, Ladies &amp; Gentlemen.

2007 Regional Freight 2007 Regional Freight Conference Conference May 15, 2007 May 15, 2007

4Q 2007 4Q 2007 Results Results Review Review February 2008 1 Snapshot 12M 2007 REVENUES

Information Meeting 2007 Information Meeting 2007 11 May, 2007 May, 2007 11 2 Contents

First Quarter 2007 Consolidated Results Milan, May 10th, 2007 Index First Quarter 2007

Fourth Quarter 2007 Results Fourth Quarter 2007 Results 30 August 2007 30 August 2007

Macquarie Bank Limited 2007 Annual General Meeting 19 July 2007 Macquarie Bank Limited 2007

Winter Outlook Consultation 07/08 Ed Blackmore Sempra Energy www.sempra.com p/Therm 10 15 20

2007 Full Year Results Presentation 12 months to 31 December 2007 13 February 2008 1 2007 Full

2007 Revenue and Results 2007: strong increase in results Strengthened growth momentum February

THORESEN THAI AGENCIES PUBLIC COMPANY LIMITED An Integrated Shipping Group Third Quarter

THORESEN THAI AGENCIES PUBLIC COMPANY LIMITED An Integrated Shipping Group Second Quarter

E.D.A.S. Conference 2007 E.D.A.S. Conference 2007 Arts &amp; Regeneration: Urban &amp; Rural

Supplementary Information 30 September 2007 Index RevPAR 3 Months to 30 September 2007

01 | KPF Overview 01 | KPF Overview 01 | KPF Overview 01 | KPF Overview 01 | KPF Overview 01 |

Guided Interaction: Rethinking the Query-Result Paradigm Arnab Nandi H.V. Jagadish University

GRADY Principals Report GO! WHOA! iFinish Data (Incompletes Spring 2020) Initially we

Visualizing public health data for communicable disease management and control Anamaria Crisan

Suraiya Shiwnarain and Joelyz Wolcott Research Impacts Research Impacts By: Joelyz Wolcott &amp;

Information diffusion kernels Based on the technical report by John Lafferty and Guy Lebanon,

Model-Independent Online Learning for Influence Maximization Sharan Vaswani 1 , Branislav Kveton

A Recurrent Neural Cascade-based Model for Continuous-Time Diffusion Sylvain Lamprier LIP6 -

What do we get out of studying systems as networks? examples: Political/Financial Networks ! Mark

Sambuz

Useful Links

Newsletter

Mail Us

Platinum 2007 Platinum 2007 14th May 2007 14th May 2007 Good morning, Ladies & Gentlemen.

E.D.A.S. Conference 2007 E.D.A.S. Conference 2007 Arts & Regeneration: Urban & Rural

Suraiya Shiwnarain and Joelyz Wolcott Research Impacts Research Impacts By: Joelyz Wolcott &