The Importance of Evaluation for Multilingual Information Retrieval
Carol Peters ISTI-CNR, Pisa, Italy
FIRE 2011 IIT Bombay, 2-4 December 2011
The Importance of Evaluation for Multilingual Information Retrieval - - PowerPoint PPT Presentation
The Importance of Evaluation for Multilingual Information Retrieval Carol Peters ISTI-CNR, Pisa, Italy FIRE 2011 IIT Bombay, 2-4 December 2011 From FIRE 2008 to FIRE 2010 FIRE 2008 CLEF: Objectives and First Results FIRE 2010 10
Carol Peters ISTI-CNR, Pisa, Italy
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2008
FIRE 2010
FIRE 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
In IR the role of an evaluation campaign is to:
assessed
US and Japan: TREC and NTCIR
FIRE 2011 IIT Bombay, 2-4 December 2011
communities
data
successful system development
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
dissemination and acquisition
commercial, leisure, educational, entertainment etc. uses
non-English content is growing rapidly
should have equal opportunities
FIRE 2011 IIT Bombay, 2-4 December 2011
http://www.internetworldstats.com/stats.htm
FIRE 2011 IIT Bombay, 2-4 December 2011
Country Population Internet Users 2000 Internet Users 2011 Penetration % of Pop. China
1,336,718,015 22,5000.000 485,000,000 36.3%
United States
313,232,044 95,354,000 245,000,000 78.2%
India
1,189,172,906 5,000,000 100,000,000 8.4%
Japan
126,475,664 47,080,000 99,182,000 78.4%
Brazil
203,429,773 5,000,000 75,982,000 37.4%
Germany
81,471,834 24,000,000 65,125,000 79.9%
Russia
138,739,892 3,100,000 59,700,000 43.0%
UK
62,698,362 15,400,000 51,442,100 82.0%
France
65,102,719 8,500,000 45,262,000 69.5%
Nigeria
155,215,573 20,000 43,982,200 28.3%
http://www.internetworldstats.com/top20.htm
FIRE 2011 IIT Bombay, 2-4 December 2011
Country Population Internet Users 2000 Internet Users 2011 Penetration % of Pop. China
1,336,718,015 22,5000.000 485,000,000 36.3%
United States
313,232,044 95,354,000 245,000,000 78.2%
India
1,189,172,906 5,000,000 100,000,000 8.4%
Japan
126,475,664 47,080,000 99,182,000 78.4%
Brazil
203,429,773 5,000,000 75,982,000 37.4%
Germany
81,471,834 24,000,000 65,125,000 79.9%
Russia
138,739,892 3,100,000 59,700,000 43.0%
UK
62,698,362 15,400,000 51,442,100 82.0%
France
65,102,719 8,500,000 45,262,000 69.5%
Nigeria
155,215,573 20,000 43,982,200 28.3%
http://www.internetworldstats.com/top20.htm
FIRE 2011 IIT Bombay, 2-4 December 2011
presentation of digital information in any of the world's languages.
internationalisation, localisation)
display
ranking, selecting, presenting results)
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
from collections in any language (covering basic enabling techniques and including MLIR and CLIR)
(includes CLIR)
any language
multimedia collection containing documents in any language and form
to the user
Oard & Hull , AAAI Spring Symposium, Stanford 1997
FIRE 2011 IIT Bombay, 2-4 December 2011
MLIR/CLIR System development involves integrating IR techniques with Language Processing tools and Language Transfer mechanisms
multilingual / how to handle updates /linguistic and cultural dependent issues
identification / indexing issues (stop words, stemmers, morphological analysers, named entity recognition, ..)
FIRE 2011 IIT Bombay, 2-4 December 2011
(language & culture)
FIRE 2011 IIT Bombay, 2-4 December 2011
(language & culture)
FIRE 2011 IIT Bombay, 2-4 December 2011
(language & culture)
FIRE 2011 IIT Bombay, 2-4 December 2011
generally using language dependent techniques (tokenisation, stopwords, stemming, morphological analysis, decompounding, etc.)
present results
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
collections (e.g. semantic analysis, classification, information extraction)
results: merging/ summarization / translation
FIRE 2011 IIT Bombay, 2-4 December 2011
problem
from different kinds of indexing errors:
FIRE 2011 IIT Bombay, 2-4 December 2011
Clough October 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
Objectives of CLEF
Promote research and stimulate development of multilingual IR systems, through
component testing
The Vision
Step-by-step promote the development of truly multilingual, multimodal systems
FIRE 2011 IIT Bombay, 2-4 December 2011
CLEF 2000 TEXT
scientific data (Domain-Specific) CLEF 2001 USER NEEDS
CLEF 2002 SPEECH
CLEF 2003 IMAGE & QA
CLEF 2005 WEB & GIR
CLEF 2008 VIDEO
CLEF 2009 USERS & APPLICATIONS
FIRE 2011 IIT Bombay, 2-4 December 2011
CLEF Tracks: 2000 - 2009
AdHoc Track: Promotes development of mono and cross-language text retrieval systems
increasingly complex & diverse tasks
catalog archives
Advancing State-of-Art through Evaluation: AdHoc
FIRE 2011 IIT Bombay, 2-4 December 2011
all languages to be queried, eg morphology, syntax, segmentation, special features
European language types
FIRE 2011 IIT Bombay, 2-4 December 2011
2 4 6 8 10 12 14 2000 2001 2002 2003 2004 2005 2006 2007 2008 Czech Hungarian Bulgarian Portuguese Russian Swedish Finnish Dutch Spanish Italian German French
Donna Harman: CLEF2009 Workshop
Ad Hoc Track 2000-2007: Bilingual & Multilingual Tasks
(EN,ES,FR,DE)
FR -> NL
FI -> DE
FIRE 2011 IIT Bombay, 2-4 December 2011
AdHoc
Monolingual Bilingual Multilingual CLEF2000
DE;FR;IT X→EN X→DE;EN;FR;IT
CLEF2001
DE;ES;FR;IT;NL X→EN, X→NL X→DE;EN;ES;FR;IT
CLEF2002
DE;ES;FI;FR IT;NL;SV X→DE;ES;FI;FR;IT;NL;SV X→EN(newcomer) X→DE;EN;ES;FR;IT
CLEF2003
DE;ES;FI;FR IT;NL;RU;SV IT→ES;DE→IT FR→NL;FI→DE X→RU;X→EN X→DE;EN;ES;FR X→DE;EN;ES;FI FR;IT;NL;SV
CLEF2004
FI;FR;RU;PT ES/FR/IT/RU→FI DE/FI/NL/SV→FR X→RU;X→EN X→FI;FR;RU;PT
CLEF2005
BG;FR;HU;PT X→ BG;FR;HU;PT EX →EN Multi8 2yrson Multi8 merge
CLEF2006
BG;FR;HU;PT X→ BG;FR;HU;PT X →EN ROBUST:X→DE;EN;ES; FR;NL
CLEF2007
BG, CZ, HU ROBUST: EN;FR;PT X→ BG;CZ;HU; AM/ID/OR/ZH→ EN BN/HI/MR/TA/TE→ EN ROBUST: X→EN;FR;PT
CLEF2008 & CLEF2009
FA TEL: DE; EN; FR ROBUST: WSD EN EN→FA TEL: x→DE;EN;FR ROBUST: WSD Es →EN
& DE retrieval
best mono retrieval
FIRE 2011 IIT Bombay, 2-4 December 2011
models and algorithms for different tasks and languages
FIRE 2011 IIT Bombay, 2-4 December 2011
15 languages: BG,CZ,DE,EN,ES,EU,FI,FR,HU,IT,NL,RU,SV,PT and Persian
collection; Cambridge Sociological Abstracts
CLEF Achievements at 10 yr milestone: Test Collections
FIRE 2011 IIT Bombay, 2-4 December 2011
Development of methodology, tools, resources:
parallel & aligned corpora
taggers, decompounders
based, conceptual networks
language models
relevance feedback
results presentation in multilingual context
FIRE 2011 IIT Bombay, 2-4 December 2011
(Exploiting CLEF experience, results, knowhow to provide guidelines for usrs and developers)
FIRE 2011 IIT Bombay, 2-4 December 2011
Best Practices White Papers produced, exploiting CLEF experience, results, knowhow to provide guidelines for users and developers:
Nicolas Moreau, ELDA
Martin Braschler, Zurich
Madrid
Evaluation Methodologies Mark Sanderson, Univ. Sheffield &
Martin Braschler, Zurich Univ. Applied Technology
Designed for practitioners rather than for academics
FIRE 2011 IIT Bombay, 2-4 December 2011
Objectives
resources they need when implementing MLIR/CLR system functionality
campaigns and projects are over
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
Result Index Indexing Query Indexing Matching Document representation Query representation Wirtschaft Wirtschaft Result Query representation Translation Documents Documents Translation Document representation Translation Result
Recommendations in three parts:
FIRE 2011 IIT Bombay, 2-4 December 2011
Use weighted & ranked retrieval copes with translation error Use Unicode/XML covers different scripts Use minimal stopword elimination keep maximum information Remove diacritics, special characters tolerant towards inconsistent spelling Use stemming covers different word forms Use decompounding for languages with poductive compound formation tolerant towards different phrasings Use character n-grams when stemming resources not available helps with languages with scarce language resources
FIRE 2011 IIT Bombay, 2-4 December 2011
Maximize coverage of translation resources reduces retrieval failure due to missing translations Use document translation to solve merging problems if combined results in multiple languages are needed Combine different types of translation resources minimizes mistranslations inherent to the individual resources Use an interlingua when direct translation resources unavailable covers language pairs with no direct translation resources Use high-performing weighting schemes, eg Okapi/BM25, LM, DFR or lnu.ltn weighting schemes with robust performance over different types of text Use pseudo-relevance feedback boosts recall (coverage of results)
FIRE 2011 IIT Bombay, 2-4 December 2011
Blueprint for Effective System
many languages as possible
information
FIRE 2011 IIT Bombay, 2-4 December 2011
Recommendations in three parts:
Document Selection
Refinement
FIRE 2011 IIT Bombay, 2-4 December 2011
Allow user to specify language skills and translation preferences in user profile
Recommendations
language inspection is preferable
KWIC
translation facilities must fit together
FIRE 2011 IIT Bombay, 2-4 December 2011
(language & culture)
FIRE 2011 IIT Bombay, 2-4 December 2011
(includes culture and language)
Peters, Braschler, Clough, 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
is what makes us human and what gives each of us a sense of identity (EC 2005)
same knowledge of certain things and would think and act similarly in certain situations
flags, and language
cultural boundaries
much more
Clough October 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
(language & culture)
FIRE 2011 IIT Bombay, 2-4 December 2011
Peters, Braschler, Clough, 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
typically several times larger than in monolingual search)
to customers’ requirements
maintenance must be low
to acquire and easy to optimise to meet client’s needs
Peters, Braschler, Clough, 2011
FIRE 2011 IIT Bombay, 2-4 December 2011
compare approaches in order to improve system performance
experimental data
examining the same issues and comparing ideas and experiences
FIRE 2011 IIT Bombay, 2-4 December 2011
must also consider multilingual interface design
problem; study impact of users’ cultural background and language skills
seeking activities
larger application
search in multiple languages well
FIRE 2011 IIT Bombay, 2-4 December 2011
FIRE 2011 IIT Bombay, 2-4 December 2011