Cross-Language Explicit Semantic Analysis
Nedim Lipka Maik Anderka Benno Stein Bauhaus University Weimar
www.webis.de
1 Lipka@CLEF [∧] 01.10.09
Cross-Language Explicit Semantic Analysis Nedim Lipka Maik Anderka - - PowerPoint PPT Presentation
Cross-Language Explicit Semantic Analysis Nedim Lipka Maik Anderka Benno Stein Bauhaus University Weimar www.webis.de 1 Lipka@CLEF [ ] 01.10.09 Outline Retrieval Models The CL-ESA Retrieval Model CL-ESA at TEL@CLEF 2009
1 Lipka@CLEF [∧] 01.10.09
❑ Retrieval Models ❑ The CL-ESA Retrieval Model ❑ CL-ESA at TEL@CLEF 2009 ❑ Formalization of CL-ESA
2 Lipka@CLEF [∧] 01.10.09
Real-world document q ∈Q d ∈D d ∈D
Computer-based relevance judgment
Information need q ∈Q Query representation
Conceptual document models, Linguistics, Computer linguistics Human query formulation Computer-based document generation Underlying theories Retrieval model R Document representation
3 Lipka@CLEF [∧] 01.10.09
4 Lipka@CLEF [∧] 01.10.09
0.4
... 0.2
...
5 Lipka@CLEF [∧] 01.10.09
0.4
... 0.2
... 0.5
... 0.2
... 0.1
...
6 Lipka@CLEF [∧] 01.10.09
0.4
... 0.2
... 0.5
... 0.2
... 0.1
...
... ... 0.1 0.0
7 Lipka@CLEF [∧] 01.10.09
0.4
... 0.2
... 0.5
... 0.2
... 0.1
...
... ... 0.1 0.0
... ... 0.2 0.1
8 Lipka@CLEF [∧] 01.10.09
... ... 0.3 0.7
... ... 0.4 0.1
0.6
... 0.9
... 0.3
...
0.6
... 0.3
...
0.4
... 0.2
... 0.5
... 0.2
... 0.1
...
... ... 0.1 0.0
... ... 0.2 0.1
9 Lipka@CLEF [∧] 01.10.09
❑ Wikipedia snapshot March 2009 ❑ 169000 articles per language ❑ 3 index collections ❑ Query representation: title + description ❑ Document representation: title + subject + alternative
10 Lipka@CLEF [∧] 01.10.09
❑ Wikipedia snapshot March 2009 ❑ 169000 articles per language ❑ 3 index collections ❑ Query representation: title + description ❑ Document representation: title + subject + alternative
❑ Selecting the correct index collection. (language detection needed) ❑ Correct index collection not always available. ❑ Fields title, subject, and alternative not always share the same language.
11 Lipka@CLEF [∧] 01.10.09
12 Lipka@CLEF [∧] 01.10.09
13 Lipka@CLEF [∧] 01.10.09
0.4
... 0.2
...
... ... 0.1 0.0
... ... 0.2 0.1
0.5
... 0.2
... 0.1
...
14 Lipka@CLEF [∧] 01.10.09
0.4
... 0.2
...
... ... 0.1 0.0
... ... 0.2 0.1
0.5
... 0.2
... 0.1
...
T
T
|DI| × |D| |DI| × |V| |V| × |D|
15 Lipka@CLEF [∧] 01.10.09
DI1 · q, AT DI2 · d)
DI1 · q)T · AT DI2 · d
DI2 · d
16 Lipka@CLEF [∧] 01.10.09
DI1 · q, AT DI2 · d)
DI1 · q)T · AT DI2 · d
DI2 · d
17 Lipka@CLEF [∧] 01.10.09
DI1 · q, AT DI2 · d)
DI1 · q)T · AT DI2 · d
DI2 · d
translation
18 Lipka@CLEF [∧] 01.10.09
19 Lipka@CLEF [∧] 01.10.09
20 Lipka@CLEF [∧] 01.10.09