Cross-Language Retrieval LBSC 796/INFM 718R Douglas W. Oard - - PowerPoint PPT Presentation

cross language retrieval
SMART_READER_LITE
LIVE PREVIEW

Cross-Language Retrieval LBSC 796/INFM 718R Douglas W. Oard - - PowerPoint PPT Presentation

Cross-Language Retrieval LBSC 796/INFM 718R Douglas W. Oard Session 12: April 27, 2011 Agenda Questions Overview Cross-Language Search User Interaction User Needs Assessment Who are the potential users? What goals do we


slide-1
SLIDE 1

Cross-Language Retrieval

LBSC 796/INFM 718R Douglas W. Oard Session 12: April 27, 2011

slide-2
SLIDE 2

Agenda

  • Questions
  • Overview
  • Cross-Language Search
  • User Interaction
slide-3
SLIDE 3

User Needs Assessment

  • Who are the potential users?
  • What goals do we seek to support?
  • What language skills must we accommodate?
slide-4
SLIDE 4

Who needs Cross-Language Search?

  • When users can read several languages

– Eliminate multiple queries – Query in most fluent language

  • Monolingual users can also benefit

– If translations can be provided – If it suffices to know that a document exists – If text captions are used to search for images

slide-5
SLIDE 5

Most Widely-Spoken Languages

100 200 300 400 500 600 700 800 900 1000 Chinese English Spanish Russian French Portuguese Arabic Bengali Hindi/Urdu Japanese German Number of Speakers (millions) Secondary Primary Source: Ethnologue (SIL), 1999

slide-6
SLIDE 6

64% 5% 4% 6% 2% 8% 2% 4% 5% 0% 33% 28% 9% 6% 5% 5% 4% 4% 4% 2% English Chinese Spanish Japanese Portuguese German Arabic French Russian Korean

Global Internet Users

slide-7
SLIDE 7

Global Trade

200 400 600 800 1000

USA Germany Japan China France UK Canada Italy Netherlands Belgium Korea Mexico Taiwan Singapore Spain

Exports Imports

Billions of US Dollars (1999)

Source: World Trade Organization 2000 Annual Report

slide-8
SLIDE 8

The Problem Space

  • Retrospective search

– Web search – Specialized services (medicine, law, patents) – Help desks

  • Real-time filtering

– Email spam – Web parental control – News personalization

  • Real-time interaction

– Instant messaging – Chat rooms – Teleconferences

Key Capabilities

Map across languages

– For human understanding – For automated processing

slide-9
SLIDE 9

A Little (Confusing) Vocabulary

  • Multilingual document

– Document containing more than one language

  • Multilingual collection

– Collection of documents in different languages

  • Multilingual system

– Can retrieve from a multilingual collection

  • Cross-language system

– Query in one language finds document in another

  • Translingual system

– Queries can find documents in any language

slide-10
SLIDE 10

The Information Retrieval Cycle

Source Selection Search Query Selection Ranked List Examination Documents Delivery Documents Query Formulation Resource

source reselection System discovery Vocabulary discovery Concept discovery Document discovery

How do you formulate a query?

If you can’t understand the documents…

How do you know something is worth looking at? How can you understand the retrieved documents?

slide-11
SLIDE 11

Translation Translingual Browsing Translingual Search Query Document Select Examine

Information Access Information Use

slide-12
SLIDE 12

Early Work

  • 1964 International Road Research

– Multilingual thesauri

  • 1970 SMART

– Dictionary-based free-text cross-language retrieval

  • 1978 ISO Standard 5964 (revised 1985)

– Guidelines for developing multilingual thesauri

  • 1990 Latent Semantic Indexing

– Corpus-based free-text translingual retrieval

slide-13
SLIDE 13

Multilingual Thesauri

  • Build a cross-cultural knowledge structure

– Cultural differences influence indexing choices

  • Use language-independent descriptors

– Matched to language-specific lead-in vocabulary

  • Three construction techniques

– Build it from scratch – Translate an existing thesaurus – Merge monolingual thesauri

slide-14
SLIDE 14

Cross-Language Retrieval Indexing Languages Machine-Assisted Indexing Information Retrieval Multilingual Metadata Digital Libraries International Information Flow Diffusion of Innovation Information Use Automatic Abstracting

Information Science

Machine Translation Information Extraction Text Summarization Natural Language Processing Multilingual Ontologies Ontological Engineering Textual Data Mining Knowledge Discovery Machine Learning

Artificial Intelligence

Localization Information Visualization Human-Computer Interaction Web Internationalization World-Wide Web Topic Detection and Tracking Speech Processing Multilingual OCR Document Image Understanding

Other Fields

Multilingual Information Access

slide-15
SLIDE 15

Free Text CLIR

  • What to translate?

– Queries or documents

  • Where to get translation knowledge?

– Dictionary or corpus

  • How to use it?
slide-16
SLIDE 16

The Search Process

Choose Document-Language Terms Query-Document Matching Infer Concepts Select Document-Language Terms

Document

Author

Query

Choose Document-Language Terms

Monolingual Searcher

Choose Query-Language Terms

Cross-Language Searcher

slide-17
SLIDE 17

Translingual Retrieval Architecture

Language Identification English Term Selection Chinese Term Selection Cross- Language Retrieval Monolingual Chinese Retrieval 3: 0.91 4: 0.57 5: 0.36 1: 0.72 2: 0.48 Chinese Query Chinese Term Selection

slide-18
SLIDE 18

Evidence for Language Identification

  • Metadata

– Included in HTTP and HTML

  • Word-scale features

– Which dictionary gets the most hits?

  • Subword features

– Character n-gram statistics

slide-19
SLIDE 19

Query-Language IR

English queries Chinese Document Collection Retrieval Engine Translation System English Document Collection Results select examine

slide-20
SLIDE 20

Example: Modular use of MT

  • Select a single query language
  • Translate every document into that language
  • Perform monolingual retrieval
slide-21
SLIDE 21

TDT-3 Mandarin Broadcast News

Systran Balanced 2-best translation

Is Machine Translation Enough?

slide-22
SLIDE 22

Document-Language IR

Retrieval Engine Translation System Chinese queries Chinese documents Results English queries select examine Chinese Document Collection

slide-23
SLIDE 23

Query vs. Document Translation

  • Query translation

– Efficient for short queries (not relevance feedback) – Limited context for ambiguous query terms

  • Document translation

– Rapid support for interactive selection – Need only be done once (if query language is same)

  • Merged query and document translation

– Can produce better effectiveness than either alone

slide-24
SLIDE 24

Interlingual Retrieval

Interlingual Retrieval 3: 0.91 4: 0.57 5: 0.36 Query Translation Chinese Query Terms English Document Terms Document Translation

slide-25
SLIDE 25

Learning From Document Pairs

E1 E2 E3 E4 E5 S1 S2 S3 S4 Doc 1 Doc 2 Doc 3 Doc 4 Doc 5 4 2 2 1 8 4 4 2 2 2 2 1 2 1 2 1 4 1 2 1 English Terms Spanish Terms

slide-26
SLIDE 26

Generalized Vector Space Model

  • “Term space” of each language is different

– Document links define a common “document space”

  • Describe documents based on the corpus

– Vector of similarities to each corpus document

  • Compute cosine similarity in document space
  • Very effective in a within-domain evaluation
slide-27
SLIDE 27

Latent Semantic Indexing

  • Cosine similarity captures noise with signal

– Term choice variation and word sense ambiguity

  • Signal-preserving dimensionality reduction

– Conflates terms with similar usage patterns

  • Reduces term choice effect, even across languages
  • Computationally expensive
slide-28
SLIDE 28
  • il

petroleum probe survey take samples Which translation? No translation! restrain

  • il

petroleum probe survey take samples cymbidium goeringii Wrong segmentation

slide-29
SLIDE 29

What’s a “Term?”

  • Granularity of a “term” depends on the task

– Long for translation, more fine-grained for retrieval

  • Phrases improve translation two ways

– Less ambiguous than single words – Idiomatic expressions translate as a single concept

  • Three ways to identify phrases

– Semantic (e.g., appears in a dictionary) – Syntactic (e.g., parse as a noun phrase) – Co-occurrence (appear together unexpectedly often)

slide-30
SLIDE 30

Learning to Translate

  • Lexicons

– Phrase books, bilingual dictionaries, …

  • Large text collections

– Translations (“parallel”) – Similar topics (“comparable”)

  • Similarity

– Similar pronunciation

  • People
slide-31
SLIDE 31

Types of Lexical Resources

  • Ontology

– Organization of knowledge

  • Thesaurus

– Ontology specialized to support search

  • Dictionary

– Rich word list, designed for use by people

  • Lexicon

– Rich word list, designed for use by a machine

  • Bilingual term list

– Pairs of translation-equivalent terms

slide-32
SLIDE 32

Original query: El Nino and infectious diseases Term selection: “El Nino” infectious diseases Term translation:

(Dictionary coverage: “El Nino” is not found)

Translation selection: Query formulation:

Structure:

Dictionary-Based Query Translation

slide-33
SLIDE 33

Four-Stage Backoff

  • Tralex might contain stems, surface forms,
  • r some combination of the two.

mangez mangez mangez mange mange mangez mange mange mangez mange mangent mange

  • eat
  • eats

eat

  • eat
  • eat

Document Translation Lexicon

surface form surface form stem surface form surface form stem stem stem

French stemmer: Oard, Levow, and Cabezas (2001); English: Inquiry’s kstem

slide-34
SLIDE 34

Exploiting Part-of-Speech (POS)

  • Constrain translations by part-of-speech

– Requires POS tagger and POS-tagged lexicon

  • Works well when queries are full sentences

– Short queries provide little basis for tagging

  • Constrained matching can hurt monolingual IR

– Nouns in queries often match verbs in documents

slide-35
SLIDE 35

BM-25

] ) ( 7 ) ( * 8 )) , ( ) ( * 9 . 3 . ( )) , ( * 2 . 2 ( ][ ) 5 . ) ( ( ) 5 . ) ( ( [log e qtf e qtf d e tf avdl d dl d e tf e df e df N

Q e k k k

     

document frequency term frequency document length

] ) ( 7 ) ( * 8 )) , ( ) ( * 9 . 3 . ( )) , ( * 2 . 2 ( ][ ) 5 . ) ( ( ) 5 . ) ( ( [log e qtf e qtf d e tf avdl d dl d e tf e df e df N

Q e k k k

     

] ) ( 7 ) ( * 8 )) , ( ) ( * 9 . 3 . ( )) , ( * 2 . 2 ( ][ ) 5 . ) ( ( ) 5 . ) ( ( [log e qtf e qtf d e tf avdl d dl d e tf e df e df N

Q e k k k

     

slide-36
SLIDE 36

“Structured Queries”

  • Weight of term a in a document i depends on:

– TF(a,i): Frequency of term a in document i – DF(a): How many documents term a occurs in

  • Build pseudo-terms from alternate translations

– TF (syn(a,b),i) = TF(a,i)+TF(b,i) – DF (syn(a,b) = |{docs with a}U{docs with b}|

  • Downweight terms with any common translation

– Particularly effective for long queries

slide-37
SLIDE 37

(Query Terms: 1: 2: 3: )

Computing Weights

  • Unbalanced:

– Overweights query terms that have many translations

  • Balanced (#sum):

– Sensitive to rare translations

  • Pirkola (#syn):

– Deemphasizes query terms with any common translation

] [ 3 1

3 3 2 2 1 1

DF TF DF TF DF TF  

] ) ( 2 1 [ 2 1

3 3 2 2 1 1

DF TF DF TF DF TF  

] [ 2 1

3 3 2 1 2 1

DF TF DF DF TF TF   

slide-38
SLIDE 38

Ranked Retrieval English/English Translation Lexicon

Measuring Coverage Effects

Ranked List 113,000 CLEF English News Stories CLEF Relevance Judgments Evaluation Measure of Effectiveness 33 English Queries (TD)

slide-39
SLIDE 39

35 Bilingual Term Lists

  • Chinese (193, 111)
  • German (103, 97, 89, 6)
  • Hungarian (63)
  • Japanese (54)
  • Spanish (35, 21, 7)
  • Russian (32)
  • Italian (28, 13, 5)
  • French (20, 17, 3)
  • Esperanto (17)
  • Swedish (10)
  • Dutch (10)
  • Norwegian (6)
  • Portuguese (6)
  • Greek (5)
  • Afrikaans (4)
  • Danish (4)
  • Icelandic (3)
  • Finnish (3)
  • Latin (2)
  • Welsh (1)
  • Indonesian (1)
  • Old English (1)
  • Swahili (1)
  • Eskimo (1)
slide-40
SLIDE 40

Size Effect

String matching Stem matching 7% OOV

slide-41
SLIDE 41

Out-of-Vocabulary Distribution

slide-42
SLIDE 42

Measuring Named Entity Effect

Compute Term Weights Build Index English Documents Compute Term Weights Compute Document Score Sort Scores Ranked List English Query Translation Lexicon

  • Named

Entities

+ Named

Entities

slide-43
SLIDE 43

Named entities removed Named entities from term list Named entities added Full Query

slide-44
SLIDE 44

Hieroglyphic Egyptian Demotic Greek

slide-45
SLIDE 45

Types of Bilingual Corpora

  • Parallel corpora: translation-equivalent pairs

– Document pairs – Sentence pairs – Term pairs

  • Comparable corpora: topically related

– Collection pairs – Document pairs

slide-46
SLIDE 46

Exploiting Parallel Corpora

  • Automatic acquisition of translation lexicons
  • Statistical machine translation
  • Corpus-guided translation selection
  • Document-linked techniques
slide-47
SLIDE 47

Some Modern Rosetta Stones

  • News:

– DE-News (German-English) – Hong-Kong News, Xinhua News (Chinese-English)

  • Government:

– Canadian Hansards (French-English) – Europarl (Danish, Dutch, English, Finnish, French, German, Greek, Italian, Portugese, Spanish, Swedish) – UN Treaties (Russian, English, Arabic, …)

  • Religion

– Bible, Koran, Book of Mormon

slide-48
SLIDE 48

Parallel Corpus

  • Example from DE-News (8/1/1996)

Diverging opinions about planned tax reform Unterschiedliche Meinungen zur geplanten Steuerreform The discussion around the envisaged major tax reform continues . Die Diskussion um die vorgesehene grosse Steuerreform dauert an . The FDP economics expert , Graf Lambsdorff , today came out in favor of advancing the enactment of significant parts of the overhaul , currently planned for 1999 . Der FDP - Wirtschaftsexperte Graf Lambsdorff sprach sich heute dafuer aus , wesentliche Teile der fuer 1999 geplanten Reform vorzuziehen . English: German: English: German: English: German:

slide-49
SLIDE 49

Word-Level Alignment

Diverging opinions about planned tax reform Unterschiedliche Meinungen zur geplanten Steuerreform English German Madam President , I had asked the administration … English Señora Presidenta, había pedido a la administración del Parlamento … Spanish

slide-50
SLIDE 50

A Translation Model

  • From word-aligned bilingual text, we

induce a translation model

  • Example:

) | ( e f p

i

1 ) | ( 

i

f i e

f p

where,

p(探测|survey) = 0.4 p(试探|survey) = 0.3 p(测量|survey) = 0.25 p(样品|survey) = 0.05

slide-51
SLIDE 51

Using Multiple Translations

  • Weighted Structured Query Translation

– Takes advantage of multiple translations and translation probabilities

  • TF and DF of query term e are computed

using TF and DF of its translations:

 

i

f k i i k

D f TF e f p D e TF ) , ( ) | ( ) , (

 

i

f i i

f DF e f p e DF ) ( ) | ( ) (

slide-52
SLIDE 52

Evaluating Corpus-Based Techniques

  • Within-domain evaluation (upper bound)

– Partition a bilingual corpus into training and test – Use the training part to tune the system – Generate relevance judgments for evaluation part

  • Cross-domain evaluation (fair)

– Use existing corpora and evaluation collections – No good metric for degree of domain shift

slide-53
SLIDE 53

40% 50% 60% 70% 80% 90% 100% 110% 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Cumulative Probability Threshold MAP: CLIR/Monolingual

DAMM IMM PSQ

Retrieval Effectiveness

CLEF French

slide-54
SLIDE 54

Exploiting Comparable Corpora

  • Blind relevance feedback

– Existing CLIR technique + collection-linked corpus

  • Lexicon enrichment

– Existing lexicon + collection-linked corpus

  • Dual-space techniques

– Document-linked corpus

slide-55
SLIDE 55

Bilingual Query Expansion

source language query

Query Translation

results

Source Language IR Target Language IR source language collection target language collection expanded source language query expanded target language terms

Pre-translation expansion Post-translation expansion

slide-56
SLIDE 56

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 5,000 10,000 15,000 Unique Dutch Terms Mean Average Precision Both Post Pre None

Query Expansion Effect

Paul McNamee and James Mayfield, SIGIR-2002

slide-57
SLIDE 57

Blind Relevance Feedback

  • Augment a representation with related terms

– Find related documents, extract distinguishing terms

  • Multiple opportunities:

– Before doc translation: Enrich the vocabulary – After doc translation: Mitigate translation errors – Before query translation: Improve the query – After query translation: Mitigate translation errors

  • Short queries get the most dramatic improvement
slide-58
SLIDE 58

Indexing Time: Doc Translation

100 200 300 400 500

1 1 5 2 2 5 3 5 4 4 5

Thousands of documents Indexing time (sec)

monolingual cross-language

slide-59
SLIDE 59

Post-Translation “Document Expansion”

Mandarin Chinese Documents Term-to-Term Translation English Corpus IR System Top 5 Automatic Segmentation Term Selection IR System Results English Query

Document to be Indexed Single Document

slide-60
SLIDE 60

Why Document Expansion Works

  • Story-length objects provide useful context
  • Ranked retrieval finds signal amid the noise
  • Selective terms discriminate among documents

– Enrich index with low DF terms from top documents

  • Similar strategies work well in other applications

– CLIR query translation – Monolingual spoken document retrieval

slide-61
SLIDE 61

Lexicon Enrichment

… Cross-Language Evaluation Forum … … Solto Extunifoc Tanixul Knadu …

?

slide-62
SLIDE 62

Lexicon Enrichment

  • Use a bilingual lexicon to align “context regions”

– Regions with high coincidence of known translations

  • Pair unknown terms with unmatched terms

– Unknown: language A, not in the lexicon – Unmatched: language B, not covered by translation

  • Treat the most surprising pairs as new translations
slide-63
SLIDE 63

Cognate Matching

  • Dictionary coverage is inherently limited

– Translation of proper names – Translation of newly coined terms – Translation of unfamiliar technical terms

  • Strategy: model derivational translation

– Orthography-based – Pronunciation-based

slide-64
SLIDE 64

Matching Orthographic Cognates

  • Retain untranslatable words unchanged

– Often works well between European languages

  • Rule-based systems

– Even off-the-shelf spelling correction can help!

  • Character-level statistical MT

– Trained using a set of representative cognates

slide-65
SLIDE 65

Matching Phonetic Cognates

  • Forward transliteration

– Generate all potential transliterations

  • Reverse transliteration

– Guess source string(s) that produced a transliteration

  • Match in phonetic space
slide-66
SLIDE 66

Leveraging Cognates

String Comparison Written Form Written Form Alphabetic Transliteration Pronunciation Phonetic Transliteration Pronunciation Spoken Form Spoken Form Phonetic Comparison Similarity Similarity

slide-67
SLIDE 67

Cross-Language “Retrieval”

Search

Translated Query Ranked List

Query Translation

Query

slide-68
SLIDE 68

Interactive Translingual Search

Search

Translated Query

Selection

Ranked List

Examination

Document

Use

Document

Query Formulation Query Translation

Query Query Reformulation

MT Translated “Headlines” English Definitions

slide-69
SLIDE 69

Selection

  • Goal: Provide information to support decisions
  • May not require very good translations

– e.g., Term-by-term title translation

  • People can “read past” some ambiguity

– May help to display a few alternative translations

slide-70
SLIDE 70
slide-71
SLIDE 71

Merging Ranked Lists

  • Types of Evidence

– Rank – Score

  • Evidence Combination

– Weighted round robin – Score combination

  • Parameter tuning

– Condition-based – Query-based 1 voa4062 .22 2 voa3052 .21 3 voa4091 .17 … 1000 voa4221 .04 1 voa4062 .52 2 voa2156 .37 3 voa3052 .31 … 1000 voa2159 .02 1 voa4062 2 voa3052 3 voa2156 … 1000 voa4201

slide-72
SLIDE 72

Examination Interface

  • Two goals

– Refine document delivery decisions – Support vocabulary discovery for query refinement

  • Rapid translation is essential

– Document translation retrieval strategies are a good fit – Focused on-the-fly translation may be a viable alternative

slide-73
SLIDE 73

Uh oh…

slide-74
SLIDE 74

Translation for Assessment

Indonesian City of Bali in October last year in the bomb blast in the case of imam accused India of the sea on Monday began to be averted. The attack

  • n getting and its plan to make the charges and

decide if it were found guilty, he death sentence of

  • May. Indonesia of the police said that the imam sea

bomb blasts in his hand claim to be accepted. A night Club and time in the bomb blast in more than 200 people were killed and several injured were in which most foreign nationals. …

slide-75
SLIDE 75

MT in a Month

50 60 70 80 90 best competing ISI public ISI public+ ISI unrestricted ISI late Human 6 Human 5 Percent Human Cased NISTr3n4 score

slide-76
SLIDE 76

Experiment Design

Participant 1 2 3 4 Task Order Narrow: Broad:

Topic Key System Key

System B: System A:

Topic11, Topic17 Topic13, Topic29 Topic11, Topic17 Topic13, Topic29 Topic17, Topic11 Topic29, Topic13 Topic17, Topic11 Topic29, Topic13 11, 13 17, 29

slide-77
SLIDE 77

Maryland Experiments

  • MT is almost always better

– Significant overall and for narrow topics alone (one-tailed t-test, p<0.05)

  • F measure is less insightful for narrow topics

– Always near 0 or 1

0.2 0.4 0.6 0.8 1 1.2 umd01 umd02 umd03 umd04 umd01 umd02 umd03 umd04 Participant Average F_0.8 on Two Topics MT GLOSS

|---------- Broad topics -----------| |--------- Narrow topics -----------|

slide-78
SLIDE 78

Delivery

  • Use may require high-quality translation

– Machine translation quality is often rough

  • Route to best translator based on:

– Acceptable delay – Required quality (language and technical skills) – Cost

slide-79
SLIDE 79

Interactive Question Answering

1 2 3 4 5 6 7 8 8 11 13 4 16 6 14 7 2 10 15 12 1 3 9 5

Users with Correct Answers Question Number

slide-80
SLIDE 80

Questions, Grouped by Difficulty

8 Who is the managing director of the International Monetary Fund? 11 Who is the president of Burundi? 13 Of what team is Bobby Robson coach? 4 Who committed the terrorist attack in the Tokyo underground? 16 Who won the Nobel Prize for Literature in 1994? 6 When did Latvia gain independence? 14 When did the attack at the Saint-Michel underground station in Paris occur? 7 How many people were declared missing in the Philippines after the typhoon “Angela”? 2 How many human genes are there? 10 How many people died of asphyxia in the Baku underground? 15 How many people live in Bombay? 12 What is Charles Millon's political party? 1 What year was Thomas Mann awarded the Nobel Prize? 3 Who is the German Minister for Economic Affairs? 9 When did Lenin die? 5 How much did the Channel Tunnel cost?

slide-81
SLIDE 81

81

slide-82
SLIDE 82

82

slide-83
SLIDE 83

83

Side-by-side Translation

slide-84
SLIDE 84

Process User

slide-85
SLIDE 85

85

Task Scenario

Task Scenario: Hezbollah (abridged version) Time: 60 min. Foreign (U.S., Canadian, Australian, and European) citizens are evacuating Lebanon as a result of the recent armed conflict between Israel, Palestinian fighters, and Hezbollah [Hizbullah]. You are assisting with the extraction of US citizens. Compile sites of recent armed conflict (in the last month) in this area. Your supervisor will use these data to develop evacuation plans. For each attack you find, place a number on the map and complete as much as you can of the following template: Location: Date: Type of attack: Number killed/wounded: Include attacks in an areas not shown on the map. For multiple attacks, list each occurrence.

slide-86
SLIDE 86

User Success

1 2 6 7 Correct/Reported 59/91 49/51 37/53 64/76 Precision 65% 96% 70% 84% Relative Recall 29% 24% 18% 32%

Hezbollah scenario: number of attacks reported

slide-87
SLIDE 87

Where Things Stand

  • Ranked retrieval works well across languages

– Bonus: easily extended to text classification – Caveat: mostly demonstrated on news stories

  • Machine translation is okay for niche markets

– Keep an eye on this: accuracy is improving fast

  • Building explainable systems seems possible
slide-88
SLIDE 88
  • Cross-Language IR Algorithms

– Levow et al., IP&M 2005 – Wang and Oard, SIGIR 2006

  • Interactive CLIR

– Oard et al., IP&M 2007 – Oard et al., in Olive et al, Springer 2011

For More Information