Golden Retriever an information retrieval project by Peter - - PowerPoint PPT Presentation

golden retriever
SMART_READER_LITE
LIVE PREVIEW

Golden Retriever an information retrieval project by Peter - - PowerPoint PPT Presentation

Golden Retriever an information retrieval project by Peter Peerdeman and Timen Olthof Information Retrieval Group 13 Project Progress: Introductory work 1. Read the assignment and tutorial 2. Downloaded and unzipped the data files 3. Use


slide-1
SLIDE 1

Information Retrieval Group 13

Golden Retriever

an information retrieval project by Peter Peerdeman and Timen Olthof

slide-2
SLIDE 2

Project Progress: Introductory work

  • 1. Read the assignment and tutorial
  • 2. Downloaded and unzipped the

data files

  • 3. Use Lucene to index the dataset
  • 4. Try out Luke to examine the

index

slide-3
SLIDE 3

Project Progress: Assignment

  • 1. Found a Lucene API tutorial online
  • 2. Get examplecode running (porting)
  • 3. Adjust the examplecode to fit the

information needs for the assignment

slide-4
SLIDE 4

Project Progress: Findings

  • Lucene code structure
  • Lucene API is easy to use
  • Very different results in plain and XML

corpora, because of different fields

slide-5
SLIDE 5

Project Progress: Plans:

  • Add multiple keyword input,

Stemming / Case folding to query

  • Write report on our topic exploration

process and findings

slide-6
SLIDE 6

Information Retrieval Group 13

Golden Retriever

an information retrieval project by Peter Peerdeman and Timen Olthof