1/50
Information Retrieval Modeling
Russian Summer School in Information Retrieval
Information Retrieval Modeling Russian Summer School in Information - - PowerPoint PPT Presentation
Information Retrieval Modeling Russian Summer School in Information Retrieval Djoerd Hiemstra http://www.cs.utwente.nl/~hiemstra 1/50 PART 1 the basics 2/50 Goal Gain basic knowledge of IR Intuitive understanding of difficulty of
1/50
Russian Summer School in Information Retrieval
2/50
3/50
4/50
5/50
6/50
comparison
representation
representation feedback
7/50
8/50
this is what IR models are about
9/50
this is what IR models are about
10/50
11/50
Massachusetts dumps Microsoft Office Massachusetts The people who brought you the Boston tea party, have joined in another revolution against good King Billy’s Office software. The state government has decided that all electronic documents saved and created by state employees have to use open formats . Microsoft is clearly worried. A lot of people live in Massachusetts and that is a big thumbs up for open sauce. However, it is hoping to get around the problem by applying recognition from an industry standards body for recognition of its own formats as open standards.
apply big billi bodi boston brought creat decid docum dump electron employe format good govern hope industri join king live lot massachusett microsoft offic
softwar standard state tea thumb worri
12/50
bitterli central clear cloudi cloudier coast cold dai east easterli edg flurri forecast frost lead moder northeast part period persist plenti risk shower sleet snow south southern southwestern sunshin todai weather wind wintri
Today's weather forecast Clear periods leading to a moderate frost in many parts away from the east coast. The northeast will be cloudier, as will the far south, here the risk of a few snow flurries. The bitterly cold easterly wind persisting. Plenty of sunshine around, but rather cloudy in northeast, here some wintry showers. The south also rather cloudy, perhaps sleet or snow edging into southwestern and central southern parts later in day.
13/50
14/50
15/50
16/50
17/50
18/50
19/50
20/50
21/50
22/50
23/50
24/50
25/50
26/50
∈
terms matching
k k k d
27/50
28/50
29/50
m
m
m
k=1 m
m
30/50
31/50
32/50
33/50
containing "social") R = 11 (number of relevant docs) n = 1000 (number of docs containing "social") N = 10000 (total number of docs)
34/50
k
35/50
36/50
37/50
i=1 n
38/50
39/50
40/50
I linking to D
41/50
42/50
43/50
44/50
45/50
46/50
47/50
48/50
49/50
50/50
hypertextual web search engine. In Proceedings of the 7th World Wide Web Conference, 1998
Information Retrieval., In: Lecture Notes in Computer Science 1513, Springer-Verlag, 1998
searching of literary information. IBM Journal of Research and Development 1 (4), 309–317.
search terms. Journal of the American Society for Information Science 27(3):129–146, 1976
Documentation 33 (4), 294–304, 1977.
Salton (Ed.), The Smart Retrieval System: Experiments in Automatic Document Processing, pp. 313–323, 1971