Retrieval Models
Probability Ranking Principle
Web Search
1
Slides based on the books:
Retrieval Models Probability Ranking Principle Web Search Slides - - PowerPoint PPT Presentation
Retrieval Models Probability Ranking Principle Web Search Slides based on the books: 1 Retrieval models Geometric/linear spaces Vector space model Probability ranking principle Language models approach to IR An important
Probability Ranking Principle
1
Slides based on the books:
2
๐๐๐ก๐ข๐๐ ๐๐๐ = ๐๐ ๐๐๐ โ ๐๐๐๐๐๐โ๐๐๐ ๐๐ค๐๐๐๐๐๐ ึ ๐ ๐ต ๐ถ = ๐ ๐ต ๐ ๐ถ ๐ต ๐ ๐ถ ๐ ๐ต, ๐ถ = ๐ ๐ต ๐ถ ๐ ๐ถ = ๐ ๐ถ ๐ต ๐ ๐ต ๐ ๐ต ๐ถ = ๐ ๐ต, ๐ถ ๐(๐ถ) = ๐ ๐ต ๐ ๐ถ ๐ต ๐ ๐ถ
3
๐ ๐ต = ๐(๐ต) ๐ าง ๐ต = ๐(๐ต) 1 โ ๐ ๐ต ๐ ๐ต ๐ถ = ๐ ๐ต ๐ ๐ถ ๐ต ๐ ๐ถ = ๐ ๐ต ฯ๐ ๐ ๐๐|๐ต ฯ๐ ๐ ๐๐ ๐ ๐ต ๐ถ = ๐ ๐ต ๐ถ ๐ าง ๐ต ๐ถ = ๐ ๐ต ๐(๐ถ|๐ต) ๐ ๐ถ ๐ าง ๐ต ๐(๐ถ| าง ๐ต) ๐ ๐ถ = ๐ ๐ต ๐(๐ถ|๐ต) ๐ าง ๐ต ๐(๐ถ| าง ๐ต)
4
5
๐ ๐๐๐ถ = ๐๐๐๐๐รฃ๐ ๐๐๐ข๐ = ๐ ๐๐๐ถ = ๐๐๐๐๐รฃ๐ ๐ ๐๐๐ข๐ ๐๐๐ถ = ๐๐๐๐๐รฃ๐ ๐ ๐๐๐ข๐ ๐๐๐๐ก๐ข๐๐ ๐๐๐ ๐ = ๐๐๐ ๐๐๐ ๐ โ ๐ค๐๐ ๐๐ก๐๐๐๐โ๐๐รง๐ ๐๐ค๐๐๐๐๐๐๐ ๐ ๐ต ๐๐๐ข๐ = ๐ ๐ต ๐ ๐๐๐ข๐ ๐ต ๐ ๐๐๐ข๐
is attempted in a semantically imprecise space of index terms. Probabilities provide a principled foundation for uncertain reasoning. Can we use probabilities to quantify our uncertainties?
User Information Need Documents Document Representation Query Representation
How to match? Uncertain guess of whether document has relevant content
Understanding
uncertain
6
7
P(R=1|document, query)
8
๐ ๐ = 1|๐, ๐ = ๐ ๐, ๐ ๐ = 1 ๐(๐ = 1) ๐(๐, ๐) ๐ ๐ = 0|๐, ๐ = ๐ ๐, ๐ ๐ = 0 ๐(๐ = 0) ๐(๐, ๐)
under 1/0 loss
9
๐ ๐ |๐, ๐ = ๐ ๐, ๐ ๐ ๐(๐ ) ๐(๐, ๐) O ๐ ๐, ๐ = ๐ ๐ = 1|๐, ๐ ๐ ๐ = 0|๐, ๐
under 1/0 loss
10
๐ ๐ |๐, ๐ = ๐ ๐, ๐ ๐ ๐(๐ ) ๐(๐, ๐) O ๐ ๐, ๐ = ๐ ๐ = 1|๐, ๐ ๐ ๐ = 0|๐, ๐ โ ๐ ๐ ๐, ๐ = 1 ๐ ๐ ๐, ๐ = 0)
O ๐ ๐, ๐ = ๐ ๐ = 1|๐, ๐ ๐ ๐ = 0|๐, ๐ โ log ๐ ๐|๐, ๐ ๐ ๐ |๐ ๐ ๐|๐, าง ๐ ๐ าง ๐ |๐
11
12
O ๐ ๐, ๐ = ๐ ๐ = 1|๐, ๐ ๐ ๐ = 0|๐, ๐ ๐ ๐ ๐, ๐ โ log ๐ ๐|๐, ๐ ๐ ๐ |๐ ๐ ๐|๐, าง ๐ ๐ าง ๐ |๐ ๐ ๐ ๐, ๐ โ ๐ ๐ ๐, ๐ = 1 ๐ ๐ ๐, ๐ = 0) Probability Ranking Principle Probabilistic Retrieval Models Language Models