Retrieval Models Probability Ranking Principle Web Search Slides - - PowerPoint PPT Presentation

โ–ถ
retrieval models
SMART_READER_LITE
LIVE PREVIEW

Retrieval Models Probability Ranking Principle Web Search Slides - - PowerPoint PPT Presentation

Retrieval Models Probability Ranking Principle Web Search Slides based on the books: 1 Retrieval models Geometric/linear spaces Vector space model Probability ranking principle Language models approach to IR An important


slide-1
SLIDE 1

Retrieval Models

Probability Ranking Principle

Web Search

1

Slides based on the books:

slide-2
SLIDE 2

Retrieval models

  • Geometric/linear spaces
  • Vector space model
  • Probability ranking principle
  • Language models approach to IR
  • An important emphasis in recent work
  • Probabilistic retrieval model
  • Binary independence model
  • Okapiโ€™s BM25

2

slide-3
SLIDE 3

Recall a few probability basics

  • For events A and B, the Bayesโ€™ Rule is:
  • Interpretation:

๐‘ž๐‘๐‘ก๐‘ข๐‘“๐‘ ๐‘—๐‘๐‘  = ๐‘ž๐‘ ๐‘—๐‘๐‘  โˆ™ ๐‘š๐‘—๐‘™๐‘“๐‘š๐‘—โ„Ž๐‘๐‘๐‘’ ๐‘“๐‘ค๐‘—๐‘’๐‘“๐‘œ๐‘‘๐‘“ ึž ๐‘ž ๐ต ๐ถ = ๐‘ž ๐ต ๐‘ž ๐ถ ๐ต ๐‘ž ๐ถ ๐‘ž ๐ต, ๐ถ = ๐‘ž ๐ต ๐ถ ๐‘ž ๐ถ = ๐‘ž ๐ถ ๐ต ๐‘ž ๐ต ๐‘ž ๐ต ๐ถ = ๐‘ž ๐ต, ๐ถ ๐‘ž(๐ถ) = ๐‘ž ๐ต ๐‘ž ๐ถ ๐ต ๐‘ž ๐ถ

3

slide-4
SLIDE 4

Recall a few probability basics

  • Independence assumption:
  • Odds:

๐‘ƒ ๐ต = ๐‘ž(๐ต) ๐‘ž าง ๐ต = ๐‘ž(๐ต) 1 โˆ’ ๐‘ž ๐ต ๐‘ž ๐ต ๐ถ = ๐‘ž ๐ต ๐‘ž ๐ถ ๐ต ๐‘ž ๐ถ = ๐‘ž ๐ต ฯ‚๐‘— ๐‘ž ๐‘๐‘—|๐ต ฯ‚๐‘— ๐‘ž ๐‘๐‘— ๐‘ƒ ๐ต ๐ถ = ๐‘ž ๐ต ๐ถ ๐‘ž าง ๐ต ๐ถ = ๐‘ž ๐ต ๐‘ž(๐ถ|๐ต) ๐‘ž ๐ถ ๐‘ž าง ๐ต ๐‘ž(๐ถ| าง ๐ต) ๐‘ž ๐ถ = ๐‘ž ๐ต ๐‘ž(๐ถ|๐ต) ๐‘ž าง ๐ต ๐‘ž(๐ถ| าง ๐ต)

4

slide-5
SLIDE 5

Recall a few probability basics

5

๐‘ž ๐‘‡๐‘€๐ถ = ๐‘‘๐‘๐‘›๐‘ž๐‘“รฃ๐‘ ๐‘’๐‘๐‘ข๐‘ = ๐‘ž ๐‘‡๐‘€๐ถ = ๐‘‘๐‘๐‘›๐‘ž๐‘“รฃ๐‘ ๐‘ž ๐‘’๐‘๐‘ข๐‘ ๐‘‡๐‘€๐ถ = ๐‘‘๐‘๐‘›๐‘ž๐‘“รฃ๐‘ ๐‘ž ๐‘’๐‘๐‘ข๐‘ ๐‘๐‘ž๐‘๐‘ก๐‘ข๐‘“๐‘ ๐‘—๐‘๐‘ ๐‘— = ๐‘๐‘ž๐‘ ๐‘—๐‘๐‘ ๐‘— โˆ™ ๐‘ค๐‘“๐‘ ๐‘๐‘ก๐‘—๐‘›๐‘—๐‘šโ„Ž๐‘๐‘œรง๐‘ ๐‘“๐‘ค๐‘—๐‘’๐‘“๐‘œ๐‘‘๐‘—๐‘ ๐‘ž ๐ต ๐‘’๐‘๐‘ข๐‘ = ๐‘ž ๐ต ๐‘ž ๐‘’๐‘๐‘ข๐‘ ๐ต ๐‘ž ๐‘’๐‘๐‘ข๐‘

slide-6
SLIDE 6

Why probabilities in IR?

  • In traditional IR systems, matching between each document and query

is attempted in a semantically imprecise space of index terms. Probabilities provide a principled foundation for uncertain reasoning. Can we use probabilities to quantify our uncertainties?

User Information Need Documents Document Representation Query Representation

How to match? Uncertain guess of whether document has relevant content

Understanding

  • f user need is

uncertain

6

slide-7
SLIDE 7

The document ranking problem

  • We have a collection of documents
  • User issues a query
  • A list of documents needs to be returned
  • Ranking method is the core of an IR system:
  • In what order do we present documents to the user?
  • We want the โ€œbestโ€ document to be first, second best second, etcโ€ฆ.

Idea: Rank by probability of relevance of the document w.r.t. information need

7

slide-8
SLIDE 8

Modeling relevance

P(R=1|document, query)

  • Let d represent a document in the collection.
  • Let R represent relevance of a document w.r.t. to a query q
  • Let R=1 represent relevant and R=0 not relevant.
  • Our goal is to estimate:

8

๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘’, ๐‘Ÿ ๐‘  = 1 ๐‘ž(๐‘  = 1) ๐‘ž(๐‘’, ๐‘Ÿ) ๐‘ž ๐‘  = 0|๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘’, ๐‘Ÿ ๐‘  = 0 ๐‘ž(๐‘  = 0) ๐‘ž(๐‘’, ๐‘Ÿ)

slide-9
SLIDE 9

Probability Ranking Principle (PRP)

  • PRP in action: Rank all documents by ๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’
  • Theorem: Using the PRP is optimal, in that it minimizes the loss (Bayes risk)

under 1/0 loss

  • Provable if all probabilities correct, etc. [e.g., Ripley 1996]
  • Using odds, we reach a more convenient formulation of ranking :

9

๐‘ž ๐‘ |๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘’, ๐‘Ÿ ๐‘  ๐‘ž(๐‘ ) ๐‘ž(๐‘’, ๐‘Ÿ) O ๐‘† ๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’ ๐‘ž ๐‘  = 0|๐‘Ÿ, ๐‘’

slide-10
SLIDE 10

Probabilistic retrieval models interpretation

  • PRP in action: Rank all documents by ๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’
  • Theorem: Using the PRP is optimal, in that it minimizes the loss (Bayes risk)

under 1/0 loss

  • Provable if all probabilities correct, etc. [e.g., Ripley 1996]
  • Using odds, we reach a more convenient ranking formulation:

10

๐‘ž ๐‘ |๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘’, ๐‘Ÿ ๐‘  ๐‘ž(๐‘ ) ๐‘ž(๐‘’, ๐‘Ÿ) O ๐‘† ๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’ ๐‘ž ๐‘  = 0|๐‘Ÿ, ๐‘’ โˆ ๐‘ž ๐‘’ ๐‘Ÿ, ๐‘  = 1 ๐‘ž ๐‘’ ๐‘Ÿ, ๐‘  = 0)

slide-11
SLIDE 11

Language models interpretation

  • In language models, we do a different formulation towards

the query posterior given the document as a model.

O ๐‘† ๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’ ๐‘ž ๐‘  = 0|๐‘Ÿ, ๐‘’ โˆ log ๐‘ž ๐‘Ÿ|๐‘’, ๐‘  ๐‘ž ๐‘ |๐‘’ ๐‘ž ๐‘Ÿ|๐‘’, าง ๐‘  ๐‘ž าง ๐‘ |๐‘’

11

slide-12
SLIDE 12

The two families of Retrieval Models

12

O ๐‘† ๐‘Ÿ, ๐‘’ = ๐‘ž ๐‘  = 1|๐‘Ÿ, ๐‘’ ๐‘ž ๐‘  = 0|๐‘Ÿ, ๐‘’ ๐‘ƒ ๐‘† ๐‘Ÿ, ๐‘’ โˆ log ๐‘ž ๐‘Ÿ|๐‘’, ๐‘  ๐‘ž ๐‘ |๐‘’ ๐‘ž ๐‘Ÿ|๐‘’, าง ๐‘  ๐‘ž าง ๐‘ |๐‘’ ๐‘ƒ ๐‘† ๐‘Ÿ, ๐‘’ โˆ ๐‘ž ๐‘’ ๐‘Ÿ, ๐‘  = 1 ๐‘ž ๐‘’ ๐‘Ÿ, ๐‘  = 0) Probability Ranking Principle Probabilistic Retrieval Models Language Models

  • Vector Space Model
  • Binary Independent Model
  • BM25
  • LM Dirichlet
  • LM Jelineck-Mercer