Relevance Feedback in Web Search
Sergei Vassilvitskii (Stanford University) Eric Brill (Microsoft Research)
Relevance Feedback in Web Search Sergei Vassilvitskii (Stanford - - PowerPoint PPT Presentation
Relevance Feedback in Web Search Sergei Vassilvitskii (Stanford University) Eric Brill (Microsoft Research) Introduction Web search is a non-interactive system. Exceptions are spell checking and query suggestions By design search
Sergei Vassilvitskii (Stanford University) Eric Brill (Microsoft Research)
suggestions
(TFiDF) + cosine similarity scores.
structure of the web tell us?
pages.
➡ Similar to Pagerank.
pages.
➡ Similar to Pagerank.
irrelevant pages.
➡ “Reverse Pagerank” ➡ Those who point to web spam are likely to be
spammers.
(perfect)
dataset
0.1 0.2 0.3 0.4 1 2 3 4 5
Baseline
dataset
URLs that are targets
0.1 0.2 0.3 0.4 1 2 3 4 5
Baseline Perfect Targets
url1 url2 url3 url4 url5 url6
url1 url2 url3 url4 url5 url6 bad result good result unrated result
url6 url3
url1 url2 url4 url5 bad result good result unrated result
url6 url3
url1 url2 url4 url5 bad result good result unrated result
url1 url2 url3 url4 url5 url6 bad result good result unrated result url1 url2 url3 url4 url5 url6
relevance of (Baseline histogram)
relevance information u u v v → u u v u → v u
relevance of . u u
static score vs. static + RF scores.
Cumulative Gain):
NDCG ∝
2rel(i) − 1 log(1 + i)
three subsets of pages.
1 2 3 4
Alg Rocchio
Roccio: Demotes the best result
three subsets of pages.
NDCG < 100
1 2 3 4
Alg Rocchio
three subsets of pages.
NDCG < 100
NDCG < 85
1 2 3 4
Alg Rocchio
Increased performance for harder queries
datasets.
NDCG < 100
NDCG < 85
7.5 15.0 22.5 30.0
Alg Rocchio
results?
based RF approaches
RF approaches
Any Questions?