RelevanceFeedback CISC489/689010,Lecture#15 Monday,April13 th - PDF document

4/15/09  Relevance Feedback  CISC489/689‐010, Lecture #15  Monday, April 13 th   Ben CartereHe  Query Process  Corpus  Accessible data store  Server(s)  Ranking  f(Q,D) EvaluaPon  (Precision, recall,   clicks, …)  1 

4/15/09  User InteracPon  • User inputs a query  • Gets a ranked list of results  • InteracPon doesn’t have to end there!  – A typical engine‐user interacPon:  the user looks  at the results and reformulates the query  – What if the engine could do it automaPcally?  Example  2 

4/15/09  InteracPon Model  • Relevance feedback   – User indicates which documents were relevant,  which were nonrelevant  • Possibly using check boxes or some other buHon  – System takes this  feedback  and uses it to find  other relevant documents  – Typical approach:   query expansion   – Add “relevant terms” to the query with weights  Example Feedback Interface  Promote result  Remove result  Find similar pages  3 

4/15/09  Models for Relevance Feedback  • Retrieval models <‐> relevance feedback models  • A model for relevance feedback needs to take  marked relevant documents and use them to  update the query or results  – Google model is very simple:  move result to top on  “promote” click, move to boHom on “remove” click  – Slightly more complex Google model:  use one  document as a relevant document for “similar pages”  click  – Query expansion is a more common approach  Vector Space Feedback  • Documents, queries are vectors  • Add relevant document vectors together to  obtain a “relevant vector”  • Add nonrelevant document vectors together  to obtain a “nonrelevant vector”  • We want a new query vector Q’ that is closer  to the relevant vector than the nonrelevant  vector  4 

4/15/09  VSM Feedback IllustraPon  Relevant  Q = t 1  Q’ = 3t 2 , ‐3t 1  Not relevant  Q  Relevance Feedback  • Rocchio algorithm  • Op7mal query – Maximizes the difference between the average  vector represenPng the relevant documents and  the average vector represenPng the non‐relevant  documents  • Modifies query according to  – α ,  β , and γ are parameters  • Typical values 8, 16, 4  5 

4/15/09  Rocchio Feedback in PracPce  • Might add top  k  terms only  • Could ignore the nonrelevant part  – Has not consistently been shown to improve  performance  • Might choose to include some documents but  not others  – Most certain, most uncertain, highest quality, …  Rocchio Expanded Query Example  • TREC topic 106:  Title:  U.S. Control of Insider Trading  DescripPon:  Document will report proposed or enacted changes to U.S. laws  and regulaPons designed to prevent insider trading.  • Original query (automaPcally generated):  #wsum( 2.0 #uw50( Control of Insider Trading )                  2.0 #1( #USA Control )                   5.0 #1( Insider Trading )                  1.0 proposed 1.0 enacted 1.0 changes 1.0 #1( #USA laws )                  1.0 regulaPons 1.0 designed 1.0 prevent )  • Expanded query:  #wsum( 3.88 #uw50( control inside trade ) 2.21 #1( #USA control )                 145.57 #1( inside trade )                 0.54 propose 2.46 enact 0.99 change 4.35 #1( #USA law )                 10.35 regulate 0.80 design 1.73 prevent                 4.60 drexel 2.05 fine 1.85 subcommiHee 1.69 surveillance 1.60 markey                 1.53 senate 1.19 manipulate 1.10 pass 1.06 scandal 0.92 edward )  6 

4/15/09  ProbabilisPc Feedback  • Recall probabilisPc models:  – Relevant class versus nonrelevant class  • P(R | D, Q) versus P(NR | D, Q)  – OpPmal ranking is in decreasing order of  probability of relevance  • Basic probabilisPc model assumes no  knowledge of classes  – e.g. BIM:  IllustraPon  Feedback provides  informaPon about the classes  User’s relevant documents  User’s nonrelevant documents  7 

4/15/09  ConPngency Table  For term i:  Number of relevant documents   Number of relevant   Number of  Number of documents   that contain term i  documents   documents   that contain term i  Gives BIM feedback scoring funcPon:  BIM Feedback  • Not query expansion  – It does not add terms to the query  • It modifies term weights based on presence or  absence in relevant documents  – Terms that appear much more open in the  relevant class than the nonrelevant class are good  discriminators of relevance  – i.e. r i  > n i  – r i       good discriminator  8 

4/15/09  Language Model Feedback  • Recall the query‐likelihood language model:  � P ( Q | D ) = P ( t | D ) t ∈ Q – Where’s the relevance?  • A  relevance model  is a language model for the  informaPon need  – P(t | R)  – What is the probability that the author of some  relevant document would use the term  t ?  – Or what is the probability that the user with the  informaPon need would describe it using  t ?  Relevance Models  • The query and relevant documents are samples  from the relevance model  • P(D|R)  ‐ probability of generaPng the text in a  document given a relevance model  – document likelihood model  – less effecPve than query likelihood due to difficulPes  comparing across documents of different lengths  • Original moPvaPon was to incorporate relevance  into language model  9 

4/15/09  EsPmaPng the Relevance Model  • Probability of pulling a word  w out of the  “bucket” represenPng the relevance model  depends on the  n query words we have just  pulled out  • By definiPon  EsPmaPng the Relevance Model  • Joint probability is  • Assume  • Gives  Look familiar?  Query‐likelihood score.  Set to 0 for nonrelevant docs.  10 

4/15/09  EsPmaPng the Relevance Model  • P(D)  usually assumed to be uniform  • P(w, q1 . . . qn) is simply a weighted average of  the language model probabiliPes for  w  in a set  of documents, where the weights are the  query likelihood scores for those documents  • Formal model for relevance feedback in the  language model  – query expansion technique  Relevance Models in PracPce  • In theory:  – Use all the documents in the collecPon weighted  by query‐likelihood score or relevance  – Expand query with every term in the vocabulary  • In pracPce:  – Use only the feedback documents, or the top  k   documents, or a subset  – Expand query with only  n  highest‐probability  terms  11 

4/15/09  Example RMs from Top 10 Docs  Example RMs from Top 50 Docs  12 

4/15/09  KL‐Divergence  • Given the  true  probability distribuPon  P  and  another distribuPon  Q  that is an  approxima7on  to  P ,  – Use negaPve KL‐divergence for ranking, and  assume relevance model  R  is the true distribuPon  (not symmetric),  Scoring funcPon  Relevance model  Document language model  KL‐Divergence  • Given a simple maximum likelihood esPmate  for  P(w|R), based on the frequency in the  query text, ranking score is  – rank‐equivalent to query likelihood score  • Query likelihood model is a special case of  retrieval based on relevance model  13 

RelevanceFeedback CISC489/689010,Lecture#15 Monday,April13 th - PDF document

4/15/09 RelevanceFeedback CISC489/689010,Lecture#15 Monday,April13 th BenCartereHe QueryProcess Corpus Accessibledatastore Server(s) Ranking f(Q,D) EvaluaPon

Relevance Feedback Relevance Feedback Relevance Feedback Prof. Paolo Ciaccia Prof. Paolo

Topic of this talk Topic of this talk From E- -Relevance Relevance From E to W- -Relevance

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback Heechan Shin CS688

NPFL103: Information Retrieval (6) Result summaries, Relevance Feedback, Qvery Expansion Pavel

Google example query: heat in query doesnt match with thermodynamics in hospital

Relevance Feedback & Other Query Expansion Techniques (Thesaurus, Semantic Network) (COSC

Pseudo-Relevance Feedback CS6200: Information Retrieval Slides by: Jesse Anderton

Query Expansion Techniques (Relevance Feedback, Thesaurus, Semantic Network) (COSC 488) Nazli

Luo Si Department of Computer Science Purdue University Query Expansion: Outline Query

Baby Got Feedback: How to Give and Take Feedback Like A Boss Sarah Hagan @thesarahhagan Sarah

Clinical application / / Clinical application relevance relevance H.-J. Stellbrink, Hamburg

World Commission on the Social Dimension of Globalization (WCSDG) RELEVANCE FOR THE RELEVANCE

Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays

NAM RELEVANCE PROJECT IMPLEMENTATION PLAN 31 JULY 2019 I. Project Overview Relevance

ASEAN ICT COOPERATION IN ASEAN ICT COOPERATION IN RELEVANCE WITH THE WSIS RELEVANCE WITH THE WSIS

Going Beyond the Document-Query Lexical Match Oren Kurland Faculty of Industrial Engineering and

Query Operations Query Operations Berlin Chen 2004 Reference: 1. Modern Information Retrieval .

Analysis and performance of morphological query expansion and language-filtering words on Basque

Relevance Feedback and Query Expansion Debapriyo Majumdar

SLWWW at the NTCIR-13 WWW Task Peng XIAO , Yimeng FAN , Lingtao Li, Tetsuya Sakai Waseda

Announcements Coordinating with other presenters Presentation length: ~20 minutes HW1

Information Retrieval Lecture 4 Recap of last time Postings pointer storage Dictionary

Tactical data engineering Julian Hyde April 1718, 2019 San Francisco @julianhyde DBMS Data

Sambuz

Useful Links

Newsletter

Mail Us

RelevanceFeedback CISC489/689010,Lecture#15 Monday,April13 th - PDF document

4/15/09 RelevanceFeedback CISC489/689010,Lecture#15 Monday,April13 th BenCartereHe QueryProcess Corpus Accessibledatastore Server(s) Ranking f(Q,D) EvaluaPon

Relevance Feedback Relevance Feedback Relevance Feedback Prof. Paolo Ciaccia Prof. Paolo

Topic of this talk Topic of this talk From E- -Relevance Relevance From E to W- -Relevance

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback Heechan Shin CS688

NPFL103: Information Retrieval (6) Result summaries, Relevance Feedback, Qvery Expansion Pavel

Google example query: heat in query doesnt match with thermodynamics in hospital

Relevance Feedback &amp; Other Query Expansion Techniques (Thesaurus, Semantic Network) (COSC

Pseudo-Relevance Feedback CS6200: Information Retrieval Slides by: Jesse Anderton

Query Expansion Techniques (Relevance Feedback, Thesaurus, Semantic Network) (COSC 488) Nazli

Luo Si Department of Computer Science Purdue University Query Expansion: Outline Query

Baby Got Feedback: How to Give and Take Feedback Like A Boss Sarah Hagan @thesarahhagan Sarah

Clinical application / / Clinical application relevance relevance H.-J. Stellbrink, Hamburg

World Commission on the Social Dimension of Globalization (WCSDG) RELEVANCE FOR THE RELEVANCE

Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays

NAM RELEVANCE PROJECT IMPLEMENTATION PLAN 31 JULY 2019 I. Project Overview Relevance

ASEAN ICT COOPERATION IN ASEAN ICT COOPERATION IN RELEVANCE WITH THE WSIS RELEVANCE WITH THE WSIS

Going Beyond the Document-Query Lexical Match Oren Kurland Faculty of Industrial Engineering and

Query Operations Query Operations Berlin Chen 2004 Reference: 1. Modern Information Retrieval .

Analysis and performance of morphological query expansion and language-filtering words on Basque

Relevance Feedback and Query Expansion Debapriyo Majumdar

SLWWW at the NTCIR-13 WWW Task Peng XIAO , Yimeng FAN , Lingtao Li, Tetsuya Sakai Waseda

Announcements Coordinating with other presenters Presentation length: ~20 minutes HW1

Information Retrieval Lecture 4 Recap of last time Postings pointer storage Dictionary

Tactical data engineering Julian Hyde April 1718, 2019 San Francisco @julianhyde DBMS Data

Sambuz

Useful Links

Newsletter

Mail Us

Relevance Feedback & Other Query Expansion Techniques (Thesaurus, Semantic Network) (COSC