Un Understanding g Web Search Satisfaction in in a a He - PowerPoint PPT Presentation

Un Understanding g Web Search Satisfaction in in a a He Heterogeneous En Environment Yiqun LIU Department of Computer Science and Technology Tsinghua University, China

What’s the Gold Standard in Web Search ch Information Search Results Need Search Engine User

What’s the Gold Standard in Web Search ch • Is the information need SATISFIED OR NOT? Questionnaire, Quiz, Concept Map (Egusa et. al., 2010), etc. • Problem: Efforts? User Experiences? • Information Search Results Need Search Engine User

What’s the Gold Standard in Web Search ch • Are results RELEVANT WITH the user query? Cranfield-like approach, Relevance judgement, • evaluation metrics (nDCG, ERR, TBG, etc.) Problem: behavior assumptions behind metrics • Information Search Results Need Search Engine User

What’s the Gold Standard in Web Search ch • Can we keep the boss HAPPY? • Various on-line metrics: CTR, SAT Click, interleaving, etc. • Problem: strong assumptions behind metrics Information Search Results Need Search Engine User

What’s the Gold Standard in Web Search ch Information Search Results Need Search Engine User • Is the user SATISFIED OR NOT? Post-search questionnaire; annotation by assessors (Huffman et. al., 2007) • Implicit feedback signals: satisfaction prediction (Jiang et. al., 2015) • Physiological signals: skin conductance response (SCR), facial muscle • movement (EMG-CS) (Ángeles et. al., 2015).

Satisfact ction Perce ception of Search ch User RQ2: How heterogeneousresults affect user satisfaction Information Search Results Need Search Engine User RQ1: Satisfaction perception v.s. Relevance judgment RQ3: Satisfaction prediction with interaction features

Ou Outl tline • Satisfaction v.s. Relevance judgment Can we use relevance scores to infer satisfaction? • Satisfaction v.s. Heterogeneous results Do vertical results help improve user satisfaction? • Satisfaction v.s. User interaction Can we predict satisfaction with implicit signals?

Relevance ce • A central concept in information retrieval (IR) “It (relevance) expresses a criterion for assessing effectiveness in retrieval of information , or to be more precise, of objects (texts, images, sounds ... ) potentially conveying information .” [Saracevic, 1996] Tefko Saracevic Former president of ASIS SIGIR Gerard Salton Award in 1997 ASIS Award of Merit in 1995

Relevance ce judgment in Web search ch • The role of Relevance in IR evaluation Information Needs Queries Search Engine Users Search Results A Paradigm of User Web Search Satisfaction

Relevance ce judgment in Web search ch • The role of Relevance in IR evaluation Information Needs A Paradigm of Queries Cranfield-like Web Search Engine Search Evaluation Users Assessors Search Results MAP, NDCG, ERR, … User Evaluation Metrics Satisfaction Relevance Judgments

Relevance ce judgment in Web search ch Idea (first-tier annotation): Practice (second-tier annotation): Relevance is expected to Relevance is made by external represent users’ opinions assessors who do not: about whether a retrieved • originate or fully understand document meet their needs the information needs [Voorhees and Harman, 2001]. • have access to search context Relevance judgments are often limited to the topical aspect, and different from user-perceived usefulness .

Example: Relevance ce v. v.s. . Useful ulne ness You are going to US by air and want to know restrictions for both checked and carry-on baggage during air travel. Q Q C C C baggage carry-on restrictions baggage liquids Checked baggagepolicy Air Canada – The Best Way to Pack a – American Airlines BaggageInformation Suitcase Relevance: Relevance: Relevance: Usefulness: Usefulness: Usefulness: Relevance judgments ≠ perceived usefulness

Research ch Questions Gold standard • Satisfaction User feedback • Query or session level • Relevance Usefulness • Assessor annotated User feedback • W/o session context With session context • • Document level Document level • • (query-doc pair) (information need v.s. doc)

Research ch Questions • RQ1.1 Difference between annotated relevance and perceived usefulness Gold standard • Satisfaction User feedback • Query or session level • Relevance Usefulness • Assessor annotated User feedback • W/o session context With session context • • Document level Document level • • (query-doc pair) (information need v.s. doc)

Research ch Questions • RQ1.2 Correlation relations between satisfaction and relevance/usefulness Gold standard • Satisfaction User feedback • Query or session level • Relevance Usefulness • Assessor annotated User feedback • W/o session context With session context • • Document level Document level • • (query-doc pair) (information need v.s. doc)

Research ch Questions • RQ1.3 Can perceived usefulness be annotated by external assessors? Gold standard • Satisfaction User feedback • Query or session level • Relevance Usefulness • Assessor annotated Assessor annotated User feedback • W/o session context With session context • • Document level Document level • • (query-doc pair) (information need v.s. doc)

Research ch Questions • RQ1.4 Can perceived usefulness be predicted with relevance judgment? Gold standard • Satisfaction User feedback • Query or session level • Relevance Usefulness • Assessor annotated User feedback Automatic Prediction • W/o session context With session context • • Document level Document level • • (query-doc pair) (information need v.s. doc)

Collect cting Data • I. User Study: • II. Data Annotation: • 29 participants • 24 assessors • 15 female, 14 male • Graduate or senior undergraduate students • Undergraduate students • 9 assessors assigned to label from different majors document relevance • 12 search tasks • 15 assessors assigned to label usefulness and satisfaction • From TREC session track • Collect: • Collect: • Relevance annotations • Users’ behavior logs • Usefulness annotations • Users’ explicit feedbacks for • Satisfaction annotations usefulness and satisfaction

User Study Proce cess I.1 Pre-experiment Training I.2 Task Description Reading and Rehearsal I.3 Task Completion with the Experimental Search Engine I.4 Satisfaction and Usefulness Feedback Query-level Usefulness satisfaction I.5 Post-experiment feedbacks: 𝑉 % Question feedbacks: 𝑅𝑇𝐵𝑈 % We also collect task-level satisfaction feedbacks: 𝑈𝑇𝐵𝑈 %

Da Data Annotation Proce cess • Relevance annotation (𝑆) • Four-level relevance score • For all clicked documents and top-5 documents • Only query and document are shown to assessors • Each query-doc pair is judged by 3 assessors

Da Data Annotation Proce cess • Usefulness and satisfaction annotations • Each search session is judged by 3 assessors Annotation Instructions: Search Task: You are going to US by air, so you want to know what restrictions there are for both checked and carry-on baggage during air travel. The left part shows the issued queries and clicked documents when a user is doing the search task via a search engine, you need to complete the following 3-step annotation: STEP1: Annotate the usefulness of each clicked document for accomplishing the search task: 1 star: Not useful at all; 2 stars: Somewhat useful; 3 stars: Fairly useful; 4 stars: V ery useful. STEP2: Annotate query-level satisfaction for each query (1 star: Most unsatisfied - 5 stars: Most satisfied) STEP3: Finally, please annotate the task-level satisfaction (1 star: Most unsatisfied - 5 stars: Most satisfied) Completed units/all units ： 0/29

II. II. Data An Annotation • Usefulness and satisfaction annotations • Each search session is judged by 3 assessors 4-level usefulness annotation: 𝑉 * 5-level query satisfaction annotation: 𝑅𝑇𝐵𝑈 * 5-level task satisfaction annotation: 𝑈𝑇𝐵𝑈 *

RQ RQ1. 1.1. 1. Us Usefulness v. v.s. Relevance ce • Relevance (assessor, R ) / Usefulness (user, U u ) / Usefulness (assessor, U a ) Finding #2: A large part of docs are relevant, much fewer are useful Finding#1 : Only a few docs are not relevant, much more are not useful

RQ RQ1. 1.1. 1. Usefulness vs. Relevance ce • Joint distribution of R , U u and U a • Positive correlation (Pearson’s 𝑠 : 0.332, Weighted 𝜆 : 0.209) between R and U u Some relevant documents are not useful to users Irrelevant documents are not likely to be useful Finding: Relevance is necessary but not sufficient for usefulness

RQ1.2. Correlation with Satisfact ction • Correlation with query-level satisfaction QSAT u • Offline metrics (based on relevance annotation R ) • Results are ranked by original positions • MAP@5, DCG@5, ERR@5, weighted relevance • Online metrics (based on R or usefulness U u ) • Results are ranked by click behavior sequences measures for all clicks under that defined as: | CS | | CS | M ( d i ) ∑ ∑ cCG ( CS , M ) = M ( d i ) cDCG ( CS , M ) = log 2 ( i + 1 ) i = 1 i = 1 d ,..., d ) is the click sequence assumes that the user’s satisfaction is largely cMAX ( CS , M ) = max ( M ( d 1 ) , M ( d 2 ) ,..., M ( d | CS | ))

Un Understanding g Web Search Satisfaction in in a a He - PowerPoint PPT Presentation

Un Understanding g Web Search Satisfaction in in a a He Heterogeneous En Environment Yiqun LIU Department of Computer Science and Technology Tsinghua University, China Whats the Gold Standard in Web Search ch Information Search

Constraint Satisfaction Problems Chapter 5 Section 1 3 Constraint Satisfaction 1 Outline

1 Constraint Satisfaction Problems Constraint Satisfaction Problems Constraint Satisfaction

Customer Satisfaction SE 350 Software Processes & Product Quality Overview Defining customer

Customer Satisfaction October 6, 2004 Swami Natarajan RIT Software Engineering Overview

Constraint Satisfaction Constraint Satisfaction Problems Problems Some material from: D Lin, J

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Search Overview Introduction to Search Blind Search Techniques Heuristic Search

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Web CS490W: Web I nformation Search & Management Web opened the door for many important

Web Data Representation Web Graph, Text, Images, Metadata, Search spaces Web Search 1 The Web

RTIA Customer Satisfaction Survey Customer Satisfaction Survey Presentation Objectives

ACSI American Customer Satisfaction Index TM Customer Satisfaction: A Key Element for an

CHAPTER 7: CONSTRAINT SATISFACTION CHAPTER 7: CONSTRAINT SATISFACTION PROBLEMS PROBLEMS

Constraint Satisfaction Problems Chapter 6 Constraint Satisfaction Problems A constraint

Constraint Satisfaction Problems 4 AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 4 1 4

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Natural Interfaces Charlie Albright & Thomas Griffin Gestures Use of gestures already

Making Sense of Multimodal Learning Analytics (MMLA) Marcelo Worsley Assistant Professor,

Quasiclassical analysis of Bloch oscillations in non-Hermitian tight- binding lattices Eva-Maria

Introduc)on to Appion and Leginon tools Chi-yu Fu (ICOB) NRAMM/AMI at New York Structural Biology

Training Periodization The Importance Of Having A Plan SBS Academy: Unit 2 Mo Module le 2.

When Virtual Reality Meets Internet of Things in the Gym: Enabling Immersive Interactive Machine

Complete Vehicle Testing of Car Occupant Muscle Responses for Integrated Safety Simulation Jonas

Kinematic Model of the Hand using Computer Vision Edgar Sim o Serra Barcelona, Spain

Sambuz

Useful Links

Newsletter

Mail Us

Un Understanding g Web Search Satisfaction in in a a He - PowerPoint PPT Presentation

Un Understanding g Web Search Satisfaction in in a a He Heterogeneous En Environment Yiqun LIU Department of Computer Science and Technology Tsinghua University, China Whats the Gold Standard in Web Search ch Information Search

Constraint Satisfaction Problems Chapter 5 Section 1 3 Constraint Satisfaction 1 Outline

1 Constraint Satisfaction Problems Constraint Satisfaction Problems Constraint Satisfaction

Customer Satisfaction SE 350 Software Processes &amp; Product Quality Overview Defining customer

Customer Satisfaction October 6, 2004 Swami Natarajan RIT Software Engineering Overview

Constraint Satisfaction Constraint Satisfaction Problems Problems Some material from: D Lin, J

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Search Overview Introduction to Search Blind Search Techniques Heuristic Search

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Web CS490W: Web I nformation Search &amp; Management Web opened the door for many important

Web Data Representation Web Graph, Text, Images, Metadata, Search spaces Web Search 1 The Web

RTIA Customer Satisfaction Survey Customer Satisfaction Survey Presentation Objectives

ACSI American Customer Satisfaction Index TM Customer Satisfaction: A Key Element for an

CHAPTER 7: CONSTRAINT SATISFACTION CHAPTER 7: CONSTRAINT SATISFACTION PROBLEMS PROBLEMS

Constraint Satisfaction Problems Chapter 6 Constraint Satisfaction Problems A constraint

Constraint Satisfaction Problems 4 AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 4 1 4

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Natural Interfaces Charlie Albright &amp; Thomas Griffin Gestures Use of gestures already

Making Sense of Multimodal Learning Analytics (MMLA) Marcelo Worsley Assistant Professor,

Quasiclassical analysis of Bloch oscillations in non-Hermitian tight- binding lattices Eva-Maria

Introduc)on to Appion and Leginon tools Chi-yu Fu (ICOB) NRAMM/AMI at New York Structural Biology

Training Periodization The Importance Of Having A Plan SBS Academy: Unit 2 Mo Module le 2.

When Virtual Reality Meets Internet of Things in the Gym: Enabling Immersive Interactive Machine

Complete Vehicle Testing of Car Occupant Muscle Responses for Integrated Safety Simulation Jonas

Kinematic Model of the Hand using Computer Vision Edgar Sim o Serra Barcelona, Spain

Sambuz

Useful Links

Newsletter

Mail Us

Customer Satisfaction SE 350 Software Processes & Product Quality Overview Defining customer

Web CS490W: Web I nformation Search & Management Web opened the door for many important

Natural Interfaces Charlie Albright & Thomas Griffin Gestures Use of gestures already