M d li U B h i d I t ti Modeling User Behavior and Interactions
Lecture 5: Search Interfaces + New Directions
Eugene Agichtein Emory University
1
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
Modeling User Behavior and Interactions M d li U B h i d I t ti - - PowerPoint PPT Presentation
Modeling User Behavior and Interactions M d li U B h i d I t ti Lecture 5: Search Interfaces + New Directions Eugene Agichtein Emory University 1 Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) Lecture 5 Plan 1.
1
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
2
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
3
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
4
– This description is crucial. – User can identify good/relevant hits based on description.
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
5
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 6
where sd is the number of sentences in document d
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
7
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
8
– whether the sentence is a heading – whether it is the first or second line of the document – the total number of query terms occurring in the sentence the total number of query terms occurring in the sentence – the number of unique query terms in the sentence – the longest contiguous run of query words in the sentence – a density measure of query words (significance factor)
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
9
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
10
11
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by
List Organization Category Org (SWISH)
Query: jaguar showing results in context, CHI 2001
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by web showing results in context, CHI 2001 web search results training (offline) running (online) classified SVM SVM web pages model classified Search
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
results
13
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by
showing results in context, CHI 2001
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
14
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by
showing results in context, CHI 2001
harley, car, auto, honda, porsche …
p , , p , , user, users, pc, hosting, os, downloads ...
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
15
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by
showing results in context, CHI 2001
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 16
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by showing results in context, CHI 2001 17
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by Category Interface List Interface showing results in context, CHI 2001
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 18
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by showing results in context, CHI 2001 19
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by 7-point rating scale (1=disagree; 7=agree)
Question Category List significance
showing results in context, CHI 2001
Question Category List significance It was easy to use this software. 6.4 3.9 p<.001 I liked using this software 6.7 4.3 p<.001 I prefer this to my usual Web Search engine 6.4 4.3 p<.001 It t t d f th f lt ti 6 4 4 2 < 001 It was easy to get a good sense of the range of alternatives 6.4 4.2 p<.001 I was confident that I could find information if it was there. 6.3 4.4 p<.001 The "More" button was useful 6.5 6.1 n.s. The display of summaries was useful 6.5 6.4 n.s.
Average Number of Uses of Feature per Task
Interface Features Category List significance Expansing / Collapsing Structure 0.78 0.48 p<.003 Viewing Summaries in Tooltips 2 99 4 60 p< 001
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
Viewing Summaries in Tooltips 2.99 4.60 p<.001 Viewing Web Pages 1.23 1.41 p<.053
20
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by
RT for Category vs. List RT by Interface and Query Difficulty
showing results in context, CHI 2001
60 80 100 edian RT
100 120 140 160
dian RT 20 40 60 verage Me
20 40 60 80 100
verage Med
Easy (Top20) Hard (NotTop20)
Category List Interface Condition Av Category List Interface Condition Av
Category: 56 secs List: 85 secs p < .002 50% faster with Category interface Top20: 57 secs NotTop20: 98 secs
No reliable interaction between query difficulty and interface condition
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
50% faster with Category interface
interface condition Category interface is helpful for both easy and difficult queries
21
Marti Hearst, SUI 2009
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 22
Marti Hearst SUI 2009 Marti Hearst, SUI 2009
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 23
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 24
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
non-rewrite mic amps -> create taxi 53.2% insertions game codes -> video game codes 9.1% substitutions john wayne bust -> john wayne statue 8.7% deletions skateboarding pics → skateboarding 5.0% ll ti l t t l t t 7 0% spell correction real eastate
7.0% mixture huston's restaurant -> houston's 6.2% specialization jobs -> marine employment 4.6% p j p y generalization gm reabtes -> show me all the current auto rebates 3.2%
thansgiving -> dia de acconde gracias 2.4%
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 26 SIGIR
[Jones & Fain, 2003]
Platonic concept f
Correct Spelling Typos/spelling errors Typing quickly Distracted p g Forgot how to spell Distracted
27
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
correct correct error error correct
Character level: p(m|n) p(s|z) etc Query level: p(“sigir 2008”), p(“sigir iraq”)…
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
28
[Cucerzan and Brill 2004] [Cucerzan and Brill, 2004]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
29
[Cucerzan and Brill 2004]
It ti l t f th i t [Cucerzan and Brill, 2004] – Iteratively transform the query into
likely queries. Use statistics from query logs to – Use statistics from query logs to determine likelihood.
misspelled misspelled
is, the more frequent it is, and correct > incorrect
– ditroitigers ->
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
g – detroit tigers
30
[Cucerzan and Brill 2004]
alternatives for each word in the query
[Cucerzan and Brill, 2004]
alternatives for each word in the query
– Stats on word unigrams, bigrams from logs Handles word concatenation and splitting – Handles word concatenation and splitting
to the input
U difi d Vit bi l ith – Use modified Viterbi algorithm
– No 2 adjacent in-vocabulary words can h i lt l change simultaneously – Short queries have further (unstated) restrictions In vocabulary words can’t be changed in
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
– In-vocabulary words can t be changed in the first round of iteration
31
[Cucerzan and Brill 2004]
– Damerau-Levenshtein edit distance:
[Cucerzan and Brill, 2004]
Damerau Levenshtein edit distance:
A rule that allows only one letter change can’t fix: – A rule that allows only one letter change can t fix:
– A too permissive rule makes too many errors:
log wood > dog food
– “a modified context-dependent weighted Damerau-Levenshtein edit f nction” function”
movement of letters
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 32
[Cucerzan and Brill 2004]
[Cucerzan and Brill, 2004]
– 2002 kawasaki ninja zx6e 2002 kawasaki ninja zx6r j j
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
33
[Cucerzan and Brill 2004]
[Cucerzan and Brill, 2004]
p p q g
– Must be sent by same user – Differ from one another by a small edit distance
P h i h f ifi i d
placement into the gold standard
– Paper doesn’t say how many total
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 34
[Cucerzan and Brill 2004]
[Cucerzan and Brill, 2004] – 73.1% accuracy – Disagreed with gold standard 99 times; 80 suggestions
gg (p g p g )
– 85.5% correct: speller correct or reasonable – Sent an unspecified subset of the errors to Google’s spellchecker
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
Its agreement with the gold standard was slightly lower
35
[Slides adapted from Jones et al 2006] [Slides adapted from Jones et al., 2006]
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 36
[Slides adapted from Jones et al 2006] [Slides adapted from Jones et al., 2006]
37
[Slides adapted from Jones et al 2006] [Slides adapted from Jones et al., 2006]
38
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 39
[Slides adapted from Jones et al 2006]
Spell correction Name →profession
[Slides adapted from Jones et al., 2006]
g
di b h
Spell correction Delete terms, generalize Try second approach, using his address
edinburgh
Spell correction y seco d app oac , us g s add ess Try looking up addresses rephrase
edinburgh scotland uk
edinburgh
rephrase specialization Generalize to location
40 Switch to new topic
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
non-rewrite mic amps -> create taxi 53.2% insertions game codes -> video game codes 9.1% substitutions john wayne bust -> john wayne statue 8.7% deletions skateboarding pics → skateboarding 5.0% spell correction real eastate
7.0% mixture huston's restaurant -> houston's 6.2% specialization jobs -> marine employment 4.6% generalization gm reabtes -> show me all the current auto rebates 3.2% 41
thansgiving -> dia de acconde gracias 2.4% [Jones & Fain SIGIR 2003]
[Slides adapted from Jones et al 2006]
– 5086 times in a sample
[Slides adapted from Jones et al., 2006]
5086 times in a sample
– 4826 times
– 2613 times
1677 i – 1677 times
– 428 times
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 42
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
P(b i |b i ) P(b i ) P(britney spears|brittney spears) >> P(britney spears) 8% >> 0.01% Log likelihood ratio test (GLRT) gives
2
distributed score 43 About 90% of query pairs are related after filtering with LLR > 100
[Slides adapted from Jones et al 2006]
dog -> dogs 9185 pluralization
[Slides adapted from Jones et al., 2006]
dog -> cat 5942 both instances of 'pet‘ dog -> dog breeds 5567 generalization dog -> dog pictures 5292 more specific dog -> 80 2420 random junk in query processing dog -> pets 1719 generalization -- hypernym dog -> puppy 1553 specification -- hyponym d d i t 1416 ifi dog -> dog picture 1416 more specific dog -> animals 1363 generalization -- hypernym
44
dog -> pet 920 generalization -- hypernym
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
castles in Edinburgh
g medieval castles near Glasgow
Represents initial query Represent rewrite query
45
[Slides adapted from Jones et al 2006]
1- Precise A near-certain match. E.g.: automotive insurance -
[Slides adapted from Jones et al., 2006]
Match automobile insurance; 2- Approximate A probable, but inexact match with user intent. E.g.: apple pp Match p , g pp music player - ipod shuffle 3 Marginal A distant but plausible match to a related topic E g : 3- Marginal Match A distant, but plausible match to a related topic. E.g.: glasses - contact lenses 4 Mismatch A clear mismatch 4- Mismatch A clear mismatch.
Call {1,2} Precise and {1,2,3} Broad
46
{ , } { , , }
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 47
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
2 ' 1p
segmentation
' 2 1 2 ' ' 1 2 1
' 2 ' 1 2 1 2 1
48
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
catholic names, …}
Find a model to
... ' '
2 ' ' 1
< > − < > − Q Q score u u Q score
) ' ( | ' Q Q i Q Q P
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 49
) ' ( | ' Q Q score correct is Q Q P > − > −
[Slides adapted from Jones et al 2006]
[Slides adapted from Jones et al., 2006]
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 50
[Slides adapted from Jones et al 2006]
T i d 37 f f d l
[Slides adapted from Jones et al., 2006]
– Lexical features including
– Statistical features including
y
– Other
– Whole query = 0 q y – Replace one phrase = 1 – Replace two phrases = 2
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 51
[Slides adapted from Jones et al 2006] [Slides adapted from Jones et al., 2006]
wordsInCommon > 0 Yes No Class={1,2} prefixOverlap>0 Yes No Class={1,2} Class={3,4} Interpretation of the decision tree:
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 52
[Slides adapted from Jones et al 2006]
Regression: continuous output in [1,4]
[Slides adapted from Jones et al., 2006]
=
+ =
features f f f
w tercept in LMScore .
Classification: If(LMScore < T) then Good else Bad If(LMScore < T) then Good, else Bad For each T, we have a precision and a recall Evaluation: A i i / ll 100 ti 10 f ld lid ti
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 53
Average precision / recall on 100 times 10-fold cross validation
[Slides adapted from Jones et al 2006]
2 1 2 1
[Slides adapted from Jones et al., 2006]
2 1 2 1 2 1
2 1 q
– Prefer few edits Prefer few word changes – Prefer few word changes – Prefer whole-query or few phrase changes
54
[Slides adapted from Jones et al 2006
100% 2 levels DT
[Slides adapted from Jones et al., 2006
90% 95% bag of 100 DTs SVM Linear model 80% 85%
precision
70% 75% 60% 65% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
55
recall
[Slides adapted from Jones et al 2006]
Initial Query Substitution Hand- Alg
[Slides adapted from Jones et al., 2006]
Initial Query Substitution Hand- label Alg. Prob anne klien watches anne klein watches 1 92% sea world san diego sea world san diego tickets 2 90% restaurants in washington dc restaurants in washington 2 89% nash county wilson county 3 66% as cou ty so cou ty 3 66% frank sinatra birth certificate elvis presley birth 4 17% 56
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 57
be more effective as a collaboration than as a solitary activity.
– Different perspectives, experiences, expertise, and vocabulary to the search
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 58
vocabulary to the search process.
UIST 2007
pp ( ) g g different individuals or let multiple people share a single user interface and cooperatively formulate queries
engine level to focus and enhance the team’s search and communication activities
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 59
, y, , , Algorithmic mediation for collaborative exploratory search, SIGIR 2008
Algorithmic mediation for collaborative exploratory search SIGIR 2008
f ld
Algorithmic mediation for collaborative exploratory search, SIGIR 2008
– Prospector: opens new fields for exploration into a data collection. – Miner: view and assess the documents returned by documents returned by Prospector.
– User Interface Layer
A i t f f P t t i i
– Regulator Layer
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
60
p g p g y pp p
– Weight Definition
Algorithmic mediation for collaborative exploratory search, SIGIR 2008
Miner Algorithm – Miner Algorithm
to the whole results collection (L).
y p q to assess their relevance. The queue is ordered by the following formula in which borda() is a function to measure the importance
the weights (wf and wr) will change over time. A lt th d t ith hi h ill h
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
61
chances to be evaluated by the Miner.
– Prospector focuses on coming up with new avenues for exploration
Algorithmic mediation for collaborative exploratory search, SIGIR 2008
p g p p into the collection. This is accomplished by real-time query term suggestion. – Each term in the whole document corpus has a score which is defined by the following formula. rlf() function means the number of y g () documents in Lk in which term t is found. – As Miner’s algorithm affect wf and wr ,the system will reorder term suggestions.
d h h d ll g terms associated with those documents will appear in term suggestion.
will be gradually replaced by others.
l d l l
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
62
value and relevance value.
more effective searching capability than simple merging of
Algorithmic mediation for collaborative exploratory search, SIGIR 2008
g p y p g g independently produced results
p y for one topic in two ways:
– simple merging and mediated collaboration search. Each experiment lasts 15 min.
24 i f TREC ll i i b d h l
number of relevant documents available for that topic.
– Topics that fell below the median (130) were deemed “sparse” (average of 60 relevant documents per topic) relevant documents per topic). – Topics above the median were “plentiful” (average of 332 relevant documents per topic). – Searching “sparse” topics is an exploratory search process, more difficult
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
63
Algorithmic mediation for collaborative exploratory search, SIGIR 2008
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
64
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 65
66 66
http://answers.yahoo.com/question/index;_ylt=3?qid=20071008115118AAh1HdO
67 67
68
E Agichtein C Castillo D Donato A Gionis
and G. Mishne, Finding High Quality Content in Social Media, in WSDM 2008
69
70
70 70
71
71 71
72
72 72
73 73
Question 1 Answer 1 User 3 User 1 User 3 Question 2 Answer 2 Answer 3 User 1 User 4 User 5 User 1 User 4 User 5 Question 2 Answer 4 Answer 3 User 2 User 6 User 5 Question 3 User 2 User 6 Answer 5 Answer 6 Question 3
=
M i
..
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
74
=
K j
..
Hub (asker) Authority (answerer)
Random forest classifier
75
75 75
76 76
40
30 35
15 20 25
5 10
1 2 3 4 5 6 7 8 9 10
Time to close a question (hours)
77 77
Yandong Liu Jiang Bian
78
Topic Questions Answers A per Q Satisfied Asker Time to close p p f rating by asker 2006 FIFA W ld C 1194 35,659 329.86 55.4% 2.63 47 i t World Cup minutes Mental Health 151 1159 7.68 70.9% 4.30 1.5 days Mathematics 651 2329 3.58 44.5% 4.48 33 minutes Diet & Fitness 450 2436 5.41 68.4% 4.30 1.5 days
79
80
Classifier With Text Without Text Selected Features Features ASP_SVM 0.69 0.72 0.62
ASP_C4.5 0.75 0.76 0.77
ASP_RandomForest 0.70 0.74 0.68 ASP_Boosting 0.67 0.67 0.67 ASP_NB 0.61 0.65 0.58
Best Human Perf 0.61 Baseline (random) 0.66 Baseline (random) 0.66
Human F1 is lower than the random baseline!
81
ASP is significantly more effective than humans
82
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia 84
– BS/BA (4-years) MS (2 years) PhD (4-6 years 5 year MLE) BS/BA (4 years) MS (2 years) PhD (4 6 years, 5 year MLE) – BS/BA (4-years) MS + PhD (4-7 years, 5 year MLE)
Deadline: Late Dec Mid January – Deadline: Late Dec Mid January – Standard Exam Scores:
TOEFL
– Application:
– Pavel Dmitriev page:
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
Pavel Dmitriev page: http://www.pavel-dmitriev.org/faq/question001_ru.xml
85
In collaboration with:
Abli i Aji
( gy)
Phil Wolff (Psychology)
Qi Guo (3rd year Phd) Ablimit Aji (2nd year PhD)
1st year graduate students: Julia Ki l D it L Qi li Li
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
86 Kiseleva, Dmitry Lagun, Qiaoling Liu, Wang Yu
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
87
88
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
88
88
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
89
Hearst, Marti, Search User Interfaces, 2009, Chapters 5, 6, 8, : “Presentation of Search Results”, “Query Reformulation” http://searchuserinterfaces.com/ Croft, Bruce, Metzler D, and Strohman, T, Search Engines: Information Retrieval in Practice, 2009, Chapters 6 and 10: “Queries and Interfaces”, “Social Search”, http://www.search-engines-book.com/
Dumais, S, E. Cutrell, and H. Chen. Optimizing search by showing results in context, CHI 2001
Cucerzan, S and Brill, E, Spelling Correction as an Iterative Process that Exploits the C ll ti K l d f W b U EMNLP 2004 Collective Knowledge of Web Users, EMNLP 2004 Jones, R., Rey, B., Madani, O., and Greiner, W. Generating query substitutions, WWW 2006 Pickens J G Golovchinsky C Shah P Qvarfordt and M Back Algorithmic Pickens, J, G. Golovchinsky, C. Shah, P. Qvarfordt, and M. Back., Algorithmic mediation for collaborative exploratory search, SIGIR 2008 Agichtein, E, Gabrilovich, E, and Zha, H, E. Agichtein, E. Gabrilovich, and H. Zha,
The Social Future of Web Search: Modeling Exploiting and Searching
Eugene Agichtein, RuSSIR 2009, September 11-15, Petrozavodsk, Russia
The Social Future of Web Search: Modeling, Exploiting, and Searching Collaboratively Generated Content, in IEEE Data Engineering Bulletin, 2009
90