Working at (and pushing) the boundaries of IR: how other fields can - - PowerPoint PPT Presentation

working at and pushing the boundaries of ir how other
SMART_READER_LITE
LIVE PREVIEW

Working at (and pushing) the boundaries of IR: how other fields can - - PowerPoint PPT Presentation

Working at (and pushing) the boundaries of IR: how other fields can influence your IR research. Dr. David Elsweiler Chair for Information Science Faculty of Language, Literature and Cultural Sciences david@elsweiler.co.uk Boundaries -


slide-1
SLIDE 1
  • Dr. David Elsweiler

Chair for Information Science Faculty of Language, Literature and Cultural Sciences david@elsweiler.co.uk

Working at (and pushing) the boundaries of IR: how other fields can influence your IR research.

slide-2
SLIDE 2

Boundaries - image

slide-3
SLIDE 3

Blurred boundaries image

slide-4
SLIDE 4

C

  • n

t e x t Wo r k

  • T

a s k

P l a n T r i p t

  • F

D I A 2 1 1

I n f

  • r

m a t i

  • n

S e e k i n g T a s k

E x p l

  • r

e p

  • s

s i b l e h

  • t

e l s “Hotels in Koblenz” Retrieval Model

Systems IR

Results

I n g w e r s e n a n d J ä r v e l i n 2 5

  • d

i f f e r e n t l e v e l s n e e d d i f f e r e n t a p p r

  • a

c h e s , d i f f e r e n t e v a l u a t i

  • n

m e t h

  • d

s

slide-5
SLIDE 5

Information Retrieval

HCI Information Science Statistics Information Seeking Mathematics L i n g u i s t i c s

P s y c h

  • l
  • g

y A f f e c t i v e C

  • m

p u t i n g E t h n

  • g

r a p h y S

  • c

i

  • l
  • g

y N e u r

  • s

c i e n c e Me d i c i n e N u t r i t i

  • n

L e i s u r e S t u d i e s Q u a n t u m Me c h a n i c s A r c h i t e c t u r e C i t y P l a n n i n g We b s c i e n c e D a t a m i n i n g E c

  • n
  • m

i c s

slide-6
SLIDE 6
  • Interesting problems
  • Very challenging problems
  • Standard IR methods are not

enough (need to combine, be inspired by other fields)

  • Examples from my own research

When you move to the boundaries:

slide-7
SLIDE 7

Personal Search “Stuff I've Seen”

[Dumais et al., 2003]

slide-8
SLIDE 8

C re a te d in fo rm a tio n

Things they created

slide-9
SLIDE 9

G iv e n in fo rm a tio n

Things they received

slide-10
SLIDE 10

re a d in fo rm a tio n

Things they've read

slide-11
SLIDE 11
slide-12
SLIDE 12

Q u e r y R e t r i e v a l Mo d e l Results I n t e r f a c e

Personal Search

slide-13
SLIDE 13

Personal Search

slide-14
SLIDE 14

Search

Find out where my hotel is for FDIA

Koblenz

Personal Search

slide-15
SLIDE 15

Q u e r y R e t r i e v a l Mo d e l Results I n t e r f a c e

G u i d e d b y U s e r R e c

  • l

l e c t i

  • n

s

Personal Search

slide-16
SLIDE 16

Memory is Important

Systems should take memory into account:

  • Support what people are likely to

remember / not remember

  • Help people remember more

There isn't much IR literature on memory!

slide-17
SLIDE 17

Cognitive Psychology

  • 130+ years of literature
  • Theories / models on (for starters):
  • Spatial recollection
  • Episodic recollection
  • Semantic recollection
  • Recollection for Texts
  • Cue-based recall
  • Experimental Methods
slide-18
SLIDE 18
slide-19
SLIDE 19

Importance of Evaluation

  • Lots of people have been building tools
  • Very few of these have actually been tested
  • Major problem in the field (and related fields)
  • Boardman 2004; Capra & Perez-Quinones 2006;

Cutrell et al. 2006; Elsweiler & Ruthven, 2007; Chernov et al., 2007; Elsweiler et al.,2011

  • Few evaluations because it is difficult
slide-20
SLIDE 20

#1 E v a lu a tin g in th e w ild ...

Dumais et al., 2003; Cutrell et al., 2006

slide-21
SLIDE 21

#2 E v a lu a tin g in th e la b ...

slide-22
SLIDE 22
  • Elsweiler and Ruthven, 2007

Task taxonomy for re-finding

  • 1. Split population into groups
  • 2. Perform investigatory studies

(diary studies, tours, interviews)

  • 3. Derive task pools for each group

Lab-based approaches

slide-23
SLIDE 23
  • Recollection for personal

information

  • Learn about task perception /

success

  • Learn about user behaviour
  • Evaluate system designs

This has been used for:

slide-24
SLIDE 24

What about systems IR experiments?

slide-25
SLIDE 25

Azzopardi et al., 2007; Kim & Croft, 2009 From: acm@sheridanprinting.com Subj: your registration Body: Dear Author, Thank you for the submission of "Understanding Re-finding behavior in Naturalistic Email Interaction Logs" to ...

Query Simulation

“Thank Understanding”

slide-26
SLIDE 26

Simulated Approaches

Ideal for testing algorithms Low cost Repeatable

+

  • Do they really accurately

represent user behaviour?

slide-27
SLIDE 27

How can we make simulated approaches better reflect real-life queries?

slide-28
SLIDE 28

Seed Simulations with User Study Behaviour

What do queries look like?

  • Length ● Field ● Named Entities ● Spelling Error
  • Advanced Operators

Do they change in different situations?

  • Different kinds of user ● Different kinds of Task
  • Different kinds of Collection

Do different retrieval models work better in different situations?

slide-29
SLIDE 29
  • Personal Search looks like standard

IR problem

  • Look deeper and we see it lies at

the boundaries

  • Creative solutions required
  • Inspiration from other fields
  • Psychology, Ethnography, HCI
slide-30
SLIDE 30

Casual-leisure Search

slide-31
SLIDE 31

L o e w e tv p ic tu reLoewe Project

slide-32
SLIDE 32

What do people need? What problems do they have? How do they use existing systems?

slide-33
SLIDE 33

Diary Study

  • 1 week during

Christmas holidays

  • 38 participants (19

male, 19 female)

  • Ages (10-72, avg. 39.5,

sd=17.4)

  • Mix of educational

levels, occupations and living arrangements

  • 381 recorded needs
slide-34
SLIDE 34
slide-35
SLIDE 35

Differences to our classical understanding of information needs

  • Not in response to a gap in knowledge
  • But in response to a mood or physical

state

  • To a need to be distracted
  • To having some free time
slide-36
SLIDE 36

Different emphasis on what is important for the user

  • The information is not always what is

important

  • Experience is always crucial
  • Success != finding something (specific)
  • It is the journey not the destination that

is key!

slide-37
SLIDE 37

Casual-leisure situations are important

  • Many participants described escaping

(monotonous tasks, stressful situations, boredom)

  • Health (mental and physical)
slide-38
SLIDE 38

Learning about search behaviour

I'm writing on a white board.

slide-39
SLIDE 39

Learning about search behaviour

I think Sir Tim likes my idea

slide-40
SLIDE 40

Learning about search behaviour

Harry Potter

slide-41
SLIDE 41
slide-42
SLIDE 42

Missing Knowledge Gap

slide-43
SLIDE 43

Experience over things found

slide-44
SLIDE 44

What does this mean for building systems?

slide-45
SLIDE 45

How do you build an IR system that deals with the query “Entertain Me”?

slide-46
SLIDE 46

What does this mean for evaluating systems?

slide-47
SLIDE 47
  • We need to better understand what

people want in various Casual- leisure situations

  • How they behave to get this
  • What we can do to provide

assistance

slide-48
SLIDE 48

Casual leisure information needs in mobile context

  • Long nights of music,

museums, science

  • Evenings of entertainment,

distributed over a city from 8pm – 3am

  • Android app to support

people find events of interest and the plan evenings / and routes to events

slide-49
SLIDE 49
  • Munich, May 2011
  • ~160 bands / artists performed at 100

locations around the city (8pm – 3am)

  • 20,000+ visitors with wide-ranging

demographics

  • We had over 500 downloads
  • We logged user interactions
slide-50
SLIDE 50
  • Queries? Genres?
  • Do they like to search for individual

events of interest or

  • Do they prefer to have routes

prepared for them? If so do they edit these afterwards?

  • We can learn a lot about how they

think and what they want / need

Learning about search behaviour

slide-51
SLIDE 51
  • How do they enjoy their evening?
  • How many events do they visit?
  • How long do they stay at the long night?
  • How much time do they spend travelling

between events?

  • What kind geographical coverage do users

have? Long nights of Science and Museums in October

How does search behaviour influence the evening

slide-52
SLIDE 52

Health and Behaviour Change

slide-53
SLIDE 53

In England nearly 1 in 4 adults, and over 1 in 10 children aged 2-10, are obese.

http://www.dh.gov.uk/en/Publichealth/Obesity/index.htm

slide-54
SLIDE 54

In England 2,338,813 are registered diabetic (5.4% of the population)

http://www.diabetes.org.uk/Professionals/Publications-reports-and-resources/Reports-statistics-and-case-studies/Reports/Diabetes- prevalence-2010/

slide-55
SLIDE 55

Picture of a doctor

How can IR / IA help?

slide-56
SLIDE 56

Self-efficacy is key to behavioural change

slide-57
SLIDE 57

Behavioural Change Circle Often People don't know the problem. Even if they do they don't know what to change.

http://www.ted.com/talks/lang/eng/thomas_goetz_it_s_time_to_redesign_medical_data.html

Thomas Goez's TED Talk, 2010

Collect data about an individual and his / her life Present it to them in a way they can relate to Give them appropriate tips

  • r information

Individual can act on these (or not)

slide-58
SLIDE 58

Collecting Personal Data

slide-59
SLIDE 59

How do you best present this information so that the individual can relate to it?

We have a PhD student working on this!

  • Show with temporal context?
  • Show with context of peers?
  • Give warning feedback?
slide-60
SLIDE 60

Blood Pressure BMI Activity Sleep

Too low Normal Too high Too low Normal Too high Too low Normal Too high Too low Normal Too high

slide-61
SLIDE 61
  • Collection of articles

from a German health magazine

  • Medical Professionals
  • Providing relevance

judgements based on sensor values

  • Providing extra

documents they specifically think are relevant

slide-62
SLIDE 62

Open questions and logistical issues

  • How can you measure whether

people have acted on information?

  • Data collection issues! How much

and how long do we need to collect data to detect behavioural change?

slide-63
SLIDE 63

Healthy food picture

Food Recommender

http://m4bu.dyndns.org/bachelor/register/

slide-64
SLIDE 64

Data Collection

  • Collecting recipe ratings
  • 5000 main meals and 500

breakfasts from 140k chefkoch.de

  • 136 users have rated 3422 ratings

after 4 weeks

  • Reasons behind their rating
slide-65
SLIDE 65

Our Idea

  • Analyse the factors behind

ratings

  • Device strategies to move people

towards rating healthier meals higher

slide-66
SLIDE 66

+ =

Healthier food with tomatoes

slide-67
SLIDE 67

=

Healthier meals that are quick to prepare Just ideas!

slide-68
SLIDE 68

Nutrition is incredibly complex! What is healthy? We have a nutritionist on the project Data to show behavioural change toward healthy meals

slide-69
SLIDE 69

Summary

  • At IR boundaries you find

interesting problems

  • Challenging problems
  • Creativity and inspiration from
  • ther fields can provide solutions
slide-70
SLIDE 70

Some tips for young researchers

  • 1. Read broadly
  • 2. Talk to anyone and everyone
  • 3. Build a network
  • 4. Find a niche
  • 5. Publish articles that count