P ROFILING THE SENSORIAL , EMOTIONAL , AND IRONIC LIFE OF A CITY - - PowerPoint PPT Presentation

p rofiling the sensorial emotional and
SMART_READER_LITE
LIVE PREVIEW

P ROFILING THE SENSORIAL , EMOTIONAL , AND IRONIC LIFE OF A CITY - - PowerPoint PPT Presentation

P ROFILING THE SENSORIAL , EMOTIONAL , AND IRONIC LIFE OF A CITY ROSSANO SCHIFANELLA PAN @ CLEF 2017 September 12st, 2017 Eureka Presentation Few words about me Eureka Presentation @UNITO TURIN DISTRIBUTED SYSTEMS RECOMMENDER SYSTEMS Eureka


slide-1
SLIDE 1

Eureka Presentation

ROSSANO SCHIFANELLA

PAN @ CLEF 2017

PROFILING THE SENSORIAL, EMOTIONAL, AND

IRONIC LIFE OF A CITY

September 12st, 2017

slide-2
SLIDE 2

Eureka Presentation

Few words about me

slide-3
SLIDE 3

Eureka Presentation

DISTRIBUTED SYSTEMS RECOMMENDER SYSTEMS

TURIN

@UNITO

slide-4
SLIDE 4

Eureka Presentation

NETWORK SCIENCE HUMAN COMPUTING CROWDSOURCING

BLOOMINGTON

@IU

slide-5
SLIDE 5

Eureka Presentation

LONDON BARCELONA

URBAN COMPUTING MACHINE LEARNING BEHAVIOURAL STUDIES COMPUTATIONAL *

NEW YORK @Yahoo

slide-6
SLIDE 6

Eureka Presentation

CAMBRIDGE

URBAN COMPUTING COMPUTATIONAL SOCIAL SCIENCE

@Nokia Bell Labs

slide-7
SLIDE 7

Eureka Presentation

WORLD URBANIZATION PROSPECTS: THE 2014 REVISION @UNITED NATIONS

HUMANITY IS URBAN

30% 54% 66%

1950 2014 2050

slide-8
SLIDE 8

Eureka Presentation

INFORMATICS ARE PERVASIVE

slide-9
SLIDE 9

Eureka Presentation

SHAKESPEARE, CORIOLANUS

“ WHAT IS A CITY BUT PEOPLE? ”

slide-10
SLIDE 10

Eureka Presentation

EFFICIENT.SUSTAINABLE.SMART

SOCIAL.HEALTHY.HAPPY

+

slide-11
SLIDE 11

Eureka Presentation

slide-12
SLIDE 12

HOW DIGITAL DATA CAN BE USED TO

  • 1. STUDY URBAN PHENOMENA AT SCALE
  • 2. PROFILING
slide-13
SLIDE 13

Eureka Presentation

slide-14
SLIDE 14

coordinates

WHERE WHEN

timestamp

WHAT

text+visual+audio

WHO

author

WHY

intent, context

(lon, lat, t)

TREE PEOPLE GRASS STREET TRASH BIN STREET LIGHT

Wonderful day at the park #nature

slide-15
SLIDE 15

Eureka Presentation

Sensing

slide-16
SLIDE 16

Eureka Presentation

SMELLY MAPS

How does a city smell?

CHATTY MAPS

How does a city sound?

HAPPY MAPS

How visually pleasant is a city?

1 2 3

slide-17
SLIDE 17

HAPPY MAPS

GET THE SHORT AND

PLEASANT ROUTE

HyperText 2014

slide-18
SLIDE 18

A B

SHORTEST SHORT and PLEASANT

slide-19
SLIDE 19

Eureka Presentation

HOW?

COLLECT URBAN PERCEPTIONS

slide-20
SLIDE 20

Eureka Presentation

HTTP://URBANGEMS.ORG/

slide-21
SLIDE 21

Eureka Presentation

HOW?

COLLECT URBAN PERCEPTIONS GENERATE EMOTIONALLY-AWARE

ROUTES

slide-22
SLIDE 22

Eureka Presentation

HOW?

COLLECT URBAN PERCEPTIONS GENERATE EMOTIONALLY-AWARE

ROUTES

EVALUATE

  • Path from Euston Square and

Tate Modern

  • 3 situations (happy, quiet,

beauty scenarios)

  • 4 paths to vote on a Likert

scale (paths are unlabeled)

SURVEY IN LONDON

slide-23
SLIDE 23

Eureka Presentation

HOW?

COLLECT URBAN PERCEPTIONS GENERATE EMOTIONALLY-AWARE

ROUTES

EVALUATE MODEL AESTHETICS WITH

SOCIAL MEDIA DATA

  • For each street segment we extract:
  • Number of pictures (density), number
  • f views, of favorites, of comments,

and tags

  • Tags (LIWC dictionary, 72 categories)
  • Extract features that are significantly

correlated with beauty scores

  • Example: density, ’posemo’, ‘negemo’,

‘swear’, ‘anx’ (anxiety), ‘sad’, and ‘anger’ LIWC categories

  • Build a model to predict beauty

METHOD

7M geolocated photos in London

DATA

slide-24
SLIDE 24

Eureka Presentation

24

slide-25
SLIDE 25

HOW DOES A CITY

SMELL?

SMELLY MAPS

ICWSM 2015

slide-26
SLIDE 26

Eureka Presentation 26

Humans discriminates millions of odors

Science 2014

slide-27
SLIDE 27

Eureka Presentation

Yet, city planning can discriminate

  • nly a few bad odors

Why this negative perspective?

slide-28
SLIDE 28

Eureka Presentation 28

slide-29
SLIDE 29

Eureka Presentation 29

Smell Walks

Amsterdam, Pamplona, Glasgow, Edinburgh, Newport, Paris, New York, Singapore

slide-30
SLIDE 30

Eureka Presentation 30

PHOTOS

436K

TWEETS

1.7M

PHOTOS

17M

DATA

London + Barcelona

slide-31
SLIDE 31

DOG CAT

30

BIRD

10 5

CO-OCCURENCE NETWORK

slide-32
SLIDE 32
slide-33
SLIDE 33

ANIMALS EMISSIONS FOOD NATURE

EMERGENCE OF CLUSTERS

slide-34
SLIDE 34

Eureka Presentation

Urban Smellscape Aroma Wheel

slide-35
SLIDE 35
slide-36
SLIDE 36

Eureka Presentation 36

DEMO

GOODCITYLIFE.ORG

slide-37
SLIDE 37

Eureka Presentation

HOW DOES THE URBAN SMELLSCAPE CHANGE THROUGH TIME AND SPACE?

ICWSM 2016

slide-38
SLIDE 38

CHATTY MAPS

HOW DOES A CITY

SOUND?

RSOS 2016

slide-39
SLIDE 39

Eureka Presentation

Urban Soundscape Wheel

slide-40
SLIDE 40

Eureka Presentation

VALIDATION

Presence of nature, food, etc. tags OPEN STREET MAPS Air quality indicators CITY OFFICIALS

slide-41
SLIDE 41

SONIC OLFACTORY VISUAL URBAN FABRIC

slide-42
SLIDE 42

Eureka Presentation

Socio-economic Indicators

London New York

slide-43
SLIDE 43

IMD: Index of Multiple Deprivation

  • Income deprivation
  • Employment deprivation
  • Health deprivation and disability
  • Education, skills and training deprivation
  • Barriers to housing and services
  • Living environment deprivation
  • Crime

London

slide-44
SLIDE 44

Eureka Presentation

LSOA (Lower Layer Super Output Area)

slide-45
SLIDE 45

Smell (London)

IMD nature animals emissions waste food cleaning industry smoke 0.24*** 0.16***

  • 0.16***
  • 0.26***
  • 0.1***
  • 0.19***
  • 0.2***
  • 0.15***
slide-46
SLIDE 46

Smell (London)

LIVING ENVIRONMENT nature animals emissions waste food cleaning industry smoke synthetic 0.29*** 0.17***

  • 0.23***
  • 0.35***
  • 0.4***
  • 0.35***
  • 0.24***
  • 0.3***
  • 0.15***
slide-47
SLIDE 47

Smell (London)

LIVING ENVIRONMENT INCOME animals nature emissions waste cleaning industry 0.12*** 0.2***

  • 0.1***
  • 0.18***
  • 0.12***
  • 0.15***
slide-48
SLIDE 48

Smell (London)

LIVING ENVIRONMENT INCOME HEALTH animals nature waste food cleaning industry smoke 0.12*** 0.21***

  • 0.23***
  • 0.14***
  • 0.17***
  • 0.18***
  • 0.12***
slide-49
SLIDE 49

Smell (London)

animals waste cleaning 0.1***

  • 0.19***
  • 0.14***

LIVING ENVIRONMENT INCOME HEALTH CRIME

slide-50
SLIDE 50

Smell (London)

animals nature emissions waste industry smoke 0.14*** 0.17***

  • 0.15***
  • 0.19***
  • 0.16***
  • 0.12***

LIVING ENVIRONMENT INCOME HEALTH CRIME HOUSING

slide-51
SLIDE 51

Sound (London)

IMD human nature mechanical motorised music 0.11*** 0.11***

  • 0.14***
  • 0.17***
  • 0.17***
slide-52
SLIDE 52

Sound (London)

LIVING ENVIRONMENT nature mechanical motorized music indoor 0.12***

  • 0.27***
  • 0.22***
  • 0.36***
  • 0.31***
slide-53
SLIDE 53

Crime 2008-2016 (London)

nature animals emissions waste metro cleaning industry smoke food synthetic

  • 0.38
  • 0.24

0.43 0.35 0.35 0.32 0.3 0.27 0.19 0.19

slide-54
SLIDE 54

Crime 2008-2016 (London)

nature humans motorised mechanical music

  • 0.21
  • 0.16

0.36 0.17 0.3

slide-55
SLIDE 55

Census Bureau ACS Economic Profile

  • Employment status
  • Commuting to work
  • Occupation
  • Industry
  • Class of worker
  • Health Insurance coverage
  • Poverty level

New York (NTA level)

slide-56
SLIDE 56

Eureka Presentation

MEDIAN HOUSEHOLD INCOME (DOLLARS)

LOW HIGH

slide-57
SLIDE 57

Smell (NYC)

MEDIAN HOUSEHOLD INCOME MEDIAN NON-FAMILY INCOME nature food metro +tobacco 0.38*** 0.16***

  • 0.43***

0.22**

slide-58
SLIDE 58

Income per household (NYC)

emissions nature waste, industry, synthetic, smoke metro <10K 0.16

  • 0.39

0.45 10K-15K 0.15

  • 0.36

0.44 15K-25K

  • 0.38

0.15 (waste) 0.44 25K-35K

  • 0.29

0.31 35K-50K

  • 0.2
  • 0.25 (smoke)

0.26 50K-75K

  • 0.2

0.21

  • 0.27 (industry), -0.25 (smoke)
  • 0.15

75K-100K

  • 0.21

0.41

  • 0.52

100K-150K 0.41

  • 0.54

150K-200K 0.36 0.15 (smoke)

  • 0.45

>200K 0.3 0.25 (smoke)

  • 0.35
slide-59
SLIDE 59

Eureka Presentation

LOW HIGH

COMMUTING TO WORK - CAR

slide-60
SLIDE 60

Eureka Presentation

LOW HIGH

COMMUTING TO WORK - PUBLIC TRANSPORT

slide-61
SLIDE 61

Commuting (NYC)

CAR PUBLIC TRANSPORTATION WALKED WORKED AT HOME nature (0.3) waste (-0.15) cleaning (-0.25) emissions (-0.2) food (-0.2) metro (-0.49) synthetic (-0.24) smoke (-0.38) nature (-0.32) waste (0.18) cleaning (0.24) industry (0.17) metro (0.54) smoke (0.22) nature (-0.25) food (0.15) industry (0.27) metro (0.41) synthetic (0.32) smoke (0.28) cleaning (0.16) emissions (0.17) industry (0.2) metro (0.15) synthetic (0.32) smoke (0.36)

slide-62
SLIDE 62

Sound (NYC)

MEDIAN HOUSEHOLD INCOME MEDIAN NON-FAMILY INCOME nature motorized +music 0.3***

  • 0.43***

0.17**

slide-63
SLIDE 63

Income per household (NYC)

human nature motorized <10K 0.3

  • 0.29

0.24 10K-15K 0.29

  • 0.27

0.29 15K-25K 0.29

  • 0.3

0.34 25K-35K 0.27

  • 0.26

0.29 35K-50K 0.3

  • 0.19

0.29 50K-75K 0.28

  • 0.22
  • 75K-100K
  • 0.33

0.36

  • 0.28

100K-150K

  • 0.18

0.37

  • 0.36

>200K 0.17 0.24

  • 0.37
slide-64
SLIDE 64

Commuting (NYC)

CAR PUBLIC TRANSPORTATION WALKED WORKED AT HOME nature (0.37) human (-0.45) mechanical (-0.17) music (-0.5) nature (-0.36) human (0.28) mechanical (0.23) music (0.28) nature (-0.3) human (0.39) music (0.41) nature (-0.25) motorised (-0.17) human (0.28) indoor (0.29) music (0.35)

slide-65
SLIDE 65

Eureka Presentation

Walkability+Activities+Ambiance

slide-66
SLIDE 66

Eureka Presentation

IS WALKABILITY QUANTIFIABLE?

WWW 2015

slide-67
SLIDE 67

Eureka Presentation

“The General Theory of Walkability explains how, to be favored, a walk has to satisfy four main conditions: it must be useful, safe, comfortable, and interesting. Each of these qualities is essential and none alone is sufficient.”

Public space surrendered to cars

Jeff Speck

WALKABLE CITY

slide-68
SLIDE 68

Questions (safety)

  • Can safe streets be identified by night activity?
  • Safe streets are photographed not only during the day but also at

night, while unsafe ones mostly during the day

  • Can safe streets be identified by activity segmented by

gender or age?

  • Safe streets are predominantly visited by a male population

(r = 0.58)

  • Safe streets are predominantly visited by an adult population

(r = 0.32)

slide-69
SLIDE 69

Questions (walkability)

  • Can walkable streets be identified by the presence
  • f specific types of places?
  • Can walkability be predicted?
  • yes!
slide-70
SLIDE 70

Eureka Presentation

Activities

slide-71
SLIDE 71

Profiling urban activities

  • Identifying activity words
  • From Flickr
  • From web documents
  • Expansion of activity words
  • Focus on private activities
  • indoor vision tag
  • Clustering of activity words in a hierarchical taxonomy
slide-72
SLIDE 72

Eureka Presentation

Urban Activities Wheel

slide-73
SLIDE 73

Results (some)

+work&study +protest +self +show +education(NYC) +housing (L)

  • education (L), -income (L/NYC)
  • income
  • crime (L), -education, -employment,
  • income (NYC)
slide-74
SLIDE 74

Diversity

London New York economic development is associated with activity diversity

slide-75
SLIDE 75

Eureka Presentation

slide-76
SLIDE 76

Eureka Presentation

slide-77
SLIDE 77

Limitations

  • Not exhaustive list of activities
  • Population-demographic bias
  • Self-selection bias
  • well-to-do areas might be over-represented
  • Results do not speak to causality
slide-78
SLIDE 78

Eureka Presentation

AMBIENCE

Can the ambience of a place be predicted from pictures?

slide-79
SLIDE 79

Eureka Presentation

Urban Ambiance Wheel

slide-80
SLIDE 80

SONIC OLFACTORY VISUAL URBAN FABRIC AMBIANCE ACTIVITIES WALKABILITY

slide-81
SLIDE 81

Eureka Presentation

Emotions

slide-82
SLIDE 82

Eureka Presentation

82

slide-83
SLIDE 83

Eureka Presentation 83

EMOTIONS

To model sentiment we adopt the EmoLex lexicon that follows the 8 primary emotions from Plutchik’s psychoevolutionary theory.

slide-84
SLIDE 84

Eureka Presentation

EMISSIONS WASTE

CORRELATION BETWEEN

EMOTIONS AND SMELLS

FOOD NATURE

slide-85
SLIDE 85

Eureka Presentation

MUSIC HUMAN TRAFFIC

CORRELATION BETWEEN

EMOTIONS AND SOUNDS

slide-86
SLIDE 86

Eureka Presentation

MUSIC

Music triggers both joy and sadness

slide-87
SLIDE 87

Example (London)

sadness negative joy

  • income, -employment, -health, -crime, -housing,
  • living environment
  • income, -health, -education, -employment

+education, +housing

slide-88
SLIDE 88

SONIC OLFACTORY VISUAL URBAN FABRIC AMBIANCE ACTIVITIES WALKABILITY EMOTIONS

slide-89
SLIDE 89

Eureka Presentation

Ongoing work

slide-90
SLIDE 90

SONIC OLFACTORY VISUAL URBAN FABRIC AMBIANCE ACTIVITIES WALKABILITY EMOTIONS SARCASM EMOJIS

slide-91
SLIDE 91

Eureka Presentation

SARCASM

slide-92
SLIDE 92

Eureka Presentation

LITERAL INTENDED ≠

slide-93
SLIDE 93

Some previous work

  • Lexical and linguistic markers
  • Context
  • hashtags, emojis
  • previous posts
  • author profile, propensity to sarcastic utterances
slide-94
SLIDE 94

Eureka Presentation

SOCIAL MEDIA IS MULTIMODAL

slide-95
SLIDE 95

Eureka Presentation

METADATA VISUALS TEXT

slide-96
SLIDE 96

Eureka Presentation

Great day today

slide-97
SLIDE 97

Eureka Presentation

Great day today

slide-98
SLIDE 98

Eureka Presentation

Text+Image

Image as a contextual clue

slide-99
SLIDE 99

Eureka Presentation

POSTS CONTAINING #SARCASM OR #SARCASTIC

DATA

517K

63K

20K

99% 40% 7.56%

TEXT+IMAGE TEXT+IMAGE TEXT+IMAGE

slide-100
SLIDE 100

Eureka Presentation

CHARACTERISE THE ROLE OF IMAGES

Study of the interplay between textual and visual components

1

slide-101
SLIDE 101

Eureka Presentation

COLLECT A GROUND TRUTH FOR SARCASM

  • A. Evaluate the impact of visuals as a source for context
  • B. Identify sarcastic posts with a high level of agreement

CHARACTERISE THE ROLE OF IMAGES

Study of the interplay between textual and visual components

1 2

slide-102
SLIDE 102

Eureka Presentation

ASK THE CROWD!

1K POSTS 5 JUDGEMENTS

slide-103
SLIDE 103

Eureka Presentation

SECOND EXPERIMENT For all the posts that are judged not sarcastic in the previous step, show the text and the image FIRST EXPERIMENT Show only the textual component of a post

slide-104
SLIDE 104

Eureka Presentation

Text+Image 37,4%

Text Only 37,8%

Not Sarcastic 24,8%

Text+Image 44,5%

Text Only 23,6%

Not Sarcastic 31,9%

\

slide-105
SLIDE 105

Eureka Presentation

COLLECT A GROUND TRUTH FOR SARCASM

  • A. Evaluate the impact of visuals as a source for context
  • B. Identify sarcastic posts with a high level of agreement

DETECT SARCASM

SVM Fusion+Deep learning fusion approaches

CHARACTERISE THE ROLE OF IMAGES

Study of the interplay between textual and visual components

1 2 3

slide-106
SLIDE 106

How can we detect sarcasm in multimodal posts?

  • Different fusion approaches
  • SVM based
  • Deep learning
  • Open questions:
  • Does the use of figurative language change according to socio-

demographic variables?

  • Does the use of figurative language change in different areas of the city?
slide-107
SLIDE 107

Eureka Presentation

107

slide-108
SLIDE 108

Eureka Presentation

Questions?

@rschifan http://www.di.unito.it/~schifane schifane@di.unito.it