The Wisdom of Crowds:
Network effects, and the Importance of Experts
Aris Anagnostopoulos Sapienza University of Rome
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
The Wisdom of Crowds: Network effects, and the Importance of - - PowerPoint PPT Presentation
The Wisdom of Crowds: Network effects, and the Importance of Experts Aris Anagnostopoulos Sapienza University of Rome Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015 Online collaboration systems
Aris Anagnostopoulos Sapienza University of Rome
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Systems creating knowledge by massive online collaboration:
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
What does the ox weigh? (1198 pounds)
At a 1906 country fair in Plymouth, UK, Sir Francis Galton made an experiment, asking people to estimate the weight of a slaughtered ox. He asked 800 participants. The answers’ median was 1207 pounds (1% error)
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
The premise of the wisdom of crowds is that averaging the
answers. Examples and applications:
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We will look at three dimensions of the problem:
spreading of (mis)information
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We will look at three dimensions of the problem:
spreading of (mis)information
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Main requirement: Independence of opinions and diversity What happens when we talk and influence each other? Answer: Often bad things – Think about democracy:
terrible governments
– GroupThink – Spread of conspiracy theories We want to study the network effect on the wisdom of crowds in a natural setting
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Instructions: Phase 1:
Phase 2
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We can use RFID tags to track sustained face-to-face proximity among people.
RFID Reader RFID Tag
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
550
I think… Bla bla bla… I want a steak! Trust me…
A typical scenario…
Each participant wears an RFID tag
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Innate/Learnt Ability (Class 1)
Knowledge and Reasoning (Class 2) Prediction (Class 3)
the Trevi fountain in 2012?
the first round (3 games each) of the 2014 Mundial? Brazil, Spain, Greece, Italy, France, Argentina, Germany, Russia (asked before the mundial… )
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Feb 2013 (69 attendees)
May 2013 (37 attendees)
May 2014 (60 attendees)
May 2014 (25 attendees)
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
An interaction graph 𝑯 = 𝑾, 𝑭 represents the interactions between the people.
node edge
E V
(interaction)
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Priverno fair
Undirected graph Nodes: 60 Edges: 128 Density: 0.072 Network Diameter: 9 Communities: 15
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Priverno fair (the others are similar): Normalized true value Average in 1st round Average in 2nd round
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Priverno fair (the others are similar):
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 Q1 Q2 Q3 Q4 Round 1 Round 2 Normalized standard deviation (std)
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Having all these data we want to design models for
Why?
Hard: different people, lots of noise, missing info
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
DeGroot model:
𝐵′(𝑣) = 𝐵 𝑣 + 𝐵 𝑤1 + 𝐵 𝑤2 + 𝐵 𝑤3 + 𝐵(𝑤4) 1 + 4 𝐵′(𝑣) = 𝛽 𝐵 𝑣 + 𝐵 𝑤1 + 𝐵 𝑤2 + 𝐵 𝑤3 + 𝐵(𝑤4) 𝛽 + 4
Generalized DeGroot model: But how can we explain the improvement?
𝐵(𝑣): answer of u at R1 𝐵′(𝑣): answer of u at R2 u v1 v2 v3 v4
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
where interaction was imposed
harm?
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We will look at three dimensions of the problem:
spreading of (mis)information
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Numerous examples where large part of the population believes false info:
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Posts from 79 italian facebook group pages:
Crawled the network of likers and found their connections:
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
180K likes 26K shares
We have 1.2M users who have liked science/conspiracy posts. Are they consistent with the content they like? For each user 𝑣 define user polarization 𝝇(𝒗): 𝜍 𝑣 = 𝒅𝒑𝒐𝒕𝒒 𝒅𝒑𝒐𝒕𝒒 + 𝒕𝒅𝒋 𝒅𝒑𝒐𝒕𝒒: # conspiracy posts 𝑣 liked 𝒕𝒅𝒋: # science posts 𝑣 liked
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We have 1.2M users who have liked science/conspiracy posts. Are they consistent with the content they like? For each user 𝑣 define user polarization 𝝇(𝒗): 𝜍 𝑣 = 𝒅𝒑𝒐𝒕𝒒 𝒅𝒑𝒐𝒕𝒒 + 𝒕𝒅𝒋 𝒅𝒑𝒐𝒕𝒒: # conspiracy posts 𝑣 liked 𝒕𝒅𝒋: # science posts 𝑣 liked
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We can select two subsets of users: Science users: {𝑣: 𝜍 𝑣 ≤ 5%} Conspiracy users: {𝑣: 𝜍 𝑣 ≥ 95%}
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We can select two subsets of users: Science users: {𝑣: 𝜍 𝑣 ≤ 5%} Conspiracy users: {𝑣: 𝜍 𝑣 ≥ 95%}
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Post statistics Post lifetime
Science and conspiracy posts and users show very similar behavior:
User lifetime User subgraph statistics
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Homophily: tendency of individuals to associate with similar others
𝜔 𝑣 # 𝑚𝑗𝑙𝑓𝑡: Normalized liking activity of 𝑣
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We can predict the ratio
the same polarization with 𝑣 as a function of 𝑣’s #likes:
𝜄 𝑣 = #𝑚𝑗𝑙𝑓𝑡: Liking activity of 𝑣
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
How does the average user of a viral post look?
deg (𝑣): # friends of node 𝑣 𝜔 𝑣 # 𝑚𝑗𝑙𝑓𝑡: Normalized liking activity of 𝑣
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We also downloaded info about 4.7K troll posts: posts with clearly useless or wrong information:
“The Italian Senate voted and accepted (257 in favor and 165 abstentions) a law proposed by Senator Cirenga aimed at funding with 134 billion Euro the policy makers to find a job in case of defeat in the political competition.”
36K shares 1.1K likes
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We also downloaded info about 4.7K troll posts: posts with clearly useless or wrong information:
“The Italian Senate voted and accepted (257 in favor and 165 abstentions) a law proposed by Senator Cirenga aimed at funding with 134 billion Euro the policy makers to find a job in case of defeat in the political competition.”
36K shares 1.1K likes
Italian senate!
exist!
French GDP!
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
deg (𝑣): # friends of node 𝑣 𝜔 𝑣 # 𝑚𝑗𝑙𝑓𝑡: Normalized liking activity of 𝑣
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
info
ambiguity and arrive at definite conclusions (sometimes irrationally)
believe, and remember info in a way that is aligned with ones beliefs
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
We will look at three dimensions of the problem:
spreading of (mis)information
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Wisdom of crowds and wisdom of experts:
trusted
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Systems creating knowledge by massive online collaboration:
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Systems creating knowledge by massive online collaboration:
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Crowdsourcing: is the process of obtaining information by using contributions from a large group of people.
There are tasks hard for computers but easy for humans (human tasks):
Colosseum)
Crowdsourcing platforms: Online services that allow, through APIs, to get answers from humans at a low cost
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Requester Human Intelligent Tasks (HITs) Workers
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
[ Ma
[relative distance], #questions [relative distance], #questions
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
[relative distance], #questions [relative distance], #questions
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
– If 𝑓𝑗 −𝑓
𝑘 ≥ 𝜄 worker returns correct answer
– If 𝑓𝑗 −𝑓
𝑘 < 𝜄 worker returns arbitrary answer
Note that if the difference is < 𝜄 no matter how many workers we ask, we cannot obtain a more accurate response
𝑓7 𝑓6 𝑓5 𝑓4 𝑓8 𝑓1 𝑓2 𝑓3 𝜄
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Usually workers are untrained An expert is a more capable worker:
Experts have started being offered by crowdsourcing systems
When should we use regular workers and when experts? Think of ‘Who wants to be a millionaire”
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
– If 𝑓𝑗 −𝑓
𝑘 ≥ 𝜄 worker returns correct answer
– If 𝑓𝑗 −𝑓
𝑘 < 𝜄 worker returns arbitrary answer
𝑓7 𝑓6 𝑓5 𝑓4 𝑓8 𝑓1 𝑓2 𝑓3 𝜄
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
– If 𝑓𝑗 −𝑓
𝑘 ≥ 𝜄 worker returns correct answer
– If 𝑓𝑗 −𝑓
𝑘 < 𝜄 worker returns arbitrary answer
Experts have a lower error threshold 𝜄𝐹
𝑓7 𝑓6 𝑓5 𝑓4 𝑓8 𝑓1 𝑓2 𝑓3 𝜄 𝜄𝐹
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
A model allows us to formalize and analyze the problem
the max as possible
possible Feel free to ask for details after the talk.
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Tested on Crowdsourcing platform with 3 datasets:
Goal: find more dots
Goal: find most expensive
Goal: find most relevant result for a given query
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
In all our 3 sets of experiments: The combination of nonexpert and expert users finds the best results with a low cost.
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Understand better when we have wisdom or ignorance of the crowds
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015
Questions, comments, etc.: http://aris.me
Aris Anagnostopoulos The Wisdom of Crowds School for Advanced Sciences of Luchon, 2015