The Multidimensional Wisdom of Crowds Welinder P., Branson S., - PowerPoint PPT Presentation

The Multidimensional Wisdom of Crowds Welinder P., Branson S., Belongie S., Perona, P Experiment Presentation [CS395T] Visual Recognition Fall 2012 Presented by: Niveda Krishnamoorthy

Problem Overview

Slides from http://videolectures.net/nips2010_welinder_mwc/

Motivation

Distribution of Human Expertise – Task: Finding bluebirds Experiment #1 Rate of correct rejection Rate of correct detection

Distribution of Human Expertise – Task: Finding bluebirds Experiment #1 Rate of correct Bots rejection 50% error Rate of correct detection

Distribution of Human Expertise – Task: Finding bluebirds Experiment #1 Competent Rate of correct rejection 50% error Rate of correct detection

Distribution of Human Expertise – Task: Finding bluebirds Experiment #1 Pessimistic Fraction of True Negatives 50% error Fraction of True Positives

Distribution of Human Expertise – Task: Finding bluebirds Experiment #1 Rate of correct rejection 50% error Optimistic Rate of correct detection

Distribution of Human Expertise – Task: Finding bluebirds Experiment #1 Rate of correct rejection Adversarial 50% error Rate of correct detection

The Idea

Error Rate for Bluebirds dataset

Estimating Image Difficulty Complex Images

1D clusters from learned X i values Dataset: Bluebirds

How do these learned image complexities compare with vision-based techniques? Vision-based measure: Predicted time* to label an image as a measure of image complexity * What’s It Going to Cost You? : Predicting Effort vs. Informativeness for Multi-Label Image Annotations. S. Vijayanarasimhan and K. Grauman. CVPR 2009 Approach: Extract 2804-d feature vectors for MSRC dataset ● Pyramid of HoG ● Color histogram ● Grayscale histogram ● Spatial pyramid of edge density (Canny edge) Train a regressor on top 200 features selected using ReliefF Predict time to label images for bluebirds dataset

Vision-based complexity (vs) Learned Image Complexity Predicted time to label image Learned image complexity

Vision-based complexity (vs) Learned Image Complexity Less time to label images that are at Predicted time to label image the ends Learned image complexity

Vision-based complexity (vs) Learned Image Complexity Longer time to label images Predicted time to label image towards the center Learned image complexity

Qualitative Comparison

Complex Images – Examples False negatives -0.1729917846 -0.9812171195 -0.032638129 -0.4584787866 -0.2354540127 -0.3173699459

Complex Images – Examples False positive 0.1455767129 0.0405159085 0.1051552874 0.2087033725 0.5611137687 0.478586944 0.02178887

Easy Images – Examples True negatives -1.7174006439 -1.4096763371 -1.1632084806 -1.339038233 -1.7104861404 -1.4764330722 -1.214442414

Easy Images – Examples True positives 1.0293133893 1.1407150551 1.0859884692 1.1461191967 1.077287623

Task: Finding ducks Slides from http://videolectures.net/nips2010_welinder_mwc/

2D clusters from learned X i values Dataset: Ducks Recreated in MATLAB

Vision-based complexity (vs) Learned Image Complexity Recreated in MATLAB: Size of point is proportional to the predicted time needed to label it

Vision-based complexity (vs) Learned Image Complexity Images along the outer edge of the cluster take longer to label Recreated in MATLAB: Size of point is proportional to the predicted time needed to label it

Vision-based complexity (vs) Learned Image Complexity Similarly Recreated in MATLAB: Size of point is proportional to the predicted time needed to label it

Vision-based complexity (vs) Learned Image Complexity Images in the wrong clusters take longer to label Recreated in MATLAB: Size of point is proportional to the predicted time needed to label it

Vision-based complexity (vs) Learned Image Complexity Images at the center can also take longer to label. Why? Recreated in MATLAB: Size of point is proportional to the predicted time needed to label it

Discussion Is vision-based image complexity a good indicator of difficulty in labeling an image? What are the other factors?

Discussion Is vision-based image complexity a good indicator of difficulty in labeling an image? What are the other factors? Bird pose Occlusions Lighting

Discussion 1. The authors experiment only with a 2-dimensional model of human expertise How would this model perform by increasing the number of intrinsic dimensions?

Extending this approach to a video dataset YouTube corpus

Example YouTube video with descriptions http://youtu.be/FYyqIJ36dSU A french bulldog is playing with a big ball A small dog chases a big ball. A French bulldog is running fast and playing with a blue yoga ball all by himself in a field. The little dog pushed a big blue ball. A dog is playing with a very large ball. A dog chases a giant rubber ball around A dog is playing with ball

Approach YouTube corpus is not cut out for this task. Consider predicting the presence of the activity “run” 1. Selected 50 videos where “run” was the predicted activity using majority voting 2. Selected 30 videos where “play” was the predicted activity using majority voting 3. Selected 20 videos where “walk” was the predicted activity using majority voting Ground Truth Labels were assigned accordingly Each video had variable number of annotators. Picked the 20 most frequent annotators .

Results Subsampled “RUN” data

1D clusters from learned X i values Dataset: YouTube videos

How do these learned video complexities compare with vision-based techniques? Vision-based measure: Number of STIPS in the video STIP density * Learning Realistic Human Actions from Movies. I. Laptev, M. Marszałek, C. Schmid and B. Rozenfeld. CVPR 2008.

Vision-based complexity (vs) Learned Image Complexity Number of STIPs Learned image complexity

Vision-based complexity (vs) Learned Image Complexity STIP density Learned image complexity

Vision-based complexity (vs) Learned Image Complexity Not much correlation but false negatives seem to have a higher STIP STIP density density PLAY/WALK RUN Learned image complexity

Discussion How can we quantify the complexity of a video? STIP density? Video length? Variety in STIPS? Confusion amongst multiple annotators? How can we quantify the effort involved in labeling a video? How do these relate to video ambiguity?

Qualitative Comparison – True positive STIP density Learned image complexity http://youtu.be/NKm8c_7mgx4

Qualitative Comparison – True negative STIP density Learned image complexity http://youtu.be/abiezv1p7SY

Qualitative Comparison – False positive STIP density Learned image complexity http://youtu.be/1l9Hx1kX_tQ

Qualitative Comparison – False negative STIP density Learned image complexity http://youtu.be/8miosT-Fs1k

Strengths 1. Each annotator is modeled as a multi-dimensional entity – competence, expertise, bias 2. Can be extended to any domain to estimate the ground truth with least error 3. Models image complexities without even seeing the image 4. The model discovers groups of annotators with varying skill sets.

Discussion 1. Image difficulties are learned from human annotations only, which is great! But would the model perform better if image difficulty was incorporated as a known parameter (using some vision-based technique) into the graphical model?

The Multidimensional Wisdom of Crowds Welinder P., Branson S., - PowerPoint PPT Presentation

The Multidimensional Wisdom of Crowds Welinder P., Branson S., Belongie S., Perona, P Experiment Presentation [CS395T] Visual Recognition Fall 2012 Presented by: Niveda Krishnamoorthy Problem Overview Slides from

The The The The Wisdom Wisdom Wisdom Wisdom Books Books Books Books Wisdom: The Art of

The Wisdom of Crowds The Wisdom of Crowds and Social Innovation in the and Social Innovation in

The Wisdom of Crowds: attacks and optimal constructions George Danezis 1 Claudia Diaz 2 asper 2

We desperately need wisdom We desperately need wisdom In fact, getting wisdom is the most

The Wisdom of Crowds: Network effects, and the Importance of Experts Aris Anagnostopoulos

Coins, Clubs, and Crowds: Coins, Clubs, and Crowds: Scaling and Decentralization in Scaling and

EE 355 Unit 5 Multidimensional Arrays Mark Redekopp 2 MULTIDIMENSIONAL ARRAYS 3

The 12 Lessons of Wisdom The Epic Battle between Wisdom and Folly Two Paths - Wisdom or Folly

Jesus in Proverbs Everyday Wisdom | Proverbs 8:1-36 To know Jesus is to know wisdom. The fear of

Wisdom Discerning between Good and Evil Goal scripture: Hebrews 5:11-14 Wisdom Discerning

CrowdMed harnesses the wisdom of crowds to help solve the worlds most difficult medical cases

14 Social Systems 14.1 Generating ontologies 14.2 Wisdom of the crowds 14.3 Folksonomies

Data Mining II Ensembles Heiko Paulheim Introduction Wisdom of the crowds a

The Wisdom of Competitive Crowds Casey Lichtendahl, Yael Grushka-Cockayne, and Phil Pfeifer

The Hypnotic Wisdom of Presentation Mastery The Hypnotic Wisdom of Presentation Mastery Filesize:

The Hypnotic Wisdom of Presentation Mastery The Hypnotic Wisdom of Presentation Mastery

Finding solutions to community environmental

Bringing The Library To The User AALL, July 15, 2007 Casey Bisson online now at

TDSB PARENT ENGAGEMENT WORKSHOP FEBRUARY 19, 2015 BY THE ASSOCIATION OF CAREER EDUCATORS Janice

National Health Insurance Presentation to NEHAWU Political Presentation to NEHAWU Political

JURY Pat McCafferty Yarra Valley Water June 2018 YARRA VALLEY WATER PURPOSE 18 MONTH RESEARCH

Faith L. Justice faithljustice.com How many miles did the Roman infantry travel in a day? How

Preliminary Results to 31 July 2012 Strategy underpins maiden dividend Preliminary Results for

Risk Assessments for Non-Profit Organizations Identifying and Mitigating Unique Risks to Improve