Learning from Crowds in the Presence of Schools of Thought Yuandong - - PowerPoint PPT Presentation

learning from crowds in the presence of schools of thought
SMART_READER_LITE
LIVE PREVIEW

Learning from Crowds in the Presence of Schools of Thought Yuandong - - PowerPoint PPT Presentation

Learning from Crowds in the Presence of Schools of Thought Yuandong Tian 1 and Jun Zhu 2 1 Carnegie Mellon University 2 Tsinghua University 1 Crowd-sourcing Worker 1 Worker 2 Worker 3 Worker 4 Task 1 x x x x x Task 2 Task 3 x x x 2


slide-1
SLIDE 1

1

Learning from Crowds in the Presence of Schools of Thought

Yuandong Tian1 and Jun Zhu2

1Carnegie Mellon University 2Tsinghua University

slide-2
SLIDE 2

2

Crowd-sourcing

Worker 1 Worker 2 Worker 3 Worker 4 Task 1 x x x Task 2 x x Task 3 x x x

slide-3
SLIDE 3

3

Crowd-sourcing

Objective Tasks Subjective Tasks

E.g. Demographical Survey Personal Opinions Creative thoughts Ill-designed ambiguous tasks. E.g. Labeling dataset Knowledge Test

slide-4
SLIDE 4

4

Crowd-sourcing

Objective Tasks Subjective Tasks

Noise

slide-5
SLIDE 5

5

Crowd-sourcing

Objective Tasks Subjective Tasks Task clarity Worker reliability

Noise

slide-6
SLIDE 6

6

Previous works

Objective Tasks Subjective Tasks

Majority Voting [J. Whitehill et al., NIPS’09] [V.C. Raykar et al., JMLR’10] [P. Welinder et al., NIPS’10] ….. Gold standard Worker Reliability

slide-7
SLIDE 7

7

Our Contribution

Objective Tasks Subjective Tasks Contributions:

  • 1. Applicable to both objective and subjective tasks.
  • 2. Simple , no iterative procedure, no initial guess.
slide-8
SLIDE 8

8

Two Principles A worker is reliable

if he agrees with other workers in many tasks.

A task is clear

if it has only a few answers.

slide-9
SLIDE 9

9

Clustering Analysis

Task k

A B

1 1 1 1 1 1

C D E F G H L Workers

slide-10
SLIDE 10

Group-size Matrix #Z

A D E L G B C F H

Task k

Worker Assign. Cluster size

A I 5 B II 3 C II 3 D I 5 E I 5 F II 3 G I 5 H III 1 L I 5

I II III

slide-11
SLIDE 11

12

Group-size Matrix #Z

#Z

Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Worker A

5 3 2 3 4 2 6

Worker B

3 3 4 5 4 3 6

Worker C

3 2 2 5 2 4 6

Worker D

5 3 4 5 4 4 6

Worker E

5 2 2 5 2 3 2

Worker F

3 2 2 5 2 4 2

Worker G

5 2 4 3 1 3 6

Worker H

1 1 1 1 2 2 1

Worker L

5 1 4 3 4 4 6

slide-12
SLIDE 12

Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Worker A

5 3 2 3 4 2 6

Worker B

3 3 4 5 4 3 6

Worker C

3 2 2 5 2 4 6

Worker D

5 3 4 5 4 4 6

Worker E

5 2 2 5 2 3 2

Worker F

3 2 2 5 2 4 2

Worker G

5 2 4 3 1 3 6

Worker H

1 1 1 1 2 2 1

Worker L

5 1 4 3 4 4 6

13

Worker Reliability

slide-13
SLIDE 13

14

Task Clarity

Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Worker A

5 3 2 3 4 2 6

Worker B

3 3 4 5 4 3 6

Worker C

3 2 2 5 2 4 6

Worker D

5 3 4 5 4 4 6

Worker E

5 2 2 5 2 3 2

Worker F

3 2 2 5 2 4 2

Worker G

5 2 4 3 1 3 6

Worker H

1 1 1 1 2 2 1

Worker L

5 1 4 3 4 4 6

slide-14
SLIDE 14

18

Factorization

=

Worker Reliability Task clarity

#Z > 0  λ > 0 and μ > 0

T 1 T 2 T 3 T4 T5 T6 T 7 WA

5 3 2 3 4 2 6

WB

3 3 4 5 4 3 6

WC

3 2 2 5 2 4 6

WD

5 3 4 5 4 4 6

WE

5 2 2 5 2 3 2

WF

3 2 2 5 2 4 2

WG

5 2 4 3 1 3 6

WH

1 1 1 1 2 2 1

WL

5 1 4 3 4 4 6

#Z

Perron-Frobenius theorem:

slide-15
SLIDE 15

19

Clustering Model

Task k

slide-16
SLIDE 16

20

Clustering Model

Task k

N M

slide-17
SLIDE 17

N M

21

Clustering Model

Task k

answers cluster centers cluster labels

slide-18
SLIDE 18

22

Clustering Model

Task k

N M

slide-19
SLIDE 19

24

Clustering Model

Label assignment Clustering Model

#Z

A D E L G B C F H

T 1 T 2 T 3 T4 T5 T6 T 7 W1 5 3 2 3 4 2 6 W2 3 3 4 5 4 3 6 W3 3 2 2 5 2 4 6 W4 5 3 4 5 4 4 6 W5 5 2 2 5 2 3 2 W6 3 2 2 5 2 4 2 W7 5 2 4 3 1 3 6 W8 1 1 1 1 2 2 1 W9 5 1 4 3 4 4 6
slide-20
SLIDE 20

25

Clustering Model

Label assignment Clustering Model

#Z

T 1 T 2 T 3 T4 T5 T6 T 7 W1 5 3 2 3 4 2 6 W2 3 3 4 5 4 3 6 W3 3 2 2 5 2 4 6 W4 5 3 4 5 4 4 6 W5 5 2 2 5 2 3 2 W6 3 2 2 5 2 4 2 W7 5 2 4 3 1 3 6 W8 1 1 1 1 2 2 1 W9 5 1 4 3 4 4 6

A D E L G B C F H

slide-21
SLIDE 21

30

Close form solution to #Z

slide-22
SLIDE 22

31

Close form solution to #Z

Squared Euclidean Distance between worker i and worker j in task k

slide-23
SLIDE 23

32

Hyper-Parameters Estimation

Hyper-parameters:

σ σ = 0.2

slide-24
SLIDE 24

33

Experiments Setting

Mission I: Image Classification (Sky/Building/Computer) Mission II: Counting Objects Mission III: Images Aesthetics Do these images contain sky? Do these images look pretty?

slide-25
SLIDE 25

34

Statistics

402 workers

Mission I

Sky Building Computer (12) (12) (12)

Mission II

Counting (4)

Mission III

Images Aesthetics (12 + 12) http://www.cs.cmu.edu/~yuandong/kdd2012-dataset.zip

Dataset link:

slide-26
SLIDE 26

35

The Groupsize Matrix

Small Group Size Large Group Size

Tasks Workers

slide-27
SLIDE 27

36

Rank-1 Factorization

= 0.27

slide-28
SLIDE 28

37

Rank-1 Factorization

Worker Reliability

slide-29
SLIDE 29

38

Count 2: Clarity = 69.4

Tasks’ clarity

slide-30
SLIDE 30

39

Beauty1 and Beauty2: Clarity = 12.4/11.8

Task’s clarity

slide-31
SLIDE 31

40

Task’s clarity

Count 4: Clarity = 10.2

slide-32
SLIDE 32

41

Workers’ Reliability

10 20 30 40 50 60 70

1.52 6.62

1.5 6.5 5 65 workers ~ 20% 337 workers ~ 80%

Count

slide-33
SLIDE 33

42

Ranking Workers

Mission I

Sky Building Computer (12) (12) (12)

Mission II

Counting (4)

Mission III

Images Aesthetics (12 + 12)

D most unreliable D most reliable

slide-34
SLIDE 34

43

Ranking Workers

2 4 6 8 10 12 14 16 18 Count1 Count2 Count3 Count4 Std of D best Std of D worst

D = 10

Std.

slide-35
SLIDE 35

44

Ranking Workers

2 4 6 8 10 12 14 16 Count1 Count2 Count3 Count4 Std of D best Std of D worst

D = 30

Std.

slide-36
SLIDE 36

45

Comparison with Clustering

Difference in Variance (a) Our Approach (b) Spectral Clustering (c) PCA-Kmeans (d) Gibbs Sampling

slide-37
SLIDE 37

46

Time Cost

Methods Time (sec) (a) Our approach 1.41± 0.05 (b) Spectral Clustering 3.90±0.36 (c) PCA-Kmeans 0.19±0.06 (d) Gibbs Sampling 53.63±0.19

slide-38
SLIDE 38

47

Predicting Ground truth

Count1 Count2 Count3 Count4

Ours, D = 5/10

65 5 8 26

Majority Voting

53.7 5.0 9.9 22.9

Majority Voting (Median)

60 5.0 8 24

Learning from Crowd [JMLR’10]

56 5 8 24

Multidimensional Wisdom of Crowds [NIPS’10]

63.7 5 8 26.0

Ground truth

65 5 8 27

slide-39
SLIDE 39

49

Conclusion and Future Work

Handling possible missing entries Improving the scalability.

Future Work Conclusion

  • 1. Estimating workers’ reliability and tasks’ clarity in

the presence of schools of thought.

  • 2. Applicable to both objective and subjective tasks.
  • 3. Simple solution without iteration, no initial guess.
slide-40
SLIDE 40

50

Thanks!