crowdsourcing quality control
play

Crowdsourcing Quality Control Madalina and Jao-ke CS 286r '12 - PowerPoint PPT Presentation

Crowdsourcing Quality Control Madalina and Jao-ke CS 286r '12 Motivation What do we do when we want to answer a question? Motivation What do we do when we want to answer a question? We ask the questions to lots of people! Motivation


  1. Crowdsourcing Quality Control Madalina and Jao-ke CS 286r '12

  2. Motivation What do we do when we want to answer a question?

  3. Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ●

  4. Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others?

  5. Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others? ● We want to weight the good workers' answers more heavily than those of the bad workers.

  6. Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others? ● We want to weight the good workers' answers more heavily than those of the bad workers. How can we tell which workers are "good" and which workers are "bad"? Proof by majority? ●

  7. Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others? ● We want to weight the good workers' answers more heavily than those of the bad workers. How can we tell which workers are "good" and which workers are "bad"? Proof by majority? ● Nope. (Sorry, democracy.) ●

  8. Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j ●

  9. Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: ○

  10. Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○

  11. Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○ Crowd has average quality ●

  12. Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○ Crowd has average quality q := E[(2 p j -1) 2 ] ●

  13. Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○ Crowd has average quality q := E[(2 p j -1) 2 ] ● Roles of reliability and quality in algorithm? ●

  14. Graph Theory G(V,E) Bipartite Graph: ● ● ● ● ● ● ● ● ● ●

  15. Graph Theory Tasks Workers ● ● ● ● ● ● ● ● ● ● m = 4 n = 6 l = ? r = ?

  16. Graph Theory Workers Tasks ● 1. ● ● 1. 2. ● ● 2. 3. ● ● 3. 4. ● ● 4. 5. ● 6.

  17. Algorithm

  18. Algorithm Example: Updates Workers Tasks ● 1. ● ● 1. 2. ● ● 2. 3. ● ● 3. 4. ● ● 4. 5. ● 6.

  19. Commercial Break Say you have a biased coin that with probability p =/= 1/2 comes up Heads and with probability 1- p comes up Tails. How can you estimate p ?

  20. Algorithm Example: Updates

  21. Optimality Discussion Oracle error: The minimax error rate achieved by the best possible graph G in G(m; l) using the best possible inference algorithm is at least Majority vote: Iterative algorithm:

  22. Algorithm Properties does not require prior, unlike belief propagation ● performance guarantees, unlike expectation maximization ● Performance: ●

  23. Worker Bias Worker A always reverses answers; Worker B always gives same ● answer

  24. Worker Bias Worker A always reverses answers; Worker B always gives same ● answer How can we separate out bias from error? ●

  25. Worker Bias Main idea: Given each worker's error rates, error costs, and priors for the correct answer distributions, we transform workers' "hard" answers into "soft" answers that have minimal (error-associated) costs.

  26. Thank You!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend