Combining Crowd and Expert Labels using Decision Theoretic Active - PowerPoint PPT Presentation

Combining Crowd and Expert Labels using Decision Theoretic Active Learning An T. Nguyen 1 Byron C. Wallace Matthew Lease University of Texas at Austin HCOMP, 2015 1 Presenter

The Problem: Label Collection ◮ Have some unlabeled data. ◮ Want labels ◮ of high quality at low cost.

The Problem: Label Collection ◮ Have some unlabeled data. ◮ Want labels ◮ of high quality at low cost. Finite Pool Setting ◮ Care about label quality of current data. ◮ Dont care (much) about future data.

Some Solutions

Some Solutions ◮ Hire a domain expert to give labels.

Some Solutions ◮ Hire a domain expert to give labels. ◮ Crowdsource the labeling.

Some Solutions ◮ Hire a domain expert to give labels. ◮ Crowdsource the labeling. ◮ Build a Prediction Model (Classifier).

Some Solutions ◮ Hire a domain expert to give labels. ◮ Crowdsource the labeling. ◮ Build a Prediction Model (Classifier). Our work: A principled way to combine these:

Some Solutions ◮ Hire a domain expert to give labels. ◮ Crowdsource the labeling. ◮ Build a Prediction Model (Classifier). Our work: A principled way to combine these: ◮ Which item ? Which labeler? ◮ How to use classifier ?

Method: Previous work Roy and McCallum 2001 ◮ ‘Optimal’ Active Learning.

Method: Previous work Roy and McCallum 2001 ◮ ‘Optimal’ Active Learning. ◮ Select item to get label by

Method: Previous work Roy and McCallum 2001 ◮ ‘Optimal’ Active Learning. ◮ Select item to get label by 1. Consider each item 2. Consider each possible label.

Method: Previous work Roy and McCallum 2001 ◮ ‘Optimal’ Active Learning. ◮ Select item to get label by 1. Consider each item 2. Consider each possible label. 3. Add that (item, label) to the training set 4. Retrain and Evaluate.

Method: Previous work Roy and McCallum 2001 ◮ ‘Optimal’ Active Learning. ◮ Select item to get label by 1. Consider each item 2. Consider each possible label. 3. Add that (item, label) to the training set 4. Retrain and Evaluate. 5. Weight outcomes by (predictive) probabilities 6. Select one with best expected outcome.

Method: Previous work Roy and McCallum 2001 ◮ ‘Optimal’ Active Learning. ◮ Select item to get label by 1. Consider each item 2. Consider each possible label. 3. Add that (item, label) to the training set 4. Retrain and Evaluate. 5. Weight outcomes by (predictive) probabilities 6. Select one with best expected outcome. ◮ Basically one-step look-ahead ◮ A (perhaps) better name: Decision Theoretic Active Learning.

Method: Our ideas The key idea: Extend their algorithm to include expert/crowd/classifier.

Method: Our ideas The key idea: Extend their algorithm to include expert/crowd/classifier. ◮ Consider (item, label, labeler ).

Method: Our ideas The key idea: Extend their algorithm to include expert/crowd/classifier. ◮ Consider (item, label, labeler ). ◮ Have a Crowd Accuracy Model: Pr (True L | Crowd L) =?

Method: Our ideas The key idea: Extend their algorithm to include expert/crowd/classifier. ◮ Consider (item, label, labeler ). ◮ Have a Crowd Accuracy Model: Pr (True L | Crowd L) =? Strategy: Loss Prediction/Minimizaion ◮ Loss for expert labels = 0 ◮ Predict Loss for crowd labels ◮ Predict Loss for classifier’s prediction

Method: Our ideas The key idea: Extend their algorithm to include expert/crowd/classifier. ◮ Consider (item, label, labeler ). ◮ Have a Crowd Accuracy Model: Pr (True L | Crowd L) =? Strategy: Loss Prediction/Minimizaion ◮ Loss for expert labels = 0 ◮ Predict Loss for crowd labels ◮ Predict Loss for classifier’s prediction ◮ Predict Loss Reduction after adding a label by a labeler. Decision Criteria: Loss Reduction/Cost

Evaluation: Application Evidence Based Medicine (EBM) aims to inform patient care using the entirety of the evidence.

Evaluation: Application Evidence Based Medicine (EBM) aims to inform patient care using the entirety of the evidence. Biomedical Citation Screening is the first step in EBM: identify relevant citations (paper abstracts, titles, keywords ...).

Evaluation: Application Evidence Based Medicine (EBM) aims to inform patient care using the entirety of the evidence. Biomedical Citation Screening is the first step in EBM: identify relevant citations (paper abstracts, titles, keywords ...). Two characteristics: ◮ Very imbalanced (2-15% positive). ◮ Recall a lot more important than Precision.

Evaluation: Application Evidence Based Medicine (EBM) aims to inform patient care using the entirety of the evidence. Biomedical Citation Screening is the first step in EBM: identify relevant citations (paper abstracts, titles, keywords ...). Two characteristics: ◮ Very imbalanced (2-15% positive). ◮ Recall a lot more important than Precision. The expert ◮ MD, specialist ◮ very expensive, paid 100 times a crowdworker.

Evaluation: Data Four Biomedical Citation Screening Datasets

Evaluation: Data Four Biomedical Citation Screening Datasets ◮ Have expert gold labels. ◮ Have crowd labels (5 for each item) ... ◮ collected via Amazon Mechanical Turk.

Evaluation: Data Four Biomedical Citation Screening Datasets ◮ Have expert gold labels. ◮ Have crowd labels (5 for each item) ... ◮ collected via Amazon Mechanical Turk. Strategy to use 1. Test/Refine our methods using only the First & Second.

Evaluation: Data Four Biomedical Citation Screening Datasets ◮ Have expert gold labels. ◮ Have crowd labels (5 for each item) ... ◮ collected via Amazon Mechanical Turk. Strategy to use 1. Test/Refine our methods using only the First & Second. 2. Finalize all details (e.g. hyper-parameters).

Evaluation: Data Four Biomedical Citation Screening Datasets ◮ Have expert gold labels. ◮ Have crowd labels (5 for each item) ... ◮ collected via Amazon Mechanical Turk. Strategy to use 1. Test/Refine our methods using only the First & Second. 2. Finalize all details (e.g. hyper-parameters). 3. Test on the Third & Forth.

Evaluation: Data Four Biomedical Citation Screening Datasets ◮ Have expert gold labels. ◮ Have crowd labels (5 for each item) ... ◮ collected via Amazon Mechanical Turk. Strategy to use 1. Test/Refine our methods using only the First & Second. 2. Finalize all details (e.g. hyper-parameters). 3. Test on the Third & Forth. 4. Purpose: See how it performs on real future data .

Evaluation: Setup Active Learning Baseline: Uncertainty Sampling (US) Select item with probability closest to 0.5

Evaluation: Setup Active Learning Baseline: Uncertainty Sampling (US) Select item with probability closest to 0.5 Compare Four Algorithms ◮ US-Crowd: use only crowd labels. ◮ US-Expert: use only experts. ◮ US-Crowd+Expert: Crowd first. Expert if disagree. ◮ Decision Theory: our method.

Evaluation: Metric Compare collected labels vs. gold labels

Evaluation: Metric Compare collected labels vs. gold labels Collected labels includes: ◮ Expert labels. ◮ Crowd (Majority Voting) ◮ Classifier predictions (trained on crowd & expert labels)

Evaluation: Metric Compare collected labels vs. gold labels Collected labels includes: ◮ Expert labels. ◮ Crowd (Majority Voting) ◮ Classifier predictions (trained on crowd & expert labels) We present: Cost-Loss Learning Curve ◮ One Expert Label = 100, One Crowd Label = 1. ◮ Loss = # False Positive + 10 # False Negative.

Evaluation: Result: First Dataset

Evaluation: Result: Second Dataset

Evaluation: Result: Third (real future) Dataset

Evaluation: Result: Forth (real future) Dataset

Discussion Our method ◮ Overall effective. Consistenly good in the beginning. ◮ On ‘real future datasets’: lose slightly at some points.

Discussion Our method ◮ Overall effective. Consistenly good in the beginning. ◮ On ‘real future datasets’: lose slightly at some points. Future work ◮ Better worker model. ◮ Multi-step lookahead. ◮ Quality Assurance/Guarantee.

Summary We have presented ◮ High level ideas of our method. ◮ Evaluation and Results

Summary We have presented ◮ High level ideas of our method. ◮ Evaluation and Results We have omitted ◮ Full algorithms. Impplementation details. ◮ Heuristics to make this fast. ◮ Crowd Model. Active Sampling Correction. ◮ More results.

Summary We have presented ◮ High level ideas of our method. ◮ Evaluation and Results We have omitted ◮ Full algorithms. Impplementation details. ◮ Heuristics to make this fast. ◮ Crowd Model. Active Sampling Correction. ◮ More results. ◮ See the paper.

Summary We have presented ◮ High level ideas of our method. ◮ Evaluation and Results We have omitted ◮ Full algorithms. Impplementation details. ◮ Heuristics to make this fast. ◮ Crowd Model. Active Sampling Correction. ◮ More results. ◮ See the paper. Question?

References I Roy, Nicholas and Andrew McCallum (2001). “Toward Optimal Active Learning through Sampling Estimation of Error Reduction”. In: In Proc. 18th International Conf. on Machine Learning .

Combining Crowd and Expert Labels using Decision Theoretic Active - PowerPoint PPT Presentation

Combining Crowd and Expert Labels using Decision Theoretic Active Learning An T. Nguyen 1 Byron C. Wallace Matthew Lease University of Texas at Austin HCOMP, 2015 1 Presenter The Problem: Label Collection Have some unlabeled data. Want

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

The Calculus of Computation: Decision Procedures with 10. Combining Decision Procedures

Reducing Label Cost by Combining Feature Labels and Crowdsourcing Combining Learning Strategies

participatory governance syros_14.07.2012 the power of the crowd some facts crowd (people)

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

How to Stand Out from the Crowd on How to Stand Out from the Crowd on LinkedIn LinkedIn Maureen

POV & EXPERIENCE PROTOTYPES SLOANE, TINA, MARIE & KARNA CROWDPOWER DREAM TEAM Sloane

CrowdsFunding Gilad Ravid, PhD Crowd Sourcing Pooling Collective Knowledge Ushahidi

Slides from session at online conference imoot 2013 May 26 th 2013 These were crowd sourced from

Aggregating and Predicting Sequence Labels from Crowd Annotations An T. Nguyen 1 Byron C.

Eco Labels in AEC Dr.Lunchakorn Prathumratana Thailand Environment Institute (TEI) Eco labels in

GENERAL PRESENTATION PROTECTION- CONTROL- IDENTIFICATION TRACKING 2506 RFID LABELS 02 What

Combining Models Oliver Schulte - CMPT 726 Bishop PRML Ch. 14 Combining Models: Some Theory

6 Decision- -Making Making MVC (revisited) 6 Decision MVC (revisited) decision

Incorporating Data into Decision Making REBECCA LUTKENHAUS, REFERENCE LIBRARIAN, DRAKE

Re-calibration of Schedule 4 TOC Working Group 10 April 2017 2 Purpose of these sessions

Episode #262 EBM Conscientious Explicit Judicious Best Evidence Individual Patients David

Evidence-based-medicine Interactive eBook learning effect Mao-meng Tiao Chang Gung Memorial

What is it & Why Does It Matter? Evidence based medicine is the conscientious, explicit, and

Wednesday, 13 th March 2013 Jon Day GBRMPA Outline of talk Overview of Great Barrier Reef

Adversarial Fisher Vectors For Unsupervised Representation Learning Shuangfei Zhai, Walter

t qt

Combining Crowd and Expert Labels using Decision Theoretic Active - PowerPoint PPT Presentation

Combining Crowd and Expert Labels using Decision Theoretic Active Learning An T. Nguyen 1 Byron C. Wallace Matthew Lease University of Texas at Austin HCOMP, 2015 1 Presenter The Problem: Label Collection Have some unlabeled data. Want

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

The Calculus of Computation: Decision Procedures with 10. Combining Decision Procedures

Reducing Label Cost by Combining Feature Labels and Crowdsourcing Combining Learning Strategies

participatory governance syros_14.07.2012 the power of the crowd some facts crowd (people)

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

How to Stand Out from the Crowd on How to Stand Out from the Crowd on LinkedIn LinkedIn Maureen

POV &amp; EXPERIENCE PROTOTYPES SLOANE, TINA, MARIE &amp; KARNA CROWDPOWER DREAM TEAM Sloane

CrowdsFunding Gilad Ravid, PhD Crowd Sourcing Pooling Collective Knowledge Ushahidi

Slides from session at online conference imoot 2013 May 26 th 2013 These were crowd sourced from

Aggregating and Predicting Sequence Labels from Crowd Annotations An T. Nguyen 1 Byron C.

Eco Labels in AEC Dr.Lunchakorn Prathumratana Thailand Environment Institute (TEI) Eco labels in

GENERAL PRESENTATION PROTECTION- CONTROL- IDENTIFICATION TRACKING 2506 RFID LABELS 02 What

Combining Models Oliver Schulte - CMPT 726 Bishop PRML Ch. 14 Combining Models: Some Theory

6 Decision- -Making Making MVC (revisited) 6 Decision MVC (revisited) decision

Incorporating Data into Decision Making REBECCA LUTKENHAUS, REFERENCE LIBRARIAN, DRAKE

Re-calibration of Schedule 4 TOC Working Group 10 April 2017 2 Purpose of these sessions

Episode #262 EBM Conscientious Explicit Judicious Best Evidence Individual Patients David

Evidence-based-medicine Interactive eBook learning effect Mao-meng Tiao Chang Gung Memorial

What is it &amp; Why Does It Matter? Evidence based medicine is the conscientious, explicit, and

Wednesday, 13 th March 2013 Jon Day GBRMPA Outline of talk Overview of Great Barrier Reef

Adversarial Fisher Vectors For Unsupervised Representation Learning Shuangfei Zhai, Walter

t qt

POV & EXPERIENCE PROTOTYPES SLOANE, TINA, MARIE & KARNA CROWDPOWER DREAM TEAM Sloane

What is it & Why Does It Matter? Evidence based medicine is the conscientious, explicit, and