Making Better Use of the Crowd∗
Jennifer Wortman Vaughan Microsoft Research, New York City jenn@microsoft.com December 5, 2016
1 Introduction
Over the last decade, crowdsourcing has been used to harness the power of human computation to solve tasks that are notoriously difficult to solve with computers alone, such as determining whether or not an image contains a tree, rating the relevance of a website, or verifying the phone number of a business. The machine learning community was early to embrace crowdsourcing as a tool for quickly and inexpen- sively obtaining the vast quantities of labeled data needed to train machine learning systems. For example, in their highly influential paper, Snow et al. [59] used crowdworkers to annotate linguistic data for common natural language processing tasks such as word sense disambiguation and affect recognition. Similar ideas were applied to problems like annotating medical images [53] and discovering and labeling image attributes
- r features [51, 52, 73]. This simple idea—that crowds could be used to generate training data for machine
learning algorithms—inspired a flurry of algorithmic work on how to best elicit and aggregate potentially noisy labels [15, 23, 28–30, 42, 58, 67, 71, 72], and is probably what many people in the machine learning community think of when they think of crowdsourcing. In the majority of this work, it is assumed that once collected, the labeled data is handed off to a machine learning algorithm for use in training a model. This handoff is typically where the interaction with the crowd
- ends. The idea is that the learned model should be able to make autonomous predictions or actions. In other
words, the crowd provides the data, but the ultimate goal is to eventually take humans out of the loop. This might lead one to ask: What other problems could the crowd solve? In the first half of this tutorial, I will showcase innovative uses of crowdsourcing that go far beyond the collection of labeled data. These fall into three basic categories:
- Direct applications to machine learning. For example, the crowd can be used to evaluate machine
learning models [9], cluster data [18, 62], and debug the large and complex machine learning models used in fields like computer vision and speech recognition [46, 47, 50].
∗These notes—part survey, part position paper, part best practice guide—were written to accompany the NIPS 2016 tutorial