Crowdsourcing Nickolai Riabov, Kenneth Tiong Brown University Fall - - PowerPoint PPT Presentation

crowdsourcing
SMART_READER_LITE
LIVE PREVIEW

Crowdsourcing Nickolai Riabov, Kenneth Tiong Brown University Fall - - PowerPoint PPT Presentation

Definition Mechanical Turk Quality Control Techniques CrowdDB Crowdsourcing Nickolai Riabov, Kenneth Tiong Brown University Fall 2013 Nickolai Riabov, Kenneth Tiong Crowdsourcing Definition Mechanical Turk Quality Control Techniques


slide-1
SLIDE 1

Definition Mechanical Turk Quality Control Techniques CrowdDB

Crowdsourcing

Nickolai Riabov, Kenneth Tiong

Brown University

Fall 2013

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-2
SLIDE 2

Definition Mechanical Turk Quality Control Techniques CrowdDB

Structure of the Talk

Definition Mechanical Turk Quality Control CrowdDB

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-3
SLIDE 3

Definition Mechanical Turk Quality Control Techniques CrowdDB

What is Crowdsourcing?

The practice of obtaining needed services, ideas, or content by soliciting contributions from a large group of people, and especially from an online community, rather than from traditional employees or suppliers Allows for large-scale and on-demand invocation of human input for data-gathering and analysis Distinct from outsourcing in that the work comes from an undefined public rather from a specific group

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-4
SLIDE 4

Definition Mechanical Turk Quality Control Techniques CrowdDB

Crowdsourcing Overview

Requester:

People who submit tasks and collect answers

Platform:

Performs task management

Worker:

People who work on tasks

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-5
SLIDE 5

Definition Mechanical Turk Quality Control Techniques CrowdDB

Machine Translation

Problem:

Manual Evaluation of quality is slow and expensive

Crowdsourcing:

Low costs of non-experts, $0.10 to translate a sentence High agreement between experts and non-experts Good framework for complex tasks like human-assisted translation edit rate (i.e. how much editing a human would have to perform to change a system output so that it exactly matches a reference translation)

Nickolai Riabov, Kenneth Tiong Crowdsourcing

Li, Guoliang, Crowdsourcing @ HotDB2012

slide-6
SLIDE 6

Definition Mechanical Turk Quality Control Techniques CrowdDB

Painting Similarity

How similar is the artistic style in the paintings above? Very Similar Similar Somewhat Dissimilar Very Dissimilar

Nickolai Riabov, Kenneth Tiong Crowdsourcing

Lease, M and Kovashka, A., Human and Machine Detection of Stylistic Similarity in Art. CrowdConf 2010

slide-7
SLIDE 7

Definition Mechanical Turk Quality Control Techniques CrowdDB

Image Search

Nickolai Riabov, Kenneth Tiong Crowdsourcing

Tingxin Yan, Vikas Kumar, Deepak Ganesan: CrowdSearch: exploiting crowds for accurate real-time image search on mobile phones. MobiSys 2010:77-90

slide-8
SLIDE 8

Definition Mechanical Turk Quality Control Techniques CrowdDB

Examples of Crowdsourcing Platforms

Most Famous: Wikipedia Mechanical Turk: Marketplace for (usually small) tasks CrowdDB: Uses crowd to answer DB queries

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-9
SLIDE 9

Definition Mechanical Turk Quality Control Techniques CrowdDB

When to Crowdsource

Computers cannot do the task (e.g. translation) A single person cannot do the task The work can be split into many small tasks

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-10
SLIDE 10

Definition Mechanical Turk Quality Control Techniques CrowdDB

Different Slide Deck

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-11
SLIDE 11

Definition Mechanical Turk Quality Control Techniques CrowdDB

Different Slide Deck

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-12
SLIDE 12

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdDB

Relational Database Fail ❙❊▲❊❈❚ ♠❛r❦❡t❴❝❛♣✐t❛❧✐③❛t✐♦♥ ❋❘❖▼ ❝♦♠♣❛♥② ❲❍❊❘❊ ♥❛♠❡ ❂ ✧■✳❇✳▼✳✧❀ Query returns an empty answer if the company table instance in the database does not contain a record for "I.B.M." Why?

Could have been deleted by accident Could be under I.B.N. Could be under International Business Machines

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-13
SLIDE 13

Definition Mechanical Turk Quality Control Techniques CrowdDB

Issues with Relational Databases

Closed World Assumption

Information not in database is either false or nonexistent

Relational databases are extremely literal

Expect data to have been properly cleaned and validated before entry; no native tolerance of inconsistency in data or queries

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-14
SLIDE 14

Definition Mechanical Turk Quality Control Techniques CrowdDB

Issues with Relational Databases

Let’s say you were to run a query like the one below: ❙❊▲❊❈❚ ✐♠❛❣❡ ❋❘❖▼ ♣✐❝t✉r❡ ❲❍❊❘❊ t♦♣✐❝ ❂ ✧❇✉s✐♥❡ss ❙✉❝❝❡ss✧ ❖❘❉❊❘ ❇❨ r❡❧❡✈❛♥❝❡ ▲■▼■❚ ✶❀ Unless someone had previously sorted the pictures by specific topic, there is no good way to run a query like this

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-15
SLIDE 15

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdDB

Use the crowd to answer DB queries

Find missing data Make a subjective comparison

Recognize patterns Main operations

Join Sort

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-16
SLIDE 16

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdSQL

An SQL extension that supports crowdsourcing (and is therefore the language for crowdDB) Involve missing data and subjective comparisons For traditional databases, equivalent to SQL Developers don’t have to be aware that their code involves crowdsourcing

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-17
SLIDE 17

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdSQL

SQL DDL Extensions Specific attributes of tuples can be crowdsourced Entire tuples can be crowdsourced Keyword: CROWD

CrowdDB does not impose any limitations with regard to SQL types and integrity constraints CROWD tables must have a primary key

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-18
SLIDE 18

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdDB

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-19
SLIDE 19

Definition Mechanical Turk Quality Control Techniques CrowdDB

Sample Code

Column "url" marked as crowdsourced

❈❘❊❆❚❊ ❚❆❇▲❊ ❉❡♣❛rt♠❡♥t ✭✉♥✐✈❡rs✐t② ❙❚❘■◆●✱ ♥❛♠❡ ❙❚❘■◆●✱ ✉r❧ ❈❘❖❲❉ ❙❚❘■◆●✱ ♣❤♦♥❡ ❙❚❘■◆●✱ P❘■▼❆❘❨ ❑❊❨ ✭✉♥✐✈❡rs✐t②✱ ♥❛♠❡✮✮❀

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-20
SLIDE 20

Definition Mechanical Turk Quality Control Techniques CrowdDB

Sample Code

"Professor" table to be crowdsourced

❈❘❊❆❚❊ ❈❘❖❲❉ ❚❆❇▲❊ Pr♦❢❡ss♦r ✭ ♥❛♠❡ ❙❚❘■◆● P❘■▼❆❘❨ ❑❊❨✱ ❡♠❛✐❧ ❙❚❘■◆● ❯◆■◗❯❊✱ ✉♥✐✈❡rs✐t② ❙❚❘■◆●✱ ❞❡♣❛rt♠❡♥t ❙❚❘■◆●✱ ❋❖❘❊■●◆ ❑❊❨ ✭✉♥✐✈❡rs✐t②✱ ❞❡♣❛rt♠❡♥t✮ ❘❊❋ ❉❡♣❛rt♠❡♥t✭✉♥✐✈❡rs✐t②✱ ♥❛♠❡✮ ✮❀

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-21
SLIDE 21

Definition Mechanical Turk Quality Control Techniques CrowdDB

Comparisons

CROWDEQUAL – ask the crowd if two objects are equal

❙❊▲❊❈❚ ♣r♦❢✐❧❡ ❋❘❖▼ ❞❡♣❛rt♠❡♥t ❲❍❊❘❊ ♥❛♠❡ ∼= ✧❈❙✧❀

CROWDORDER – ask the crowd to arrange the objects in order of importance

❈❘❊❆❚❊ ❚❆❇▲❊ ♣✐❝t✉r❡ ✭ ♣ ■▼❆●❊✱ s✉❜❥❡❝t ❙❚❘■◆● ✮❀ ❙❊▲❊❈❚ ♣ ❋❘❖▼ ♣✐❝t✉r❡ ❲❍❊❘❊ s✉❜❥❡❝t ❂ ✧●♦❧❞❡♥ ●❛t❡ ❇r✐❞❣❡✧ ❖❘❉❊❘ ❇❨ ❈❘❖❲❉❖❘❉❊❘✭♣✱ ✧❲❤✐❝❤ ♣✐❝t✉r❡ ✈✐s✉❛❧✐③❡s ❜❡tt❡r ✪s✉❜❥❡❝t✧✮❀

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-22
SLIDE 22

Definition Mechanical Turk Quality Control Techniques CrowdDB

User Interface Generation

Automatically generates user interfaces Two-step process in CrowdDB User interfaces are in HTML and JavaScript

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-23
SLIDE 23

Definition Mechanical Turk Quality Control Techniques CrowdDB

What the worker sees

The title of the HTML is the name of the table Fields ask the worker to input the missing information Copies the known field values into the HTML form Generates JavaScript code to check for correct types of input

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-24
SLIDE 24

Definition Mechanical Turk Quality Control Techniques CrowdDB

Multi-Relation Interfaces

Foreign key references a non-crowdsourced table Generated user interface shows a drop-down box CrowdDB supports two types of user interfaces

Normalized Denormalized

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-25
SLIDE 25

Definition Mechanical Turk Quality Control Techniques CrowdDB

Crowd Operators

Implements all operators of the relational algebra, just like any traditional database system Initialized with a user interface template and the standard HIT parameters Quality control carried out by majority vote

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-26
SLIDE 26

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdDB has three crowd operators

CrowdProbe: Crowdsources missing information of crowd columns CrowdJoin: Implements an index nested-loop join over two tables CrowdCompare: Implements the CROWDEQUAL and CROWDORDER functions

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-27
SLIDE 27

Definition Mechanical Turk Quality Control Techniques CrowdDB

CrowdSQL in practice

Minimal extension to SQL CrowdSQL changes the closed-world to an open-world assumption Cost and response time of queries can be unbounded Provide a way to define a budget for a query – using the LIMIT operator

Constrains the number of tuples returned as a result of the query Implicit constraint on cost and result time of query

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-28
SLIDE 28

Definition Mechanical Turk Quality Control Techniques CrowdDB

Shortcomings of CrowdSQL

No explicit constraint on budget

LIMIT only constrains number of responses to query

No accounting for lineage

Turker #5 is a spammer. Currently no way to identify and remove all data from him

No entity resolution of crowdsourced data.

Not a problem if all the workers use exactly the same literals In general, makes data from different sources difficult to clean

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-29
SLIDE 29

Definition Mechanical Turk Quality Control Techniques CrowdDB

Benchmarks

Workers were asked to fill in missing data for a table with two crowdsourced columns For 3607 business names in 40 cities, the turkers had to find the phone # and address of the business

❈❘❊❆❚❊ ❚❆❇▲❊ ❜✉s✐♥❡ss❡s ✭ ♥❛♠❡ ❱❆❘❈❍❆❘ P❘■▼❆❘❨ ❑❊❨✱ ♣❤♦♥❡❴♥✉♠❜❡r ❈❘❖❲❉ ❱❆❘❈❍❆❘✭✸✷✮✱ ❛❞❞r❡ss ❈❘❖❲❉ ❱❆❘❈❍❆❘✭✷✺✻✮ ✮❀

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-30
SLIDE 30

Definition Mechanical Turk Quality Control Techniques CrowdDB

Experiment: Vary hit groups, track response time

Response times decrease dramatically as size of HIT groups increases

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-31
SLIDE 31

Definition Mechanical Turk Quality Control Techniques CrowdDB

Experiment: Vary hit groups, track response time

But, there is a tradeoff between size of HIT group and how much of that HIT group is actually completed

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-32
SLIDE 32

Definition Mechanical Turk Quality Control Techniques CrowdDB

Experiment: Responsiveness, vary reward

For the particular task the experimenters assigned, paying the turkers more resulted in increased performance

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-33
SLIDE 33

Definition Mechanical Turk Quality Control Techniques CrowdDB

Experiment: Worker affinity and quality

Analysis of the distribution of work among workers and answer quality Some workers begin to specialize in a particular requester’s requests This does not decrease error frequency Reward and group size also has no effect on error frequency

Nickolai Riabov, Kenneth Tiong Crowdsourcing

slide-34
SLIDE 34

Definition Mechanical Turk Quality Control Techniques CrowdDB

Observations

Crowd resources have long-term memory that impact performance

If the requester rejects too many HITs, workers stop working for requester Bugs leading to error messages can alarm the turkers

User interface design and precise instructions can greatly increase reliability of results

Nickolai Riabov, Kenneth Tiong Crowdsourcing