How to crowdsource behavioural data in the social sciences Jon W. - - PowerPoint PPT Presentation

how to crowdsource behavioural data in the social sciences
SMART_READER_LITE
LIVE PREVIEW

How to crowdsource behavioural data in the social sciences Jon W. - - PowerPoint PPT Presentation

How to crowdsource behavioural data in the social sciences Jon W. Carr & Jasmeen Kanwal Centre for Language Evolution School of Philosophy, Psychology and Language Sciences University of Edinburgh Todays workshop About Us Why run


slide-1
SLIDE 1

Jon W. Carr & Jasmeen Kanwal Centre for Language Evolution School of Philosophy, Psychology and Language Sciences University of Edinburgh

How to crowdsource behavioural data in the social sciences

slide-2
SLIDE 2

Today’s workshop

About Us Why run experiments online? CrowdFlower & Mechanical Turk Ethics & Payment Web Programming Basics Live Demo & Hands-on Question & Answer

slide-3
SLIDE 3

Language

What shapes language?

slide-4
SLIDE 4

Language

Expressivity

What shapes language?

slide-5
SLIDE 5

Language

Learnability Expressivity

What shapes language?

slide-6
SLIDE 6

Language Evolution in the Lab

Generation 1 Generation 2 Generation 3 Generation 4

slide-7
SLIDE 7

Why Run Experiments Online?

  • Get results fast. You can have a few hundred people do your

experiment within half an hour.

  • Save time. You don’t need to spend weeks in the lab explaining

the task over and over to participants.

  • Cheaper. Each participant spends less time because they don’t

need to come into the lab.

  • More diverse population. Not as WEIRD?
  • As good as lab results. Reproduce classic results.
slide-8
SLIDE 8

Mechanical Turk

slide-9
SLIDE 9

Mechanical Turk

  • Requires a US bank account, or use a 3rd party middle-man
  • Participants also need a US bank account, so mostly Americans
  • This means a slow-replenishing participant pool, and an big

proportion of non-naïve participants

  • Participants get paid at the full wage you set
  • Participants do not get paid until you approve their responses, so

they tend to be highly attentive and keen to perform well

slide-10
SLIDE 10

CrowdFlower

  • Open to people outside US
  • Participants from all over the world, so potentially a more diverse

population than MTurk

  • CF mostly outsources to other (possibly questionable) companies,

and the fee taken by these companies is not transparent

  • The platform is not ideal for running your own custom

experiments, so you have to resort to slightly hacky methods to get it to work

slide-11
SLIDE 11

Ethics & Payment

  • Standard ethics apply. You still need to get ethical approval.
  • Store online participants data securely and anonymously just like

you would with lab participants.

  • Set wages equivalent to minimum wage or minimum wage set by

your ethics body.

  • Generate a completion code and give it to your participant, so

that you can verify that they completed the task.

  • Be aware of potential payment issues.
slide-12
SLIDE 12

Server-Client Basics

slide-13
SLIDE 13

Web Programming

Python JavaScript PHP Client-side Server-side HTML JavaScript CSS Perl Ruby Java Go

slide-14
SLIDE 14

Client-Side

HTML: Markup language describing the elements

  • f the page (DOM)

CSS: Markup language describing how things should be styled (colours, sizes, positions, etc) JavaScript: Programming language where you do the client-side logic

slide-15
SLIDE 15

Subitizing

Rapid, accurate, and confident estimation of numerosity, especially for small numbers

slide-16
SLIDE 16

Subitizing

Rapid, accurate, and confident estimation of numerosity, especially for small numbers

slide-17
SLIDE 17

Subitizing

Rapid, accurate, and confident estimation of numerosity, especially for small numbers

slide-18
SLIDE 18

Subitizing

slide-19
SLIDE 19

Subitizing

slide-20
SLIDE 20

Subitizing

slide-21
SLIDE 21

Subitizing

slide-22
SLIDE 22

Hands-on

If you’re feeling adventurous… play with the code github.com/jwcarr/subitizingOnline Then set up the task crowdflower.com OR mturk.com Try the experiment blake.ppls.ed.ac.uk/~s1153197/sub/cf.php

slide-23
SLIDE 23

Results…

slide-24
SLIDE 24

Results reported in Dehaene (1992) Our CrowdFlower results Our Mechanical Turk results