how to crowdsource behavioural data in the social sciences
play

How to crowdsource behavioural data in the social sciences Jon W. - PowerPoint PPT Presentation

How to crowdsource behavioural data in the social sciences Jon W. Carr & Jasmeen Kanwal Centre for Language Evolution School of Philosophy, Psychology and Language Sciences University of Edinburgh Todays workshop About Us Why run


  1. How to crowdsource behavioural data in the social sciences Jon W. Carr & Jasmeen Kanwal Centre for Language Evolution School of Philosophy, Psychology and Language Sciences University of Edinburgh

  2. Today’s workshop About Us Why run experiments online? CrowdFlower & Mechanical Turk Ethics & Payment Web Programming Basics Live Demo & Hands-on Question & Answer

  3. What shapes language? Language

  4. What shapes language? Language Expressivity

  5. What shapes language? Language Expressivity Learnability

  6. Language Evolution in the Lab Generation 1 Generation 3 Generation 4 Generation 2

  7. Why Run Experiments Online? • Get results fast. You can have a few hundred people do your experiment within half an hour. • Save time. You don’t need to spend weeks in the lab explaining the task over and over to participants. • Cheaper. Each participant spends less time because they don’t need to come into the lab. • More diverse population. Not as WEIRD? • As good as lab results. Reproduce classic results.

  8. Mechanical Turk

  9. Mechanical Turk • Requires a US bank account, or use a 3rd party middle-man • Participants also need a US bank account, so mostly Americans • This means a slow-replenishing participant pool, and an big proportion of non-naïve participants • Participants get paid at the full wage you set • Participants do not get paid until you approve their responses, so they tend to be highly attentive and keen to perform well

  10. CrowdFlower • Open to people outside US • Participants from all over the world, so potentially a more diverse population than MTurk • CF mostly outsources to other (possibly questionable) companies, and the fee taken by these companies is not transparent • The platform is not ideal for running your own custom experiments, so you have to resort to slightly hacky methods to get it to work

  11. Ethics & Payment • Standard ethics apply. You still need to get ethical approval. • Store online participants data securely and anonymously just like you would with lab participants. • Set wages equivalent to minimum wage or minimum wage set by your ethics body. • Generate a completion code and give it to your participant, so that you can verify that they completed the task. • Be aware of potential payment issues.

  12. Server-Client Basics

  13. Web Programming Client-side Server-side Perl Java HTML CSS Go Ruby Python PHP JavaScript JavaScript

  14. Client-Side HTML: Markup language describing the elements of the page (DOM) CSS: Markup language describing how things should be styled (colours, sizes, positions, etc) JavaScript: Programming language where you do the client-side logic

  15. Subitizing Rapid, accurate, and confident estimation of numerosity, especially for small numbers

  16. Subitizing Rapid, accurate, and confident estimation of numerosity, especially for small numbers

  17. Subitizing Rapid, accurate, and confident estimation of numerosity, especially for small numbers

  18. Subitizing

  19. Subitizing

  20. Subitizing

  21. Subitizing

  22. Hands-on Try the experiment blake.ppls.ed.ac.uk/~s1153197/sub/cf.php Then set up the task crowdflower.com OR mturk.com If you’re feeling adventurous… play with the code github.com/jwcarr/subitizingOnline

  23. Results…

  24. Results reported in Dehaene (1992) Our CrowdFlower results Our Mechanical Turk results

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend