An automated R tool for identifying individuals with difficulties - PowerPoint PPT Presentation

An automated R tool for identifying individuals with difficulties in a large pool of raters Pete Meyer and Shaun Lysen Google, Santa Monica, California - USA Meyer and Lysen useR! 2008 - 2008-08-12

Overview •The User Experience •How raters assess quality •Identifying raters that are having difficulties •Process flowchart •Summary Meyer and Lysen useR! 2008 - 2008-08-12

The User Experience Google's Mission: organize the world's information and make it universally accessible and useful. Google primarily funds the service it provides with advertising. “Eyeballs” drive the value for advertisers. The User Experience is key to retaining eyeballs. Ads should contribute to the User Experience, not detract from it. Meyer and Lysen useR! 2008 - 2008-08-12

Raters assess quality Raters are trained to assign ratings to query-ad pairs according to common guidelines There are a variety of ways raters might diverge from the guidelines, whose detection would require reference to statistical distributions. • assigning scores randomly • assigning scores that are inconsistent with the • assigning the same score guidelines over and over • assigning the same score to • assigning scores without more than one measure doing due diligence with respect to the landing page Meyer and Lysen useR! 2008 - 2008-08-12

Example: Do a series of ratings appear to be random? Idea: Assuming the rater really is rating tasks randomly, then any configuration of his ratings is equally good. Thus under any permutation of his ratings, his error rate should on average be the same. Meyer and Lysen useR! 2008 - 2008-08-12

Example: Are unusually long runs of the same score assigned? Idea: Given the proportions of each rating occurring over a week and the number of ratings submitted for a given rater, how unusual is it to see run lengths as long as those observed? Simulated run lengths: 1 2 3 4 5 6 7 8 9 10 11 353289 52483 9511 1914 437 87 21 5 2 0 1 Longer observed run lengths: 12 13 14 15 18 22 25 6 1 1 1 2 1 1 Meyer and Lysen useR! 2008 - 2008-08-12

Notifying managers Construct an HTML results file and send a plain text email system(paste('mail -s',subj, ' ',paste(recipients, collapse=','),' < temp0001.txt', sep='')) Send an HTML email paste("mutt -e 'set content_type=\"text/html\"'", paste(recipients, collapse=","), "-s", paste("'",subj, "'", sep=""), "<", fileName) Meyer and Lysen useR! 2008 - 2008-08-12

Process flowchart database DBI RMySQL R crontab HTML mail reports messages R2HTML # m h dom mon dow command 1 2 * * 1 . <home directory>/.bashrc; R --vanilla < RaterFlagging-6.R Meyer and Lysen useR! 2008 - 2008-08-12

Credits (and many thanks!) go to ... R Core DBI: R-Databases Special Interest Group RMySQL: David A. James <dj@bell-labs.com> Saikat DebRoy <saikat@stat.wisc.edu> R2HTML: Eric Lecoutre Meyer and Lysen useR! 2008 - 2008-08-12

Summary R (with DBI, RMySQL, and R2HTML) enabled us to leverage statistical insights that are not accessible through standard database tools in order to identify raters that are having difficulties and communicate the results to colleagues in a production environment. Meyer and Lysen useR! 2008 - 2008-08-12

An automated R tool for identifying individuals with difficulties - PowerPoint PPT Presentation

An automated R tool for identifying individuals with difficulties in a large pool of raters Pete Meyer and Shaun Lysen Google, Santa Monica, California - USA Meyer and Lysen useR! 2008 - 2008-08-12 Overview The User Experience How

SynAthina Onli line Tools 1. . A mapping tool 2. A Community Tool 3. An Archive Tool 3. An

Automated Design of Digital Automated Design of Digital Automated Design of Digital Automated

Individuals and Relations It is useful to view the world as consisting of individuals (objects,

Individuals and Relations It is useful to view the world as consisting of individuals (objects,

Individuals and Relations It is useful to view the world as consisting of individuals (objects,

Overview of Automated Bus Consortium Program Accelerating automated technology for transit

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

PPP Loans For Self Employed Individuals PPP LOANS FOR SELF EMPLOYED INDIVIDUALS Self employed

Improved Communication Feed Forward A tool to help individuals to be better at giving and

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

Workflow Plus Signature Capture Tool for Synergy Enterprise What is This Tool ? This tool

Workflow Plus URL Hyperlinks Tool for Synergy Enterprise What is This Tool ? This tool will

Vulnerability Screening Tool Identifying and addressing vulnerability: A tool for asylum and

ABILITIES OF SERVICE STATISTICAL DECLARING OF AUTOMATED SOFTWARE TOOL FOREIGN TRADE

LIFE IN THE FUND LANE BRIAN ROBBINS PRESIDENT, RIPTIDE GAMES GAME DEVELOPERS CONFERENCE - MARCH

The Cathedral and the Bazaar Background by Eric S. Raymond Eric Raymond, by 1993, had

DNS NS Adv dvancing ancing the he Net Networ ork k where weve been where were headed

Loci of Compe++on, Market Power, and the Evolving Internet

Local ocal-Aware ware IC ICN-Based Based Co Cont ntent ent Deli eliver very Patrick

Data Transparency: Managing the legislative risk 7th International Digital Curation Conference

What is a Security Flaw? Colin Percival cperciva@freebsd.org Colin Percival May 13, 2006

Attracting eyeballs is hard work! 3 It helps to look through this end! Eyeballs Cost $ !