cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis - PowerPoint PPT Presentation

cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis July 22, 2015 Valkyrie Savage

thanks for the feedback!

Data Analysis 41057893@N02 on flickr

Start by counting 5680 trials total � � normal: bubble: mean time 976.1 ms, mean time 809.4 ms, mean errors 2.560 mean errors 0.287 �

Start by counting 71 users completed condition normal, size 10 71 users completed condition bubble, size 10 mean time: 1123.43 ms, mean errors: 3.408 mean time: 852.75 ms, mean errors: 0.296 median time: 1039 ms, median errors: 3 median time: 804 ms, median errors: 0 � � 70 users completed condition normal, size 25 72 users completed condition bubble, size 25 mean time: 826.64 ms, mean errors: 1.700 mean time: 766.58 ms, mean errors: 0.014 median time: 785 ms, median errors: 1 median time: 725 ms, median errors: 0

Descriptive Statistics Continuous data: N Shape of distribution ∑ X i Central tendency i = 1 µ = Skew, Kurtosis N mean,median,mode Mean Categorical data: Dispersion 2 ∑ ( ) X i − µ Frequency σ = Range (max-min) N distributions Standard Standard deviation

Understanding Y our Data Exploratory Data Analysis (EDA): Look at your data from different perspectives to get better intuition for it. Show the raw data! � Use different visualizations: Histograms, scatterplots, box plots, …

1D Scatter Plot with Jitter

1D Scatter Plot with Jitter colored by condition

1D Scatter Plot with Jitter separated by condition

Cleaning Data Don’t discard data just because it doesn’t fit your expectation! Maybe your assumptions were wrong � In online experiments, discarding extreme outliers can make sense if you believe they reflect users not following normal task protocol (e.g., multitasking in a reaction-time study)

Median vs. Mean For normally distributed data, mean=median. Many data sets gathered online are strongly skewed Outliers pull the mean to the right/left Median is more robust!

Power Law Distributions From C. Shirky, Here Comes Everybody

Power Law Distribution Source: Ed Chi

Confidence interval confidence interval (also called margin of error) is the plus-or-minus figure usually reported in newspaper or television opinion poll results. � if you use a confidence interval of 4 and 47% percent of your sample picks an answer you can be "sure" that if you had asked the question of the entire relevant population between 43% (47-4) and 51% (47+4) would have picked that answer

Sample size 1000 people in population � 95% confidence level � Confidence interval of +-5 � https://www.qualtrics.com/blog/ Need to sample 278 people determining-sample-size/ � Confidence interval of +-1 � …you need to sample 906 people

Effect Sizes: Time � Normal vs. Bubble cursor at target size 10:   Target size for normal cursor:   1123ms vs. 852ms: Bubble cursor 31% 1123ms vs 826ms: Larger targets 35% faster faster Normal vs. Bubble cursor at target size 25:   Target size for Bubble cursor:   826ms vs. 766ms: Bubble cursor 8% 852ms vs. 766ms: Larger targets 11% faster faster �

Effect Sizes: Error Normal vs. Bubble cursor, target size 10:   3.4 vs. 0.3 Errors per 20 trials: 1033% fewer errors Normal vs. Bubble cursor, target size 25:   1.7 vs. 0.3 Errors per 20 trials: 466% fewer errors

break!

Interaction Effects Relationship between one IV and DV depends on the level of another IV

Example of Interactions Group problem solving Independent variable: Leadership [example from Martin 04]

Example of Interactions Group problem solving Independent variable: Leadership Independent variable: Group size [example from Martin 04]

Example of Interactions Group problem solving Change in time due to leadership is same regardless of group size [example from Martin 04]

Example of Interactions Group problem solving Change in time due to leadership is same regardless of group size Change in time due to group size is same regardless of leadership Independent variables do not interact [example from Martin 04]

Example of Interactions Multiple IVs affect DV non-additively Change in time due to leadership differs with changes in group size Independent variables do interact [example from Martin 04]

Population versus Sample

Are the Results Meaningful? p < 0.05 usually considered significant Hypothesis testing (Sometimes p < 0.01) Hypothesis: Manipulation of IV effects DV Means that < 5% chance that null in some way hypothesis is true Null hypothesis: Manipulation of IV has Statistical tests no effect on DV T-test (1 factor, 2 levels) Null hypothesis assumed true unless statistics allow us to reject it Correlation Statistical significance (p value) ANOVA (1 factor, > 2 levels, multiple factors) Likelihood that results are due to chance variation MANOVA ( > 1 dependent variable)

T -test Compare means of 2 groups Population variances are equal (between subjects tests) Null hypothesis: No difference between means Reasonably robust for differing variances Assumptions Individual observations in Samples are normally samples are independent distributed Important! Very robust in practice

ANOV A Repeated measures analysis of variance   Single factor analysis of variance (ANOVA) (RM-ANOVA) Compare means for 3 or more levels of a Use when > 1 observation per subject single independent variable (within subjects experiment) Multi-Way Analysis of variance (n-Way Multi-variate analysis of variance (MANOVA) ANOVA) Compare between more than one Compare more than one independent dependent var. variable ANOVA tests whether means differ, but does Can find interactions between not tell us which means differ – for this we independent variables   must perform pairwise t-tests

t-test? ANOV A? n-way ANOV A? MANOV A?

Our Example Two-Way ANOVA (Cursor, Size) for time: Main effect for cursor F(1,5676) = 424.9, p<0.001 is statistically significant. Main effect for size F(1,5676)=556.2, p<0.001 is statistically significant. Interaction cursor x size F(1,5676)=169.5, p<0.001 is statistically significant.

Our Example Two-Way ANOVA (Cursor, Size) for errors: Main effect for cursor F(1,564) = 314.04, p<0.001 is statistically significant. Main effect for size F(1,564)=44.65, p<0.001 is statistically significant. Interaction cursor x size F(1,564)=43.40, p<0.001 is statistically significant.

errors in Bubble Cursor case only F(1,2038) = 0.009, p=0.92 – NOT significant

What does p > 0.05 mean? No statistically significant (at 5% level) Does that mean that the two conditions are equivalent? No! We did observe differences. But we can’t be confident they weren’t due to chance.

Draw Conclusions What is the scope of the finding? Are there other parameters at play? Internal validity Does the experiment reflect real use? External validity

Summary Pros/Cons Quantitative evaluations Objective measurements Repeatable, reliable evaluation of interface elements Good internal validity -> repeatability To control properly, usually limited to low- But, real-world implications may be level issues difficult to foresee Menu selection method A faster than Statistically significant results doesn’t method B imply real-world importance � 3.05s versus 3.00s for menu selection

assignments! collegedegrees360 on flickr

Midterm Exam Midterm July 27 (Monday!!) 80 minute exam: be here on time! Covers lectures & studios up to now (plus readings, assignments, …) Closed book. No notes, no tech.

midterm reviews: today in section, tomorrow in studio

GRP05 : interactive prototype due Monday after midterm (3 August)

PRG03 framer license details are on Piazza

another judge : Anca Mosoiu founder of community tech hub in oakland

cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis July 22, 2015 Valkyrie Savage

cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis - PowerPoint PPT Presentation

cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis July 22, 2015 Valkyrie Savage thanks for the feedback! Data Analysis 41057893@N02 on flickr Start by counting 5680 trials total normal: bubble: mean time 976.1

Samsung Galaxy Gear - A Long Time Coming cs160. cs160. valkyriesavage.com valkyriesavage.com

Jugger - 5pm Thursdays on Memorial Glade cs160. cs160. valkyriesavage.com valkyriesavage.com

cs160. cs160. valkyriesavage.com valkyriesavage.com personas, scenarios, & storyboards

cs160. cs160. valkyriesavage.com valkyriesavage.com modality , heuristics, and studies, oh my!

cs160. cs160. valkyriesavage.com valkyriesavage.com design cycle and critique June 24, 2015

cs160. cs160. valkyriesavage.com valkyriesavage.com prototyping July 8, 2015 Valkyrie Savage

CS160: INFORMATION VISUALIZATION Prof. Marti Hearst August 4, 2015 INFORMATION VISUALIZATION

Index Compression David Kauchak cs160 Fall 2009 adapted from:

HTML/CSS Basics Forrest Huang CS160 Summer 2019 Week 1 // He/His/Him // 2nd Year Ph.D. HCI + ML

CS160 Midterm Exam Spring 2007, version 1 User Interface Design, Prototyping, and Evaluation Total

DataCamp Data Types for Data Science DataCamp Data Types for Data Science Data types Data type

Environmental Health Science Data Streams Data Streams Health Data Health Data Brian S.

Diagnose data for cleaning Cleaning Data in Python Cleaning data Prepare data for analysis

CS378 Introduction to Data Mining Data Exploration and Data Preprocessing Li Xiong Data

Data Preparation Discretization Data cleaning (Data pre-processing) Data

Business Statistics CONTENTS The role of data The data matrix Data types Aspects of data

Kanban - Crossing the line, pushing the limit or rediscovering the agile vision? Jesper Boeg,

Welcome! (please log in to a computer) Madison and Adams Professional Development December 15,

Use of New Industrial By-products and Mixtures for Reducing the Environmental Cost of

Input Performance KLM, Fitts Law, Pointing Interaction Techniques 1 CS 349 - Input

Lattice QCD Approach to HVP and Muon g-2 Kohtaroh Miura (GSI Helmholtz-Instute Mainz,

HVP lattice finite-volume Giusti corrections OUTLINE Motivations Second Plenary Workshop of

QED in muon g 2 , hadron spectroscopy, and beyond The RBC & UKQCD collaborations

SPC: muon g-2 session Aida X. El-Khadra (University of Illinois) USQCD All Hands meeting, JLab,

Sambuz

Useful Links

Newsletter

Mail Us

cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis - PowerPoint PPT Presentation

cs160. cs160. valkyriesavage.com valkyriesavage.com data analysis July 22, 2015 Valkyrie Savage thanks for the feedback! Data Analysis 41057893@N02 on flickr Start by counting 5680 trials total normal: bubble: mean time 976.1

Samsung Galaxy Gear - A Long Time Coming cs160. cs160. valkyriesavage.com valkyriesavage.com

Jugger - 5pm Thursdays on Memorial Glade cs160. cs160. valkyriesavage.com valkyriesavage.com

cs160. cs160. valkyriesavage.com valkyriesavage.com personas, scenarios, &amp; storyboards

cs160. cs160. valkyriesavage.com valkyriesavage.com modality , heuristics, and studies, oh my!

cs160. cs160. valkyriesavage.com valkyriesavage.com design cycle and critique June 24, 2015

cs160. cs160. valkyriesavage.com valkyriesavage.com prototyping July 8, 2015 Valkyrie Savage

CS160: INFORMATION VISUALIZATION Prof. Marti Hearst August 4, 2015 INFORMATION VISUALIZATION

Index Compression David Kauchak cs160 Fall 2009 adapted from:

HTML/CSS Basics Forrest Huang CS160 Summer 2019 Week 1 // He/His/Him // 2nd Year Ph.D. HCI + ML

CS160 Midterm Exam Spring 2007, version 1 User Interface Design, Prototyping, and Evaluation Total

DataCamp Data Types for Data Science DataCamp Data Types for Data Science Data types Data type

Environmental Health Science Data Streams Data Streams Health Data Health Data Brian S.

Diagnose data for cleaning Cleaning Data in Python Cleaning data Prepare data for analysis

CS378 Introduction to Data Mining Data Exploration and Data Preprocessing Li Xiong Data

Data Preparation Discretization Data cleaning (Data pre-processing) Data

Business Statistics CONTENTS The role of data The data matrix Data types Aspects of data

Kanban - Crossing the line, pushing the limit or rediscovering the agile vision? Jesper Boeg,

Welcome! (please log in to a computer) Madison and Adams Professional Development December 15,

Use of New Industrial By-products and Mixtures for Reducing the Environmental Cost of

Input Performance KLM, Fitts Law, Pointing Interaction Techniques 1 CS 349 - Input

Lattice QCD Approach to HVP and Muon g-2 Kohtaroh Miura (GSI Helmholtz-Instute Mainz,

HVP lattice finite-volume Giusti corrections OUTLINE Motivations Second Plenary Workshop of

QED in muon g 2 , hadron spectroscopy, and beyond The RBC &amp; UKQCD collaborations

SPC: muon g-2 session Aida X. El-Khadra (University of Illinois) USQCD All Hands meeting, JLab,

Sambuz

Useful Links

Newsletter

Mail Us

cs160. cs160. valkyriesavage.com valkyriesavage.com personas, scenarios, & storyboards

QED in muon g 2 , hadron spectroscopy, and beyond The RBC & UKQCD collaborations