Unit 1: Introduction to data Lecture 3: EDA (cont.) and Introduction to statistical inference via simulation Statistics 101
Nicole Dalzell May 15, 2015
Spread
Measures of Spread
The population Variance, σ2, measures each observation’s deviation from the mean. The population Standard Deviation, σ, is the square root of the variance. The Inner Quartile Range (IQR) measures the spread of the middle 50% of your data, and is visually depicted in Boxplots.
Link Statistics 101 ( Nicole Dalzell) U1 - L3: EDA + Inference May 15, 2015 2 / 1 Spread
Box Plot
The box in a box plot represents the middle 50% of the data, and the thick line in the box is the median.
# of study hours / week 10 20 30 40
Statistics 101 ( Nicole Dalzell) U1 - L3: EDA + Inference May 15, 2015 3 / 1 Spread
Anatomy of a Box Plot
# of study hours / week 10 20 30 40 lower whisker Q1 (first quartile) median Q3 (third quartile) upper whisker max whisker reach suspected outliers
Statistics 101 ( Nicole Dalzell) U1 - L3: EDA + Inference May 15, 2015 4 / 1