SLIDE 8 Constructing a normal probability plot
We construct a normal probability plot for the heights of a sample of 100 men as follows:
- 1. Order the observations.
- 2. Determine the percentile of each observation in the ordered data
set.
- 3. Identify the Z scores corresponding to the each percentile for a Z
distribution.
- 4. Create a scatterplot of the observations (vertical) against the Z
scores (horizontal) Observation i 1 2 3 · · · 100 xi 61 63 63 · · · 78 Percentile , i/(n + 1) 0.99% 1.98% 2.97% · · · 99.01% zi
· · · 2.33 How are the Z scores corresponding to each percentile determined?
28
Below is a histogram and normal probability plot for the heights of Duke men’s basketball players (from 1990s and 2000s). Do these data appear to follow a normal distribution?
height (in.) 70 75 80 85 Theoretical Quantiles Sample Quantiles −2 −1 1 2 70 75 80 85
Source: GoDuke.com
29
Normal probability plot and skewness
2 4 6 8 10 0.0 0.1 0.2 0.3 0.4 0.5 Theoretical Quantiles Sample Quantiles −3 −2 −1 1 2 3 2 4 6 8 10
Right Skew - Points bend up and to the left
2 4 6 8 10 0.0 0.1 0.2 0.3 0.4 0.5 Theoretical Quantiles Sample Quantiles −3 −2 −1 1 2 3 2 4 6 8 10
Left Skew - Points bend down and to the right
−2 −1 1 2 0.0 0.1 0.2 0.3 0.4 0.5 Theoretical Quantiles Sample Quantiles −3 −2 −1 1 2 3 −1.5 −0.5 0.0 0.5 1.0 1.5
Skinny Tails - S shaped-curve indicating shorter than normal tails (narrower, less variable, than expected)
−6 −4 −2 2 4 6 8 0.00 0.10 0.20 0.30 Theoretical Quantiles Sample Quantiles −3 −2 −1 1 2 3 −4 −2 2 4 6 8
Fat Tails - Curve starting below the normal line, bends to follow it, and ends above it (wider, more variable, than expected)
30
Summary of main ideas
- 1. Two types of probability distributions: discrete and continuous
- 2. Normal distribution is unimodal, symmetric, and follows the
69-95-99.7 rule
- 3. Z scores serve as a ruler for any distribution
- 4. Z distribution is normal with µ = 0 and σ = 1
- 5. Normally distributed data plot as a straight line on the normal
probability plot
31