Chapter 3 Chapter 3
Data Description
1 McGraw-Hill, Bluman, 7th ed, Chapter 3
Chapter 3 Chapter 3 Data Description McGraw-Hill, Bluman, 7 th ed, - - PowerPoint PPT Presentation
Chapter 3 Chapter 3 Data Description McGraw-Hill, Bluman, 7 th ed, Chapter 3 1 Ch Chapter 3 Overview t 3 O i Introduction Introduction 3-1 Measures of Central Tendency 3-2 Measures of Variation 3-3 Measures of Position 3
1 McGraw-Hill, Bluman, 7th ed, Chapter 3
3-1 Measures of Central Tendency 3-2 Measures of Variation 3-3 Measures of Position 3 4 Exploratory Data Analysis 3-4 Exploratory Data Analysis
2 Bluman, Chapter 3
3 Bluman, Chapter 3
Average Variation Position
4 Bluman, Chapter 3
A statistic
A statistic
A parameter
5 Bluman, Chapter 3
6 Bluman, Chapter 3
Mean Median Mode Mode Midrange Weighted Mean
7 Bluman, Chapter 3
The mean
The mean
The symbol is used for sample mean.
1 2 3 n
X X X X X X + + + + = = ∑
X n n = =
1 2 3 N
X X X X X μ + + + + = = ∑
N μ = =
8 Bluman, Chapter 3
9 Bluman, Chapter 3
1 2 3 n
X X X X X X + + + + = = ∑
n n 20 26 40 36 23 42 35 24 30 276 30.7 X + + + + + + + + = = = 9 9
10 Bluman, Chapter 3
11 Bluman, Chapter 3
The mean for grouped data is calculated
m
f X X n ⋅ = ∑
12 Bluman, Chapter 3
13 Bluman, Chapter 3
Class Boundaries Frequency
q y 5.5 - 10.5 10.5 - 15.5 1 2 15.5 - 20.5 20.5 - 25.5 25 5 30 5 3 5 4 25.5 - 30.5 30.5 - 35.5 35.5 - 40.5 4 3 2 Σf = 20
14 Bluman, Chapter 3
Class Frequency, f Midpoint, Xm f ·Xm 5.5 - 10.5 10.5 - 15.5 15 5 20 5 1 2 3 8 13 18 8 26 54 15.5 - 20.5 20.5 - 25.5 25.5 - 30.5 3 5 4 18 23 28 54 115 112 30.5 - 35.5 35.5 - 40.5 3 2 33 38 Σf = 20 99 76 Σ f ·Xm = 490 Σf 20 Σ f Xm 490
490 24.5 miles
m
f X X ⋅ = = =
15 Bluman, Chapter 3
24.5 miles 20 X n
The median
The median will be one of the data values
The median will be the average of two
16 Bluman, Chapter 3
17 Bluman, Chapter 3
292, 300, 311, 401, 596, 618, 713 292, 300, 311, 401, 596, 618, 713
MD = 401
18 Bluman, Chapter 3
19 Bluman, Chapter 3
684 764 656 702 856 1133 1132 1303 684, 764, 656, 702, 856, 1133, 1132, 1303
656, 684, 702, 764, 856, 1132, 1133, 1303
764 856 1620 +
764 856 1620 MD 810 2 2 + = = =
20 Bluman, Chapter 3
The mode
It is sometimes said to be the most typical
There may be no mode, one mode
21 Bluman, Chapter 3
22 Bluman, Chapter 3
18 0 14 0 34 5 10 11 3 10 12 4 10 18.0, 14.0, 34.5, 10, 11.3, 10, 12.4, 10
10, 10, 10, 11.3, 12.4, 14.0, 18.0, 34.5
23 Bluman, Chapter 3
24 Bluman, Chapter 3
110 731 1031 84 20 118 1162 1977 103 752 110, 731, 1031, 84, 20, 118, 1162, 1977, 103, 752
25 Bluman, Chapter 3
26 Bluman, Chapter 3
104 104 104 104 104 107 109 109 109 110 109 111 112 111 109 104 104 104 104 104 107 109 109 109 110 109 111 112 111 109
27 Bluman, Chapter 3
28 Bluman, Chapter 3
Class Frequency 5.5 – 10.5 1
10.5 – 15.5 2 15.5 – 20.5 3 20.5 – 25.5 5 25.5 – 30.5 4 30.5 – 35.5 3 35 5 40 5 2
35.5 – 40.5 2
29 Bluman, Chapter 3
The midrange
30 Bluman, Chapter 3
31 Bluman, Chapter 3
2, 3, 6, 8, 4, 1
1 8 9 MR 4.5 2 2 + = = =
32 Bluman, Chapter 3
Find the weighted mean
1 1 2 2
1 2 2 1 2 n n n
Bluman, Chapter 3
34 Bluman, Chapter 3
Course Credits, w Grade, X English Composition 3 A (4 points) g p 3 A (4 points) Introduction to Psychology 3 C (2 points) Biology 4 B (3 points) Ph i l Ed ti 2 D (1 i t)
wX
X = ∑
Physical Education 2 D (1 point)
32 2 7
3 4 3 2 4 3 2 1
=
⋅ + ⋅ + ⋅ + ⋅ = =
w
X = ∑
2.7 12
3 3 4 2
=
= = + + +
35 Bluman, Chapter 3
Uses all data values. Varies less than the median or mode Used in computing other statistics such as Used in computing other statistics, such as
Unique usually not one of the data values Unique, usually not one of the data values Cannot be used with open-ended classes Affected by extremely high or low values,
36 Bluman, Chapter 3
Gives the midpoint Used when it is necessary to find out
Can be used for an open-ended Can be used for an open ended
Affected less than the mean by extremely Affected less than the mean by extremely
37 Bluman, Chapter 3
Used when the most typical case is
Easiest average to compute
Can be used with nominal data Not always unique or may not exist Not always unique or may not exist
38 Bluman, Chapter 3
Easy to compute. Gives the midpoint. Affected by extremely high or low values in Affected by extremely high or low values in
39 Bluman, Chapter 3
40 Bluman, Chapter 3
Range
Variance Standard Deviation Coefficient of Variation Chebyshev’s Theorem Empirical Rule (Normal) Empirical Rule (Normal)
41 Bluman, Chapter 3
The range
42 Bluman, Chapter 3
43 Bluman, Chapter 3
Brand A Brand B 10 35 10 35 60 45 50 30 30 35 40 40 20 25
44 Bluman, Chapter 3
20 25
Brand A Brand B 10 35
210 35 Brand A: 6 X N
μ
= =
= ∑
10 35 60 45 50 30
6 60 10 50 N R = − = 210 X
30 35 40 40 20 25
210 35 Brand B: 6 45 25 20 X R
N μ
= = = =
= ∑
45 25 20 R = − =
The average for both brands is the same, but the range The average for both brands is the same, but the range for Brand A is much greater than the range for Brand B. Which brand would you buy?
45 Bluman, Chapter 3
Which brand would you buy?
The variance
The standard deviation
The standard deviation
The standard deviation is a measure of
46 Bluman, Chapter 3
To determine the spread of the data. To determine the consistency of a To determine the consistency of a
To determine the number of data values To determine the number of data values
Used in inferential statistics.
47 Bluman, Chapter 3
The population variance
2 2
X N μ σ − = ∑
The population standard deviation
N
2
2
X N μ σ − = ∑
48 Bluman, Chapter 3
49 Bluman, Chapter 3
Months, X µ X - µ (X - µ)2
2 2
X μ σ − = ∑
10 60 50 35 35 35
25 15 625 625 225
1750 6 n =
50 30 40 35 35 35 15
5 225 25 25
1750 6 291.7 =
20 35
225 1750
1750 6 17 1 σ =
50 Bluman, Chapter 3
17.1 =
The sample variance
2 2
1 X X s n − = −
The sample standard deviation
1 n
2
2
1 X X s n − = −
51 Bluman, Chapter 3
Is mathematically equivalent to the
Saves time when calculating by hand Saves time when calculating by hand Does not use the mean Is more accurate when the mean has
52 Bluman, Chapter 3
The sample variance
2
2 2
X X
The sample standard deviation
2
53 Bluman, Chapter 3
54 Bluman, Chapter 3
11.2, 11.9, 12.0, 12.8, 13.4, 14.3 X X 2 11.2 125.44
2
2 2
X X
11.9 12.9 12.8 141.61 166.41 163.84
2 2
75.6
2
958 94 12.8 13.4 14.3 163.84 179.56 204.49 75 6
2 2
958.94
55 Bluman, Chapter 3
75.6
2 2
56 Bluman, Chapter 3
57 Bluman, Chapter 3
5 100% 5 7% S l CV 100% 5.7% Sales 87 CVar = ⋅ = 773 100% 14 8% Commissions CVar
100% 14.8% Commissions 5225 CVar = ⋅ =
58 Bluman, Chapter 3
59 Bluman, Chapter 3
60 Bluman, Chapter 3
# of standard Minimum Proportion within k standard Minimum Percentage within k standard standard deviations, k within k standard deviations within k standard deviations
61 Bluman, Chapter 3
# of Minimum Proportion Minimum Percentage standard deviations, k within k standard deviations within k standard deviations
2 1-1/4=3/4 75% 3 1-1/9=8/9 88.89% 4 1-1/16=15/16 93.75%
62 Bluman, Chapter 3
63 Bluman, Chapter 3
64 Bluman, Chapter 3
50,000 – 2(10,000) = 30,000 ( ) 50,000 + 2(10,000) = 70,000 At least 75% of all homes sold in the area will have a
65 Bluman, Chapter 3
price range from $30,000 and $75,000.
66 Bluman, Chapter 3
2 2
2 2
At least 84% of the data values will fall between $0 20 d $0 30
67 Bluman, Chapter 3
$0.20 and $0.30.
# of standard d i ti k Proportion within k standard d i ti deviations, k deviations
68 Bluman, Chapter 3
69 Bluman, Chapter 3
Z-score Percentile Quartile Outlier
70 Bluman, Chapter 3
A z-
A z-score represents the number of
71 Bluman, Chapter 3
72 Bluman, Chapter 3
She has a higher relative position in the Calculus class.
73 Bluman, Chapter 3
She has a higher relative position in the Calculus class.
Percentiles
A percentile rank for a datum represents
74 Bluman, Chapter 3
75 Bluman, Chapter 3
76 Bluman, Chapter 3
18, 15, 12, 6, 8, 2, 3, 5, 20, 10
2, 3, 5, 6, 8, 10, 12, 15, 18, 20
6 values
# of values below 0.5 100% total # of values X Percentile + = ⋅
A student whose score was 12 did better than
6 0.5 100% 10 + = ⋅
77 Bluman, Chapter 3
65% of the class.
65% =
78 Bluman, Chapter 3
18, 15, 12, 6, 8, 2, 3, 5, 20, 10
2, 3, 5, 6, 8, 10, 12, 15, 18, 20
100 n p c ⋅ = 10 25 2.5 100 ⋅ = = 3 ≈
The value 5 corresponds to the 25th percentile.
79 Bluman, Chapter 3
p p
Deciles
Quartiles
Q2 = median(Low,High)
The Interquartile
The Interquartile
80 Bluman, Chapter 3
81 Bluman, Chapter 3
15, 13, 6, 5, 12, 50, 22, 18
5 6 12 13 15 18 22 50 5, 6, 12, 13, 15, 18, 22, 50
6 12 Q 9 median Low MD + = = =
13 15 Q 14 median Low High + = = =
1
Q , 9 2 median Low MD = = =
2
Q , 14 2 median Low High = = =
3
18 22 Q , 20 median MD High + = = =
82 Bluman, Chapter 3
3
Q , 2 g
An outlier
A data value less than Q1 – 1.5(IQR) or
83 Bluman, Chapter 3
The Five
The Five-Number Summary can be
84 Bluman, Chapter 3
85 Bluman, Chapter 2
86 Bluman, Chapter 3
89, 47, 164, 296, 30, 215, 138, 78, 48, 39 30, 39, 47, 48, 78, 89, 138, 164, 215, 296 30, 39, , 8, 8, 89, 38, 6 , 5, 96
Q1 Q3 MD Low High
30 47 83.5 164 296 30 296
87 Bluman, Chapter 3