S A M P L I N G MPA 630: Data Science for Public Management - - PowerPoint PPT Presentation

s a m p l i n g
SMART_READER_LITE
LIVE PREVIEW

S A M P L I N G MPA 630: Data Science for Public Management - - PowerPoint PPT Presentation

S A M P L I N G MPA 630: Data Science for Public Management November 1, 2018 Fill out your reading report on Learning Suite P L A N F O R T O D A Y Exam 2 Sampling vocabulary Sampling in real life Sampling with computers E X A M 2 S A


slide-1
SLIDE 1

S A M P L I N G

MPA 630: Data Science for Public Management November 1, 2018

Fill out your reading report

  • n Learning Suite
slide-2
SLIDE 2

P L A N F O R T O D A Y Exam 2 Sampling vocabulary Sampling in real life Sampling with computers

slide-3
SLIDE 3

E X A M 2

slide-4
SLIDE 4

S A M P L I N G V O C A B U L A R Y

slide-5
SLIDE 5

D E F I N I N G T H E P O P U L A T I O N Population

A collection of things in the world

Population parameter

Something we want to know about the population

slide-6
SLIDE 6

C O U N T I N G T H E P O P U L A T I O N Census

Count every single thing in the whole population

Sampling

Select parts of the population and count those

slide-7
SLIDE 7

M E A S U R E T H E S A M P L E Sample statistic or point estimate

The population parameter, but for the sample

Uses the hat sign; p-hat

slide-8
SLIDE 8

I S T H E S A M P L E G O O D ?

Representativeness

Does the sample look like the population?

Generalizability

Is p-hat a good guess of p?

Bias and randomness

Does every part of the population have the same chance of being sampled?

slide-9
SLIDE 9

W H Y E V E N D O T H I S ?

If a sample is taken at random… …it will be unbiased and representative… …and the sample estimates can generalize to the whole population

(within a confidence interval)

Censuses are expensive and often impossible

slide-10
SLIDE 10

W H A T I F * Y O U * A R E N ’ T C O U N T E D ?

Sampling gets us accurate estimates

  • f population parameter—even if

samples seem small!

Statistical power

slide-11
SLIDE 11

S A M P L I N G I N R E A L L I F E

slide-12
SLIDE 12

M & M S A M P L I N G

Define the population

What thing are we counting? What parameter are we measuring?

Count the population

Census or sample?

Measure the sample

What is our p-hat?

Is the sample good?

Is the sample representative? Is the sample biased? Is p-hat a good guess?

slide-13
SLIDE 13
slide-14
SLIDE 14

T H E T R U E P

Plant City Blue Brown Green Orange Red Yellow CLV Cleveland, OH 20.7% 12.4% 19.8% 20.5% 13.1% 13.5% HKP Hackettstown, NJ 25.0% 12.5% 12.5% 25.0% 12.5% 12.5%

“Our color blends were selected by conducting consumer preference tests, which indicate the assortment of colors that pleased the greatest number of people and created the most attractive overall effect.” “Each large production batch is blended to those ratios and mixed thoroughly. However, since the individual packages are filled by weight on high-speed equipment, and not by count, it is possible to have an unusual color distribution”

slide-15
SLIDE 15

I M P R O V I N G P

What can we do to get a better estimate

  • f the whole population of M&Ms?

Bigger samples? More samples? Bigger sample size = better sampling

slide-16
SLIDE 16

S A M P L I N G W I T H C O M P U T E R S