STAT 373/ STAT 814_STAT714 Week 9
1
2019
1
Week 9: Example
1996 Australian Bureau of Statistics census data
- We will use the data from Australian
census 1996 as an example.
- Units: Local Government Areas (LGAs) of
NSW (182) at the time
- Data are in file LGA.MTW, available on
the unit iLearn.
2
LGAs
Albury, Armidale, Ashfield, Auburn, Ballina, Balranald, Bankstown, Barraba, Bathurst, Baulkham Hills, Bega Valley, Bellingen, Berrigan, Bingara, Blacktown, …….., Wollongong, Woollahra, Wyong, Yallaroi, Yarrowlumla, Yass,Young
3
Variables in LGA.mtw
Variable Mean Median Total M 18335 6997 Total F 18803 6893 Total P 37138 13890 GE15 M 14286 5233 GE15 F 14944 5193 GE15 P 29231 10426 Aborig M 269.9 146.5 Aborig F 277.5 142.0 Aborig P 547.3 292.0 Variable Mean Median AusBorn 13302 5991 AusBorn 13716 6010 AusBorn 27018 12001 OSBorn M 4246 512 OSBorn F 4256 432 OSBorn P 8502 935 AusCit M 16171 6274 AusCit F 16599 6330 AusCit P 32769 12605 Unempl M 944 345 Unempl F 609.7 209.5 Unempl P 1554 586
4
- We will be using these data to illustrate
sampling and estimation techniques.
- Sampling frame: list of 182 LGAs with IDs
from 1 to 182 (N=182 LGAs)
- We will estimate quantities such as total
- verseas-born population of NSW on the
basis of a random sample, and compare our answer with the actual total population.
5
Overseas-born population
- Say we wish to estimate the mean LGA OS-
born population, , and the total NSW OS- born population , on the basis of a simple random sample of size n = 30 LGAs. (Note: You can find instructions on obtaining a SRS in Minitab on slides 46 and 47 of this lecture)
- Histogram of number of OS-born in all 182
LGAs, ie, the population (see next page).
6
Very skewed population... normal approximation for sample mean (for n = 30) may be unlikely.
50000 100000 50 100
OSBorn P Frequency