STAT373/ STAT814_STAT714 Week 10
2019 1
1
Week 10: STRATIFIED SAMPLING LGA example
- We return to the problem of estimating the
mean number of overseas-born people per NSW LGA (1996).
- It seems plausible that overseas-born people
would be more likely to settle in urban rather than rural areas.
- So perhaps a stratification based on broad
geographical groupings of LGAs would be sensible.
2 SD id SD name Number of LGAs
5 Sydney 46 10 Hunter 14 15 Illawarra 4 20 Richmond-Tweed 7 25 Mid-North Coast 11 30 Northern 20 35 North Western 14 40 Central West 14 45 South Eastern 19 50 Murrumbidgee 14 55 Murray 16 60 Far West 3
Statistical Divisions (SD)
NOTE: LGAs (182 of them) are grouped into 12 Statistical Divisions (SDs). These become our strata.
3
Descriptive statistics of OS born by SD_id
Descriptive Statistics
Variable SD_id N Mean Median TrMean StDev OSBorn P 5 46 26426 20093 24728 19966 10 14 9731 1192 4070 23028 15 4 15311 7439 15311 19457 20 7 3048 3263 3048 3000 25 11 2113 1281 1823 2193 30 20 1084 350 554 2554 35 14 476 239 388 555 40 14 913 407 796 1001 45 19 1420 879 1267 1498 50 14 644 264 427 994 55 16 511 262 297 947 60 3 1486 972 1486 1333
4
55 50 60 35 30 20 25 10 5 15 45 40
5
Number of OS born in NSW LGAs by SD
5 10 15 20 25 30 35 40 45 50 55 60 50000 100000
SD_id OSBorn P
6
A reasonable stratification strategy for sampling LGAs would be to have the following three strata:
- Stratum 1:
5 (Sydney), 15 (Illawarra)
- Stratum 2:
10 (Hunter), 20 (Richmond-Tweed), 25 (Mid- North Coast), 45 (South Eastern)
- Stratum 3:
rest of NSW Note: Alternatively, Hunter (10), Sydney (5) or Illawarra (15) may be considered as a separate stratum due to its difference in variability.