postal code conversion for data analysis
play

Postal Code Conversion for Data Analysis An overview of the PCCF - PDF document

26/11/2015 Postal Code Conversion for Data Analysis An overview of the PCCF and PCCF+ Saeeda Khan Michael Tjepkema Health Analysis Division, Statistics Canada December 1, 2015 www.statcan.gc.ca Outline 1. Postal codes Components of a


  1. 26/11/2015 Postal Code Conversion for Data Analysis An overview of the PCCF and PCCF+ Saeeda Khan Michael Tjepkema Health Analysis Division, Statistics Canada December 1, 2015 www.statcan.gc.ca Outline 1. Postal codes • Components of a postal code • Uses of small-area data 2. Introduction to the Postal Code Conversion File (PCCF) and the Postal Code Conversion File Plus (PCCF+) 3. Single link indicator geocoding versus population- weighting 4. Why PCCF+? 5. Limitations of PCCF & PCCF+ Statistics Canada • Statistique Canada 11/26/2015 2 1

  2. 26/11/2015 1. Postal Codes Statistics Canada • Statistique Canada 11/26/2015 3 What are postal codes? • An identifier managed by Canada Post Corporation for the efficient sorting and delivery of mail. • They are not created as units for the analysis or mapping of population, business or dwelling characteristics. • However, postal codes are part of most administrative data sets and are usually the only variable available for geographic identification • Thus, they are important identifiers for geocoding Statistics Canada • Statistique Canada 11/26/2015 4 2

  3. 26/11/2015 Components of a postal code • The postal code is a six-character alphanumeric code • Postal codes are not geographic attributes • Only spatial in that mail is delivered by geographic area • Six character code ‘ANA NAN’ • First 3 – Forward Sortation Area (FSA) • Last 3 – Local Delivery Unit (LDU) Statistics Canada. Postal Codes Conversion File (PCCF), Reference Guide . Catalogue no. 92-153-G, no 02. Ottawa, ON: Statistics Canada, 2011. Statistics Canada • Statistique Canada 11/26/2015 5 What is a postal code? Province / Territory / Region First Character Newfoundland and Labrador A Nova Scotia B Prince Edward Island C ANA NAN New Brunswick E Eastern Québec G Forward Local Metropolitan Montréal H Sortation Delivery Area Unit Western Québec J Eastern Ontario K Central Ontario L if 0 then rural Metropolitan Toronto M if 1-9 then urban Southwestern Ontario N Northern Ontario P Manitoba R Saskatchewan S Alberta T British Columbia V Northwest Territories and Nunavut X Yukon Y Statistics Canada • Statistique Canada 11/26/2015 6 3

  4. 26/11/2015 Components of a postal code Statistics Canada • Statistique Canada 11/26/2015 7 Components of a postal code • Local Delivery Unit (LDU) • Letter carrier delivery to ordinary urban address • Community mailbox • Apartment building • Business building • Large firm or organisation (Foothills Medical Centre: T2N2T9; CBC: M5W 1E6) • Federal department or agency (Statistics Canada: K1A 0T6) • Mail delivery route (suburban, rural, or mobile) • General delivery and post office boxes (large or small) Statistics Canada. Postal Codes Conversion File (PCCF), Reference Guide . Catalogue no. 92-153-G, no 02. Ottawa, ON: Statistics Canada, 2011. Statistics Canada • Statistique Canada 11/26/2015 8 4

  5. 26/11/2015 Components of a postal code Haydu G. The Postal Code – Geographic classification code conversion file, a tool for social science research . Paper presented at the 1979 annual meeting of the Canadian Association of Geographers, Victoria, BC, Canada. Statistics Canada • Statistique Canada 11/26/2015 9 How can postal codes be used for analysis • Postal codes are part of most administrative data sets • PCCF, PCCF+, and related tools are now the standard • Allows for the conversion of address and postal code attributes to standard geographical codes • Used in data collection, processing, and analysis, e.g., dissemination area (DA), census tract (CT), health region (HR) • Resulting small-area geography have a variety of uses • Familiarity with the methods, strengths, and limitations will help researchers exploit the potential Statistics Canada • Statistique Canada 11/26/2015 10 5

  6. 26/11/2015 Uses of small area data • Add policy relevance by aggregating to admin areas • Health Regions, School Districts, etc… • Deal with changes over time (boundary shifts) • Assign neighbourhood socio-economic status (SES) and other confounders • Determine point-distance, road distance, travel time • Allow for studies of migration over time (longitudinal) • Help in the imputation of missing data • Obtain additional identifiers for record linkage Statistics Canada • Statistique Canada 11/26/2015 11 2. Introduction to the PCCF and PCCF+ Statistics Canada • Statistique Canada 11/26/2015 12 6

  7. 26/11/2015 What is the PCCF? • A flat file that links postal codes (active and retired) to standard geographic areas • Allows for: • Association of postal codes to standard geographic areas • Selection of statistical units by geographic areas • Provides linkages (including a single link indicator (SLI)) to block face (BF), dissemination block (DB), and dissemination area (DA) • However, some postal codes are only linked to post office locations, many serve multiple DAs, and some are non-residential (government offices, etc) Statistics Canada. Postal Codes Conversion File (PCCF), Reference Guide . Catalogue no. 92-153-G, no 02. Ottawa, ON: Statistics Canada, 2011. Statistics Canada • Statistique Canada 11/26/2015 13 What is the PCCF+? • The PCCF+ consists of: 1. SAS control program, 2. reference files primarily derived from the PCCF 3. postal code population-weight file derived from the Census of Population • Assigns geographic identifiers based on postal codes • Full diagnostic output (troublesome postal codes, precision of geocoding, etc.) • Provides residential & institutional coding separately Wilkins R, Peters PA. PCCF+ Version 5K User’s Guide: Automated geocoding based on the Statistics Canada Postal Code Conversion File . Catalogue no. 82F0086-XDB. Ottawa, ON: Statistics Canada, 2011. Statistics Canada • Statistique Canada 11/26/2015 14 7

  8. 26/11/2015 Importance of Identifying Non-residential PCs • PCCF+ is able to identify non-residential postal codes • Government Offices, e.g., Statistics Canada • Coroners Offices • Children’s Aid Societies • Hospitals in a Birth File • Tax preparers office in a Tax File • UPS Store, Mailboxes Etc , Statistics Canada • Statistique Canada 11/26/2015 15 How does the PCCF+ geocode postal codes? • Assigns geographic identifiers based on postal codes in a staged approached: 1. assigns 6-digit postal codes in rural areas to disseminations areas (DA) and dissemination blocks (DB) using population- weighted random allocation 2. assigns 6-digit postal codes with an exact match to a PCCF unique record 3. randomly assigns 6-digit postal codes with an exact match to a PCCF duplicate record 4. imputes full geography for the first 5-, first 4- and first 3- digit postal codes using census population weights 5. imputes partial geography for the first 2-digit postal codes Wilkins R, Peters PA. PCCF+ Version 5K User’s Guide: Automated geocoding based on the Statistics Canada Postal Code Conversion File . Catalogue no. 82F0086-XDB. Ottawa, ON: Statistics Canada, 2011. Statistics Canada • Statistique Canada 11/26/2015 16 8

  9. 26/11/2015 Uses of the PCCF and the PCCF+ • A 2011 literature review for publications using the PCCF and PCCF+ resulted in 622 publications • Health Sciences 463 (74%) • Social Sciences & Economics 93 (15%) • Education, data, & statistics 34 (6%) • Natural & applied sciences 12 (2%) • Other 20 (3%) • Articles appeared in 233 different journals, top two: • Canadian Medical Association Journal (23) • Canadian Journal of Public Health (19) Peller P. An analysis of the Postal Code Conversion File’s use in research . DLI research paper series, 2011. Calgary, AB: University of Calgary. Statistics Canada • Statistique Canada 11/26/2015 17 3. PCCF-SLI vs. PCCF+ Statistics Canada • Statistique Canada 11/26/2015 18 9

  10. 26/11/2015 Single-link (PCCF-SLI) vs. PCCF+ • PCCF-SLI forces each postal code to be assigned to a single dissemination area (DA) & dissemination block (DB), regardless of how large the actual service area may be • For most research purposes, the distribution of the population across the entire service area is needed • PCCF+ uses a population-weighted method of geocoding where multiple-matches are possible • As such, the distribution of respondents more accurately reflects the underlying population • “Numerator - denominator consistency” Statistics Canada • Statistique Canada 11/26/2015 19 A1A 1A1 DA 1 DA 3 60% 10% DA 2 30% PCCF (SLI) PCCF+ A1A 1A1 A1A 1A1 10 6 0 1 0 3 Of 10 records reporting this postal code, Of 10 records reporting this postal code, 6 all 10 will be assigned to DA 1 using the will be assigned to DA 1, 3 to DA2 and 1 to PCCF single link indicator (SLI) DA 3 using the PCCF+ Statistics Canada • Statistique Canada 11/26/2015 20 10

  11. 26/11/2015 Population assignment using PCCF-SLI Saskatchewan Manitoba Alberta Statistics Canada • Statistique Canada 11/26/2015 21 Population assignment using PCCF+ Saskatchewan Manitoba Alberta Statistics Canada • Statistique Canada 11/26/2015 22 11

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend