canada data and resources
play

Canada Data and Resources Hugh McCague Valerie Preston Walter - PowerPoint PPT Presentation

Accessing Statistics Canada Data and Resources Hugh McCague Valerie Preston Walter Giesbrecht Sara Tumpane Outline Survey Terminology Research Data Centre (RDC) RDC versus Public Use Microdata Files (PUMF) Accessing the RDC


  1. Accessing Statistics Canada Data and Resources Hugh McCague Valerie Preston Walter Giesbrecht Sara Tumpane

  2. Outline • Survey Terminology • Research Data Centre (RDC) • RDC versus Public Use Microdata Files (PUMF) • Accessing the RDC • Statistics Canada Surveys and Data • Statistical Software • Statistical Consulting Service • Resources

  3. Some Survey Terminology • Population • Elements • Sample: Simple Random Sample, Probability Sample • Response Rate • Weights: Simple Weights 3

  4. Some Survey Terminology • Demographics • Strata • Clusters (primary sampling units, PSUs) • Complex Sample • Complex Weights, Bootstrap and Jackknife Replicate Weights 4

  5. Some Survey Terminology • Cross-sectional data • Longitudinal data : periods, waves, cycles, trajectory, life course • Attrition : attrition rate. • Helpful reference : Ornstein, Michael. A Companion to Survey Research . London; Thousand Oaks, CA: SAGE, 2013. 5

  6. Research Data Center (RDC) • Access to Statistics Canada data and statistical software • Microdata & administrative data • For York students and faculty, access is free • A “secure” environment • Researchers are “deemed employees” of Statistics Canada • Must work in RDC • CRDCN Network

  7. The CRDCN Network

  8. York RDC • 282 York Lanes • Staffed by: Analyst Sara Tumpane (yorkrdc2@yorku.ca) • Assistant Theresa Kim (yorkrdc3@yorku.ca) • • 8 workstations • Open 3 days/ wk • http://www.isr.yorku.ca/rdc / 8

  9. Before you apply to the RDC… • Consider your options • Is what you need in some more readily accessible source (either PUMF or aggregate file)

  10. RDC or PUMF? Confidential Microdata in Research Public Use Microdata Files accessed Data Centres online Characteristics: Characteristics: o Contains most of the original o Manipulated by aggregating, information collected during the capping, or deleting variables that survey could be “identifiers”; survey o Continuous variables are accessible respondents cannot be identified o Longitudinal identifiers provided o Many continuous variables o Contains bootstrap weights used for transformed into categorical calculating exact variance variables o Longitudinal identifiers stripped Access is appropriate when: Access is appropriate when: o Sensitive variables not provided in o Immediate data access is required o Analysis is for a course paper or PUMF o A PUMF does not exist equivalent o Longitudinal data is necessary o Data exploration o Analytical work is complex in nature

  11. Labour Force Survey PUMF Master file • Demographic variables • Demographic variables o o Geography Geography o o Age Age o o Sex Sex o o Marital status Marital status o Country of birth o Country completed highest post- secondary degree/certificate/diploma o Landed immigrant status o Detailed Aboriginal status

  12. CCHS 2012 Example 1 PUMF Master File • 1815 variables • 1381 variables • Sources of personal income • Sources of personal income o wages and salaries o Employment inc. o income from self-employment o o EI/Worker's comp dividends and interest o employment insurance o Senior benefits o worker's compensation o Other o CPP or QPP o job related retirement pensions o RRSP/RRIF o OAS and GIS o social assistance/welfare o child tax benefits o child support o alimony o other o none

  13. CCHS 2012 Example 2 PUMF Master File • Geography • Geography o Province of residence of respondent o Province of residence of respondent-(G) o Postal code - (D) o Health Region - (G) o Health region of residence of respondent - (D) o B.C. Health Authority (BCHA) - (D) o Sub-health region (Québec only) - (D) o Nova Scotia district health authority o British Columbia local health authority - (D) o Regional health authority (RHA) - Alberta - (D) o British Columbia health authority - (D) o Local health integrated networks - Ontario - (D) o 2006 census dissemination area o Federal electoral district - (D) o Census subdivision - (D) o Census division - (D) o Statistical area classification type - (D) o 2006 Census metropolitan area (CMA) o Health region peer group o Urban and rural areas o Urban and rural areas - 2 levels - (D) o Subzones for Alberta o Manitoba health authority - (D)

  14. Accessing PUMFs & master file metadata • Statistics Canada Nesstar data portal o metadata only, for PUMFs and master files o http://www62.statcan.ca/webview/ • YUL: Data & Statistics library guide o http://researchguides.library.yorku.ca/data • <odesi> (OCUL) o http://www.library.yorku.ca/e/resolver/id/1165738

  15. http://www.andertoons.com/data/cartoon/6543/things-good-stuff-ok-i-reiterate-request-for-specific-data

  16. How to apply to an RDC and available datasets • RDC Application Pages • Data available in the RDCs • SSHRC Website

  17. Accessing the RDC Action Timeline Notes Provide list of academic Apply through the 1-2 Hours contributions, project SSHRC website proposal Approval based on Evaluation of the relevance of methods and 2-4 Weeks data, and demonstrated proposal need for microdata Security screening 1-3 Weeks for approval process Sign Microdata Research 1-3 Weeks for approval Contract

  18. Project Proposal • The project proposal includes the following elements: o Title of the Project o Rationale and objectives of the study o Proposed data analysis and software requirements o Data requirements o Expected project start and end dates o Expected products o References

  19. Data at the RDC • Labour Force Survey (LFS): 1976 - 2014 o Monthly estimates of employment & unemployment o Rotating 6 month panel, N= ~ 16,500 • Paper: Seasonal Adjustment, Demography, and GDP Growth , Dunbar, G.R. (2013), Canadian Journal of Economics • Survey of Labour and Income Dynamics (SLID): 1993 – 2011 o Changes in well-being over time o Overlapping 6 year panels, N= ~ 17,000 • Paper: An Empirical Model of Tax Convexity and Self-Employment, Wen, J-F. & Gordon, D. (2014), The Review of Economics and Statistics • Workplace and Employee Survey (WES): 1999 – 2006 o Employer: competitiveness, innovation, technology use: N= ~ 6,300 o Employee: training, job stability, earnings: N= ~24,000 • Paper: Organizational Redesign, Information Technologies and Workplace Productivity , Dostie, B. & Jayaraman, R. (2012), The B.E. Journal of Economic Analysis and Policy

  20. Data (continued) • Survey of Household Spending (SHS): 1986 - 2012 o Spending, investments, and savings: household and person o Cross-sectional: N= ~17,000 (households) • Paper: Does One Size Fit All? The CPI and Canadian Seniors , Brzozowski, M. (2006), Canadian Public Policy • Survey of Financial Security (SFS): 1999 – 2012 o Net worth (wealth) of Canadian families: assets, debt, employment, income, education o Cross-sectional: N= ~20,000 (households) • Paper: New Evidence on Taxes and Portfolio Choice , Atalay, K. et al. (2009), Social and Economic Dimensions of an Aging Population (SEDAP) Research Papers • Census & National Household Survey (NHS): 1911 – 2011 o Demographic, social, and economic characteristics o Cross-sectional (mandatory): 20% sample, N= ~6,000,000 • Paper: Quality of Life, Firm Productivity, and the Value of Amenities Across Canadian Cities , Albouy, D. Leibovici, F. & Warman, C. (2013), Canadian Journal of Economics

  21. Data by Themes • Health and Health Care • National Population Health Survey (NPHS) • Participation and Activity Limitation Survey (PALS) • Canadian Tobacco, Alcohol and Drugs Survey (CTADS) • Occupations and Organizations • Workplace and Employee Survey (WES) • Survey of Labour and Income Dynamics (SLID) • Census • Education • Youth in Transition Survey (YITS) • National Graduates Survey (NGS) • Race and Ethnicity • Aboriginal Peoples Survey (APS) • Longitudinal Survey of Immigrants to Canada (LSIC) • Ethnic Diversity Survey (EDS)

  22. Pilot Data • Canadian Cancer Registry (CCR) • Vital Statistics • Uniform Crime Reporting • Homicide Survey • Hate Crime Data • Ministry of Community and Social Services (MCSS) • Citizenship and Immigration Canada (CIC)

  23. Which Statistical Software to use at the York RDC? Features to Consider • SPSS 23 • SAS 9.4 • Stata 13 • R 3.0.3 Statistical Software Resources: Institute for Digital Research and Educations (idre), UCLA http://www.ats.ucla.edu/stat/

  24. Statistical Consulting Service (SCS ) • Statistical Consulting provided by a group of York faculty and graduate students with staff at the Institute for Social Research (ISR). • Usually, no fee for York faculty and student researchers • Online appointment scheduler 24

  25. http://truthfacts.com/truthfacts/2014/04/09

  26. Statistical Consulting Service (SCS ) • ISR/SCS Short Courses and Spring Seminar Series on data analysis, qualitative research methods, survey methods, and related software • More details: http://www.isryorku.ca/centres/scs/ 26

  27. Contact Information and Resources • http://www.isryorku.ca/econ

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend