aine@sensiblecode.io duncan@sensiblecode.io 2021 Census Outputs - - PowerPoint PPT Presentation

aine sensiblecode io duncan sensiblecode io 2021 census
SMART_READER_LITE
LIVE PREVIEW

aine@sensiblecode.io duncan@sensiblecode.io 2021 Census Outputs - - PowerPoint PPT Presentation

aine@sensiblecode.io duncan@sensiblecode.io 2021 Census Outputs and Dissemination Update Suzie Dunsmith & Neil Townsend ONS June 2019 Were committed to delivering 2021 Census results earlier, more flexibly and with greater


slide-1
SLIDE 1

aine@sensiblecode.io duncan@sensiblecode.io

slide-2
SLIDE 2

Suzie Dunsmith & Neil Townsend ONS

2021 Census Outputs Update and Dissemination

June 2019

slide-3
SLIDE 3
  • Using innovative methods developed by our Statistical Disclosure

methods into a “proof of concept” prototype across several workstreams

We’re committed to delivering 2021 Census results earlier, more flexibly and with greater accessibility

Control experts we designed an approach to dissemination which meets these aims

  • Last year we worked with Sensible Code Company who built these
  • We are now developing methods, processes and specifications

June 2019

slide-4
SLIDE 4
  • Origin-destination outputs
  • Metadata incl W

NS accreditation – OSR consultation UK data

  • Output content – derived variables, classifications, geography

etc Analysing table design to inform dissemination development

  • Microdata samples

Admin data integration

  • elsh language requirements
  • Analysis and data visualisation
  • June 2019
slide-5
SLIDE 5

About the 2021 Census and/or other areas of ONS: Respond to the Of s: Any questions or feedback please contact: fice for Statistics Regulation’ user consultation

June 2019

slide-6
SLIDE 6

Intr ase S active session/ Q&A AGENDA

  • ductions SensibleCode/Welsh Government

Census 2021 ONS, UK C tudy Inter

6 | Sensible Code

slide-7
SLIDE 7

7 | Sensible Code

“To learn more about the challenges being faced by professionals who are considering privacy issues on a regular basis; how they address these issues given the desire to open data and the fact that many more sources of data are being made available. What's being considered and the factors influencing these decisions”

Our Challenge

slide-8
SLIDE 8

We make products that modernise the processing and dissemination of data

8 | Sensible Code

slide-9
SLIDE 9
  • sur

easing capacity is a challenge e t ving tech is new & landscape is foggy Problems disclosure control is a manual process

  • ge of new data sources
  • incr
  • pressur
  • publish more and sooner
  • privacy preser

9 | Sensible Code

slide-10
SLIDE 10

10 | Sensible Code

slide-11
SLIDE 11
  • NSIs want to
  • the collection date

more gr . modernise and automate SDC.

  • Disseminating data closer t

increases their value to the economy.

  • Users expect to be able to see

anular data for more diverse populations.

  • Users want to query the data more flexibly

11 | Sensible Code

slide-12
SLIDE 12

Flexible dissemination through real-time application of disclosure control techniques in response to user queries

12 | Sensible Code

slide-13
SLIDE 13

13 | Sensible Code

slide-14
SLIDE 14

14 | Sensible Code

slide-15
SLIDE 15

TableBuilder: what does it do?

  • Allow users t

Fle eal-time e Contr edact data if necessar Best-in-class aggregation speed using an optimized data format

  • choose “any” output table within limits

○ dubbed “ xible Dissemination”

  • In r

: ○ apply perturbative Statistical Disclosur

  • l (SDC)

○ use SDC rules post perturbation and r y

15 | Sensible Code

slide-16
SLIDE 16

16 | Sensible Code

How it works

slide-17
SLIDE 17

Census Data

Person 41 mappings dataset 28 variables 57 million rows 11 mappings 11 variables Household dataset 22 million rows Join both datasets to associate household variables and mappings with people

17 | Sensible Code

slide-18
SLIDE 18

2 (O (aver

  • wer Layer

A) (MSO 7,200 L

Geographical Data

Countries 10 Regions 350

  • cal Authorities (LA)

Middle Layer A) 35,000 L (LSO 180,000 Output Areas A) age about 300 people)

| 18 Sensible Code

slide-19
SLIDE 19

○ consist turbation must pass all of the rules

  • TableBuilder does perturbation using the cell-key method

some modifications for ONS ○ ent zero perturbation: always query whole data set

  • Apply post-per

rules ○ a publishable table

Statistical Disclosure Control (SDC)

19 | Sensible Code

slide-20
SLIDE 20

○ P ableBuilder:

  • count at
  • Naive approach

Force zero the “impossible” combinations of categories ○ roblem: enumerating all the combinations

  • T

automatic preservation of structural zeros ○ Use zer higher geographic level as indicator ○ Sensitive to geographic variation

SDC: Handling “Structural” Zeros

Sensible Code 20 |

slide-21
SLIDE 21

21 | Sensible Code

slide-22
SLIDE 22

○ Selective by e tables ar e

  • Formalise SDC “rules”

Publishable tables must pass all of the rules

  • geography

○ mor e available in areas with diverse population

  • Data controllers can xperiment with rule parameters

SDC: Which tables can be published?

22 | Sensible Code

slide-23
SLIDE 23

23 | Sensible Code

slide-24
SLIDE 24

Demonstration

slide-25
SLIDE 25

aine@sensiblecode.io

Q & A - Thank you