Big Data and Privacy Engineering & Public Policy Rebecca - - PowerPoint PPT Presentation

big data and privacy
SMART_READER_LITE
LIVE PREVIEW

Big Data and Privacy Engineering & Public Policy Rebecca - - PowerPoint PPT Presentation

CyLab Big Data and Privacy Engineering & Public Policy Rebecca Balebako y & c S a e v c i u r P r Advisor: Dr. Lorrie Cranor i t e y l b L a a s b U o b r a a t L o y r C y U H D T T E P . U :


slide-1
SLIDE 1

1

C y L a b U s a b l e P r i v a c y & S e c u r i t y L a b

  • r

a t

  • r

y H T T P : / / C U P S . C S . C M U . E D U

Engineering & Public Policy

CyLab

Big Data and Privacy

Rebecca Balebako Advisor: Dr. Lorrie Cranor

slide-2
SLIDE 2

2

Agenda

  • Quiz
  • Important dates
  • Exam feedback
  • Benefits of big data
  • Concerns about big data
  • What is PCAST
  • PCAST big data recommendations
slide-3
SLIDE 3

3

slide-4
SLIDE 4

4

Important Dates

  • November 18th – Homework 7 due

– Time consuming: start today

  • November 20 – Draft project paper due
  • Dec 4th – Poster Fair
  • Dec 11th – Final exam 1-4pm
  • Dec 12th – Final project due
slide-5
SLIDE 5

5

Comments on Exam

slide-6
SLIDE 6

6

Benefits of big data

  • Scientific American “How Big Data Can Transform

Society for the Better’ Oct 13

  • Understanding the spread of Malaria in Kenya

through mobile phone usage patterns (Wesolowski, Science 2012)

  • Better public transportation through GPS tracking
  • Better public health through search queries
slide-7
SLIDE 7

7

Concerns about big data

  • Incremental Effect
  • Automated Decision-Making
  • Predictive Analysis
  • Lack of Access and Exclusion
  • Analytics
  • Chilling Effect

Omer Tene and Jules Polonetsky, Big Data for All: Privacy and User Control in the Age of Analytics, 11 Nw. J. Tech. & Intell.

  • Prop. 239 (2013).
slide-8
SLIDE 8

8

Solutions to the concerns?

slide-9
SLIDE 9

9

PCAST report on big data

slide-10
SLIDE 10

10

PCAST = President’s advisory committee on science and tech.

  • PCAST, staffed by OSTP
  • Since FDR in 1933, many US Presidents have had

a science and technology advisory board

  • PCAST named and chartered by George H. W.

Bush in 1990

  • Topics: Climate, MOOCs, agriculture, nuclear

security, big data

Seed Magazine,Recasting PCAST by Robert Koenig President's Advisory Committee Straddles Worlds Of Politics, Science Politics, Science (The Scientist, Vol:10, #6, p. 3 & 5, March 18, 1996)

slide-11
SLIDE 11

11

20 current PCAST members

  • Three Nobel laureates,
  • Four MacArthur Prize fellows,
  • Two university presidents
  • 4 members of the national scientific, engineering,
  • r medical academies
  • Google’s chief executive, Eric Schmidt,
  • Microsoft’s research chief, Craig Mundie.
slide-12
SLIDE 12

12

Decisions about and by PCAST

  • Who is a member
  • Number of members
  • Access to president
  • Funding available
  • Private or public meetings?

– The next public PCAST meeting will take place on November 14, 2014.

  • Big studies or small reports
slide-13
SLIDE 13

13

PCAST report on big data

slide-14
SLIDE 14

14

What’s new about big data

  • The quantity and variety of data that are available

to be processed.

  • The scale of analysis, inferences, and conclusions
  • Data fusion: “when data from different sources are

brought into contact and new facts emerge”

slide-15
SLIDE 15

15

~6000 mega Data centers

http://www.Google.com/about/datacenters

slide-16
SLIDE 16

16

Data: “Born digital” versus “born analog”

  • Born digital - data packets, such as intentionally

typed text, clicks, GPS location

– Overcollection of data is a red flag

  • Born analog – pictures, video, health info, voice

calls

– Signal to noise = More information than is strictly needed – Overcollection of data may be a good design choice

slide-17
SLIDE 17

17

Security Options for Big Data

  • Technologies that allow encrypted databases to be

queried

– E.g. Zero-knowledge systems – difficult to extend to complex/unstructured systems

  • Differential privacy

– Trade-off between anonymity and accuracy

  • Anonymization

– “PCAST does not see it as being a useful basis for policy”

  • Deletion

– “The only viable assumption …is that data, once created, are permanent”

slide-18
SLIDE 18

18

Other FIPPs that don’t seem viable

  • Notice and Consent
  • Collection Limitation

– Except perhaps attention to collection practices

  • A right to forget/ expiration
slide-19
SLIDE 19

19

Specific options that might work

  • Privacy profiles

– E.g. Developed by ACLU or Consumer Reports – standard, machine‐readable interfaces

  • Focus on use

– With metatags

slide-20
SLIDE 20

20

PCAST Policy Recommendations

  • 1. Focus more on use of data than collection and

analysis

  • 2. Policy should be on intended outcomes, not

technology solutions

slide-21
SLIDE 21

21

PCAST Policy Recommendations cont.

  • 3. Strengthen U.S. research in privacy‐related

technologies

  • 4. Encourage increased education and training
  • pportunities concerning privacy protection
slide-22
SLIDE 22

22

PCAST Policy Recommendations, cont.

  • 5. US should take the lead through standards and

procurement practices