DATA PROTECTION RIGHT Prof. dr. Mireille Hildebrandt Interfacing - PowerPoint PPT Presentation

GETTING DATA PROTECTION RIGHT Prof. dr. Mireille Hildebrandt Interfacing Law & Technology Vrije Universiteit Brussel Smart Environments, Data Protection & the Rule of Law Radboud University

21/2/17 Hildebrandt SNS seminar Stockholm 2

what’s next? 1. 1. From m online ine to onlif ife 2. 2. Machine ne Learning rning 3. 3. Data a Protec ectio tion 21/2/17 Hildebrandt SNS seminar Stockholm 3

what’s next? 1. From om on onli line to o on onli life 21/2/17 Hildebrandt SNS seminar Stockholm 4

online → onli f e ■ internet: packet switching & routing, network structure, ■ world wide web: hyperlinking ■ search engines, blogs, social media, web portals ■ web platforms [network effects & filter bubbles; reputation & fake news] ■ mobile applications [moving towards IoT, wearables] ■ IoT: cyberphysical infrastructures [connected cars, smart energy grids] ■ cloud computing, fog computing & edge computing 21/2/17 Hildebrandt SNS seminar Stockholm 5

onli f e: data driven agency ■ creating added value from big data or small data ■ predicting behaviours ■ pre-empting behaviours ■ interplay of backend & frontend of computing systems ■ interfaces enable but they also hide , nudge and force [AB testing, ‘by design’ paradigms] 21/2/17 Hildebrandt SNS seminar Stockholm 9

onli f e: digital unconscious Big Data Space: ce: ■ accumulation of behavioural and other data ■ mobile and polymorphous data & hypothesis spaces ■ distributed storage [once data has been shared, control becomes a challenge] ■ distributed access [access to data or to the inferences, to training set & algos] 21/2/17 Hildebrandt SNS seminar Stockholm 10

onli f e: digital unconscious Big Data Space: ce: the e envelop elop of big data space drives human agency, providing convenience & resilience Weiser’s calm computi uting , IBM’s auton onom omic ic computi uting :  increasing dependence on the dynamics of interacting data driven cyberphysical systems 21/2/17 Hildebrandt SNS seminar Stockholm 12

what’s next? 2. Machin ine Learni ning ng 21/2/17 Hildebrandt SNS seminar Stockholm 14

big data, open data, personal data ■ BIG – volume (but, n=all is nonsense) – variety (unstructured in sense of different formats) – velocity (real time, streaming) ■ OP OPEN EN as opposed to proprietary? reuse? repurposing? public-private? – creating added value is hard work, not evident, no guarantees for return on investment ■ PER ERSONA ONAL data: IoT will contribute to a further explosion of personal data – high risk high gain (think DPIA)? anonymisation will mostly be pseudonymisation! 21/2/17 Hildebrandt SNS seminar Stockholm 16

machine Learning (ML) “ we say that a machine learns: - with respect to a particular task T, - performance metric P, and - type of experience E, if if - the system reliably improves its performance P - at task T, - following experience E .” (Tom Mitchell) http://www.cs.cmu.edu/~tom/mlbook.html 21/2/17 Hildebrandt SNS seminar Stockholm 17

types of machine learning ■ super pervi vised sed (lear arning ning from om exam ample les – requi uire res s labelli elling, ng, doma main in exper ertise tise) ■ reinf inforce rcement ment (lea earnin rning g by correcti rection on - requires uires prior r doma omain in exper erti tise) se) ■ uns nsuper upervised vised (bott ttum up up, induc ucti tive e – danger nger of overfitt tting) ing) 21/2/17 Hildebrandt SNS seminar Stockholm 18

bias optimisation spurious correlations ■ 2. have a network trained to recognize animal faces ■ 1. present it with a picture of a flower ■ 2. run the algorithms ■ 3. check the output (see what it sees) http://www.nature.com/news/can-we-open-the-black-box-of-ai-1.20731 21/2/17 Hildebrandt SNS seminar Stockholm 20

Wol olper pert: : no no free ee lu lunc nch h theor orem em Wher here d = trainin ning g set; et; f = ‘target’ input -ou outp tput ut relat ationshi ionships; s; h = hypo poth thesi esis (the he algori rith thm's m's gue uess ss for f made de in response ponse to d); ; and C = off-trai training ng- set ‘loss’ associated with f and h (‘generalization error’) How well you do is determined by how ‘aligned’ your learning algorithm P( h|d) is with the actual posterior, P(f|d). Check http://www.no-free-lunch.org 21/2/17 Hildebrandt SNS seminar Stockholm 21

Wol olper pert: : no no free ee lu lunc nch h theor orem em Summary: – The bias that is necessary to mine the data will co-determine the results – This relates to the fact that the data used to train an algorithm is finite – ‘Reality’, whatever that is, escapes the inherent reduction – Data is not the same as what it refers to or what it is a trace of 21/2/17 Hildebrandt SNS seminar Stockholm 22

trade-offs ■ NFL FL theo eorem rem – overfitting, overgeneralization ■ trainin ning g set, et, domai main n kno nowled wledge, ge, hypo poth theses ses space, ce, test st set et – accuracy, precision, speed, iteration ■ low w hanging ging frui uit t – may be cheap and/or available but not very helpfull ■ data nor algori rith thms s are object jectiv ive e – bias in the data, bias of the algos, guess what: bias in the output ■ the e more re data, a, the e larger er the e hypo poth theses es sp space, e, the e more ore pattern erns – spurious correlations, computational artefacts 21/2/17 Hildebrandt SNS seminar Stockholm 25

data hoarding & obesitas ■ data obes esitas itas: : lots of data, but often incorrect, incomplete, irrelevant (low hanging fruit) – any personal data stored presents security and other risks sks (need for DPIA, DPbD) – pu purpose rpose limitati tion on is crucial: select ect before re you ou collect lect (and while, and after) ■ pattern ern obesi esitas tas: : trained algorithms can see patterns anywhere, added value? – training set and algorithms ne necessari essarily ly contain bias, this may be problematic (need for DPIA, DPbD) – purpose pu rpose limitati tion on is crucial: to prevent spurious correlations, to test t rele levance nce 21/2/17 Hildebrandt SNS seminar Stockholm 26

agile and lean computing ■ agile e softw tware are developme elopment: nt: – iteration instead of waterfall – collaboration domain experts, data scientists, whoever invests – initial purpose (prediction of behaviour, example: tax office, car insurance) – granular purposing (testing specific patterns, AB testing to nudge specific behaviour) ■ lean n com omputing: uting: – less data = more effective & more efficient ■ meth ethodo dologi logica cal l integri egrity ty: – make your software testable and contestable: mathematical & empirical software verification – secure logging, open source 21/2/17 Hildebrandt SNS seminar Stockholm 27

what’s next? 4. Data a Protect ection ion Law 21/2/17 Hildebrandt SNS seminar Stockholm 28

pr priv ivacy acy and nd aut utonomy onomy ■ th the im impli lica cati tion ons s of of pre-empti tive co computi ting: – AB testing & nudging – pre-emption of our intent, playing with our autonomy – we become subject to decisions of data-driven agents – this choice architecture may generate manipulability 21/2/17 Hildebrandt SNS seminar Stockholm 29

no non-discrimination discrimination ■ three ee type pes of bi bias: – bias inherent in any action-perception-system (APS) – bias that some would qualify as unfair – bias that discriminates on the basis of prohibited legal grounds 21/2/17 Hildebrandt SNS seminar Stockholm 30

the opacity argument in ML: 1. 1. intent ntional nal conceal alment ment – trade de secre rets ts, , IP right hts, s, pub ublic c security urity 2. 2. we we have learned d to read and write, , not ot to code or do machine hine learning ing – monopoly of the new ‘clerks’, the end of democracy 3. 3. mismatc match h betwee etween mathe hematic matical al optimi miza zation tion and human an semant ntics ics – when it comes to law and justice we cannot settle for ‘computer says no’ – inspired by: Jenna Burrell, How the machine ‘thinks’: Understanding opacity in machine learning algorithms’, in Big Data ta & Society ty, January-June 2016, 1-12 21/2/17 Hildebrandt SNS seminar Stockholm 31

DATA PROTECTION RIGHT Prof. dr. Mireille Hildebrandt Interfacing - PowerPoint PPT Presentation

GETTING DATA PROTECTION RIGHT Prof. dr. Mireille Hildebrandt Interfacing Law & Technology Vrije Universiteit Brussel Smart Environments, Data Protection & the Rule of Law Radboud University 21/2/17 Hildebrandt SNS seminar Stockholm

The Data Protection Landscape Before and aft fter GDPR: General Data Protection Regulation Data

Tier 1 Water Budget CTC SPC Meeting # 2/09 Agenda Item # 6.1 February 17, 2009 Gayle

Groundwater Quality Vulnerability Analysis - WHPA delineation & vulnerability CTC SWP

Protection Issues I/O protection Protection and System Calls Prevent users from

I. Asset Protection Trusts Foreign Asset Protection Trusts Offshore Asset Protection

Module 18: Protection Goals of Protection Domain of Protection Access Matrix

Module 18: Protection Goals of Protection Domain of Protection Access Matrix

Data Protection & Availability Kaushal Devater Data Protection & Availability Discipline

Data Protection Reform preparing for the General Data Protection Regulation By Philip Brining

GDPR T owards Compliance 25 May 2018 Wha hat t is GDPR? EU Data Protection Directive EU

5 THINGS HR MUST DO IN THE ROLE OF THE DATA PROTECTION OFFICER GILLIAN ACHESON DATA PROTECTION

Privacy, Data Protection Law and Privacy, Data Protection Law and Flow Data Anonymisation

Invention-Con 2017 - International 2 Protection - Patents International Protection: Patents

Bill Protection Introduction 1 Agenda 1.Review Decision Parameters & Principles 2.TOU Bill

The Protection Mainstreaming Mobile Application (ProM) The Protection Mainstreaming App is

Wave Sound Space Tourist On-board Protection Space Tourist On-board Protection Space Tourist

Slot clouds getting more from orbital slots with networking Lloyd Wood Global Defense and Space

VoIP switching and billing suite Break through your data VoIP switching and billing suite

Evaluation of variance for TCP throughput Olga I. Bogoiavlenskaia PetrSU, Department of Computer

A networked-FPGA platform o ff ering fm exible Ethernet switching from Layer 1 all the way to full

Enabling GPU-as-a-Service Providers with Red Hat OpenShift @jeremyeder Senior Principal Software

ATO Journey IMechE - May 2019 Mainline ATO over ETCS development CR1238 - ATO B3 R2 TEN-T

The Future of Network Flow Monitoring Prague Embedded Systems Workshop (PESW 2019) Friday 28 th

Routing in packet-switching networks Circuit switching vs. Packet switching Most of WANs based on

DATA PROTECTION RIGHT Prof. dr. Mireille Hildebrandt Interfacing - PowerPoint PPT Presentation

GETTING DATA PROTECTION RIGHT Prof. dr. Mireille Hildebrandt Interfacing Law & Technology Vrije Universiteit Brussel Smart Environments, Data Protection & the Rule of Law Radboud University 21/2/17 Hildebrandt SNS seminar Stockholm

The Data Protection Landscape Before and aft fter GDPR: General Data Protection Regulation Data

Tier 1 Water Budget CTC SPC Meeting # 2/09 Agenda Item # 6.1 February 17, 2009 Gayle

Groundwater Quality Vulnerability Analysis - WHPA delineation &amp; vulnerability CTC SWP

Protection Issues I/O protection Protection and System Calls Prevent users from

I. Asset Protection Trusts Foreign Asset Protection Trusts Offshore Asset Protection

Module 18: Protection Goals of Protection Domain of Protection Access Matrix

Module 18: Protection Goals of Protection Domain of Protection Access Matrix

Data Protection &amp; Availability Kaushal Devater Data Protection &amp; Availability Discipline

Data Protection Reform preparing for the General Data Protection Regulation By Philip Brining

GDPR T owards Compliance 25 May 2018 Wha hat t is GDPR? EU Data Protection Directive EU

5 THINGS HR MUST DO IN THE ROLE OF THE DATA PROTECTION OFFICER GILLIAN ACHESON DATA PROTECTION

Privacy, Data Protection Law and Privacy, Data Protection Law and Flow Data Anonymisation

Invention-Con 2017 - International 2 Protection - Patents International Protection: Patents

Bill Protection Introduction 1 Agenda 1.Review Decision Parameters &amp; Principles 2.TOU Bill

The Protection Mainstreaming Mobile Application (ProM) The Protection Mainstreaming App is

Wave Sound Space Tourist On-board Protection Space Tourist On-board Protection Space Tourist

Slot clouds getting more from orbital slots with networking Lloyd Wood Global Defense and Space

VoIP switching and billing suite Break through your data VoIP switching and billing suite

Evaluation of variance for TCP throughput Olga I. Bogoiavlenskaia PetrSU, Department of Computer

A networked-FPGA platform o ff ering fm exible Ethernet switching from Layer 1 all the way to full

Enabling GPU-as-a-Service Providers with Red Hat OpenShift @jeremyeder Senior Principal Software

ATO Journey IMechE - May 2019 Mainline ATO over ETCS development CR1238 - ATO B3 R2 TEN-T

The Future of Network Flow Monitoring Prague Embedded Systems Workshop (PESW 2019) Friday 28 th

Routing in packet-switching networks Circuit switching vs. Packet switching Most of WANs based on

Groundwater Quality Vulnerability Analysis - WHPA delineation & vulnerability CTC SWP

Data Protection & Availability Kaushal Devater Data Protection & Availability Discipline

Bill Protection Introduction 1 Agenda 1.Review Decision Parameters & Principles 2.TOU Bill