Privacy Protections as an Incentive for Collaborative Research on - PowerPoint PPT Presentation

DIMACS 1 / 23 Privacy Protections as an Incentive for Collaborative Research on Human Health Anand D. Sarwate Department of Electrical and Computer Engineering Rutgers, the State University of New Jersey April 24, 2017 Rutgers Sarwate

DIMACS > Human health research 2 / 23 Human health research There are many data sharing challenges in human health research • Secondary use of clinical data for research • Multi-site studies on QA or comparative effectiveness • Joint (secondary) analyses on aggregated research data Rutgers Sarwate

DIMACS > Human health research 3 / 23 Institutions often want to share data Rutgers Sarwate

DIMACS > Human health research 3 / 23 Institutions often want to share data • Different research groups using the same type of measurements want to do a joint analysis. Rutgers Sarwate

DIMACS > Human health research 3 / 23 Institutions often want to share data • Different research groups using the same type of measurements want to do a joint analysis. • Sharing requires lawyers at each institution to generate Data Use Agreements. Rutgers Sarwate

DIMACS > Human health research 3 / 23 Institutions often want to share data • Different research groups using the same type of measurements want to do a joint analysis. • Sharing requires lawyers at each institution to generate Data Use Agreements. • Resulting months of negotiation makes even small-scale collaboration too complicated. Rutgers Sarwate

DIMACS > Human health research 4 / 23 Collaborative research systems Research consortia are common in many research areas involving human health: Rutgers Sarwate

DIMACS > Human health research 4 / 23 Collaborative research systems Research consortia are common in many research areas involving human health: • Foster collaborative research about a particular condition (Alzheimer’s, autism, breast cancer, etc.) Rutgers Sarwate

DIMACS > Human health research 4 / 23 Collaborative research systems Research consortia are common in many research areas involving human health: • Foster collaborative research about a particular condition (Alzheimer’s, autism, breast cancer, etc.) • Automated sharing is challenging, but this is changing. Rutgers Sarwate

DIMACS > Human health research 4 / 23 Collaborative research systems Research consortia are common in many research areas involving human health: • Foster collaborative research about a particular condition (Alzheimer’s, autism, breast cancer, etc.) • Automated sharing is challenging, but this is changing. Goal: use privacy protections to encourage consortium growth. Rutgers Sarwate

DIMACS > Human health research 5 / 23 CO llaborative I nformatics N euroimaging S uite • End-to-end system for managing data for studies on the brain • Current usage: 37,903 participants in 42,961 scan sessions from 612 studies for a total of 486,955 clinical assessments. • Data from 34 states, 38 countries • Partners with research consortia such as the Autism Brain Imaging Data Exchange (ABIDE) Rutgers Sarwate

DIMACS > Human health research 6 / 23 Example: schizophrenia research � R d D 0 Private D 1 w priv , 1 � SVM R M � P r i v a t e S V M w priv Private D 2 A g g r e g a t o r SVM w priv , 2 x i = W > x i ˜ � final classification rule w priv ,M Private D M y = sgn( w > priv W > x ) ˆ SVM Rutgers Sarwate

DIMACS > Human health research 6 / 23 Example: schizophrenia research � R d D 0 Private D 1 w priv , 1 � SVM R M � P r i v a t e S V M w priv Private D 2 A g g r e g a t o r SVM w priv , 2 x i = W > x i ˜ � final classification rule w priv ,M Private D M y = sgn( w > priv W > x ) ˆ SVM • Goal: build a system that can identify schizophrenia. Rutgers Sarwate

DIMACS > Human health research 6 / 23 Example: schizophrenia research � R d D 0 Private D 1 w priv , 1 � SVM R M � P r i v a t e S V M w priv Private D 2 A g g r e g a t o r SVM w priv , 2 x i = W > x i ˜ � final classification rule w priv ,M Private D M y = sgn( w > priv W > x ) ˆ SVM • Goal: build a system that can identify schizophrenia. • Data: MRIs from multiple studies (healthy controls and schizophrenics). Rutgers Sarwate

DIMACS > Human health research 6 / 23 Example: schizophrenia research � R d D 0 Private D 1 w priv , 1 � SVM R M � P r i v a t e S V M w priv Private D 2 A g g r e g a t o r SVM w priv , 2 x i = W > x i ˜ � final classification rule w priv ,M Private D M y = sgn( w > priv W > x ) ˆ SVM • Goal: build a system that can identify schizophrenia. • Data: MRIs from multiple studies (healthy controls and schizophrenics). • Algorithm: classification using machine learning (e.g. support vector machine). Rutgers Sarwate

DIMACS > Human health research 6 / 23 Example: schizophrenia research � R d D 0 Private D 1 w priv , 1 � SVM R M � P r i v a t e S V M w priv Private D 2 A g g r e g a t o r SVM w priv , 2 x i = W > x i ˜ � final classification rule w priv ,M Private D M y = sgn( w > priv W > x ) ˆ SVM • Goal: build a system that can identify schizophrenia. • Data: MRIs from multiple studies (healthy controls and schizophrenics). • Algorithm: classification using machine learning (e.g. support vector machine). • Privacy risk: each study has to allow access to sensitive subject data. Rutgers Sarwate

DIMACS > Status quo ante 7 / 23 State of the art: ENIGMA http://enigma.ini.usc.edu “The ENIGMA Network brings together researchers in imaging genomics to understand brain structure, function, and disease, based on brain imaging and genetic data.” • MA = meta analysis : focused on • Goals: improve reproducibility, sample sizes • Validation: found genetic variations associated with neurophysiological characteristics (e.g. hippocampal/intercranial volumes) Rutgers Sarwate

DIMACS > Status quo ante 8 / 23 Workflows in ENIGMA http://enigma.ini.usc.edu ENIGMA has 30+ working groups on diseases, genomics, population variation, and methods. To do a study: • Study proposal is approved by ENIGMA managers. • Analyses performed on local sites and emailed to ENIGMA manager as Excel spreadsheets. • Manager has to perform “manual” meta-analysis. Rutgers Sarwate

DIMACS > Status quo ante 9 / 23 Low-hanging fruit: automate this COINSTAC works in a different way: data is registered in the system and analyses are performed/aggregated automatically through message passing. Rutgers Sarwate

DIMACS > Status quo ante 9 / 23 Low-hanging fruit: automate this COINSTAC works in a different way: data is registered in the system and analyses are performed/aggregated automatically through message passing. • Study is proposed specifying data needed. Rutgers Sarwate

DIMACS > Status quo ante 9 / 23 Low-hanging fruit: automate this COINSTAC works in a different way: data is registered in the system and analyses are performed/aggregated automatically through message passing. • Study is proposed specifying data needed. • Local sites approve access to data. Rutgers Sarwate

DIMACS > Status quo ante 9 / 23 Low-hanging fruit: automate this COINSTAC works in a different way: data is registered in the system and analyses are performed/aggregated automatically through message passing. • Study is proposed specifying data needed. • Local sites approve access to data. • Analyses are run and aggregated automatically. Rutgers Sarwate

DIMACS > Status quo ante 9 / 23 Low-hanging fruit: automate this COINSTAC works in a different way: data is registered in the system and analyses are performed/aggregated automatically through message passing. • Study is proposed specifying data needed. • Local sites approve access to data. • Analyses are run and aggregated automatically. This can be significantly faster than the ENIGMA approach. Rutgers Sarwate

DIMACS > COINSTAC 10 / 23 The COINSTAC workflow In COINSTAC, research groups install the software and register their data in the system: • Form ongoing and ad-hoc “consortia” (slow, requires approval) • Once established, consortium members can initiate a joint analysis • Computation is performed locally and messages passed between sites Rutgers Sarwate

DIMACS > COINSTAC 11 / 23 What’s in the medium term COINSTAC prototype is currently “demo-able” but not up and running. • Compute more than summary statistics, ridge regression, etc. • Improve user interface and usability for practitioners, including visualization tools. • Initial subject focus for new results: addiction studies. • Incorporate/test differentially private methods for machine learning. Rutgers Sarwate

DIMACS > COINSTAC 12 / 23 Focusing on “old” algorithms Because the focus is on usability, we are working on methods popular in neuroimaging: • Feature discovery: ICA, IVA, NMF, deep learning, etc. • Regression and classification: ridge regression, LASSO, SVM, etc. • Visualization: t-SNE, network visualization, etc. Rutgers Sarwate

DIMACS > COINSTAC 13 / 23 COINSTAC vs. other health data systems COINSTAC is a solution that works for typical neuroimaging research initiatives. Rutgers Sarwate

Privacy Protections as an Incentive for Collaborative Research on - PowerPoint PPT Presentation

DIMACS 1 / 23 Privacy Protections as an Incentive for Collaborative Research on Human Health Anand D. Sarwate Department of Electrical and Computer Engineering Rutgers, the State University of New Jersey April 24, 2017 Rutgers Sarwate

RTU protections - news Ostrava 2015 Energetics RTU protections RTU RTUs for control,

Data Privacy Law Overview Privacy Protections (D) Working Group Jennifer McAdam Senior Counsel

Data privacy: Privacy models Vicen c Torra March, 2019 Hamilton Institute, Maynooth

$ Lesson Fourteen Consumer Privacy 04/09 privacy and information information privacy: privacy

$ Lesson Ten Consumer Privacy 04/09 privacy and information information privacy: privacy that

CS305 Topic Privacy Concept Evolution Rights to Privacy Privacy and Technologies

Privacy Protection privacy notions and metrics; privacy in RFID systems; location privacy in

Due Process Protections In Due Process Protections In DBE Decertification Proceedings DBE

Exceeding Domestic Standards: The gap between treaty protections and protections under US law

Introduction to Cybersecurity Database Privacy Review: Anonymity vs. Privacy Privacy -

Database Privacy Review: Anonymity vs. Privacy Privacy - Privacy is the claim of individuals,

CS573 Data Privacy and Security Data Privacy and Security in Healthcare Data Privacy and Security

Privacy engineering, CyLab privacy by design, privacy impact assessments, and privacy governance

Privacy Enhancing Technologies Spring 2006 Outline Privacy Overview Course Topics

Privacy engineering, CyLab privacy by design, privacy impact assessments, and privacy governance

Privacy in Wireless Networks privacy notions and metrics; privacy in RFID systems; location

Using LOD to crowdsource Dutch WW2 underground newspapers on Wikipedia Olaf Janssen, National

Feedly is a news aggregator application It compiles news feeds from a variety of online sources

On the impact of social cost in opinion dynamics Panagiotis Liakos Katia Papakonstantinopoulou

Mobile and Ubiquitous Computing: Informed Mobile Prefetching Brett Levasseur Computer Science

Cloud Native Camel Design Patterns (Tips for Running Apache Camel on Kubernetes) ApacheCon

Why I chose mongodb for guardian.co.uk Mat Wall Lead Software Architect, guardian.co.uk It is

Automated and Scalable QoS Control - For Network Convergence Wonho Kim (Princeton Univ.) Puneet

Oracles Thus Far Bitcoin Ethereum Recap Bitcoin Script: Dest Address in UTXO

Sambuz

Useful Links

Newsletter

Mail Us