Yaniv Erlich
@erlichya 10/26/15
Genome Hacking Yaniv Erlich @erlichya @erlichya 10/26/15 Yaniv - - PowerPoint PPT Presentation
Genome Hacking Yaniv Erlich @erlichya @erlichya 10/26/15 Yaniv Erlich Intro. Methodology The Venter case Anonymous datasets Summary We need to share genetic information Hereditary Spastic Joubert syndrome Hemifacial Paraparesis
Yaniv Erlich
@erlichya 10/26/15
Yaniv Erlich
@erlichya 10/26/15
Hereditary Spastic Paraparesis (Erlich et al.) Joubert syndrome (Endevson et al.) Hemifacial Microsomia (Zielinski,.., & Erlich)
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Genetic Privacy
IT department
bank
Intro. Methodology The Venter case Anonymous datasets Summary
Yaniv Erlich
@erlichya 10/26/15
Y Y
Smith Smith
Y
Smith
Genetic privacy Intro. Methodology The Venter case Anonymous datasets Summary
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Estimating the time to most recent common ancestor
Intro. Methodology The Venter case Anonymous datasets Summary
surname Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Y-chr of a real person
Querying Ysearch and SMGF
Inferring surname
Comparing the predicted surname to the true one Intro. Methodology The Venter case Anonymous datasets Summary Surname inference algorithm Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Age+state+surname Only age+state
0.0% 0.5% 1.0% 1.5% 2.0% 0 10 20 30 40 50 60 70 80 90
Adams
100,000 rounds
Intro. Methodology The Venter case Anonymous datasets Summary
Age Freq.
Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
www.ysearch.org:
DYS458: 17 repeats
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary
Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
surname predictions
10 CEU (Utah) genomes
Winfield Utah
*Some of the details in this slide were modified to respect the identity of the family Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Yaniv Erlich
5 successful surname recoveries Breaching the privacy of close to 50 CEU samples.
Successful surname recovery (targeted individual) Patrilineal line from source to target Person tested by genetic genealogy service (source) p<5x10-9 p<10-5 p<5x10-6 p<5x10-6 p<10-5
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
NIH response
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary
Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
Intro. Methodology The Venter case Anonymous datasets Summary
Genetic Privacy
Yaniv Erlich
@erlichya 10/26/15
@erlichya
Yaniv Erlich
@erlichya 10/26/15
Andria and Paul Heafy
T eam Genetic Privacy Melissa Gymrek (HST – Harvard/MIT) Amy McGuire (Baylor) David Golan (Tel-Aviv University) Eran Halperin (Tel-Aviv University)