Using DNA from many samples to distinguish pedigree relationships of - PowerPoint PPT Presentation

Using DNA from many samples to distinguish pedigree relationships of close relatives Amy L. Williams @amythewilliams February 24, 2020 Family History Technology Workshop

Massive datasets: Many close relatives / small pedigrees >100,000 samples > 9 million samples ~500,000 samples >14 million samples 𝑜 𝑜−1 In dataset with 𝑜 individuals, have 𝑜 = 𝒫 𝑜 2 pairs 2 = 2

Goal: detect and reconstruct pedigrees using only DNA …

Signal: Identical by descent (IBD) sharing • Close (and some distant) relatives share large regions identical by descent (IBD) – Represented here as same color • Each generation, parents transmit random ½ of their genome to children  Relatives separated by 𝑁 generations 1 share average of 2 𝑁 of genome • Average IBD sharing fractions: – Full siblings: 50%, Aunt-nephew: 25%, First cousins: 12.5%

Second degree relatives: All share ~25% of genome IBD Grandparent- Avuncular (AV) Half-sibling (HS) grandchild (GP)  Difficult to distinguish using only data from the pairs

IBD sharing rates for these relationships heavily overlap

Idea: analyze IBD sharing of pair to other relatives

CREST: Classification of Relationship Types Ying Qiao Jens Sannerud

Approach: ratios of IBD sharing in three samples versus two 𝑆 1 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 ∩ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 𝑆 2 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 ∩ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑦 1 For GP, expect 𝑆 1 = 1/4, 𝑆 2 = 1 𝑧 𝑦 2 Ying Qiao

Approach: ratios of IBD sharing in three samples versus two 𝑆 1 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 ∩ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 𝑆 2 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 ∩ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 2 ,𝑧 For GP, expect 𝑆 1 = 1/4, 𝑆 2 = 1 For AV, expect 𝑆 1 = 1/4, 𝑆 2 = 1/2 𝑦 1 𝑧 𝑦 2 Ying Qiao

Approach: ratios of IBD sharing in three samples versus two 𝑆 1 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 ∩ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 𝑆 2 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 1 ,𝑧 ∩ 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑀𝑓𝑜𝑕𝑢ℎ 𝐽𝐶𝐸 𝑦 2 ,𝑧 For GP, expect 𝑆 1 = 1/4, 𝑆 2 = 1 For AV, expect 𝑆 1 = 1/4, 𝑆 2 = 1/2 For HS, expect 𝑆 1 = 1/2, 𝑆 2 = 1/2 𝑧 𝑦 1 𝑦 2 Ying Qiao

CREST uses kernel density estimators to infer relationships Trained kernel density estimators (KDEs) using simulated data Features: 𝑆 1 , 𝑆 2

Can combine multiple relatives by taking union of IBD sharing 𝑧 𝑘 ’s 𝑀𝑓𝑜𝑕𝑢ℎ 𝑘 𝐽𝐶𝐸 𝑦 1 ,𝑧 𝑘 ∩ 𝑘 𝐽𝐶𝐸 𝑦 2 ,𝑧 𝑘 ∩ 𝐽𝐶𝐸 𝑦 1 ,𝑦 2 𝑆 𝑗 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝑘 𝐽𝐶𝐸 𝑦 𝑗 ,𝑧 𝑘

CREST highly sensitive, highly specific Ran PADRE, CREST on 200 replicates of various pedigree structures : CREST : PADRE Qiao, Sannerud et al. (in revision, 2019)

CREST infers relative types in Generation Scotland data Generation Scotland data: 205 GP, 1,949 AV, and 121 HS pairs with at least one mutual relative Given data equivalent to one first cousin (10% of genome covered by IBD regions), CREST’s sensitivity is 0.99 in GP, 0.86 in AV, and 0.95 in HS pairs Qiao, Sannerud et al. (in revision, 2019)

Secondary aim: infer whether relatives are paternal or maternal Paternal Maternal Grandparent Half-siblings

Key insight: males / females have different crossover locations Female rate (cM/Mb) Data from human chromosome 10 Average number of crossovers: Male rate (cM/Mb) • Females: 2.04 • Males: 1.27 Physical position (Mb) Genetic map from Bhérer et al. (2017)

CREST infers maternal / paternal type in Generation Scotland Analyzed all 848 GP and 381 HS pairs in Generation Scotland Using 𝑀𝑃𝐸 = 0 as Half-siblings boundary: • 99.7% of HS • 93.5% of GP Inferred correctly Grandparent-grandchild Qiao, Sannerud et al. (in revision, 2019)

Conclusions • CREST classifies second degree relationship types – Enabled by multi-way IBD sharing • Male / female crossovers reveal the paternal / maternal type of half-siblings and grandparent-grandchild pairs • Can apply to pedigree reconstruction: other methods subject to ambiguities for second degree pairs • Preliminary results indicate CREST also applies to third degree pairs

Acknowledgements Generation Scotland Caroline Hayward Archie Campbell Ying Qiao Jens Sannerud Nancy E. and Peter C. Meinig

Approach: IBD segment ends approximate crossover locations • Model IBD segments as regions flanked by two crossovers No-crossover interval: interior of IBD segment 𝑗int 𝑥 0 𝑥 1 Locations of crossovers: window surrounding IBD segment ends • For each IBD segment 𝑗, likelihood of parent being 𝑇 ∈ {𝐺, 𝑁} is 𝑄 𝑗 𝑇 = 𝑄 𝑥 0 𝑇 ⋅ 𝑄 𝑗int 𝑇 ⋅ 𝑄 𝑥 1 𝑇 • Taking all IBD segments to be independent, we compute 𝑗 𝑄(𝑗|𝐺) 𝑀𝑃𝐸 = log 10 𝑗 𝑄 𝑗 𝑁 Jens Sannerud

Using DNA from many samples to distinguish pedigree relationships of - PowerPoint PPT Presentation

Using DNA from many samples to distinguish pedigree relationships of close relatives Amy L. Williams @amythewilliams February 24, 2020 Family History Technology Workshop Massive datasets: Many close relatives / small pedigrees >100,000

Pedigree subpartitioning May 27, 2005 1 Terminology A pedigree is a set of individuals along

DNA D DNA Double bl Helix DNA stands for: DNA stands for: U d Under a Deoxyribose

Table of Contents Why DNA Computing? The Structure of DNA DNA Computing Operations on DNA

Take out your DNA model DNA and the Human Genome DNA Model How was your How was your model

Table of Contents Why DNA Computing? The Structure of DNA DNA Computing Operations on

DNA Computing Information Processing with DNA Molecules Christian Jacob, 01/2002. Table of

Eastern Shores (GHOTES) DNA A Family Tree DNA Project Family Tree DNA Family Tree DNA or

Visualizing Large Pedigree Visualizing Large Pedigree Charts in 3D Space Charts in 3D Space

Automatic Merging of Automatic Merging of Pedigree Information Pedigree Information Annual

DNA IN OUR FOOD? EXTRACTION OF DNA FROM STRAWBERRIES (GETTING THE DNA OUT OF STRAWBERRIES) -OR

The Design of Autonomous DNA The Design of Autonomous DNA Nanomechanical Devices: Devices:

DNA evidence: two important features match between two DNA profiles frequency of the DNA profile in

DNA Nucleus Contains cells genetic info (DNA) controls cell functions DNA Structure

Self-Assembling DNA Self-Assembling DNA N. Jonoska Jonoska, N. C. , N. C. Seeman Seeman, DNA

Go Bananas! Introduction Tell you about DNA Show you how to extract DNA from a Banana

Analysis and classification of the DNA Analysis and classification of the DNA sequence of TARA

2016 BPC Update 287-2731 www.thinkfirstspraylast.org pesticides@maine.gov What I plan to cover

I am Ferdie and I am Hilarys eldest grandchild. I have a sister Aurora who is 3 and a cousin

THE PRESENTATION SECRETS OF STEVE JOBS: HOW TO BE INSANELY GREAT IN FRONT OF ANY AUDIENCE

FROM RESEARCH-LED TEACHING TO RESEARCH-LED LEARNING: EDUCATION FOR AN UNKNOWN FUTURE Emeritus

Interventions for patients with suicidal ideation Dr. Eduardo Chachamovich, MD, PhD Associate

PRO WPF 4.5 IN VB: WINDOWS PRESENTATION FOUNDATION IN .NET 4.5 Download Free Author: Matthew

A BPA Biosensor A BPA Biosensor The Bisphenolics : Jason Gardiner, Saul Godkewitsch, Benny Hung,

Catalogue-presentation Our company LLC Siberian Ecological Products (Russia) is exporting

Sambuz

Useful Links

Newsletter

Mail Us