Population-based detection of Structural Variants in normal and aberrant genomes.
Jean Monlong, PhD2
Guillaume Bourque’s group
Research Day - June 5, 2014 Human Genetics Dept.
1 / 13
Population-based detection of Structural Variants in normal and - - PowerPoint PPT Presentation
Population-based detection of Structural Variants in normal and aberrant genomes. Jean Monlong, PhD2 Guillaume Bourques group Research Day - June 5, 2014 Human Genetics Dept. 1 / 13 What is structural variation ? Genetic variation
Guillaume Bourque’s group
1 / 13
Baker 2012, Nature Methods. Raphael Lab, Brown University.
Structural Variant: SV; Copy Number Variation: CNV.
2 / 13
◮ Major role in evolution. ◮ Population Genetics: widespread variation across humans. ◮ Association with diseases and cancer.
◮ Sample is sequenced. ◮ Reads are mapped to the reference genome. ◮ Unexpected patterns could be explain by presence of SVs.
3 / 13
Baker 2012, Nature Methods. 4 / 13
Low mappability
◮ Noisy or reduced signal in repeat-rich regions, centromeres, telomeres. ◮ Unpredictable segmentation → reduced sensitivity/specificity. ◮ Filtering problematic regions reduces the genome range tested. genomic window number of reads mapped genomic window number of reads mapped
5 / 13
genomic window number of reads mapped
sample reference tested 6 / 13
genomic window number of reads mapped
sample reference tested
7 / 13
◮ Normal samples → reference samples. ◮ 10kb bins. ◮ Only properly paired and mapped read pairs.
◮ Germline events detected in tumor samples ? ◮ Concordant with SNP-array calls ? ◮ Twin dataset: concordant with the pedigree ? ◮ Concordant when using different bin sizes ?
8 / 13
4000 100.75 100.80 100.85 100.90 100.95 101.00
position (Mb) read coverage
tumor sample: D000GMU
normal normal samples
Chr.1, overlapping CDC14A gene (cell division cycle), not detected by other approaches. 9 / 13
4000 6000 135.11 135.13 135.15
position (Mb) read coverage
normal sample: D000GQ9
normal normal samples
Chr.10, overlapping genes (PRAP1, CALY), not detected by other approaches. 10 / 13
◮ Increased resolution in regions of interest. ◮ Promising results: enrichment in centromere/telomere.
◮ Detect excess of discordant reads. ◮ Promising results, including on repeats.
11 / 13
◮ Detection in low mappability regions and partial tumoral signal. ◮ Superior to other Read-Depth methods. ◮ Wider range of the genome tested.
◮ Explore results and application to other projects (e.g. Pan-Cancer Analysis of
Whole Genome).
◮ Custom binning: repeat annotation, Whole-Exome Sequencing. ◮ More than an CNV caller.
◮ Excess of discordant read pairs. ◮ Combination with orthogonal approaches (PEM, Assembly). 12 / 13
◮ Guillaume Bourque ◮ Mathieu Bourgey ◮ Louis Letourneau ◮ Francois Lefebvre ◮ Eric Audemard ◮ Toby Hocking ◮ Simon Gravel ◮ Mathieu Blanchette ◮ Mehran Karimzadeh Reghbati
13 / 13