Discovery of Genomic Structural Variations with Next-Generation - - PowerPoint PPT Presentation

discovery of genomic structural variations with next
SMART_READER_LITE
LIVE PREVIEW

Discovery of Genomic Structural Variations with Next-Generation - - PowerPoint PPT Presentation

Discovery of Genomic Structural Variations with Next-Generation Sequencing Data Advanced Topics in Computational Genomics Slides from Marcel H. Schulz, Tobias Rausch (EMBL), and Kai Ye (Leiden University) Genomic Rearrangements/ Structural


slide-1
SLIDE 1

Discovery of Genomic Structural Variations with Next-Generation Sequencing Data

Advanced Topics in Computational Genomics

Slides from Marcel H. Schulz, Tobias Rausch (EMBL), and Kai Ye (Leiden University)

slide-2
SLIDE 2

Genomic Rearrangements/ Structural Variations (SVs)

  • 1 Kb to several Mb in size

courtesy of Tobias Rausch (EMBL)

slide-3
SLIDE 3
  • 1 Kb to several Mb in size
  • Copy number variants

(CNVs)

– Deletion – Duplication

Genomic Rearrangements/ Structural Variations (SVs)

courtesy of Tobias Rausch (EMBL)

slide-4
SLIDE 4

Genomic Rearrangements/ Structural Variations (SVs)

  • 1 Kb to several Mb in size
  • Copy number variants

(CNVs)

– Deletion – Duplication

  • Insertion

courtesy of Tobias Rausch (EMBL)

slide-5
SLIDE 5

Genomic Rearrangements/ Structural Variations (SVs)

  • 1 Kb to several Mb in size
  • Copy number variants

(CNVs)

– Deletion – Duplication

  • Insertion, Inversion

courtesy of Tobias Rausch (EMBL)

slide-6
SLIDE 6
  • 1 Kb to several Mb in size
  • Copy number variants

(CNVs)

– Deletion – Duplication

  • Insertion, Inversion, Translocation

Genomic Rearrangements/ Structural Variations

courtesy of Tobias Rausch (EMBL)

slide-7
SLIDE 7
  • 1 Kb to several Mb in size
  • Copy number variants

– Deletion – Duplication

  • Insertion, Inversion, Translocation
  • More abundant than SNPs

Genomic Rearrangements/ Structural Variations

…ACGATACG… …ACGAGACG…

courtesy of Tobias Rausch (EMBL)

slide-8
SLIDE 8
  • 1 Kb to several Mb in size
  • Copy number variants

– Deletion – Duplication

  • Insertion, Inversion, Translocation
  • More abundant than SNPs
  • Either neutral or non-neutral in function
  • Non-neutral mechanisms

– Disrupting genes – Creating fusion genes – Copy number changes of dosage-sensitive genes

Genomic Rearrangements/ Structural Variations

courtesy of Tobias Rausch (EMBL)

slide-9
SLIDE 9

Why Structural Variation Discovery

  • Finding disease causal genes
  • Trace evolutionary genome history
  • Analyze the mechanisms of SVs occurrence
  • Understand Repetitive Element spreading

(LINEs, ALUs, etc.)

slide-10
SLIDE 10

Technologies to Discover Structural Variations

slide-11
SLIDE 11

Technologies

  • Fluorescent in situ hybridization (FISH)

– Fluorescent probes (≈100kb) detect and localize the presence or absence of specific DNA sequence – Probe should be large enough for a specific hybridization

 Perry et al. (2007)

courtesy of Tobias Rausch (EMBL)

slide-12
SLIDE 12

Technologies

  • Fluorescent in situ hybridization (FISH)
  • Comparative Genomic Hybridization (CGH)

– Test vs. reference sample – 2.1 million probes – Different types

  • Whole-Genome Tiling Arrays
  • Whole-Genome Exon-Focused Arrays
  • CNV Arrays

courtesy of Tobias Rausch (EMBL)

slide-13
SLIDE 13

Technologies

  • Fluorescent in situ hybridization (FISH)
  • Comparative Genomic Hybridization (CGH)
  • Genome-Wide Human SNP Array 6.0

– 1.8 million genetic markers

  • 906,600 SNPs
  • 946,000 probes for CNVs

courtesy of Tobias Rausch (EMBL)

slide-14
SLIDE 14

Technologies

  • Fluorescent in situ hybridization (FISH)
  • Comparative Genomic Hybridization (CGH)
  • Genome-Wide Human SNP Array 6.0
  • Human 1M-Duo DNA Analysis BeadChip

– 1.2 million genetic markers

  • Markers for SNPs and CNV regions

– Targeted studies

  • 60,800 additional custom SNPs
  • 60,000 custom CNV-targets

courtesy of Tobias Rausch (EMBL)

slide-15
SLIDE 15

Technologies

  • Fluorescent in situ hybridization (FISH)
  • Comparative Genomic Hybridization (CGH)
  • Genome-Wide Human SNP Array 6.0
  • Human 1M-Duo DNA Analysis BeadChip
  • Next-Generation Sequencing (NGS)

– Whole-genome sequencing – Targeted, e.g. RNA-Seq

courtesy of Tobias Rausch (EMBL)

slide-16
SLIDE 16

Focus on NGS

  • Limitations of Arrays

– Lower resolution for genomic rearrangements – Balanced events (e.g., inversions) cannot be detected using signal intensity differences – No breakpoint information

courtesy of Tobias Rausch (EMBL)

slide-17
SLIDE 17

Paired-end data

  • Two protocols for paired-end data

– mate-pair sequencing by circularization (traditional Sanger sequencing) – paired-end NGS

  • verview protocol
slide-18
SLIDE 18

Paired-end data

– paired-end NGS (insert distribution known due to fragment size selection)