Next-generation DNA sequencing Diana Le Duc, M.D. Biochemistry - - PowerPoint PPT Presentation

next generation dna sequencing
SMART_READER_LITE
LIVE PREVIEW

Next-generation DNA sequencing Diana Le Duc, M.D. Biochemistry - - PowerPoint PPT Presentation

Next-generation DNA sequencing Diana Le Duc, M.D. Biochemistry Institute, Medical Faculty, University of Leipzig Statistical Analysis of RNA-Seq Data , University of Leipzig, 18 th of April 2012 Gabriela-Diana.LeDuc@medizin.uni-leipzig.de


slide-1
SLIDE 1

Next-generation DNA sequencing

Diana Le Duc, M.D. Biochemistry Institute, Medical Faculty, University of Leipzig

Statistical Analysis of RNA-Seq Data , University of Leipzig, 18th of April 2012

Gabriela-Diana.LeDuc@medizin.uni-leipzig.de

slide-2
SLIDE 2

Deoxyribonucleic acid (DNA)

n Discovery (Miescher, 1869) n Carrier of genetic information

(Avery/MacLeod/ McCarty, 1944)

n Structural model (Watson/

Crick/Wilkins/Franklin, 1953)

n Replication using

complementary base pairing

n Reading its information start

early 1970s

Picture from http://en.wikipedia.org/wiki/DNA

slide-3
SLIDE 3

Why Sequencing?

n Medicine n Forensics n Biology n Agriculture

slide-4
SLIDE 4

DNA Sequencing

cancerdiscovery.aacrjournals.org

slide-5
SLIDE 5

Sanger sequencing

n DNA Sequencing = determining the

  • rder of the nucleotide bases

n single-stranded DNA template n DNA primer n DNA polymerase n Normal dNTPs n Terminating nucleotide

Sanger Video

Image from http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/D/DNAsequencing.html

slide-6
SLIDE 6

Sanger sequencing overview

  • genomic DNA is fragmented
  • cloned to a plasmid vector -> transform
  • E. coli
  • a single bacterial colony is picked ->

plasmid DNA isolated

Image from http://en.wikipedia.org/wiki/DNA_sequencing

slide-7
SLIDE 7

Sequencing technologies – Sequencing Revolution

DOI: 10.1002/anie.201003880

Improved technologies:

  • Higher throughput 1500 x
  • Reduced costs / Mb
  • Common method:

sequencing by extension

slide-8
SLIDE 8

NGS – What Platforms are there?

n Illumina/Solexa reversible terminator

chemistry

n Principle of SOLiD sequencing by ligation n 454 Pyrosequencing n Ion Torrent Personal genome Machine n Single Molecule Sequencing

slide-9
SLIDE 9

Sequencing technologies – shared attributes

n Template preparation n Sequencing and imaging n Data analysis

slide-10
SLIDE 10

Sequencing technologies – NGS template preparation

  • A. Clonally amplified templates - cell free

system:

  • 1. Emulsion PCR Emulsion PCR Video

Picture on http://www.seqtech.com/2011/11/08/454-life-sciences-2/

  • standard microscope slide (Polonator)
  • aminocoated glass surface (Life/APG; Polonator)
  • PicoTiterPlate (PTP) wells (Roche/454)
  • microchip sensor (Ion Torrent)

Metzker, M. L. Sequencing technologies - the next generation. Nat Rev Genet 11, 31-46.

slide-11
SLIDE 11

Sequencing technologies – NGS template preparation

  • A. Clonally amplified templates - cell free

system:

  • 2. Solid-phase amplification Bridge PCR Video

Metzker, M. L. Sequencing technologies - the next generation. Nat Rev Genet 11, 31-46.

slide-12
SLIDE 12

Sequencing technologies – NGS template preparation

  • B. Single-molecule templates:

n Require less starting material n Immobilized on the solid surface by

§ Primers: Helicos BioSciences § Template: Helicos BioSciences § Polymerase: Pacific Biosciences, Life/Visigen, LI- COR Biosciences

slide-13
SLIDE 13

Sequencing technologies – NGS sequencing and imaging

  • 1. Cyclic reversible termination

Modified polymerase incorporates nucleotides

  • after each nucleotide incorporation process stops
  • camera reads fluorophore signal (filter for each

nucleotide type)

  • terminator and labeling is removed and cycle starts

again Illumina/Solexa Genome Analyzer

Illumina Video

IMPRS EVA Genetics Core Seminar Week – Janet Kelso, Martin Kircher

slide-14
SLIDE 14

Sequencing technologies – NGS sequencing and imaging

  • 1. Cyclic reversible termination

Metzker, M. L. Sequencing technologies - the next generation. Nat Rev Genet 11, 31-46.

  • Substitutions with higher frequency when the

previous base is ‘G’

  • Underrepresentation of GC- rich regions
slide-15
SLIDE 15

Sequencing technologies – NGS sequencing and imaging

  • 1. Cyclic reversible termination

n 3’-unblocked reversible terminators n LaserGen – Lightning Terminators n Helicos BioSciences – Virtual Terminators n Cleavage of only one bond

slide-16
SLIDE 16

Sequencing technologies – NGS sequencing and imaging

  • 2. Sequencing by ligation

n Difference – DNA ligase n Hybridization of a fluorescently labelled

probe

n SOLiD cycle of 1,2-probe hybridization

slide-17
SLIDE 17

Picture by ABI/Life Technologies

SOLiD Video

slide-18
SLIDE 18

Sequencing technologies – NGS sequencing and imaging

  • 2. Sequencing by ligation errors:

n Substitutions n Underrepresentation of AT- and GC- rich regions

slide-19
SLIDE 19

Sequencing technologies – NGS sequencing and imaging

  • 3. Pyrosequencing 454 Video
slide-20
SLIDE 20
  • 3. Pyrosequencing errors:

§

For homopolymeric reads -> unreliable sequence

§

Insertions

§

Deletions

Sequencing technologies – NGS sequencing and imaging

slide-21
SLIDE 21

Sequencing technologies – NGS sequencing and imaging

  • 4. Real-time sequencing:

n Pacific Biosciences n Continuous imaging of dye-labelled nucleotides

incorporation

slide-22
SLIDE 22
  • 5. Ion Semiconductor

Sequencing Ion Torrent Video

  • incorporation of dNTP

into DNA strand -> release of H+

  • ΔpH detected by an ion-

sensitive field-effect transistor

Sequencing technologies – NGS sequencing and imaging

slide-23
SLIDE 23

Comparison of different NGS platforms

Adapted from IMPRS EVA Genetics Core Seminar Week – Janet Kelso, Martin Kircher

Throughput Length Quality Costs Sanger

6 Mb/day 800nt 10-4 - 10-5 500$/Mb

454

750Mb/day 400nt 10-3 - 10-4 ~20$/Mb

Ion Torrent

1600Mb/day 200nt 10-2 - 10 -3 ~10$/Mb

Illumina

100000Mb/day 125nt 10-2 - 10 -3 ~0.40$/Mb

SOLiD 4

100000Mb/day 125nt 10-2 - 10 -3 ~0.40$/Mb

Helicos

5000Mb/day 32nt 10-2 ~0.40$/Mb

slide-24
SLIDE 24

http://www.omicsmaps.com/

Sequencing around the World

slide-25
SLIDE 25
  • micsmaps.com/stats
slide-26
SLIDE 26

Leipzig

  • 10 Sequencing

Machines,

  • 4th place in

Germany

http://www.omicsmaps.com/

slide-27
SLIDE 27
  • micsmaps.com/stats
slide-28
SLIDE 28

Sequencing technologies – Data analysis

n Bioinformatics tools for:

n Alignment n Base calling/polymorphism detection n De novo assembly n Genome browsing or annotation

n Challenging problems:

n De novo assembly of short reads -> mate-paired

libraries required

n Reads in repetitive regions

slide-29
SLIDE 29

Sequencing technologies – Data analysis

slide-30
SLIDE 30

Sequencing technologies – Data analysis

slide-31
SLIDE 31

Sequencing technologies – Data analysis

n $ 1000 genome sequencing and n $ 1000000 data analysis

slide-32
SLIDE 32

NGS applications

n Genome resequencing: polymorphism and

mutation discovery in humans (1000 Genomes Project)

n “Omics”: transcriptomics, proteomics,

metabolomics, microbiomes

slide-33
SLIDE 33

NGS applications

n Transcriptome sequencing:

§ Gene expression § Alternative splicing § Transcript annotation § SNPs § Somatic mutations

slide-34
SLIDE 34

NGS applications Future

n Throughput and costs of sequencing will

allow to characterize genetic variation within and between species in great detail

n Will become routine n Greatest challenge is extracting biologically or

clinically meaningful information

slide-35
SLIDE 35

My Projects

1.

Kiwi sequencing Illumina HiScan 2

2.

Transcriptome analysis and comparison GPCR 34 knock out – wild type C57BL/6

slide-36
SLIDE 36

Kiwi

Goals:

1.

Assessment of wing development genes:

  • Mutations
  • Signatures of

selection

  • Functional

assessment 2.

G protein coupled receptors

Ensembl Gene ID Associated Gene Name ENSGALG00000001532 F1NPH2_CHICK ENSGALG00000006379 SHH ENSGALG00000007562 FGF4 ENSGALG00000007706 Q90696_CHICK ENSGALG00000007834 SALL4 ENSGALG00000008253 TBX5_CHICK ENSGALG00000009495 FGFR2 ENSGALG00000010863 TWISTNB ENSGALG00000011630 GLI2 ENSGALG00000012329 GLI3 ENSGALG00000014872 FGF10 ENSGALG00000023904 FIBIN

slide-37
SLIDE 37

Further goals:

3.

Phylogeny tree

4.

Genome assembly Scientific Partners:

n

BGI-G10K: Prof. Guojie Zhang

n

MPI EVA: Bioinformatics group Janet Kelso

n

Allan Wilson Centre for Molecular Ecology and Evolution, School of Biological Sciences, University

  • f Auckland, Auckland, New Zealand: Prof. David

Lambert

Kiwi

slide-38
SLIDE 38

G Protein Coupled Receptor Image from fossilmuseum.net C57/Bl6 mouse image from http://www.criver.com Image from http://labrat.fieldofscience.com

slide-39
SLIDE 39

Transcriptome analysis

Goals:

n Differences in gene expression KO vs. WT n Involved metabolic pathways n Assess genes with immunologic involvement

slide-40
SLIDE 40

BGI

Thank you!