De Novo Genome Analysis . . . . . Ketil Malde Analysis - - PowerPoint PPT Presentation

de novo genome analysis
SMART_READER_LITE
LIVE PREVIEW

De Novo Genome Analysis . . . . . Ketil Malde Analysis - - PowerPoint PPT Presentation

De Novo . Institute of Marine Research Ketil Malde De Novo Genome Analysis . . . . . Ketil Malde Analysis Annotation evaluation Assembly Gene prediction Assembly Introduction October 2, 2012 De Novo . Annotation Assembly


slide-1
SLIDE 1

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

De Novo Genome Analysis

Ketil Malde

Institute of Marine Research

October 2, 2012

slide-2
SLIDE 2

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Overview

Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

slide-3
SLIDE 3

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Processing a new genome

  • 1. Construct draft genome assembly
  • 2. Map known sequences

(ESTs, RNAseq, proteins)

  • 3. Build/train stochastic gene models
  • 4. Predict genes and transcripts
  • 5. Annotate transcripts with function
  • 6. Do awesome biology stuff - yay!
slide-4
SLIDE 4

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

e pipeline

DNA sequences RNA sequences Assembled genome Genome assembly Predicted transcriptome Gene prediction T ranscript assembly Genome evaluation Related proteomes Annotated genes Annotation

slide-5
SLIDE 5

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Assembly

DNA sequences RNA sequences Assembled genome Genome assembly Predicted transcriptome Gene prediction T ranscript assembly Genome evaluation Related proteomes Annotated genes Annotation

slide-6
SLIDE 6

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Assembly

Shotgun reads Paired reads Contig assembly

Overlaps:

  • Celera/CABOG
  • Newbler

DeBruijn:

  • ABYSS
  • CLC
  • SRA

Contigs Scaffolding

  • SSPACE
  • PCAP

Scaffolded contigs

slide-7
SLIDE 7

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Assembly

Shotgun reads Contigs

slide-8
SLIDE 8

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Assembly

Shotgun reads Contigs Paired reads Scaffold

slide-9
SLIDE 9

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Gene prediction

DNA sequences RNA sequences Assembled genome Genome assembly Predicted transcriptome Gene prediction T ranscript assembly Genome evaluation Related proteomes Annotated genes Annotation

slide-10
SLIDE 10

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Gene prediction

Stochastic methods

Scaffold

slide-11
SLIDE 11

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Gene prediction

Stochastic methods

Scaffold

RNA mapping

RNA seq

Training

slide-12
SLIDE 12

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Gene prediction

Stochastic methods

Scaffold

RNA mapping

RNA seq

Training Protein mapping

Proteomes

slide-13
SLIDE 13

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Gene prediction

Stochastic methods

Scaffold

RNA mapping

RNA seq

Training Protein mapping

Proteomes Gene model

slide-14
SLIDE 14

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Assembly evaluation

DNA sequences RNA sequences Assembled genome Genome assembly Predicted transcriptome Gene prediction T ranscript assembly Genome evaluation Related proteomes Annotated genes Annotation

slide-15
SLIDE 15

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Assembly evaluation

◮ Accuracy

◮ read mapping scores

◮ Completeness

◮ mapped reads ◮ mapped proteins

◮ Fragmentation

◮ N50 etc. ◮ mapped pairs

◮ Redundancy

◮ total scaffold size ◮ multiply mapped reads ◮ mulitply mapped proteins

slide-16
SLIDE 16

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Annotation

DNA sequences RNA sequences Assembled genome Genome assembly Predicted transcriptome Gene prediction T ranscript assembly Genome evaluation Related proteomes Annotated genes Annotation

slide-17
SLIDE 17

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Annotation

Putative transcripts BLAST GO KEGG EC UniProt

slide-18
SLIDE 18

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Transitive alignments

AGCT

p q

r

ACGT TGCA

slide-19
SLIDE 19

De Novo Ketil Malde Introduction Assembly Gene prediction Assembly evaluation Annotation Analysis

. . . . . .

Awesome biology stuff!