Strategies for Bulk RNA-seq Analysis Genome Transcriptome - - PowerPoint PPT Presentation

strategies for bulk rna seq analysis genome transcriptome
SMART_READER_LITE
LIVE PREVIEW

Strategies for Bulk RNA-seq Analysis Genome Transcriptome - - PowerPoint PPT Presentation

Strategies for Bulk RNA-seq Analysis Genome Transcriptome Assembly Mapping Mapping Reads Reads Reads RSEM, Trinity, STAR, Kallisto, Scripture, HISAT2 Sailfish, Stringtie Salmon Splice-aware Transcript mapping Assembly into


slide-1
SLIDE 1

Strategies for Bulk RNA-seq Analysis

slide-2
SLIDE 2

Genome Mapping Transcriptome Mapping Assembly

Reads Reads Reads Transcript mapping and quantification Splice-aware Genome mapping RSEM, Kallisto, Sailfish, Salmon STAR, HISAT2 Gene counting Transcript discovery & counting htseq-count, featureCounts StringTie Novel transcript annotation Homology-based BLAST2GO Assembly into transcripts Trinity, Scripture, Stringtie Novel transcript annotation Trinotate

slide-3
SLIDE 3

multiple BAMs (+known GTF)

Sequence reads Quality control Alignment to Genome: HISAT2, STAR DGE with R: DESeq2, EdgeR, limma:voom Count reads associated with genes: htseq-count, featureCounts

FASTQ FASTQ Count Matrix (+known GTF, optional) (+reference genome index)

✓ Genome ✓ GTF (annotation)

slide-4
SLIDE 4

multiple BAMs (+known GTF)

Sequence reads Quality control Alignment to Genome: HISAT2, STAR DGE with R: DESeq2, EdgeR, limma:voom Count reads associated with genes: htseq-count, featureCounts Reference-based transcriptome assembly and quantitation with StringTie

FASTQ FASTQ Count Matrix (+known GTF, optional) BAM (+reference genome index)

✓ Genome ✓ GTF (annotation)?

DGE with CuffDiff, Ballgown

slide-5
SLIDE 5

(+reference genome index) multiple BAMs (+known GTF)

Sequence reads Quality control Alignment to Genome: HISAT2, STAR DGE with R: DESeq2, EdgeR, limma:voom Count reads associated with genes: htseq-count, featureCounts Reference-based transcriptome assembly and quantitation with StringTie

FASTQ FASTQ Count Matrix BAM

✓ Transcriptome

(FASTA)

DGE with CuffDiff, Ballgown

(nown GTF, optional) (+reference transcriptome index) FASTQ

DGE with Sleuth Pseudocounts with Kallisto, Sailfish, Salmon

Count Matrix generated using tximport

slide-6
SLIDE 6

Reference-based assembly De novo assembly

Martin J.A. and Wang Z., Nat. Rev. Genet. (2011) 12:671–682

✓ Genome ✓ GTF (annotation)? ✓ Genome? ✓ GTF (annotation)?

slide-7
SLIDE 7

Quantitation from assembled reads

Alignment to new transcriptome: Bowtie2, BWA DGE with R: DESeq2, EdgeR, limma:voom Count reads associated with genes

Count Matrix SAM/BAM

Reads Assembly into transcripts Trinity, Scripture RSEM, Kallisto. Salmon, eXpress Transcript mapping & quantification Novel transcript annotation Trinotate

slide-8
SLIDE 8

These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.