Visualization of results Mary Piper Bioinformatics Consultant and - - PowerPoint PPT Presentation

visualization of results
SMART_READER_LITE
LIVE PREVIEW

Visualization of results Mary Piper Bioinformatics Consultant and - - PowerPoint PPT Presentation

DataCamp RNA-Seq Differential Expression Analysis RNA - SEQ DIFFERENTIAL EXPRESSION ANALYSIS Visualization of results Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq Differential Expression Analysis Visualizing results -


slide-1
SLIDE 1

DataCamp RNA-Seq Differential Expression Analysis

Visualization of results

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

Mary Piper

Bioinformatics Consultant and Trainer

slide-2
SLIDE 2

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Expression heatmap

# Subset normalized counts to significant genes sig_norm_counts_wt <- normalized_counts_wt[wt_res_sig$ensgene, ] # Choose a color palette from RColorBrewer library(RColorBrewer) heat_colors <- brewer.pal(6, "YlOrRd") display.brewer.all()

slide-3
SLIDE 3

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Expression heatmap

# Run pheatmap pheatmap(sig_norm_counts_wt, color = heat_colors, cluster_rows = T, show_rownames = F, annotation = select(wt_metadata, condition), scale = "row")

slide-4
SLIDE 4

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Volcano plot

# Obtain logical vector regarding whether padj values are less than 0.05 wt_res_all <- wt_res_all %>% rownames_to_column(var = "ensgene") %>% mutate(threshold = padj < 0.05) # Volcano plot ggplot(wt_res_all) + geom_point(aes(x = log2FoldChange, y = -log10(padj), color = threshold)) + xlab("log2 fold change") + ylab("-log10 adjusted p-value") + theme(legend.position = "none", plot.title = element_text(size = rel(1.5), hjust = 0.5), axis.title = element_text(size = rel(1.25)))

slide-5
SLIDE 5

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Volcano plot

ggplot(wt_res_all) + geom_point(aes(x = log2FoldChange, y = -log10(padj), color = threshold)) xlab("log2 fold change") + ylab("-log10 adjusted p-value") + ylim=c(0, 15) + theme(legend.position = "none", plot.title = element_text(size = rel(1.5), hjust = 0.5), axis.title = element_text(size = rel(1.25)))

slide-6
SLIDE 6

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Expression plot

Significant results: Normalized counts for significant genes:

slide-7
SLIDE 7

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Expression plot

top_20 <- data.frame(sig_norm_counts_wt)[1:20, ] %>% rownames_to_column(var = "ensgene") top_20 <- gather(top_20, key = "samplename", value = "normalized_counts", 2:8)

slide-8
SLIDE 8

DataCamp RNA-Seq Differential Expression Analysis

Visualizing results - Expression plot

top_20 <- inner_join(top_20, rownames_to_column(wt_metadata, var = "samplename"), by = "samplename") ggplot(top_20) + geom_point(aes(x = ensgene, y = normalized_counts, color = condition)) + scale_y_log10() + xlab("Genes") + ylab("Normalized Counts") + ggtitle("Top 20 Significant DE Genes") + theme_bw() + theme(axis.text.x = element_text(angle = 45, hjust = 1)) + theme(plot.title = element_text(hjust = 0.5))

slide-9
SLIDE 9

DataCamp RNA-Seq Differential Expression Analysis

slide-10
SLIDE 10

DataCamp RNA-Seq Differential Expression Analysis

Let's practice!

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

slide-11
SLIDE 11

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq DE analysis summary

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

Mary Piper

Bioinformatics Consultant and Trainer

slide-12
SLIDE 12

DataCamp RNA-Seq Differential Expression Analysis

slide-13
SLIDE 13

DataCamp RNA-Seq Differential Expression Analysis

slide-14
SLIDE 14

DataCamp RNA-Seq Differential Expression Analysis

slide-15
SLIDE 15

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq Workflow: Raw data quality control

slide-16
SLIDE 16

DataCamp RNA-Seq Differential Expression Analysis

slide-17
SLIDE 17

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq Workflow: Alignment

slide-18
SLIDE 18

DataCamp RNA-Seq Differential Expression Analysis

slide-19
SLIDE 19

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq Workflow: Quantitation

slide-20
SLIDE 20

DataCamp RNA-Seq Differential Expression Analysis

slide-21
SLIDE 21

DataCamp RNA-Seq Differential Expression Analysis

Preparation for differential expression analysis: DESeq2 object

dds <- DESeqDataSetFromMatrix(countData = rawcounts, colData = metadata, design = ~ condition)

slide-22
SLIDE 22

DataCamp RNA-Seq Differential Expression Analysis

Let's practice!

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

slide-23
SLIDE 23

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq DE analysis summary 2

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

Mary Piper

Bioinformatics Consultant and Trainer

slide-24
SLIDE 24

DataCamp RNA-Seq Differential Expression Analysis

DESeq workflow - normalization

dds <- estimateSizeFactors(dds) normalized_counts <- counts(dds, normalized=TRUE)

slide-25
SLIDE 25

DataCamp RNA-Seq Differential Expression Analysis

Unsupervised clustering analyses: log transformation

# Log transformation of normalized counts vsd <- vst(dds, blind=TRUE)

slide-26
SLIDE 26

DataCamp RNA-Seq Differential Expression Analysis

Unsupervised clustering analyses

slide-27
SLIDE 27

DataCamp RNA-Seq Differential Expression Analysis

Unsupervised clustering analyses - heatmap

vsd %>% assay() %>% # Extract the vst matrix from the object cor() %>% # Compute pairwise correlation values pheatmap(annotation = metadata[ , c("column_name1", "column_name2])

slide-28
SLIDE 28

DataCamp RNA-Seq Differential Expression Analysis

Unsupervised clustering analyses - pca

# PCA plotPCA(vsd, intgroup="condition")

slide-29
SLIDE 29

DataCamp RNA-Seq Differential Expression Analysis

Running the DE analysis

slide-30
SLIDE 30

DataCamp RNA-Seq Differential Expression Analysis

Running the DE analysis

# Create DESeq object dds <- DESeqDataSetFromMatrix(countData = rawcounts, colData = metadata, design = ~ source_of_variation + condition) # Run analysis dds <- DESeq(dds)

slide-31
SLIDE 31

DataCamp RNA-Seq Differential Expression Analysis

DESeq2 workflow - model

# Plot dispersion estimates plotDispEsts(dds)

slide-32
SLIDE 32

DataCamp RNA-Seq Differential Expression Analysis

DESeq2 workflow - contrasts and LFC shrinkage

slide-33
SLIDE 33

DataCamp RNA-Seq Differential Expression Analysis

DESeq2 workflow - contrasts and LFC shrinkage

# Extract results for comparison of interest res <- results(dds, contrast = c("condition_factor", "level_to_compare", "base_level"), alpha = 0.05) # Shrink the log2 foldchanges res <- lfcShrink(dds, contrast = c("condition_factor", "level_to_compare", "base_level"), res = res)

slide-34
SLIDE 34

DataCamp RNA-Seq Differential Expression Analysis

DESeq2 workflow - LFC shrinkage

# Extract all results as a data frame res_all <- data.frame(res) %>% rownames_to_column(var = "ensgene") # Add gene annotations res_all <- left_join(x = res_all, y = grcm38[, c("ensgene", "symbol", "description")], by = "ensgene") res_all <- arrange(res_all, padj)

slide-35
SLIDE 35

DataCamp RNA-Seq Differential Expression Analysis

DESeq2 workflow - results exploration

slide-36
SLIDE 36

DataCamp RNA-Seq Differential Expression Analysis

DESeq2 workflow - results exploration

# Identify significant genes res_sig <- subset(res_all, padj < 0.05)

slide-37
SLIDE 37

DataCamp RNA-Seq Differential Expression Analysis

slide-38
SLIDE 38

DataCamp RNA-Seq Differential Expression Analysis

Let's practice!

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

slide-39
SLIDE 39

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq next steps

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS

Mary Piper

Bioinformatics Consultant and Trainer

slide-40
SLIDE 40

DataCamp RNA-Seq Differential Expression Analysis

RNA-Seq next steps

Vignette: Bioconductor support site: (tag 'DESeq2') http://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html https://support.bioconductor.org

slide-41
SLIDE 41

DataCamp RNA-Seq Differential Expression Analysis

slide-42
SLIDE 42

DataCamp RNA-Seq Differential Expression Analysis

Overview of goals

slide-43
SLIDE 43

DataCamp RNA-Seq Differential Expression Analysis

Significant genes interpretation

slide-44
SLIDE 44

DataCamp RNA-Seq Differential Expression Analysis

Significant genes interpretation

slide-45
SLIDE 45

DataCamp RNA-Seq Differential Expression Analysis

Significant genes interpretation

slide-46
SLIDE 46

DataCamp RNA-Seq Differential Expression Analysis

Conclusion

slide-47
SLIDE 47

DataCamp RNA-Seq Differential Expression Analysis

Congratulations!

RNA-SEQ DIFFERENTIAL EXPRESSION ANALYSIS