Normalization of f kazak RNA-seq data usin ing TMM and DESeq
- Ying Sha, Lu Wang
1
usin ing TMM and DESeq -Ying Sha, Lu Wang 1 Extreme low library - - PowerPoint PPT Presentation
Normalization of f kazak RNA-seq data usin ing TMM and DESeq -Ying Sha, Lu Wang 1 Extreme low library size of two samples before filtering after filtering Density Density Log2 (raw count) Log2 (raw count) Sen_treated_1 and
1
criteria that in each of the library, raw read count should be more than 3.
2
Log2 (raw count) Log2 (raw count)
Density Density before filtering after filtering
3
should be more than 3(the dashed line represents three read counts).
log2 Observed ERCC Counts log2 Expected ERCC Counts
4
TMM normalization RPKM normalization
Log2(normalized counts)
5
DESeq normalization LOESS normalization RPKM + LOESS normalization
A B C D E F Log2(normalized counts)
w/ ERCC
Raw Counts (filtered)
TMM normalization Raw Counts (filtered) RPKM normalization
Density
6
DESeq normalization LOESS normalization RPKM + LOESS normalization
A B C D E F Log2 counts
w/ ERCC
7
TMM normalization Raw Counts RPKM normalization
Log2(normalized counts)
8
DESeq normalization LOESS normalization RPKM + LOESS normalization
A B C D E F Log2(normalized counts)
w/ ERCC
TMM normalization Raw Counts RPKM normalization
Density
9
DESeq normalization LOESS normalization RPKM + LOESS normalization
A B C D E F Log2 counts
w/ ERCC
10
log2 Normalized ERCC Counts log2 Expected ERCC Counts log2 Normalized ERCC Counts log2 Expected ERCC Counts TMM Normalization LOESS Normalization
w/ ERCC
Both normalization methods with or without ERCC did not change linear relationship between expected ERCC counts/concentrations and normalized counts.
11
normalized counts
TMM and DESeq normalization did not adjust the housekeeping genes very well. LOESS with ERCC actually increased the variation.
normalized counts
A B C D E F G H
Not normalized w/ ERCC
TMM and DESeq normalization did not adjust the housekeeping genes very well.
12
normalized counts
A B C D
Not normalized w/ ERCC
13
14
normalized counts normalized counts
A B C D E F G H
w/ ERCC Not normalized
15
normalized counts normalized counts
A B C D E F G H TMM and DESeq normalization reduced the variation for new housekeeping genes.
w/ ERCC Not normalized
16
normalized counts normalized counts
A B C D E F G H TMM and DESeq normalization reduced the variation for new housekeeping genes.
w/ ERCC Not normalized
17
normalized counts normalized counts
A B C D E F G H TMM and DESeq normalized data is consistent with our previous findings about these genes of interest.
w/ ERCC Not normalized
18
sequenced on the same day.
variance between each other.
replicates from a given condition, for example SR_treated_1 and SR_treated_2 showed reduced difference in variance between replicates. However some replicates, such as SEN_treated_1 and SEN_treated_2 still shows large difference in variance.
were reduced after TMM normalization, however, the we can see that the error bars for SEN_treated and SEN_untreated are still large.
such as SR_treated_1 and SR_treated_2 were from the same patient
19
normalization.
expression analysis to call differential expression
SEN_treated_2(the larger library) to represent SEN_treated or the combination of SEN_treated_1 and SEN_treated_2
with missing replicates.