Exploring short read sequences
Martin Morgan1 Fred Hutchinson Cancer Research Institute, Seattle, WA June 27-July 1, 2011
1mtmorgan@fhcrc.org
Exploring short read sequences Martin Morgan 1 Fred Hutchinson - - PowerPoint PPT Presentation
Exploring short read sequences Martin Morgan 1 Fred Hutchinson Cancer Research Institute, Seattle, WA June 27-July 1, 2011 1 mtmorgan@fhcrc.org Topics RNA-seq Experimental design Quality assessment Counting reads Microbiome
1mtmorgan@fhcrc.org
◮ Replication ◮ Randomization and
◮ Statistical power ◮ Library complexity
◮ Estimation biases ◮ Legitimate comparison
◮ Replication ◮ Randomization and
◮ Statistical power ◮ Library complexity
◮ Estimation biases ◮ Legitimate comparison
◮ Replication ◮ Randomization and
◮ Statistical power ◮ Library complexity
◮ Estimation biases ◮ Legitimate comparison
Number of occurrences of each read (log10) Cumulative proportion of reads
0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 4
1 2
1 2 3 4
3 4 5
1 2 3 4
6 7
1 2 3 4 0.0 0.2 0.4 0.6 0.8 1.0
8
◮ Replication ◮ Randomization and
◮ Statistical power ◮ Library complexity
◮ Estimation biases ◮ Legitimate comparison
0.0 0.2 0.4 0.6 0.8 1.0 2.0 2.2 2.4 2.6
◮ Replication ◮ Randomization and
◮ Statistical power ◮ Library complexity
◮ Estimation biases ◮ Legitimate comparison
◮ Replication ◮ Randomization and
◮ Statistical power ◮ Library complexity
◮ Estimation biases ◮ Legitimate comparison
G1 F1 Case I & II : Single read, single gene, single feature Case III, IV & V : Single read, single gene, multiple features Case VI : Single read, multiple genes, multiple features F9 G6 Case VII : Split read, single gene, single feature Case VIII & IX : Split read, single or multiple genes, multiple features G2 F2 G3 F3 F4 G4 F5 F6 G5 F7 F8 F10 F11 G7 G8 F12 G9 F13 G8 F12 F14 G8 F12 G10 F15 G11 F16
◮ "none" → discard ◮ "divide" → equal divsion
◮ "uniqueDisjoint" → ◮ Unique disjoint overlap →
◮ Otherwise discard
◮ Bar codes ◮ Primers