Introduction to transcriptome analysis using high- throughput sequencing technologies
- D. Puthier 2015
Introduction to transcriptome analysis using high- throughput - - PowerPoint PPT Presentation
Introduction to transcriptome analysis using high- throughput sequencing technologies D. Puthier 2015 Main objectives of transcriptome analysis Understand the molecular mechanisms underlying gene expression Interplay between
landscape on gene expression
Cyanine 5 (Cy5) Cyanine 3 (Cy3) Scanning (ex: Genepix)
Cy-3:
Cy-5:
■ Gene: IL2RA
http://www.bioconductor.org/help/course-materials/2009/EMBLJune09/Talks/RNAseq-Paul.pdf
(a) Comparison of two brain technical replicate RNA- Seq determinations for all mouse gene models (from the UCSC genome database), measured in reads per kilobase of exon per million mapped sequence reads (RPKM), which is a normalized measure of exonic read density; R2 = 0.96. (c) Six in vitro–synthesized reference transcripts of lengths 0.3–10 kb were added to the liver RNA sample (1.2 104 to 1.2 109 transcripts per sample; R2 > 0.99).
http://bgiamericas.com/wp-content/uploads/2011/12/RNA-Aeq-100-ng-20111209. pdf
http://www.bioconductor.org/help/course-materials/2009/EMBLJune09/Talks/RNAseq-Paul. pdf
○ ”The relationship is not quite linear … but the vast majority of the expression values are similar between the methods. Scatter increases at low expression … as background correction methods for arrays are complicated when signal levels approach noise
events become a source of error in the quantification of rare transcripts ” ○ ”Given the substantial agreement between the two methods, the array data in the literature should be durable”
Comparison of array and RNA-Seq data for measuring differential gene expression in the heads of male and female D. pseudoobscura
■ Header ■ Sequence ■ + (optional header) ■ Quality (default Sanger-style)
@QSEQ32.249996 HWUSI-EAS1691:3:1:17036:13000#0/1 PF=0 length=36 GGGGGTCATCATCATTTGATCTGGGAAAGGCTACTG + =.+5:<<<<>AA?0A>;A*A################ @QSEQ32.249997 HWUSI-EAS1691:3:1:17257:12994#0/1 PF=1 length=36 TGTACAACAACAACCTGAATGGCATACTGGTTGCTG + DDDD<BDBDB??BB*DD:D#################
Quality Position in read Nb Reads Mean Phred Score Position in read Look also at over-represented sequences
1 2 3 4 5 6 7 acaacg$ caacg$a aacg$ac acg$aca cg$acaa g$acaac $acaacg acaacg$ $acaacg aacg$ac acaacg$ acg$aca caacg$a cg$acaa g$acaac T BWT (T) gc$aaac 7 3 1 4 2 5 6
7 3 1 4 2 5 6
Read pair Gapped alignment