1
The Massive Parallel Sequencing era: "Global sequencing"
Richard Christen CNRS UMR 6543 & Université de Nice christen@unice.fr http://bioinfo.unice.fr
The Massive Parallel Sequencing era: "Global sequencing" - - PowerPoint PPT Presentation
The Massive Parallel Sequencing era: "Global sequencing" Richard Christen CNRS UMR 6543 & Universit de Nice christen@unice.fr http://bioinfo.unice.fr 1 At the end of 2007, three next-generation sequencing platforms appeared:
1
Richard Christen CNRS UMR 6543 & Université de Nice christen@unice.fr http://bioinfo.unice.fr
2
3
4
5
6
7
8
9
10
11
12
13
using de Bruijn graphs. Genome Res. 18:821-829.
Nusbaum, and D. B. Jaffe. 2008. ALLPATHS: de novo assembly of whole-genome shotgun microreads. Genome Res. 18:810-820.
bacterial genome sequencing: millions of very short reads assembled on a desktop
highly accurate short-read assembly algorithm for de novo genomic sequencing. Genome Res. 17:1697-1706.
14
15
16
17
18
19
8 cells / ml
20
21
8 cells / ml
22
– they are many, everywhere
23
Genome Res. 2006 16: 316-322
24
25
26
27
1 10 100 1000 10000 100000 1 1970 3939 5908 7877 9846 11815 13784 15753 17722 19691 FS396 FS312
28
29
30
BMC Microbiology 2007, 7:108
31
BMC Microbiology 2007, 7:108
32
33
34
5000 10000 15000 20000 25000 100000 200000 300000 400000 500000
35
3 3 50 43 43 46 54 45 46
% Singletons 13251 7185 2396 7587 6237 5009 5040 6217 2297 singletons tags 21529 10613 2769 8699 7167 5776 5751 7186 2655 unique tags
442061 247825 4834 17665 14373 11004 9281 13901 4999
Total tags FS312 FS396 FS312 FS396 138 115R 112R 55R 53R Experiment
24 22 21 23
% Singletons 6792 11638 5598 7337 singletons tags 8779 14885 7683 9486 unique tags
28247 53245 26115 31745
Total tags Fl Ca Br Il Experiment
36
37
38
39
length >100 nt presently deposited have a taxonomic description down to the genus level, while 383,570 sequences (57 %) have "environmental samples" as sole description.
40
41
– The raw data. – Data with final annotations. – Intermediate calculations and results.
– Entrez is nearly not usable. – SRS is problematic. – ACNUC works quite well but is not widely supported.
42