1 / 23 Genome Informatics
Massively parallel read mapping on graphics cards
Johannes K¨
- ster
Massively parallel read mapping on graphics cards Johannes K oster - - PowerPoint PPT Presentation
Massively parallel read mapping on graphics cards Johannes K oster May 15, 2014 1 / 23 Genome Informatics Outline 1 Next-Generation-Sequencing of DNA 2 Read Mapping 3 Algorithm 4 Results 2 / 23 Genome Informatics Outline 1
1 / 23 Genome Informatics
2 / 23 Genome Informatics
3 / 23 Genome Informatics
4 / 23 Genome Informatics
Illumina, 2013
5 / 23 Genome Informatics
6 / 23 Genome Informatics
? ? ?
7 / 23 Genome Informatics
◮ Smith Waterman Algorithm
8 / 23 Genome Informatics
◮ PEANUT – the ParallEl AligNment UTility
9 / 23 Genome Informatics
10 / 23 Genome Informatics
10 / 23 Genome Informatics
11 / 23 Genome Informatics
◮ size 4q + |T|
12 / 23 Genome Informatics
◮ size 2/w ·4q +min{4q, |T|}+|T|
228 0 1 2 3 4 5 6 7... 1 1 228 / 32 228 % 32 found
13 / 23 Genome Informatics
0000 0010 2 2 3 5 8 15 3 52 31 GAAA 1 11 17 308 22 0101 I S S' O
14 / 23 Genome Informatics
15 / 23 Genome Informatics
T G T C T A T G T A +1 +1
1 1 1 1 1 1 1 + << & ^
1Myers, 1999. J. ACM 46.
16 / 23 Genome Informatics
stop p
t p r
e s s i n g f i l t r a t i
v a l i d a t i
f i l t r a t i
v a l i d a t i
f i l t r a t i
v a l i d a t i
f i l t r a t i
v a l i d a t i
f i l t r a t
v a l i d a t i
f i l t r a t i
v a l i d a t i
l
d r e a d s e q u e n c e s w r i t e h i t s start
17 / 23 Genome Informatics
18 / 23 Genome Informatics
100 200 300 400 500 600
block size
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
filter_reference create_queries_index validate_hits
19 / 23 Genome Informatics
2Holtgrewe et al. 2011. BMC Bioinformatics
20 / 23 Genome Informatics
21 / 23 Genome Informatics
22 / 23 Genome Informatics
23 / 23 Genome Informatics