1 / 36 Genome Informatics
Detecting SNVs with Next-generation-Sequencing
Johannes K¨
- ster
Detecting SNVs with Next-generation-Sequencing Johannes K oster - - PowerPoint PPT Presentation
Detecting SNVs with Next-generation-Sequencing Johannes K oster Genome Informatics, University Duisburg-Essen February 25, 2014 1 / 36 Genome Informatics Outline 1 Next-Generation-Sequencing 2 Workflow 3 Read Mapping 4 Post-Processing Reads
1 / 36 Genome Informatics
2 / 36 Genome Informatics
3 / 36 Genome Informatics
4 / 36 Genome Informatics
5 / 36 Genome Informatics
Illumina, 2013
6 / 36 Genome Informatics
7 / 36 Genome Informatics
8 / 36 Genome Informatics
9 / 36 Genome Informatics
10 / 36 Genome Informatics
11 / 36 Genome Informatics
11 / 36 Genome Informatics
11 / 36 Genome Informatics
11 / 36 Genome Informatics
11 / 36 Genome Informatics
12 / 36 Genome Informatics
13 / 36 Genome Informatics
14 / 36 Genome Informatics
15 / 36 Genome Informatics
16 / 36 Genome Informatics
Before indel re-alignment Aer indel re-alignment Read 1 Read 1 Read 2 Read 2
17 / 36 Genome Informatics
18 / 36 Genome Informatics
19 / 36 Genome Informatics
A C G T A
0.17 0.25 C 0.35
0.54 G 0.32 0.05
T 0.46 0.22 0.32
20 / 36 Genome Informatics
21 / 36 Genome Informatics
22 / 36 Genome Informatics
23 / 36 Genome Informatics
24 / 36 Genome Informatics
25 / 36 Genome Informatics
26 / 36 Genome Informatics
m=0 P(q = m)P(D|q = m)
26 / 36 Genome Informatics
m=0 P(q = m)P(D|q = m)
27 / 36 Genome Informatics
28 / 36 Genome Informatics
Wikipedia, 2013
29 / 36 Genome Informatics
30 / 36 Genome Informatics
31 / 36 Genome Informatics
32 / 36 Genome Informatics
33 / 36 Genome Informatics
Known variant Feature Transcript Gene Annotation Variant Call Run Capture kit Disease Sample Patient Library Source Unit
1 1 1 1 1 1 1 1 1 1 1 1 1 1 n n n n n n n n n n n n n n
34 / 36 Genome Informatics
Gene Common Sample Transcripts Nucleotide Codon Amino acid Type Region Splice Quality 1 ARID1A 1 M45227 001, 201, 002, more C → T CGA→ TGA R → * Nonsense Coding 75.19 1:27106354 2 ARID1B 1 M45224 201, 203, 009, more C → T CGA→ TGA R → * Nonsense Coding 37.98 6:157406006 3 SMARCB1 1 M44263 005, 001, 002, 003 G → A CGG→ CAG R → Q Missense Coding Splice 224.79 22:24176330 Location
35 / 36 Genome Informatics
36 / 36 Genome Informatics