SLIDE 32 Where are the indels coming from?
- Allowing a small number of indels we assigned 3,748,614 (over 99.5%) of the
alignments to genomic fragments flanked by pairs of identical 4-cutter sites
- Only 1% of the fragments attributed to the sticky ends 4-cutter,GATC,
contained indels, whereas the three blunt ends 4-cutter (GTAC8,GGCC8, AGCT) had much higher rates of indels 29.5-54%
- If a base is lost at the sticky end ligation is significantly compromised:
- However, a lost bp at the blunt end has no effect on ligation:
- Consistently, GATC libraries gave much lower cloning efficiencies than the 3
blunt cutters
- The indels are most likely not generated during sequencing, but then why
haven’t we observed them when using Sanger sequencing?
CTAG 5’ 3’ GATC 3’ 5’ CTAG GATC double stranded vector with ligated insert 5’ 3’ 3’ 5’ double stranded vector with ligated insert CC GG GG CC