Heuristic Alignment and Searching
Mark Voorhies 3/28/2012
Mark Voorhies Heuristic Alignment and Searching
Heuristic Alignment and Searching Mark Voorhies 3/28/2012 Mark - - PowerPoint PPT Presentation
Heuristic Alignment and Searching Mark Voorhies 3/28/2012 Mark Voorhies Heuristic Alignment and Searching Types of alignments Global Alignment Each letter of each sequence is aligned to a letter or a gap ( e.g. , Needleman-Wunsch). Local
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
f o r i i n 50 100 150 200 250 300 350 400 450; do head −n $ i −q G217B iron . f a s t a Pb01 iron . f a s t a > temp . f a s t a ; time c l u s t a l w −i n f i l e =temp . f a s t a −type= DNA −a l i g n ; done
Sequences (1:2) Aligned. Score: Guide tree file created: [temp.dnd] There are 1 groups Start of Multiple Alignment Aligning... Group 1: Delayed Alignment Score 7238 CLUSTAL-Alignment file created [temp.aln] real 0m3.400s user 0m3.388s sys 0m0.012s Mark Voorhies Heuristic Alignment and Searching
#!/ usr / bin / env python # Time−stamp : <ParseTimes . py 2011−03−29 21:10:59 Mark Voorhies> ””” Parse w a l l times from a log f i l e
s t d i n and w r i t e them as a CSV formatted column f o r Excel / OpenOffice / etc
stdout . I f command l i n e arguments are given , t r e a t them as a second
column . ””” from csv import w r i t e r import re t i m e r e = re . compile ( ”ˆ r e a l .∗(?P <minutes >[\d]+)m(?P <seconds >[\d ]+\.[\ d]+) s ” , re .M) i f ( name == ” m a i n ” ) : import s y s args = s y s . argv [ 1 : ]
i = 0 f o r t i n t i m e r e . f i n d i t e r ( s y s . s t d i n . read ( ) ) : t r y : y = args [ i ] i += 1 except I n d e x E r r o r : y = ””
( f l o a t ( t . group ( ” minutes ”))∗60+ f l o a t ( t . group ( ” seconds ” ) ) , y ) ) del
Mark Voorhies Heuristic Alignment and Searching
data < − read . csv ( ” t i m i n g s . csv ” , header = FALSE , c o l . names = c ( ” t ” , ”n” )) x < − log ( data $n∗80) y < − log ( data $ t / 60) f < − lm ( y ˜ x ) x0 < − 0:40000 a < − exp ( f $ c o e f f [ 1 ] ) b < − f $ c o e f f [ 2 ] pdf ( ” ClustalwTimings . pdf ” ) p l o t ( data $n∗80 , data$ t / 60 , x la b = ” l e n g t h /bp” , y la b = ” time / minutes ” , main = ”CLUSTALW t i m i n g s
I n t e l Core2 T7300@2 .00GHz , 32 b i t ” ) p o i n t s ( x0 , a∗x0ˆb , c o l = ” blue ” , type = ” l ” ) legend ( ” t o p l e f t ” , c ( ”y = ( 1 . 8 e−9)x ˆ ( 2 . 0 8 ) ” ) , c o l = ” blue ” , l t y = 1) dev . o f f ( ) Mark Voorhies Heuristic Alignment and Searching
10000 15000 20000 25000 30000 35000 1 2 3 4 5
CLUSTALW timings on Intel Core2 T7300@2.00GHz, 32bit
length/bp time/minutes y = (1.8e−9)x^(2.08)
Mark Voorhies Heuristic Alignment and Searching
Dec 1982 Dec 1990 Dec 1994 Dec 1998 Dec 2002 Dec 2006 Dec 2010 0.0e+00 2.0e+10 4.0e+10 6.0e+10 8.0e+10 1.0e+11 1.2e+11
source: ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt
Genbank Release Size/bp
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
time bl2seq -p blastn -i G217B_iron.fasta -j Pb01_iron.fasta -e 1e-6 > temp.blastn real 0m0.342s user 0m0.080s sys 0m0.032s Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching
Mark Voorhies Heuristic Alignment and Searching