smith waterman algorithm
play

Smith-Waterman Algorithm AMPP 0708-Q1 Eduard Ayguade Juan J. - PowerPoint PPT Presentation

Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm AMPP 0708-Q1 Eduard Ayguade Juan J. Navarro Dani Jimenez-Gonzalez October 4, 2007 Outline Introduction Smith-Waterman Algorithm Introduction 1 Why compare sequences


  1. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm AMPP 0708-Q1 Eduard Ayguade Juan J. Navarro Dani Jimenez-Gonzalez October 4, 2007

  2. Outline Introduction Smith-Waterman Algorithm Introduction 1 Why compare sequences of aminoacids? How to compare sequences? Alignment Scoring the relationships How to find the best alignment? Smith-Waterman Algorithm 2

  3. Outline Introduction Smith-Waterman Algorithm Why compare sequences of aminoacids? Proteins are made by aminoacid sequences t:c g g g t a t c c a a Similar sequences of aminoacids → similar protein structures t:c g g g t a t c c a a s:c c c t a g g t c c c a Evolutionary perspective: Mutations?, insertions?, etc. t 1 = g mutated to s 1 = c ? s 1 = c has been an insertion? Some evolution are more important/likely than others

  4. Outline Introduction Smith-Waterman Algorithm How to compare sequences? Alignment An alignment of two sequences t and s must satisfy: All symbols (residues) in the two sequences have to be in the alignment, and in the same order they appear in the sequences We can align one symbol from one sequence with one from the another A symbol can be aligned with a blank (’-’) Two blanks cannot be aligned t: c g g g t a t c c a a s: c c c t a g g t c c c a t: c g g g t a - - t - c c a a s: c c c - t a g g t c c c - a

  5. Outline Introduction Smith-Waterman Algorithm What is the BEST alignment? Example t: c g g g t a t c c a a s: c c c t a g g t c c c a

  6. Outline Introduction Smith-Waterman Algorithm What is the BEST alignment? Example t: c g g g t a t c c a a s: c c c t a g g t c c c a t: c g g g t a - - t - c c a a s: c c c - t a g g t c c c - a

  7. Outline Introduction Smith-Waterman Algorithm What is the BEST alignment? Example t: c g g g t a t c c a a s: c c c t a g g t c c c a t: c g g g t a - - t - c c a a s: c c c - t a g g t c c c - a t: c g g g t a - - - t c c a a s: c c - - c t a g g t c c c a

  8. Outline Introduction Smith-Waterman Algorithm What is the BEST alignment? Example t: c g g g t a t c c a a s: c c c t a g g t c c c a t: c g g g t a - - t - c c a a s: c c c - t a g g t c c c - a Which is the best? t: c g g g t a - - - t c c a a s: c c - - c t a g g t c c c a t: c - g g g t a - - t c c a a s: c c - - c t a g g t c c c a

  9. Outline Introduction Smith-Waterman Algorithm Scoring the relationships Needed a scoring matrix We will be able to find a optimal solution for the scoring matrix at hand Figure: BLOSUM scoring matrix, S.

  10. Outline Introduction Smith-Waterman Algorithm What is the BEST alignment (for that Score Matrix)? Example t: c g g g t a t c c a a s: c c c t a g g t c c c a t : c g g g t a t c c a a − − − +12 − 3 − 3 − 1 +5 +5 − 1 − 1 +5 − 1 +12 +12 − 1 +5 45 s : c c c t a g g t c c c a − − t : c g g g t a t c c a a − − − +12 − 3 − 1 − 1 − 1 +0 − 1 − 1 − 1 +5 +12 +12 − 1 +5 36 s : c c c t a g g t c c c a − − t : c g g g t a t c c a a − − − +12 − 1 − 1 − 1 − 3 +5 +5 − 1 − 1 +5 +12 +12 − 1 +5 47 s : c c c t a g g t c c c a − −

  11. Outline Introduction Smith-Waterman Algorithm How to find the best alignment? Homology search methods begin with DP algorithms Needleman-Wusch: global search Smith-Waterman (SW): local search Faster but less sensitive for larger datasets FASTA BLAST Optimal spaced seeds of pattern-writer increase Speed and sensitivity Similar to SW Examples: Pattern Hunter and BLAT SW sensitivity BLAST speed

  12. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm Figure: Computation Matrix alginment, M N x N integer matrix N is sequence length (both s and t ) Compute M [ i ][ j ] based on Score Matrix and optimum score compute so far (DP)

  13. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm: Understanding Matrix Alignment t : − − − − − − − − s : c c c t a g g t Figure: Aligning s to gaps

  14. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm: Understanding Matrix Alignment t : c g g g t a t ... s : − − − − − − − ... Figure: Aligning t to gaps

  15. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm: How to compute cell score? How to find M [ i ][ j ]? Three ways to finish the alignment of s 0 .. i and t 0 .. j s i 1 Score t j 2 Gap s i in t t j − 3 Gap s i − in s t j

  16. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm: How to compute cell score? How to find M [ i ][ j ]? Three ways to finish the alignment of s 0 .. i and t 0 .. j s i 1 M [ i − 1][ j − 1] + S [ s i ][ t j ] t j s i 2 M [ i − 1][ j ] − g t j − s i − 3 M [ i ][ j − 1] − g t j

  17. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm: Scoring Process Element Computation M [ i ][ j ]: M [ i ][0] = 0 M [0][ j ] = 0  0   M [ i − 1][ j − 1] + S [ s i ][ t j ] if s i t j  M [ i ][ j ] = max M [ i − 1][ j ] − d if s i -   M [ i ][ j − 1] − d if - t j 

  18. Outline Introduction Smith-Waterman Algorithm Smith-Waterman Algorithm: Backtracking Process If we want to find BEST local alignment... Find Score opt and then traceback N Score opt = i , j =1 M [ i ][ j ] max

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend