Pasta Modifications
Kodi Collins CS 466
Pasta Modifications Kodi Collins CS 466 Motivation: Multiple - - PowerPoint PPT Presentation
Pasta Modifications Kodi Collins CS 466 Motivation: Multiple Sequence Alignment Evolution Detection of Selection Alleles in populations MSA on Coding Regions Proportion synonymous and non-synonymous substitutions 3
Kodi Collins CS 466
○ Alleles in populations
○ Advantageous ○ Deleterious ○ Neutral
○ Positive ○ Negative ○ Balancing ○ Diversifying ○ Stabilizing
○ MSA on Coding Regions ○ Proportion synonymous and non-synonymous substitutions ○ Synonymous = Neutral ○ Differing rates means some selection
○ Substitution favored over indels ○ Substitutions not neutral ○ False Positive Detection of Selection
○ Assumed opposite effect but we don’t know
1. Build Guide Tree 2. Decompose 3. Align 4. Merge 5. Transitivity 6. Repeat 1-5
Image: http://www.cs.utexas.edu/~phylo/software/pasta/pasta.pdf
1. Build Guide Tree 2. Decompose 3. Align 4. Merge 5. Transitivity 6. Repeat 1-5
Image: http://www.cs.utexas.edu/~phylo/software/pasta/pasta.pdf
= 4.5 = 3.5 = 2.5 = 4 = 1 = 3 = 2 = 5 = 4 = 1.5 B A D C E
AC BC CD DE
A A A A B B C D E C B D B E C D C E D E
Percentage of gaps:
Other Potential Considerations:
Maximum Likelihood:
○ Default Pasta Best ○ no/little improvement
○ Local alignments where transitivity ‘fails’ ○ Use Muscle not Opal ○ …
Mirarab, S., N. Nguyen, and T. Warnow, 2014. “PASTA: ultra-large multiple sequence alignment.” Proceedings RECOMB
Warnow, Tandy. Computational Phylogenetics: An Introduction to Designing Methods for Phylogeny Estimation. N.p.: Cambridge U Press, 2017. Print. Mirarab, S. Presentation on Pasta at RECOMB 2014: http://www.cs.utexas.edu/~phylo/software/pasta/pasta.pdf