FastTree 2 Approximately Maximum-Likelihood Trees for Large - PowerPoint PPT Presentation

Nov 18, 2022 •170 likes •304 views

FastTree 2 Approximately Maximum-Likelihood Trees for Large Alignments Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin Presented by Arjun P. Athreya April 21, 2015 CS 598AGB Fast Tree 2 Five stages of computation Heuristic

FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin Presented by Arjun P. Athreya April 21, 2015 CS 598AGB
Fast Tree 2  Five stages of computation – Heuristic neighbor-joining (NJ) – Tree length reductions • Nearest-neighbor interchanges (NNI) • Subtree-prune-regraft (SPR) moves • Distance model – Maximum Likelihood with NNIs – Local support values
Heuristic NJ C A  Produces rough topology B D  Optimization: – Profile for internal nodes instead of a distance-matrix (space saving!) – Remembers best join for each node – Remembers top pair-wise distances (space saving!) – Updates best join for a node as it traverses
Tree-length reductions : NNI  Topology refinement C A ? B D A C B A ? ? B D C D  Optimization: work with profiles, than pairwise distances (space saving!) – – 2 log(N) rounds of NNI  Space: Time:
SPR moves  A subtree is removed from the tree, reinserted somewhere else B A A C D C B E E D  Optimization: – Consider shortest SPRs first, and then extends the promising candidates (space savings!) – For each subtree, only two SPR moves (time saving!)
Maximum Likelihood  Improve tree-topology and branch lengths  Jukes-Cantor model, accounts for variable rates (20 categories, geometrically distributed)  Operation: – Likelihood of trees generated using NNI – Estimate branch lengths  Optimizations: – Stop NNI if likelihood of rearrangements are not improving – NNI restricted to 2log(N) – Skip SPR in parts of tree that did not improve in recent rounds
Results: Metric: RF distances FastTree outperforms other tools which don’t use SPR’s
Results: likelihoods on biological data  RAxML still better  Exhaustive ML search still wins
Results: RAxML vs FastTree2 • But, FastTree found 96-98% of splits RAxML found • Heuristics did not affect the results much and performed as expected compared to simulated data
Results: Runtime Would take years!
Results: Likelihood over time RAxML with same starting tree as FastTree shows similar improvement in likelihood with time
Conclusion  FastTree2 makes intelligent decisions on improving speed while maintaining pretty good accuracy  Impact of heuristics, computational tricks do not impact results a lot  RAxML is still a winner for accuracy, but at the cost of time (may never complete for large datasets) – Personal experience on running FastTree 2 and RAxML for course project, 1 minute vs 30 minutes on small amino acid data

Recommend

RAxML vs. FastTree: A Comparison of Two Maximum Likelihood Phylogeny Estimation Methods Mia

RAxML vs. FastTree: A Comparison of Two Maximum Likelihood Phylogeny Estimation Methods Mia Schoening RAxML vs. FastTree RAxML FastTree Implements standard SPR-based Uses combination of Neighbor-Joining, hill-climbing

1.08k views • 12 slides

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

N. Salamin c Sept 2007 Lecture outline Maximum likelihood in phylogenetics Definition Phylogenetics and bioinformatics for evolution Maximum likelihood and models Likelihood of a tree Computational complexity Statistical Maximum

703 views • 30 slides

Phylogenetic trees IV Maximum Likelihood Gerhard Jger ESSLLI 2016 Gerhard Jger Maximum

Phylogenetic trees IV Maximum Likelihood Gerhard Jger ESSLLI 2016 Gerhard Jger Maximum Likelihood ESSLLI 2016 1 / 50 Theory Theory Gerhard Jger Maximum Likelihood ESSLLI 2016 2 / 50 Theory Recap: Continuous time Markov model

838 views • 55 slides

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28,

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28, 2018 Gerhard Jger Maximum Likelihood WBGT 1 / 20 Theory Theory Gerhard Jger Maximum Likelihood WBGT 2 / 20 Theory Recap: Continuous

512 views • 21 slides

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having no cycles is A graph having no cycles is acyclic acyclic. . A A forest forest is an is an acyclic acyclic graph. graph. A A

226 views • 9 slides

Maximum likelihood models Tues. Feb. 27, 2018 1 Overview of today Informal notion of

COMP 546 Lecture 14 Maximum likelihood models Tues. Feb. 27, 2018 1 Overview of today Informal notion of likelihood Formal definition of likelihood as conditional probability Maximum likelihood problems (sketch) 2 Scene

961 views • 40 slides

Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood

Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood differentiation Maximum Likelihood N N t n | y ( x n , w ) , 1 p ( t | x , w , ) = . (1.61) n =1 As we did in the

598 views • 49 slides

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one main meta-algorithm) this semester: (Regularized) ERM principle: pick the model that minimizes an average loss over training data. 1 / 76

646 views • 45 slides

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Maximum likelihood

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Maximum likelihood estimation We now estimate the values of the unknown parameters 1 ,. . . , K from a sample of observations drawn at random from the population. Each

764 views • 8 slides

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one main meta-algorithm this semester: (Regularized) ERM principle: pick the model that minimizes an average loss over training data. 1 / 70

602 views • 42 slides

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood estimation The main

172 views • 5 slides

15-388/688 - Practical Data Science: Maximum likelihood estimation, nave Bayes J. Zico Kolter

15-388/688 - Practical Data Science: Maximum likelihood estimation, nave Bayes J. Zico Kolter Carnegie Mellon University Spring 2018 1 Outline Maximum likelihood estimation Naive Bayes Machine learning and maximum likelihood 2 Outline

352 views • 22 slides

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM with observed state data X , and s states, For some observed data O = o 1 o n , and a model, we do the same: here a bigram model,

272 views • 3 slides

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

B- B -Trees Trees B B- -Trees Trees Search for key R ( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees Trees Each Disk-Read or Disk-Write = one Basic unit of work O(1) Typical Node x

287 views • 4 slides

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree Traversal Spanning Trees Minimum Spanning Trees Introduction to Trees Section 11.1 Section Summary Introduction to Trees Rooted Trees

639 views • 42 slides

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a Picture Postcard, a Postage Stamp affixed on the picture side of the card and a Cancellation on it and should confirm maximum possible concordance

386 views • 23 slides

Toward a Better Understanding of the Hispanic Paradox* Robert A. Hummer University of Texas at

Toward a Better Understanding of the Hispanic Paradox* Robert A. Hummer University of Texas at Austin * Presentation given to the Department of Sociology, University of Nebraska, March 1, 2013 Aims of Current Talk Produce independent (from

578 views • 39 slides

Large-scale EXecution for Industry & Society www.lexis-project.eu HPC, BIG DATA, IOT AND AI

Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 825532 Large-scale EXecution for Industry & Society www.lexis-project.eu HPC, BIG DATA, IOT AND AI FUTURE INDUSTRY-DRIVEN COLLABORATIVE

599 views • 7 slides

What we have learnt and what What we have learnt and what we have to consider we have to

Short Course for Secondary School Teachers on Teaching the Key Learning Areas of Technology Education Arts Education Physical Education in the English Medium Teachers Voices: Teaching Technology, Arts & PE through English What we have

355 views • 12 slides

Coast Lean Log Handling Project 11/10/2017 TOPFN Division 1 WHAT IS LEAN? LEAN is a

Coast Lean Log Handling Project 11/10/2017 TOPFN Division 1 WHAT IS LEAN? LEAN is a philosophy that focuses on fostering and enabling a continual improvement culture by encouraging efficiency 11/10/2017 through the elimination of

853 views • 37 slides

Juvenile Justice Policy and Data Board Community-Based Interventions Subcommittee *Virtual

Juvenile Justice Policy and Data Board Community-Based Interventions Subcommittee *Virtual Meeting* March 25, 2020 1pm 3pm Agenda Welcome and Introductions Virtual Meeting Guidelines Review/Approval of February meeting minutes

474 views • 23 slides

From Research to Practice: New Models for Data-sharing and Collaboration to Improve Health and

From Research to Practice: New Models for Data-sharing and Collaboration to Improve Health and Healthcare Joe Selby, MD, MPH, Executive Director, PCORI Francis Collins, MD, PhD, Director, National Institutes of Health Philip Bourne, PhD,

715 views • 56 slides

Measuring the extent to which residents are able to participate in the urban planning and

Indicator 11.3.2 : Proportion of cities with a direct participation structure of civil society in urban planning and management that operates regularly and democratically Robert Ndugwa UN-Habitat Measuring the extent to which residents

343 views • 10 slides

Active Citizen E-Participation in Local Governance: Do Individual Social Capital and

1 Active Citizen E-Participation in Local Governance: Do Individual Social Capital and E-Participation Management Matter? Jooho Lee, Ph.D. Assistant Professor University of Nebraska at Omaha & Soonhee Kim, Ph. D. Professor Syracuse

466 views • 22 slides

FastTree 2 Approximately Maximum-Likelihood Trees for Large - PowerPoint PPT Presentation

FastTree 2 Approximately Maximum-Likelihood Trees for Large Alignments Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin Presented by Arjun P. Athreya April 21, 2015 CS 598AGB Fast Tree 2 Five stages of computation Heuristic

RAxML vs. FastTree: A Comparison of Two Maximum Likelihood Phylogeny Estimation Methods Mia

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

Phylogenetic trees IV Maximum Likelihood Gerhard Jger ESSLLI 2016 Gerhard Jger Maximum

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28,

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Maximum likelihood models Tues. Feb. 27, 2018 1 Overview of today Informal notion of

Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Maximum likelihood

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation

15-388/688 - Practical Data Science: Maximum likelihood estimation, nave Bayes J. Zico Kolter

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a

Toward a Better Understanding of the Hispanic Paradox* Robert A. Hummer University of Texas at

Large-scale EXecution for Industry & Society www.lexis-project.eu HPC, BIG DATA, IOT AND AI

What we have learnt and what What we have learnt and what we have to consider we have to

Coast Lean Log Handling Project 11/10/2017 TOPFN Division 1 WHAT IS LEAN? LEAN is a

Juvenile Justice Policy and Data Board Community-Based Interventions Subcommittee *Virtual

From Research to Practice: New Models for Data-sharing and Collaboration to Improve Health and

Measuring the extent to which residents are able to participate in the urban planning and

Active Citizen E-Participation in Local Governance: Do Individual Social Capital and

Sambuz

Useful Links

Newsletter

Mail Us

FastTree 2 Approximately Maximum-Likelihood Trees for Large - PowerPoint PPT Presentation

FastTree 2 Approximately Maximum-Likelihood Trees for Large Alignments Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin Presented by Arjun P. Athreya April 21, 2015 CS 598AGB Fast Tree 2 Five stages of computation Heuristic

RAxML vs. FastTree: A Comparison of Two Maximum Likelihood Phylogeny Estimation Methods Mia

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

Phylogenetic trees IV Maximum Likelihood Gerhard Jger ESSLLI 2016 Gerhard Jger Maximum

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28,

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Maximum likelihood models Tues. Feb. 27, 2018 1 Overview of today Informal notion of

Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Maximum likelihood

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation

15-388/688 - Practical Data Science: Maximum likelihood estimation, nave Bayes J. Zico Kolter

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a

Toward a Better Understanding of the Hispanic Paradox* Robert A. Hummer University of Texas at

Large-scale EXecution for Industry &amp; Society www.lexis-project.eu HPC, BIG DATA, IOT AND AI

What we have learnt and what What we have learnt and what we have to consider we have to

Coast Lean Log Handling Project 11/10/2017 TOPFN Division 1 WHAT IS LEAN? LEAN is a

Juvenile Justice Policy and Data Board Community-Based Interventions Subcommittee *Virtual

From Research to Practice: New Models for Data-sharing and Collaboration to Improve Health and

Measuring the extent to which residents are able to participate in the urban planning and

Active Citizen E-Participation in Local Governance: Do Individual Social Capital and

Sambuz

Useful Links

Newsletter

Mail Us

Large-scale EXecution for Industry & Society www.lexis-project.eu HPC, BIG DATA, IOT AND AI