A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO - PowerPoint PPT Presentation

Apr 14, 2023 •127 likes •260 views

A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO E NTFELLNER & Olivier G ASCUEL LIRMM (CNRS - UM2), Montpellier June 10 th , 2008 1 / 13 What is at stake? Goal Search a databank for sequences homologous to a query

A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO E NTFELLNER & Olivier G ASCUEL LIRMM (CNRS - UM2), Montpellier June 10 th , 2008 1 / 13
What is at stake? Goal Search a databank for sequences homologous to a query protein family. Existing approaches 1 Blast: poor results when identity rate is too low ( � 30%) 2 Profile HMMs: • allow lower percentage of identity between query & target • but make no use of the phylogeny Proposed solution Design a model which takes advantage of: 1 the possible presence in the family of a sequence close to the target 2 the global information (e.g. hydrophilic/phobic columns) conveyed by the alignment 2 / 13
Profile HMMs I 1 I 2 A 0.2 C 0.05 M 2 M 3 D 0.08 E 0.01 .... D 2 Each match and insertion state generates a single a.a. 3 / 13
phylo-HMMs Seminal works: Goldman et al. 1996, Siepel & Haussler 2003 I 1 I 2 ? ? M 2 M 3 ? ? D 2 • each node is populated by a phylogeny which defines a probability distribution over a column of the alignment • typical use: prediction of the conservation or secondary structure of the sites 4 / 13
How we use phylo-HMMs Knowing the phylogeny, we fill in each match state with the distribution of posterior probas of a.a. for the target, given the corresponding column of the alignment. → Felsenstein’s pruning algorithm Arabidopsis thaliana ADRDSKR Anopheles gambiae PERESKR Ciona savignyi PSPVASR Homo sapiens ??????? 5 / 13
Arabidopsis thaliana ADRDSKR Anopheles gambiae PERESKR Ciona savignyi PSPVASR Homo sapiens ??????? I I 2 1 R R P ? S S A ? D E V ? D 2 6 / 13
Arabidopsis thaliana ADRDSKR Anopheles gambiae PERESKR Ciona savignyi PSPVASR Homo sapiens ??????? I I 2 1 .... .... A 0.6 N 0.02 D 0.2 C 0.01 P 0.6 E 0.2 .... .... Q 0.02 S 0.3 V 0.5 R 0.2 .... .... .... D 2 7 / 13
Experimenting • test data: 690 protein families from the Treefam database (Vertebrates + Insects + 1 Tunicate, 4 worms, 2 yeasts and 2 plants). • phylogeny is assumed (calculated with PhyML, matches NCBI consensus). Experimental setup: 1 take those 690 complete families from Treefam 2 gradually prune to remove all Vertebrates, Insects, ... 3 realign the remaining sequences 4 build the profile HMM with hmmbuild 5 phylogenise it to scan for human proteins 6 scan the human proteome with resulting phylo-HMM to find the original protein 8 / 13
9 / 13
Pruned trees (1/3) # of true positives sensitivity standard profile HMM 1345 0.88 Blast 1434 0.94 phylo-HMM 1435 0.94 # expected detections 1526 10 / 13
Pruned trees (2/3) # of true positives sensitivity standard profile HMM 1280 0.86 Blast 1293 0.87 phylo-HMM 1348 0.91 # expected detections 1489 11 / 13
Pruned trees (3/3) # of true positives sensitivity Blast 25 0.38 standard profile HMM 38 0.58 phylo-HMM 52 0.80 # expected detections 65 12 / 13
Conclusion Our model uses phylogenetic information to contextualize a profile HMM. • first results look promising • good combination of Blast and profile HMMs paradigms, robust to remote phylogenetic relations 13 / 13

Recommend

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM.

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM. One Standard Markov model (example: CG islands Discrimination) St d d M k d l Two Hidden Markov model (example: CG islands Detection) Hidden

336 views • 20 slides

Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona

IBM VUT Student Research Project 2006 Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona HMM Project specification Implement one of algorithm used for HMM on Cell architecture In our

612 views • 16 slides

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017 Heterogeneous Memory Management Overview 2 Agenda Overview Agenda for HMM: HMM Benefits SW-HW stack: where does HMM fit in? Heterogeneous Definitions

651 views • 35 slides

Interactive HMM construction based on interesting sequences Szymon Jaroszewicz National

Interactive HMM construction based on interesting sequences Szymon Jaroszewicz National Institute of Telecommunications Warsaw, Poland LeGo 2008 Szymon Jaroszewicz Interactive HMM construction based on interesting sequences Overview

576 views • 26 slides

Non-Homogeneous Hidden Markov Model Qingyuan Liu Introduction (Why Homogeneous HMM) Classify

Non-Homogeneous Hidden Markov Model Qingyuan Liu Introduction (Why Homogeneous HMM) Classify new sequences into new family Add related sequences into MSA Compute MSA for groups of related sequence Introduction (Building a HMM)

294 views • 8 slides

20-03-06 7. Learning Sequences/Behaviors How to use sequences/behaviors? Sequences and more

20-03-06 7. Learning Sequences/Behaviors How to use sequences/behaviors? Sequences and more generally behaviors are about Sequences are used integrating the concept of time into what is learned. In } to analyze time dependent data general,

299 views • 6 slides

Paradigm Shift: Moving from Vertical Paradigm Shift: Moving from Vertical Paradigm Shift:

Paradigm Shift: Moving from Vertical Paradigm Shift: Moving from Vertical Paradigm Shift: Moving from Vertical Paradigm Shift: Moving from Vertical to Horizontal in Aviation Safety to Horizontal in Aviation Safety David T. Deveau,

285 views • 27 slides

Prolog Declarative/logic paradigm Functional paradigm No assignment statement

Prolog Declarative/logic paradigm Functional paradigm No assignment statement Declarative paradigm No program! Specification without implementation. Prolog Declarative/logic paradigm Functional paradigm No

917 views • 47 slides

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of Computer Science University of Illinois Urbana-Champaign March 15, 2017 1 / 16 Qing Ye qingye3@illinois.edu Protein homology detection by

633 views • 16 slides

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Digital Speech Processing Digital Speech Processing Lecture 20 Lecture 20 The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline Theory of Markov Models discrete Markov processes

996 views • 87 slides

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large Vo Vocabulary ry Han Handwr writin ing Re Reco cognit itio ion Alessandro L. Koerich, Robert Sabourin & Ching Y. Suen Pontifical

500 views • 35 slides

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM based filtering Cdric LE BARZ ONERA, The French Aerospace Laboratory PhD PhD advisors: d i - M. CORD (Pierre&Marie Curie University - Paris)

542 views • 23 slides

ANLP Lecture 9: Algorithms for HMMs Sharon Goldwater 4 Oct 2019 Recap: HMM Elements of HMM:

ANLP Lecture 9: Algorithms for HMMs Sharon Goldwater 4 Oct 2019 Recap: HMM Elements of HMM: Set of states (tags) Output alphabet (word types) Start state (beginning of sentence) State transition probabilities Output

520 views • 50 slides

Sequences Sequences and Difference Equations "Sequences" is a central topic in

5mm. Sequences Sequences and Difference Equations "Sequences" is a central topic in mathematics: (Appendix A) x 0 , x 1 , x 2 , . . . , x n , . . . , Example: all odd numbers Hans Petter Langtangen 1 , 3 , 5 , 7 , . . . , 2 n + 1 , .

226 views • 10 slides

Sequences Sequences and Difference Equations "Sequences" is a central topic in

475 views • 5 slides

ESG Criteria: ESG Criteria: ESG Criteria: ESG Criteria: New paradigm that will redefine the

ESG Criteria: ESG Criteria: ESG Criteria: ESG Criteria: New paradigm that will redefine the New paradigm that will redefine the New paradigm that will redefine the New paradigm that will redefine the Precious Metals Supply Chain? Precious

246 views • 12 slides

On the convergence of Boolean automata networks without negative cycles Tarek Melliti and Damien

On the convergence of Boolean automata networks without negative cycles Tarek Melliti and Damien Regnault e d IBISC - Universit Evry Val dEssonne, France Adrien Richard I3S - Universit e de Nice-Sophia Antipolis, France Sylvain

1.13k views • 66 slides

Inferring parameters in genetic regulatory networks Camilo La Rota 1 Fabien Tarissan 2 Leo Liberti

Introduction Modelling the Biological Problem GRN Inference Ongoing work Results Summary Inferring parameters in genetic regulatory networks Camilo La Rota 1 Fabien Tarissan 2 Leo Liberti 2 1 Complex Systems Institute (IXXI) Ecole Normale

780 views • 39 slides

Machine Learning Methods for Metabolic Pathway Prediction Joseph M. Dale, Liviu Popescu, and

PathoLogic Machine Learning Methods for Prediction Evaluation Conclusions and Future Directions Machine Learning Methods for Metabolic Pathway Prediction Joseph M. Dale, Liviu Popescu, and Peter D. Karp Pathway Tools Workshop August 27, 2009

593 views • 26 slides

Data Mining: References Prof. Dr. Karsten Borgwardt, Department Biosystems, ETH Z urich Basel,

Data Mining: References Prof. Dr. Karsten Borgwardt, Department Biosystems, ETH Z urich Basel, Fall Semester 2016 D-BSSE References I Achlioptas, P., Sch olkopf, B., and Borgwardt, K. (2011). Two-locus association mapping in subquadratic

352 views • 17 slides

XML GUS Data Loading The Genomics Unified Schema Users and Developers Workshop July 7, 2005

XML GUS Data Loading The Genomics Unified Schema Users and Developers Workshop July 7, 2005 Josef Jurek Daphne Preuss Laboratory Molecular Genetics and Cell Biology The University of Chicago jurek@cs.uchicago.edu Terry Clark, Josef

619 views • 16 slides

Bioinformatics: Network Analysis Network Motifs COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay

Bioinformatics: Network Analysis Network Motifs COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Recall Not all subgraphs occur with equal frequency Motifs are subgraphs that are over-represented compared to a

803 views • 52 slides

Structure-to-Function Theory for Boolean Networks Henning S. Mortveit Department of Engineering

Background Equivalence of Sequential Graph Dynamical Systems Enumeration for -equivalence Structure-to-Function Theory for Boolean Networks Henning S. Mortveit Department of Engineering Systems and Environment & NSSAC, Biocomplexity

556 views • 30 slides

Workshop Schedule 9am Introductions & Running the VM 10:30am Coffee 11am

Workshop Schedule 9am Introductions & Running the VM 10:30am Coffee 11am Analyzing tool performance & Merging 11:30am Managing File Structures 12pm Understanding VCFs and Visualizing your SVs 1pm Lunch 2pm

227 views • 20 slides

A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO - PowerPoint PPT Presentation

A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO E NTFELLNER & Olivier G ASCUEL LIRMM (CNRS - UM2), Montpellier June 10 th , 2008 1 / 13 What is at stake? Goal Search a databank for sequences homologous to a query

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM.

Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017

Interactive HMM construction based on interesting sequences Szymon Jaroszewicz National

Non-Homogeneous Hidden Markov Model Qingyuan Liu Introduction (Why Homogeneous HMM) Classify

20-03-06 7. Learning Sequences/Behaviors How to use sequences/behaviors? Sequences and more

Paradigm Shift: Moving from Vertical Paradigm Shift: Moving from Vertical Paradigm Shift:

Prolog Declarative/logic paradigm Functional paradigm No assignment statement

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM

ANLP Lecture 9: Algorithms for HMMs Sharon Goldwater 4 Oct 2019 Recap: HMM Elements of HMM:

Sequences Sequences and Difference Equations "Sequences" is a central topic in

Sequences Sequences and Difference Equations "Sequences" is a central topic in

ESG Criteria: ESG Criteria: ESG Criteria: ESG Criteria: New paradigm that will redefine the

On the convergence of Boolean automata networks without negative cycles Tarek Melliti and Damien

Inferring parameters in genetic regulatory networks Camilo La Rota 1 Fabien Tarissan 2 Leo Liberti

Machine Learning Methods for Metabolic Pathway Prediction Joseph M. Dale, Liviu Popescu, and

Data Mining: References Prof. Dr. Karsten Borgwardt, Department Biosystems, ETH Z urich Basel,

XML GUS Data Loading The Genomics Unified Schema Users and Developers Workshop July 7, 2005

Bioinformatics: Network Analysis Network Motifs COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay

Structure-to-Function Theory for Boolean Networks Henning S. Mortveit Department of Engineering

Workshop Schedule 9am Introductions & Running the VM 10:30am Coffee 11am

Sambuz

Useful Links

Newsletter

Mail Us

A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO - PowerPoint PPT Presentation

A new phylo-HMM paradigm to search for sequences Jean-Baka D OMELEVO E NTFELLNER & Olivier G ASCUEL LIRMM (CNRS - UM2), Montpellier June 10 th , 2008 1 / 13 What is at stake? Goal Search a databank for sequences homologous to a query

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM.

Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017

Interactive HMM construction based on interesting sequences Szymon Jaroszewicz National

Non-Homogeneous Hidden Markov Model Qingyuan Liu Introduction (Why Homogeneous HMM) Classify

20-03-06 7. Learning Sequences/Behaviors How to use sequences/behaviors? Sequences and more

Paradigm Shift: Moving from Vertical Paradigm Shift: Moving from Vertical Paradigm Shift:

Prolog Declarative/logic paradigm Functional paradigm No assignment statement

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM

ANLP Lecture 9: Algorithms for HMMs Sharon Goldwater 4 Oct 2019 Recap: HMM Elements of HMM:

Sequences Sequences and Difference Equations &quot;Sequences&quot; is a central topic in

Sequences Sequences and Difference Equations &quot;Sequences&quot; is a central topic in

ESG Criteria: ESG Criteria: ESG Criteria: ESG Criteria: New paradigm that will redefine the

On the convergence of Boolean automata networks without negative cycles Tarek Melliti and Damien

Inferring parameters in genetic regulatory networks Camilo La Rota 1 Fabien Tarissan 2 Leo Liberti

Machine Learning Methods for Metabolic Pathway Prediction Joseph M. Dale, Liviu Popescu, and

Data Mining: References Prof. Dr. Karsten Borgwardt, Department Biosystems, ETH Z urich Basel,

XML GUS Data Loading The Genomics Unified Schema Users and Developers Workshop July 7, 2005

Bioinformatics: Network Analysis Network Motifs COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay

Structure-to-Function Theory for Boolean Networks Henning S. Mortveit Department of Engineering

Workshop Schedule 9am Introductions &amp; Running the VM 10:30am Coffee 11am

Sambuz

Useful Links

Newsletter

Mail Us

Sequences Sequences and Difference Equations "Sequences" is a central topic in

Sequences Sequences and Difference Equations "Sequences" is a central topic in

Workshop Schedule 9am Introductions & Running the VM 10:30am Coffee 11am