Towards Characterization of Identifiability of Profile HMMs - PowerPoint PPT Presentation

Apr 21, 2023 •119 likes •240 views

Towards Characterization of Identifiability of Profile HMMs Srilakshmi Pattabiraman University of Illinois, Urbana-Champaign April 26, 2018 Joint work with Prof. Tandy Warnow. 1/11 Introduction Statistically consistent estimator 0

Towards Characterization of Identifiability of Profile HMMs Srilakshmi Pattabiraman University of Illinois, Urbana-Champaign April 26, 2018 Joint work with Prof. Tandy Warnow. 1/11
Introduction ◮ Statistically consistent estimator ˆ θ 0 (asymptotic estimator) of a parameter θ 0 is one that identifies the correct parameter θ 0 when the data available is arbitrarily large. ◮ A necessary condition for any estimator’s asymptotic consistency is that the evolutionary model has to be identifiable. ◮ Identifiability - given the set of sequence profiles that are generated on a model tree, and the probabilities of their occurrences, can the underlying evolutionary model be identified correctly? ◮ Trivially, if there are two models that generate the same sequence profiles with matched probabilities, the models are not identifiable! 2/11
Central Question Are all profile HMMs identifiable? Figure 1: The standard profile HMM. ◮ φ : 1 path, A : 2 n + 1 paths, AA : n ( n − 1) + (2 n + 1)( n + 1) 3/11 2
Profile HMMs without deletion nodes Figure 2: Profile HMM with no deletion nodes. Theorem The model is identifiable iff no match state has the same distribution as the insertion states. 4/11
Proof Theorem The model is identifiable iff no match state has the same distribution as the insertion states. ◮ the sequence with the minimum length defines the topology p ?[ i − 1] A ?[ n − i ] ◮ z i A = � X ∈ A , T , G , C p ?[ i − 1] X ?[ n − i ] ◮ p A ∗ = x 1 z 1 A + (1 − x 1 ) 1 4 Figure 3: Finding x 1 . 5/11
Proof x 2 z 2 A + (1 − x 2 ) 1 y 1 z 1 A + (1 − y 1 ) 1 ◮ p ? A ∗ = x 1 � � � � + (1 − x 1 ) 4 4 x 2 z 2 T + (1 − x 2 ) 1 y 1 z 1 T + (1 − y 1 ) 1 ◮ p ? T ∗ = x 1 � � � � + (1 − x 1 ) 4 4 Figure 4: Finding x 2 , y 1 . ◮ p ? [ m − 1] A ∗ = x m − 1 , y m − 2 A + (1 − x m ) 1 � � + p ( m : m − 1) � x m z m � + f m , 1 1 1 4 p ( i : m − 2) � y m − 1 z m − 1 + (1 − y m − 1 ) 1 � A 4 ◮ p ? [ m − 1] T ∗ = x m − 1 , y m − 2 + p ( m : m − 1) � x m z m T + (1 − x m ) 1 � � � f m , 2 + 1 1 4 y m − 1 z m − 1 + (1 − y m − 1 ) 1 p ( i : m − 2) � � 4 T 6/11
Proof Figure 5: Two models that produce the same sequence profiles. ◮ p A = x 1 1 4 x 2 ◮ p AA = (1 − x 1 ) 1 4 y 1 1 4 x 2 + x 1 1 4 (1 − x 2 ) 1 4 y 2 ◮ p A [ n ] = n − 2 +(1 − x 1 ) 1 n − 2 y 1 1 x 1 1 4 (1 − x 2 ) 1 4 (1 − y 2 ) n − 2 1 4 (1 − y 1 ) n − 2 1 4 x 2 + 4 4 n 1 y 1 1 n 2 y 2 n 1 + n 2 = n − 3 (1 − x 1 ) 1 4 (1 − y 1 ) n 1 1 4 (1 − x 2 ) 1 4 (1 − y 2 ) n 2 1 � 4 4 7/11
Proof Figure 6: Two models that produce the same sequence profiles.. 8/11
What about the standard profile HMMs? ◮ Unfortunately, these methods don’t extend. ◮ Finding the number of match states itself is non-trivial. ◮ Standard ML tricks may not work! ◮ Maybe they are unidentifiable? 9/11
Bad news! Figure 7: Standard profile HMM with one match state. ◮ If we knew that the profile HMM had only one match state, then the model can be completely characterized. 10/11
Thank you! 11/11

Recommend

Set-membership identifiability and estimation of parameters for uncertain nonlinear systems

Set-membership identifiability Methods to analyse set-membership identifiability and numerical applications Conclusion Set-membership identifiability and estimation of parameters for uncertain nonlinear systems Nathalie Verdire 1 , Carine

765 views • 48 slides

Identifiability, Integro-Differential Equations and Neurobiology F. Boulier, F. Lemaire, A.

Identifiability Integro-Differential Equations Neurobiology Identifiability, Integro-Differential Equations and Neurobiology F. Boulier, F. Lemaire, A. Poteaux, A. Quadrat, N. Verdi` ere, N. Corson, V. Lanza, H. Castel, P. Gandolfo, V. Comp`

488 views • 15 slides

Applications of computer algebra in the identifiability and diagnosability studies Nathalie

Applications of computer algebra in the identifiability and diagnosability studies Nathalie Verdire 1 1 Normandie Univ, UNIHAVRE, LMAH, FR-CNRS-3335, ISCN, 76600 Le Havre, France CUNY, 2019 My topics/works: Identifiability study and parameter

488 views • 45 slides

A Simple and Efficient Solution of the Identifiability Problem for Hidden Markov Models and

Guideline Introduction String Functions Solution of the Identifiability Problem A Simple and Efficient Solution of the Identifiability Problem for Hidden Markov Models and Quantum Random Walks Alexander Schnhuth Pacific Institute for the

648 views • 37 slides

Structural identifiability: An Introduction Mike Chappell & Neil Evans

Motivation Structural identifiability Techniques for nonlinear models Structural identifiability: An Introduction Mike Chappell & Neil Evans m.j.chappell@warwick.ac.uk AMR Summer School, University of Warwick, July 2016 MJ Chappell

1.68k views • 130 slides

Characterization of the Household Electricity Characterization of the Household Electricity

Characterization of the Household Electricity Characterization of the Household Electricity Characterization of the Household Electricity Characterization of the Household Electricity Consumption in the EU, Potential Energy Savings Consumption

770 views • 35 slides

SITE CHARACTERIZATION Part 1. Non-Intrusive Site Characterization Technologies Tyler E. Gass,

SITE CHARACTERIZATION Part 1. Non-Intrusive Site Characterization Technologies Tyler E. Gass, CPG Tetra Tech, Inc. Louisville, CO Site Characterization Non-intrusive Technologies Factors to Consider When Designing a Site Characterization

1.17k views • 70 slides

Geomaterial Characterization Sub-topics Chemical characterization pH, TDS, EC, BOD, COD

Geomaterial Characterization Sub-topics Chemical characterization pH, TDS, EC, BOD, COD Sulphite and Chloride contents Cation-Exchange Capacity Pore-solution sampling Corrosion potential Sorption-Desorption Thermal Characterization

738 views • 8 slides

Sub-topics Chemical characterization Sorption-Desorption (Contaminant Transport in Porous

Geomaterial Characterization Sub-topics Chemical characterization Sorption-Desorption (Contaminant Transport in Porous Media) Thermal Characterization Electrical Characterization Dispersion (thinning out/scattering/spreading) The

463 views • 10 slides

Towards a Characterization of the Double Category of Spans Evangelia Aleiferi Dalhousie

Motivation Cartesian double categories Eilenberg-Moore Objects Towards the characterization of spans Further Questions Towards a Characterization of the Double Category of Spans Evangelia Aleiferi Dalhousie University Category Theory 2017

719 views • 51 slides

Identifiability and Unmixing of Latent Parse Trees Daniel Hsu, Sham Kakade, Percy Liang NIPS

Identifiability and Unmixing of Latent Parse Trees Daniel Hsu, Sham Kakade, Percy Liang NIPS 2012 Jan Gasthaus Tea talk January 8th, 2013 1 / 15 Parsing 2 / 15 Big Picture Generative parsing models define joint distributions P ( x , z )

368 views • 20 slides

Improving the parameter identifiability of a watershed scale onsite wastewater infiltration model

Improving the parameter identifiability of a watershed scale onsite wastewater infiltration model Bjrn Helm TU Dresden, Chair of Urban Watermanagement Athens, 16.09.2016 Motivation Infiltration based wastewater disposal globally most

405 views • 14 slides

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints Yanjun Li Joint

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints Yanjun Li Joint work with Kiryung Lee and Yoram Bresler Coordinated Science Laboratory Department of Electrical and Computer Engineering University of Illinois,

568 views • 30 slides

Identifiability in dynamic network identification Harm Weerts 1 Arne Dankers 2 Paul Van den Hof 1 1

Identifiability in dynamic network identification Harm Weerts 1 Arne Dankers 2 Paul Van den Hof 1 1 Control Systems, Department of Electrical Engineering, Eindhoven University of Technology, The Netherlands. 2 Department of Electrical Engineering,

664 views • 39 slides

Identifiability of Gaussian DAG models with one latent source Hisayuki Hara Niigata University

Identifiability of Gaussian DAG models with one latent source Hisayuki Hara Niigata University http://www.econ.niigata-u.ac.jp/hara/ hara@econ.niigata-u.ac.jp Joint work with Dennis Leung and Mathias Drton H. Hara (Niigata U.)

527 views • 26 slides

Identifiability and Transportability in Dynamic Causal Networks Gilles Blondel, Marta Arias,

Identifiability and Transportability in Dynamic Causal Networks Gilles Blondel, Marta Arias, Ricard Gavald Universitat Politcnica de Catalunya, Barcelona KDD 2016 - Workshop on Causal Discovery San Francisco - August 2016 Contact:

152 views • 13 slides

Random Matrices in Wireless Communications M erouane Debbah Eurecom Institute debbah@eurecom.fr

Random Matrices in Wireless Communications M erouane Debbah Eurecom Institute debbah@eurecom.fr MIMO System Model 2 MIMO Representation T x R x y ( t ) = H n rx n tx ( ) x ( t ) d + n ( t ) n t x and y ( f )

445 views • 13 slides

Advanced Section #2 Model Selection & Information Criteria Akaike Information Criterion

Advanced Section #2 Model Selection & Information Criteria Akaike Information Criterion Marios Mattheakis and Pavlos Protopapas CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader 1 Outline Maximum Likelihood

673 views • 26 slides

Understanding and communicating widespread flood risk Ross Towe 1 , 2 Jonathan Tawn 1 Rob Lamb 1 ,

Understanding and communicating widespread flood risk Ross Towe 1 , 2 Jonathan Tawn 1 Rob Lamb 1 , 3 Chris Sherlock 1 Ye Liu 4 1 Dept. Mathematics and Statistics, Lancaster University, Lancaster, UK 2 JBA Trust, Broughton Hall, Skipton, UK 3

713 views • 47 slides

PERSISTENCE IN TURKISH REAL EXCHANGE RATES: PANEL APPROACHES Haluk Erlat Department of Economics

PERSISTENCE IN TURKISH REAL EXCHANGE RATES: PANEL APPROACHES Haluk Erlat Department of Economics Middle East Technical University 06531 Ankara, Turkey email: herlat@metu.edu.tr Real Exchange Rate: = + * (1) q e p p it it t it

328 views • 20 slides

The ergodic high SNR capacity of the Introduction spatially-correlated non-coherent MIMO System

ITW Jeju, South Korea, 2015 1 / 18 Ramy Gohary and Halim Yanikomeroglu The ergodic high SNR capacity of the Introduction spatially-correlated non-coherent MIMO System Model channel within an SNR-independent gap The right singular

292 views • 18 slides

Doru Caraeni CD-adapco, USA CFD Futures Conference, August 6-8, 2012 Why I did Residual-based

Doru Caraeni CD-adapco, USA CFD Futures Conference, August 6-8, 2012 Why I did Residual-based schemes research ? - (1996) Leading the CFD/CAE group (Centrifugal Compressors) at COMOTI Bucharest - Challenge: to perform LES of turbulence inside

668 views • 41 slides

The pion-photon transition form factor in QCD: Facts and fancy P. Kroll Fachbereich Physik,

The pion-photon transition form factor in QCD: Facts and fancy P. Kroll Fachbereich Physik, Univ. Wuppertal and Univ. Regensburg Dubna, September 2010 Outline: The trans. form factor in coll. factorization The new BaBar data

737 views • 18 slides

Large Deviations for a Randomly Indexed Branching Process with Applications in Finance Sheng-Jhih

Large Deviations for a Randomly Indexed Branching Process with Applications in Finance Sheng-Jhih Wu NCSU April 5, 2012 Sheng-Jhih Wu (NCSU) Large Deviations for a RIBP April 5, 2012 1 / 29 Outline Introduction Branching Process Large

578 views • 29 slides

Towards Characterization of Identifiability of Profile HMMs - PowerPoint PPT Presentation

Towards Characterization of Identifiability of Profile HMMs Srilakshmi Pattabiraman University of Illinois, Urbana-Champaign April 26, 2018 Joint work with Prof. Tandy Warnow. 1/11 Introduction Statistically consistent estimator 0

Set-membership identifiability and estimation of parameters for uncertain nonlinear systems

Identifiability, Integro-Differential Equations and Neurobiology F. Boulier, F. Lemaire, A.

Applications of computer algebra in the identifiability and diagnosability studies Nathalie

A Simple and Efficient Solution of the Identifiability Problem for Hidden Markov Models and

Structural identifiability: An Introduction Mike Chappell & Neil Evans

Characterization of the Household Electricity Characterization of the Household Electricity

SITE CHARACTERIZATION Part 1. Non-Intrusive Site Characterization Technologies Tyler E. Gass,

Geomaterial Characterization Sub-topics Chemical characterization pH, TDS, EC, BOD, COD

Sub-topics Chemical characterization Sorption-Desorption (Contaminant Transport in Porous

Towards a Characterization of the Double Category of Spans Evangelia Aleiferi Dalhousie

Identifiability and Unmixing of Latent Parse Trees Daniel Hsu, Sham Kakade, Percy Liang NIPS

Improving the parameter identifiability of a watershed scale onsite wastewater infiltration model

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints Yanjun Li Joint

Identifiability in dynamic network identification Harm Weerts 1 Arne Dankers 2 Paul Van den Hof 1 1

Identifiability of Gaussian DAG models with one latent source Hisayuki Hara Niigata University

Identifiability and Transportability in Dynamic Causal Networks Gilles Blondel, Marta Arias,

Random Matrices in Wireless Communications M erouane Debbah Eurecom Institute debbah@eurecom.fr

Advanced Section #2 Model Selection & Information Criteria Akaike Information Criterion

Understanding and communicating widespread flood risk Ross Towe 1 , 2 Jonathan Tawn 1 Rob Lamb 1 ,

PERSISTENCE IN TURKISH REAL EXCHANGE RATES: PANEL APPROACHES Haluk Erlat Department of Economics

The ergodic high SNR capacity of the Introduction spatially-correlated non-coherent MIMO System

Doru Caraeni CD-adapco, USA CFD Futures Conference, August 6-8, 2012 Why I did Residual-based

The pion-photon transition form factor in QCD: Facts and fancy P. Kroll Fachbereich Physik,

Large Deviations for a Randomly Indexed Branching Process with Applications in Finance Sheng-Jhih

Sambuz

Useful Links

Newsletter

Mail Us

Towards Characterization of Identifiability of Profile HMMs - PowerPoint PPT Presentation

Towards Characterization of Identifiability of Profile HMMs Srilakshmi Pattabiraman University of Illinois, Urbana-Champaign April 26, 2018 Joint work with Prof. Tandy Warnow. 1/11 Introduction Statistically consistent estimator 0

Set-membership identifiability and estimation of parameters for uncertain nonlinear systems

Identifiability, Integro-Differential Equations and Neurobiology F. Boulier, F. Lemaire, A.

Applications of computer algebra in the identifiability and diagnosability studies Nathalie

A Simple and Efficient Solution of the Identifiability Problem for Hidden Markov Models and

Structural identifiability: An Introduction Mike Chappell &amp; Neil Evans

Characterization of the Household Electricity Characterization of the Household Electricity

SITE CHARACTERIZATION Part 1. Non-Intrusive Site Characterization Technologies Tyler E. Gass,

Geomaterial Characterization Sub-topics Chemical characterization pH, TDS, EC, BOD, COD

Sub-topics Chemical characterization Sorption-Desorption (Contaminant Transport in Porous

Towards a Characterization of the Double Category of Spans Evangelia Aleiferi Dalhousie

Identifiability and Unmixing of Latent Parse Trees Daniel Hsu, Sham Kakade, Percy Liang NIPS

Improving the parameter identifiability of a watershed scale onsite wastewater infiltration model

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints Yanjun Li Joint

Identifiability in dynamic network identification Harm Weerts 1 Arne Dankers 2 Paul Van den Hof 1 1

Identifiability of Gaussian DAG models with one latent source Hisayuki Hara Niigata University

Identifiability and Transportability in Dynamic Causal Networks Gilles Blondel, Marta Arias,

Random Matrices in Wireless Communications M erouane Debbah Eurecom Institute debbah@eurecom.fr

Advanced Section #2 Model Selection &amp; Information Criteria Akaike Information Criterion

Understanding and communicating widespread flood risk Ross Towe 1 , 2 Jonathan Tawn 1 Rob Lamb 1 ,

PERSISTENCE IN TURKISH REAL EXCHANGE RATES: PANEL APPROACHES Haluk Erlat Department of Economics

The ergodic high SNR capacity of the Introduction spatially-correlated non-coherent MIMO System

Doru Caraeni CD-adapco, USA CFD Futures Conference, August 6-8, 2012 Why I did Residual-based

The pion-photon transition form factor in QCD: Facts and fancy P. Kroll Fachbereich Physik,

Large Deviations for a Randomly Indexed Branching Process with Applications in Finance Sheng-Jhih

Sambuz

Useful Links

Newsletter

Mail Us

Structural identifiability: An Introduction Mike Chappell & Neil Evans

Advanced Section #2 Model Selection & Information Criteria Akaike Information Criterion