Detecting Outliers in HMM modeling through Relative Entropy with - PowerPoint PPT Presentation

Detecting Outliers in HMM modeling through Relative Entropy with Applications to Change-Point Detection V. Perduca, G. Nuel JOBIM 2012 V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 1 / 17

Change-point detection, HMMs and outliers ◮ Given an heterogeneous sequence: find the segments in which the signal is homogeneous ◮ Here: Hidden Markov modeling ◮ Segmentation models are sensitive to the presence of outliers No outliers With outliers ● 100 100 ● ● ● ● ● 50 50 ● ● ● ● ● ● ● ● ● ● ● ● X X ● ● ● ● ● ● ● ● 0 0 ● ● ● ● ● ●● ● −50 −50 ● 0 200 400 600 800 1000 0 200 400 600 800 1000 Index Index V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 2 / 17

Hidden Markov Models ◮ X i observed variable ◮ S i hidden variable, for i = 1 . . . , n S 1 S 2 S 3 S 4 S 5 X 1 X 2 X 3 X 4 X 5 Factorization of the joint probability distribution n � P ( S i = s i | S i − 1 = s i − 1 ) P ( S 1: n = s 1: n , X 1: n = x 1: n ) = P ( S 1 = s 1 ) i =2 n � P ( X i = x i | S i = s i ) i =1 V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 3 / 17

Hidden Markov Models ◮ X i observed variable ◮ S i hidden variable, for i = 1 . . . , n S 1 S 2 S 3 S 4 S 5 X 1 X 2 X 3 X 4 X 5 Factorization of the joint probability distribution n n � � P ( S 1: n , X 1: n ) = P ( S 1 ) P ( S i | S i − 1 ) P ( X i | S i ) i =2 i =1 V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 3 / 17

Hidden Markov Models ◮ X i observed variable ◮ S i hidden variable, for i = 1 . . . , n S 1 S 2 S 3 S 4 S 5 X 1 X 2 X 3 X 4 X 5 Factorization of the joint probability distribution n n � � P ( S 1: n , X 1: n ) = P ( S 1 ) P ( S i | S i − 1 ) P ( X i | S i ) i =2 i =1 Example (Application to change point detection) ◮ Level-based: S i = underlying level of observation X i ◮ Segment-based: S i = segment of X i with S 1 = 1 and S n = # segments V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 3 / 17

Inference in HMMs If E = { X 1: n = x 1: n } observed, compute P ( E ) , P ( S i |E ) , P ( S i | S i − 1 , E ) ... Backward and Forward recursions (Baum-Welch algorithm) Standard inference problems solved by combining the Forward and Backward quantities ◮ F i ( S i ) := P ( S i , X 1: i = x 1: i ) ◮ B i ( S i ) := P ( X i +1: n = x i +1: n | S i ), which are computed recursively: ◮ F i ( S i ) = � S i − 1 F i − 1 ( S i − 1 ) P ( S i | S i − 1 ) P ( X i | S i ) ◮ B i − 1 ( S i − 1 ) = � S i P ( S i | S i − 1 ) P ( X i | S i ) B i ( S i ). E.g. P ( S i , E ) = F i ( S i ) B i ( S i ) Parameter estimation EM algorithm: Backward/Forward quantities provide explicit update formulas V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 4 / 17

Ad hoc model for outlier detection in HMMs ◮ X i is an outlier if it is not generated by the underlying HMM ◮ ⇒ extend the HMM with variables for the outliers status [Shah 2006] Topology and conditional dependencies (homoscedastic Gaussian case) S 1 S 2 S 3 S 4 S 5 X 1 X 2 X 3 X 4 X 5 O 1 O 2 O 3 O 4 O 5 ◮ O i = 1 iff X i is outlier; P ( O i = 1) = ρ ◮ P ( S 1 ); P ( S i | S i − 1 ): same as for underlying HMM ◮ P ( X i | S i , O i = 0) = N ( µ S i , σ 2 ): same as for underlying HMM ◮ P ( X i | S i , O i = 1) = N ( µ S i , σ 2 ) + N (0 , δ 2 ) V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 5 / 17

Inference in the ad hoc model Inferring outlier posterior probabilities � � ρ P ( X i = x i | S i , O i =1) P ( O i = 1 |E ) = � ρ P ( X i = x i | S i , O i =1)+(1 − ρ ) P ( X i = x i | S i , O i =0) · P ( S i |E ) S i where F i ( S i ) B i ( S i ) P ( S i |E ) = � S i F i ( S i ) B i ( S i ) � F i ( S i ) = F i − 1 ( S i ) P ( S i | S i − 1 ) P ( X i | S i ) S i − 1 V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 6 / 17

Inference in the ad hoc model Inferring outlier posterior probabilities � � ρ P ( X i = x i | S i , O i =1) P ( O i = 1 |E ) = � ρ P ( X i = x i | S i , O i =1)+(1 − ρ ) P ( X i = x i | S i , O i =0) · P ( S i |E ) S i where F i ( S i ) B i ( S i ) P ( S i |E ) = � S i F i ( S i ) B i ( S i ) � � F i ( S i ) = F i − 1 ( S i ) P ( S i | S i − 1 ) P ( X i = x i | S i , O i = o ) P ( O i = o ) o =0 , 1 S i − 1 V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 6 / 17

Inference in the ad hoc model Inferring outlier posterior probabilities � � ρ P ( X i = x i | S i , O i =1) P ( O i = 1 |E ) = � ρ P ( X i = x i | S i , O i =1)+(1 − ρ ) P ( X i = x i | S i , O i =0) · P ( S i |E ) S i where F i ( S i ) B i ( S i ) P ( S i |E ) = � S i F i ( S i ) B i ( S i ) � � F i ( S i ) = F i − 1 ( S i ) P ( S i | S i − 1 ) P ( X i = x i | S i , O i = o ) P ( O i = o ) o =0 , 1 S i − 1 � � P ( S i | S i − 1 ) P ( X i = x i | S i , O i = o ) P ( O i = o ) B i ( S i ) B i − 1 ( S i − 1 ) = o =0 , 1 S i V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 6 / 17

EM algorithm for the ad hoc model Parameter updates: 2 new parameters ( ρ, δ 2 ) ◮ Transition parameters have same update formulas as in plain HMM � n i =1 P ( O i =1 |E ) ◮ ρ = n ◮ µ s , σ 2 , δ 2 found as fixed points: i x i [ P ( S i = s , O i =1 |E ) σ 2 + P ( S i = s , O i =0 |E )( σ 2 + δ 2 )] �  µ s = i [ P ( S i = s , O i =1 |E ) σ 2 + P ( S i = s , O i =0 |E )( σ 2 + δ 2 )] �    s ( x i − µ s ) 2 P ( S i = s , O i =0 |E ) � � σ 2 = i � � s P ( S i = s , O i =0 |E ) i s ( x i − µ s ) 2 P ( S i = s , O i =1 |E ))  σ 2 + δ 2 � �  = i  � � s P ( S i = s , O i =1 |E ) i ρ P ( X i = x i | S i , O i =1) where P ( S i , O i |E ) = ρ P ( X i = x i | S i , O i =1)+(1 − ρ ) P ( X i = x i | S i , O i =0) · P ( S i |E ) Initialization k -means algorithm and z-score V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 7 / 17

Validation of the ad hoc model on a toy example ◮ Simulations done with the homoscedastic Gaussian ad hoc model: ◮ H0: no outlier ( δ = 0) ◮ H1: presence of outliers ( δ � = 0) ◮ Global statistics: T = max i =1 ,..., n P ( O i = 1 |E ) ◮ P ( O i |E ) computed using the true parameters δ AUC ( T ) 0.00 0.52 [0.48,0.55] 0.50 0.49 [0.46,0.53] 1.00 0.56 [0.52,0.59] 1.50 0.74 [0.71,0.78] 2.00 0.87 [0.85,0.89] 2.50 0.93 [0.91,0.95] 3.00 0.97 [0.95,0.98] 3.50 0.99 [0.98,0.99] 4.00 0.99 [0.99,1.00] 4.50 0.99 [0.99,1.00] V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 8 / 17

Application of the ad hoc model to real data ◮ CNV dataset from breast cancer cell line BT474 [Snijders 2001] ◮ Level-based model: S i = level of observation X i ◮ Parameters in the ad hoc model estimated with the EM algorithm Original data Original data 1.0 ● ● ● ● ● ● 0.5 30 ● ● ● ● ● ● ● ●● ● ● ● ● ●●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●●● ●●●● ● ● ●●● ● ●● ● ● ● ● ● ●●● ● ●●● ● ● ● ● ● ● ● ●● ● ● ●● ● ● ● ● ● ● 0.0 ● ● ● ●● ● −log(1 − post_out) ● ● ● ● ● ● ● ● ● 20 −0.5 X ● ● ● ●● ● ● ● ● ● ●● ● ●● ● ● ● ● −1.0 ● ● ● 10 −1.5 ● −2.0 0 0 20 40 60 80 100 120 0 20 40 60 80 100 120 Index Index V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 9 / 17

Application of the ad hoc model to real data ◮ CNV dataset from breast cancer cell line BT474 [Snijders 2001] ◮ Level-based model: S i = level of observation X i ◮ Parameters in the ad hoc model estimated with the EM algorithm Original data with outliers Original data 1.0 ● ● ● ● ● ● 0.5 30 ● ● ● ● ● ● ● ●● ● ● ● ● ●●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●●● ●●●● ● ●●● ●● ● ● ● ● ● ●●● ● ●●● ● ● ● ● ● ● ● ●● ● ● ●● ● ● ● ● ● ● 0.0 ● ● ● ●● ● −log(1 − post_out) ● ● ● ● ● ● ● ● ● 20 −0.5 X ● ● ● ● ●● ● ● ● ● ●● ●● ● ● ● ● −1.0 ● ● ● 10 −1.5 ● −2.0 0 0 20 40 60 80 100 120 0 20 40 60 80 100 120 Index Index V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 9 / 17

Detecting Outliers in HMM modeling through Relative Entropy with - PowerPoint PPT Presentation

Detecting Outliers in HMM modeling through Relative Entropy with Applications to Change-Point Detection V. Perduca, G. Nuel JOBIM 2012 V. Perduca (MAP 5 - Paris Descartes) Detecting Outliers in HMM modeling JOBIM 2012 1 / 17 Change-point

Detecting Outliers under Detecting Outliers . . . What We Plan To Do Interval Uncertainty:

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM.

Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona

Detecting Outliers with Ensemble of Profile HMMs Xilin Yu 1 UIUC December 11, 2018 1 under the

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

Correspondence Analysis and Moderate Outliers Anna Langovaya, Sonja Kuhnt TU Dortmund Ferbruar

Visualizing Big Data Outliers through Distributed Aggregation Leland Wilkinson. Proc VAST 2017,

Finding Maxima and Minima For a function of two variables what does a relative maximum or relative

Relative Clause clause adds additional information to the noun in the sentence. 1 Direct

Advanced Lesson 22 Topic 22: Dreams. Grammar: Relative clauses and reduced relative clauses

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM

HMM: Particle filters Lirong Xia Recap: Reasoning over Time Markov models p( X 1 )

Predicting Sequences: Structured Perceptron CS 6355: Structured Prediction 1 Conditional Random

1 Real HMM Examples Real HMM Examples Speech recognition HMMs: Machine translation HMMs:

Presented by Yi-Ta Wu April. 23, 2004 1 Agenda The Surveillance System Auto-Alarm Based

Holes and Islands in Random Point Sets Martin Balko, Manfred Scheucher, Pavel Valtr 1 k -Gons a

AP Chemistry Compounds 2015-09-14 www.njctl.org Slide 3 / 126 Slide 4 / 126 Table of

AP Chemistry Compounds 2015-09-14 www.njctl.org Slide 3 / 126 Table of Contents: Compounds Pt.

Dynamical formation ! of black-hole X -ray binaries Ross Church ! Department of Astronomy and