How to Take into Account the Discrete Parameters in the BIC - PowerPoint PPT Presentation

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives How to Take into Account the Discrete Parameters in the BIC Criterion? V. Vandewalle University Lille 2, IUT STID COMPSTAT 2010 Paris August 23th, 2010

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Intorduction Issue • Some models involve discrete parameters. • The discrete parameters play a part in the likelihood overfitting . • But, they cannot be penalized using standard BIC approximation. Study • Study the influence of the discrete parameters in the BIC approximation • Focus on a simple model : the modal modality model • Study the accuracy of differents approximations

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Outline The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives The modal modality model Model • X ∼ M ( 1 , α 1 , . . . , α m ) ( � m h = 1 α h = 1, α h > 0). • x = ( x 1 , x 2 , . . . , x n ) an n i.i.d. sample coming from X • Constraint proposed by Biernacki et al. (2006) : � 1 − ε if h = h ∗ α h = otherwise, ε m − 1 h ∗ the location of the modal modality and 0 ≤ ε ≤ m − 1 m . • Two parameters must be estimated : ε which is continuous and h ∗ which is discrete. Comments • Intuitive interpretation. • Useful to get parsimonious models in clustering.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives The modal modality model • Both continuous and discrete parameters, in a simple case. • In a Bayesian setting integration over both continuous and discrete parameters.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Integrated likelihood • Prior on ( ε, h ∗ ) : p ( ε, h ∗ ) = 1 mp ( ε ) . • Integrated likelihood : � m m − 1 � p ( x ) = 1 m p ( x | ε, h ∗ ) p ( ε ) d ε. m 0 h ∗ = 1 • Truncated Dirichlet prior for p ( ε ) p ( ε ) = C ε − 1 2 ( 1 − ε ) − 1 2 1 [ 0 , m − 1 m ]( ε ) , with C some normalization constant.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Integrated likelihood n � Let n h = x ih , the logarithm of the integrated likelihood (IL) is i = 1 � � � � n − n h � m m − 1 � 1 m ε C ε − 1 2 ( 1 − ε ) − 1 ( 1 − ε ) n h 2 d ε IL = log m m − 1 0 h = 1 How can we approximate this integral ? • Neglect discrete parameters. • Make Laplace approximation for each term of the sum. • Take into account the number of states of the discrete variable into account in the penalization.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Standard BIC approximation • Maximum likelihood estimator of the parameters � � n − n h ε ε, � ε, h ( 1 − ε ) n h (ˆ h ∗ ) = arg max , m − 1 n c h ∗ = arg max h n h and ˆ which gives � ε = 1 − h ∗ n . • If the discrete parameters are not taken into account, the BIC criterion is : � h ∗ � � � n − n c ˆ − 1 ε ε ) n c BIC 1 = log ( 1 − ˆ 2 log n , h ∗ m − 1 • However this approximation is not justified when considering discrete parameters.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Taking the discrete parameters into account For the sum into IL, there are terms for which the maximum in reached on the border for which we need the following proposition. Proposition Let L : [ a , b ] �→ R , such that L be one time differentiable on [ a , b ] and that it reaches its maximum at b with L ′ ( b ) > 0 . Then �� b � e nL ( u ) du log = nL ( b ) − log n + O ( 1 ) . a For a comparison note that �� b � = nL ( c ) − 1 e nL ( u ) du log 2 log n + O ( 1 ) , a if L would reach its maximum for c ∈ ] a , b [ .

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Taking the discrete parameters into account • Applying the previous proposition �� n − n h m − 1 ε m C ε − 1 2 ( 1 − ε ) − 1 ( 1 − ε ) n h 2 d ε log = m − 1 0 ε, h ) − 1 + s h log p ( x | ˆ log n + O ( 1 ) 2 ε = m − 1 where s h = 1 if the constraint is saturated (i.e. ˆ m ) and 0 otherwise. • Then replacing these approximations in IL we get � � � � n − n h m � 1 ˆ ε h n − 1 + sh ε h ) n h BIC 2 = log ( 1 − ˆ 2 m m − 1 h = 1 where ˆ ε h is the maximum likelihood estimator of ε when h is constrained to be the modal modality.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Taking the discrete parameters into account • Simplify BIC 2 to avoid the integration on the states of the discrete variable, which gives the alternative criterion � h ∗ � � � n − n c ˆ − 1 ε c h ∗ ) n c h ∗ BIC 3 = log ( 1 − ˆ 2 log n − log m . ε c h ∗ m − 1 • It is the standard BIC criterion penalized by the logarithm of the number of possible states of the discrete variable.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Numerical experiments • Study the accuracy of the approximation in a simple case. • Study the accuray for parsimonious models on binary data.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives X ∼ M ( 1 , 0 . 40 , 0 . 30 , 0 . 30 ) X ∼ M ( 1 , 0 . 40 , 0 . 35 , 0 . 25 ) Behavior of each criterion according to the number of data when M2 is true Behavior of each criterion according to the number of data when M1 is true 10000 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● IL ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● BIC1 Number of selection of the right model (M1) ● ● ● ● ● ● ● BIC2 ● ● ● ● ● ● ● ● ● ● ● Number of selection of the right model (M1) 8000 ● ● ● ● ● ● ● ● BIC3 ● ● ● 9500 ● ● ● ● ● ● ● ● ● ● 6000 ● ● ● ● ● IL BIC1 ● 9000 ● BIC2 ● BIC3 ● 4000 ● ● ● ● ● ● 8500 ● 2000 ● ● ● ● ● 0 200 400 600 800 1000 0 200 400 600 800 1000 Number of data Number of data F IG .: Number of times where F IG .: Number of times where the parsimonious model is the true model is selected. selected.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Binary simulated data Model • Binary data in the mulvariate case (in dimension d ). • x i ( i ∈ { 1 , . . . , n } ) with x i = ( x 1 i , x 2 i , . . . , x d i ) . • x j i drawn from a Bernoulli distribution. • Equality of ε for each variable (Celeux and Govaert (1991)). Experimental setting • If d is large it is not possible to perform the integration over all the states of the discrete variable. • Importance sampling (IS) to compute the sum. • Compare the different approximations of the integrated likelihood without considering the model choice issue. • d = 5, d = 10 and d = 20 variables. • ε = 0 . 45 for each variable. • 100 datasets, simulate 10 , 000 modal positions for IS.

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives Binary simulated data Crit \ n 20 50 100 1000 d = 5 dimensions IL − 70 . 91 ( 0 . 9 ) − 174 . 96 ( 1 . 2 ) − 347 . 77 ( 1 . 7 ) − 3448 . 18 ( 7 . 2 ) BIC 1 − 68 . 77 ( 1 . 4 ) − 172 . 59 ( 1 . 7 ) − 345 . 03 ( 2 . 2 ) − 3444 . 38 ( 7 . 2 ) BIC 2 − 70 . 50 ( 0 . 8 ) − 174 . 59 ( 1 . 2 ) − 347 . 41 ( 1 . 6 ) − 3447 . 84 ( 7 . 2 ) BIC 3 − 72 . 23 ( 1 . 4 ) − 176 . 05 ( 1 . 7 ) − 348 . 49 ( 2 . 2 ) − 3447 . 85 ( 7 . 2 ) d = 10 dimensions IL − 140 . 24 ( 1 . 0 ) − 348 . 15 ( 1 . 2 ) − 693 . 71 ( 2 . 4 ) − 6891 . 66 ( 10 ) BIC 1 − 135 . 98 ( 2 . 1 ) − 343 . 32 ( 2 . 1 ) − 688 . 22 ( 3 . 3 ) − 6884 . 02 ( 10 ) BIC 2 − 139 . 49 ( 1 . 0 ) − 347 . 44 ( 1 . 2 ) − 693 . 01 ( 2 . 3 ) − 6890 . 97 ( 10 ) BIC 3 − 142 . 91 ( 2 . 1 ) − 350 . 25 ( 2 . 1 ) − 695 . 15 ( 3 . 3 ) − 6890 . 95 ( 10 ) d = 20 dimensions IL − 279 . 01 ( 0 . 8 ) − 694 . 51 ( 1 . 4 ) − 1385 . 87 ( 2 . 4 ) − 13795 . 88 ( 14 ) − 271 . 06 ( 2 . 6 ) − 685 . 31 ( 3 . 2 ) − 1374 . 98 ( 3 . 5 ) − 13765 . 95 ( 11 ) BIC 1 − 277 . 93 ( 0 . 8 ) − 693 . 46 ( 1 . 4 ) − 1384 . 84 ( 2 . 4 ) − 13794 . 85 ( 14 ) BIC 2 BIC 3 − 284 . 93 ( 2 . 6 ) − 699 . 18 ( 3 . 2 ) − 1388 . 85 ( 3 . 5 ) − 13779 . 81 ( 11 ) T AB .: Mean value of the criterion according the values of n and d , the standard deviation is given into parenthesis.

How to Take into Account the Discrete Parameters in the BIC - PowerPoint PPT Presentation

The modal modality model Integrated likelihood and BIC approximations Numerical experiments Conclusion and perspectives How to Take into Account the Discrete Parameters in the BIC Criterion? V. Vandewalle University Lille 2, IUT STID

Camera Parameters INEL 6088 Computer Vision Camera Parameters Extrinsic parameters: define

Account Compliance Trust Account Reconciliation Agenda Trust Account Overview Top

Discrete Mathematics Jeremy Siek Spring 2010 Jeremy Siek Discrete Mathematics 1 / 118 Jeremy

Cyber-Physical Systems Discrete Dynamics IECE 553/453 Fall 2019 Prof. Dola Saha 1 Discrete

CMSC 222: Discrete Mathematics Prof S Fall 2018 What is Discrete Mathematics? Discrete

Cyber-Physical Systems Discrete Dynamics ICEN 553/453 Fall 2018 Prof. Dola Saha 1 Discrete

Plan Discrete paths as Heyting algebras Discrete paths as categories Discrete paths as quantales

Discrete-time Systems in the Time Domain Chaiwoot Boonyasiriwat August 21, 2020 Discrete-time

Evidence evaluation for discrete data Evidence evaluation for discrete data Evidence evaluation

Two-Port Networks Definitions Impedance Parameters Admittance Parameters Hybrid

+ + Review n function parts: n parameters n no parameters n return type n multiple

Chapter 4 Parameters 1 Parameters T wo methods of passing arguments to parameters

One-Port v Admittance Parameters Network - Hybrid Parameters i' 1 Transmission

Account Home Revamp AB 1296 January 23 rd 2020 | 1 Account Home Fixer-Upper We redesigned

LANDING ACCOUNT PROCEDURES. LANDING ACCOUNT The Landing Account is a report of all the cargo that

TDC Special Project Accounts Development Account Contingency Account Donations and

Frequentist Statistics DS GA 1002 Probability and Statistics for Data Science

Safety Assurance in in Cyber-Physical Systems buil ilt wit ith Le Learning-Enabled Components

INTEGRAL PRIVACY COMPLIANT STATISTICS COMPUTATION NAVODA SENAVIRATHNE UNIVERSITY OF SKVDE,

Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted

Compressive Extreme Learning Machines Improved Models Through Exploiting Time-Accuracy Trade-offs

Menu Concerns about the quality of the predictive distributions Augmentation: a bit more

First International Workshop on Learning over Multiple Contexts LMCE 2014 Nancy, 19 September

Event Calendar SHIMA Daio,

Sambuz

Useful Links

Newsletter

Mail Us