+ 2. Model Selection Scores 3. New Stuff: fNML Score 2/30 + Bayesian - PowerPoint PPT Presentation

+ fNML Criterion Tomi Silander Teemu Roos Petri Kontkanen Petri Myllymaki for Learning Bayesian PGM‐08 Network Hirtshals Structures September 17‐19 2008 Helsinki Institute for Information Technology HIIT FINLAND

1. Bayesian Networks + 2. Model Selection Scores 3. New Stuff: fNML Score 2/30

+ Bayesian Networks 3/30 Conditional independence assumptions Factorization of a joint probability distribution:

+ Data 4/30 NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 Clark male reporter 0 Margrethe female queen 2 : : : :

+ Data 5/30 NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 Clark male reporter 0 Margrethe female queen 2 : : : :

+ 6/30 Data NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 D Clark male reporter 0 Margrethe female queen 2 : : : :

+ 7/30 Data NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 D i Clark male reporter 0 Margrethe female queen 2 : : : :

+ • Bayes (BDe) • BIC & AIC • MDL 8/30

+ Bayesian Score 9/30 The state-of-the-art model selection criterion: Bayesian Dirichlet equivalent (BDe) score Assumes Dirichlet prior on model parameters θ . Evaluate marginal likelihood of data given model Depends on hyper-parameter α .

+ BIC & AIC 10/30 BIC: Asymptotic approximation of marginal likelihood: AIC: Asymptotic approximation of estimated prediction error:

+ MDL 11/30 Minimum Description Length (MDL) Principle: Choose the model that yields the shortest description of the data together with the model. Too simple model data long, model short "Just right" data short, model short Too complex model data short, model long

+ Flavours of MDL 12/30 1. "Pedestrian" Asymptotic two-part code-length same as BIC.

+ Flavours of MDL 13/30 1. "Pedestrian" Asymptotic two-part code-length same as BIC. 2. "Sophisticated" Bayesian marginal likelihood.

+ Flavours of MDL 14/30 1. "Pedestrian" Asymptotic two-part code-length same as BIC. 2. "Sophisticated" Bayesian marginal likelihood. 3. "Champions League" Modern (minimax regret optimal) code normalized maximum likelihood (NML) Problem: NML computationally very hard.

+ Bayes vs. MDL (minimax regret) 15/30 The Bayesian decision principle is minimization of expected loss: min A E X [loss(A,X)] MDL (especially NML) is based on minimization of worst-case regret: min A max X [loss(A,X) – min A' loss(A',X)] "regret"

+ • fNML = "factorized NML" • computation • consistency 16/30

+ fNML Score 17/30 We propose a new MDL score, factorized NML, which is 1. easy to compute, 2. decomposable (allowing fast search), 3. robust (experimentally).

+ 18/30 fNML vs. NML: what's new? NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 Clark male reporter 0 Margrethe female queen 2 : : : :

+ 19/30 fNML vs. NML: what's new? NML: Minimax code applied to whole data as one block NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 D Clark male reporter 0 Margrethe female queen 2 : : : :

+ 20/30 fNML vs. NML: what's new? fNML: minimax code applied column by column NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 D 2 Clark male reporter 0 Margrethe female queen 2 : : : :

+ 21/30 fNML vs. NML: what's new? fNML: Conditional minimax code when parent(s) exist. NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 D 1 Clark male reporter 0 Margrethe female queen 2 : : : :

+ 24/30 fNML vs. NML: what's new? fNML: Conditional minimax code when parent(s) exist. NAME GENDER PROFESSION CHILDREN Teemu male researcher 2 D 4 Clark male reporter 0 Margrethe female queen 2 : : : : Each column is encoded using the minimax code for multinomials. Using fast NML algorithms, this takes O(n log n) per column.

+ fNML: Consistency 25/30 (Haughton, 1988): Any penalized likelihood score of the form where a n satisfies and , is consistent. Theorem: fNML behaves asymptotically like BIC, i.e., a n = log n . Hence, fNML is consistent.

+ Robustness 26/30 BIC BDe, fNML

+ Robustness 27/30 BIC BDe optimal when prior "correct". fNML almost as good. BDe, fNML

+ Robustness 28/30 f N M L

+ Robustness 29/30 BDe much worse when prior "incorrect". fNML more robust. f N M L

+ Questions?

+ Decomposable Scores Problem: Super-exponential search space. Solution: Decomposable scores m SCORE(G,D) = Σ S(D i ,D Gi ) i=1 For decomposable scores, exact search (global optimum) can be done for about m ≤ 30 nodes (Koivisto & Sood, 2004; Silander and Myllymäki, 2006) .

+ 2. Model Selection Scores 3. New Stuff: fNML Score 2/30 + Bayesian - PowerPoint PPT Presentation

+ fNML Criterion Tomi Silander Teemu Roos Petri Kontkanen Petri Myllymaki for Learning Bayesian PGM08 Network Hirtshals Structures September 1719 2008 Helsinki Institute for Information Technology HIIT FINLAND 1. Bayesian

Chapter 5: z-Scores : Location of Scores Chapter 5: z-Scores : Location of Scores and Standardized

Parent Seminar Welcome! PSAT Scores SAT vs. ACT Next Steps Overview New PSAT Score Report

MARC Fall Meeting 09/24/17 MARC Fall Meeting 09/24/17 SCORE Presentation SCORE

AND MARKET LINKAGES PROGRAMME SMALLHOLDER ADAPTATION TO CLIMATE CHANGE (FNML-SACCC) Case

CMAS: PARCC New state assessment scores arriving by new year New assessment to measure mastery

Ultrafast spectroscopy Detector Stuff Generally you have some stuff and you want to

CRITICAL INFORMATICS Our stuff keeps your stuff from becoming their stuff CRITICAL INFORMATICS

1/12/2011 Chapter 5: z-Scores : Location of Scores and Standardized Distributions Introduction to

Top-k Queries over Uncertain Scores Qing Liu, Debabrota Basu, Talel Abdessalem, St ephane

2017 SBAC ELA Scores 2017 SBAC ELA Scores Average Scaled Scores Percentage

Why z-scores? Transforming scores in order to make comparisons, especially when using

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

PSAT/NMSQT Understanding Your Scores & Taking the Next Steps How to Read the Score Report

Understanding the CAT4 assessment and reports Standardised Age Score (SAS) The raw scores for

Understanding Your Scores Applerouth Tutoring Services Presented By: Hannah Schwendeman

Propensity Score Matching James H. Steiger Department of Psychology and Human Development

Portable Enforcement Solution International Product Marketing Department Portable PTZ Dome Body

Random Variable Models of Computation Michael W. Mislove Tulane University New Orleans, LA

Autonomously Reviewing and Validating the Knowledge Base of a Never-Ending Learning System Saulo

Social'Data'Science' David'Dreyer'Lassen' UCPH'ECON' September'24,'2015' In'God'we'trust,'

World Cup draw: quantifying (un)fairness and (im)balance Julien Guyon Bloomberg L.P.,

Dive into Scala Joost den Boer Freelancer / Contractor email : jdboer@diversit.eu blog :

Q2FY19 Financial Results Presentation For the quarter ended 30 Sep 2018 Chua Sock Koong, Group

III.4 Statistical Language Models III.4 Statistical LM (MRS book, Chapter 12*) 4.1 What is

+ 2. Model Selection Scores 3. New Stuff: fNML Score 2/30 + Bayesian - PowerPoint PPT Presentation

+ fNML Criterion Tomi Silander Teemu Roos Petri Kontkanen Petri Myllymaki for Learning Bayesian PGM08 Network Hirtshals Structures September 1719 2008 Helsinki Institute for Information Technology HIIT FINLAND 1. Bayesian

Chapter 5: z-Scores : Location of Scores Chapter 5: z-Scores : Location of Scores and Standardized

Parent Seminar Welcome! PSAT Scores SAT vs. ACT Next Steps Overview New PSAT Score Report

MARC Fall Meeting 09/24/17 MARC Fall Meeting 09/24/17 SCORE Presentation SCORE

AND MARKET LINKAGES PROGRAMME SMALLHOLDER ADAPTATION TO CLIMATE CHANGE (FNML-SACCC) Case

CMAS: PARCC New state assessment scores arriving by new year New assessment to measure mastery

Ultrafast spectroscopy Detector Stuff Generally you have some stuff and you want to

CRITICAL INFORMATICS Our stuff keeps your stuff from becoming their stuff CRITICAL INFORMATICS

1/12/2011 Chapter 5: z-Scores : Location of Scores and Standardized Distributions Introduction to

Top-k Queries over Uncertain Scores Qing Liu, Debabrota Basu, Talel Abdessalem, St ephane

2017 SBAC ELA Scores 2017 SBAC ELA Scores Average Scaled Scores Percentage

Why z-scores? Transforming scores in order to make comparisons, especially when using

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

PSAT/NMSQT Understanding Your Scores &amp; Taking the Next Steps How to Read the Score Report

Understanding the CAT4 assessment and reports Standardised Age Score (SAS) The raw scores for

Understanding Your Scores Applerouth Tutoring Services Presented By: Hannah Schwendeman

Propensity Score Matching James H. Steiger Department of Psychology and Human Development

Portable Enforcement Solution International Product Marketing Department Portable PTZ Dome Body

Random Variable Models of Computation Michael W. Mislove Tulane University New Orleans, LA

Autonomously Reviewing and Validating the Knowledge Base of a Never-Ending Learning System Saulo

Social'Data'Science' David'Dreyer'Lassen' UCPH'ECON' September'24,'2015' In'God'we'trust,'

World Cup draw: quantifying (un)fairness and (im)balance Julien Guyon Bloomberg L.P.,

Dive into Scala Joost den Boer Freelancer / Contractor email : jdboer@diversit.eu blog :

Q2FY19 Financial Results Presentation For the quarter ended 30 Sep 2018 Chua Sock Koong, Group

III.4 Statistical Language Models III.4 Statistical LM (MRS book, Chapter 12*) 4.1 What is

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

PSAT/NMSQT Understanding Your Scores & Taking the Next Steps How to Read the Score Report