Graphical Models Independence & Factorization Including - PDF document

Graphical Models slides adopted from Sandro Schönborn 1 Graphical Models • Independence & Factorization • Including structure • Complexity of multivariate problems • Independence assumptions • Graphical Models • Graphs to depict factorizations • Topological properties • Causal modeling • Factor graphs 2 1

Graphical Models • Independence & Factorization • Including structure • Complexity of multivariate problems • Independence assumptions • Graphical Models • Graphs to depict factorizations • Topological properties • Causal modeling • Factor graphs With examples from chapters 13 & 14 Russell, Norvig, Artificial Intelligence – A modern approach , 3 rd ed., Pearson 2010 3 Missing Structure • Until now: put everything in a large feature vector then find best classification or or learn a full joint probability distribution • Knowledge about the domain? -> features, pre-processing • Knowledge about feature dependencies? -> classification method • How can we integrate specialist knowledge? • It surely helps to make the problem easier! • How to construct a composite system when only parts are available for training? 4 2

Structured Problems Relations among pixels (dependencies) Genetic Code  Function Image  Facial Expression 5 Structure in Probabilistic Models • Bayes formalism needs: • Likelihood • Prior or Joint Probability Density / Table • Both contain “ structure/knowledge ” information: • Likelihood: likelihood assigned to each possible combination of features -> contains every possible form of structure among feat atur ures • Prior: prior belief -> contains our knowledge about the mo model/do domai ain before seeing data • Structure is complicated, it can render models intractable Too much structure is also undesired: not entirely hand-designed classifiers 6 3

Multivariate Problems: Complexity • Most problems involve many variables images > 10 6 (1 MP), DNA data, web consumer data, … • Structure involves interdependencies among many variables Image pixels show strong correlations with each other -> complicates inference • Estimation of densities is susceptible to high dimensionality e.g. dimension of covariance matrix: 𝑒 x 𝑒 , captures only linear relations Joint probability tables: one entry for ever ery y possible combination 𝒫 exp 𝑒 -> complicates density estimation 7 Example: Dentist Diagnosis with Joint Probability • Dentist diagnosis considering 4 binary variables • toothache : patient has toothache • cavity: patient has a cavity • probe: the dentists probe catches in the tooth • rain : it is currently raining • JPT gives occurrence probability of each combination “Joint Probability Table” toothache ¬toothache JPT Complexity of probe ¬probe probe ¬probe tion 𝒫 2 𝑒 estimati cavity 0.036 0.004 0.024 0.003 rain ¬cavity 0.005 0.021 0.048 0.192 Contains a lot of structure, although cavity 0.072 0.008 0.048 0.005 not t easily extractabl ble ¬rain ¬cavity 0.011 0.043 0.096 0.384 Russell, Norvig, Artificial Intelligence – A modern approach , 3 rd ed., Pearson 2010 8 4

Example: Dentist Inference • “ What is the probability of a cavity if the probe catches?” not know: toothache 𝑈 , rain 𝑆 We do not c: cavity p: probe catches 𝑄 cavity probe = 𝑄(c, p) 𝑄 p 𝑄 p, c = ෍ 𝑄 p, c, 𝑆, 𝑈 𝑆,𝑈 = 𝑄 p, c, r, t + 𝑄 p, c, r, ¬t + 𝑄 p, c, ¬r, t + 𝑄 p, c, ¬ r, ¬t marginalization 𝑄 p = ෍ 𝑄 𝑆, 𝑈, 𝐷, p = ⋯ 𝑆,𝑈,𝐷 Nasty complexity ⇒ 𝑄 cavity probe = 0.53 9 Multivariate Problems • Most problems involve many variables • Estimation of densities is susceptible to high dimensionality • Exponential requirement of samples • Inference with many variables? • Impractical complexity • How to handle joint probability tables? • Encode and decode structure in JPT? Are probabilities practically useless?? 10 5

Independence and Factorization • Help through independence assumptions: • Marginal Independence • Conditional Independence • Independence assumptions lead to factorizations • Lowers complexity of estimation and inference drastically • Explicit “ non-structure statements” • A way of expressing structure • to deal with intermediate forms of dependence (anywhere from none to full) • which is easy to work with, can be used by specialists 11 Marginal Independence • Full statistical independence among variables 𝑄 𝑌, 𝑍 = 𝑄 𝑌 𝑄 𝑍 𝑄 𝑌 𝑍 = 𝑄(𝑌) • Expert knowledge: No relation between X and Y not linear, not higher order, no none • Affects complexity drastically: 𝑙 𝑒 → 𝑒 ∗ 𝑙 Full independence: 𝑙 𝑒 → 𝑙 𝑒−1 + 𝑙 For each independent variable: • Unfortunately not very common Independent variables usually do not appear in the first place since they are irrelevant 12 6

Marginal Independence • The dentist probabilities should not be dependent on the weather • Assume independence of rain from all the other variables: 𝑄 𝐷, 𝑈, 𝑄, 𝑆 = 𝑄 𝐷, 𝑈, 𝑄 𝑄(𝑆) toothache ¬toothache rain ¬rain probe ¬probe probe ¬probe cavity 0.108 0.012 0.072 0.008 0.333 0.667 ¬cavity 0.016 0.064 0.144 0.576 Lowered complexity Structure is visible of estimation Russell, Norvig, Artificial Intelligence – A modern approach , 3 rd ed., Pearson 2010 13 Conditional Independence • Independent conditional probabilities Independent if we know the value of a third variable 𝑄 𝑌, 𝑍|𝑎 = 𝑄 𝑌 𝑎 𝑄 𝑍 𝑎 𝑄 𝑌 𝑍, 𝑎 = 𝑄(𝑌|𝑎) • Lowers complexity: 𝑙 𝑒−1 ∗ 𝑙 → (𝑒 − 1) ∗ 𝑙 ∗ 𝑙 Full conditional independence: 𝑙 𝑒−1 ∗ 𝑙 → (𝑙 𝑒−2 + 𝑙) ∗ 𝑙 For each cond. independent variable: • Very useful: • More often than marginal independence • Causal modeling: effects of a common cause 14 7

Example: Conditional Independence • Expect: Catching of the probe should be “independent” of toothache • But they are not, they occur with strong correlation ( xxx ) 𝑄 𝑈, 𝑄 ≠ 𝑄 𝑈 𝑄(𝑄) • Dependency can be “reduced” to a common cause: cavity c → p, c → t • Knowing about cavity renders toothache and probe independent 𝑄 𝑈, 𝑄|𝐷 = 𝑄 𝑈 𝐷 𝑄 𝑄 𝐷 𝑄 𝑈 𝑄, 𝐷 = 𝑄 𝑈 𝐷 𝑄 𝑄 𝑈, 𝐷 = 𝑄(𝑄|𝐷) 15 Example: Conditional Independence • Factorization into 4 factors: (4 tables) cavity ¬cavity rain ¬rain 𝑄(𝐷) 𝑄(𝑆) 0.2 0.8 0.333 0.667 “Conditional CPT toothache ¬toothache CPT probe ¬probe Probability Table” cavity 0.6 0.4 cavity 0.9 0.1 𝑄(𝑄|𝐷) 𝑄(𝑈|𝐷) ¬cavity 0.2 0.8 ¬cavity 0.1 0.9 𝑄 𝑄, 𝑈, 𝐷, 𝑆 = 𝑄 𝑄 𝐷 𝑄 𝑈 𝐷 𝑄 𝐷 𝑄(𝑆) Lowered complexity of estimation Structure e is visi sible 16 8

A Discriminative Shortcut? • Bayes classifier only needs posterior … direct estimation? • Posterior: diagnostic knowledge “Toothache indicates a cavity.” • Likelihoods: causal knowledge “ A cavity causes toothache.” • Diagnostic information is what we want at the end -> Classification using the posterior • Generative models waste resources on modeling irrelevant details Details within the classes are not relevant for classification 17 Why Generative? • Causal knowledge is more robust in structured domains: • More flexible model e.g. add gum disease to the dentist diagnosis model • Individual parts of causal knowledge can change independently e.g. usage of new improved and more precise probe • Expert knowledge is most often available in causal form Conditional independence relations Using facto tored causal knowledg dge has more advantages than only a better complexity • Careful: Generative models are prone to • Over-structuring -> bad estimation & inference quality • Over-simplification -> model can not capture necessary relations • Generative models are the usual way of Bayesian modeling 18 9

Structure in Bayesian Models • Bayes Classifier / Models • Likelihood & Prior to calculate posterior 𝑄 Ԧ 𝑦 𝑑 𝑄(𝑑) 𝑄 𝑑 Ԧ 𝑦 = σ 𝑑 𝑄 Ԧ 𝑦 𝑑 𝑄(𝑑) • Uncertainty through probabilistic models • Structure • Likelihood factorizes according to knowledge expressed through (con onditi tion onal) indep ndepende dence e relati tions • Prior captures knowledge about the model • Causal knowledge in likelihood: generative model 19 Graphical Models • Independence & Factorization • Including structure • Complexity of multivariate problems • Independence assumptions • Graphical Models • Graphs to depict factorizations • Topological properties • Causal modeling • Factor graphs 20 10

Graphical Models Independence & Factorization Including - PDF document

Graphical Models slides adopted from Sandro Schnborn 1 Graphical Models Independence & Factorization Including structure Complexity of multivariate problems Independence assumptions Graphical Models Graphs to depict

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Graphical Models Graphical Models Relationship between the directed & undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

Guest Lectures: Bettina Speckmann, Cartography & Flow; Yang Wang, Architectures for Scale.

Today CS 188: Artificial Intelligence Uncertainty Spring 2006 Probability Basics

Presidential Management Fellows (PMF) Program PMF Class of 2021 Application Cycle Briefing for

Computer Communication Networks Transport Layer IECE / ICSI 416 Spring 2020 Prof. Dola Saha

47.1 Introduction chapter overview: 46. Introduction and Quantification 47. Representation

Technology Solutions for Financial Inclusion-Indian Models PART - A Deepankar Roy, Ph.D.

Ap ery numbers and their experimental siblings Challenges in 21st Century Experimental

Natural Language Processing Lecture 2: Words and Morphology Linguistic Morphology The shape of

Graphical Models Independence & Factorization Including - PDF document

Graphical Models slides adopted from Sandro Schnborn 1 Graphical Models Independence & Factorization Including structure Complexity of multivariate problems Independence assumptions Graphical Models Graphs to depict

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Graphical Models Graphical Models Relationship between the directed &amp; undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical &gt; Tangible? What are their limitations? 93 94 Graphical &gt; Tangible? Graphical

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

Guest Lectures: Bettina Speckmann, Cartography &amp; Flow; Yang Wang, Architectures for Scale.

Today CS 188: Artificial Intelligence Uncertainty Spring 2006 Probability Basics

Presidential Management Fellows (PMF) Program PMF Class of 2021 Application Cycle Briefing for

Computer Communication Networks Transport Layer IECE / ICSI 416 Spring 2020 Prof. Dola Saha

47.1 Introduction chapter overview: 46. Introduction and Quantification 47. Representation

Technology Solutions for Financial Inclusion-Indian Models PART - A Deepankar Roy, Ph.D.

Ap ery numbers and their experimental siblings Challenges in 21st Century Experimental

Natural Language Processing Lecture 2: Words and Morphology Linguistic Morphology The shape of

Graphical Models Graphical Models Relationship between the directed & undirected models

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

Guest Lectures: Bettina Speckmann, Cartography & Flow; Yang Wang, Architectures for Scale.