A Brief Introduction to Graphical Models and How to Learn Them from - PowerPoint PPT Presentation

A Brief Introduction to Graphical Models and How to Learn Them from Data Christian Borgelt Dept. of Knowledge Processing and Language Engineering Otto-von-Guericke-University of Magdeburg Universit¨ atsplatz 2, D-39106 Magdeburg, Germany E-mail: borgelt@iws.cs.uni-magdeburg.de WWW: http://fuzzy.cs.uni-magdeburg.de/~borgelt Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 1

Overview ✎ Graphical Models: Core Ideas and Notions ✎ A Simple Example: How does it work in principle? ✎ Conditional Independence Graphs ✍ conditional independence and the graphoid axioms ✍ separation in (directed and undirected) graphs ✍ decomposition/factorization of distributions ✎ Evidence Propagation in Graphical Models ✎ Building Graphical Models ✎ Learning Graphical Models from Data ✍ quantitative (parameter) and qualitative (structure) learning ✍ evaluation measures and search methods ✍ learning by conditional independence tests ✍ learning by measuring the strength of marginal dependences ✎ Summary Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 2

Graphical Models: Core Ideas and Notions ✎ Decomposition: Under certain conditions a distribution ✍ (e.g. a probability distribution) on a multi-dimensional domain, which encodes prior or generic knowledge about this domain, can be decomposed into a set ❢ ✍ 1 ❀ ✿ ✿ ✿ ❀ ✍ s ❣ of (overlapping) distributions on lower-dimensional subspaces. ✎ Simplified Reasoning: If such a decomposition is possible, it is sufficient to know the distributions on the subspaces to draw all inferences in the domain under consideration that can be drawn using the original distribution ✍ . ✎ Such a decomposition can nicely be represented as a graph (in the sense of graph theory), and therefore it is called a Graphical Model . ✎ The graphical representation ✍ encodes conditional independences that hold in the distribution, ✍ describes a factorization of the probability distribution, ✍ indicates how evidence propagation has to be carried out. Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 3

A Simple Example Example World Relation color shape size small medium small medium medium large medium medium medium ✎ 10 simple geometrical objects, 3 attributes. large ✎ One object is chosen at random and examined. ✎ Inferences are drawn about the unobserved attributes. Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 4

✥ ✩ ✥ ✥ ✥ ✩ ✩ ✪ ✩ ✥ ✩ ✩ ✩ ✩ ✩ ✪ ✏ ✥ The Reasoning Space large medium medium small ✎ The reasoning space consists of a finite set Ω of world states. ✎ The world states are described by a set of attributes ❆ ✐ , whose domains ❢ ❛ ( ✐ ) 1 ❀ ✿ ✿ ✿ ❀ ❛ ( ✐ ) ❦ i ❣ can be seen as sets of propositions or events. ✎ The events in a domain are mutually exclusive and exhaustive. ✎ The reasoning space is assumed to contain the true, but unknown state ✦ 0 . Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 5

★ ✦ ★ ★ ✦ ★ ★ ✥ ✧ ✩ ✩ ★ ✦ ★ The Relation in the Reasoning Space Relation Relation in the Reasoning Space color shape size small medium small medium medium large large medium medium small medium medium large Each cube represents one tuple. Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 6

★ ★ ❆ ✧❆ ❅ ❅ ✩ ✩ ✩ ✩ ✩ ✩ ✥ ✦ ✦ Reasoning ✎ Let it be known (e.g. from an observation) that the given object is green. This information considerably reduces the space of possible value combinations. ✎ From the prior knowledge it follows that the given object must be ✍ either a triangle or a square and ✍ either medium or large. large large medium medium small small Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 7

✩ ❆ ❅ ❅ ❇ ❇ ✩ ✩ ✩ ✩ ✩ ✩ ✩ ✪❆ ❆ ✩ ❆ ❆ ❆ ❆ ❆ ❇ ❇ ✪❇ ✩ ✩ ✩ ✩ ✩ ❅ ❅ ❅ ❅ ✩ ✩ ✧ ✥ ★ ★ ✦ ★ ★ ★ ★ ✦ ✦ ★ ✩ ✩ ✩ ✩ ✩ ✩ ✩ ✪ Prior Knowledge and Its Projections large large medium medium small small large large medium medium small small Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 8

✦ ✩ ✦ ★ ★ ✧ ✩ ✩ ✂ ✄ ✩ ✧ ★ ✥ ★ ★ ✦ ★ ★ ★ ★ ✦ ✦ ✦ ★ ✥ ✩ ★ ★ ★ ✪ ✪ ✪ ✪ ✪ ✩ ✩ ✥ ✥ ✥ ✧ ✧ ✧ ✥ Cylindrical Extensions and Their Intersection Intersecting the cylindrical extensions of the projection to the subspace formed by color and shape and of the projection to the subspace formed by large shape and size yields the origi- medium nal three-dimensional relation. small large large medium medium small small Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 9

❈ ❈❅ ❅ ❅ ❅ ❅ ❅ ❅ ❈ ❈ ❈ ❈ ❈ ✁ ❈ ❅ ❈ ❈ ❈ � ❅ ❅ � ❅ ✄ ❈ ❈ ❈ Reasoning with Projections The same result can be obtained using only the projections to the subspaces without reconstructing the original three-dimensional space: s m l size color project extend shape project extend s m l ✛ ✘ ✛ ✘ ✛ ✘ This justifies a network representation: color shape size ✚ ✙ ✚ ✙ ✚ ✙ Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 10

✩ ✦ ✦ ★ ✦ ✦ ★ ✥ ✩ ✥ ✥ ✥ ✥ ✥ ✥ ✦ ✦ ★ ✦ ✦ ✦ ✦ ✦ ✦ ★ ✥ ✧ ✧ ✩ ✩ ✥ ✥ ✦ ★ ✧ ✦ ✂ ✄ ✩ ✩ ✥ ✧ ✥ ✥ ✥ ✧ ✧ ✦ ✦ ★ ✦ ★ ✦ ★ ★ ★ ★ ✦ ✩ ✩ ✧ ✥ ★ ★ ✦ ★ ✥ Using other Projections large large medium medium small small large large medium medium small small Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 11

❅ ✩ ✩ ✪ ❅ ❅ ❅ ❅ ✩ ❅ ✩ ✩ ✩ ✩ ✩ ✩ ✩ ✩ ✩ ✩ ✪❆ ❆ ❆ ❆ ❆ ❆ ❆ ✩ ✩ ✩ ★ ❇ ❇ ✩ ✩ ✧ ✥ ★ ★ ✦ ★ ★ ★ ✩ ✦ ✦ ★ ✦ ✦ ❇ ❇ ✪❇ ✩ ✩ ✩ ✩ ❆ Is Decomposition Always Possible? 2 large large medium medium 1 small small large large medium medium small small Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 12

✫ ✩ ✩ ✧ ✩ ✩ ✧ ✫ A Probability Distribution all numbers in 220 330 170 280 parts per 1000 20 90 10 80 400 2 1 20 17 240 28 24 5 3 360 large 18 81 9 72 s m l 300 8 4 80 68 20 180 200 56 48 10 6 40 160 40 medium 2 9 1 8 180 120 60 460 2 1 20 17 84 72 15 9 small 240 40 180 20 160 50 115 35 100 large 12 6 120 102 82 133 99 146 medium 168 144 30 18 88 82 36 34 small ✎ The numbers state the probability of the corresponding value combination. Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 13

✩ ✩ ✧ ✩ ✩ ✧ ✫ ✫ Reasoning: Computing Conditional Probabilities all numbers in 0 0 0 1000 parts per 1000 0 0 0 286 572 0 0 0 61 364 0 0 0 11 64 large 0 0 0 257 s m l 358 0 0 0 242 29 257 286 0 0 0 21 61 242 61 medium 0 0 0 29 32 21 11 520 0 0 0 61 0 0 0 32 small 122 0 0 0 572 0 0 0 358 large 0 0 0 364 0 0 0 531 medium 0 0 0 64 0 0 0 111 small ✎ Using the information that the given object is green. Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 14

Probabilistic Decomposition ✎ As for relational networks, the three-dimensional probability distribution can be decomposed into projections to subspaces, namely the marginal distribution on the subspace formed by color and shape and the marginal distribution on the subspace formed by shape and size. ✎ The original probability distribution can be reconstructed from the marginal distributions using the following formulae ✽ ✐❀ ❥❀ ❦ : � � � � � � ❛ (color) ❀ ❛ (shape) ❀ ❛ (size) ❛ (color) ❀ ❛ (shape) ❛ (size) � ❛ (shape) � P = P ) ✁ P ✐ ❥ ✐ ❥ ❥ ❦ ❦ � � ❛ (shape) ❀ ❛ (size) P � � ❛ (color) ❀ ❛ (shape) ❥ ❦ = P ✁ � � ✐ ❥ ❛ (shape) P ❥ ✎ These equations express the conditional independence of attributes color and size given the attribute shape , since they only hold if ✽ ✐❀ ❥❀ ❦ : � � � � � � ❛ (size) � ❛ (shape) ❛ (size) � ❛ (color) ❀ ❛ (shape) � � P = P ❥ ✐ ❥ ❦ ❦ Christian Borgelt A Brief Introduction to Graphical Models and How to Learn Them from Data 15

A Brief Introduction to Graphical Models and How to Learn Them from - PowerPoint PPT Presentation

A Brief Introduction to Graphical Models and How to Learn Them from Data Christian Borgelt Dept. of Knowledge Processing and Language Engineering Otto-von-Guericke-University of Magdeburg Universit atsplatz 2, D-39106 Magdeburg, Germany

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

You will learn what git is . You will learn how you can use git . You will learn how to learn more

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Learn Blackboard Learn Learn with others Learn in your own time, pace, space Learn through

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Models Graphical Models Relationship between the directed & undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

CSE 421 Algorithms Summer 2007 Huffman Codes: An Optimal Data Compression Method 1 a 45% b

CSE 417 c 12% Compression Example d 16% e 9% Algorithms f 5% 100k file, 6 letter

How the Concept of Shannons Derivation: . . . Shannons Derivation . . . Case of a

A Conditional Information Inequality and Its Combinatorial Applications Nikolay Vereshchagin 1

Intrabody Communication: Applications and Practical Issues Kurt Partridge University of

Elements of a Nonstochastic Information Theory Girish Nair Dept. Electrical & Electronic

Wireless Multimedia System (Topic 5) Wireless Link I: Multiple

Prof Anita Heis iss Wiradjuri Nation Welcoming Cities, Brisbane, 2019 Gadigal /Eora Cowra

A Brief Introduction to Graphical Models and How to Learn Them from - PowerPoint PPT Presentation

A Brief Introduction to Graphical Models and How to Learn Them from Data Christian Borgelt Dept. of Knowledge Processing and Language Engineering Otto-von-Guericke-University of Magdeburg Universit atsplatz 2, D-39106 Magdeburg, Germany

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

You will learn what git is . You will learn how you can use git . You will learn how to learn more

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Learn Blackboard Learn Learn with others Learn in your own time, pace, space Learn through

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Models Graphical Models Relationship between the directed &amp; undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

CSE 421 Algorithms Summer 2007 Huffman Codes: An Optimal Data Compression Method 1 a 45% b

CSE 417 c 12% Compression Example d 16% e 9% Algorithms f 5% 100k file, 6 letter

How the Concept of Shannons Derivation: . . . Shannons Derivation . . . Case of a

A Conditional Information Inequality and Its Combinatorial Applications Nikolay Vereshchagin 1

Intrabody Communication: Applications and Practical Issues Kurt Partridge University of

Elements of a Nonstochastic Information Theory Girish Nair Dept. Electrical &amp; Electronic

Wireless Multimedia System (Topic 5) Wireless Link I: Multiple

Prof Anita Heis iss Wiradjuri Nation Welcoming Cities, Brisbane, 2019 Gadigal /Eora Cowra

Graphical Models Graphical Models Relationship between the directed & undirected models

Elements of a Nonstochastic Information Theory Girish Nair Dept. Electrical & Electronic