Graphical models Sunita Sarawagi IIT Bombay - PowerPoint PPT Presentation

Graphical models Sunita Sarawagi IIT Bombay http://www.cse.iitb.ac.in/~sunita 1

Probabilistic modeling Given: several variables: x 1 , . . . x n , n is large. Task: build a joint distribution function Pr( x 1 , . . . x n ) Goal: Answer several kind of projection queries on the distribution 2

Probabilistic modeling Given: several variables: x 1 , . . . x n , n is large. Task: build a joint distribution function Pr( x 1 , . . . x n ) Goal: Answer several kind of projection queries on the distribution Basic premise ◮ Explicit joint distribution is dauntingly large ◮ Queries are simple marginals (sum or max) over the joint distribution. 2

Examples of Joint Distributions So far Naive Bayes: P ( x 1 , . . . x d | y ) , d is large. Assume conditional independence. Multivariate Gaussian Recurrent Neural Networks for Sequence labeling and prediction 3

Example Variables are attributes are people. Age Income Experience Degree Location 10 ranges 7 scales 7 scales 3 scales 30 places An explicit joint distribution over all columns not tractable: number of combinations: 10 × 7 × 7 × 3 × 30 = 44100. Queries: Estimate fraction of people with ◮ Income > 200K and Degree=”Bachelors”, ◮ Income < 200K, Degree=”PhD” and experience > 10 years. ◮ Many, many more. 4

Alternatives to an explicit joint distribution Assume all columns are independent of each other: bad assumption 5

Alternatives to an explicit joint distribution Assume all columns are independent of each other: bad assumption Use data to detect pairs of highly correlated column pairs and estimate their pairwise frequencies ◮ Many highly correlated pairs income �⊥ ⊥ age, income �⊥ ⊥ experience, age �⊥ ⊥ experience ◮ Ad hoc methods of combining these into a single estimate 5

Alternatives to an explicit joint distribution Assume all columns are independent of each other: bad assumption Use data to detect pairs of highly correlated column pairs and estimate their pairwise frequencies ◮ Many highly correlated pairs income �⊥ ⊥ age, income �⊥ ⊥ experience, age �⊥ ⊥ experience ◮ Ad hoc methods of combining these into a single estimate Go beyond pairwise correlations: conditional independencies ◮ income �⊥ ⊥ age, but income ⊥ ⊥ age | experience ◮ experience ⊥ ⊥ degree, but experience �⊥ ⊥ degree | income Graphical models make explicit an efficient joint distribution from these independencies 5

More examples of CIs The grades of a student in various courses are correlated but they become CI given attributes of the student (hard-working, intelligent, etc?) Health symptoms of a person may be correlated but are CI given the latent disease. Words in a document are correlated, but may become CI given the topic. Pixel color in an image become CI of distant pixels given near-by pixels. 6

Graphical models Model joint distribution over several variables as a product of smaller factors that is Intuitive to represent and visualize 1 ◮ Graph: represent structure of dependencies ◮ Potentials over subsets: quantify the dependencies Efficient to query 2 ◮ given values of any variable subset, reason about probability distribution of others. ◮ many efficient exact and approximate inference algorithms 7

Graphical models Model joint distribution over several variables as a product of smaller factors that is Intuitive to represent and visualize 1 ◮ Graph: represent structure of dependencies ◮ Potentials over subsets: quantify the dependencies Efficient to query 2 ◮ given values of any variable subset, reason about probability distribution of others. ◮ many efficient exact and approximate inference algorithms Graphical models = graph theory + probability theory. 7

Graphical models in use Roots in statistical physics for modeling interacting atoms in gas and solids [ 1900] Early usage in genetics for modeling properties of species [ 1920] AI: expert systems ( 1970s-80s) Now many new applications: ◮ Error Correcting Codes: Turbo codes, impressive success story (1990s) ◮ Robotics and Vision: image denoising, robot navigation. ◮ Text mining: information extraction, duplicate elimination, hypertext classification, help systems ◮ Bio-informatics: Secondary structure prediction, Gene discovery ◮ Data mining: probabilistic classification and clustering. 8

Part I: Outline Representation 1 Directed graphical models: Bayesian networks Undirected graphical models Inference Queries 2 Exact inference on chains Variable elimination on general graphs Junction trees Approximate inference 3 Generalized belief propagation Sampling: Gibbs, Particle filters Constructing a graphical model 4 Graph Structure Parameters in Potentials General framework for Parameter learning in graphical models 5 References 6 9

Representation Structure of a graphical model: Graph + Potential Graph Nodes: variables x = x 1 , . . . x n Directed ◮ Continuous: Sensor temperatures, income �� ◮ Discrete: Degree (one of Bachelors, Masters, PhD), Levels of age, Labels of �� words �� Edges: direct interaction ◮ Directed edges: Bayesian networks Undirected ◮ Undirected edges: Markov Random fields �� 10

Representation Potentials: ψ c ( x c ) Scores for assignment of values to subsets c of directly interacting variables. Which subsets? What do the potentials mean? ◮ Different for directed and undirected graphs 11

Representation Potentials: ψ c ( x c ) Scores for assignment of values to subsets c of directly interacting variables. Which subsets? What do the potentials mean? ◮ Different for directed and undirected graphs Probability Factorizes as product of potentials � Pr( x = x 1 , . . . x n ) ∝ ψ S ( x S ) 11

Directed graphical models: Bayesian networks Graph G : directed acyclic ◮ Parents of a node: Pa( x i ) = set of nodes in G pointing to x i 12

Directed graphical models: Bayesian networks Graph G : directed acyclic ◮ Parents of a node: Pa( x i ) = set of nodes in G pointing to x i Potentials: defined at each node in terms of its parents. ψ i ( x i , Pa( x i )) = Pr( x i | Pa( x i ) 12

Directed graphical models: Bayesian networks Graph G : directed acyclic ◮ Parents of a node: Pa( x i ) = set of nodes in G pointing to x i Potentials: defined at each node in terms of its parents. ψ i ( x i , Pa( x i )) = Pr( x i | Pa( x i ) Probability distribution n � Pr( x 1 . . . x n ) = Pr( x i | pa ( x i )) i =1 12

Example of a directed graph �� 13

Example of a directed graph �� ψ 1 ( L ) = Pr( L ) NY CA London Other 0.2 0.3 0.1 0.4 13

Example of a directed graph �� ψ 1 ( L ) = Pr( L ) NY CA London Other 0.2 0.3 0.1 0.4 ψ 2 ( A ) = Pr( A ) 20–30 30–45 > 45 0.3 0.4 0.3 or, a Guassian distribution ( µ, σ ) = (35 , 10) 13

Example of a directed graph �� ψ 1 ( L ) = Pr( L ) ψ 2 ( E , A ) = Pr( E | A ) NY CA London Other 0–10 10–15 > 15 0.2 0.3 0.1 0.4 20–30 0.9 0.1 0 0.4 0.5 0.1 30–45 ψ 2 ( A ) = Pr( A ) 0.1 0.1 0.8 > 45 20–30 30–45 > 45 0.3 0.4 0.3 or, a Guassian distribution ( µ, σ ) = (35 , 10) 13

Example of a directed graph �� ψ 1 ( L ) = Pr( L ) ψ 2 ( E , A ) = Pr( E | A ) NY CA London Other 0–10 10–15 > 15 0.2 0.3 0.1 0.4 20–30 0.9 0.1 0 0.4 0.5 0.1 30–45 ψ 2 ( A ) = Pr( A ) 0.1 0.1 0.8 > 45 ψ 2 ( I , E , D ) = Pr( I | D , A ) 20–30 30–45 > 45 0.3 0.4 0.3 3 dimensional table, or a or, a Guassian distribution histogram approximation. ( µ, σ ) = (35 , 10) 13

Example of a directed graph �� ψ 1 ( L ) = Pr( L ) ψ 2 ( E , A ) = Pr( E | A ) NY CA London Other 0–10 10–15 > 15 0.2 0.3 0.1 0.4 20–30 0.9 0.1 0 0.4 0.5 0.1 30–45 ψ 2 ( A ) = Pr( A ) 0.1 0.1 0.8 > 45 ψ 2 ( I , E , D ) = Pr( I | D , A ) 20–30 30–45 > 45 0.3 0.4 0.3 3 dimensional table, or a or, a Guassian distribution histogram approximation. ( µ, σ ) = (35 , 10) Probability distribution Pa( x = L , D , I , A , E ) = Pr( L ) Pr( D ) Pr( A ) Pr( E | A ) Pr( I | D , E ) 13

Graphical models Sunita Sarawagi IIT Bombay - PowerPoint PPT Presentation

Graphical models Sunita Sarawagi IIT Bombay http://www.cse.iitb.ac.in/~sunita 1 Probabilistic modeling Given: several variables: x 1 , . . . x n , n is large. Task: build a joint distribution function Pr( x 1 , . . . x n ) Goal: Answer several

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Graphical Models Graphical Models Relationship between the directed & undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

Probabilistic Graphical Models Part II: Undirected Graphical Models Selim Aksoy Department of

Probabilistic Graphical Models Probabilistic Graphical Models Relationship between the directed

Graphical Models CS 6355: Structured Prediction 1 So far We discussed sequence labeling

Introduction to Big Data and Machine Learning Graphical Models Dr. Mihail October 29, 2019 (Dr.

Undirected Graphical Models Dr. Shuang LIANG School of Software Engineering TongJi University

Probabilistic Graphical Models Christian Borgelt Dept. of Knowledge Processing and Language

Graphical Models L eon Bottou COS 424 4/15/2010 Introduction People like drawings better

Part 2: Introduction to Graphical Models Sebastian Nowozin and Christoph H. Lampert Providence,

Graphical models Sunita Sarawagi IIT Bombay - PowerPoint PPT Presentation

Graphical models Sunita Sarawagi IIT Bombay http://www.cse.iitb.ac.in/~sunita 1 Probabilistic modeling Given: several variables: x 1 , . . . x n , n is large. Task: build a joint distribution function Pr( x 1 , . . . x n ) Goal: Answer several

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Undirected Graphical Models Aaron Courville, Universit de Montral 2 (UNDIRECTED) GRAPHICAL

Graphical models Review Graphical models (Bayes nets, Markov random fields, factor graphs) !

Probabilistic Graphical Models CMSC 691 UMBC Two Problems for Graphical Models 1 ,

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Graphical Models Graphical Models Relationship between the directed &amp; undirected models

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical &gt; Tangible? What are their limitations? 93 94 Graphical &gt; Tangible? Graphical

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

Probabilistic Graphical Models Part II: Undirected Graphical Models Selim Aksoy Department of

Probabilistic Graphical Models Probabilistic Graphical Models Relationship between the directed

Graphical Models CS 6355: Structured Prediction 1 So far We discussed sequence labeling

Introduction to Big Data and Machine Learning Graphical Models Dr. Mihail October 29, 2019 (Dr.

Undirected Graphical Models Dr. Shuang LIANG School of Software Engineering TongJi University

Probabilistic Graphical Models Christian Borgelt Dept. of Knowledge Processing and Language

Graphical Models L eon Bottou COS 424 4/15/2010 Introduction People like drawings better

Part 2: Introduction to Graphical Models Sebastian Nowozin and Christoph H. Lampert Providence,

Graphical Models Graphical Models Relationship between the directed & undirected models

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical