Graphical models
Sunita Sarawagi IIT Bombay http://www.cse.iitb.ac.in/~sunita
1
Graphical models Sunita Sarawagi IIT Bombay - - PowerPoint PPT Presentation
Graphical models Sunita Sarawagi IIT Bombay http://www.cse.iitb.ac.in/~sunita 1 Probabilistic modeling Given: several variables: x 1 , . . . x n , n is large. Task: build a joint distribution function Pr( x 1 , . . . x n ) Goal: Answer several
1
2
◮ Explicit joint distribution is dauntingly large ◮ Queries are simple marginals (sum or max) over the joint
2
3
◮ Income > 200K and Degree=”Bachelors”, ◮ Income < 200K, Degree=”PhD” and experience > 10 years. ◮ Many, many more. 4
5
◮ Many highly correlated pairs
◮ Ad hoc methods of combining these into a single estimate 5
◮ Many highly correlated pairs
◮ Ad hoc methods of combining these into a single estimate
◮ income ⊥
◮ experience ⊥
5
6
1
◮ Graph: represent structure of dependencies ◮ Potentials over subsets: quantify the dependencies 2
◮ given values of any variable subset, reason about probability
◮ many efficient exact and approximate inference algorithms 7
1
◮ Graph: represent structure of dependencies ◮ Potentials over subsets: quantify the dependencies 2
◮ given values of any variable subset, reason about probability
◮ many efficient exact and approximate inference algorithms
7
◮ Error Correcting Codes: Turbo codes, impressive success story
◮ Robotics and Vision: image denoising, robot navigation. ◮ Text mining: information extraction, duplicate elimination,
◮ Bio-informatics: Secondary structure prediction, Gene discovery ◮ Data mining: probabilistic classification and clustering. 8
1
2
3
4
5
6
9
◮ Continuous: Sensor temperatures, income ◮ Discrete: Degree (one of Bachelors,
◮ Directed edges: Bayesian networks ◮ Undirected edges: Markov Random fields
◮ Different for directed and undirected graphs 11
◮ Different for directed and undirected graphs
11
◮ Parents of a node: Pa(xi) = set of nodes in G pointing to xi 12
◮ Parents of a node: Pa(xi) = set of nodes in G pointing to xi 12
◮ Parents of a node: Pa(xi) = set of nodes in G pointing to xi
12
◮ Parents of a node: Pa(xi) = set of nodes in G pointing to xi
n
12
NY CA London Other 0.2 0.3 0.1 0.4
13
NY CA London Other 0.2 0.3 0.1 0.4
20–30 30–45 > 45 0.3 0.4 0.3
13
NY CA London Other 0.2 0.3 0.1 0.4
20–30 30–45 > 45 0.3 0.4 0.3
13
NY CA London Other 0.2 0.3 0.1 0.4
20–30 30–45 > 45 0.3 0.4 0.3
0–10 10–15 > 15 20–30 0.9 0.1 30–45 0.4 0.5 0.1 > 45 0.1 0.1 0.8
13
NY CA London Other 0.2 0.3 0.1 0.4
20–30 30–45 > 45 0.3 0.4 0.3
0–10 10–15 > 15 20–30 0.9 0.1 30–45 0.4 0.5 0.1 > 45 0.1 0.1 0.8
13
NY CA London Other 0.2 0.3 0.1 0.4
20–30 30–45 > 45 0.3 0.4 0.3
0–10 10–15 > 15 20–30 0.9 0.1 30–45 0.4 0.5 0.1 > 45 0.1 0.1 0.8
13
14
14
i P(xi|x1, . . . , xi−1) = i P(xi|PaG(xi))
15
16
17
18
19
20
21
1
2
3
4
22
1
2
3
4
22
23
24
◮ Observation variables: continuous (speech waveform), discrete
◮ Discussed later
1
2
25