Causal Discovery of Dynamic Bayesian Networks
Cora Perez-Ariza1, Ann Nicholson2, Kevin Korb2, Steven Mascaro2 and Chao Heng Hu2
1
- Dept. of Computer Science and Artificial Intelligence
University of Granada
2
Clayton School of IT Monash University
Dynamic Bayesian Networks Cora Perez-Ariza 1 , Ann Nicholson 2 , - - PowerPoint PPT Presentation
Causal Discovery of Dynamic Bayesian Networks Cora Perez-Ariza 1 , Ann Nicholson 2 , Kevin Korb 2 , Steven Mascaro 2 and Chao Heng Hu 2 1 Dept. of Computer Science and Artificial Intelligence University of Granada 2 Clayton School of IT Monash
1
University of Granada
2
Clayton School of IT Monash University
Performs independence tests e.g. PC algorithm (Spirtes et al., 1993) Tests all pairs for direct dependencies
Performs independence tests e.g. PC algorithm (Spirtes et al., 1993) Tests all pairs for direct dependencies Finding a graph pattern Finding head to head arcs and orient the rest
(Stochastic) search and score Learning programs/packages: e.g. CaMML (Causal discovery via
MML), BNT (Bayes Net Toolbox).
(Stochastic) search and score Learning programs/packages: e.g. CaMML (Causal discovery via
MML), BNT (Bayes Net Toolbox).
Score and rank M → M' : Add/remove/reverse arcs add remove reverse
Uses BIC/BDe scoring
Hill-climbing Learn the prior/initial network and the transition network
Prior network Transition network
The corresponding DBN
Written by Kevin Murphy (2001) Supports DBN learning and inference
Uses BIC/ML scoring Guarantees that Only learns arcs between slices (temporal arcs)
Motivation, SES, Education ≺ Motivation1, SES1, Education1
Learn transitional arcs t0 Data BN for t0 Learn from Data Learned DBN Copy Network t0 t1
Mutual information Use mutual information to score strength of arcs
Too many variables to show!
Known Models Tier or BN Priors Learned Models Generate Data Learn DBNs Test with ED/CKL
Count 1 if an arc is missing/added/reversed in the learned model Our modification for DBNs: EDDBN = Ws.Ns + Wt.Nt
Computes the distance of probability distribution between model P and model Q
Datasize CaMML w/ Tiers GeNIe (PC) BNT 500 6.8 (0.98) 13 7.5 (0.50) 5000 5.4 (1.62) 10 6.2 (0.74) 50000 1.0 (0.0) 10 3.7 (0.33)
5 10 15 20 25 30 35 total errors (tiers) total errors (Alg)
500 5000 50000 Data size Errors
5 10 15 20 25 static errors (tiers) static errors (Alg)
Errors
500 5000 50000 Data size
2 4 6 8 10 12 14 temporal errors (tiers) temporal errors (Alg)
500 5000 50000 Data size Errors