Efficient MCMC for Continuous Time Discrete State Systems
Vinayak Rao and Yee Whye Teh
Gatsby Computational Neuroscience Unit, University College London
Efficient MCMC for Continuous Time Discrete State Systems Vinayak - - PowerPoint PPT Presentation
Efficient MCMC for Continuous Time Discrete State Systems Vinayak Rao and Yee Whye Teh Gatsby Computational Neuroscience Unit, University College London Overview Continuous time discrete space systems: applications in physics, chemistry,
Gatsby Computational Neuroscience Unit, University College London
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 2 / 40
Ω .
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 3 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 4 / 40
n
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 5 / 40
ΩA on these event times.
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 6 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 7 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 8 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 9 / 40
◮ independent of the observations, ◮ produced by ‘thinning’ a rate Ω Poisson process with probability
AS(t)S(t) Ω
◮ thus, distributed according to a inhomogeneous Poisson process
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 9 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 9 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 10 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 11 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 12 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 13 / 40
ii .
◮ This state must account for the likelihood of children nodes’
◮ The state must also explain relevant observations.
ΩAP), sample new trajectory using forward-filtering
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 13 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 14 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 15 / 40
10 20 30 40 50 10
−1
10 10
1
10
2
10
3
Number of nodes in CTBN chain CPU time in seconds Uniformization El Hay et al. El Hay et al. (Matrix exp.)
10 10
1
10
2
10
−2
10
−1
10 10
1
10
2
10
3
Dimensionality of nodes in CTBN chain CPU time in seconds Uniformization El Hay et al. El Hay et al. (Matrix exp.)
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 16 / 40
10
1
10
2
10
3
10 10
1
10
2
10
3
10
4
Length of CTBN time−interval CPU time in seconds Uniformization El Hay et al. El Hay et al. (Matrix exp.)
10 100 1000 10000 Number of samples 10−1 100 101 Uniformization El Hay et al. Average Relative Error
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 17 / 40
500 1000 1500 2000 2500 3000 5 10 15 20 25 30 True path Mean−field approx. MCMC approx. 500 1000 1500 2000 2500 3000 5 10 15 20 25 30 True path Mean−field approx. MCMC approx.
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 18 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 19 / 40
0 g(u)du
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 20 / 40
x uγ−1e−udu
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 21 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 22 / 40
t,τ h(τ)λ(t)
◮ Yi = Yi−1 → reject Ei, ◮ Yi = i → keep Ei.
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 23 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 24 / 40
20 40 60 80 100 0.05 0.1 0.15 0.2 Intensity Time
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 25 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 26 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 27 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 28 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 28 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 28 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 29 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 30 / 40
10 20 30 40 50 −0.5 0.5 1 1.5 2 2.5 Intensity Truth MRP Exp MRP Gam3 MRP Full Disc100 1 2 3 4 5 −2 2 4 6 8 10 12 20 40 60 80 100 −0.5 0.5 1 1.5 2 2.5 3
2 4 0.05 0.1 0.15 0.2 2 4 0.05 0.1 0.15 0.2 2 4 0.1 0.2 0.3 0.4
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 31 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 32 / 40
1850 1900 1950 −1 1 2 3 4 Intensity 1850 1900 1950 −1 1 2 3 4 1 1.5 2 0.05 0.1 0.15 0.2
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 33 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 34 / 40
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 35 / 40
◮ semi-Markov jump processes, ◮ inhomogeneous MJPs, MJPs with infinite state spaces etc. ◮ renewal processes with unbounded hazard rates,
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 36 / 40
Adams, R. P., Murray, I., and MacKay, D. J. C. (2009). Tractable nonparametric Bayesian inference in Poisson processes with gaussian process intensities. In Bottou, L. and Littman, M., editors, Proceedings of the 26th International Conference on Machine Learning (ICML), pages 9–16, Montreal. Omnipress. Cohn, I., El-Hay, T., Friedman, N., and Kupferman, R. (2010). Mean field variational approximation for continuous-time bayesian networks.
Cunningham, J. P., Yu, B. M., Shenoy, K. V., and Sahani, M. (2008). Inferring neural firing rates from spike trains using gaussian processes. In Advances in Neural Information Processing Systems,20. El-Hay, T., Friedman, N., and Kupferman, R. (2008). Gibbs sampling in factorized continuous-time Markov processes. In UAI, pages 169–178. Fan, Y. and Shelton, C. R. (2008). Sampling for approximate inference in continuous time Bayesian networks. In Tenth International Symposium on Artificial Intelligence and Mathematics. Fearnhead, P. and Sherlock, C. (2006). An exact Gibbs sampler for the Markov-modulated Poisson process. Journal Of The Royal Statistical Society Series B, 68(5):767–784. Hobolth, A. and Stone, E. A. (2009). Simulation from endpoint-conditioned, continuous-time Markov chains on a finite state space, with applications to molecular evolution. Ann Appl Stat, 3(3):1204. Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 37 / 40
Jarrett, B. Y. R. G. (1979). A note on the intervals between coal-mining disasters. Biometrika, 66(1):191–193. Jensen, A. (1953). Markoff chains as an aid in the study of Markoff processes.
Murray, I., Adams, R. P., and MacKay, D. J. (2010). Elliptical slice sampling. JMLR: W&CP, 9:541–548. Nodelman, U., Koller, D., and Shelton, C. (2005). Expectation propagation for continuous time Bayesian networks. In Proceedings of the Twenty-first Conference on Uncertainty in AI (UAI), pages 431–440, Edinburgh, Scottland, UK. Nodelman, U., Shelton, C., and Koller, D. (2002). Continuous time Bayesian networks. In Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI), pages 378–387. Opper, M. and Sanguinetti, G. (2007). Variational inference for Markov jump processes. In NIPS. Rao, V. and Teh, Y. W. (2011a). Fast MCMC sampling for Markov jump processes and continuous time Bayesian networks. In Proceedings of the International Conference on Uncertainty in Artificial Intelligence. Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 38 / 40
Rao, V. and Teh, Y. W. (2011b). Gaussian process modulated renewal processes. In Advances in Neural Information Processing Systems 23. Rasmussen, C. E. and Williams, C. K. I. (2006). Gaussian Processes for Machine Learning. MIT Press. Rokem, A., Watzl, S., Gollisch, T., Stemmler, M., Herz, A. V. M., Watzl, S., Gollisch, T., Stemmler, M., and Herz, A.
Spike-Timing Precision Underlies the Coding Efficiency of Auditory Receptor Neurons. Journal of Neurophysiology, pages 2541–2552. Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 39 / 40
Gnew of the GP on G ∪ ˜
Gprev.
ˆ λσ(l(a))h(a−Gi−1) Ω
Gnew be the restriction of lA to
Gprev.
Gnew using, for example, elliptical slice sampling.
Vinayak Rao, Yee Whye Teh (Gatsby Unit) MCMC for Continuous Time Systems 40 / 40