bayesian linear regression

Bayesian Linear Regression Seung-Hoon Na Chonbuk National - PowerPoint PPT Presentation

Bayesian Linear Regression Seung-Hoon Na Chonbuk National University Bayesian Linear Regression Compute the full posterior over and 2 Case 1) the noise variance 2 is known Use Gaussian prior Case 2) the noise


  1. Bayesian Linear Regression Seung-Hoon Na Chonbuk National University

  2. Bayesian Linear Regression β€’ Compute the full posterior over 𝒙 and 𝜏 2 β€’ Case 1) the noise variance 𝜏 2 is known – Use Gaussian prior β€’ Case 2) the noise variance 𝜏 2 is unknown – Use normal inverse gamma (NIG) prior

  3. Posterior: 𝜏 2 is known β€’ The likelihood offset putting an improper prior on 𝜈 Further assume that the output is centered: β€’ The conjugate prior:

  4. Posterior: 𝜏 2 is known β€’ The posterior: – If and then the posterior mean reduces to the ridge estimate with βˆ’1 𝜏 2 𝜐 2 𝑱 + 𝒀 𝑼 𝒀 𝒀 π‘ˆ 𝒛 𝒙 𝑂 =

  5. Posterior: 𝜏 2 is known β€’ 1D example – the true parameters: π‘₯ 0 = βˆ’0.3 , π‘₯ 1 = 0.5 β€’ Sequential Bayesian inference β€’ Posterior given the first n data points

  6. π‘œ = 0 π‘œ = 1 π‘œ = 2 π‘œ = 20 π‘₯ 0 = βˆ’0.3 , π‘₯ 1 = 0.5

  7. Posterior Predictive: 𝜏 2 is known β€’ The posterior predictive distribution at a test point x : Gaussian β€’ The plug-in approximation: constant error bar

  8. Posterior Predictive: 𝜏 2 is known

  9. Posterior Predictive: 𝜏 2 is known 10 samples from the posterior predictive 10 samples from the plugin approximation to posterior predictive.

  10. Bayesian linear regression: 𝜏 2 is unknown β€’ The likelihood: β€’ The natural conjugate prior:

  11. Inverse Wishart Distribution β€’ Similarly, If D = 1, the Wishart reduces to the Gamma distribution

  12. Inverse Wishart Distribution If D = 1, this reduces to the inverse Gamma

  13. Bayesian linear regression: 𝜏 2 is unknown β€’ The posterior: β€’ The posterior marginals

  14. Bayesian linear regression: 𝜏 2 is unknown β€’ The posterior predictive: Student T distribution β€’ Given new test inputs

  15. Bayesian linear regression: 𝜏 2 is unknown – Uninformative prior β€’ It is common to set 𝑏 0 = 𝑐 0 = 0 , corresponding to an uninformative prior for 𝜏 2 , and to set β€’ The unit information prior:

  16. Bayesian linear regression: 𝜏 2 is unknown – Uninformative prior β€’ An uninformative prior: use the uninformative limit of the conjugate g-prior, which corresponds to setting 𝑕 = ∞

  17. Bayesian linear regression: 𝜏 2 is unknown – Uninformative prior β€’ The marginal distribution of the weights:

  18. Bayesian linear regression: Evidence procedure β€’ Evidence procedure – an empirical Bayes procedure for picking the hyper- parameters – Choose to maximize the marginal likelihood, where πœ‡ = 1/𝜏 2 is the precision of the observation noise and 𝛽 is the precision of the prior – Provides an alternative to using cross validation

  19. Bayesian linear regression: Evidence procedure

Recommend


More recommend