Hierarchical models Dr. Jarad Niemi STAT 544 - Iowa State - PowerPoint PPT Presentation

Hierarchical models Dr. Jarad Niemi STAT 544 - Iowa State University February 21, 2019 Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 1 / 35

Outline Motivating example Independent vs pooled estimates Hierarchical models General structure Posterior distribution Binomial hierarchial model Posterior distribution Prior distributions Stan analysis of binomial hierarchical model informative prior default prior integrating out θ across seasons Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 2 / 35

Modeling Andre Dawkin’s three-point percentage Suppose Y i are the number 3-pointers Andre Dawkin’s makes in season i , and assume ind Y i ∼ Bin ( n i , θ i ) where n i are the number of 3-pointers attempted and θ i is the probability of making a 3-pointer in season i . Do these models make sense? The 3-point percentage every season is the same, i.e. θ i = θ . The 3-point percentage every season is independent of other seasons. The 3-point percentage every season should be similar to other seasons. Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 3 / 35

Modeling Andre Dawkin’s three-point percentage Suppose Y i are the number of 3-pointers Andre Dawkin’s makes in game i , and assume ind Y i ∼ Bin ( n i , θ i ) where n i are the number of 3-pointers attempted in game i and θ i is the probability of making a 3-pointer in game i . Do these models make sense? The 3-point percentage every game is the same, i.e. θ i = θ . The 3-point percentage every game is independent of other games. The 3-point percentage every game should be similar to other games. Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 4 / 35

Modeling Andre Dawkin’s 3-point percentage 95% Credible Intervals 0 5 10 Estimate game Combined Individual 15 20 25 0.00 0.25 0.50 0.75 1.00 θ Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 5 / 35

Modeling Andre Dawkin’s 3-point percentage date opponent made attempts game 1 16017.00 davidson 0 0 1 2 16021.00 kansas 0 0 2 3 16024.00 florida atlantic 5 8 3 4 16027.00 unc asheville 3 6 4 5 16028.00 east carolina 0 1 5 6 16033.00 vermont 3 9 6 7 16036.00 alabama 0 2 7 8 16038.00 arizona 1 1 8 9 16042.00 michigan 2 2 9 10 16055.00 gardner-webb 4 8 10 11 16058.00 ucla 1 5 11 12 16067.00 eastern michigan 6 10 12 13 16070.00 elon 5 7 13 14 16074.00 notre dame 1 4 14 15 16077.00 georgia tech 1 5 15 16 16081.00 clemson 0 4 16 17 16083.00 virginia 1 1 17 18 16088.00 nc state 3 7 18 19 16092.00 miami 2 6 19 20 16095.00 florida state 3 6 20 21 16097.00 pitt 6 7 21 22 16102.00 syracuse 4 9 22 23 16105.00 wake forest 4 7 23 24 16109.00 boston college 0 1 24 Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 6 / 35

Hierarchical models Hierarchical models Consider the following model ind y i ∼ p ( y | θ i ) ind θ i ∼ p ( θ | φ ) φ ∼ p ( φ ) where y i is observed, θ = ( θ 1 , . . . , θ n ) and φ are parameters, and only φ has a prior that is set. This is a hierarchical or multilevel model. Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 7 / 35

Binomial hierarchical model Three-pointer example Our statistical model ind Y i ∼ Bin ( n i , θ i ) ind θ i ∼ Be ( α, β ) α, β ∼ p ( α, β ) In this example, φ = ( α, β ) Be ( α, β ) describes the variability in 3-point percentage across games, and we are going to learn about this variability. Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 9 / 35

Binomial hierarchical model Decomposed posterior ind ind Y i ∼ Bin ( n i , θ i ) θ i ∼ Be ( α, β ) α, β ∼ p ( α, β ) Conditional posterior for θ : n n � � p ( θ | α, β, y ) = p ( θ i | α, β, y i ) = Be ( θ i | α + y i , β + n i − y i ) i =1 i =1 Marginal posterior for ( α, β ) : p ( α, β | y ) ∝ p ( y | α, β ) p ( α, β ) = � n i =1 p ( y i | α, β ) = � n � p ( y | α, β ) p ( y i | θ i ) p ( θ i | α, β ) dθ i i =1 = � n � Bin ( y i | n i , θ i ) Be ( θ i | α, β ) dθ i i =1 � 1 i (1 − θ i ) n i − y i θ α − 1 (1 − θ i ) β − 1 = � n � n i θ y i � dθ i i i =1 0 y i B ( α,β ) � 1 = � n 0 θ α + y i − 1 � n i � 1 (1 − θ i ) β + n i − y i − 1 dθ i i =1 y i B ( α,β ) i � B ( α + y i ,β + n i − y i ) = � n � n i i =1 B ( α,β ) y i Thus y i | α, β ind ∼ Beta-binomial ( n i , α, β ) . Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 10 / 35

Binomial hierarchical model Prior A prior distribution for α and β Recall the interpretation: α : prior successes β : prior failures A more natural parameterization is α prior expectation: µ = α + β prior sample size: η = α + β Place priors on these parameters or transformed to the real line: logit µ = log( µ/ [1 − µ ]) = log( α/β ) log η Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 11 / 35

Binomial hierarchical model Prior A prior distribution for α and β It seems reasonable to assume the mean ( µ ) and size ( η ) are independent a priori : p ( µ, η ) = p ( µ ) p ( η ) Let’s construct a prior that has P (0 . 1 < µ < 0 . 5) ≈ 0 . 95 since most college basketball players have a three-point percentage between 10% and 50% and is somewhat diffuse for η but has more mass for smaller values. Let’s assume an informative prior for µ and η perhaps µ ∼ Be (6 , 14) η ∼ Exp (0 . 05) a = 6 b = 14 e = 1/20 Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 12 / 35

Binomial hierarchical model Prior Prior draws n = 1e4 prior_draws = data.frame(mu = rbeta(n, a, b), eta = rexp(n, e)) %>% mutate(alpha = eta* mu, beta = eta*(1-mu)) prior_draws %>% tidyr::gather(parameter, value) %>% group_by(parameter) %>% summarize(lower95 = quantile(value, prob = 0.025), median = quantile(value, prob = 0.5), upper95 = quantile(value, prob = 0.975)) # A tibble: 4 x 4 parameter lower95 median upper95 <chr> <dbl> <dbl> <dbl> 1 alpha 0.129 3.87 23.9 2 beta 0.359 9.61 51.4 3 eta 0.514 13.8 72.4 4 mu 0.124 0.292 0.511 cor(prior_draws$alpha, prior_draws$beta) [1] 0.7951507 Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 13 / 35

Stan model_informative_prior = " data { int<lower=0> N; // data int<lower=0> n[N]; int<lower=0> y[N]; real<lower=0> a; // prior real<lower=0> b; real<lower=0> e; } parameters { real<lower=0,upper=1> mu; real<lower=0> eta; real<lower=0,upper=1> theta[N]; } transformed parameters { real<lower=0> alpha; real<lower=0> beta; alpha = eta* mu ; beta = eta*(1-mu); } model { mu ~ beta(a,b); eta ~ exponential(e); // implicit joint distributions theta ~ beta(alpha,beta); y ~ binomial(n,theta); } " Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 14 / 35

Stan Stan dat = list(y = d$made, n = d$attempts, N = nrow(d),a = a, b = b, e = e) m = stan_model(model_code = model_informative_prior) r = sampling(m, dat, c("mu","eta","alpha","beta","theta"), iter = 10000) Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 15 / 35

Hierarchical models Dr. Jarad Niemi STAT 544 - Iowa State - PowerPoint PPT Presentation

Hierarchical models Dr. Jarad Niemi STAT 544 - Iowa State University February 21, 2019 Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 1 / 35 Outline Motivating example Independent vs pooled estimates Hierarchical models

What is a hierarchical model? Richard Erickson Quantitative Ecologist DataCamp Hierarchical

Bayesian hierarchical models in Stata Nikolay Balov StataCorp LP 2016 Stata Conference Nikolay

Bayesian hierarchical models Bruno Nicenboim / Shravan Vasishth 2020-03-14 1 Bayesian

Hierarchical Bounding Volume October 11, 2005 () Hierarchical Bounding Volume October 11, 2005

Hierarchical models (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University February 21, 2019

OLD Paired t-test Richard Erickson Quantitative Ecologist DataCamp Hierarchical and Mixed

Crash course on GLMs Richard Erickson Quantitative Ecologist DataCamp Hierarchical and Mixed

Hierarchical models Dr. Jarad Niemi Iowa State University August 31, 2017 Jarad Niemi (Iowa

Unsupervised Learning and Clustering Owen Roberts, Zach Busser, Ganesh Sugunan Hierarchical

Hierarchical linear models Dr. Jarad Niemi STAT 544 - Iowa State University April 30, 2019

Estimating Distributional Parameters in Hierarchical Models Introduction: Variability in

Part 7 Bayesian hierarchical modelling, simulation and MCMC by Gero Walter 252 Bayesian

Hierarchical Models Applied Bayesian Statistics Dr. Earvin Balderama Department of Mathematics

Linear mixed effect model- Birth rates data Richard Erickson Quantitative Ecologist DataCamp

Multilevel Mixed (hierarchical) models Christopher F Baum ECON 8823: Applied Econometrics Boston

Multilevel Mixed (hierarchical) models Christopher F Baum EC 823: Applied Econometrics Boston

IceCube A-333 Fieldwork Plans Kael Hanson and the IceCube M&O Team IceCube Management and

Seasonal ARIMA Models Many time series collected on a monthly or quarterly basis have seasonal

Dr Stephen I. Thomson Some little things In the submission scripts, there is an option to

The seasons Astronomy 101 Syracuse University, Fall 2020 Walter Freeman September 8, 2020

Earth-Sun Relationships Energy received from the Sun drives weather and climate, so it is

Improving predictive accuracy using Smart-Data rather than Big-Data : A case study of soccer

Expressing experience: Not necessarily stoned, but beautiful Chris Kennedy and Malte

Status and prospect of the NEWS-G experiment Alexis Brossard, on behalf of the NEWS-G

Sambuz

Useful Links

Newsletter

Mail Us

Hierarchical models Dr. Jarad Niemi STAT 544 - Iowa State - PowerPoint PPT Presentation

Hierarchical models Dr. Jarad Niemi STAT 544 - Iowa State University February 21, 2019 Jarad Niemi (STAT544@ISU) Hierarchical models February 21, 2019 1 / 35 Outline Motivating example Independent vs pooled estimates Hierarchical models

What is a hierarchical model? Richard Erickson Quantitative Ecologist DataCamp Hierarchical

Bayesian hierarchical models in Stata Nikolay Balov StataCorp LP 2016 Stata Conference Nikolay

Bayesian hierarchical models Bruno Nicenboim / Shravan Vasishth 2020-03-14 1 Bayesian

Hierarchical Bounding Volume October 11, 2005 () Hierarchical Bounding Volume October 11, 2005

Hierarchical models (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University February 21, 2019

OLD Paired t-test Richard Erickson Quantitative Ecologist DataCamp Hierarchical and Mixed

Crash course on GLMs Richard Erickson Quantitative Ecologist DataCamp Hierarchical and Mixed

Hierarchical models Dr. Jarad Niemi Iowa State University August 31, 2017 Jarad Niemi (Iowa

Unsupervised Learning and Clustering Owen Roberts, Zach Busser, Ganesh Sugunan Hierarchical

Hierarchical linear models Dr. Jarad Niemi STAT 544 - Iowa State University April 30, 2019

Estimating Distributional Parameters in Hierarchical Models Introduction: Variability in

Part 7 Bayesian hierarchical modelling, simulation and MCMC by Gero Walter 252 Bayesian

Hierarchical Models Applied Bayesian Statistics Dr. Earvin Balderama Department of Mathematics

Linear mixed effect model- Birth rates data Richard Erickson Quantitative Ecologist DataCamp

Multilevel Mixed (hierarchical) models Christopher F Baum ECON 8823: Applied Econometrics Boston

Multilevel Mixed (hierarchical) models Christopher F Baum EC 823: Applied Econometrics Boston

IceCube A-333 Fieldwork Plans Kael Hanson and the IceCube M&amp;O Team IceCube Management and

Seasonal ARIMA Models Many time series collected on a monthly or quarterly basis have seasonal

Dr Stephen I. Thomson Some little things In the submission scripts, there is an option to

The seasons Astronomy 101 Syracuse University, Fall 2020 Walter Freeman September 8, 2020

Earth-Sun Relationships Energy received from the Sun drives weather and climate, so it is

Improving predictive accuracy using Smart-Data rather than Big-Data : A case study of soccer

Expressing experience: Not necessarily stoned, but beautiful Chris Kennedy and Malte

Status and prospect of the NEWS-G experiment Alexis Brossard, on behalf of the NEWS-G

Sambuz

Useful Links

Newsletter

Mail Us

IceCube A-333 Fieldwork Plans Kael Hanson and the IceCube M&O Team IceCube Management and