[PPT] - A GPU Implementation of a Spatial GLMM: Assessing Spatially Varying PowerPoint Presentation

SLIDE 1

A GPU Implementation of a Spatial GLMM: Assessing Spatially Varying Coefficients of Multiple Sclerosis Lesions

Timothy D. Johnson

Department of Biostatistics University of Michigan

Johnson (University of Michigan) MS-GPU Jan 6, 2014 1 / 18

SLIDE 2

Outline

1

Disease Background

2

The Model

3

Intro to GPUs

4

Results

5

Acknowledgements

Johnson (University of Michigan) MS-GPU Jan 6, 2014 2 / 18

SLIDE 3

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 4

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 5

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men Commonly diagnosed between 20 and 40 years of age.

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 6

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men Commonly diagnosed between 20 and 40 years of age. Immune system attacks the myelin sheath surrounding axons

Slows nerve impulse

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 7

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men Commonly diagnosed between 20 and 40 years of age. Immune system attacks the myelin sheath surrounding axons

Slows nerve impulse

Immune system attacks the myelin

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 8

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men Commonly diagnosed between 20 and 40 years of age. Immune system attacks the myelin sheath surrounding axons

Slows nerve impulse

Immune system attacks the myelin Causes inflammation visible on MRI

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 9

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men Commonly diagnosed between 20 and 40 years of age. Immune system attacks the myelin sheath surrounding axons

Slows nerve impulse

Immune system attacks the myelin Causes inflammation visible on MRI Cause of autoimmune disease is unknown, theories include

Viral infection Genetics Environmental factors

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 10

Disease Background

Multiple Sclerosis

Autoimmune disease affecting the central nervous system Affects women more than men Commonly diagnosed between 20 and 40 years of age. Immune system attacks the myelin sheath surrounding axons

Slows nerve impulse

Immune system attacks the myelin Causes inflammation visible on MRI Cause of autoimmune disease is unknown, theories include

Viral infection Genetics Environmental factors

No known cure

Johnson (University of Michigan) MS-GPU Jan 6, 2014 3 / 18

SLIDE 11

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 12

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 13

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 14

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination Brainstem function — difficulty in speech, swallowing, ...

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 15

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination Brainstem function — difficulty in speech, swallowing, ... Pyramidal function — walking

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 16

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination Brainstem function — difficulty in speech, swallowing, ... Pyramidal function — walking Sensory function — touch and pain

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 17

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination Brainstem function — difficulty in speech, swallowing, ... Pyramidal function — walking Sensory function — touch and pain Bowel and Bladder function

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 18

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination Brainstem function — difficulty in speech, swallowing, ... Pyramidal function — walking Sensory function — touch and pain Bowel and Bladder function Cerebral function — memory, mood, dementia

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 19

Disease Background

Multiple Sclerosis Functional Systems Scores

FSS Visual function — blind spots, decrease in visual acuity Cerebellar function — coordination Brainstem function — difficulty in speech, swallowing, ... Pyramidal function — walking Sensory function — touch and pain Bowel and Bladder function Cerebral function — memory, mood, dementia PASAT score Paced auditory serial addition test

Assesses auditory speed/flexibility + calculation ability

Johnson (University of Michigan) MS-GPU Jan 6, 2014 4 / 18

SLIDE 20

Disease Background

MS high-resolution imaging

Data are T2 hyperintense lesions

Johnson (University of Michigan) MS-GPU Jan 6, 2014 5 / 18

SLIDE 21

Disease Background

MS high-resolution imaging

Data are T2 hyperintense lesions

Johnson (University of Michigan) MS-GPU Jan 6, 2014 5 / 18

SLIDE 22

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 23

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 24

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector.

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 25

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj.

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 26

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 27

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Voxels are rectangular prisms (commonly cubes)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 28

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Voxels are rectangular prisms (commonly cubes) If sj and sk are neighboring sites, we denote this by sj ∼ sk

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 29

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Voxels are rectangular prisms (commonly cubes) If sj and sk are neighboring sites, we denote this by sj ∼ sk The set of neighbors of sj is denoted by ∂sj

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 30

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Voxels are rectangular prisms (commonly cubes) If sj and sk are neighboring sites, we denote this by sj ∼ sk The set of neighbors of sj is denoted by ∂sj The number of neighbors of sj is N(sj)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 31

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Voxels are rectangular prisms (commonly cubes) If sj and sk are neighboring sites, we denote this by sj ∼ sk The set of neighbors of sj is denoted by ∂sj The number of neighbors of sj is N(sj) Two voxels are neighbors if they shared a common face

each voxel has at most 6 neighbors

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 32

The Model

Notation

sj, j = 1, . . . , M will denote the jth voxel at location (x, y, z) (lexicographical ordering) Yi(sj) ∈ {0, 1}: outcome for subject i = 1, . . . , N at voxel, or site, sj Xi: subject specific covariate vector. β(sj): parameter vector at voxel sj. MRF (Markov Random Field) model

Voxels are rectangular prisms (commonly cubes) If sj and sk are neighboring sites, we denote this by sj ∼ sk The set of neighbors of sj is denoted by ∂sj The number of neighbors of sj is N(sj) Two voxels are neighbors if they shared a common face

each voxel has at most 6 neighbors

If two voxels are not neighbors, they are cond. indep. given their neighbors (Markov property)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 6 / 18

SLIDE 33

The Model

The Multivariate Gaussian MRF

Let βT = (βT(s1), . . . , βT(sM)) where each β(si) is a p-col vector

Johnson (University of Michigan) MS-GPU Jan 6, 2014 7 / 18

SLIDE 34

The Model

The Multivariate Gaussian MRF

Let βT = (βT(s1), . . . , βT(sM)) where each β(si) is a p-col vector Following Mardia (1988) the full conditionals are given by [β(si) | β(s−i), Σ] ∼ MVN

sj∈∂si β(sj)

N(si) , Σ N(si)

Johnson (University of Michigan)

MS-GPU Jan 6, 2014 7 / 18

SLIDE 35

The Model

The Multivariate Gaussian MRF

Let βT = (βT(s1), . . . , βT(sM)) where each β(si) is a p-col vector Following Mardia (1988) the full conditionals are given by [β(si) | β(s−i), Σ] ∼ MVN

sj∈∂si β(sj)

N(si) , Σ N(si)

By Brook’s lemma the joint dist has form

π [β | Σ] ∝ exp   −1 2

si∼sj
β(si) − β(sj)

T Σ−1 β(si) − β(sj)



 

Johnson (University of Michigan) MS-GPU Jan 6, 2014 7 / 18

SLIDE 36

The Model

The Multivariate Gaussian MRF

Let βT = (βT(s1), . . . , βT(sM)) where each β(si) is a p-col vector Following Mardia (1988) the full conditionals are given by [β(si) | β(s−i), Σ] ∼ MVN

sj∈∂si β(sj)

N(si) , Σ N(si)

By Brook’s lemma the joint dist has form

π [β | Σ] ∝ exp   −1 2

si∼sj
β(si) − β(sj)

T Σ−1 β(si) − β(sj)



  We write [β | Σ] ∼ G-MRF(Σ)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 7 / 18

SLIDE 37

The Model

Spatially Varying Coefficient Model

Probit Regression Model Φ−1 Pr(Yi(sj) = 1 | ·)

=

X T

i α + X T i β(sj)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 8 / 18

SLIDE 38

The Model

Spatially Varying Coefficient Model

Probit Regression Model Φ−1 Pr(Yi(sj) = 1 | ·)

=

X T

i α + X T i β(sj)

Xi: vector of p covariates α: associated parameter vector (fixed effects, intercepts and slopes)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 8 / 18

SLIDE 39

The Model

Spatially Varying Coefficient Model

Probit Regression Model Φ−1 Pr(Yi(sj) = 1 | ·)

=

X T

i α + X T i β(sj)

Xi: vector of p covariates β(sj): associated parameter vector (spatial random effects) Let βT = (βT(s1), . . . , βT(sM))

Johnson (University of Michigan) MS-GPU Jan 6, 2014 8 / 18

SLIDE 40

The Model

Spatially Varying Coefficient Model

Probit Regression Model Φ−1 Pr(Yi(sj) = 1 | ·)

=

X T

i α + X T i β(sj)

= X T

i β∗(sj)

Xi: vector of p covariates Reparametrize: β∗(sj) = α + β(sj)—aids in mixing Let β∗T = (β∗T(s1), . . . , β∗T(sM)) [β∗ | Σ] ∼ G-MRF(Σ) Σ−1 ∼ W(ν, I) ν = 0 gives an improper, uninformative prior

Johnson (University of Michigan) MS-GPU Jan 6, 2014 8 / 18

SLIDE 41

The Model

Latent Variable Representation

We use the Latent Variable rep introduced by Albert & Chib (1993)

1

Introduce latent variable di(sj) ∼ N(X T

i β∗(sj), 1)

Johnson (University of Michigan) MS-GPU Jan 6, 2014 9 / 18

SLIDE 42

The Model

Latent Variable Representation

We use the Latent Variable rep introduced by Albert & Chib (1993)

1

Introduce latent variable di(sj) ∼ N(X T

i β∗(sj), 1)

2

Define Pr[Yi(sj) = 1 | di(sj)] = 1 : di(sj) > 0 : di(sj) ≤ 0

binary regression transformed into a cont. regression problem

Johnson (University of Michigan) MS-GPU Jan 6, 2014 9 / 18

SLIDE 43

The Model

Latent Variable Representation

We use the Latent Variable rep introduced by Albert & Chib (1993)

1

Introduce latent variable di(sj) ∼ N(X T

i β∗(sj), 1)

2

Define Pr[Yi(sj) = 1 | di(sj)] = 1 : di(sj) > 0 : di(sj) ≤ 0

binary regression transformed into a cont. regression problem

3

Given the di(sj) our model now a spatial (continuous) mixed effects model with normal errors.

Johnson (University of Michigan) MS-GPU Jan 6, 2014 9 / 18

SLIDE 44

The Model

Latent Variable Representation

We use the Latent Variable rep introduced by Albert & Chib (1993)

1

Introduce latent variable di(sj) ∼ N(X T

i β∗(sj), 1)

2

Define Pr[Yi(sj) = 1 | di(sj)] = 1 : di(sj) > 0 : di(sj) ≤ 0

binary regression transformed into a cont. regression problem

3

Given the di(sj) our model now a spatial (continuous) mixed effects model with normal errors.

4

Update the full conditional of each di(sj) from its truncated normal distribution

Johnson (University of Michigan) MS-GPU Jan 6, 2014 9 / 18

SLIDE 45

The Model

Latent Variable Representation

We use the Latent Variable rep introduced by Albert & Chib (1993)

1

Introduce latent variable di(sj) ∼ N(X T

i β∗(sj), 1)

2

Define Pr[Yi(sj) = 1 | di(sj)] = 1 : di(sj) > 0 : di(sj) ≤ 0

binary regression transformed into a cont. regression problem

3

Given the di(sj) our model now a spatial (continuous) mixed effects model with normal errors.

4

Update the full conditional of each di(sj) from its truncated normal distribution

5

Given di(sj), model parameters can be drawn from full conditionals due to conjugacy

Johnson (University of Michigan) MS-GPU Jan 6, 2014 9 / 18

SLIDE 46

The Model

Latent Variable Representation

We use the Latent Variable rep introduced by Albert & Chib (1993)

1

Introduce latent variable di(sj) ∼ N(X T

i β∗(sj), 1)

2

Define Pr[Yi(sj) = 1 | di(sj)] = 1 : di(sj) > 0 : di(sj) ≤ 0

binary regression transformed into a cont. regression problem

3

Given the di(sj) our model now a spatial (continuous) mixed effects model with normal errors.

4

Update the full conditional of each di(sj) from its truncated normal distribution

5

Given di(sj), model parameters can be drawn from full conditionals due to conjugacy

6

Marginalizing over di(sj), retrieve probit regression model.

Johnson (University of Michigan) MS-GPU Jan 6, 2014 9 / 18

SLIDE 47

The Model

Why GPU?

15 Subject Specific Covariates 7 functional system scores, PASAT score MS subtype (4 distinct subtypes) age, disease duration, gender—nuisance parameters

Johnson (University of Michigan) MS-GPU Jan 6, 2014 10 / 18

SLIDE 48

The Model

Why GPU?

15 Subject Specific Covariates 7 functional system scores, PASAT score MS subtype (4 distinct subtypes) age, disease duration, gender—nuisance parameters Problem Size 3D image Approx 66 million observed outcomes

275K voxels times 239 subjects

Approx 41 million spatially varying coefficients

275K voxels times 15 covariates

Johnson (University of Michigan) MS-GPU Jan 6, 2014 10 / 18

SLIDE 49

Intro to GPUs

CPU/GPU Comparison

CPUs dedicate more space to Cache/Control

GPUs dedicate more space to ALUs

Johnson (University of Michigan) MS-GPU Jan 6, 2014 11 / 18

SLIDE 50

Intro to GPUs

GPU Memory Structure

Johnson (University of Michigan)

MS-GPU Jan 6, 2014 12 / 18

SLIDE 51

Intro to GPUs

GPU Coding Guidelines

Three basic strategies for performance optimization

1

maximize parallel execution

find parts of code that can execute in parallel

Johnson (University of Michigan) MS-GPU Jan 6, 2014 13 / 18

SLIDE 52

Intro to GPUs

GPU Coding Guidelines

Three basic strategies for performance optimization

1

maximize parallel execution

find parts of code that can execute in parallel

2

ptimize memory usage

restructure data reduce memory transfers (esp CPU ⇐ ⇒ GPU) load global into shared memory and process

Johnson (University of Michigan) MS-GPU Jan 6, 2014 13 / 18

SLIDE 53

Intro to GPUs

GPU Coding Guidelines

Three basic strategies for performance optimization

1

maximize parallel execution

find parts of code that can execute in parallel

2

ptimize memory usage

restructure data reduce memory transfers (esp CPU ⇐ ⇒ GPU) load global into shared memory and process

3

ptimize instruction usage

avoid conditional statements (if, switch, do, for, while) replace regular functions by intrinsic functions

e.g. replace x/y by fdividef(x, y) however a little less precise

Johnson (University of Michigan) MS-GPU Jan 6, 2014 13 / 18

SLIDE 54

Intro to GPUs

SGLMM Implementation

My GPU has 2496 processors 5 Gb DRAM All data and parameters fit in 0.53 Gb DRAM Combination of GPU/CPU computation

Johnson (University of Michigan) MS-GPU Jan 6, 2014 14 / 18

SLIDE 55

Intro to GPUs

Parallelization of Spatial Coefficient Updates

2 4 6 8 2 4 6 8 Johnson (University of Michigan) MS-GPU Jan 6, 2014 15 / 18

SLIDE 56

Intro to GPUs

Increase in Computation Efficiency

Update CPU (ms) GPU (ms) Efficiency Gain Latent Var 25059 56 447

Spat. Coef

10944 130 84

Spat. Prec

870 24 36

Post. Prb.

301 14 22 Total 37174 224 166 Times are per iteration For 30K iterations

CPU approx 13 days GPU approx 2 hrs 25 minutes

Johnson (University of Michigan) MS-GPU Jan 6, 2014 16 / 18

SLIDE 57

Results

Spatially Varying Coefficients: PASAT Score

Coronal View β

Johnson (University of Michigan) MS-GPU Jan 6, 2014 17 / 18

SLIDE 58

Results

Spatially Varying Coefficients: PASAT Score

Coronal View β

Johnson (University of Michigan) MS-GPU Jan 6, 2014 17 / 18

SLIDE 59

Acknowledgements

Tian Ge

Department of Computer Science, University of Warwick

Thomas E. Nichols

Department of Statistics, University of Warwick

Ernst Wilhelm Rad¨ u

Department of Neurology, University Hospital Basel

US NIH/NINDS grant: 1 R01 NS075066-01A1

Johnson (University of Michigan) MS-GPU Jan 6, 2014 18 / 18