Stéphanie Allassonnière
with J.B. Schiratti, J. Chevallier, I. Koval, V. Debavalaere and
- S. Durrleman
Mixed effect model for the spatiotemporal analysis of longitudinal - - PowerPoint PPT Presentation
Mixed effect model for the spatiotemporal analysis of longitudinal manifold valued data Stphanie Allassonnire with J.B. Schiratti, J. Chevallier, I. Koval, V. Debavalaere and S. Durrleman Universit Paris Descartes & Ecole
2
3
population from the atlas
4
– One observation per subject – Image or shape (viewed as currents) – Deformations either linearized or diffeomorphic – Homogeneous or heterogeneous populations (mixture models)
5
6
7
– One observation per subject – Image or shape (viewed as currents) – Deformations either linearized or diffeomorphic – Homogeneous or heterogeneous populations (mixture models)
Ø One observation per subject Ø Corresponding acquistion time
8
– Several observation per subject – Image, shape, etc – Atlas = representative trajectory and population variability
9
Temporal marker of progression
(e.g. time since drug injection, seeding, birth, etc..)
subject #1 subject #2 subject #3 Time (age) Regression
(e.g. compare measurements at same time-point)
No temporal marker of progression
(e.g. in aging, neurodegenerative diseases, etc..)
How to learn representative trajectories of data changes from longitudinal data?
Learning spatiotemporal distribution of trajectories
stages of progression
Linear mixed-effects models
[Laird&Ware’82, Diggle et al., Fitzmaurice et al.]
Needs to disentangle differences in:
changes vector-valued data manifold-valued data
(normalized data, positive matrices, shapes, etc;;)
10
[Schiratti et al. IPMI’15, NIPS’15]
data changes
changes
identifiability (unique space/time decomposition)
random variable
t0 t
v0
p0
T0(t) = Expp0,t0(v0)(t)
vi
pij
vij
yij = Ti(ψi(t)) + εij
Ti(t) = ExpT0(t)(PT0
t0,t(vi))
ψi(t) = t0 + αi(t − t0 − τi)
Acceleration factor Time-shift Space-shift
αi ∼ log N(0, σ2
α)
τi ∼ N(0, σ2
τ)
vi = (A1| . . . |AK) si Ak⊥v0 Random effects: Fixed effects: (p0, t0, v0)
(σ2
α, σ2 τ, A1, ...AK)
and
11
yij = Ti(ψi(t)) + εij
Ti(t) = ExpT0(t)(PT0
t0,t(vi))
T0(t) = Expp0,t0(v0)(t)
ψi(t) = t0 + αi(t − t0 − τi) αi ∼ log N(0, σ2
α)
τi ∼ N(0, σ2
τ)
vi = (A1| . . . |AK) si Ak⊥v0 (p0, t0, v0)
Submanifold value observations Parallel curve Representative trajectory Linear time reparametrization
Hidden random variables: Acceleration factor Time shift Space shift Parameters: Mean trajectory parametrization and prior parameter
(σ2
α, σ2 τ, A1, ...AK)
12
vi
pij
vij
p0
Σ P0tT ΣP0t
v0
p0
vi vi
pij
vij v0
p0
13
y time
b
t0 yij = (a + ai)(ti,j − t0) + b + bi | {z } +εi,j
Measurement of the ith subject at time t0 Laird & Ware (1982) Schiratti et al. (2015)
time
b
t0 yij = (a + ai)(ti,j −t0 − τi | {z }) + b + εi,j
Time at which measurement of the ith subject reaches ¯
b
x x
14
Riemannian log at p0 = 0.5), since p0 is estimated
from Laird&Ware)
M =]0, 1[, g(p)(u, v) = uv p2(1 − p)2 γ0(t) = 1 + (1 − p0)/p0 exp ⇣ −
v0 p0(1−p0) (t − t0)
⌘ yij = γ0 ⇣ t0 + αi(t − t0 − τi) ⌘ + εij
15
à The parallel changes the relative timing of the effect onset across coordinates
M =]0, 1[N, g(p)(u, v) =
N
X
k=1
ukvk p2
k(1 − pk)2
γδ(t) = ⇣ γ0(t), γ0(t − δ1), . . . , γ0(t − δN−1) ⌘
✓ γ0 ✓ t + vi,1 v0 ◆ , γ0 ✓ t − δ1 + vi,2 v0 ◆ , ..., γ0 ✓ t − δN−1 + vi,N v0 ◆◆
16
maxθ p(y|θ) = Z p(y, z|θ)dz p(yi|zi, θ)p(zi|θ) θk+1 = argmaxθ
N
X
i=1
Z log ✓ p(yi, zi|θ) | {z } ◆ p(zi|yi, θk)dzi θk+1 = argmaxθ ( φ(θ)T
N
X
i=1
Z S(yi, zi)p(zi|yi, θk)dzi − N log(C(θ)) )
log p(yi, zi|θ) = φ(θ)T S(yi, zi) − log(C(θ))
y = (y1, ..., yN), z = (z1, ...zN), θ = (σ2
z, σ2 ε, A1, ..., AK, p0, t0, v0)
17
sample from , and a stochastic approximation of the sufficient statistics Maximization step (unchanged)
geometrically ergodic Markov chain targeting the conditional distribution
zi,k+1 p(zi|yi, θk) Sk+1 = (1 − ∆k)Sk + ∆k 1 N
N
X
i=1
S(yi, zi,k+1) ! θk+1 = argmaxθ
[Delyon, Lavielle, Moulines.’99] [Allassonnière et al.’10]
18
sample from , and a stochastic approximation of the sufficient statistics Maximization step (unchanged)
geometrically ergodic Markov chain targeting the conditional distribution As long as “converges towards” as
zi,k+1 p(zi|yi, θk) Sk+1 = (1 − ∆k)Sk + ∆k 1 N
N
X
i=1
S(yi, zi,k+1) ! θk+1 = argmaxθ
[Chevallier & Allassonnière, preprint ‘19]
˜ qk
<latexit sha1_base64="40UCSY71gF6hpiKfKWST7aiH12w=">ACznicjVHLSsNAFD2Nr1pfVZdugkVwVRIt6LgxmUF+4BWSjKd1qHTJCaTQinFrT/gVj9L/AP9C+MKahFdEKSM+ec2fuvX4kRaIc5zVnLS2vrK7l1wsbm1vbO8XdvUYSpjHjdRbKMG75XsKlCHhdCSV5K4q5N/Ilb/rDCx1vjnmciDC4VpOI34y8QSD6gnmKqHZHCdnj07tZd9gtlpyY5a9CNwMlJCtWlh8Qc9hGBIMQJHAEVYwkNCTxsuHETE3WBKXExImDjHDAXypqTipPCIHdJ3QLt2xga01zkT42Z0iqQ3JqeNI/KEpIsJ69NsE09NZs3+lntqcuq7TejvZ7lGxCrcEvuXb678r0/XotDHualBUE2RYXR1LMuSmq7om9tfqlKUISJO4x7FY8LMOd9to0nMbXr3nom/maUmtV7lmlTvOtb0oDdn+NcBI2Tsntadq4qpWolG3UeBzjEMc3zDFVcoa6fgjnvBs1ayxNbPuP6VWLvPs49uyHj4A2kCT/g=</latexit>k → ∞
<latexit sha1_base64="+SPztLGDsSBXAoy1PgIrOiODdQ=">ACz3icjVHLSsNAFD2Nr1pfVZdugkVwVRIt6LgxmUL9gFNkSd1qF5MZkopShu/QG3+lfiH+hfeGdMQS2iE5KcOfeM3Pv9ZKAp9KyXgvGwuLS8kpxtbS2vrG5Vd7eadxJnzW8uMgFl3PTVnAI9aSXAasmwjmhl7AOt74TMU710ykPI4u5CRh/dAdRXzIfVcS5YwdGZsOj4ZyUrosV6yqpZc5D+wcVJCvRlx+gYMBYvjIEIhgiQcwEVKTw82LCTE9TElThDiOs5wixJpM8pilOESO6bviHa9nI1orzxTrfbplIBeQUoTB6SJKU8QVqeZOp5pZ8X+5j3VnupuE/p7uVdIrMQVsX/pZpn/1alaJIY41TVwqinRjKrOz10y3RV1c/NLVZIcEuIUHlBcEPa1ctZnU2tSXbvqravjbzpTsWrv57kZ3tUtacD2z3HOg/ZR1T6uWs1apV7LR13EHvZxSPM8QR3naKBF3gke8YRno2ncGHfG/WeqUcg1u/i2jIcPWcCTwQ=</latexit>Under mild conditions: – Drift property – Small set – Geometric ergodicity uniformly on any compact set of the parameters
– a.s. convergence towards the MAP estimator – Normal asymptotic behaviour: speed – Normal asymptotoc behaviour with optimal speed with averaging sequences
19
1/ p ∆k
1/ √ k
20
The average trajectory of data changes
Gog from ADNI
from MCI to AD
average (min 3, max 11)
with propagation logistic model yij ∈]0, 1[4
[Schiratti et al. IPMI’15, NIPS’15]
[Schiratti et al. IPMI’15, NIPS’15]
Distinguish fast vs. slow progressers Distinguish early vs. late onset individuals
+1σ −1σ
[Schiratti et al. IPMI’15, NIPS’15] Variability in the relative timing and ordering of the events
+1σ −1σ
23
24
True values MCMC-SAEM STAN Monolix
[Schiratti et al. IPMI’15, NIPS’15]
26
27
28
Each snapshot corresponds to the neuronal loss within 5 years (from 70-75 to 85-90) Neuronal loss within 10 years
ADNI Data à Alzheimer’s Disease cohort Measures of the cortical thickness for MCI converters
[Koval et al, Frontiers in Neurosciences’17]
29
[Schiratti et al. IPMI’15, NIPS’15]
t0 t
v0
p0
T0(t) = Expp0,t0(v0)(t)
vi
pij
vij
yij = Ti(ψi(t)) + εij
Ti(t) = ExpT0(t)(PT0
t0,t(vi))
ψi(t) = t0 + αi(t − t0 − τi)
Acceleration factor Time-shift Space-shift
αi ∼ log N(0, σ2
α)
τi ∼ N(0, σ2
τ)
vi = (A1| . . . |AK) si Ak⊥v0 Random effects: Fixed effects: (p0, t0, v0)
(σ2
α, σ2 τ, A1, ...AK)
and
30
[Chevallier et al, NIPS’17]
[Chevallier et al, NIPS’17]
[Chevallier et al, NIPS’17]
35
The individual parameters are related to the real age of conversion of the individuals
36
37
vi
pij
vij
t0 t
v0
p0
T0(t) = Expp0,t0(v0)(t)