Rank-penalized nonnegative spatiotemporal deconvolution and demixing - - PowerPoint PPT Presentation

rank penalized nonnegative spatiotemporal deconvolution
SMART_READER_LITE
LIVE PREVIEW

Rank-penalized nonnegative spatiotemporal deconvolution and demixing - - PowerPoint PPT Presentation

Rank-penalized nonnegative spatiotemporal deconvolution and demixing of calcium imaging data Eftychios A. Pnevmatikakis 1 Joint work with Tim Machado 1 , Logan Grosenick 2 , Ben Poole 2 , Joshua Vogelstein 3 , and Liam Paninski 1 1 Columbia


slide-1
SLIDE 1

Rank-penalized nonnegative spatiotemporal deconvolution and demixing of calcium imaging data

Eftychios A. Pnevmatikakis1 Joint work with Tim Machado1, Logan Grosenick2, Ben Poole2, Joshua Vogelstein3, and Liam Paninski1

1Columbia University 2Stanford University 3Duke University

slide-2
SLIDE 2

Calcium imaging: Opportunities and challenges

◮ Enables imaging of large neuronal ensembles or detailed dendritic structures. ◮ Numerous applications: Neural connectivity, dendritic computation. ◮ Challenges: noisy slow signal, limited temporal resolution, spatial mixing of active

components.

Key problem

Denoise, unmix, and optimally extract spiking signals from spatiotemporal calcium imaging data.

slide-3
SLIDE 3

Spike inference from 1-d fluorescence traces

Nonnegative deconvolution (FOOPSI)

Assume simple autoregressive dynamics: ∆C(t) = −∆t/τC(t − 1) + As(t) Y(t) = C(t) + εt Approximate maximum-a-posteriori (MAP) estimate (Vogelstein et. al. 2010): min

C,s≥0

1 2σ2

  • t

(Y(t) − C(t))2

  • Loss function

+ 1 Aλ

  • t=1

s(t)

  • Spike sparsity

penalty

◮ Outperforms optimal linear (Wiener) filter (nonnegativity is important). ◮ Other approaches: "Peeling" (Grewe et. al. 2010).

slide-4
SLIDE 4

Spike inference from 1-d fluorescence traces

Nonnegative deconvolution (FOOPSI)

Assume simple autoregressive dynamics: ∆C(t) = −∆t/τC(t − 1) + As(t) Y(t) = C(t) + εt Approximate maximum-a-posteriori (MAP) estimate (Vogelstein et. al. 2010): min

C,s≥0

1 2σ2

  • t

(Y(t) − C(t))2

  • Loss function

+ 1 Aλ

  • t=1

s(t)

  • Spike sparsity

penalty

◮ Outperforms optimal linear (Wiener) filter (nonnegativity is important). ◮ Other approaches: "Peeling" (Grewe et. al. 2010).

Our goal

Extend to the high-d setup with multiple (possible overlapping) neurons by sharing information across different pixels optimally.

slide-5
SLIDE 5

Single neuron spatiotemporal fluorescence contribution

Spikes from neuron 1: s1(t)

Neuron 1 Spikes

slide-6
SLIDE 6

Single neuron spatiotemporal fluorescence contribution

Spikes from neuron 1: s1(t) Calcium trace from neuron 1:

C1(t) =

p

  • j=1

γjC1(t − j) + s1(t)

Neuron 1 Spikes Neuron 1 Trace

slide-7
SLIDE 7

Single neuron spatiotemporal fluorescence contribution

Spikes from neuron 1: s1(t) Calcium trace from neuron 1:

C1(t) =

p

  • j=1

γjC1(t − j) + s1(t)

Neuron 1 location: a1

Neuron 1 Spikes Neuron 1 Trace Neuron 1 Location

slide-8
SLIDE 8

Single neuron spatiotemporal fluorescence contribution

Spikes from neuron 1: s1(t) Calcium trace from neuron 1:

C1(t) =

p

  • j=1

γjC1(t − j) + s1(t)

Neuron 1 location: a1

Neuron 1 Spikes Neuron 1 Trace Neuron 1 Location Spatiotemporal fluorescence contribution of neuron 1 Time Pixel

Neuron 1 spatiotemporal fluorescence: F1(t) = a1C1(t)

slide-9
SLIDE 9

Single neuron spatiotemporal fluorescence contribution

Spikes from neuron 2: s2(t) Calcium trace from neuron 2:

C2(t) =

p

  • j=1

γjC2(t − j) + s2(t)

Neuron 2 location: a2

Neuron 2 Spikes Neuron 2 Trace Neuron 2 Location Spatiotemporal fluorescence contribution of neuron 2 Time Pixel

Neuron 2 spatiotemporal fluorescence: F2(t) = a2C2(t)

slide-10
SLIDE 10

Multi-d Case

Spatiotemporal dynamics are of low rank

Key observation

Each neuron contributes a rank-1 term to the spatiotemporal matrix. A C

T N d

X F =

d T N

d: # of pixels. T: # of timesteps. N: # of neurons.

◮ rank(F) ≤ N ≪ d, T (degrees of freedom (d + T)N ≪ dT)

Ci(t) =

p

  • j=1

γjCi(t − j) + si(t) CGT = S Neuron spiking F(t) =

N

  • i=1

aiCi(t) F = AC Noiseless image Y(t) = F(t) + ε, Y = F + E Noisy observations

slide-11
SLIDE 11

Multi-d Case

Rank-penalized denoising

◮ Plain SVD/PCA methods to exploit the low rank structure do not retain

nonnegativity of the spikes.

slide-12
SLIDE 12

Multi-d Case

Rank-penalized denoising

◮ Plain SVD/PCA methods to exploit the low rank structure do not retain

nonnegativity of the spikes.

◮ Instead we want to penalize the rank in an structured way:

minimize

F: FGT ≥ 0

  • Nonnegative spikes

L(Y, F)

  • Quadratic loss function

+ λ1FGT 1

  • Spike sparsity penalty

+ λNNF∗

  • Rank penalty

◮ The nuclear norm (NN) · ∗ (sum of the singular values) is the convex relaxation

  • f the rank(·) function.
slide-13
SLIDE 13

Multi-d Case

Rank-penalized denoising

◮ Plain SVD/PCA methods to exploit the low rank structure do not retain

nonnegativity of the spikes.

◮ Instead we want to penalize the rank in an structured way:

minimize

F: FGT ≥ 0

  • Nonnegative spikes

L(Y, F)

  • Quadratic loss function

+ λ1FGT 1

  • Spike sparsity penalty

+ λNNF∗

  • Rank penalty

◮ The nuclear norm (NN) · ∗ (sum of the singular values) is the convex relaxation

  • f the rank(·) function.

◮ The NN penalty (ideally) shrinks the components due to noise to zero, and

provides a robust estimate on the underlying number of neurons.

◮ The NN convex relaxation method has been used successfully in many machine

learning applications (e.g. low rank matrix completion (Candès and Recht, 2009)).

slide-14
SLIDE 14

Spatiotemporal deconvolution and demixing

Nonnegative Matrix Factorization (NMF)

We can estimate the spatial and temporal components sequentially using alternating matrix regression: Ck+1 = arg min

C:CGT ≥0

L(Y, AkC)

  • Loss function

+ λCGT 1

  • Spike sparsity

Spike deconvolution Ak+1 = arg min

A:A≥0

L(Y, ACk+1)

  • Loss function

+ λ1A1

Component sparsity

Component demixing

◮ The spatial component is initialized from the nuclear norm penalized solution

using clustering methods.

◮ The sparsity penalties can be consistently estimated using AIC like criteria.

slide-15
SLIDE 15

Results: Single-neuron patch

◮ GCaMP6S data, spinal cord neuron, in vitro. ◮ Antidromic stimulation (spike times are known).

slide-16
SLIDE 16

Results: Spinal cord synchronized neurons

◮ GCaMP3 data, spinal cord neurons, in vitro.

slide-17
SLIDE 17

Results: In vivo cortical dendritic data

(rodent barrel cortex, data from Clay Lacefield and Randy Bruno)

slide-18
SLIDE 18

Additional details and extensions

Bayesian methods

◮ Use sampling methods to quantify uncertainty and estimate parameters. ◮ Can be done efficiently (in just O(T) time) in the low-SNR regime, using an

augmented block-Gibbs sampler (Martens and Sutskever 2010).

Compressive fluorescence imaging

◮ Our methods are applicable to “scanless" imaging approaches that can achieve

higher imaging rates (Nikolenko et. al. 2008).

◮ Using compressed sensing ideas, only a few measurements are sufficient.

Parameter estimation

The parameters are estimated using a method-of-moments approach on the autoregressive dynamics.

slide-19
SLIDE 19

Conclusions

Structured approaches to calcium imaging denoising problems

◮ Modern statistical approaches provide flexible and powerful methods for dealing

with spatiotemporal Ca2+ imaging denoising problems in a structured way.

◮ Convex optimization leads to efficient MAP inference and can be used to initialize

computationally more intensive Bayesian methods (Gibbs sampling, particle filtering).

◮ More broadly, the rapid development of novel experimental methods provides

many new challenges and opportunities for breakthroughs based on statistical ideas.

slide-20
SLIDE 20

Thank you - Questions?

slide-21
SLIDE 21

Extensions: Fully Bayesian approaches

Beyond MAP inference

◮ Use sampling methods to quantify

uncertainty and estimate parameters.

◮ Can be done efficiently (in just O(T) time)

in the low-SNR regime, using an augmented block-Gibbs sampler (Martens and Sutskever 2010).

◮ Can introduce intermediate timebins to

achieve spike superresolution.

◮ Challenge: Perform tractable fully

Bayesian inference for the general spatiotemporal case.

0.1 0.2 0.3 0.4 0.5

Calcium trace estimate

Raw Data Standard Error Mean Gibbs MAP

MAP estimate of Spikes

200 400 600 800 1000 1200 1400

Gibbs probabilities of spikes

Frame 0.02 0.03 0.04 0.05 0.06 0.07

Spike amplitude

0.02 0.04 0.06 0.08 0.1

Baseline Repetition #

0.1 0.15 0.2 Mean spike probability

Figure : GCaMP3 data, spinal cord neuron in vitro, antidromic stimulation.

slide-22
SLIDE 22

Extensions: Compressive fluorescence imaging

◮ Compressive fluorescence imaging, e.g.

spatial light modulator (SLM) microscopy (Nikolenko et. al. 2008), takes generalized linear measurements at any frame, allowing for faster imaging rates.

◮ Exploit spike sparsity.

# of measurements ≪ # of neurons In high-SNR: nt ∼ O(pN log(1/p))

◮ Potential to dramatically increase the

number of neurons we can record from given a fixed number of measurements.

◮ Challenge: Accurately estimate the model

parameters and apply to real data. Simulated data, 50 neurons.

slide-23
SLIDE 23

Parameter estimation

γj and σ2 are estimated from the autoregressive statistics

◮ For τ > p it holds for the autocovariance CY :

CY (τ) =

p

  • j=1

γjCY (τ − j).

◮ For 0 < τ ≤ p, becomes

CY (τ) =

p

  • j=1

γjCY (τ − j) − σ2γτ. To estimate b and Aλ use E(y) = b + Aλ 1 − γj , and the approximate relation σ ∝ b.