Statistical methods for understanding neural coding and dynamics - PowerPoint PPT Presentation

Statistical methods for understanding neural coding and dynamics Liam Paninski Department of Statistics and Center for Theoretical Neuroscience Columbia University http://www.stat.columbia.edu/ ∼ liam liam@stat.columbia.edu November 18, 2010 Support: NIH/NSF CRCNS, Sloan Fellowship, NSF CAREER, McKnight Scholar award.

The coming statistical neuroscience decade Some notable recent developments: • machine learning / statistics methods for extracting information from high-dimensional data in a computationally-tractable, systematic fashion • computing (Moore’s law, massive parallel computing) • optical methods (eg two-photon, FLIM) and optogenetics (channelrhodopsin, viral tracers, brainbow) • high-density multielectrode recordings (Litke’s 512-electrode retinal readout system; Shepard’s 65,536-electrode active array)

Some exciting open challenges • inferring biophysical neuronal properties from noisy recordings • reconstructing the full dendritic spatiotemporal voltage from noisy, subsampled observations • estimating subthreshold voltage given superthreshold spike trains • extracting spike timing from slow, noisy calcium imaging data • reconstructing presynaptic conductance from postsynaptic voltage recordings • inferring connectivity from large populations of spike trains • decoding behaviorally-relevant information from spike trains • optimal control of neural spike timing — to solve these, we need to combine the two classical branches of computational neuroscience: dynamical systems and neural coding

Part 1: modeling correlated spiking in retina Preparation: dissociated macaque retina — extracellularly-recorded responses of populations of RGCs Stimulus: random spatiotemporal visual stimuli (Pillow et al., 2008)

Receptive fields tile visual space

Multineuronal point-process model �� „ « b i + � X k i · � h i ′ ,j n i ′ ( t − j ) λ i ( t ) = f x ( t ) + , i ′ ,j — likelihood is easy to compute and to maximize (concave optimization) (Paninski, 2004; Paninski et al., 2007; Pillow et al., 2008) — close connections to noisy integrate-and-fire model — captures spike timing precision and details of spatiotemporal correlations in retinal ganglion cell network

Reconsidering the model

Considering common input effects — universal problem in network analysis: can’t observe all neurons!

Extension: including common input effects

Direct state-space optimization methods To fit parameters, optimize approximate marginal likelihood: � log p ( spikes | θ ) = log p ( Q | θ ) p ( spikes | θ, Q ) dQ Q θ ) − 1 log p ( ˆ Q θ | θ ) + log p ( spikes | ˆ ≈ 2 log | J ˆ Q θ | ˆ Q θ = arg max Q { log p ( Q | θ ) + log p ( spikes | Q ) } — Q is a very high-dimensional latent (unobserved) “common input” term. Taken to be a Gaussian process here with autocorrelation time ≈ 5 ms (Khuc-Trong and Rieke, 2008). — correlation strength specified by one parameter per cell pair. — all terms can be computed in O ( T ) via banded matrix methods (Paninski et al., 2010).

Inferred common input effects are strong common input 1 0 −1 −2 direct coupling input 1 0 −1 −2 stimulus input 2 0 −2 refractory input 0 −1 −2 spikes 100 200 300 400 500 600 700 800 900 1000 ms — note that inferred direct coupling effects are now relatively small.

Common-input-only model captures x-corrs — single and triple-cell activities captured well, too (Vidne et al., 2009)

Inferring cone locations — cone locations and color identity can be inferred accurately with high spatial-resolution stimuli via maximum a posteriori estimates (Field et al., 2010).

Inferring cone-to-RGC effective connectivity

Part 2: applications to cortex

Model-based estimation of spike rates Note: each component here can be generalized easily (Vogelstein et al., 2009).

Fast maximum a posteriori (MAP) filter Start by writing out the posterior: log p ( C | F ) log p ( C ) + log p ( F | C ) + const. = X X = log p ( C t +1 | C t ) + log p ( F t | C t ) + const. t t Three basic observations: • If log p ( C t +1 | C t ) and log p ( F t | C t ) are concave, then so is log p ( C | F ). • Hessian H of log p ( C | F ) is tridiagonal: log p ( F t | C t ) contributes a diag term, and log p ( C t +1 | C t ) contributes a tridiag term (Paninski et al., 2010). • C is a linear function of n . Newton’s method: iteratively solve HC dir = ∇ . Tridiagonal solver requires O ( T ) time. Can include nonneg constraint n t ≥ 0 (Koyama and Paninski, 2009). — Two orders of magnitude faster than particle filter: can process data from ≈ 100 neurons in real time on a laptop (Vogelstein et al., 2010).

Example: nonnegative MAP filtering — nonnegative deconvolution is a recurring problem (Vogelstein et al., 2010) (e.g., deconvolution of PSPs in intracellular recordings (Paninski et al., 2010))

Simulated circuit inference Sparse Prior Sparse Prior 1 Positive weights 0.6 Negative weights Inferred connection weights 0.4 Zero weights 0.8 0.2 0 Histogram 0.6 −0.2 −0.4 0.4 −0.6 −0.8 0.2 −1 0 −2 −1.5 −1 −0.5 0 0.5 1 1.5 −8 −6 −4 −2 0 2 4 Actual connection weights Connection weights — conductance-based integrate-and-fire networks with biologically plausible connectivity matrices, imaging speed, SNR (Mishchencko et al., 2009). Good news: MAP connections are inferred with the correct sign, in just a couple minutes of compute time, if network is fully observed. Current work focusing on improved Monte Carlo sampling methods, to better quantify uncertainty in unobserved neurons (Mishchenko and Paninski, 2010).

Optimal control of spike timing To test our results, we want to perturb the network at will. How can we make a neuron fire exactly when we want it to? Assume bounded inputs; otherwise problem is trivial. Start with a simple model: λ t = f ( V t + h t ) √ V t + dt ( − gV t + aI t ) + ǫ t ∼ N (0 , 1) . V t + dt = dtσǫ t , Now we can just optimize the likelihood of the desired spike train, as a function of the input I t , with I t bounded. Concave objective function over convex set of possible inputs I t + Hessian is tridiagonal = ⇒ O ( T ) optimization. — again, can be done in real time (Ahmadian et al., 2010).

Simulated electrical control of spike timing target resp optimal stim resp 0 100 200 300 400 500 600 700 800 time(ms)

Example: intracellular control of spike timing target spikes I max = 2.04 I max = 1.76 I max = 1.26 I max = 0.76 −50 0 50 100 150 200 250 300 350 400 450 500 time (ms) (Ahmadian et al., 2010)

Optical conductance-based control of spiking √ t ( V i − V t ) + g e t ( V e − V t ) “ ” − gV t + g i = V t + dt + ǫ t ∼ N (0 , 1) V t + dt dtσǫ t , − g i − g e „ « „ « g i g i τ i + a ii L i t t + a ie L e ; g e t + dt = g e τ i + a ee L e t t + a ei L i = t + dt t + dt t + dt t t target spike train 0 20 40 60 80 100 120 140 160 180 200 −50 Voltage −60 −70 0 20 40 60 80 100 120 140 160 180 200 Light intensity 20 E I 10 0 0 20 40 60 80 100 120 140 160 180 200 induced spike trains 0 20 40 60 80 100 120 140 160 180 200 time(ms)

Part 3: spatiotemporal filtering on dendrites Spatiotemporal imaging data opens an exciting window on the computations performed by single neurons, but we have to deal with noise and intermittent observations. (Djurisic et al., 2004; Knopfel et al., 2006)

Basic paradigm: the Kalman filter Variable of interest, q t , evolves according to a noisy differential equation (Markov process): dq/dt = f ( q t ) + ǫ t . Make noisy observations: y t = g ( q t ) + η t . We want to infer E ( q t | Y ): optimal estimate given observations. Problem: Kalman filter requires O ( d 3 T ) time ( d = dim ( q )). Reduction to O ( dT ): exploit tree structure of dendrite (Paninski, 2010). Can be applied to voltage- or calcium-sensitive imaging data (Pnevmatikakis et al, 2010).

Example: inferring voltage from subsampled observations (Loading low-rank-speckle.mp4)

Example: summed observations (Loading low-rank-horiz.mp4)

Applications • Optimal experimental design: which parts of the neuron should we image? (Huggins and Paninski, 2010) • Estimation of biophysical parameters (e.g., membrane channel densities, axial resistance, etc.): reduces to a simple nonnegative regression problem once V ( x, t ) is known (Huys et al., 2006) • Detecting location and weights of synaptic input (Huggins and Paninski, 2011)

Application: synaptic locations/weights

Statistical methods for understanding neural coding and dynamics - PowerPoint PPT Presentation

Statistical methods for understanding neural coding and dynamics Liam Paninski Department of Statistics and Center for Theoretical Neuroscience Columbia University http://www.stat.columbia.edu/ liam liam@stat.columbia.edu November 18, 2010

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Coding and Applications in Sensor Networks Why coding? Information compression

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

Bag of Pursuits and Neural Gas for Improved Sparse Coding Manifold Learning with Sparse Coding

Image and Video Coding: Motion Estimation and Coding 4 5 6 B C D 1 D 0 3 7 A current 2

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m

Image and Video Coding: Improved Inter-Picture Prediction Review of Hybrid Video Coding Last

CCN & Network Coding Cedric Westphal Huawei and UCSC ICN & Network Coding - RFC7933

Single Top Physics at the Tevatron Enrique Palencia Fermilab for the CDF and D

1 Radial Arm Maze Olton Radial Arm Maze Observe path taken Typical track of rat in radial arm

MacSeNet/SpaRTan Spring School on Sparse Representations and Compressed Sensing Sp Spar arse

Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 11 Jan-Willem van de Meent

Grey-box Models for Structural Dynamics Lizzy Cross , Tim Rogers, Ramon Fuentes, Haichen Shi,

Introduction to Artificial Intelligence Marc Toussaint March 29, 2016 The majority of slides of

JERICO-NEXT 1 st General Assembly Report 16 th March 2017 Grant Agreement n 654410 Project

Tendinopathy Founder, RunSafe Founder, SportZPeak Inc. Basic Strategy for Sanofi,

Statistical methods for understanding neural coding and dynamics - PowerPoint PPT Presentation

Statistical methods for understanding neural coding and dynamics Liam Paninski Department of Statistics and Center for Theoretical Neuroscience Columbia University http://www.stat.columbia.edu/ liam liam@stat.columbia.edu November 18, 2010

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Coding and Applications in Sensor Networks Why coding? Information compression

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Speech &amp; Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

Bag of Pursuits and Neural Gas for Improved Sparse Coding Manifold Learning with Sparse Coding

Image and Video Coding: Motion Estimation and Coding 4 5 6 B C D 1 D 0 3 7 A current 2

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m

Image and Video Coding: Improved Inter-Picture Prediction Review of Hybrid Video Coding Last

CCN &amp; Network Coding Cedric Westphal Huawei and UCSC ICN &amp; Network Coding - RFC7933

Single Top Physics at the Tevatron Enrique Palencia Fermilab for the CDF and D

1 Radial Arm Maze Olton Radial Arm Maze Observe path taken Typical track of rat in radial arm

MacSeNet/SpaRTan Spring School on Sparse Representations and Compressed Sensing Sp Spar arse

Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 11 Jan-Willem van de Meent

Grey-box Models for Structural Dynamics Lizzy Cross , Tim Rogers, Ramon Fuentes, Haichen Shi,

Introduction to Artificial Intelligence Marc Toussaint March 29, 2016 The majority of slides of

JERICO-NEXT 1 st General Assembly Report 16 th March 2017 Grant Agreement n 654410 Project

Tendinopathy Founder, RunSafe Founder, SportZPeak Inc. Basic Strategy for Sanofi,

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

CCN & Network Coding Cedric Westphal Huawei and UCSC ICN & Network Coding - RFC7933