Computa(on through dynamics Using recurrent neural networks to - - PowerPoint PPT Presentation

computa on through dynamics
SMART_READER_LITE
LIVE PREVIEW

Computa(on through dynamics Using recurrent neural networks to - - PowerPoint PPT Presentation

Computa(on through dynamics Using recurrent neural networks to unveil mechanism in neural circuits David Sussillo with Valerio Mante and Bill Newsome Table of contents Introduc(on Training recurrent neural networks(RNNs) Understanding how


slide-1
SLIDE 1

Computa(on through dynamics

Using recurrent neural networks to unveil mechanism in neural circuits David Sussillo with Valerio Mante and Bill Newsome

slide-2
SLIDE 2

Table of contents Introduc(on Training recurrent neural networks(RNNs) Understanding how RNNs work Contextual decision making Future direc(ons

slide-3
SLIDE 3

Complex behavior

  • Complex neural data

Foster et al. IEEE EMBS 2012

slide-4
SLIDE 4

spikes

−400 −200 200 400 600 800 5 10 15 20 25

(me (ms) (me (ms) rate (spikes / sec)

−400 −200 200 400 600 800 10 20 30 40 50

firing rate trials

slide-5
SLIDE 5

(me (ms) firing rates of many neurons

400 800 −400

(me (ms) a few principal components

What are the biophysical correlates of these variables?

I work at the level of rates because we can make networks do interes(ng computa(ons!

slide-6
SLIDE 6

Recurrent Neural Networks (RNNs)

time (ms)

−400 −200 200 400 600 800 10 20 30 40 50

firing rate

slide-7
SLIDE 7

Recurrent Neural Networks (RNNs)

... time (ms) rate (spikes / sec)

−400 −200 200 400 600 800 10 20 30 40 50

firing rate

slide-8
SLIDE 8

Recurrent Neural Networks (RNNs) (me (me

...

nonlinear distributed feedback

time (ms) rate (spikes / sec)

−400 −200 200 400 600 800 10 20 30 40 50

firing rate

slide-9
SLIDE 9

Sompolinsky et al., PRL 1988 Rajan et al., PRE 2010

...

Dynamics in RNNs (Spontaneous Ac(vity)

(me (ms)

slide-10
SLIDE 10

Tools to understand how RNNs work

Sussillo* & Barak*, Neural Computa(on 2013

with Omri Barak

slide-11
SLIDE 11

Martens & Sutskever, ICML 2011

How does a sine-wave generator work?

Time Output Frequency

Sussillo* & Barak*, Neural Computa(on 2013

slide-12
SLIDE 12

Time Output Frequency

PC1 PC3 PC2

slide-13
SLIDE 13

What is a fixed point?

Why are they important?

A B C D E F

Any nonlinear dynamical system (e.g. neural circuit) Zero “mo(on”

Sussillo* & Barak*, Neural Computa(on 2013

firing rate 1 firing rate 2

slide-14
SLIDE 14

PC1 PC3 PC2

Time Output Frequency D

slide-15
SLIDE 15

PC1 PC3 PC2

10 20 30 40 50 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 Fixed Point # Frequency (radians)

Oscilla(on # Frequency (radians)

Input frequency Linear system frequency

The linear system is a very good approxima(on!

Sussillo* & Barak*, Neural Computa(on 2013

slide-16
SLIDE 16

Recurrent neural networks are a natural model class for modeling cor(cal phenomenon: dynamical, nonlinear, distributed. Recent advances have enabled the training of RNNs. In “simple” cases, one can understand how an RNN implements its computa(on in the language of dynamical systems (e.g. fixed points, saddle points,

  • scilla(ons).

One simple descrip(on of an RNN is as a bunch of linear systems (ling the state space.

Conclusions from technical part

slide-17
SLIDE 17

Contextual decision making (data) with Valerio Mante and Bill Newsome

Mante*, Sussillo*, Shenoy & Newsome

slide-18
SLIDE 18

Decision( Sensory( s,muli( Context(

Prefrontal cortex contributes to flexibility of decisions

A2end relevant s4muli Ignore irrelevant s4muli Suppress inappropriate responses Represent context

Computa(ons in cor(cal circuits are flexible

slide-19
SLIDE 19

Context-dependent ga(ng in monkeys

mo7on context

slide-20
SLIDE 20

Context-dependent ga(ng in monkeys

color context

slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23

S(muli

slide-24
SLIDE 24

S(muli

slide-25
SLIDE 25

S(muli

slide-26
SLIDE 26

S(muli

slide-27
SLIDE 27

mo(on strength color strength

Averaging over color shows effects of mo(on “Average over”

slide-28
SLIDE 28

mo(on strength color strength

Averaging over mo(on shows effects of color “Average over”

slide-29
SLIDE 29

Behavior

mo(on strength color strength mo(on strength color strength

slide-30
SLIDE 30

Behavior

slide-31
SLIDE 31

COLOR% (V4,IT)(

Where are sensory inputs selected?

MOTION& (MT)%

Sensory evidence DECISION( (LIP,PFC,SC)* Integrated evidence

One could easily frame this work in the context

  • f routing information in the brain.
slide-32
SLIDE 32

Mixed signals in FEF neurons

slide-33
SLIDE 33

0.1 0.1 choice choice motion motion color color

Mixed signals in FEF neurons

Verbal aside on how to make sense of this data via a state-space.

slide-34
SLIDE 34

mo#on% color% choice%

PFC popula(on response during mo(on context

“dots on” to “dots off” (750ms) Correct trials only!

slide-35
SLIDE 35

mo#on% color% choice%

choice% le,% choice% right% N">"250" 50ms"

PFC popula(on response during mo(on context

  • n
  • ff
  • ff

“dots on” to “dots off” (750ms)

slide-36
SLIDE 36

mo#on% color% choice%

  • ff%
  • ff%

choice% le-% choice% right%

PFC popula(on response during mo(on context

i n t e g r a (

  • n

*

mo#on%

slide-37
SLIDE 37

mo#on% color% choice%

  • n%
  • ff%
  • ff%

choice% le-% choice% right%

mo#on% color% choice%

PFC popula(on response during mo(on context

slide-38
SLIDE 38

PFC popula(on response during mo(on context

slide-39
SLIDE 39

mo#on% color% choice%

choice% le,% choice% right%

PFC popula(on response during mo(on context color%

slide-40
SLIDE 40

mo#on% color% choice%

PFC popula(on response during color context

  • n
  • ff
  • ff

choice& le(& choice& right&

slide-41
SLIDE 41

PFC popula(on response during color context

slide-42
SLIDE 42

PFC popula(on response during color context

mo#on% color% choice%

slide-43
SLIDE 43

Choice and input signals in PFC

Color towards Target 1 Color towards Target 2 Motion towards Target 1 Motion towards Target 2 Choice Target 2 Choice Target 1

slide-44
SLIDE 44

Representa(on of context in PFC

Mo#on%context%

Color%context%

context' mo)on' choice'

slide-45
SLIDE 45

Motion context Color context

color choice color choice motion motion motion motion context (4D)

The structure of task related signals in PFC

slide-46
SLIDE 46

The$structure$of$task$related$signals$in$PFC$

Motion context Color context

color c h

  • i

c e color c h

  • i

c e m

  • t

i

  • n

m

  • t

i

  • n

m

  • t

i

  • n

m

  • t

i

  • n

How does selec(ve integra(on occur?

slide-47
SLIDE 47

The$structure$of$task$related$signals$in$PFC$

Motion context Color context

choice motion choice color

Context'dependent* ga#ng*(“a.en/on”)*

How does selec(ve integra(on occur?

slide-48
SLIDE 48

The$structure$of$task$related$signals$in$PFC$

Motion context Color context

color choice motion choice motion color

Context'dependent* input*direc.on*

How does selec(ve integra(on occur?

slide-49
SLIDE 49

The$structure$of$task$related$signals$in$PFC$

Motion context Color context

color choice color motion motion choice

Context'dependent* choice*direc.on*

How does selec(ve integra(on occur?

slide-50
SLIDE 50

How$does$selec*ve$integra*on$occur?$

Motion context Color context

color choice color choice m

  • t

i

  • n

m

  • t

i

  • n

m

  • t

i

  • n

m

  • t

i

  • n

The structure of task related signals in PFC How does selec(ve integra(on occur?

slide-51
SLIDE 51

Task-relevant variables are mixed in the responses of single neurons, but separable and systema(cally represented in the popula(on. Irrelevant inputs are not filtered-out. Selec(on of relevant inputs occurs late, possibly within PFC. Sensory inputs elicit popula(on responses that differ from those corresponding to a choice. The direc(ons of choice and of the inputs are largely independent of context (only shii in state space)

Conclusions from data

slide-52
SLIDE 52

Contextual decision making (model)

How could selec(ve integra(on occur?

slide-53
SLIDE 53

model

guess mechanism

task Tradi(onal Modeling Framework model data system data

slide-54
SLIDE 54

guessed mechanism task Tradi(onal Modeling Framework

But what should the solu(ons look like? Are we too clever? Not clever enough?

system

slide-55
SLIDE 55

task Op(mized Modeling Framework

  • p(mized model

model data system data discover mechanism

slide-56
SLIDE 56

discovered mechanism task Op(mized Modeling Framework system

Fetz, 1993 Zipser & Andersen, 1988

This is a concrete and detailed hypothesis genera(ng mechanism.

slide-57
SLIDE 57

A neural-network model of selec(ve integra(on

750ms&

sensory evidence context choice

slide-58
SLIDE 58

A neural-network model of selec(ve integra(on

750ms&

sensory evidence context choice

1 +1

750ms&

slide-59
SLIDE 59

A neural-network model of selec(ve integra(on

750ms&

sensory evidence context

1

  • 1

750ms&

choice

slide-60
SLIDE 60

Model “Behavior”

25 50 75 100 fraction right choices (%) fraction green choices (%) motion trials 25 50 75 100 25 50 75 100 fraction right choices (%) fraction green choices (%) color trials 25 50 75 100 motion strength right left color strength green strong weak red strong weak strong weak strong weak

slide-61
SLIDE 61

750ms& context sensory evidence

Network Output Bounded Integrator

The trained network creates a bounded integrator

slide-62
SLIDE 62

Model trajectories during color trials

choice& right&

  • n&
  • ff&
  • ff&

strong& weak& weak& strong& choice& le1&

choice& color&

slide-63
SLIDE 63

Model trajectories during color trials

slide-64
SLIDE 64
slide-65
SLIDE 65

choice& mo(on&

choice le; choice right

Model trajectories during color trials

slide-66
SLIDE 66

How does integra(on happen?

choice le; choice right

color choice

slide-67
SLIDE 67

What is a fixed point?

Why are they important?

A B C D E F

Any nonlinear dynamical system (e.g. neural circuit) Zero “mo(on”

+1

  • 1

firing rate 1 firing rate 2

Seung, PNAS 1996

slide-68
SLIDE 68

choice le; choice right

color choice

Fixed points make a line amractor

slide-69
SLIDE 69

color c h

  • i

c e motion

context sensory evidence

Two line amractors for two contexts

The line amractors are context dependent and never exist at the same (me.

slide-70
SLIDE 70

choice le; choice right

color choice

Fixed points make a line amractor

slide-71
SLIDE 71

choice le; choice right

A simulated perturba(on experiment

color context

slide-72
SLIDE 72

choice le; choice right

A simulated perturba(on experiment

color context

slide-73
SLIDE 73

choice le; choice right

A simulated perturba(on experiment

color context

slide-74
SLIDE 74

choice le; choice right

A simulated perturba(on experiment

color context

slide-75
SLIDE 75

choice le; choice right

A simulated perturba(on experiment

color context

slide-76
SLIDE 76

choice le; choice right

A simulated perturba(on experiment

color context

slide-77
SLIDE 77

choice le; choice right

A simulated perturba(on experiment

color context

slide-78
SLIDE 78

choice le; choice right

A simulated perturba(on experiment

color context

slide-79
SLIDE 79

choice le; choice right

A simulated perturba(on experiment

color context

slide-80
SLIDE 80

choice le; choice right

A simulated perturba(on experiment

color context

slide-81
SLIDE 81

choice le; choice right

So what causes this difference between integra(on of color and ignoring mo(on?

color context

slide-82
SLIDE 82

Projec(ons onto the line amractor

slide-83
SLIDE 83
slide-84
SLIDE 84

These vectors we’ve talked about are context independent

slide-85
SLIDE 85
slide-86
SLIDE 86

The dynamics are context dependent

slide-87
SLIDE 87

The dynamics are context dependent

slide-88
SLIDE 88

The dynamics are context dependent

slide-89
SLIDE 89

The dynamics are context dependent

slide-90
SLIDE 90

The dynamics are context dependent

slide-91
SLIDE 91

The dynamics are context dependent

slide-92
SLIDE 92

The dynamics are context dependent

slide-93
SLIDE 93

The dynamics are context dependent

slide-94
SLIDE 94

The dynamics are context dependent

slide-95
SLIDE 95

The dynamics are context dependent

slide-96
SLIDE 96

The dynamics are context dependent

slide-97
SLIDE 97

The dynamics are context dependent

slide-98
SLIDE 98

The dynamics are context dependent

slide-99
SLIDE 99

The dynamics are context dependent

slide-100
SLIDE 100

Flexible selec(on and integra(on

slide-101
SLIDE 101

How$does$selec*ve$integra*on$occur?$

Motion context Color context

color c h

  • i

c e color c h

  • i

c e m

  • t

i

  • n

m

  • t

i

  • n

m

  • t

i

  • n

m

  • t

i

  • n

selection vector s e l e c t i

  • n

v e c t

  • r

Context'dependent* selec%on(vector*

slide-102
SLIDE 102

A predic(on of the model

choice le; choice right

color context

slide-103
SLIDE 103

We trained an abstract model to make a contextual decision based on two noisy input streams. The model made a contextual integrator with bounds. Like the data, the model represents the relevant and irrelevant inputs in separable dimensions. Two context dependent line amractors are responsible for the integra(on. Network dynamics generated through feedback, not input ga(ng, are responsible for context dependent integra(on. The network is flexibly reconfigured by the context input, which is seen as two different line amractors in state space.

Conclusions from model

slide-104
SLIDE 104

Ga7ng of sensory signals: Does not require modula(on of sensory responses. Is not about suppressing the irrelevant input, but about selec(ng the relevant input in state space. Is one aspect of a dynamical process occurring in the same cor(cal circuit as integra(on of evidence. Everything is happening at the popula(on level. Our works suggests a possible mechanism, which is not exclusive of others.

Computa7on through dynamics: mixed, separable representa(ons are contextually and dynamically linked to generate the desired output.

Final conclusions

slide-105
SLIDE 105

Shenoy Lab

Krishna Shenoy

Mark Churchland MaD Kaufman Cindy Chestek Dan O’Shea Cora Ames Werapong Goo Jus(n Foster Paul Nuyujukian Jonathan Kao Joline Fan Eric Trautmann Sergey Stavisky Chand Chandrasekaran

Newsome Lab

Bill Newsome

Valerio Mante Roozbeh Kiani Vince McGinty Daniel Kimmel Leo Segrue Diogo Peixioto Nazli Emadi

Larry AbboD Mark Churchland Omri Barak Valerio Mante

This research was supported by an NIH Director’s Pioneer award, Howard Hughes Medical Ins(tute, and grants from NIH-CRCNS, DARPA REPAIR, NSF, the Helen Hay Whitney Founda(on, the Burroughs Wellcome Fund, and the Christopher and Dana Reeve Founda(on and a NSF GRFP.

Funding Collaborators Acknowledgments