A reinforcement learning model of song acquisition in the bird
Michale Fee
McGovern Institute Department of Brain and Cognitive Sciences Massachusetts Institute of Technology
9.54 November 12, 2014
A reinforcement learning model of song acquisition in the bird - - PowerPoint PPT Presentation
A reinforcement learning model of song acquisition in the bird Michale Fee McGovern Institute Department of Brain and Cognitive Sciences Massachusetts Institute of Technology 9.54 November 12, 2014 Structure of zebra finch song Motif Motif
McGovern Institute Department of Brain and Cognitive Sciences Massachusetts Institute of Technology
9.54 November 12, 2014
0 kHz
10 kHz
1s
Frequency
Motif Motif Syllable (~100ms) Note (~10ms)
Decreased Variability Increased Similarity to Tutor
Tutor Song Subsong Plastic Song Crystallized
RA nXII Motor Pathway HVC
Cortex
Uva
Thalamus
Nottebohm et al, 1976, 1982
Hahnloser, Kozhevnikov and Fee, 2002
HVC RA X Stimulation electrode Extracellular recording electrode
Lynch, Okubo and Fee, in preparation
’
t # 1 66
Bird A: 66 bursts, 40 neurons Bird B: 56 bursts, 44 neurons
’
t # 1 56
Bird B: 56 bursts, 44 neurons
’
t # 1 91
Bird C: 91 bursts, 64 neurons
Yu and Margoliash, 1996
Motif
Leonardo and Fee, 2005
HVC RA Extracellular recording electrode
Sparse representation of time
Output
Leonardo and Fee, 2005
Sparse representation of time
Output
Leonardo and Fee, 2005
Brain cooling to localize dynamics n
p
RA nXII HVC 0.0A 0.25A
...
Bilateral cooling of HVC causes uniform slowing of the song
5 mm
Long and Fee, Nature 2008
Auditory Memory
Doya and Sejnowski, 1989
RA nXII Motor Pathway HVC
Cortex
LMAN DLM
Thalamus
Anterior Forebrain Pathway (AFP)
for learning (Bottjer, 1984, Scharff and Nottebohm, 1991)
in the motor pathway
Basal Ganglia
Area X
RA nXII LMAN HVC
Kao et al, 2005 Ölveczky et al, 2005 Aronov et al, 2008 Stepanek and Doupe, 2010
RA nXII LMAN HVC TTX or Muscimol
Kao et al, 2005 Ölveczky et al, 2005 Aronov et al, 2008 Stepanek and Doupe, 2010
55 day old bird
RA nXII LMAN HVC Olveczky, Andalman, and Fee, 2005
LMAN intact LMAN inactivated
20 40 60
20
Residual Pitch (Hz) Time (ms)
LMAN intact
250 ms
LMAN inactivated
30 dB
Goldberg and Fee, 2011
RA nXII LMAN Motor Pathway Learning Pathway (AFP) HVC
Aronov, Andalman and Fee, Science 2008,
Subsong bird Plastic song bird Adult bird
30 dB
Subsong
Pre-lesion 250 ms Post-lesion
HVC RA nXIIts LMAN X DLM
Goldberg and Fee, 2011
effect on juvenile song variability.
babbling exploratory vocal variability is generated by local circuit dynamics within LMAN.
RA nXII
HVC
LMAN
Kao et al, 2005 Ölveczky et al, 2005 Aronov et al, 2008 Stepanek and Doupe, 2010
RA nXII LMAN HVC
Kao et al, 2005 Ölveczky et al, 2005 Aronov et al, 2008 Stepanek and Doupe, 2010
RA nXII LMAN HVC
Area X DLM
Tchernichovski, Mitra, Lints, Nottebohm, 2001
Tutor Pupil Days of Training 5 8 12 20 30 606 Hz Harmonic Stack Pitch (Hz) 568 554 551 596 607
0.5 0.6 Targeted region
Vocalized
Pitch (kHz)
Heard
Feedback Noise
Pitch threshold
Andalman and Fee 2009; Tumer and Brainard 2007
Speaker
DSP
Brain Cranial airsac Mic
Tumer and Brainard 2007
Pitch (Hz) 550 650 25 ms
0 h 2 h 4 h
Pitch (Hz) 550 650
Andalman and Fee 2009
50 5 10 15
ΔPitch, Day (Hz) Observations
120 125 130 135 140 145 150 155 160 165 500 600 700 141 142 143 144 470 600 161 162 163 164 520 650
Pitch (Hz) Days Post Hatch Days Post Hatch Days Post Hatch Pitch (Hz)
50 5 10 15
ΔPitch, Overnight (Hz) Observations
Up Days Down Days Up Days Down Days
Motor parameter space
AFP-driven variability
Motor pathway
Motor parameter space
AFP-driven variability AFP-driven bias
Error gradient (reduced error)
Motor pathway
Motor parameter space
AFP-driven variability AFP-driven bias
Error gradient (reduced error)
Motor pathway Plasticity in motor pathway
AFP-driven bias Plasticity in motor pathway
HVC RA nXIIts LMAN
X
DLM
Motor Pathway Anterior Forebrain Pathway (AFP)
HVC RA nXIIts LMAN
X
DLM
25 Hz Pitch (Hz) TTX 2 h ∆Pitch (Hz)
50 2 4 6 8
TTX Observations
Up Days Down Days
Δ Vehicle Pitch (Hz) Vehicle ∆Pitch (Hz)
50 2 4 6 8 10 Up Days Down Days
Observations
Andalman and Fee, PNAS 2009
470 600
TTX TTX TTX TTX PBS PBS PBS PBS
Pitch (Hz)
drug reservoir cap inflow tube
dialysis membrane skull dental acrylic LMAN
Motor parameter space
AFP-driven variability AFP-driven error-reducing bias
Error gradient (reduced error)
Plasticity in motor pathway
Motor pathway
120 125 130 135 140 145 150 155 160 165 500 600 700
Days Post Hatch Pitch (Hz)
120 125 130 135 140 145 150 155 160 165 500 600 Days post-hatch Pitch (Hz) baseline LMAN(+) LMAN(-)
120 125 130 135 140 145 150 155 160 165 500 600 Days post-hatch Pitch (Hz) baseline LMAN(+) LMAN(-)
Pitch β
Day 1
Δm
Night Day 2 Day 3
Andalman and Fee, 2009 Warren et al, 2011
2 4 0.2 0.4 0.6 0.8 1
Lag (days) Correlation Coefficient (r2)
100 50 100 Lag = -2 d Lag = -1 d Lag = 0 d Δm (Hz) (down days inv.) Estimated AFP bias (Hz, down days inverted) Days
Motor parameter space
AFP-driven variability AFP-driven bias motor pathway plasticity Motor pathway motor pathway plasticity
Error gradient (reduced error) Day 1 Day 2 Day 3
X
RA nXII HVC LMAN
lead to better song performance.
VTA Schultz, 2000
AIV
VTA
X
CM
Aud
Keller and Hahnloser, 2008 Gale, Perkel 2008 Mandelblat-Cerf et al, 2014
Retrograde label from VTA AIV Ventral Intermediate Arcopallium (AIV)
X
nXII HVC LMAN VTA
Las, Denisenko, Mandelblat-Cerf, eLife, 2014
Bird tutored in home cage AIV lesion 40 90 Check imitation Bird isolated Age (days post hatch)
AIV lesioned pupil #2 – Adult song Example 1 Example 2 Tutor
AIV lesioned – adult song Tutor
Lesioned control Unlesioned control AIV lesion
Similarity of unrelated birds
X nXII HVC LMAN VTA
2 4 6 8 10 12 14
0.0 0.2 0.4 Voltage (mV) Time from stim (ms)
200 ms
Noise burst
Mandelblat-Cerf, Las, Denisenko, under review
AIV
VTA
X
CM
Aud
Keller and Hahnloser, 2008 Gale, Perkel 2008 Mandelblat-Cerf et al, 2014
Retrograde label from VTA AIV Ventral Intermediate Arcopallium (AIV)
RA nXII HVC LMAN
VTA
X
CM
Aud
VTA LMAN HVC
DLM
X
LMAN HVC
RA HVC LMAN
3 2 1
VTA HVC (Time Sequence)
LMAN
Pallidal Thalamus
MSN
To RA
Area X
HVC(X) firing patterns
4 kHz 1 2 3 4 5 6 7 100 ms
The AFP forms a classic cortical-BG-thalamo-cortical loop
3 2 1
VTA HVC (Time Sequence)
LMAN
Pallidal Thalamus
MSN
To RA
Area X
Learning rule: Strengthen HVC synapse after coincidence
VTA
3 2 1
VTA HVC (Time Sequence)
LMAN
Pallidal Thalamus
MSN
To RA
Area X
Time-dependent bias of one LMAN neuron Goldberg and Fee 2010
LMAN
HVC
1 2 3
MSN To RA
3 2 1
VTA HVC (Time Sequence)
LMAN
Pallidal Thalamus
MSN
To RA
Area X HVC synapses Timing Drive MSNs Plastic Selective for single synapses LMAN synapses Action Do NOT drive MSNs Not plastic Global signal
VTA LMAN
MSN
HVC LMAN
MSN
HVC
LMAN HVC
VTA ΔWHVC-X
Eligibility trace
EHVC-X
HVC on spines LMAN on dendritic shafts
Synapse Strengthened
Collaboration with Winfried Denk and Jörgen Kornfeld
LMAN
200 µm 200 µm
HVC
200 µm 200 µm
Michael Stetner Axonal arbor of LMAN neuron in Area X Axonal arbor of HVC neuron in Area X
~94% of synapses onto spines are from HVC-like axons
Putative LMAN axons Putative HVC axons
improved song performance.
carry ‘performance’ error-related information.
better performance and bias the variability in the direction of improved performance.
incorporates an efference copy of cortically-generated motor actions.
Current Lab Members
Former Lab Members
National Institutes of Health - NIMH, NIDCD
Stimulating electrode nXII LMAN Area X DLM HVC RA
Instantaneous firing rate (Hz) 700
20 dB
Sound amplitude
250 ms
Instantaneous firing rate (Hz) 700
20 dB
Sound amplitude
200 ms
20 dB 700
Instantaneous firing rate (Hz) Neuron 14 Sound amplitude
250 ms
* *
100 200 20 40 60
Time relative to offset (ms) Mean firing rate (Hz)
Sound amplitude