Reinforcement Learning Models
- f the Basal Ganglia
Reinforcement Learning Models of the Basal Ganglia Computational - - PowerPoint PPT Presentation
Reinforcement Learning Models of the Basal Ganglia Computational Models of Neural Systems Lecture 6.2 David S. Touretzky November, 2017 Dopamine Cells Located in SNc (substantia nigra pars compacta) and VTA (ventral tegmental area)
11/20/17 Computational Models of Neural Systems 2
– Unpredicted primary reinforcer (food, juice) – Unpredicted CS (tone, light) that has become a secondary reinforcer
– High intensity or novel stimuli
– For a few cells (less than 20%): aversive stimuli
11/20/17 Computational Models of Neural Systems 3
– Nicely explains response to unpredicted reinforcers – Novelty is somewhat rewarding to animals – Aversive stimuli? (prediction error)
11/20/17 Computational Models of Neural Systems 4
– Maybe the problem is that his animals were only tested on a single task. – More recent experiments have shown that DA neurons can distinguish
11/20/17 Computational Models of Neural Systems 5
– Afferent input + subsequent dopamine input ⇒ LTP.
– 500-5,000 DA synapses – 5,000-10,000 cortical synapses
11/20/17 Computational Models of Neural Systems 6
– What prevents extinction? – Perhaps a separate reinforcer signal in striatum.
11/20/17 Computational Models of Neural Systems 7
11/20/17 Computational Models of Neural Systems 8
i
11/20/17 Computational Models of Neural Systems 9
Time delay
11/20/17 Computational Models of Neural Systems 10 Time delay
11/20/17 Computational Models of Neural Systems 11
11/20/17 Computational Models of Neural Systems 12
11/20/17 Computational Models of Neural Systems 13
11/20/17 Computational Models of Neural Systems 14
– respond to a secondary reinforcer stimulus (indirect path), and also – predict the timing of the primary reward to follow (direct path)
11/20/17 Computational Models of Neural Systems 15
11/20/17 Computational Models of Neural Systems 16
11/20/17 Computational Models of Neural Systems 17
11/20/17 Computational Models of Neural Systems 18
11/20/17 Computational Models of Neural Systems 19
11/20/17 Computational Models of Neural Systems 20
11/20/17 Computational Models of Neural Systems 21
11/20/17 Computational Models of Neural Systems 22
11/20/17 Computational Models of Neural Systems 23
11/20/17 Computational Models of Neural Systems 24
11/20/17 Computational Models of Neural Systems 25
11/20/17 Computational Models of Neural Systems 26
11/20/17 Computational Models of Neural Systems 27
11/20/17 Computational Models of Neural Systems 28
11/20/17 Computational Models of Neural Systems 29
11/20/17 Computational Models of Neural Systems 30
– Markov model = states plus transitions – “Hidden” means the current state must be inferred – “Semi-” means dwell times are drawn from a distribution; transitions do