Title: Striatal Dopamine DA and Learning: Do Category Learning CL data constrain computational models
1Striatal Dopamine (DA) and Learning Do Category
Learning (CL) data constrain computational models?
- Alan Pickering
- Department of Psychology
- a.pickering_at_gold.ac.uk
2Overview
- Classic CL findings and questions
- DA, the striatum and learning
- Generate simple hypothesis about CL deficits in
Parkinsons Disease - Generate simple biologically-constrained neural
net to test hypothesis - Simulate CL data on 2 types of matched CL tasks
- Conclusions why model fails
3Classic Findings and Questions
- Parkinsons Disease (PD) patients are impaired at
CL tasks. - Why?
- -What psychological processes are
- impaired?
- -What brain regions and neuro-
- transmitters are involved?
-
4Category Learning in Parkinsons Disease
- Weather task Knowlton et al, 1996
5Category Learning in Parkinsons Disease
- Main Findings Knowlton et al, 1996
6Key Facts
- PD involves prominent damage to the striatum
- CL may (sometimes) involve procedural/habit
learning - Striatal structures are part of
cortico-striato-pallido-thalamic loops possibly
implicated in procedural learning - The striatum is strongly innervated by ascending
DA projections
7Simple Interpretation
- CL deficits in PD may arise because of damage to
-
- loss of ascending DA signals
- which compromise the functioning of (parts of)
- the striatum
8Three Learning Processes Which Might Be
DA-Related
- Appetitive reinforcement and motivation
- DA cell firing increses/decreases provide a
positive/negative reinforcement signal which is
required for synaptic strengthening/ weakening - 3-factor learning rule
- (e.g., Wickens Brown et al etc.)
9Corticostriatal (Medium Spiny Cell) Synapse
10DA Receptors in Striatum
DA receptors Unfilled rectangles GLU receptors
Filled rectangles
11DA-Related Processes (cont)
- Reward Prediction Error
- Midbrain DA neurons increase firing in response
to unexpected rewards and decrease firing to
nonoccurrence of expected rewards - Firing change reward prediction error
- Schultz, Suri, Dickinson, Dayan etc.
12DA Cell Recordings Evidence For Reward
Prediction Error
CUE
REWARD
13DA-Related Processes (cont)
- Modulation of Neural Signals
- Floresco et al (2001) DA receptor activity
serves to strengthen salient inputs while
inhibiting weaker ones - Also Nicola Malenka J.D.Cohen Ashby
Cassale Salum et al Nakahara Schultz
14Evidence For Modulation
- Nicola Malenka, 1997
- Recorded effect of DA on response of striatal
cells to strong and weak inputs
Strong
Weak
15Linking 3-Factor Learning Reward Prediction
Error
Striatal Cell
Cue
Reward prediction
Reward predictionerror
Reward
DA Cell
Excitatory
Inhibitory
Reinforcement
16Simple Working Hypothesis
- CL is impaired in PD patients (and other
DA-compromised groups) due to reduced DA
function in striatum (tail of caudate) - The loss of ascending DA input reduces the
reinforcing function of the reward prediction
error signal innervating the striatum
17Modelling
- Biologically-constrained neural net
- Data to be simulated taken from Ashby et al
(2003) - Data from young and old controls (YC, OC) and PD
patients - Study used matched CL tasks rule-based (RB) and
Information Integration (II) - Ashby and colleagues believe these tasks are
handled by distinct CL systems
18Ashby et al II Task
- 3 of the 4 dimensions determine categories
- Not readily verbalisable
Cat A
Cat B
19Ashby et al RB Task
- 1 dimension (background colour) determines
category - Readily verbalisable rule
Cat A
Cat B
20Ashby et al Results
- Proportion failing to learning to criterion in
200 trials
RB Task
II Task
21RB Task Results
- Trials to criterion for learners
II Task
RB Task
22Modelling
- Constrained by input and output connections of
striatum (caudate) - Learning rule based on known
- 3-factor form of synaptic plasticity in striatum
- Learning rule consistent with reward prediction
error properties of DA neurons
23Connections of Striatum
24Schematic Model
Reward prediction
25Model Learning Rule
xJout
yK
yKout
wJK
Reward prediction error, E
- When reward present, Egt0
- ?wJK kREykoutxJout
- When reward absent, Elt0
- ?wJK kNEykoutxJout
26Modelling of Reduced DA Function
- Loss of DA input to striatum (tail of caudate)
modelled 2 ways (with same results)- - a) loss of modifiability of cortico-
- striatal weights
- b) proportional reduction of reward
prediction error strength - Mean proportion of weights modifiable-
- YC 0.8 OC 0.5 PD 0.2
- (with s.d. 0.15)
27Modelling Process
- Found parameters which gave good fit to YC
performance on II task and set DA parameters for
PD to produce appropriate level of nonlearners on
same task - Varied OC DA values between YC and PD
- Looked at fit (with these parameters) to all
other data cells esp. RB task
28Modelling II Task Results
Proportion of non-learners
Trials to criterion (learners)
YC
PD
29Modelling II Task Results
Proportion of non-learners
Trials to criterion (learners)
YC
PD
OC
30Modelling II Task Results
Proportion of non-learners
Trials to criterion (learners)
YC
PD
OC
31Model Results II Task
- Performance of learners in blocks of 16 trials
32Modelling RB Task Results
Proportion of non-learners
Trials to criterion (learners)
YC
PD
OC
33Modelling RB Task Results
Proportion of non-learners
Trials to criterion (learners)
YC
PD
OC
34Conclusions Future Work 1
- Simplest realistic model of cortico-striatal
learning captures only limited aspects of the CL
data - Bimodal nature (learn normally vs. fail) of
data simulated only under some paramter settings - No intermediate DA parameter settings in old
controls which can be both PD-like for II task
and YC-like for RB task
35Conclusions Future Work 2
- Model challenges hypothesis under test PD (and
OC) deficits in some CL tasks seem unlikely to be
solely due to reduced DA-related reinforcement in
striatum - Findings are consistent with gt1 CL system
- Future model should add rule system (c.f. Ashbys
COVIS)
36Alan PickeringCL Refs 2001-
- Pickering, A.D., Gray, J.A. (2001). Dopamine,
appetitive reinforcement, and the neuropsychology
of human learning An individual differences
approach. In A. Eliasz A. Angleitner (Eds.),
Advances in individual differences research (pp.
113-149). Lengerich, Germany PABST Science
Publishers.