CHAINING routine sequential action, dynamic decision making, motor - - PowerPoint PPT Presentation

▶

Jun 27, 2023 108 likes •208 views

Activity The bigram frequency effect trace 1 0.9 Conjunctive, 0.8 Proportion items correct distributed 0.7 representations 0.6 0.5 0.4 0.3 Recurrent connectivity 0.2 0.1 0 Dominey, Arbib & Joseph, 1995 Zero order First

SLIDE 1

Beiser & Houk, J Neurophys, 1998

Activity trace Conjunctive, distributed representations Recurrent connectivity

Dominey, Arbib & Joseph, 1995
Beiser & Houk, 1998
O'Reilly & Soto, 2002

Recurrent neural networks

Jordan (1986), Elman (1990, 1991, 1993)
Applied to implicit learning, language

comprehension and production, event parsing, routine sequential action, dynamic decision making, motor control, cognitive development...

Natural account of sensitivity to sequential

domain structure

From: Baddeley, QJEP, 1964

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Proportion items correct First order Zero order

The bigram frequency effect

CHAINING

SLIDE 2

Initial interest
Ebbinghaus
Wickelgren (associative intrusions)
Murdock (TODAM)
Eventual rejection
Lashley
Baddeley, 1968, sawtooth pattern
Henson, Norris, Page & Baddeley,

1996: “Memory unchained”

The trouble with chaining

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6 Position Errors PC PN AC AN

Baddeley, QJEP, 1968

Example AC list: B R D Q P L

Context-association models of serial recall

B T C V G

Hebbian links Context signal

Burgess & Hitch, 1992
Henson, 1996
Houghton, 1990
Brown, Preece & Hulme, 2000

"Interactions between short- and long-term memory pose problems for most models of serial recall" (Henson, Cog. Psych., 1998)

CHAINING

???

Model architecture

List items (input) List items (output)

a b c d … a b c d …

SLIDE 3

Training and testing

Same trial structure (encoding, recall) during training and testing
Actual relevant “training” experience is from language
List lengths 1-9
Trained on small proportion of possible sequences (< 1 / 1,000)
Training: backpropagation learning procedure
Testing: weights frozen (during both encoding and recall)
Three simulations
1. Localist letter units
2. Two-dimensional items (confusable / non-confusable)
3. Bigram frequencies of English

Empirical data: Crannell & Parrish, J Psychol, 1957

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 4 5 6 7 8 9 10 11

List length Accuracy Empirical data: Henson, Norris, Page & Baddeley, QJEP, 1996

Simulation

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6

Output position Proportion

Data

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6

Output position Proportion

Transposition errors

SLIDE 4

Empirical data: Baddeley, QJEP, 1968

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6 Position Errors (proportion)

Data

PC PN AC AN

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6

Simulation

Position Empirical data: Baddeley, QJEP, 1964

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Data Simulation

Proportion items correct

First order Zero order

Data

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6 7 8 9 Position

Accuracy

Simulation

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

1 2 3 4 5 6 7 8 9 Position Accuracy High BF Low BF

Empirical data: Kantowitz, Ornstein & Schwartz, JEP, 1972

Grouping effects

Empirical data Model

SLIDE 5

Representational similarity

T R B L . . . T R B L

T1 R2 B3 L4 B2 D3 S3 D2 S2

Similarity chart for isr

0.0 0.2 0.4 0.6 0.8 1.0 1 2 3 Distance Correlation

Same item Confusable Non-confusable

Distance between positions of contributing items Similarity between items’ contributions ( r 2 )

B2 D2 S2 D3 S3

SLIDE 6

Item Position

B3 B4 F2 S3 D2 F1 S1 D3 LRTFJ ZDGBT ZDBGT LRFTJ JFTRL

Cumulative transposition distance

18 16 14 12 10 8 6 4 2

Euclidian distance

6 5 4 3 2 1

Non-confusable Confusable 5 4 3 2

Non-confusable Euclidean distance from reference (Median) Confusable Degree of re-ordering (Cumulative transposition distance)

5 4 3 2

Beiser & Houk, J Neurophys, 1998

SLIDE 7

Weight magnitudes Accuracy depends on:

1. Frequency (goodness) of target
2. Neighborhood relations of target
a. Similar (near) neighbors: lower accuracy
b. High-frequency neighbors: lower accuracy

(analogous to frequency and regularity/consistency effects in single word reading; Andrews, 1982; Taraban & McClelland, 1987)

The good neighbor effect:

Assuming equal target goodness, recall accuracy will be lower for targets with high-goodness near neighbors

(cf. consistency effects among “regular” words; Glushko, 1979; e.g., GAVE is slower than GATE due to interference from HAVE)

A A B

0.75 0.25

A1, A2, A3, B1, B2, B3 A B

0.75 0.25

SLIDE 8

ABABAB BABABA .00402 ABABBA BABAAB .00209 ABAABA BABBAB .00209 AABBAB BBAABA .00106 ABBAAB BAABBA .00106 AABABB BBABAA .00106 ABAABB BABBAA .00106 ABBBAA BAAABB .00052 AABBBA BBAAAB .00052 AAABBB BBBAAA .00026 List structure Probability

AABABB AABBAB ABAABB ABBAAB ABABAB

The good neighbor effect: An experimental test

(Botvinick, submitted)

Six undergraduate participants
Immediate serial recall
Items: Six pseudo-words (dah, fie, poe, noo, tee, kay)
ABABAB grammar
Auditory presentation (100 words / min)
Verbal and keyboard responding (repeats prevented)
Eight training, seven testing sessions (250 lists / session)

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.302 0.15 0.076 0.037 0.019 Stimulus probability Proportion correct 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.302 0.15 0.076 0.037 0.019 Stimulus probability Proportion correct

AABABB 0.54* Others 0.44 AABABB 0.51* Others 0.46 Simulation Empirical data

Stimulus class probability Stimulus class probability

SLIDE 9

Conclusions

Theory
Activation-based coding, recurrent connectivity
Conjunctive representation of item, position
Sequence representation occupies continuous, multidimensional space
Graded similarity structure
Evidence
Fits with neuroanatomic, neurophysiologic data
Fits with behavioral data (this isn't chaining)
Exhibits appropriate sensitivity to domain structure