Finding Structure in Time
Jeffrey L. Elman In Cognitive Science 14, 179 – 211 (1990) presented by Dominic Seyler (dseyler2@illinois.edu)
Finding Structure in Time Jeffrey L. Elman In Cognitive Science 14, - - PowerPoint PPT Presentation
Finding Structure in Time Jeffrey L. Elman In Cognitive Science 14, 179 211 (1990) presented by Dominic Seyler (dseyler2@illinois.edu) Outline Motivation Method Experiments Exclusive-Or Structure in Letter Sequences
Jeffrey L. Elman In Cognitive Science 14, 179 – 211 (1990) presented by Dominic Seyler (dseyler2@illinois.edu)
vector
temporal pattern
position
[ 0 1 1 1 0 0 0 0 0 ] [ 0 0 0 1 1 1 0 0 0 ]
part of the input
the effect it has on processing
see previous output
give the network memory
units (context units)
sequentially, the context units contain the exact values of the hidden units of the previous sequence
external input and previous internal state to desired output
to predict the next bit correctly
chance
be the XOR of the first and second
predictions (multi-bit)
sequence, predict the character word
randomly there is high error
the network can make use of previous information. Thus, error is low.
structured the network can make partial predictions even where the complete prediction is not possible
sequential list of concatenated characters?
predict the following letter
manyyearsago
anyyearsago?
high
error declines
the recurring sequences in the input are and highly correlates with words
linguistic units from input signal
when only the surface forms (i.e. words) are presented to it?
woman; VERB-PERCEPTION -> smell, see)
FOOD)
vectors (e.g. 00010 00100 10000)
nondeterministic RMS error is not a fitting measurement
each word in all possible contexts and average over them
representations is shown in tree
which reflect facts about possible sequential ordering of inputs
conventional sense, since patterns also reflect prior context.
the network is able to predict the approximate likelihood of
the activation pattern in its entirety is meaningful.
(XOR could previously not be learned by single-layer network)
at the beginning of words in sentence)
performance (Increasing number of bits did not hurt performance)
high among words within one class)
Jeffrey L. Elman In Cognitive Science 14, 179 – 211 (1990) presented by Dominic Seyler (dseyler2@illinois.edu)