Productivity, Reuse, and Competition between Generalizations
Timothy J. O’Donnell MIT
Productivity, Reuse, and Competition between Generalizations - - PowerPoint PPT Presentation
Productivity, Reuse, and Competition between Generalizations Timothy J. ODonnell MIT Two Problems 1. Problem of Competition 2. Problem of Productivity The Problem of Competition When multiple ways of expressing a meaning exist, how do
Timothy J. O’Donnell MIT
(e.g., Aronoff, 1976; Plag, 2003; Rainer, 1988; van Marle, 1986)
(Aronoff, 1976)
6
elsewhere condition (subset principle, Pāṇini’s principle, blocking, pre-emption, etc.)
“more general” way.
Schvaneveldt, 1978)
Marcus et al. 1992)
Suffix
Productive (with Adjectives)
Context-Dependent
Unproductive
Suffix
Productive (with Adjectives)
Semi-productive
Unproductive
circuitousness, grandness, orderliness, pretentiousness, cheapness, ...
pine-scentedness
pine-scented
Suffix
Productive (with Adjectives)
Semi-productive
Unproductive
Suffix
Productive (with Adjectives)
Context-Dependent
Unproductive
verticality,tractability,severity, seniority, inanity, electricity, ...
*pine-scentedity
Suffix
Productive (with Adjectives)
Context-Dependent
Unproductive
subsequentiability subsequentiable
Suffix
Productive (with Adjectives)
Context-Dependent
Unproductive
Suffix
Productive (with Adjectives)
Context-Dependent
Unproductive
warmth, width, truth, depth, ...
*coolth
Suffix
Productive (with Adjectives)
Context-Dependent
Unproductive
Suffix
Most Productive
Less Productive
Least Productive
under uncertainty based on an inference which
(computation) and reuse (storage).
productivity and competition.
combinations.
random choices.
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Thursday, February 2, 2012
prior and likelihood applied to computation and storage problem.
given lexicon in (two part) MDL.
frameworks.
60
P(Data, Fragments) = P(Data | Fragments) * P(Fragments)
61
P(Data, Fragments) = P(Data | Fragments) * P(Fragments)
Likelihood (derivation probabilities)
62
P(Data, Fragments) = P(Data | Fragments) * P(Fragments)
63
P(Fragments | Data) ∝ P(Data | Fragments) * P(Fragments)
64
al., 2007).
De Marcken, 1996.
substitution grammars (e.g., Bod, 2003; Cohn, 2010;
Goodman, 2003; Zuidema, 2007; Post, 2013).
Thursday, February 2, 2012
Thursday, February 2, 2012
literature.
Thursday, February 2, 2012
literature.
Thursday, February 2, 2012
literature.
representations.
Thursday, February 2, 2012
literature.
representations.
Thursday, February 2, 2012
(MAP Multinomial-Dirichlet Context- Free Grammars)
productive.
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Full-Parsing
(FP)
(MAP All-Adapted Adaptor Grammars)
(recursively).
frequencies.
redundancy rules.
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Full-Parsing
(FP)
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Full-Listing
(FL)
(Data-Oriented Parsing)
consistent with input.
Parsing 1 (DOP1; Bod, 1998), Data- Oriented Parsing: Equal-Node Estimator (ENDOP; Goodman, 2003).
syntax.
Full-Parsing
(FP)
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Full-Listing
(FL)
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Exemplar-Based
(EB)
(Fragment Grammars)
which best explains the data.
Grammars (O’Donnell, et al. 2009)
distribution of tokens over types.
variables.
Full-Parsing
(FP)
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Full-Listing
(FL)
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Exemplar-Based
(EB)
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
N Adj V agree
N Adj V count
N Adj V agree
N Adj V agree
Inference-Based
(IB)
Past Tense (Inflectional) Derivational Morphology
Productive +ed (walked) +ness (goodness) Context-Dependent I →æ (sang) +ity (ability) Unproductive suppletion
(go/went)
+th (width)
Past Tense Derivational Morphology
Productive +ed (walked) +ness (goodness) Context-Dependent I →æ (sang) +ity (ability) Unproductive suppletion
(go/went)
+th (width)
High Proportion of Low Frequency Types
High Token Frequency High Type Frequency High Token Frequency High Token Frequency
proportion of frequency-1 words in an input corpus.
P P∗
( values from Hay & Baayen, 2002)
P/P∗
MDPCFG
(Full-parsing)
MAG
(Full-listing)
DOP1
(Exemplar-based) ENDOP (Exemplar-based)
FG
(Inference)
Grammars behave approximately as if they were using hapaxes.
fact that some new words are built, behavior arises automatically.
Past Tense Derivational Morphology
Productive +ed (walked) +ness (goodness) Context-Dependent I →æ (sang) +ity (ability) Unproductive suppletion
(go/went)
+th (width)
Irregulars Regulars
Thursday, February 2, 2012
Irregulars Regulars
Thursday, February 2, 2012
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB 98 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
Preference for Correct Past Form
99 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
Preference for Incorrect Past Form
100 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
101 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
102 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
103 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB 104 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
105 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
106
−4 −2 2 4 6 8
Log Odds Correct
Irregular
Regular
Unattested
FP FP FPFL FL FL E1 E1 E1 E2 E2 E2 IB IB IB
107 FP
Full-Parsing
(Multinomial-Dirichlet CFG)
FL
Full-Listing
(Adaptor Grammars)
E1
Exemplar
(Data-Oriented Parsing 1)
E2
Exemplar
(DOP: ENDOP)
IB
Inference-Based
(Fragment Grammars)
108
109
110
111
112
113
114
(Kiparsky, 1973; Anderson, 1969; Kiparsky, 1982a; Andrews, 1982)
stipulation (cf. subset principle, premption, etc.).
derivations, prefer the one with highest P(form | meaning) more “tightly.”
computed, etc.
Past Tense Derivational Morphology
Productive +ed (walked) +ness (goodness) Context-Dependent I →æ (sang) +ity (ability) Unproductive suppletion
(go/went)
+th (width)
(Hay, 2002; Hay and Plag, 2004; Plag et al, 2009).
words.
anything, including morphologically-complex stored forms.
(Anshen & Aronoff, 1981; Aronoff & Schvaneveldt, 1978; Cutler, 1980)
(Aronoff & Schvaneveldt, 1978).
(Anshen & Aronoff, 1981).
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
(Multinomial-Dirichlet Context-Free Grammar)
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Exemplar
(DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
(Adaptor Grammars)
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
(Data-Oriented Parsing 1)
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(FrMAGment Grammars)
Predicted
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
(Data-Oriented Parsing: Goodman Estimator)
−5 5
ive ive ive ive ive ive ble ble ble ble ble ble
Full-Parsing
(MDPCFG)
Full-Listing
(Adaptor Grammars) Exemplar (DOP1)
Exemplar
(GDMN)
Inference
(Fragment Grammars)
Predicted
(Fragment Grammars)
types.
compute as an inference.
generalizations.