Grammatical inference and subregular phonology
Adam Jardine Rutgers University December 9, 2019 · Tel Aviv University
Grammatical inference and subregular phonology Adam Jardine - - PowerPoint PPT Presentation
Grammatical inference and subregular phonology Adam Jardine Rutgers University December 9, 2019 Tel Aviv University Overview [V]arious formal and substantive universals are intrinsic properties of the language-acquisition system, these
Adam Jardine Rutgers University December 9, 2019 · Tel Aviv University
“[V]arious formal and substantive universals are intrinsic properties of the language-acquisition system, these providing a schema that is applied to data and that determines in a highly restricted way the general form and, in part, even the substantive features of the grammar that may emerge upon presentation of appropriate data.” Chomsky, Aspects “[I]f an algorithm performs well on a certain class of problems then it necessarily pays for that with degraded performance on the set of all remaining problems.” Wolpert and Macready (1997), NFL Thms. 2
computational universals
these universals
learning procedures that derive from these universals
– that directly connect linguistic universals with learning – whose behavior in the general case is well-understood – that make typological and psycholinguistic predictions 3
By the end of this course, you should be able to engage with the literature, and start your own research project! 4
Jeff Heinz Jim Rogers Rémi Eyraud Jane Chandlee Kevin McMullin (Stony Brook) (Earlham) (Marseilles) (Haverford) (Ottowa)
...at Rutgers:
Eileen Blum Chris Oakden Nate Koser Dine Mamadou Wenyue Hua Huteng Dai
5
What is learning?
‘learned’ something?
language? 6
What is learning?
‘learned’ something?
language?
grammar language finite sample learner grammar′ language′
6
What is learning?
7
Grammatical inference
Model of language Oracle Learner Model of language MO ML information requests (from Heinz et al., 2016)
8
Grammatical inference
Model of language Oracle Learner Model of language MO ML information requests (from Heinz et al., 2016) Problem Given a positive sample of a language, return a grammar that describes that language exactly
9
What is a pattern?
– Well-formedness (phonotactics)
˚ – Transformations (processes)
˚ / → [NC ˇ] 10
What is a pattern?
˚ well-formed: {an, anda, amba, lalalalanda, blIk, ffffff, ...} ill-formed: {anta, ampa, lalalalaNka, ...} 11
What is a pattern?
/NC ˚ / → [NC ˇ] {(an, an), (anda, anda), (anta, anda), (lalalalampa, lalalalamba),...}
C → [+voice] / N ≈ *NC ˚ ≫ Id[±voice] 12
What is a pattern?
then move on to (functional) relations. 13
Formal languages
{0, 1} {a, b, c} {a, b, c, ..., æ, B, O, ..., z} {N, V, Adj, ..., C} 14
Formal languages
Σ = {a, b, c} Σ∗ = 15
Formal languages
Σ = {a, b, c} Σ∗ = { λ, a, b, c, aa, ab, ac, ba, bb, bc, ca, cb, cc, aaa, aab, aac, ..., abbaaacccbabacb, ... } 15
Formal languages
– {b} – (ab)n = {λ, ab, abab, ababab, ...} – anbn = {λ, ab, aabb, aaabbb, aaaabbbb, ...} – ... 16
Formal languages
(ab)n λ → ⊤ a → ⊥ b → ⊥ aa → ⊥ ab → ⊤ ... abaa → ⊥ abab → ⊤ abba → ⊥ ... 17
Formal language classes
all possible languages 18
Formal language classes
all possible languages computable languages 18
Formal language classes
all possible languages Fin computable languages 18
The strictly local languages
˚ language?1 { an, anda, amba, lalalalanda, blIk, ffffff, ... }
1Σ = {a, b, c, ..., æ, B, O, ..., z}
19
The strictly local languages
˚ language?1 { an, anda, amba, lalalalanda, blIk, ffffff, ... }
˚ sequences! {anta, ampa, lalalalaNka, ...}
1Σ = {a, b, c, ..., æ, B, O, ..., z}
19
The strictly local languages
a b b a b w a b b a b v1 u v2 20
The strictly local languages
⋊ a b b a b ⋉ w
a b b a b ⋉ ⋊ a b b a b 21
The strictly local languages
G = {bb, aa}
= G 22
The strictly local languages
G = {bb, aa} w w | = G? λ ⊤ a ⊥ b ⊥ aa ⊥ ab ⊤ aaa ⊥ aab ⊥ aba ⊥ w w | = G? abb ⊥ baa ⊥ aaaa ⊥ ... abab ⊤ abba ⊥ baba ⊤ ... 23
The strictly local languages
grammar for some k
24
The strictly local languages
Fin SL computable languages 25
The strictly local languages
classes: – Strictly piecewise (SP) languages
(Heinz, 2010)
– Tier-based strictly local (TSL) languages
(Heinz et al., 2011; McMullin, 2016)
26
Review
Problem Given a positive sample of a language, return a grammar that describes that language exactly
problem
similar solution 27