Grammatical inference and subregular phonology
Adam Jardine Rutgers University December 11, 2019 · Tel Aviv University
Grammatical inference and subregular phonology Adam Jardine - - PowerPoint PPT Presentation
Grammatical inference and subregular phonology Adam Jardine Rutgers University December 11, 2019 Tel Aviv University Review [V]arious formal and substantive universals are intrinsic properties of the language-acquisition system, these
Adam Jardine Rutgers University December 11, 2019 · Tel Aviv University
“[V]arious formal and substantive universals are intrinsic properties of the language-acquisition system, these providing a schema that is applied to data and that determines in a highly restricted way the general form and, in part, even the substantive features of the grammar that may emerge upon presentation of appropriate data.” Chomsky, Aspects “[I]f an algorithm performs well on a certain class of problems then it necessarily pays for that with degraded performance on the set of all remaining problems.” Wolpert and Macready (1997), NFL Thms.
2
computable languages Reg SL phonotactics computable functions Reg Subseq ISL processes
3
Review
structure that can be used by a learner
⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊤ V : ⊥ C : ⊥
4
Review
structure that can be used by a learner data 0 CV 1 V 2 CV CV
⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊤ V : ⊥ C : ⊥
4
Review
structure that can be used by a learner
⋊ : λ C : λ V : λ C : C V : V V : V C : C V : V C : C
5
Review
structure that can be used by a learner
⋊ : λ C : V V : λ C : C V : V V : V C : C V : V C : V C
5
Review
structure that can be used by a learner
⋊ : λ C : V V : λ C : C V : V V : V C : C V : λ C : V C
5
Review
structure that can be used by a learner
⋊ : C : V : C : V : V : C : V : C :
5
Today
– ISL functions – SL distributions
6
Learning input strictly local functions
t datum V 1 CV CV 2 CV V CV CV . . .
7
Learning input strictly local functions
t datum V 1 CV CV 2 CV V CV CV . . .
t datum (C, CV ) 1 (CV C, CV CV ) 2 (CV CV, CV CV ) . . .
7
Learning input strictly local functions
t datum (C, CV ) 1 (CV C, CV CV ) 2 (CV CV, CV CV ) 3 (V CV C, V CV CV )
?
− − →
⋊ : C : V : C : V : V : C : V : C :
8
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) =
9
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) = CV C lcp({CV CV, CCV CV, CV CV C}) =
9
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) = CV C lcp({CV CV, CCV CV, CV CV C}) = C
9
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) = CV C lcp({CV CV, CCV CV, CV CV C}) = C
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗))
10
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) = CV C lcp({CV CV, CCV CV, CV CV C}) = C
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = ...
10
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) = CV C lcp({CV CV, CCV CV, CV CV C}) = C
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = CV C dp(V CV V ) = ...
10
Learning input strictly local functions
a set of strings lcp({CV CV, CV CCV, CV CV C}) = CV C lcp({CV CV, CCV CV, CV CV C}) = C
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = CV C dp(V CV V ) = V CV
10
Learning input strictly local functions
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = CV C dp(V CV V ) = V CV dw(u) = dp(w)−1d(wu)
11
Learning input strictly local functions
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = CV C dp(V CV V ) = V CV dw(u) = dp(w)−1d(wu) dCV (C) = dp(CV )−1d(CV C) = (CV )−1CV C = C
11
Learning input strictly local functions
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = CV C dp(V CV V ) = V CV dw(u) = dp(w)−1d(wu) dCV (C) = dp(CV )−1d(CV C) = (CV )−1CV C = C dV CV (V ) = dp(V CV )−1d(V CV V ) = (V CV )−1V CV = λ
11
Learning input strictly local functions
(CV, CV ) (CV C, CV C) (CV CV C, CV CV C) (V CV V C, V CV C) (V CV V, V CV ) dp(w) = lcp(d(wΣ∗)) dp(CV C) = CV C dp(V CV V ) = V CV dw(u) = dp(w)−1d(wu) dCV (C) = dp(CV )−1d(CV C) = (CV )−1CV C = C dV CV (V ) = dp(V CV )−1d(V CV V ) = (V CV )−1V CV = λ dp
w(u) = lcp(dw(uΣ∗))
11
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : V : C : V : V : C : V : C :
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : V : C : C V : V : C : V : C :
dp
λ(C) = C
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : V : C : C V : V : C : V : C : C
dp
C(C) = C
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : V : C : C V : V : V C : V : C : C
dp
C(V ) = V
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : V : C : C V : V : V C : C V : C : C
dp
CV (C) = C
12
(CV C, CV C) (CV V , CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : V : C : C V : V : V C : C V : λ C : C
dp
CV (V ) = λ
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V , V )
⋊ : C : V : C : C V : V V : V C : C V : λ C : C
dp
λ(V ) = V
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V , V )
⋊ : C : λ V : λ C : C V : V V : V C : C V : λ C : C
dp(CV C)−1d(CV C) = λ, dp(V )−1d(V ) = λ
12
(CV C, CV C) (CV V, CV ) (CV CCV, CV CCV ) (CCV CC, CCV CC) (CCCV CV, CCCV CV ) (CV V CV, CV CV ) (V, V )
⋊ : C : λ V : λ C : C V : V V : V C : C V : λ C : C
12
Learning input strictly local functions
ILPD-learns the ISLk functions
⋊ : C : V : C : V : V : C : V : C :
structure (Jardine et al., 2014)
13
Learning input strictly local functions
properties of phonological patterns
McMullin, 2019) use a similar (yet distinct) method
progress) based on Beros and de la Higuera (2016)
14
Learning strictly local distributions
⋊ : 0.0 C : 0.2 V : 0.5 C : 0.6 V : 0.4 V : 0.6 C : 0.2 V : 0.1 C : 0.4
15
Learning strictly local distributions
CV C CV V CV CCV CV CV C CV CV CV V CV ⋊ : 0 C : 0 V : 0 C : 0 V : 0 V : 0 C : 0 V : 0 C : 0
16
Learning strictly local distributions
CV C CV V CV CCV CV CV C CV CV CV V CV ⋊ : 0 C : 2 V : 4 C : 6 V : 0 V : 10 C : 1 V : 2 C : 6
16
Learning strictly local distributions
CV C CV V CV CCV CV CV C CV CV CV V CV ⋊ : 0
6
C : 2
13
V : 4
12
C : 6
6
V : 0
6
V : 10
13
C : 1
13
V : 2
12
C : 6
12
16
Learning structured distributions
– Learning strictly piecewise distributions: Heinz and Rogers (2010) – Learning SL distributions over features: Heinz and Koirala (2010) – ...
17
Review
identify structural properties for learning: – phonotactics – processes – stochastic generalizations
predictions from learning
18
Open questions
(Jardine, 2016; Strother-Garcia, 2017)
(Strother-Garcia et al., 2016)
(Chandlee et al., 2019)
(wide open)
19
Open questions
https://github.com/alenaks/SigmaPie
20