Grammatical inference and subregular phonology Adam Jardine - - PowerPoint PPT Presentation

grammatical inference and subregular phonology
SMART_READER_LITE
LIVE PREVIEW

Grammatical inference and subregular phonology Adam Jardine - - PowerPoint PPT Presentation

Grammatical inference and subregular phonology Adam Jardine Rutgers University December 11, 2019 Tel Aviv University Review Outline of course Day 1: Learning, languages, and grammars Day 2: Learning strictly local grammars Day 3:


slide-1
SLIDE 1

Grammatical inference and subregular phonology

Adam Jardine Rutgers University December 11, 2019 · Tel Aviv University

slide-2
SLIDE 2

Review

slide-3
SLIDE 3

Outline of course

  • Day 1: Learning, languages, and grammars
  • Day 2: Learning strictly local grammars
  • Day 3: Automata and input strictly local functions
  • Day 4: Learning functions and stochastic patterns, other
  • pen questions

2

slide-4
SLIDE 4

Review of days 1 & 2

  • Phonological patterns are governed by restrictive

computational universals

  • We studied one such universal of strict locality

3

slide-5
SLIDE 5

Review of days 1 & 2

  • We studied learning SL languages under the paradigm of

identification in the limit from positive data

L⋆ t p(t) abab 1 ababab 2 ab . . . . . . i λ . . . . . . p[i] A Gi

4

slide-6
SLIDE 6

Today

  • Learning with finite-state automata for

– strictly local languages – input-strictly local functions 5

slide-7
SLIDE 7

Strictly local acceptors

slide-8
SLIDE 8

Strictly local acceptors

Engelfriet & Hoogeboom, 2001 “It is always a pleasant surprise when two formalisms, intro- duced with different motivations, turn out to be equally pow- erful, as this indicates that the underlying concept is a natural

  • ne.”

(p. 216) 6

slide-9
SLIDE 9

Strictly local acceptors

  • A finite-state acceptor (FSA) is a set of states and

transitions between states 1 a a b b 7

slide-10
SLIDE 10

Strictly local acceptors

1 a a b b a b b a 8

slide-11
SLIDE 11

Strictly local acceptors

1 a a b b a b b a 0 → 1 8

slide-12
SLIDE 12

Strictly local acceptors

1 a a b b a b b a 0 → 1 → 1 8

slide-13
SLIDE 13

Strictly local acceptors

1 a a b b a b b a 0 → 1 → 1 → 1 8

slide-14
SLIDE 14

Strictly local acceptors

1 a a b b a b b a 0 → 1 → 1 → 1 → 0 8

slide-15
SLIDE 15

Strictly local acceptors

1 a a b b a b b a 0 → 1 → 1 → 1 → 0 8

slide-16
SLIDE 16

Strictly local acceptors

1 a a b b b a a b b a 9

slide-17
SLIDE 17

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 9

slide-18
SLIDE 18

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 → 1 9

slide-19
SLIDE 19

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 → 1 → 0 9

slide-20
SLIDE 20

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 → 1 → 0 → 0 9

slide-21
SLIDE 21

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 → 1 → 0 → 0 → 0 9

slide-22
SLIDE 22

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 → 1 → 0 → 0 → 0 → 1 9

slide-23
SLIDE 23

Strictly local acceptors

1 a a b b b a a b b a 0 → 0 → 1 → 0 → 0 → 0 → 1 ✗ 9

slide-24
SLIDE 24

Strictly local acceptors

  • A SLkFSA’s states represent the k − 1 factors of Σ∗

1 a a b b 1 b a Not SLk for any k SL2; 0 = b, 1 = a 10

slide-25
SLIDE 25

Strictly local acceptors

  • Traversing a SLkFSA is equivalent to scanning for k factors

a b b a ⋊ a b a b ⋉ 11

slide-26
SLIDE 26

Strictly local acceptors

  • Traversing a SLkFSA is equivalent to scanning for k factors

a b b a ⋊ a b a b ⋉ 11

slide-27
SLIDE 27

Strictly local acceptors

  • Traversing a SLkFSA is equivalent to scanning for k factors

a b b a ⋊ a b a b ⋉ 11

slide-28
SLIDE 28

Strictly local acceptors

  • Traversing a SLkFSA is equivalent to scanning for k factors

a b b a ⋊ a b a b ⋉ 11

slide-29
SLIDE 29

Strictly local acceptors

  • Traversing a SLkFSA is equivalent to scanning for k factors

a b b a ⋊ a b a b ⋉ 11

slide-30
SLIDE 30

Strictly local acceptors

  • Traversing a SLkFSA is equivalent to scanning for k factors

a b b a ⋊ a b a b ⋉ 11

slide-31
SLIDE 31

Strictly local acceptors

  • Forbidden k-factors are expressed by missing

transitions/accepting states a b b a ⋊ a b b ⋉ ✗ 12

slide-32
SLIDE 32

Strictly local acceptors

  • SLFSAs describe exactly the SL languages
  • Thus, they capture the same concept of locality as SL

grammars, but in a different way 13

slide-33
SLIDE 33

Learning with strictly local acceptors

slide-34
SLIDE 34

Learning with strictly local acceptors

  • Finite-state automata are useful because they have a

number of learning techniques

(de la Higuera, 2010)

  • We’ll use a ‘transition filling’ of Heinz and Rogers (2013)

14

slide-35
SLIDE 35

Learning with strictly local acceptors

1 a b b a 15

slide-36
SLIDE 36

Learning with strictly local acceptors

0 : ⊥ ⊥ ⊥ 1 : ⊤ ⊤ ⊤ a : ⊤ ⊤ ⊤ b : ⊤ ⊤ ⊤ b : ⊥ ⊥ ⊥ a : ⊥ ⊥ ⊥

  • output function

Q × Σ → {⊤, ⊥}

  • ending function

Q → {⊤, ⊥} 15

slide-37
SLIDE 37

Learning with strictly local acceptors

⋊ : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ V : ⊥ C : ⊥ V : ⊥ C : ⊥

Learning procedure:

  • Start with ‘empty’ SLkFSA
  • Change ⊥ transitions to ⊤ when traversed by input data

16

slide-38
SLIDE 38

Learning with strictly local acceptors

data 0 CV

⋊ : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ V : ⊥ C : ⊥ V : ⊥ C : ⊥

Learning procedure:

  • Start with ‘empty’ SLkFSA
  • Change ⊥ transitions to ⊤ when traversed by input data

16

slide-39
SLIDE 39

Learning with strictly local acceptors

data 0 CV

⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊥ V : ⊤ C : ⊥ V : ⊥ C : ⊥

Learning procedure:

  • Start with ‘empty’ SLkFSA
  • Change ⊥ transitions to ⊤ when traversed by input data

16

slide-40
SLIDE 40

Learning with strictly local acceptors

data 0 CV 1 V

⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊥ V : ⊥ C : ⊥

Learning procedure:

  • Start with ‘empty’ SLkFSA
  • Change ⊥ transitions to ⊤ when traversed by input data

16

slide-41
SLIDE 41

Learning with strictly local acceptors

data 0 CV 1 V 2 CV CV

⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊤ V : ⊥ C : ⊥

Learning procedure:

  • Start with ‘empty’ SLkFSA
  • Change ⊥ transitions to ⊤ when traversed by input data

16

slide-42
SLIDE 42

Learning with strictly local acceptors

data 0 CV 1 V 2 CV CV

⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊤ V : ⊥ C : ⊥

Learning procedure:

  • Start with ‘empty’ SLkFSA
  • Change ⊥ transitions to ⊤ when traversed by input data

16

slide-43
SLIDE 43

Learning with strictly local acceptors

⋊ : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ V : ⊥ C : ⊥ V : ⊥ C : ⊥

  • Any SL2 language can be described by varying {⊤, ⊥} on this

structure 17

slide-44
SLIDE 44

Learning with strictly local acceptors

⋊ : ⊥ C : ⊤ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊤ V : ⊤ C : ⊥

  • Any SL2 language can be described by varying {⊤, ⊥} on this

structure 17

slide-45
SLIDE 45

Learning with strictly local acceptors

⋊ : ⊥ C : ⊥ V : ⊤ C : ⊤ V : ⊤ V : ⊤ C : ⊤ V : ⊥ C : ⊥

  • Any SL2 language can be described by varying {⊤, ⊥} on this

structure 17

slide-46
SLIDE 46

Learning with strictly local acceptors

⋊ : ⊥ C : ⊥ V : ⊥ CC : ⊥ CV : ⊥ V C : ⊥ V V : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ C : ⊥ V : ⊥ V : ⊥ C : ⊥ C : ⊥ V : ⊥

  • Any SL3 language can be described by this structure

18

slide-47
SLIDE 47

Learning with strictly local acceptors

  • This procedure ILPD-learns any SLk language for a given k
  • It is distinct, yet based on the same notion of locality

19

slide-48
SLIDE 48

Input strictly local functions

slide-49
SLIDE 49

Input strictly local functions

  • Generative phonology is primarily interested in maps

/kam-pa/ → [kamba]

/kam-pa/ b C → [+voi] / N [kamba] /kam-pa/ Faith *NC ˇ ID(voi) [kampa] *! [kama] *! ☞ [kamba] *

20

slide-50
SLIDE 50

Input strictly local functions

  • Maps are (functional) relations

/NC ˚ / → [NC ˇ] {(an, an), (anda, anda), (anta, anda), (lalalalampa, lalalalamba),...}

  • We can study classes of relations like we studied classes of

formal languages 21

slide-51
SLIDE 51

Input strictly local functions

  • Johnson (1972); Kaplan and Kay (1994): phonological maps are

regular

memory length of w memory length of w

regular non-regular

  • Regular functions = regular languages!

22

slide-52
SLIDE 52

Input strictly local functions

computable functions Reg

  • How do we extend strict locality to functions?

23

slide-53
SLIDE 53

Input strictly local functions

computable functions Reg Subseq

  • How do we extend strict locality to functions?
  • Phonological maps are subsequential...

(Mohri, 1997; Heinz and Lai, 2013, et seq.)

23

slide-54
SLIDE 54

Subsequential transducers

0 : ⊥ ⊥ ⊥ 1 : ⊤ ⊤ ⊤ a : ⊤ ⊤ ⊤ b : ⊤ ⊤ ⊤ b : ⊥ ⊥ ⊥ a : ⊥ ⊥ ⊥ Deterministic acceptor:

  • output function

Q × Σ → {⊤, ⊥}

  • ending function

Q → {⊤, ⊥} 24

slide-55
SLIDE 55

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a Subsequential transducer:

  • output function

Q × Σ → Γ∗

  • ending function

Q → Γ∗ 24

slide-56
SLIDE 56

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a b a b b 25

slide-57
SLIDE 57

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a b a b b 0 → 0 b 25

slide-58
SLIDE 58

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a b a b b 0 → 0 → 1 b a 25

slide-59
SLIDE 59

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a b a b b 0 → 0 → 1 → 0 b a c 25

slide-60
SLIDE 60

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a b a b b 0 → 0 → 1 → 0 → 0 b a c b 25

slide-61
SLIDE 61

Subsequential transducers

0 : d 1 : λ a : a b : c b : b a : a b a b b 0 → 0 → 1 → 0 → 0 b a c b d 25

slide-62
SLIDE 62

Subsequential transducers

Let’s do some examples... 26

slide-63
SLIDE 63

Input strictly local functions

  • ISL transducers are SFSTs whose states represent k − 1

suffixes.

(Chandlee, 2014; Chandlee and Heinz, 2018)

SL acceptor: a b b a ⋊ a b a b ⋉ ISL transducer:

a : λ b : λ b : b a : b a : a b : b

⋊ a b a b ⋉ a b b b 27

slide-64
SLIDE 64

Input strictly local functions

  • 94% of the processes in P-Base (Mielke, 2004) are ISL (Chandlee

and Heinz, 2018).

  • Others are output strictly local (Chandlee, 2014) or are

non-local, but subsequential (Luo, 2017; Payne, 2017) 28

slide-65
SLIDE 65

Review

  • SL acceptors exactly capture the SL notion of locality
  • Learning with SL acceptors takes advantage of their shared

state structure

  • The ISL functions extend this structure from languages to

functions 29