Learning Automata over Large Alphabets Oded Maler Irini Eleftheria - - PowerPoint PPT Presentation

learning automata over large alphabets
SMART_READER_LITE
LIVE PREVIEW

Learning Automata over Large Alphabets Oded Maler Irini Eleftheria - - PowerPoint PPT Presentation

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions Learning Automata over Large Alphabets Oded Maler Irini Eleftheria Mens CNRS-V ERIMAG University of Grenoble EQINOCS Workshop,


slide-1
SLIDE 1

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Learning Automata over Large Alphabets

Oded Maler Irini Eleftheria Mens

CNRS-VERIMAG University of Grenoble

EQINOCS Workshop, Paris

9 Mai 2016

VERIMAG O Maler - IE Mens 1 / 22

slide-2
SLIDE 2

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Outline

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

VERIMAG O Maler - IE Mens 2 / 22

slide-3
SLIDE 3

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Machine Learning in General

  • Given a sample M = {(x, y) | x ∈ X, y ∈ Y}
  • Find f : X → Y such that f(x) = y, ∀(x, y) ∈ M
  • Predict f(x) for all x ∈ X

1 1 2 3 4 5 6 7 1.5 1.0 0.5 0.0 0.5 1.0 1.5

VERIMAG O Maler - IE Mens 3 / 22

slide-4
SLIDE 4

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Learning Regular Languages

Let Σ be an alphabet and let L ⊆ Σ∗ be a regular language (the target

language)

start q0 q4 3, 4 q6 2 q3 0, 1 1, 2, 3, 4 q2 q5 0, 1, 2, 3, 4 3, 4 2 0, 1 0, 1, 2, 3, 4 1 q1 0, 2, 3, 4 2, 3, 4 1

χL : Σ∗ → {0, 1}

?

  • Edward F Moore, Gedanken-experiments on sequential machines, 1956
  • E. Mark Gold, System Identification via State Characterization, 1972
  • Dana Angluin, Learning regular sets from queries and

counterexamples, 1987

VERIMAG O Maler - IE Mens 4 / 22

slide-5
SLIDE 5

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Active Learning

Learner

learning algorithm

start q0 q4 3, 4 q6 2 q3 0, 1 1, 2, 3, 4 q2 q5 0, 1, 2, 3, 4 3, 4 2 0, 1 0, 1, 2, 3, 4 1 q1 0, 2, 3, 4 2, 3, 4 1

hypothesis

Teacher

L ⊆ Σ∗

Memb(·)

w

?

∈ L +/−

Equiv(·)

L(H)

?

= L cex

Return H

True

VERIMAG O Maler - IE Mens 5 / 22

slide-6
SLIDE 6

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Regular Sets and their Syntactic Congruences

Equivalence relation ∼L on Σ∗ induced by L ⊆ Σ∗ u ∼L v iff ∀w ∈ Σ∗ u · w ∈ L ⇔ v · w ∈ L This relation is a right-congruence with respect to concatenation u ∼ v implies u · w ∼ ·w for all u, v, w ∈ Σ∗

  • [u] is the equivalence class of u
  • Σ∗/∼ is the set of all equivalence classes

Theorem (Myhill-Nerode)

The language L is regular iff ∼L has finitely many congruence classes

VERIMAG O Maler - IE Mens 6 / 22

slide-7
SLIDE 7

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Canonical Representation

The minimal automaton for L is AL = (Σ, Q, q0, δ, F) where

  • Q = Σ∗/∼
  • q0 = [ǫ]
  • δ([u], a) = [u · a]
  • F = {[u] : u · ǫ ∈ L}

AL is homomorphic to any other automaton accepting L

VERIMAG O Maler - IE Mens 7 / 22

slide-8
SLIDE 8

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Observation Table - T = (Σ, S, R, E, f)

definition and properties

E ε a S ε − + a + − b − − ba − − R aa − + ab + − bb + − baa − − bab + −

  • S states of the canonical automaton
  • The words/paths correspond to a

spanning tree

  • R cross- and back-edges/transitions

ε a b aa ab ba bb

···

... a b a b a b a b

VERIMAG O Maler - IE Mens 8 / 22

slide-9
SLIDE 9

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Observation Table - T = (Σ, S, R, E, f)

definition and properties

E ε a S ε − + a + − b − − ba − − R aa − + ab + − bb + − baa − − bab + −

  • S states of the canonical automaton
  • The words/paths correspond to a

spanning tree

  • R cross- and back-edges/transitions

ε a b

a b b a b a

VERIMAG O Maler - IE Mens 8 / 22

slide-10
SLIDE 10

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

VERIMAG O Maler - IE Mens 9 / 22

slide-11
SLIDE 11

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a b

hypothesis automaton

ε

?

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-12
SLIDE 12

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b −

hypothesis automaton

ε

?

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-13
SLIDE 13

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b −

hypothesis automaton

ε

?

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-14
SLIDE 14

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b −

hypothesis automaton

ε a

? a b

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-15
SLIDE 15

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b − aa ab

hypothesis automaton

ε a

? a b

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-16
SLIDE 16

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b − aa − ab +

hypothesis automaton

ε a

? a b

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-17
SLIDE 17

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b − aa − ab +

hypothesis automaton

ε a

? a b

a, b

VERIMAG O Maler - IE Mens 9 / 22

slide-18
SLIDE 18

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b − aa − ab +

hypothesis automaton

ε a

a b b a

VERIMAG O Maler - IE Mens 9 / 22

slide-19
SLIDE 19

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε ε − a + b − aa − ab +

hypothesis automaton

ε a

a b b a

Ask Equivalence Query:

counterexample: −ba a ∼ ba − → a is a new distinguishing string

VERIMAG O Maler - IE Mens 9 / 22

slide-20
SLIDE 20

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − a + b − aa − ab +

hypothesis automaton

ε a

a b b a

counterexample: −ba a ∼ ba − → a is a new distinguishing string

VERIMAG O Maler - IE Mens 9 / 22

slide-21
SLIDE 21

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − + a + − b − − aa − + ab + −

hypothesis automaton

ε a

a b b a

counterexample: −ba a ∼ ba − → a is a new distinguishing string

VERIMAG O Maler - IE Mens 9 / 22

slide-22
SLIDE 22

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − + a + − b − − aa − + ab + −

hypothesis automaton

ε a

a b b a

counterexample: −ba a ∼ ba − → a is a new distinguishing string

VERIMAG O Maler - IE Mens 9 / 22

slide-23
SLIDE 23

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − + a + − b − − aa − + ab + − ba bb

hypothesis automaton

ε a b

a b b a

VERIMAG O Maler - IE Mens 9 / 22

slide-24
SLIDE 24

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − + a + − b − − aa − + ab + − ba − − bb + −

hypothesis automaton

ε a b

a b b a

VERIMAG O Maler - IE Mens 9 / 22

slide-25
SLIDE 25

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − + a + − b − − aa − + ab + − ba − − bb + −

hypothesis automaton

ε a b

a b b a b a

VERIMAG O Maler - IE Mens 9 / 22

slide-26
SLIDE 26

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

L∗ Example (Σ = {a, b})

  • bservation table

ε a ε − + a + − b − − aa − + ab + − ba − − bb + −

hypothesis automaton

ε a b

a b b a b a

Ask Equivalence Query:

True

VERIMAG O Maler - IE Mens 9 / 22

slide-27
SLIDE 27

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Laguages over Large Alphabets

  • Traditionally automata theory is flat, based on small alphabets,

e.g. {a, b}

  • In verification, for example, we have sequences over a huge

state-space like Bn for very large n

  • Or we want to have languages over numbers or vectors
  • We use symbolic automata with a modest number of states
  • We do not want to enumerate all transitions but represent them

symbolically using predicates on the alphabet

  • We will use inequalities (intervals) for numbers or Boolean

functions for Boolean vectors

VERIMAG O Maler - IE Mens 10 / 22

slide-28
SLIDE 28

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Symbolic Automata

q0 q1 q2 q3 q4 [0, 10) [30, 100) [10, 30)

a11 a12 a13

[0, 50) [50, 100) [0, 20) [50, 70) [0, 50), [70, 100) [20, 100) Σ

a01 a02 a22 a32 a31, a33 a21 a41

[ [a01] ] = [0, 50) Σ = [0, 100) ⊆ R [ [a] ] = {a ∈ Σ | ψ(a) = a}

w = 20 · 40 · 60 +

A = (Σ, Σ, ψ, Q, δ, q0, F)

  • Q finite set of states,
  • q0 initial state,
  • F accepting states,
  • Σ large concrete alphabet,
  • δ ⊆ Q × 2Σ × Q
  • Σ finite alphabet (symbols)
  • ψq : Σ → Σq, q ∈ Q

A is complete and deterministic

if ∀q ∈ Q {[ [a] ] | a ∈ Σq} forms a partition of Σ

VERIMAG O Maler - IE Mens 11 / 22

slide-29
SLIDE 29

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Learning using Evidences and Representatives

u

Σ

[ [a] ]

ˆ µ(a) ˆ µ(b)

a1 a2 a3 ak ... [ [b] ]

|

p

? ? ? ?

evidences µ(u · a) = {ˆ µ(u) · ai | ai ∈ [ [a] ]} representatives ˆ µ(u · a) = ˆ µ(u) · ˆ µ(a)

Let Σ be a subset of R

  • To characterize continuations of u, ask queries

about u · a for a finite sample of Σ (evidence)

  • Evidence can be a fixed set, random, or a result
  • f binary search
  • Form evidence compatible partitions
  • All evidences within a partition block

behave the same

  • Estimate boundaries using split, binary

search,. . .

  • Associate a symbol to each partition block
  • Choose one evidence as the representative for

each new symbol

VERIMAG O Maler - IE Mens 12 / 22

slide-30
SLIDE 30

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Symbolic Observation Table - T = (Σ, Σ, S, R, ψ, E, f, µ, ˆ

µ) E ε a S ε − + a1 + − a2 − − R a1a3 + − a1a4 − + a1a5 − − a2a6 + −

  • Prefixes are symbolic words
  • Symbols represent sets of letters (“fat” edges)
  • Suffixes are concrete words (distinguish states)
  • Fill in the table according to the representatives

a1 a2 a3 a4 a5 a6

ε a1 a2 a1a3 a1a4 a1a5 a2a6 VERIMAG O Maler - IE Mens 13 / 22

slide-31
SLIDE 31

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Symbolic Observation Table - T = (Σ, Σ, S, R, ψ, E, f, µ, ˆ

µ) E ε a S ε − + a1 + − a2 − − R a1a3 + − a1a4 − + a1a5 − − a2a6 + −

  • ψ = {ψs}s∈S, ψs : Σ → Σs

semantics

  • [

[a] ] = {a ∈ Σ | ψ(a) = a}

  • µ : Σ → 2Σ

evidences

  • µ(ε) = {ε}, µ(s · a) = ˆ

µ(s) · µ(a)

  • ˆ

µ : Σ → Σ representative

  • ˆ

µ(ε) = ε, ˆ µ(s · a) = ˆ µ(s) · ˆ µ(a)

  • f : (S ∪ R) · E → {−, +} classif. function
  • f(s · e) = f(ˆ

µ(s) · e), f s(e) = f(s · e)

VERIMAG O Maler - IE Mens 13 / 22

slide-32
SLIDE 32

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Counter-example Treatment (Symbolic Breakpoint)

Proposition

If w is a counter-example to AT then there exists an i-factorization of w such that either f(ˆ µ(si−1 · ai) · vi) = f(ˆ µ(si) · vi) (1)

  • r

f(ˆ µ(si−1) · ai · vi) = f(ˆ µ(si−1) · ˆ µ(ai) · vi) (2)

  • If (1), then vi is a new distinguishing word

vertical expansion

  • Table not closed → new state
  • If (2), then ai is a new evidence for ai.

horizontal expansion

  • Evidence incompatibility → new transition / refinement

VERIMAG O Maler - IE Mens 14 / 22

slide-33
SLIDE 33

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Counter-example Treatment (Symbolic Breakpoint)

Let w = a1 · · · · ai · · · a|w| = ui · ai · vi ) be a counter-example. f(ˆ µ(si−1 · ai) · vi) = f(ˆ µ(si) · vi)

si = δ(ε, ui · ai)

ε

s s′

ˆ µ(ai) ˆ µ(ui)

vi vi =

VERIMAG O Maler - IE Mens 15 / 22

slide-34
SLIDE 34

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Counter-example Treatment (Symbolic Breakpoint)

Let w = a1 · · · · ai · · · a|w| = ui · ai · vi ) be a counter-example. f(ˆ µ(si−1 · ai) · vi) = f(ˆ µ(si) · vi)

si = δ(ε, ui · ai)

ε

s s′

ˆ µ(ai) ˆ µ(ui)

vi vi =

ε

s s′

new

ˆ µ(ai) ˆ µ(ui)

vi vi = = s · ai is a new state

VERIMAG O Maler - IE Mens 15 / 22

slide-35
SLIDE 35

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Counter-example Treatment (Symbolic Breakpoint)

Let w = a1 · · · · ai · · · a|w| = ui · ai · vi ) be a counter-example. f(ˆ µ(si−1 · ai) · vi) = f(ˆ µ(si) · vi) f(ˆ µ(si−1) · ai · vi) = f(ˆ µ(si−1) · ˆ µ(ai) · vi)

si = δ(ε, ui · ai)

ε

s s′

ˆ µ(ai) ˆ µ(ui)

vi vi =

ε

s s′

new

ˆ µ(ai) ˆ µ(ui)

vi vi = = s · ai is a new state

ε

s

ˆ µ(ui) ˆ µ(ai) ai

vi vi = =

VERIMAG O Maler - IE Mens 15 / 22

slide-36
SLIDE 36

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Counter-example Treatment (Symbolic Breakpoint)

Let w = a1 · · · · ai · · · a|w| = ui · ai · vi ) be a counter-example. f(ˆ µ(si−1 · ai) · vi) = f(ˆ µ(si) · vi) f(ˆ µ(si−1) · ai · vi) = f(ˆ µ(si−1) · ˆ µ(ai) · vi)

si = δ(ε, ui · ai)

ε

s s′

ˆ µ(ai) ˆ µ(ui)

vi vi =

ε

s s′

new

ˆ µ(ai) ˆ µ(ui)

vi vi = = s · ai is a new state

ε

s

ˆ µ(ui) ˆ µ(ai) ai

vi vi = =

ε

s

ˆ µ(ui) ˆ µ(ai) ai

vi vi = refine [ [ai] ]

VERIMAG O Maler - IE Mens 15 / 22

slide-37
SLIDE 37

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Learning with a Teacher

Σ

. . . ai ai+1 . . .

|

p

error in the partitions

  • Equivalence is checked by an oracle (teacher) returning a

minimal counter-examples (in length and lexicographically)

  • Choose as evidence the min element of the interval (Σ has min)
  • The counter-example indicates the minimal element of a new

transition (in horizontal expansion)

  • The partition bounds are exact and no error is introduced

[ [a] ]

ˆ µ(a) ˆ µ(b)

Σ

[ [b] ]

|

p VERIMAG O Maler - IE Mens 16 / 22

slide-38
SLIDE 38

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε

semantics hypothesis automaton

VERIMAG O Maler - IE Mens 17 / 22

slide-39
SLIDE 39

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

semantics

ε

hypothesis automaton

VERIMAG O Maler - IE Mens 17 / 22

slide-40
SLIDE 40

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0

semantics

ε [ [a0] ] = [1, 100)

hypothesis automaton

VERIMAG O Maler - IE Mens 17 / 22

slide-41
SLIDE 41

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

semantics

ε [ [a0] ] = [1, 100)

hypothesis automaton

VERIMAG O Maler - IE Mens 17 / 22

slide-42
SLIDE 42

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

semantics

ε [ [a0] ] = [1, 100)

hypothesis automaton

VERIMAG O Maler - IE Mens 17 / 22

slide-43
SLIDE 43

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

semantics

ε [ [a0] ] = [1, 100) a0

hypothesis automaton

ε a0

Σ VERIMAG O Maler - IE Mens 17 / 22

slide-44
SLIDE 44

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1

semantics

ε [ [a0] ] = [1, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

Σ VERIMAG O Maler - IE Mens 17 / 22

slide-45
SLIDE 45

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

semantics

ε [ [a0] ] = [1, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

Σ Σ

Ask Equivalence Query:

VERIMAG O Maler - IE Mens 17 / 22

slide-46
SLIDE 46

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

semantics

ε [ [a0] ] = [1, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

Σ Σ

Ask Equivalence Query:

counterexample − 24 24 ∈ [ [a0] ] but 1 ∼ 24 − → refine a0

VERIMAG O Maler - IE Mens 17 / 22

slide-47
SLIDE 47

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

[1, 24) Σ

Ask Equivalence Query:

counterexample − 24 24 ∈ [ [a0] ] but 1 ∼ 24 − → refine a0

VERIMAG O Maler - IE Mens 17 / 22

slide-48
SLIDE 48

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

[1, 24) Σ

Ask Equivalence Query:

counterexample − 24 24 ∈ [ [a0] ] but 1 ∼ 24 − → refine a0

VERIMAG O Maler - IE Mens 17 / 22

slide-49
SLIDE 49

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) Σ

Ask Equivalence Query:

VERIMAG O Maler - IE Mens 17 / 22

slide-50
SLIDE 50

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) Σ

Ask Equivalence Query:

counterexample + 1 · 66 66 ∈ [ [a1] ] but 1 ∼ 66 − → refine a1

VERIMAG O Maler - IE Mens 17 / 22

slide-51
SLIDE 51

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) Σ

Ask Equivalence Query:

counterexample + 1 · 66 66 ∈ [ [a1] ] but 1 ∼ 66 − → refine a1

VERIMAG O Maler - IE Mens 17 / 22

slide-52
SLIDE 52

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

1

a0

66

a3

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) [1, 66)

Ask Equivalence Query:

counterexample + 1 · 66 66 ∈ [ [a1] ] but 1 ∼ 66 − → refine a1

VERIMAG O Maler - IE Mens 17 / 22

slide-53
SLIDE 53

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

1

a0

66

a3 +

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) [1, 66)

Ask Equivalence Query:

counterexample + 1 · 66 66 ∈ [ [a1] ] but 1 ∼ 66 − → refine a1

VERIMAG O Maler - IE Mens 17 / 22

slide-54
SLIDE 54

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

1

a0

66

a3 +

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) [66, 100) [1, 66)

Ask Equivalence Query:

VERIMAG O Maler - IE Mens 17 / 22

slide-55
SLIDE 55

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

1

a0

66

a3 +

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) [66, 100) [1, 66)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-56
SLIDE 56

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε −

1

a0 +

1

a0

1

a1 −

24

a2 −

1

a0

66

a3 +

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) [66, 100) [1, 66)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-57
SLIDE 57

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

1

a0

1

a1 − +

24

a2 − −

1

a0

66

a3 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100)

hypothesis automaton

ε a0

[24, 100) [1, 24) [66, 100) [1, 66)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-58
SLIDE 58

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2

hypothesis automaton

ε a0 a2

[1, 24) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-59
SLIDE 59

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 100)

hypothesis automaton

ε a0 a2

[1, 24) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-60
SLIDE 60

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 100)

hypothesis automaton

ε a0 a2

[1, 24) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-61
SLIDE 61

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 100)

hypothesis automaton

ε a0 a2

[1, 24) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

− 24 · 1

1 ∼ 24 · 1 − → add distinguishing suffix 1

VERIMAG O Maler - IE Mens 17 / 22

slide-62
SLIDE 62

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 100)

hypothesis automaton

ε a0 a2

[1, 24) Σ [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

VERIMAG O Maler - IE Mens 17 / 22

slide-63
SLIDE 63

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 100)

hypothesis automaton

ε a0 a2

[1, 24) Σ [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

+ 24 · 51

VERIMAG O Maler - IE Mens 17 / 22

slide-64
SLIDE 64

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 100)

hypothesis automaton

ε a0 a2

[1, 24) Σ [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

+ 24 · 51

51 ∈ [ [a4] ] but 1 ∼ 51 − → refine a4

VERIMAG O Maler - IE Mens 17 / 22

slide-65
SLIDE 65

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

+ 24 · 51

51 ∈ [ [a4] ] but 1 ∼ 51 − → refine a4

VERIMAG O Maler - IE Mens 17 / 22

slide-66
SLIDE 66

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

+ 24 · 51

51 ∈ [ [a4] ] but 1 ∼ 51 − → refine a4

VERIMAG O Maler - IE Mens 17 / 22

slide-67
SLIDE 67

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [51, 100) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

+ 24 · 51

51 ∈ [ [a4] ] but 1 ∼ 51 − → refine a4

VERIMAG O Maler - IE Mens 17 / 22

slide-68
SLIDE 68

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [51, 100) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query:

counterexample

+ 24 · 51

51 ∈ [ [a4] ] but 1 ∼ 51 − → refine a4

VERIMAG O Maler - IE Mens 17 / 22

slide-69
SLIDE 69

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [51, 100) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query: True

VERIMAG O Maler - IE Mens 17 / 22

slide-70
SLIDE 70

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [51, 100) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query: True M = {ε, 1, 24, 1 1, 1 66, 24 1, 24 51, 1 1 1, 1 66 1, 24 1 1, 24 51 1} |M| = 11, |MQ| = 7, |EQ| = 5, |S| = 3, |R| = 4

VERIMAG O Maler - IE Mens 17 / 22

slide-71
SLIDE 71

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example with Teacher (Σ = [1, 100))

Teacher returns minimal counterexamples

  • bservation table

ε 1 ε − +

1

a0 + −

24

a2 − −

1

a0

1

a1 − +

1

a0

66

a3 + −

24

a2

1

a4 − −

24

a2

51

a5 + −

semantics

ε [ [a0] ] = [1, 24) [ [a2] ] = [24, 100) a0 [ [a1] ] = [1, 66) [ [a3] ] = [66, 100) a2 [ [a4] ] = [1, 51) [ [a5] ] = [51, 100)

hypothesis automaton

ε a0 a2

[1, 24) [1, 51) [51, 100) [66, 100) [1, 66) [24, 100)

Ask Equivalence Query: True M = {ε, 1, 24, 1 1, 1 66, 24 1, 24 51, 1 1 1, 1 66 1, 24 1 1, 24 51 1}

L∗ over (Σ ∩ N) → |M| = 790, |MQ| = 789, |EQ| = 2, |S| = 4, |R| = 396

|M| = 11, |MQ| = 7, |EQ| = 5, |S| = 3, |R| = 4

VERIMAG O Maler - IE Mens 17 / 22

slide-72
SLIDE 72

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Learning without a Teacher

Σ

. . . ai ai+1 . . .

|

p

error in the partitions

  • Equivalence is checked by testing random words

selected using a probability distribution D

  • Counter-examples are not minimal

we may have errors in the boundaries

  • Counter-examples may be missed

terminate algorithm and return hypothesis after r(ε, δ, i) random words have been tested, none of which is a counter-example

  • The final hypothesis A is a good approximation of the target

language L with high probability P(d(L, LA) < ε) ≥ 1 − δ

(PAC learnability)

VERIMAG O Maler - IE Mens 18 / 22

slide-73
SLIDE 73

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

semantics hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-74
SLIDE 74

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε

semantics hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-75
SLIDE 75

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

semantics

ε

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-76
SLIDE 76

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-77
SLIDE 77

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-78
SLIDE 78

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-79
SLIDE 79

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-80
SLIDE 80

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-81
SLIDE 81

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-82
SLIDE 82

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

13

a1

18

a3 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2

hypothesis automaton

VERIMAG O Maler - IE Mens 19 / 22

slide-83
SLIDE 83

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

13

a1

18

a3 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ VERIMAG O Maler - IE Mens 19 / 22

slide-84
SLIDE 84

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε ε −

13

a1 +

68

a2 −

13

a1

18

a3 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add distinguishing string 11

− → new state

VERIMAG O Maler - IE Mens 19 / 22

slide-85
SLIDE 85

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε −

13

a1 +

68

a2 −

13

a1

18

a3 −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add distinguishing string 11

− → new state

VERIMAG O Maler - IE Mens 19 / 22

slide-86
SLIDE 86

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add distinguishing string 11

− → new state

VERIMAG O Maler - IE Mens 19 / 22

slide-87
SLIDE 87

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add distinguishing string 11

− → new state

VERIMAG O Maler - IE Mens 19 / 22

slide-88
SLIDE 88

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ VERIMAG O Maler - IE Mens 19 / 22

slide-89
SLIDE 89

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ VERIMAG O Maler - IE Mens 19 / 22

slide-90
SLIDE 90

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4

68

a2

72

a5

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ VERIMAG O Maler - IE Mens 19 / 22

slide-91
SLIDE 91

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ VERIMAG O Maler - IE Mens 19 / 22

slide-92
SLIDE 92

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1

x ≥ 27 x < 27 Σ VERIMAG O Maler - IE Mens 19 / 22

slide-93
SLIDE 93

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 Σ x ≥ 27 VERIMAG O Maler - IE Mens 19 / 22

slide-94
SLIDE 94

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 Σ x ≥ 27

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add 73 as evidence of a1

− → new transition

VERIMAG O Maler - IE Mens 19 / 22

slide-95
SLIDE 95

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 Σ x ≥ 27

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add 73 as evidence of a1

− → new transition

VERIMAG O Maler - IE Mens 19 / 22

slide-96
SLIDE 96

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 Σ x ≥ 27

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add 73 as evidence of a1

− → new transition

VERIMAG O Maler - IE Mens 19 / 22

slide-97
SLIDE 97

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 Σ x ≥ 27

Ask Equivalence Query:

counterexample − 12 · 73 · 11 add 73 as evidence of a1

− → new transition

VERIMAG O Maler - IE Mens 19 / 22

slide-98
SLIDE 98

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 x ≥ 63 x < 63 x ≥ 27 VERIMAG O Maler - IE Mens 19 / 22

slide-99
SLIDE 99

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 x ≥ 63 x < 63 x ≥ 27

Ask Equivalence Query:

counterexample

− 52 · 47

add 47 as evidence of a2

− → refine existing transition

VERIMAG O Maler - IE Mens 19 / 22

slide-100
SLIDE 100

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27

|

45 47

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 x ≥ 63 x < 63 x ≥ 27

Ask Equivalence Query:

counterexample

− 52 · 47

add 47 as evidence of a2

− → refine existing transition

VERIMAG O Maler - IE Mens 19 / 22

slide-101
SLIDE 101

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27 47 | 55

hypothesis automaton

ε a1 a2

x < 27 x < 45 x ≥ 45 x ≥ 63 x < 63 x ≥ 27

Ask Equivalence Query:

counterexample

− 52 · 47

add 47 as evidence of a2

− → refine existing transition

VERIMAG O Maler - IE Mens 19 / 22

slide-102
SLIDE 102

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27 47 | 55

hypothesis automaton

ε a1 a2

x < 27 x < 55 x ≥ 55 x ≥ 63 x < 63 x ≥ 27 VERIMAG O Maler - IE Mens 19 / 22

slide-103
SLIDE 103

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Example without Teacher (Σ = [1, 100))

Counterexamples are not minimal

  • bservation table

ε 11 ε − +

13

a1 + −

68

a2 − −

13

a1

18

a3 − +

13

a1

73

a6 + −

68

a2

17

a4 − −

68

a2

72

a5 + −

semantics

ε ˆ µ(a1) ˆ µ(a2) 13 42 68 78 92

|

27 a1 ˆ µ(a3) 18 26 46 54 2 ˆ µ(a6) 73

|

63 a2 ˆ µ(a4) ˆ µ(a5) 64 72 94 17 27 47 | 55

hypothesis automaton

ε a1 a2

x < 27 x < 55 x ≥ 55 x ≥ 63 x < 63 x ≥ 27

Ask Equivalence Query:

. . .

VERIMAG O Maler - IE Mens 19 / 22

slide-104
SLIDE 104

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Computing the Error

2 4 6 8 10 12 14 16 hypothesis 0.0 0.1 0.2 0.3 0.4 0.5 0.6 error

Run 1 Run 2 Run 3 Run 4

  • The error of the hypotheses along

several runs of the algorithm

The error is measured as volumes of the symmetric difference L between the conjectured and the target language error = D(L) = limk→inf Dk(L), where Dk(L) is the k-volume of L, i.e., Dk(L) = V(Lk)/V(Σk)

VERIMAG O Maler - IE Mens 20 / 22

slide-105
SLIDE 105

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Current Status

  • We implemented the algorithm for the case Σ ⊂ R and Σ ⊂ N

with and without a teacher

  • Experimental results on password rules over ASCII charcters
  • We developed a similar algorithm for Σ = Bn for large n
  • We use bounded complexity alphabet partitions in the style of

k-DNF or decision lists

  • First results are encouraging, will be used to extend to Σ ⊂ Rn

VERIMAG O Maler - IE Mens 21 / 22

slide-106
SLIDE 106

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Discussion

  • Symbolic automata: concrete values are used to select a

transition but their exact value is not remembered by the automaton

  • Is there a niche for it in the time series analysis science?
  • Combination of temporal (automaton) and static concept learning
  • Can be an alternative to (deep) recurrent neural networks
  • Should relax full compatibility with the sample (noise) and be

ready to drop negative examples

VERIMAG O Maler - IE Mens 22 / 22

slide-107
SLIDE 107

Learning Languages The L* Algorithm Learning over Large Alphabets Learning with/without a Teacher Conclusions

Discussion

  • Symbolic automata: concrete values are used to select a

transition but their exact value is not remembered by the automaton

  • Is there a niche for it in the time series analysis science?
  • Combination of temporal (automaton) and static concept learning
  • Can be an alternative to (deep) recurrent neural networks
  • Should relax full compatibility with the sample (noise) and be

ready to drop negative examples Thank you

VERIMAG O Maler - IE Mens 22 / 22