Learning Learning Re Regular gular Languages Languages over er - PowerPoint PPT Presentation

Irini-Eleftheria Irini-Eleftheria Mens Mens V ERIMAG , University of Grenoble-Alpes Learning Learning Re Regular gular Languages Languages over er Lar Large ge Alphabets Alphabets 10 October 2017 Jury Members Oded Maler Directeur de th` ese Laurent Fribourg Examinateur Dana Angluin Rapporteur Eric Gaussier Examinateur Peter Habermehl Rapporteur Frits Vaandrager Examinateur

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Model Black Box Learning Language Identification System Identification Inductive Inference 1 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion A Short Prehistory and History of Automaton Learning 1956 Edward F Moore. Gedanken-experiments on sequential machines. Defines the problem as a black box model inference. 1967 E. Mark Gold. Language identification in the limit. 1972 E. Mark Gold. System identification via state characterization. Learning finite automata is possible in finite time. He first uses the basic idea that underlies table-based methods. 1978 E. Mark Gold. Complexity of automaton identification from given data. Finding the minimal automaton compatible with a given sample is NP-hard. 1987 Dana Angluin. Learning regular sets from queries and counter-examples. The L ∗ active learning algorithm with membership and equivalence queries. Polynomial in the automaton size. 1993 Ronald L. Rivest and Robert E. Schapire. Inference of finite automata using homing sequences. An improved version of the L ∗ algorithm using the breakpoint method to treat counter-examples. 2 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Machine Learning Model f : X → Y f ( x ) = y , ∀ ( x , y ) ∈ M a small sample Learn M = { ( x , y ) : x ∈ X , y ∈ Y } predict or identify f ( x ) for all x ∈ X Learning Regular Languages Model over large or infinite alphabets f is a language • Σ an alphabet L ⊆ Σ ∗ Learn • X = Σ ∗ set of words The model is an • Y = { + , −} symbolic automaton 3 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Types of Learning Off-line vs Online The sample M is known before The sample M is updated the learning procedure starts. during learning. Passive vs Active The sample M is given. The sample M is chosen by the learning algorithm. Learning using Queries The learning algorithm can access queries e.g., membership queries, equivalence queries, etc. ? L ( H ) ≡ L w ∈ L w ∈ Σ ∗ Hypothesis H Yes / No True / MQ ( · ) EQ ( · ) Counter-example (cex) 4 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Outline Preliminaries Regular Languages and Automata The L ∗ Algorithmic Scheme Large Alphabets Motivation Symbolic Representation of Transitions - Symbolic Automata Learning Symbolic Automata Why L ∗ cannot be applied? Our Solution The Algorithm Equivalence Queries and Counter-Examples Adaptation to the Boolean Alphabet Experimental Results Conclusion 5 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Regular Languages and Automata suffixes a b ε . . . a b aa ab ba bb aaa a b ε − − − − + − − − . . . a − − + − − + − − . . . a b Σ = { a , b } − − − − + − − − . . . b − − − − + − − − . . . aa L ⊆ Σ ∗ is a language + + − + − − + + . . . ab prefixes ba − − + − − + − − . . . • Σ is an alphabet bb − − − − + − − − . . . • w = a 1 · · · a n is a word . . . . . . . . ... . . . . . . . . . . . . . . . . . . . • Σ ∗ is the set of all words + + − + − − + + . . . aba abb − − + − − + − − . . . . . . . . . . . ... . . . . . . . . . . . . . . . . . . . 6 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Regular Languages and Automata suffixes a b ε . . . a b aa ab ba bb aaa a b ε − − − − + − − − . . . a − − + − − + − − . . . a b Σ = { a , b } − − − − + − − − . . . b − − − − + − − − . . . aa L ⊆ Σ ∗ is a language + + − + − − + + . . . ab prefixes ba − − + − − + − − . . . bb − − − − + − − − . . . Equivalence relation . . . . . . . . ... . . . . . . . . . . . . . . . . . . u ∼ L v iff u · w ∈ L ⇔ v · w ∈ L . + + − + − − + + . . . aba Nerode’s Theorem abb − − + − − + − − . . . L is a regular language iff ∼ L has . . . . . . . . ... . . . . . . . . . . . . . . . . . . . finitely many equivalence classes. ε ∼ b ∼ aa a ∼ ba ∼ abb ab ∼ aba Q = Σ ∗ / ∼ (states in the minimal representation of L . 6 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Regular Languages and Automata A sufficient sample that characterizes the language ε a b ε a b a b ε − − − a b a b − − + a aa ab ba bb + + − ab a b a a a b b b − − − b aba abb aa − − − a a b b aba + + − abb − − + 7 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Regular Languages and Automata A sufficient sample that characterizes the language ε a b b E ε a b a b ε − − − a a b − − + a S aa ab + + − ab a a b b b − − − aba abb aa − − − R + + − aba − − + abb S prefixes (states) boundary ( R = S · Σ \ S ) R E suffixes (distinguishing strings) f : S ∪ R × E → { + , −} classif. function f s : E → { + , −} residual functions 7 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion Regular Languages and Automata A sufficient sample that characterizes the language a ε b a E ε a b a b b ε − − − b − − + a S aa ab + + − ab b − − − a aba abb aa − − − R + + − aba − − + abb A L = (Σ , Q , q 0 , δ, F ) S prefixes (states) - Q = S boundary ( R = S · Σ \ S ) R - q 0 = [ ε ] E suffixes (distinguishing strings) - δ ([ u ] , a ) = [ u · a ] f : S ∪ R × E → { + , −} classif. function - F = { [ u ] : ( u · ε ) ∈ L } f s : E → { + , −} residual functions The minimal automaton for L 7 / 31

2 3, 4 0 q4 1, 2, 3, 4 start q0 2, 3, 4 0 1 q6 0, 1 0, 2, 3, 4 q1 3, 4 q3 0, 1 1 q5 2 q2 0, 1, 2, 3, 4 0, 1, 2, 3, 4 Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion The L ∗ Algorithmic Scheme ∗ Active learning using queries ε Learner Teacher a b L ⊆ Σ ∗ Initialize a b ? a b ∈ L w MQ ( · ) aa ab Fill in Table a b + / − aba abb EQ ( · ) ∗ D. Angluin. Learning regular sets from queries and counter-examples , 1987. 8 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion The L ∗ Algorithmic Scheme ∗ Active learning using queries a ε Learner Teacher b a L ⊆ Σ ∗ Initialize a b b ? b ∈ L w MQ ( · ) aa ab Fill in Table + / − a aba abb Make ? L ( H ) = L Hypothesis H EQ ( · ) 2 3, 4 0 q4 1, 2, 3, 4 start q0 2, 3, 4 0 1 q6 0, 1 0, 2, 3, 4 q1 3, 4 q3 0, 1 1 q5 2 q2 0, 1, 2, 3, 4 0, 1, 2, 3, 4 ∗ D. Angluin. Learning regular sets from queries and counter-examples , 1987. 8 / 31

Introduction Preliminaries Large Alphabets Learning Symbolic Automata Counter-examples Booleans Experimental Results Conclusion The L ∗ Algorithmic Scheme ∗ Active learning using queries a ε Learner Teacher b a L ⊆ Σ ∗ Initialize a b ? b ∈ L w MQ ( · ) aa ab Fill in Table b + / − a aba abb a a Make ? L ( H ) = L Hypothesis H EQ ( · ) 2 3, 4 0 q4 1, 2, 3, 4 start q0 2, 3, 4 0 1 q6 0, 1 0, 2, 3, 4 q1 3, 4 q3 0, 1 1 q5 2 q2 0, 1, 2, 3, 4 0, 1, 2, 3, 4 counter-example Treat cex True (cex) Return H ∗ D. Angluin. Learning regular sets from queries and counter-examples , 1987. 8 / 31

Learning Learning Re Regular gular Languages Languages over er - PowerPoint PPT Presentation

Irini-Eleftheria Irini-Eleftheria Mens Mens V ERIMAG , University of Grenoble-Alpes Learning Learning Re Regular gular Languages Languages over er Lar Large ge Alphabets Alphabets 10 October 2017 Jury Members Oded Maler Directeur de

Objectives You should be able to ... Regular Languages Use the syntax of regular expressions

A Theory of Regular Queries Moshe Y. Vardi Rice University Theory of Regular Languages, I

Review Languages and Grammars CS 301 - Lecture 5 Alphabets, strings, languages Regular

Regular Expressions = Regular Languages Mark Greenstreet, CpSc 421, Term 1, 2008/09 17

Theory of Computer Science C3. Regular Languages: Regular Expressions, Pumping Lemma Malte

Regular Expressions A regular expression describes a language using three operations. Regular

Theory of Computer Science C2. Regular Languages: Finite Automata Gabriele R oger University

Review Languages and Grammars Alphabets, strings, languages Regular Languages

Review Languages and Grammars Alphabets, strings, languages Regular Languages

Boa Board rd of of Re Rege gents Re Regu gular r Mee eetin ing Presidents Report

Mark Fernandes Principal, Cyber Risk Services + FUTURE OF CYBER CYB CYBER SINGU GULAR ARIT

Delray Beach h Commu mmunit nity Rede developme ment Age genc ncy y Regu gular Board

CFLs and Regular Languages We can show that every RL is also a CFL CFLs and Regular Languages

Finite-State Automata Formal Languages in brief Regular Expressions Finite-State

Regular Languages Mark Greenstreet, CpSc 421, Term 1, 2006/07 8 September 2008 p.1/14

Chapter 3: Regular Languages In this chapter, we study: regular expressions and languages;

Quiz I What is the coordinate representation of [1 , 2 , 3] in terms of the vectors [1 , 0 , 0] ,

Globally Coherent Text Generation with Neural Checklist Models Chloe Kiddon, Luke

RHETORIC Rhetoric A VERY SHORT, NECESSARILY INCOMPLETE, and possibly totally superfluous

Set theory and model theory: a symbiosis Jouko Vnnen Helsinki, Finland Montseny, November

Specification and Analysis of Contracts Lecture 7 Specification of Deontic Contracts Using

Substructural Typestates Filipe Milito (CMU & UNL) Jonathan Aldrich (CMU)

The Coq Proof Script Visualiser (coq-psv) Coq Workshop 2020, Virtual Mario Frank

[5] The Basis Ren e Descartes Born 1596. After studying law in college,.... I entirely

Learning Learning Re Regular gular Languages Languages over er - PowerPoint PPT Presentation

Irini-Eleftheria Irini-Eleftheria Mens Mens V ERIMAG , University of Grenoble-Alpes Learning Learning Re Regular gular Languages Languages over er Lar Large ge Alphabets Alphabets 10 October 2017 Jury Members Oded Maler Directeur de

Objectives You should be able to ... Regular Languages Use the syntax of regular expressions

A Theory of Regular Queries Moshe Y. Vardi Rice University Theory of Regular Languages, I

Review Languages and Grammars CS 301 - Lecture 5 Alphabets, strings, languages Regular

Regular Expressions = Regular Languages Mark Greenstreet, CpSc 421, Term 1, 2008/09 17

Theory of Computer Science C3. Regular Languages: Regular Expressions, Pumping Lemma Malte

Regular Expressions A regular expression describes a language using three operations. Regular

Theory of Computer Science C2. Regular Languages: Finite Automata Gabriele R oger University

Review Languages and Grammars Alphabets, strings, languages Regular Languages

Review Languages and Grammars Alphabets, strings, languages Regular Languages

Boa Board rd of of Re Rege gents Re Regu gular r Mee eetin ing Presidents Report

Mark Fernandes Principal, Cyber Risk Services + FUTURE OF CYBER CYB CYBER SINGU GULAR ARIT

Delray Beach h Commu mmunit nity Rede developme ment Age genc ncy y Regu gular Board

CFLs and Regular Languages We can show that every RL is also a CFL CFLs and Regular Languages

Finite-State Automata Formal Languages in brief Regular Expressions Finite-State

Regular Languages Mark Greenstreet, CpSc 421, Term 1, 2006/07 8 September 2008 p.1/14

Chapter 3: Regular Languages In this chapter, we study: regular expressions and languages;

Quiz I What is the coordinate representation of [1 , 2 , 3] in terms of the vectors [1 , 0 , 0] ,

Globally Coherent Text Generation with Neural Checklist Models Chloe Kiddon, Luke

RHETORIC Rhetoric A VERY SHORT, NECESSARILY INCOMPLETE, and possibly totally superfluous

Set theory and model theory: a symbiosis Jouko Vnnen Helsinki, Finland Montseny, November

Specification and Analysis of Contracts Lecture 7 Specification of Deontic Contracts Using

Substructural Typestates Filipe Milito (CMU &amp; UNL) Jonathan Aldrich (CMU)

The Coq Proof Script Visualiser (coq-psv) Coq Workshop 2020, Virtual Mario Frank

[5] The Basis Ren e Descartes Born 1596. After studying law in college,.... I entirely

Substructural Typestates Filipe Milito (CMU & UNL) Jonathan Aldrich (CMU)