Summary Machine learning in general and of formal languages in - PowerPoint PPT Presentation

Omega Automata: Minimization and Learning 1 Oded Maler CNRS - VERIMAG Grenoble, France 2007 1 Joint work with A. Pnueli, late 80s

Summary ◮ Machine learning in general and of formal languages in particular ◮ States, minimization and learning in finitary automata ◮ Basics of ω -automata ◮ Why minimization/learning does not work for ω -languages in the general case ◮ A solution for the B ∩ ¯ B subclass ◮ Toward a general solution

Machine Learning ◮ Given a sample consisting of a set of pairs ( x , f ( x )) for some unknown function f ◮ Find a (representation of) a function f ′ : X → Y which is compatible with the sample

Machine Learning ◮ Given a sample consisting of a set of pairs ( x , f ( x )) for some unknown function f ◮ Find a (representation of) a function f ′ : X → Y which is compatible with the sample ◮ Many issues and variations: ◮ Validity of inductive inference ◮ Static or dynamic sampling ◮ Passive or active sampling - can we influence the choice of examples ◮ Evaluation criteria: identification in the limit, probabilities, etc.

Learning Formal Languages ◮ For sets of sequences (languages) L ⊆ Σ ∗ , we want to learn the characteristic function χ L : Σ ∗ → { 0 , 1 } ◮ The sample elements are of the form ( u , χ L ( u )) ◮ The goal is to find a representation (say, automaton) compatible with the sample

Learning Formal Languages ◮ For sets of sequences (languages) L ⊆ Σ ∗ , we want to learn the characteristic function χ L : Σ ∗ → { 0 , 1 } ◮ The sample elements are of the form ( u , χ L ( u )) ◮ The goal is to find a representation (say, automaton) compatible with the sample ◮ The problem was first posed in Moore 56: Gedanken experiments on sequential machines ◮ It was solved in Gold 72: System identification via state characterization ◮ Various complexity issues concerning the number of examples as a function of the number of states (Gold, Trakhtenbrot and Barzdins, Angluin)

Regular Sets and their Syntactic Congruences ◮ With every L ⊆ Σ ∗ we can define the following equivalence relation u ∼ L v iff ∀ w ∈ Σ ∗ u · w ∈ L ⇐ ⇒ v · w ∈ L ◮ Two prefixes are equivalent if they “accept” the same suffixes

Regular Sets and their Syntactic Congruences ◮ With every L ⊆ Σ ∗ we can define the following equivalence relation u ∼ L v iff ∀ w ∈ Σ ∗ u · w ∈ L ⇐ ⇒ v · w ∈ L ◮ Two prefixes are equivalent if they “accept” the same suffixes ◮ This relation is a right-congruence with respect to concatenation: u ∼ v implies u · w ∼ v · w for all u , v , w ∈ Σ ∗

Regular Sets and their Syntactic Congruences ◮ With every L ⊆ Σ ∗ we can define the following equivalence relation u ∼ L v iff ∀ w ∈ Σ ∗ u · w ∈ L ⇐ ⇒ v · w ∈ L ◮ Two prefixes are equivalent if they “accept” the same suffixes ◮ This relation is a right-congruence with respect to concatenation: u ∼ v implies u · w ∼ v · w for all u , v , w ∈ Σ ∗ ◮ Myhill-Nerode theorem: a language L is accepted by a finite automaton iff ∼ L has finitely many congruence classes ◮ This relation is sometimes called the syntactic congruence associated with L

The minimal Automaton ◮ Let Σ ∗ / ∼ be the quotient of Σ ∗ by ∼ , that is the set of its equivalence classes and let [ u ] denote the equivalence class of u ◮ The minimal automaton for L is A L = (Σ , Q , q 0 , δ, F ) where ◮ The states are the ∼ -classes: Q = Σ ∗ / ∼ ◮ Ther initial state is the class of the empty word: q 0 = [ ε ] ◮ Transition function: δ ([ u ] , a ) = [ u · a ] ◮ Accepting states are those that accept the empty word: F = { [ u ] : u · ε ∈ L }

The minimal Automaton ◮ Let Σ ∗ / ∼ be the quotient of Σ ∗ by ∼ , that is the set of its equivalence classes and let [ u ] denote the equivalence class of u ◮ The minimal automaton for L is A L = (Σ , Q , q 0 , δ, F ) where ◮ The states are the ∼ -classes: Q = Σ ∗ / ∼ ◮ Ther initial state is the class of the empty word: q 0 = [ ε ] ◮ Transition function: δ ([ u ] , a ) = [ u · a ] ◮ Accepting states are those that accept the empty word: F = { [ u ] : u · ε ∈ L } ◮ This is canonical representation of L based on its I/O semantics ◮ A L is homomorphic to any other automaton accepting L

Observation Tables (Gold 1972) ◮ Given a language L , imagine an infinite two-dimensional table ◮ The rows of the table are indexed by all elements of Σ ∗ ◮ The columns of the table are indexed by all elements of Σ ∗ ◮ Each entry u , v in the table indicates whether u · v ∈ L (whether after reading prefix u we accept v )

Observation Tables (Gold 1972) ◮ Given a language L , imagine an infinite two-dimensional table ◮ The rows of the table are indexed by all elements of Σ ∗ ◮ The columns of the table are indexed by all elements of Σ ∗ ◮ Each entry u , v in the table indicates whether u · v ∈ L (whether after reading prefix u we accept v ) ◮ For finite automata, according to Myhill-Nerode, there will be only finitely-many distinct rows (and columns) ◮ It is sufficient to use tables over Σ n × Σ n

Example b a a b a b ε a b aa ab ba bb · · · ε − − − − + − − · · · a − − + − − + − · · · b − − − − + − − · · · aa − − − − + − − · · · − − − · · · ab + + + + − − + − − + − · · · ba − − − − + − − · · · bb · · · aba + + − + − − + · · · abb − − + − − + − · · · · · · ε ∼ b ∼ aa a ∼ ba ∼ abb ab ∼ aba

A Sufficient Sample to Characterize the Automaton b a a b a b E ε a b ε − − − S a − − + ab + + − b − − − S · Σ aa − − − − S aba + + − − − abb +

A Sufficient Sample to Characterize the Automaton b a a b a b E ε a b ε − − − S a − − + ab + + − b − − − S · Σ aa − − − − S aba + + − − − abb + ◮ The states of the canonical automaton are S = { [ ε ], [ a ] and [ ab ] }

A Sufficient Sample to Characterize the Automaton b a a b a b E ε a b ε − − − S a − − + ab + + − b − − − S · Σ aa − − − − S aba + + − − − abb + ◮ The states of the canonical automaton are S = { [ ε ], [ a ] and [ ab ] } ◮ The words/paths correspond to a spanning tree ◮ Elements of S · Σ − S correspond to cross- and back-edges in the spanning tree

Angluin’s L ∗ Algorithm ◮ An incremental algorithm to construct the table based on two sources of information: ◮ Membership query Member ( u )? where the learner asks whether u ∈ L ◮ Equivalence query Equiv ( A ) where the learner asks whether automaton A is the (minimal) automaton for L ◮ The answer is either “yes” or a counter-example

Angluin’s L ∗ Algorithm ◮ An incremental algorithm to construct the table based on two sources of information: ◮ Membership query Member ( u )? where the learner asks whether u ∈ L ◮ Equivalence query Equiv ( A ) where the learner asks whether automaton A is the (minimal) automaton for L ◮ The answer is either “yes” or a counter-example ◮ The learner asks membership queries until it can build an automaton

Angluin’s L ∗ Algorithm ◮ An incremental algorithm to construct the table based on two sources of information: ◮ Membership query Member ( u )? where the learner asks whether u ∈ L ◮ Equivalence query Equiv ( A ) where the learner asks whether automaton A is the (minimal) automaton for L ◮ The answer is either “yes” or a counter-example ◮ The learner asks membership queries until it can build an automaton ◮ Then it asks an equivalence query and if there is a counter-example it adds its suffixes to the columns, thus discovering new states and so on

Angluin’s L ∗ Algorithm ◮ An incremental algorithm to construct the table based on two sources of information: ◮ Membership query Member ( u )? where the learner asks whether u ∈ L ◮ Equivalence query Equiv ( A ) where the learner asks whether automaton A is the (minimal) automaton for L ◮ The answer is either “yes” or a counter-example ◮ The learner asks membership queries until it can build an automaton ◮ Then it asks an equivalence query and if there is a counter-example it adds its suffixes to the columns, thus discovering new states and so on ◮ Polynomial in the number of states

ω -Languages ◮ Let Σ ω be the set of all infinite sequences over Σ ◮ An ω -language is a subset L ⊆ Σ ω ◮ The ω -regular sets can be written as a finite union of sets of the form U · V ω with U and V finitary regular sets ◮ Every non-empty ω -regular set contains an ultimately-periodic sequence of the form u · v ω

Acceptance of ω -Languages by ω -Automata ◮ Consider a deterministic automaton (Σ , Q , δ, q 0 ) ◮ When an infinite word u is read by the automaton it induces an infinite run, an infinite sequence of states ◮ This run is summarized by Inf ( u ), the set of states visited infinitely-often by the run

Summary Machine learning in general and of formal languages in - PowerPoint PPT Presentation

Omega Automata: Minimization and Learning 1 Oded Maler CNRS - VERIMAG Grenoble, France 2007 1 Joint work with A. Pnueli, late 80s Summary Machine learning in general and of formal languages in particular States, minimization and

Baldwin Space Summary October 25 1 Baldwin School Space Summary 2 Baldwin School Space Summary

1 Product Range Products 2 summary summary summary summary Relays with 8 and 11-Pins

An Ultramarathon Pie with Doge Glaze An Ultramarathon Pie with Doge Glaze Marathon: The Summary

SUMMARY OF 2 0 1 5 BRI TI SH EVENTI NG DATA DATA SUMMARY 2015 68,269 Cross Country Starters

summary(dsm_x_tw) summary(dsm_xyb_tw) summary(dsm_xy_tw) Overview Estimating smooths How

New patent case filings per year 1 Summary Judgment motions per year 2 All courts: 101 Summary

Search Summary Search Summary Some material from: D Lin, J You, JC Latombe 1 Search Summary #

Q3FY18 RESULTS Results Summary Operating Highlights Financial Summary Key Strategies Appendix

Summary 1. Summary of

Preliminary Results For year end 31st July 2019 6 November 2019 SUMMARY & OUTLOOK SUMMARY

EXECUTIVE SUMMARY ABOUT SEMPERTI Semperti Executive Summary Version: v1 // 2016 SEMPERTI

Q1FY18 RESULTS Results Summary Operating Highlights Financial Summary Key Strategies Appendix

How similar are these curves? Jessica Sherette EAPSI Research and Experience Summary of Proposal

Lecture 12: Summary Summary Advanced Digital Communications (EQ2410) 1 Standards Final Exam

Security Summary Michael McCool Intel Osaka, W3C Web of Things F2F, 17 May 2017 Summary

GDRSD FINANCIAL GDRSD FINANCIAL GDRSD FINANCIAL GDRSD FINANCIAL OVERVIEW SUMMARY OVERVIEW

Midterm Scores Min 1Q Median 3Q Max 23 58 68 83 100 The exam will be curved, with

Course on Inverse Problems Albert Tarantola Fourth Lesson: Sampling a Probability Distribution

Multi-parameter models - Metropolis sampling Applied Bayesian Statistics Dr. Earvin Balderama

Probabilistic Graphical Models Lecture 17 EM CS/CNS/EE 155 Andreas Krause Announcements

Stochastic Simulation Markov Chain Monte Carlo Bo Friis Nielsen Institute of Mathematical

Metropolis Sampling Matt Pharr cs348b May 20, 2003 Introduction Unbiased MC method for

Reasonable Accommodations and Modifications for People with Disabilities D E B O R A H T H R O

+ Soo Choi Sr. Manager, eBook Production HarperCollins Publishers Reaching the Same Screen

Summary Machine learning in general and of formal languages in - PowerPoint PPT Presentation

Omega Automata: Minimization and Learning 1 Oded Maler CNRS - VERIMAG Grenoble, France 2007 1 Joint work with A. Pnueli, late 80s Summary Machine learning in general and of formal languages in particular States, minimization and

Baldwin Space Summary October 25 1 Baldwin School Space Summary 2 Baldwin School Space Summary

1 Product Range Products 2 summary summary summary summary Relays with 8 and 11-Pins

An Ultramarathon Pie with Doge Glaze An Ultramarathon Pie with Doge Glaze Marathon: The Summary

SUMMARY OF 2 0 1 5 BRI TI SH EVENTI NG DATA DATA SUMMARY 2015 68,269 Cross Country Starters

summary(dsm_x_tw) summary(dsm_xyb_tw) summary(dsm_xy_tw) Overview Estimating smooths How

New patent case filings per year 1 Summary Judgment motions per year 2 All courts: 101 Summary

Search Summary Search Summary Some material from: D Lin, J You, JC Latombe 1 Search Summary #

Q3FY18 RESULTS Results Summary Operating Highlights Financial Summary Key Strategies Appendix

Summary 1. Summary of

Preliminary Results For year end 31st July 2019 6 November 2019 SUMMARY &amp; OUTLOOK SUMMARY

EXECUTIVE SUMMARY ABOUT SEMPERTI Semperti Executive Summary Version: v1 // 2016 SEMPERTI

Q1FY18 RESULTS Results Summary Operating Highlights Financial Summary Key Strategies Appendix

How similar are these curves? Jessica Sherette EAPSI Research and Experience Summary of Proposal

Lecture 12: Summary Summary Advanced Digital Communications (EQ2410) 1 Standards Final Exam

Security Summary Michael McCool Intel Osaka, W3C Web of Things F2F, 17 May 2017 Summary

GDRSD FINANCIAL GDRSD FINANCIAL GDRSD FINANCIAL GDRSD FINANCIAL OVERVIEW SUMMARY OVERVIEW

Midterm Scores Min 1Q Median 3Q Max 23 58 68 83 100 The exam will be curved, with

Course on Inverse Problems Albert Tarantola Fourth Lesson: Sampling a Probability Distribution

Multi-parameter models - Metropolis sampling Applied Bayesian Statistics Dr. Earvin Balderama

Probabilistic Graphical Models Lecture 17 EM CS/CNS/EE 155 Andreas Krause Announcements

Stochastic Simulation Markov Chain Monte Carlo Bo Friis Nielsen Institute of Mathematical

Metropolis Sampling Matt Pharr cs348b May 20, 2003 Introduction Unbiased MC method for

Reasonable Accommodations and Modifications for People with Disabilities D E B O R A H T H R O

+ Soo Choi Sr. Manager, eBook Production HarperCollins Publishers Reaching the Same Screen

Preliminary Results For year end 31st July 2019 6 November 2019 SUMMARY & OUTLOOK SUMMARY