Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 - PowerPoint PPT Presentation

Closure properties of regular languages Regular expressions Algebra for regular expressions Regular expressions and Kleene’s theorem Informatics 2A: Lecture 5 John Longley School of Informatics University of Edinburgh jrl@inf.ed.ac.uk 29 September, 2011 1 / 20

Closure properties of regular languages Regular expressions Algebra for regular expressions 1 Closure properties of regular languages ǫ -NFAs Closure under concatenation Closure under Kleene star 2 Regular expressions From regular expressions to ǫ -NFAs From NFAs to regular expressions 3 Algebra for regular expressions From DFAs to regular expressions: a practical method 2 / 20

Closure properties of regular languages Regular expressions Algebra for regular expressions Clicker meta-question What would you consider to be the optimal number of clicker questions per lecture? (Not counting meta-questions like this one.) 1 1 2 2 3 3 4 4 5 0 3 / 20

Closure properties of regular languages ǫ -NFAs Regular expressions Closure under concatenation Algebra for regular expressions Closure under Kleene star Closure properties of regular languages We’ve seen that if both L 1 and L 2 are regular languages, so is L 1 ∪ L 2 . We sometimes express this by saying that regular languages are closed under the ‘union’ operation. (‘Closed’ used here in the sense of ‘self-contained’.) We will show that regular languages are closed under other operations too: Concatenation: write L 1 . L 2 for the language { xy | x ∈ L 1 , y ∈ L 2 } Kleene star: let L ∗ denote the language { ǫ } ∪ L ∪ L . L ∪ L . L . L ∪ . . . For these, we’ll need to work with a minor variation on NFAs. All this will lead us to another way of defining regular languages: via regular expressions. 4 / 20

Closure properties of regular languages ǫ -NFAs Regular expressions Closure under concatenation Algebra for regular expressions Closure under Kleene star NFAs with ǫ -transitions We can vary the definition of NFA by also allowing transitions labelled with the special symbol ǫ ( not a symbol in Σ). The automaton may (but doesn’t have to) perform an ǫ -transition at any time, without reading an input symbol. This is quite convenient: for instance, we can turn any NFA into an ǫ -NFA with just one start state and one accepting state: . . . . . . . . . . ε ε ε ε . . . . . ε ε . . . . . (Add ǫ -transitions from new start state to each state in S , and from each state in F to new accepting state.) 5 / 20

Closure properties of regular languages ǫ -NFAs Regular expressions Closure under concatenation Algebra for regular expressions Closure under Kleene star Equivalence to ordinary NFAs Allowing ǫ -transitions is just a convenience: it doesn’t fundamentally change the power of NFAs. If N = ( Q , ∆ , S , F ) is an ǫ -NFA, we can convert N to an ordinary NFA with the same associated language, by simply ‘expanding’ ∆ and S to allow for silent ǫ -transitions. Formally, the ǫ -closure of a transition relation ∆ ⊆ Q × (Σ ∪{ ǫ } ) × Q is the smallest relation ∆ that contains ∆ and satisfies: if ( q , u , q ′ ) ∈ ∆ and ( q ′ , ǫ, q ′′ ) ∈ ∆ then ( q , u , q ′′ ) ∈ ∆; if ( q , ǫ, q ′ ) ∈ ∆ and ( q ′ , u , q ′′ ) ∈ ∆ then ( q , u , q ′′ ) ∈ ∆. Likewise, the ǫ -closure of S under ∆ is the smallest state S ∆ that contains S and satisfies: if q ∈ S ∆ and ( q , ǫ, q ′ ) ∈ ∆ then q ′ ∈ S ∆ . We can then replace the ǫ -NFA ( Q , ∆ , S , F ) with the ordinary NFA ( Q , ∆ ∩ ( Q × Σ × Q ) , S ∆ , F ) 6 / 20

Closure properties of regular languages ǫ -NFAs Regular expressions Closure under concatenation Algebra for regular expressions Closure under Kleene star Concatenation of regular languages We can use ǫ -NFAs to show that regular languages are closed under the concatenation operation: L 1 . L 2 = { xy | x ∈ L 1 , y ∈ L 2 } If L 1 , L 2 are any regular languages, choose ǫ -NFAs N 1 , N 2 that define them. As noted earlier, we can pick N 1 and N 2 to have just one start state and one accepting state. Now hook up N 1 and N 2 like this: ε N1 N2 Clearly, this NFA corresponds to the language L 1 . L 2 . To ponder: do we need the ǫ -transition in the middle? 7 / 20

Closure properties of regular languages ǫ -NFAs Regular expressions Closure under concatenation Algebra for regular expressions Closure under Kleene star Kleene star Similarly, we can now show that regular languages are closed under the Kleene star operation: L ∗ = { ǫ } ∪ L ∪ L . L ∪ L . L . L ∪ . . . (E.g. if L = { aaa , b } , L ∗ contains strings like baaab .) For suppose L is represented by an ǫ -NFA N with one start state and one accepting state. Consider the following ǫ -NFA: ε N ε Clearly, this ǫ -NFA corresponds to the language L ∗ . 8 / 20

Closure properties of regular languages From regular expressions to ǫ -NFAs Regular expressions From NFAs to regular expressions Algebra for regular expressions Regular expressions We’ve been looking at ways of specifying regular languages via machines (often given by diagrams). But it’s also useful to have more textual ways of defining languages. A regular expression is a written mathematical expression that defines a language over a given alphabet Σ. The basic regular expressions are ∅ a (for a ∈ Σ) ǫ From these, more complicated regular expressions can be built up by (repeatedly) applying the binary operations + , . and the unary operation ∗ . Example: ( a . b + ǫ ) ∗ + a We allow brackets to indicate priority. In the absence of brackets, ∗ binds more tightly than . , which itself binds more tightly than +. a + b . a ∗ a + ( b . ( a ∗ )) So means Also the dot is often omitted: ab means a . b 9 / 20

Closure properties of regular languages From regular expressions to ǫ -NFAs Regular expressions From NFAs to regular expressions Algebra for regular expressions How do regular expressions define languages? A regular expression is itself just a written expression (actually in some context-free ‘meta-language’). However, every regular expression α over Σ can be seen as defining an actual language L ( α ) ⊆ Σ ∗ in the following way: L ( ∅ ) = ∅ , L ( ǫ ) = { ǫ } , L ( a ) = { a } . L ( α + β ) = L ( α ) ∪ L ( β ) L ( α.β ) = L ( α ) . L ( β ) L ( α ∗ ) = L ( α ) ∗ Example: a + ba ∗ defines the language { a , b , ba , baa , baaa , . . . } . The languages defined by ∅ , ǫ , a are obviously regular. What’s more, we’ve seen that regular languages are closed under union, concatenation and Kleene star. This means every regular expression defines a regular language. (Proof by induction on the size of the regular expression.) 10 / 20

Closure properties of regular languages From regular expressions to ǫ -NFAs Regular expressions From NFAs to regular expressions Algebra for regular expressions Clicker question Consider again the language { x ∈ 0 , 1 ∗ | x contains an even number of 0’s } Which of the following regular expressions is not a possible definition of this language? 1 (1 ∗ 01 ∗ 01 ∗ ) ∗ 2 (1 ∗ 01 ∗ 0) ∗ 1 ∗ 3 1 ∗ (01 ∗ 0) ∗ 1 ∗ 4 (1 + 01 ∗ 0) ∗ 11 / 20

Closure properties of regular languages From regular expressions to ǫ -NFAs Regular expressions From NFAs to regular expressions Algebra for regular expressions Solution The third expression, 1 ∗ (01 ∗ 0) ∗ 1 ∗ , doesn’t define the language in question. For instance, it doesn’t admit the string 00100. 12 / 20

Closure properties of regular languages From regular expressions to ǫ -NFAs Regular expressions From NFAs to regular expressions Algebra for regular expressions Kleene’s theorem We’ve seen that every regular expression defines a regular language. C onversely, we shall show that every regular language can be defined by a regular expression. So we have the following result, known as Kleene’s theorem: DFAs and regular expressions give rise to exactly the same class of languages (the regular languages). As we’ve already seen, NFAs (with or without ǫ -transitions) also give rise to this class of languages. So the evidence is mounting that the class of regular languages is mathematically a very ‘natural’ class to consider. 13 / 20

Closure properties of regular languages From regular expressions to ǫ -NFAs Regular expressions From NFAs to regular expressions Algebra for regular expressions Proof of Kleene’s theorem: From NFAs to regular expressions Given an NFA N = ( Q , ∆ , S , F ) (without ǫ -transitions), we’ll show how to define a regular expression defining the same language as N . In fact, to build this up, we’ll construct a three-dimensional array of regular expressions α X uv : one for every u ∈ Q , v ∈ Q , X ⊆ Q . Informally, α X uv will define the set of strings that get us from u to v allowing only intermediate states in X . We shall build suitable regular expressions α X u , v by working our way from smaller to larger sets X . At the end of the day, the language defined by N will be given by the sum (+) of the languages α Q sf for all states s ∈ S and f ∈ F . 14 / 20

Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 - PowerPoint PPT Presentation

Closure properties of regular languages Regular expressions Algebra for regular expressions Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 John Longley School of Informatics University of Edinburgh jrl@inf.ed.ac.uk 29

Kleene Algebras: The Algebra of Regular Expressions Adam Braude University of Puget Sound May

Concurrent Kleene Algebra Tobias Kapp e University College London BCTCS 2018 What is Kleene

Regular Languages Today we continue looking at our first class Kleene Theorem I of

Regular Expressions (REs) Regular Expressions (REs) p.1/37 Expressions In arithmetic:

MA/CSSE 474 Theory of Computation Kleene's Theorem Practical Regular Expressions Kleenes

Objectives You should be able to ... Regular Languages Use the syntax of regular expressions

Regexp Lecture 26: Regular Expressions Regular Expressions Regular expressions are a small

Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 Alex Simpson School of

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

C++0x Regular Expressions Simon Andreas Frimann Lund Datalogisk Institut Kbenhavns

Regular Expressions = Regular Languages Mark Greenstreet, CpSc 421, Term 1, 2008/09 17

parsing with regular expressions and extensions to kleene algebra Niels Bjrn Bugge Grathwohl

Theory of Computer Science C3. Regular Languages: Regular Expressions, Pumping Lemma Malte

Regular Expressions A regular expression describes a language using three operations. Regular

Chapter 7 Expressions and Statements Expressions Arithmetic Expressions Conditional

Ardens Rule and the Kleene-Sch utzenberger Theorem Joost Winter Centrum Wiskunde &

Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture

GN3plus JRA 1 Network architectures for Horizon 2020 Tony Breach, NORDUnet A/S 2nd TERENA

CSE 116: Fall 2019 Introduction to Functional Programming Formalizing Nano Owen Arden

CMPS 112: Spring 2019 Comparative Programming Languages Formalizing Nano Owen Arden

Efficient & Robust LK for Mobile Vision Instructor - Simon Lucey 16-623 - Designing Computer

gra raph coherent st states tes in loo oop quantum um gra ravi vity ty Mehdi Assanioussi

Synthetic Aperture Radar Backscatter and Coherence in in Tropical Forest Elsa Carla De Grandi,

Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 - PowerPoint PPT Presentation

Closure properties of regular languages Regular expressions Algebra for regular expressions Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 John Longley School of Informatics University of Edinburgh jrl@inf.ed.ac.uk 29

Kleene Algebras: The Algebra of Regular Expressions Adam Braude University of Puget Sound May

Concurrent Kleene Algebra Tobias Kapp e University College London BCTCS 2018 What is Kleene

Regular Languages Today we continue looking at our first class Kleene Theorem I of

Regular Expressions (REs) Regular Expressions (REs) p.1/37 Expressions In arithmetic:

MA/CSSE 474 Theory of Computation Kleene's Theorem Practical Regular Expressions Kleenes

Objectives You should be able to ... Regular Languages Use the syntax of regular expressions

Regexp Lecture 26: Regular Expressions Regular Expressions Regular expressions are a small

Regular expressions and Kleenes theorem Informatics 2A: Lecture 5 Alex Simpson School of

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

C++0x Regular Expressions Simon Andreas Frimann Lund Datalogisk Institut Kbenhavns

Regular Expressions = Regular Languages Mark Greenstreet, CpSc 421, Term 1, 2008/09 17

parsing with regular expressions and extensions to kleene algebra Niels Bjrn Bugge Grathwohl

Theory of Computer Science C3. Regular Languages: Regular Expressions, Pumping Lemma Malte

Regular Expressions A regular expression describes a language using three operations. Regular

Chapter 7 Expressions and Statements Expressions Arithmetic Expressions Conditional

Ardens Rule and the Kleene-Sch utzenberger Theorem Joost Winter Centrum Wiskunde &amp;

Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture

GN3plus JRA 1 Network architectures for Horizon 2020 Tony Breach, NORDUnet A/S 2nd TERENA

CSE 116: Fall 2019 Introduction to Functional Programming Formalizing Nano Owen Arden

CMPS 112: Spring 2019 Comparative Programming Languages Formalizing Nano Owen Arden

Efficient &amp; Robust LK for Mobile Vision Instructor - Simon Lucey 16-623 - Designing Computer

gra raph coherent st states tes in loo oop quantum um gra ravi vity ty Mehdi Assanioussi

Synthetic Aperture Radar Backscatter and Coherence in in Tropical Forest Elsa Carla De Grandi,

Ardens Rule and the Kleene-Sch utzenberger Theorem Joost Winter Centrum Wiskunde &

Efficient & Robust LK for Mobile Vision Instructor - Simon Lucey 16-623 - Designing Computer