Theory of Computer Science C3. Regular Languages: Regular - PowerPoint PPT Presentation

Theory of Computer Science C3. Regular Languages: Regular Expressions Gabriele R¨ oger University of Basel March 25, 2020 Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 1 / 20

Theory of Computer Science March 25, 2020 — C3. Regular Languages: Regular Expressions C3.1 Regular Expressions C3.2 Summary Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 2 / 20

C3. Regular Languages: Regular Expressions Regular Expressions C3.1 Regular Expressions Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 3 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Overview Regular Languages Grammars & Grammars DFAs Regular NFAs Languages Automata & Regular Formal Languages Expressions Context-free Pumping Languages Lemma Minimal Automata Context-sensitive & Type-0 Languages properties Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 4 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Formalisms for Regular Languages ◮ DFAs, NFAs and regular grammars can all describe exactly the regular languages. ◮ Are there other concepts with the same expressiveness? ◮ Yes! � regular expressions Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 5 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Overview Regular Languages Grammars & Grammars DFAs Regular NFAs Languages Automata & Regular Formal Languages Expressions Context-free Pumping Languages Lemma Minimal Automata Context-sensitive & Type-0 Languages properties Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 6 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Concatenation of Languages and Kleene Star Concatenation ◮ For two languages L 1 (over Σ 1 ) and L 2 (over Σ 2 ), the concatenation of L 1 and L 2 is the language L 1 L 2 = { w 1 w 2 ∈ (Σ 1 ∪ Σ 2 ) ∗ | w 1 ∈ L 1 , w 2 ∈ L 2 } . Kleene star ◮ For language L define ◮ L 0 = { ε } ◮ L 1 = L ◮ L i +1 = L i L for i ∈ N > 0 ◮ The definition of Kleene star on L is L ∗ = � i ≥ 0 L i . Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 7 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions: Definition Definition (Regular Expressions) Regular expressions over an alphabet Σ are defined inductively: ◮ ∅ is a regular expression ◮ ε is a regular expression ◮ If a ∈ Σ, then a is a regular expression If α and β are regular expressions, then so are: ◮ ( αβ ) (concatenation) ◮ ( α | β ) (alternative) ◮ ( α ∗ ) (Kleene closure) German: regul¨ are Ausdr¨ ucke, Verkettung, Alternative, kleenesche H¨ ulle Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 8 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions: Omitting Parentheses omitted parentheses by convention: ◮ Kleene closure α ∗ binds more strongly than concatenation αβ . ◮ Concatenation binds more strongly than alternative α | β . ◮ Parentheses for nested concatenations/alternatives are omitted (we can treat them as left-associative; it does not matter). Example: ab ∗ c | ε | abab ∗ abbreviates (((( a ( b ∗ )) c ) | ε ) | ((( ab ) a )( b ∗ ))). Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 9 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions: Examples some regular expressions for Σ = { 0 , 1 } : ◮ 0 ∗ 10 ∗ ◮ ( 0 | 1 ) ∗ 1 ( 0 | 1 ) ∗ ◮ (( 0 | 1 )( 0 | 1 )) ∗ ◮ 01 | 10 ◮ 0 ( 0 | 1 ) ∗ 0 | 1 ( 0 | 1 ) ∗ 1 | 0 | 1 Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 10 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions: Language Definition (Language Described by a Regular Expression) The language described by a regular expression γ , written L ( γ ), is inductively defined as follows: ◮ If γ = ∅ , then L ( γ ) = ∅ . ◮ If γ = ε , then L ( γ ) = { ε } . ◮ If γ = a with a ∈ Σ, then L ( γ ) = { a } . ◮ If γ = ( αβ ), where α and β are regular expressions, then L ( γ ) = L ( α ) L ( β ). ◮ If γ = ( α | β ), where α and β are regular expressions, then L ( γ ) = L ( α ) ∪ L ( β ). ◮ If γ = ( α ∗ ) where α is a regular expression, then L ( γ ) = L ( α ) ∗ . Examples: blackboard Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 11 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Finite Languages Can Be Described By Regular Expressions Theorem Every finite language can be described by a regular expression. Proof. For every word w ∈ Σ ∗ , a regular expression describing the language { w } can be built from regular expressions a ∈ Σ by using concatenations. (Use ε if w = ε .) For every finite language L = { w 1 , w 2 , . . . , w n } , a regular expression describing L can be built from the regular expressions for { w i } by using alternatives. (Use ∅ if L = ∅ .) Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 12 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions Not More Powerful Than NFAs Theorem For every language that can be described by a regular expression, there is an NFA that accepts it. Proof. Let γ be a regular expression. We show the statement by induction over the structure of regular expressions. For γ = ∅ , γ = ε and γ = a , NFAs that accept L ( γ ) are obvious. . . . Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 13 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions Not More Powerful Than NFAs Theorem For every language that can be described by a regular expression, there is an NFA that accepts it. Proof (continued). For γ = ( αβ ), let M α and M β be NFAs that (by ind. hypothesis) accept L ( α ) and L ( β ). W.l.o.g., their states are disjoint. Construct NFA M for L ( γ ) by “daisy-chaining” M α and M β : ◮ states: union of states of M α and M β ◮ start states: those of M α ; if ε ∈ L ( α ), also those of M β ◮ end states: end states of M β ◮ state transitions: all transitions of M α and of M β ; additionally: for every transition to an end state of M α , an equally labeled transition to all start states of M β . . . Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 14 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions Not More Powerful Than NFAs Theorem For every language that can be described by a regular expression, there is an NFA that accepts it. Proof (continued). For γ = ( α | β ), by the induction hypothesis let M α = � Q α , Σ , δ α , S α , E α � and M β = � Q β , Σ , δ β , S β , E β � be NFAs that accept L ( α ) and L ( β ). W.l.o.g., Q α ∩ Q β = ∅ . Then the “union automaton” M = � Q α ∪ Q β , Σ , δ α ∪ δ β , S α ∪ S β , E α ∪ E β � accepts the language L ( γ ). . . . German: Vereinigungsautomat Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 15 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Expressions Not More Powerful Than NFAs Theorem For every language that can be described by a regular expression, there is an NFA that accepts it. Proof (continued). For γ = ( α ∗ ), by the induction hypothesis let M α = � Q α , Σ , δ α , S α , E α � be an NFA that accepts L ( α ). If ε / ∈ L ( α ), add an additional state to M α that is a start and end state and not connected to other states. M α now recognizes L ( α ) ∪ { ε } . M is constructed from M α by adding the following new transitions: whenever M α has a transition from s to end state s ′ with symbol a , add transitions from s to every start state with symbol a . Then L ( M ) = L ( γ ). Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 16 / 20

C3. Regular Languages: Regular Expressions Regular Expressions DFAs Not More Powerful Than Regular Expressions Theorem Every language accepted by a DFA can be described by a regular expression. Without proof. Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 17 / 20

C3. Regular Languages: Regular Expressions Regular Expressions Regular Languages vs. Regular Expressions Theorem (Kleene) The set of languages that can be described by regular expressions is exactly the set of regular languages. This follows directly from the previous two theorems. Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 18 / 20

C3. Regular Languages: Regular Expressions Summary C3.2 Summary Gabriele R¨ oger (University of Basel) Theory of Computer Science March 25, 2020 19 / 20

Theory of Computer Science C3. Regular Languages: Regular - PowerPoint PPT Presentation

Theory of Computer Science C3. Regular Languages: Regular Expressions Gabriele R oger University of Basel March 25, 2020 Gabriele R oger (University of Basel) Theory of Computer Science March 25, 2020 1 / 20 Theory of Computer Science

Chapter 2- -3 3 Chapter 2 Definition of Theory: A theory is a systematic Definition of

Theory of Computer Science May 6, 2020 E1. Complexity Theory: Motivation and Introduction

A Theory of Rate-Based Execution A Theory of Rate-Based Execution Kevin Jeffay Steve Goddard

Theory of Computer Science B4. Predicate Logic I Gabriele R oger University of Basel March

Theory of Computer Science C1. Formal Languages and Grammars Gabriele R oger University of

Theory of Computer Science B3. Propositional Logic III Gabriele R oger University of Basel

Theory of Computer Science E6. Beyond NP Gabriele R oger University of Basel May 25, 2020

Theory of Computer Science E5. Some NP-Complete Problems, Part II Gabriele R oger University

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

I do Computer Science. I do Computer Science. Cool! I do Computer

Preparatory Course in Computer programming experience Science Computer Science 1 : Theoretical

Theory of Computer Science C5. Context-free Languages: Normal Form and PDA Gabriele R oger

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Theory and Applications of Boosting Theory and Applications of Boosting Theory and Applications

Theory and Applications of Boosting Theory and Applications of Boosting Theory and Applications

SOCIOLOGICAL THEORY: A SCIENTIFIC APPROACH What is a theory? ! What does a theory consist of?

TDT4205 Lecture #3 2 So, we have this DFA It can tell you whether or not you have an

CPSC 121: Models of Computation Instructor: Bob Woodham woodham@cs.ubc.ca Department of Computer

NFAs continued, Closure Properties of Regular Languages Lecture 5 September 11, 2018 Nikita

Mysteries Revealed Terminology A class is a data type

3.2: Equivalence and Correctness of Regular Expressions In this section, we: say what it

INF2080 Context-Free Langugaes Daniel Lupp Universitetet i Oslo 1st February 2018 Department

SASE: Complex Event Processing Over Streams Daniel Gyllstrom, Eugene Wu, Hee-Jin Chae, Yanlei

System Modelling Introduction Finite State Machines Petri Nets Untimed Model of Computation