Lecture 4 Regular Expressions 4-0 DFAs vs NFAs - PDF document

✬ ✩ The University of Melbourne Dept. of Computer Science and Software Eng. 433–330 Theory of Computation Harald Søndergaard Lecture 4 Regular Expressions ✫ ✪ 4-0

✬ ✩ DFAs vs NFAs Surprisingly, for finite automata, adding the non-determinism does not result in more computing power. The class of languages recognised by NFAs is exactly the class of regular languages. Theorem: Every NFA has an equivalent DFA. The proof rests on the so-called subset construction. Given NFA N , we construct DFA M , each of whose states is a set of N -states. If N has k states then M may have up to 2 k states (but it will often have far fewer than that). ✫ ✪ 4-1

� � � � ✬ ✩ DFAs vs NFAs (cont.) Consider the NFA �� 2 a � � b � � � � �� 1 a,b a �� ǫ 3 We can systematically construct an equivalent DFA. Its start state is { 1 , 3 } . From this state an a will take us back to { 1 , 3 } . From { 1 , 3 } , b can only take us to { 2 } . Continuing thus, gives the DFA. Any state S which contains an accept state from ✫ the NFA will be an accept state for the DFA. ✪ 4-2

✬ ✩ More Formally . . . Let N = ( Q, Σ , δ, q 0 , F ). Let E ( S ) be the “ ǫ closure” of S ⊆ Q , that is, S together with all states reachable from S using only ǫ steps: { s ′ ∈ Q | s � ∗ → ǫ s ′ } E ( S ) = s ∈ S We construct M = ( P ( Q ) , Σ , δ ′ , q ′ 0 , F ′ ) as follows. • q ′ 0 = E ( { q 0 } ). • F ′ = { S ⊆ Q | S ∩ F � = ∅} . • δ ′ ( S, a ) = � s ∈ S E ( δ ( s, a )). Note: This construction may include some unreachable states. ✫ ✪ 4-3

� � ✬ ✩ Closure Results Theorem: The class of regular languages is closed under union. Proof: Let A and B be regular languages. An NFA that recognises A ∪ B is easily constructed: machine for A � � ǫ � � � �� ǫ � � machine for B ✫ ✪ 4-4

✬ ✩ Closure Results (cont.) Theorem: The class of regular languages is closed under concatenation. Proof: Let A and B be regular languages with these recognisers, respectively: From these we can easily construct an NFA that recognises A ◦ B : ǫ ǫ ✫ ✪ 4-5

✬ ✩ Closure Results (cont.) Theorem: The class of regular languages is closed under Kleene star. Proof: Let A be a regular language with recogniser Here is how we construct an NFA to recognise A ∗ : ǫ ǫ ǫ ✫ ✪ 4-6

✬ ✩ Closure Results (cont.) Regular languages have several other closure properties. They are closed under • intersection, • complement, A • difference, as A \ B = A ∩ B , • reversal. ✫ ✪ 4-7

✬ ✩ Regular Expressions Regular expressions is a notation for languages. You are probably familiar with similar notation in Unix, Awk or Perl. Example: 0 ∪ 1 ∪ (0(0 ∪ 1) ∗ 0) ∪ (1(0 ∪ 1) ∗ 1) denotes the set of non-empty binary strings that begin and end with the same symbol. We can avoid excessive parentheses if we agree that the star binds tighter than concatenation, which in turn binds tighter than union. ✫ ✪ 4-8

✬ ✩ Regular Expressions (cont.) Syntax: The regular expressions over an alphabet Σ = { a 1 , . . . , a n } is given by the grammar → | · · · | | | ∅ re a 1 a n ǫ | re ∪ re | re ◦ re | re ∗ (Sometimes we leave out the ◦ .) Semantics: { a } L ( a ) = { ǫ } L ( ǫ ) = L ( ∅ ) ∅ = L ( R 1 ∪ R 2 ) = L ( R 1 ) ∪ L ( R 2 ) L ( R 1 ◦ R 2 ) = L ( R 1 ) ◦ L ( R 2 ) L ( R ∗ ) = L ( R ) ∗ ✫ ✪ 4-9

✬ ✩ Regular Expressions – Examples { 110 } 110 : (ΣΣ) ∗ : all strings of even length (0 ∪ ǫ )( ǫ ∪ 1) { ǫ, 0 , 1 , 01 } : 1 ∗ : all sequences of 1s ǫ ∪ 1 ∪ ( ǫ ∪ 1) ∗ ( ǫ ∪ 1) : all sequences of 1s ✫ ✪ 4-10

✬ ✩ Regular Expressions vs Automata Theorem: A language is regular iff it can be described by a regular expression. Let us first show the ‘if’ direction, by showing how to convert a regular expression R into an NFA that recognises L ( R ). The proof is by structural induction over the form of R . � �� a Case R = a : � �� Case R = ǫ : � �� Case R = ∅ : Case R = R 1 ∪ R 2 , R = R 1 ◦ R 2 , or R = R ∗ 1 : We already gave the constructions when we showed that regular languages were closed under the regular operations. ✫ ✪ 4-11

� � ✬ ✩ NFAs from Regular Expressions Let us construct an NFA for ( a ∪ b ) ∗ bc Start from innermost expressions and work out: � �� a � �� b So a ∪ b yields: �� a ǫ � �� ǫ �� b ✫ ✪ 4-12

� � � � � � � � ✬ ✩ Then ( a ∪ b ) ∗ yields: �� a � � � ǫ � � � �� ǫ � � � � ǫ � � � � ǫ � � ǫ � �� b Finally ( a ∪ b ) ∗ bc yields: �� a � � � � � ǫ � � � � ǫ � � � �� ǫ � � � � � � ǫ b c � � � � � � ǫ � � � ǫ � � � �� ǫ � � � � � �� b ǫ Of course there are simpler, equivalent automata. ✫ ✪ 4-13

� � � � ✬ ✩ Regular Expressions from NFAs We now show the ‘only if’ direction of the theorem. We sketch how an NFA can be turned into a regular expression in a systematic process of “state elimination”. In the process, arcs are labelled with regular expressions. Since we only eliminate states that are neither start nor accept states, the process produces R 1 R 3 R R 2 � �� either or R 4 ( R 1 ∪ R 2 R ∗ 3 R 4 ) ∗ R 2 R ∗ 3 in the first case. R ∗ in the second. Note that R s could well be ǫ or ∅ . ✫ ✪ 4-14

� � ✬ ✩ The State Elimination Process Consider a node R 2 R 1 � �� R 3 Any such pair of incoming/outgoing arcs get replaced by a single arc that bypasses the node. The new arc gets the label R 1 R ∗ 2 R 3 . If there are n accept states, we eliminate non-accept states first, then apply the process for each accepting state, giving n regular expressions. Then we form the union. Let us illustrate this process. ✫ ✪ 4-15

� � � � � ✬ ✩ State Elimination Example 0 , 1 � �� 0 , 1 � �� 0 , 1 � �� 1 A B C D First turn annotations into regular expressions: 0 ∪ 1 � �� 0 ∪ 1 � �� 0 ∪ 1 � �� 1 A B C D Then eliminate B : 0 ∪ 1 � �� 1(0 ∪ 1) � �� 0 ∪ 1 � �� A C D Here we branch, eliminating C and D separately. 0 ∪ 1 0 ∪ 1 � �� 1(0 ∪ 1) � �� 1(0 ∪ 1)(0 ∪ 1) � �� A C A D ✫ ✪ 4-16

✬ ✩ State Elimination Example (cont.) The resulting regular expression is (0 ∪ 1) ∗ 1(0 ∪ 1) (0 ∪ 1) ∗ 1(0 ∪ 1)(0 ∪ 1) ∪ That language could also be written (0 ∪ 1) ∗ 1(0 ∪ 1)( ǫ ∪ 0 ∪ 1) Sipser provides all the details of this kind of translation. ✫ ✪ 4-17

✬ ✩ Some Useful Laws for Regexps A ∪ A = A A ∪ B = B ∪ A ( A ∪ B ) ∪ C = A ∪ ( B ∪ C ) ( A ◦ B ) ◦ C = A ◦ ( B ◦ C ) ∅ ∪ A = A ǫ ◦ A = A ◦ ǫ = A ∅ ◦ A = A ◦ ∅ = ∅ ( A ∪ B ) ◦ C = A ◦ C ∪ B ◦ C A ◦ ( B ∪ C ) = A ◦ B ∪ A ◦ C ( A ∗ ) ∗ = A ∗ ∅ ∗ = ǫ ∗ = ǫ ( ǫ ∪ A ) ∗ = A ∗ ( A ∪ B ) ∗ = ( A ∗ B ∗ ) ∗ ✫ ✪ 4-18

Lecture 4 Regular Expressions 4-0 DFAs vs NFAs - PDF document

The University of Melbourne Dept. of Computer Science and Software Eng. 433330 Theory of Computation Harald Sndergaard Lecture 4 Regular Expressions 4-0 DFAs vs NFAs Surprisingly, for finite automata,

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

Pocket Lecture Pocket Lecture Pocket Lecture Pocket Lecture Listen Audio Notes Progress

Multiphase Modelling in Cancer Helen Byrne Wolfson Centre for Mathematical Biology Mathematical

Previous Lecture Todays Lecture Slides for Lecture 5 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 30 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 28 Completion of divide-by-3 counter

Previous Lecture Todays Lecture Slides for Lecture 12 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 3 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 2 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 35 ENEL 353: Digital Circuits Fall

Lecture Capture Introduction to Lecture Capture Learning Outcomes What will lecture capture

Previous Lecture Todays Lecture Slides for Lecture 32 Completion of a timing analysis

Repetition Automatic Control, Basic Course, Lecture 11 Fredrik Bagge Carlson December 17, 2016

Previous Lecture Todays Lecture Slides for Lecture 26 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 33 ENEL 353: Digital Circuits Fall

CSE443 Compilers Dr. Carl Alphonce alphonce@buffalo.edu 343 Davis Hall Announcements HW-01

Theory of Computer Science C6. Context-free Languages: Closure & Decidability Gabriele R

Regular Expressions Greg Plaxton Theory in Programming Practice, Spring 2004 Department of

Formal Languages 1 Discrete Mathematical Structures Formal Languages

91.304 Foundations of (Th (Theoretical) Computer Science ti l) C t S i Chapter 1 Lecture

Compiler Construction Lecture 3: Scanner Generators 2020-01-14 Michael Engel Includes material

CS 301 Lecture 07 Closure properties of regular languages Stephen Checkoway February 7, 2018

Applications in finite state automata Completeness of Regular Relations Kurt Eberle

Lecture 4 Regular Expressions 4-0 DFAs vs NFAs - PDF document

The University of Melbourne Dept. of Computer Science and Software Eng. 433330 Theory of Computation Harald Sndergaard Lecture 4 Regular Expressions 4-0 DFAs vs NFAs Surprisingly, for finite automata,

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

Pocket Lecture Pocket Lecture Pocket Lecture Pocket Lecture Listen Audio Notes Progress

Multiphase Modelling in Cancer Helen Byrne Wolfson Centre for Mathematical Biology Mathematical

Previous Lecture Todays Lecture Slides for Lecture 5 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 30 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 28 Completion of divide-by-3 counter

Previous Lecture Todays Lecture Slides for Lecture 12 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 3 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 2 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 35 ENEL 353: Digital Circuits Fall

Lecture Capture Introduction to Lecture Capture Learning Outcomes What will lecture capture

Previous Lecture Todays Lecture Slides for Lecture 32 Completion of a timing analysis

Repetition Automatic Control, Basic Course, Lecture 11 Fredrik Bagge Carlson December 17, 2016

Previous Lecture Todays Lecture Slides for Lecture 26 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 33 ENEL 353: Digital Circuits Fall

CSE443 Compilers Dr. Carl Alphonce alphonce@buffalo.edu 343 Davis Hall Announcements HW-01

Theory of Computer Science C6. Context-free Languages: Closure &amp; Decidability Gabriele R

Regular Expressions Greg Plaxton Theory in Programming Practice, Spring 2004 Department of

Formal Languages 1 Discrete Mathematical Structures Formal Languages

91.304 Foundations of (Th (Theoretical) Computer Science ti l) C t S i Chapter 1 Lecture

Compiler Construction Lecture 3: Scanner Generators 2020-01-14 Michael Engel Includes material

CS 301 Lecture 07 Closure properties of regular languages Stephen Checkoway February 7, 2018

Applications in finite state automata Completeness of Regular Relations Kurt Eberle

Theory of Computer Science C6. Context-free Languages: Closure & Decidability Gabriele R