REDUCING DEEP PUSHDOWN AUTOMATA ek K Ing. Zbyn RIVKA, Doctoral - PDF document

REDUCING DEEP PUSHDOWN AUTOMATA ek K ˇ Ing. Zbynˇ RIVKA, Doctoral Degree Programme (2) Dept. of Information Systems, FIT, BUT E-mail: krivka@fit.vutbr.cz Ing. Rudolf SCHÖNECKER, Doctoral Degree Programme (1) Dept. of Information Systems, FIT, BUT E-mail: schonec@fit.vutbr.cz Supervised by: Prof. Alexander Meduna ABSTRACT This contribution presents reducing variant of the deep pushdown automata. Deep pushdown automata is a new generalization of the classical pushdown automata. Basic idea of the modification consists of allowing these automata to access more deeper parts of pushdown and reducing strings to non-input symbols in the pushdown. It works simi- larly to bottom-up analysis simulation of context-free grammars in the classical pushdown automata. Further, this paper presents results of equivalence of reducing deep pushdown automata with n -limited state grammars and infinite hierarchy of language families based on that. 1 INTRODUCTION Consider the standard simulation of a context-free grammar by a classical pushdown automaton acting as a general bottom-up parser (see [4]). During every move, the parser either shifts or reduces its pushdown depending on the top pushdown symbol, current input symbol, and state. Shift operation takes one input symbol and moves it to the top of the pushdown. If a reversal string on the top of the pushdown equals to any right-handed side of a context-free production, this string is reduced to one non-input symbol. In this paper, we discuss one variant of a slight generalization of this automaton. Hereafter, the generalized bottom-up parser represented by pushdown automaton works exactly the same as the above automaton except that it can make reductions of depth m so it replaces the pushdown’s substring with m th topmost non-input symbol in the pushdown, for some m ≥ 1. We call it reducing deep pushdown automaton (abbrev. RDPDA ) and it is a modification of the recently published generalizations of pushdown automata (see [3, 5]). RDPDA has no input tape because the input string is immediately part of the pushdown in the start configuration of RDPDA . The pushdown bottom represented by bottom

symbol corresponds to endmarker of the input string (used in LL(k) translation, see [1]). This minor property can be also simulated by reading the input tape from the right to the left by shift operations. RDPDA also do not need start pushdown symbol. 2 PRELIMINARIES This paper assumes that the reader is familiar with the theory of automata, formal languages, and parsing (see [1, 4]). For a set, Q , card ( Q ) denotes the cardinality of Q . I denotes the set of all positive integers. For an alphabet, V , V ∗ represents the free monoid generated by V under the operation of concatenation. The identity of V ∗ is denoted by ε . Set V + = V ∗ −{ ε } ; algebraically, V + is thus the free semigroup generated by V under the operation of concatenation. For w ∈ V ∗ , | w | denotes the length of w and alph ( w ) denotes the set of symbols occurring in w . For W ⊆ V , occur ( w , W ) denotes the number of occurrences of symbols from W in w . For every i ≥ 0, prefix ( w , i ) is w ’s prefix of length i if | w | ≥ i , and prefix ( w , i ) = w if i ≥ | w | + 1. A state grammar (see [2]) is a quintuple, G = ( V , W , T , P , S ) , where V is a total alphabet , W is a finite set of states , T ⊆ V is an alphabet of terminals , S ∈ ( V − T ) is the start symbol , and P ⊆ ( W × ( V − T )) × ( W × V + ) is a finite relation. Instead of ( q , A , p , v ) ∈ P , we write ( q , A ) → ( p , v ) ∈ P throughout. For every z ∈ V ∗ , set G states ( z ) = { q | ( q , B ) → ( p , v ) ∈ P , where B ∈ ( V − T ) ∩ alph ( z ) , v ∈ V + , q , p ∈ W } . If ( q , A ) → ( p , v ) ∈ P , x , y ∈ V ∗ , G states ( x ) = / 0 , then G makes a derivation step from ( q , xAy ) to ( p , xvy ) , symbolically written as ( q , xAy ) ⇒ ( p , xvy ) [( q , A ) → ( p , v )] in G ; in addition, if n is a positive integer satisfying occur ( xA , V − T ) ≤ n , we say that ( q , xAy ) ⇒ ( p , xvy ) [( q , A ) → ( p , v )] is n-limited , symbolically written as ( q , xAy ) n ⇒ ( p , xvy ) [( q , A ) → ( p , v )] . Whenever there is no danger of confusion, we simplify ( q , xAy ) ⇒ ( p , xvy ) [( q , A ) → ( p , v )] and ( q , xAy ) n ⇒ ( p , xvy ) [( q , A ) → ( p , v )] to ( q , xAy ) ⇒ ( p , xvy ) and ( q , xAy ) n ⇒ ( p , xvy ) , respectively. In the standard manner, we extend ⇒ to ⇒ m , where m ≥ 0; then, based on ⇒ m , we define ⇒ + and ⇒ ∗ . Let n ∈ I and υ , ϖ ∈ ( W × V + ) . To express that every derivation step in υ ⇒ m ϖ , υ ⇒ + ϖ , and υ ⇒ ∗ ϖ is n -limited, we write υ n ⇒ m ϖ , υ n ⇒ + ϖ , and υ n ⇒ ∗ ϖ instead of υ ⇒ m ϖ , υ ⇒ + ϖ , and υ ⇒ ∗ ϖ , respectively. The language of G , L ( G ) , is defined as L ( G ) = { w ∈ T ∗ | ( q , S ) ⇒ ∗ ( p , w ) , q , p ∈ W } . Furthermore, we define for every n ≥ 1 , L ( G , n ) = { w ∈ T ∗ | ( q , S ) n ⇒ ∗ ( p , w ) , q , p ∈ W } , and L ( G , n ) is called n-limited language of G . A derivation of the form ( q , S ) n ⇒ ∗ ( p , w ) , where q , p ∈ W and w ∈ T ∗ , represents a successful n-limited generation of w in G . A state grammar G is of degree n for a positive integer n if and only if L ( G , n ) = L ( G ) . ST n denotes the family of languages containing ( n or less)-limited languages of arbitrary state grammar. More for- mally, for every n ≥ 1, set ST n = { L ( G , i ) | G is an arbitrary state grammar, 1 ≤ i ≤ n } . If L ( G , n ) � = L ( G ) for every positive integer n , then G is state grammar of infinite degree . Let ST ∞ = ❙ ∞ n = 1 ST n . Let ST ω be the entire family of state languages. CF and CS denote the families of context-free and context-sensitive languages, respectively. Kasai proved in his paper (see [2]) these crucial theorems concerning state grammars (reformulated in the terms of this paper): Theorem Kasai.2. ST ω = CS .

Corollary Kasai.1. ST ∞ ⊂ ST ω . Theorem Kasai.5. For every n ≥ 1, ST n ⊂ ST n + 1 . Observe that for each n ≥ 1, ST n ⊆ ST n + 1 follows from the definition of state languages. 3 DEFINITIONS A reducing deep pushdown automaton , a RDPDA for short, is a 6-tuple, M = ( Q , Σ , Γ , R , s , F ) , where Q is a finite set of states , Σ is an input alphabet , and Γ is a pushdown alphabet , I , Q , Γ are pairwise disjoint (see Section 2 for I ), Σ ⊆ Γ , Γ − Σ contains a special bottom symbol denoted by #, R ⊆ ( I × Q × ( Γ − { # } ) + × Q × ( Γ − ( Σ ∪ { # } ))) ∪ ( I × Q × ( Γ −{ # } ) ∗ { # }× Q ×{ # } ) is a finite relation , s ∈ Q is the start state , F ⊆ Q is a set of final states . Instead of ( m , q , v , p , A ) ∈ R , we write qv ⊢ mpA ∈ R and call qv ⊢ mpA a rule ; accordingly, R is referred to as the set of M’s rules . A configuration of M is a pair in Q × ( Γ −{ # } ) ∗ { # } . Let χ denote the set of all configurations of M . Let x , y ∈ χ be two configurations. M reduces its pushdown (or makes a move ) from x to y , symbolically written as x � y , if x = ( q , uvz ) , y = ( p , uAz ) , qv ⊢ mpA ∈ R , where A ∈ Γ − Σ , u , v , z ∈ Γ ∗ , q , p ∈ Q , and occur ( u , Γ − Σ ) = m − 1. To express that M makes x � y according to qv ⊢ mpA , we write x � y [ qv ⊢ mpA ] . We say that qv ⊢ mpA is a rule of depth m ; accordingly, x � y [ qv ⊢ mpA ] is a reduction of depth m . If n ∈ I is the minimal positive integer such that each of M ’s rules is of depth n or less, we say that M is of depth n , symbolically written as n M . In the standard manner, extend � to � m , respectively, for m ≥ 0; then, based on � m define � + , and � ∗ . Let M be of depth n , for some n ∈ I . We define the language reduced by n M , L ( n M ) , as L ( n M ) = { w ∈ Σ ∗ | ( s , w # ) � ∗ ( f , # ) in n M with f ∈ F } . For every every k ≥ 1, set RDPD k = { L ( i M ) | i M is a RDPDA , 1 ≤ i ≤ k } . Example 1 Consider a RDPDA, 2 M = ( { s , t , q , p , f } , { a , b , c } , { A , B , # } , R , s , { f } ) with 1 tA , R = { sab ⊢ 2 pB , ⊢ tc 1 qA , ⊢ paAb 2 pB , ⊢ qBc pAB # 1 f # } . ⊢ With aabbcc, M makes ( s , aabbcc # ) ( t , aAbcc # ) [ sab ⊢ 1 tA ] � ( p , aAbBc # ) [ tc ⊢ 2 pB ] � ( q , ABc # ) [ paAb ⊢ 1 qA ] � ( p , AB # ) [ qBc ⊢ 2 pB ] � ( f , # ) [ pAB # ⊢ 1 f # ] � We write ( s , aabbcc # ) � ∗ ( f , # ) , and we say that the string aabbcc is successfully reduced by RDPDA M. Observe that L ( M ) = { a n b n c n | n ≥ 1 } ∈ RDPD 2 , and L ( M ) ∈ CS − CF .

REDUCING DEEP PUSHDOWN AUTOMATA ek K Ing. Zbyn RIVKA, Doctoral - PDF document

REDUCING DEEP PUSHDOWN AUTOMATA ek K Ing. Zbyn RIVKA, Doctoral Degree Programme (2) Dept. of Information Systems, FIT, BUT E-mail: krivka@fit.vutbr.cz Ing. Rudolf SCHNECKER, Doctoral Degree Programme (1) Dept. of Information Systems,

Pushdown Automata Context Free Languages IV Input tape 1 2 Pushdown Automata 3 5 4 State

Pushdown Automata A pushdown automata (PDA) is essentially: Pushdown Automata An NFA with

Pushdown Automata Pushdown Automata p.1/25 Rationale CFG are specification

Pushdown Automata 7-0 Pushdown Automata The automata we saw so far were

Pushdown Automata Reading: Chapter 6 1 Pushdown Automata (PDA) Informally: A PDA is

Equivalence of Pushdown Automata with Context-Free Grammar Equivalence of Pushdown Automata with

Pushdown automata I Context-free languages are more general than regular languages For regular

Parity to Safety in Polynomial Time for Pushdown and Collapsible Pushdown Games Matthew Hague

Pushdown Automata Definition Moves of the PDA Languages of the PDA Deterministic PDAs 1

Pushdown Automata and Parser Sebastian Hack (based on slides by Reinhard Wilhelm and Mooly Sagiv)

COMP3630/6360: Theory of Computation Semester 1, 2020 The Australian National University

Pushdown Automata Definition Moves of the PDA Languages of the PDA Deterministic PDAs 1

Parikh Image of Pushdown Automata Elena Guti errez and Pierre Ganty Introduction Context-free

Pushdown Automata A PDA is an FA together with a stack. Stacks A stack stores information on the

Pushdown automata Z. Sawa (TU Ostrava) Theoretical Computer Science December 3, 2020 1 / 31

Reachability analysis of first-order definable pushdown systems (= pushdown systems in sets with

Training session on the new pharmaceutical legislation, European Medicines Agency 29 November

Demonstrating Acquired Distinctiveness Leveraging Declarations, Establishing Use and Evidence of

Universal Algebra Jesse E. Jenks University of Puget Sound May 2019 Jesse E. Jenks (University

Reducing Observation Time f for Reliable Cyclostationarity R li bl C l i i Feature

presentation.sty : An Infrastructure for EX Presenting Semantic Macros in S T Michael

Review: Intersecting Lines Coplanar lines that share one common point. E Symbol: A D

How complex are decisions savers make at retirement? The consequences Alan Higham Retirement

Variability Extraction and Analysis Toolkit (VEXA) VEXA Introduction The Variability Extraction