Top-Down Parsing 1 Parsing: Review of the Big Picture (1) - PowerPoint PPT Presentation

Top-Down Parsing 1

Parsing: Review of the Big Picture (1) • Context-free grammars (CFGs) • Generation: • Recognition: Given , is • Translation • Given , create a parse tree for • Given , create an AST for • The AST is passed to the next component of our compiler 2

Parsing: Review of the Big Picture (2) • Algorithms • CYK • Top-down (“recursive-descent”) for LL(1) grammars • How to parse, given the appropriate parse table for • How to construct the parse table for • Bottom-up for LALR(1) grammars • How to parse, given the appropriate parse table for • How to construct the parse table for 3

Last time CYK – Step 1: get a grammar in Chomsky Normal Form – Step 2: Build all possible parse trees bottom-up • Start with runs of 1 terminal • Connect 1-terminal runs into 2-terminal runs • Connect 1- and 2- terminal runs into 3-terminal runs • Connect 1- and 3- or 2- and 2- terminal runs into 4 terminal runs • … • If we can connect the entire tree, rooted at the start symbol, we’ve found a valid parse 4

Some Interesting Properties of CYK Very old algorithm – Already well known in early 70s No problems with ambiguous grammars: – Gives a solution for all possible parse tree simultaneously 5

CYK Example F 1,6 W In general, go up a column ⟶ F I W 1,5 2,6 and down a diagonal ⟶ F I Y X ⟶ W L X 1,4 2,5 3,6 ⟶ X N R ⟶ Y L R N ⟶ N id ⟶ 1,3 2,4 3,5 4,6 N I Z ⟶ Z C N ⟶ Z X I id ⟶ L ( 1,2 2,3 3,4 4,5 5,6 ⟶ R ) ⟶ I,N L I,N C I,N R C , id ( id , id ) 6

Thinking about Language Design Balanced considerations – Powerful enough to be useful – Simple enough to be parsable Syntax need not be complex for complex behaviors – Guy Steele’s “Growing a Language” Video: https://www.youtube.com/watch?v=_ahvzDzKdB0 Text: http://www.cs.virginia.edu/~evans/cs655/readings/steele.pdf 7

Restricting the Grammar By restricting our grammars we can – Detect ambiguity – Build linear-time, O(n) parsers LL(1) languages – Particularly amenable to parsing – Parsable by predictive (top-down) parsers • Sometimes called “recursive-descent parsers” 8

Top-Down Parsers Start at the Start symbol Repeatedly: “predict” what production to use – Example: if the current token to be parsed is an id, no need to try productions that start with intLiteral – This might seem simple, but keep in mind that a chain of productions may have to be used to get to the rule that handles, e.g., id 9

Predictive Parser Sketch Parser Scanner Column: terminal Token Stream a b a a EOF Row: nonterminal Selector table current “Work to do” Stack 10

Example S → ( S ) | { S } | ε Input: ( { } ) eof ( ) { } eof S ( S ) ε { S } ε ε current current current current current { ( S S } S ) eof “Work to do” Stack 11

A Snapshot of a Predictive Parser D S eof C t The B A structure already u u A A seen eof “Work to do” D C t Stack The structure that the parser expects to build Input: u t eof 12 current Not yet seen Already processed

Algorithm stack.push( eof ) Initial stack is “ Start eof ” stack.push( Start non-term) t = scanner.getToken() Repeat if stack.top is a terminal y match y with t pop y from the stack t = scanner.next_token() if stack.top is a nonterminal X get table[X,t] pop X from the stack push production’s RHS (each symbol from Right to Left) Until one of the following: accept stack is empty stack.top is a terminal that does not match t stack.top is a non-term and parse-table entry is empty reject 13

Example 2, bad input: You try S → ( S ) | { S } | ε ( ) { } eof S ( S ) ε { S } ε ε INPUT ( ( } eof 14

This Parser Works Great! Given a single token we always knew exactly what production it started ( ) { } eof S ( S ) ε { S } ε ε 15

Two Outstanding Issues 1. How do we know if the language is LL(1) – Easy to imagine a grammar where a single token is not enough to select a rule S → ( S ) | { S } | ε | ( ) 1. How do we build the selector table? – It turns out that there is one answer to both: If our selector table has 1 production per cell, then grammar is LL(1) 16

LL(1) Grammar Transformations Necessary (but not sufficient conditions) for LL(1) parsing: – Free of left recursion • “No left-recursive rules” • Why? Need to look past the list to know when to cap it – Left-factored • “No rules with a common prefix, for any nonterminal” • Why? We would need to look past the prefix to pick the production 17

Left-Recursion • Recall that a grammar for which is left recursive • A grammar is immediately left recursive if the repetition of the LHS nonterminal can happen in one step, e.g., A A α | β • Fortunately, it is always possible to change the grammar to remove left recursion without changing the language it recognizes 18

Why Left Recursion is a Problem (Blackbox View) XList XList x | x CFG snippet: x Current token: Current parse tree: XList How should we grow the tree top-down? XList XList (OR) x XList x Correct if there are no more x s Correct if there are more x s 19 We don’t know which to choose without more lookahead

Why Left Recursion is a Problem (Whitebox View) XList XList x | x CFG snippet: x Current token: Current parse tree: XList x eof XList XList x ε Parse table: XList x XList x XList x (Stack overflow) XList x eof Stack Current 20

Removing Left-Recursion (for a single immediately left-recursive rule) A → A α | β A → β A’ A’ → α A’ | ε Where β does not begin with A 21

Let’s check in on the parse tree… Exp → Factor Exp’ Exp → Exp – Factor Exp’ → - Factor Exp’ | Factor | ε Factor → intlit | ( Exp ) Factor → intlit | ( Exp ) E E E F - F E 2 F E - - 4 E F F 3 F E 3 - 2 grouping of 2 – 3 destroyed ε 4 23 2 – 3 grouped together

 … We’ll fix this issue later 24

General Rule for Removing Immediate Left-Recursion A → A α 1 | A α 2 | … | A α m | β 1 | β 2 | … | β n A → β 1 A’ | β 2 A’ | … | β n A’ A’ → α 1 A’ | α 2 A’ | … | α m A’ | ε 25

Left-Factored Grammars If a nonterminal has two productions whose right-hand sides have a common prefix, the grammar is not left-factored, and not LL(1) Exp → ( Exp ) | ( ) Not left-factored 26

Left Factoring Given productions of the form A → α β 1 | α β 2 A → α A’ A’ → β 1 | β 2 27

Where are we at? We’ve set ourselves up for success in building the selection table – Two things that prevent a grammar from being LL(1) were identified and avoided • Left-recursive grammars • Non left-factored grammars – Next time • Build two data structures that combine to yield a selector table: – FIRST sets – FOLLOW sets 29

Top-Down Parsing 1 Parsing: Review of the Big Picture (1) - PowerPoint PPT Presentation

Top-Down Parsing 1 Parsing: Review of the Big Picture (1) Context-free grammars (CFGs) Generation: Recognition: Given , is Translation Given , create a parse tree for Given , create an AST for The AST is

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Top-Down Parsing Slides modified from Louden Book and Dr. Scherger Top Down Parsing A

Top-Down Parsing Top-Down Parsing #1 Extra Credit Question Given this grammar G: E

* 07/16/96 Plan for Today Shift-reduce parsing The problem with predictive top down parsing

Compiler Design Spring 2018 3.3 Top-down parsing Thomas R. Gross Computer Science Department

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Bottom Up Parsing Also known as Shift-Reduce parsing More powerful than top down Dont

Compiler Construction Lecture 7: Bottom-up parsing 2020-01-28 Michael Engel Includes material

Programming Languages: Parsing Onur Tolga S ehito glu Computer Engineering,METU 27 May

Grundlegende Parsingalgorithmen Top-Down & Bottom-Up Parsing Kurt Eberle

Agenda What is Top-down Web services? Benefit of top-down Web services How to develop

Abstract Syntax Trees & Top-Down Parsing Review of Parsing Given a language L(G), a

Parsing Parsing involves: determining if a string belongs to a language, and

To TOP or NOT to TOP www.SAS.com To TOP or NOT to TOP Using the TOP command in Linux By Len van

Asian Cities: Fostering Growth and Inclusion ERNA WITOELAR Asian Cities: Fostering Growth and

SCP and Climate Change - JICAs Cases - DAC-UNEP Workshop on Sustainable Consumption and

Palladium Mine Waterberg Definitive Feasibility Study (DFS) Results S e p t e m b e r 3 0 , 2 0

Building Integrated Approaches to Social Equity Dra. Mara del Carmen Zabala FLACSO Cuba /

PRELIMINARY RESULTS Year ended 30 June 2016 CAUTIONARY STATEMENT This presentation contains

Free Parafermions Paul Fendley University of Virginia Free

Group 6 Team leader: Arnav Kumar Agrawal Assistant Team Leader: Gurniamat Kaur Introduction Aim

InterCriteria Decision Making using Intuitionistic Fuzzy Sets Vassia Atanassova IICT BAS,

Top-Down Parsing 1 Parsing: Review of the Big Picture (1) - PowerPoint PPT Presentation

Top-Down Parsing 1 Parsing: Review of the Big Picture (1) Context-free grammars (CFGs) Generation: Recognition: Given , is Translation Given , create a parse tree for Given , create an AST for The AST is

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Top-Down Parsing Slides modified from Louden Book and Dr. Scherger Top Down Parsing A

Top-Down Parsing Top-Down Parsing #1 Extra Credit Question Given this grammar G: E

* 07/16/96 Plan for Today Shift-reduce parsing The problem with predictive top down parsing

Compiler Design Spring 2018 3.3 Top-down parsing Thomas R. Gross Computer Science Department

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Bottom Up Parsing Also known as Shift-Reduce parsing More powerful than top down Dont

Compiler Construction Lecture 7: Bottom-up parsing 2020-01-28 Michael Engel Includes material

Programming Languages: Parsing Onur Tolga S ehito glu Computer Engineering,METU 27 May

Grundlegende Parsingalgorithmen Top-Down &amp; Bottom-Up Parsing Kurt Eberle

Agenda What is Top-down Web services? Benefit of top-down Web services How to develop

Abstract Syntax Trees &amp; Top-Down Parsing Review of Parsing Given a language L(G), a

Parsing Parsing involves: determining if a string belongs to a language, and

To TOP or NOT to TOP www.SAS.com To TOP or NOT to TOP Using the TOP command in Linux By Len van

Asian Cities: Fostering Growth and Inclusion ERNA WITOELAR Asian Cities: Fostering Growth and

SCP and Climate Change - JICAs Cases - DAC-UNEP Workshop on Sustainable Consumption and

Palladium Mine Waterberg Definitive Feasibility Study (DFS) Results S e p t e m b e r 3 0 , 2 0

Building Integrated Approaches to Social Equity Dra. Mara del Carmen Zabala FLACSO Cuba /

PRELIMINARY RESULTS Year ended 30 June 2016 CAUTIONARY STATEMENT This presentation contains

Free Parafermions Paul Fendley University of Virginia Free

Group 6 Team leader: Arnav Kumar Agrawal Assistant Team Leader: Gurniamat Kaur Introduction Aim

InterCriteria Decision Making using Intuitionistic Fuzzy Sets Vassia Atanassova IICT BAS,

Grundlegende Parsingalgorithmen Top-Down & Bottom-Up Parsing Kurt Eberle

Abstract Syntax Trees & Top-Down Parsing Review of Parsing Given a language L(G), a