t r Prsr t - PowerPoint PPT Presentation

❙②♥t❛① ❆♥❛❧②③❡r ✖ P❛rs❡r ❆❙❯ ❚❡①t❜♦♦❦ ❈❤❛♣t❡r ✹✳✷✕✹✳✺✱ ✹✳✼✱ ✹✳✽ Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1

Main tasks if it is a legal program, a program represented then output some ab- parser − → − → by a sequence of tokens stract representation of the program Abstract representations of the input program: • abstract-syntax tree + symbol table • intermediate code • object code Context free grammar (CFG) is used to specify the structure of legal programs. Compiler notes #3, 20060421, Tsan-sheng Hsu 2

Context free grammar (CFG) Definitions: G = ( T, N, P, S ) , where • T : a set of terminals (in lower case letters); • N : a set of nonterminals (in upper case letters); • P : productions of the form A → α 1 , α 2 , . . . , α m , where A ∈ N and α i ∈ T ∪ N ; • S : the starting nonterminal, S ∈ N . Notations: • terminals : lower case English strings, e.g., a , b , c, . . . • nonterminals: upper case English strings, e.g., A , B , C, . . . • α, β, γ ∈ ( T ∪ N ) ∗ ⊲ α , β , γ : alpha, beta and gamma. ⊲ ǫ : epsilon. • � → A α 1 ≡ A → α 1 | α 2 → A α 2 Compiler notes #3, 20060421, Tsan-sheng Hsu 3

How does a CFG define a language? The language defined by the grammar is the set of strings (sequence of terminals) that can be “derived” from the starting nonterminal. How to “derive” something? • Start with: “current sequence” = the starting nonterminal. • Repeat ⊲ find a nonterminal X in the current sequence ⊲ find a production in the grammar with X on the left of the form X → α , where α is ǫ or a sequence of terminals and/or nonterminals. ⊲ create a new “current sequence” in which α replaces X • Until “current sequence” contains no nonterminals. We derive either ǫ or a string of terminals. This is how we derive a string of the language. Compiler notes #3, 20060421, Tsan-sheng Hsu 4

Example E Grammar: ⇒ E − E = • E → int = ⇒ 1 − E • E → E − E = ⇒ 1 − E/E • E → E / E ⇒ 1 − E/ 2 = • E → ( E ) = ⇒ 1 − 4 / 2 Details: • The first step was done by choosing the second production. • The second step was done by choosing the first production. • · · · Conventions: • = ⇒ : means “derives in one step”; + ⇒ : means “derives in one or more steps”; = • ∗ = ⇒ : means “derives in zero or more steps”; • + • In the above example, we can write E = ⇒ 1 − 4 / 2 . Compiler notes #3, 20060421, Tsan-sheng Hsu 5

Language The language defined by a grammar G is + L ( G ) = { w | S = ⇒ ω } , where S is the starting nonterminal and ω is a sequence of terminals or ǫ . An element in a language is ǫ or a sequence of terminals in the set defined by the language. More terminology: ⇒ · · · = ⇒ 1 − 4 / 2 is a of 1 − 4 / 2 from E . • E = derivation • There are several kinds of derivations that are important: ⊲ The derivation is a leftmost one if the leftmost nonterminal always gets to be chosen (if we have a choice) to be replaced. ⊲ It is a rightmost one if the rightmost nonterminal is replaced all the times. Compiler notes #3, 20060421, Tsan-sheng Hsu 6

A way to describe derivations Construct a derivation or parse tree as follows: • start with the starting nonterminal as a single-node tree • REPEAT ⊲ choose a leaf nonterminal X ⊲ choose a production X → α ⊲ symbols in α become the children of X • UNTIL no more leaf nonterminal left Need to annotate the order of derivation on the nodes. E (1) E (2) − E (3) E = ⇒ E − E (5) 1 = ⇒ 1 − E (4) E / E = ⇒ 1 − E/E 4 2 ⇒ 1 − E/ 2 = ⇒ 1 − 4 / 2 = Compiler notes #3, 20060421, Tsan-sheng Hsu 7

� Parse tree examples Example: • Using 1 − 4 / 2 as the input, the left parse tree is E E Grammar: derived. E → int − E E E E / • A string is formed by reading the leaf nodes E → E − E E E − E E / from left to right, which 1 2 E → E/E gives 1 − 4 / 2 . 4 2 1 4 E → ( E ) • The string 1 − 4 / 2 has rightmost derivation leftmost derivation another parse tree on the right. Some standard notations: • Given a parse tree and a fixed order (for example leftmost or rightmost) we can derive the order of derivation. • For the “semantic” of the parse tree, we normally “interpret” the meaning in a bottom-up fashion. That is, the one that is derived last will be “serviced” first. Compiler notes #3, 20060421, Tsan-sheng Hsu 8

Ambiguous grammar If for grammar G and string α , there are • more than one leftmost derivation for α , or • more than one rightmost derivation for α , or • more than one parse tree for α , then G is called ambiguous . • Note: the above three conditions are equivalent in that if one is true, then all three are true. • Q: How to prove this? ⊲ Hint: Any unannotated tree can be annotated with a leftmost number- ing. Problems with an ambiguous grammar: • Ambiguity can make parsing difficult. • Underlying structure is ill-defined: in the example, the precedence is not uniquely defined, e.g., the leftmost parse tree groups 4 / 2 while the rightmost parse tree groups 1 − 4 , resulting in two different semantics. Compiler notes #3, 20060421, Tsan-sheng Hsu 9

Common grammar problems Lists: that is, zero or more id’s separated by commas: • Note it is easy to express one or more id’s: ⊲ NonEmptyIdList → NonEmptyIdList , id | id • For zero or more id’s, ⊲ IdList 1 → ǫ | id | IdList 1 , IdList 1 will not work due to ǫ ; it can generate: id, , id ⊲ IdList 2 → ǫ | IdList 2 , id | id will not work either because it can generate: , id, id • We should separate out the empty list from the general list of one or more id’s. ⊲ OptIdList → ǫ | NonEmptyIdList ⊲ NonEmptyIdList → NonEmptyIdList , id | id Expressions: precedence and associativity as discussed next. Useless terms: to be discussed. Compiler notes #3, 20060421, Tsan-sheng Hsu 10

Grammar that expresses precedence correctly Use one nonterminal for each precedence level Start with lower precedence (in our example “ − ”) Original grammar: Revised grammar: E → int E → E − E | T E → E − E T → T/T | F E → E/E F → int | ( E ) E → ( E ) E E T E E − / T T T T T F / T F ERROR F 1 F 2 2 4 rightmost derivation Compiler notes #3, 20060421, Tsan-sheng Hsu 11

Problems with associativity However, the above grammar is still ambiguous, and parse trees do not express the associative of “ − ” and “ / ” correctly. Example: 2 − 3 − 4 E E − E E − E E − − E E E E T T T T F T T F F 4 F F F 2 Revised grammar: 4 3 3 2 E → E − E | T rightmost derivation rightmost derivation value = 2 − (3−4) = 3 value = (2−3)−4 = −5 T → T/T | F F → int | ( E ) Problems with associativity: • The rule E → E − E has E on both sides of “ − ”. • Need to make the second E to some other nonterminal parsed earlier. • Similarly for the rule E → E/E . Compiler notes #3, 20060421, Tsan-sheng Hsu 12

Grammar considering associative rules Original grammar: Revised grammar: Final grammar: E → int E → E − E | T E → E − T | T E → E − E T → T/T | F T → T/F | F E → E/E F → int | ( E ) F → int | ( E ) E → ( E ) Example: 2 − 3 − 4 E − E T − E T F T F 4 F 3 2 leftmost/rightmost derivation value = (2−3)−4 = −5 Compiler notes #3, 20060421, Tsan-sheng Hsu 13

Rules for associativity Recursive productions: • E → E − T is called a left recursive production. + ⊲ A = ⇒ Aα . • E → T − E is called a right recursive production. + ⇒ αA . ⊲ A = • E → E − E is both left and right recursive. If one wants left associativity, use left recursion. If one wants right associativity, use right recursion. Compiler notes #3, 20060421, Tsan-sheng Hsu 14

Useless terms A non-terminal X is useless if either • a sequence includes X cannot be derived from the starting nonterminal, or • no string can be derived starting from X , where a string means ǫ or a sequence of terminals. Example 1: • S → A B • A → + | − | ǫ • B → digit | B digit • C → . B In Example 1: • C is useless and so is the last production. Any nonterminal not in the right-hand side of any production • is useless! Compiler notes #3, 20060421, Tsan-sheng Hsu 15

More examples for useless terms Example 2: • S → X | Y • X → ( ) • Y → ( Y Y ) Y derives more and more nonterminals and is useless. Any recursively defined nonterminal without a production of deriving ǫ or a string of all terminals is useless! • Direct useless. • Indirect useless: one can only derive direct useless terms. From now on, we assume a grammar contains no useless nonterminals. Compiler notes #3, 20060421, Tsan-sheng Hsu 16

t r Prsr t - PowerPoint PPT Presentation

t r Prsr t tr Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Main tasks

Best Position Algorithms for Top-k Queries* Reza Akbarinia, Esther Pacitti, Patrick Valduriez

Connected Living London Presentation 29 May 2020

International practices in agricultural spraying and fertilizer spreading Our speaker VITALII

E XAMPLE 1 SOL 1 SOL 2 Given: Given: 1. AEC = 90 1. AEC = 90 2. BDA = 90 2. BDA

Coupled 4D-variational physical and biological data assimilation in the California Current System

Chair of Computer Science 5 RWTH Aachen University Learning Layers Analysis of Overlapping

Evaluating Viability of Network Functions on Lambda Architecture By Arjun Singhvi, Anshul

Shinsuke Satoh, F. Isoda, T. Sano, H. Hanado (NICT), T. Ushio (Tokyo Metropolitan Univ), S. Otsuka

Welcome Data-Driven Discovery Teik C. Lim, PhD Provost and Vice President for Academic Affairs

Markov chain Monte Carlo Dr. Jarad Niemi STAT 544 - Iowa State University April 2, 2018 Jarad

A practical primal-dual interior-point algorithm for nonsymmetric conic optimization September 8,

The Significance of Food and Drink to the Indo-Europeans. Presented by Rev. Skip Ellison

From Bean to Cup: a history of coffee Michael Firmin UDLS - 19 Sep 2014 What this UDLS is not

Chocolates journey 600AD Mayans, Aztec, Incas Xocolatl Cocoa drink made of

AnimeHistory Tue 10 Mar 2020 11:17:07 AM CST

Idea: why not directly couple dark energy and dark matter? Ein eqn : G = 8 GT

Advanced Analytics in Business [D0S07a] Big Data Platforms & Technologies [D0S06a] Ensemble

BBN Constraints on Scaling Quintessence Models and Dark Energy Surveys Antonio Cardoso

Introduction to Number of Probability favorable outcomes Probability = of an event Total

Coffee Bean Sorter Orange Team Outline 2. Customer 3. Product 4. Distribution 4. Distribution

C ONCURRENCY C ONTROL Prasun Dewan Department of Computer Science University of North Carolina

INFO 1998: Introduction to Machine Learning Lecture 10: Real-World Applications of Data Science

SCET 2015 XII th Annual Workshop on Soft Collinear E fg ective Theory March 25-27, 2015 Santa Fe,

Living with Diabetic Kidney Disease S U E - E L L E N A N D E R S O N H A Y N E S N U T R I T I

t r Prsr t - PowerPoint PPT Presentation

t r Prsr t tr Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Main tasks

Best Position Algorithms for Top-k Queries* Reza Akbarinia, Esther Pacitti, Patrick Valduriez

Connected Living London Presentation 29 May 2020

International practices in agricultural spraying and fertilizer spreading Our speaker VITALII

E XAMPLE 1 SOL 1 SOL 2 Given: Given: 1. AEC = 90 1. AEC = 90 2. BDA = 90 2. BDA

Coupled 4D-variational physical and biological data assimilation in the California Current System

Chair of Computer Science 5 RWTH Aachen University Learning Layers Analysis of Overlapping

Evaluating Viability of Network Functions on Lambda Architecture By Arjun Singhvi, Anshul

Shinsuke Satoh, F. Isoda, T. Sano, H. Hanado (NICT), T. Ushio (Tokyo Metropolitan Univ), S. Otsuka

Welcome Data-Driven Discovery Teik C. Lim, PhD Provost and Vice President for Academic Affairs

Markov chain Monte Carlo Dr. Jarad Niemi STAT 544 - Iowa State University April 2, 2018 Jarad

A practical primal-dual interior-point algorithm for nonsymmetric conic optimization September 8,

The Significance of Food and Drink to the Indo-Europeans. Presented by Rev. Skip Ellison

From Bean to Cup: a history of coffee Michael Firmin UDLS - 19 Sep 2014 What this UDLS is not

Chocolates journey 600AD Mayans, Aztec, Incas Xocolatl Cocoa drink made of

AnimeHistory Tue 10 Mar 2020 11:17:07 AM CST

Idea: why not directly couple dark energy and dark matter? Ein eqn : G = 8 GT

Advanced Analytics in Business [D0S07a] Big Data Platforms &amp; Technologies [D0S06a] Ensemble

BBN Constraints on Scaling Quintessence Models and Dark Energy Surveys Antonio Cardoso

Introduction to Number of Probability favorable outcomes Probability = of an event Total

Coffee Bean Sorter Orange Team Outline 2. Customer 3. Product 4. Distribution 4. Distribution

C ONCURRENCY C ONTROL Prasun Dewan Department of Computer Science University of North Carolina

INFO 1998: Introduction to Machine Learning Lecture 10: Real-World Applications of Data Science

SCET 2015 XII th Annual Workshop on Soft Collinear E fg ective Theory March 25-27, 2015 Santa Fe,

Living with Diabetic Kidney Disease S U E - E L L E N A N D E R S O N H A Y N E S N U T R I T I

Advanced Analytics in Business [D0S07a] Big Data Platforms & Technologies [D0S06a] Ensemble