INF5110 Compiler Construction Symbol tables Spring 2016 1 / 43 - PowerPoint PPT Presentation

INF5110 – Compiler Construction Symbol tables Spring 2016 1 / 43

Outline 1. Symbol tables Introduction Symbol table design an interface Implementing symbol tables Block-structure, scoping, binding, name-space organization Symbol tables as attributes in an AG 2 / 43

Symbol tables, in general • central data structure • “data base” or repository associating properties with “names” (identifiers, symbols) • declarations • constants • type declarationss • variable declarations • procedure-declarations • class declarations • . . . • declaring occurrences vs. use occurrences of names (e.g. variables) 4 / 43

Does my compiler need a symbol table? • goal: associate attributes (properties) to syntactic elements (names/symbols) • storing once calculated: (costs memory) ↔ recalculating on demand (costs time) • most often: storing prefered • but: can’t one store it in the nodes of the AST ? • remember: attribute grammar • however, fancy attribute grammars with many rules and complex synthesized/inherited attribute (whose evaluation traverses up and down and across the tree): • might be intransparent • storing info in the tree: might not be efficient ⇒ central repository (= symbol table) better. So: do I need a symbol table? In theory, alternatives exists; in practice, yes, symbol tables needed; most compilers do use symbol tables. 5 / 43

Symbol table as abstract date type • separate interface from implementation • ST: basically nothing else than a lookup-table or dictionary , • associating “keys” with “values” • here: keys = names (id’s, symbols), values the attribute(s) Schematic interface: two core functions (+ more) • insert ; add new binding • lookup : retrieve besides the core functionality: • structure of (different?) name spaces in the implemented language, scoping rules • typically: not one single “flat” namespace ⇒ typically not one big flat look-up table 1 • ⇒ influence on the design/interface of the ST (and indirectly the choice of implementation) • necessary to “delete” or “hide” information ( delete ) 1 Neither conceptually nor the way it’s implemented. 7 / 43

Two main philosophies declarations in the AST nodes Traditional table(s) • to look-up ⇒ tree- search • central repository, separate • insert/delete: implicit, from AST depending on relative • interface positioning in the tree • lookup ( name ) , • look-up: • insert ( name , decl ) , • delete ( name ) • potential lack of efficiency • however: optimizations • last 2: update ST for exist, e.g. “redundant” declarations and when extra table (similar to the entering/exiting blocks traditional ST) Here, for concreteness, (declarations/ are the attributes stored in the ST. It’s not the only possible attribute stored. There may also be more than one ST. 8 / 43

Data structures to implement a symbol table • different ways to implement dictionaries (or look-up tables etc) • simple (association) lists • trees • balanced (AVL, B, red-black, binary-search trees) • hash tables, often method of choice • functional vs. imperative implementation • careful choice influences efficiency • influenced also by the language being implemented, • in particular, its scoping rules (or the structure of the name space in general) etc. 2 2 Also the language used for implementation (and the availability of libraries therein) may play a role (but remember “bootstrapping”) 10 / 43

Nested block / lexical scope for instance: C { i n t i ; . . . ; double d ; void p ( . . . ) ; { i n t i ; . . . } i n t j ; . . . more later 11 / 43

Blocks in other languages L A T EX T EX \ documentclass { a r t i c l e } \ def \x{a} \newcommand{\x}{a} { \ begin {document} \ def \x{b} \x \x {\renewcommand{\x}{b} } \x \x } \ bye \ end {document} But: static vs. dynamic binding (see later) 12 / 43

Hash tables • classical and common implementation for STs • “hash table”: • generic term itself, different general forms of HTs exists • e.g. separate chaining vs. open addressing 3 Code snippet { Separate chaining i n t temp ; i n t j ; r e a l i ; void s i z e ( . . . . ) { { . . . . } } } 3 There is alternative terminology (cf. INF2220), under which separate chaining is also known as open hashing . The open addressing methods are also called closed hashing . That’s how it is. 13 / 43

Block structures in programming languages • almost no language has one global namespace (at least not for variables) • pretty old concept, seriously started with ALGOL60. block • “region” in the program code • delimited often by { and } or BEGIN and END • used to organize the scope of declarations (i.e., the name space) • nested blocks 14 / 43

Block-structured scopes (in C) i , j ; i n t i n t f ( i n t s i z e ) { char i , temp ; . . . { double j ; . . } . . . { char ∗ j ; . . . } } 15 / 43

Nested procedures in Pascal program Ex ; var i , j : i n t e g e r f u n c t i o n f ( s i z e : i n t e g e r ) : i n t e g e r ; var i , temp : char ; procedure g ; var j ; r e a l ; begin . . . end ; procedure h ; var j : ^ char ; begin . . . end ; begin (∗ f ’ s body ∗) . . . end ; begin (∗ main program ∗) . . . end . 16 / 43

Block-strucured via stack-organized separate chaining “Evolution” of the hash table C code snippet i n t i , j ; i n t f ( i n t s i z e ) { char i , temp ; . . . { double j ; . . } . . . { char ∗ j ; . . . } } 17 / 43

Using the syntax tree for lookup lookup ( s t r i n g n ) { k = naavaerende blokk do // l e t e t t e r n i d e c l t i l blokk k ; k = k . s l u n t i l funnet e l l e r k == none } 18 / 43

Alternative representation: • arrangement different from 1 table with stack-organized external chaining • each block with one own hash table. 4 • standard hashing within each block • static links to link the block levels ⇒ “tree-of-hashtables” • AKA: sheaf-of-tables or chained symbol tables representation 4 One may say: one symbol table per block, because the form of organization can generally be done for symbol tables data structures (where hash tables is just one of possible implementing data structure). 19 / 43

Block-structured scoping with chained symbol tables • remember the interface • look-up: following the static link (as seen) 5 • Enter a block • create new (empty) symbol table • set static link from there to the “old” (= previously current) one • set the current block to the newly created one • at exit • move the current block one level up • note: no deletion of bindings, just made inaccessible 5 The notion of static links will be encountered later again when dealing with run-time environments (and for analogous purposes: identfying scopes in “block-stuctured” languages). 21 / 43

Lexical scoping & beyond • block-structured lexical scoping: central in programming languages (ever since ALGOL60 . . . ) • but: other scoping mechanism exists (and exist side-by-side) • example: C ++ • member functions declared inside a class • defined outside • still: method supposed to be able to access names defined in the scope of the class definition (i.e., other members, e.g. using this ) Java analogon C ++ class and member function c l a s s A { c l a s s A { . . . i n t f ( ) ; . . . // member f u n c t i o n i n t f () { . . . } ; } boolean b ; void h () { . . . } ; A : : f () {} // def . of f ‘ ‘ i n ’ ’ A } 22 / 43

Scope resolution in C ++ • class name introduces a name for the scope 6 (not only in C ++ ) • scope resolution operator :: • allows to explicitly refer to a “scope”’ • to implement • such flexibility, • also for remote access like a.f() • declarations must be kept separatly for each block (e.g. one hash table per class, record, etc., appropriately chained up) 6 Besides that, class names are subject to scoping themselves, of course. 23 / 43

Same-level declarations Same level • often forbidden (for instance typedef i n t i in C) i n t i ; • insert : requires check (= lookup ) first Sequential vs. “collaterals declarations i n t i = 1 ; i = 1 ; ; l e t void f ( void ) l e t i = 2 and y = i +1;; { i n t i = 2 , j = i +1, . . . p r i n t _ i n t ( y ) ; ; } 24 / 43

INF5110 Compiler Construction Symbol tables Spring 2016 1 / 43 - PowerPoint PPT Presentation

INF5110 Compiler Construction Symbol tables Spring 2016 1 / 43 Outline 1. Symbol tables Introduction Symbol table design an interface Implementing symbol tables Block-structure, scoping, binding, name-space organization Symbol tables

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

INF5110 Compiler Construction Spring 2017 1 / 93 Outline 1. Grammars Introduction

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INF5110 Compiler Construction Scanning Spring 2016 1 / 102 Outline 1. Scanning Intro

INF5110 Compiler Construction Spring 2016 1 / 98 Outline 1. Intermediate code generation

INF5110 Compiler Construction Run-time environments Spring 2016 1 / 92 Outline 1. Run-time

INF5110 Compiler Construction Spring 2017 1 / 97 Outline 1. Intermediate code generation

INF5110 Compiler Construction Spring 2017 1 / 95 Outline 1. Run-time environments Intro

INF5110 Compiler Construction Spring 2017 1 / 45 Outline 1. Symbol tables Introduction

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

INF5110 Compiler Construction Types and type checking Spring 2016 1 / 43 Outline 1. Types

INF5110 Compiler Construction Semantic analysis Spring 2016 1 / 60 Outline 1. Semantic

Compiler Construction Compiler Construction 1 / 111 Mayer Goldberg \ Ben-Gurion University

Compiler Construction November 21, 2018 Compiler Construction November 21, 2018 1 / 102 Mayer

Compiler Construction Compiler Construction 1 / 54 Mayer Goldberg \ Ben-Gurion University Tuesday

Compiler Construction Compiler Construction 1 / 193 Mayer Goldberg \ Ben-Gurion University Friday

IMPROVING THE ROOT INTERPRETER Javier Lpez-Gmez <jalopezg@inf.uc3m.es> A CS PhD. student

Lecture 8: Dictionaries and Hash Tables Instructor: Saravanan Thirumuruganathan CSE 5311

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Hash tables Most data structures that were going to see are about storing and manipulating data

Basic External Memory Data Structures Zorieh Soltani Yazd University Fall-1389 Zorieh Soltani

A look inside the Windows Kernel CVE-2011-1237 Evolution from XP to 8 Bruno Pujos

Anchors Page 53 Transform the World So When Do You Use Chains? Use Chaining Anchors when the

Linked List Implementation Data-structure-palooza Introduction to Markov Chaining Checkout

INF5110 Compiler Construction Symbol tables Spring 2016 1 / 43 - PowerPoint PPT Presentation

INF5110 Compiler Construction Symbol tables Spring 2016 1 / 43 Outline 1. Symbol tables Introduction Symbol table design an interface Implementing symbol tables Block-structure, scoping, binding, name-space organization Symbol tables

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

INF5110 Compiler Construction Spring 2017 1 / 93 Outline 1. Grammars Introduction

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INF5110 Compiler Construction Scanning Spring 2016 1 / 102 Outline 1. Scanning Intro

INF5110 Compiler Construction Spring 2016 1 / 98 Outline 1. Intermediate code generation

INF5110 Compiler Construction Run-time environments Spring 2016 1 / 92 Outline 1. Run-time

INF5110 Compiler Construction Spring 2017 1 / 97 Outline 1. Intermediate code generation

INF5110 Compiler Construction Spring 2017 1 / 95 Outline 1. Run-time environments Intro

INF5110 Compiler Construction Spring 2017 1 / 45 Outline 1. Symbol tables Introduction

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

INF5110 Compiler Construction Types and type checking Spring 2016 1 / 43 Outline 1. Types

INF5110 Compiler Construction Semantic analysis Spring 2016 1 / 60 Outline 1. Semantic

Compiler Construction Compiler Construction 1 / 111 Mayer Goldberg \ Ben-Gurion University

Compiler Construction November 21, 2018 Compiler Construction November 21, 2018 1 / 102 Mayer

Compiler Construction Compiler Construction 1 / 54 Mayer Goldberg \ Ben-Gurion University Tuesday

Compiler Construction Compiler Construction 1 / 193 Mayer Goldberg \ Ben-Gurion University Friday

IMPROVING THE ROOT INTERPRETER Javier Lpez-Gmez &lt;jalopezg@inf.uc3m.es&gt; A CS PhD. student

Lecture 8: Dictionaries and Hash Tables Instructor: Saravanan Thirumuruganathan CSE 5311

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Hash tables Most data structures that were going to see are about storing and manipulating data

Basic External Memory Data Structures Zorieh Soltani Yazd University Fall-1389 Zorieh Soltani

A look inside the Windows Kernel CVE-2011-1237 Evolution from XP to 8 Bruno Pujos

Anchors Page 53 Transform the World So When Do You Use Chains? Use Chaining Anchors when the

Linked List Implementation Data-structure-palooza Introduction to Markov Chaining Checkout

IMPROVING THE ROOT INTERPRETER Javier Lpez-Gmez <jalopezg@inf.uc3m.es> A CS PhD. student