Intermediate Code Generation
ALSU Textbook Chapter 6.1–6.4, 6.5.1–6.5.3, 6.6–6.8 Tsan-sheng Hsu
tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu
1
Intermediate Code Generation ALSU Textbook Chapter 6.16.4, - - PowerPoint PPT Presentation
Intermediate Code Generation ALSU Textbook Chapter 6.16.4, 6.5.16.5.3, 6.66.8 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Intermediate code generation Compiler usually generate intermediate codes.
tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu
1
⊲ A sequence of statements of the general form: x := y op z, where “op” is an operator, x is the result, and y and z are operands. ⊲ Consists of at most 3 addresses for each statement. ⊲ A linearized representation of a binary syntax tree.
Compiler notes #6, 20070608, Tsan-sheng Hsu 2
⊲ x := &y ⊲ x := ∗y ⊲ ∗x := y
Compiler notes #6, 20070608, Tsan-sheng Hsu 3
⊲ Static data area: for global data. ⊲ Allocated when the program starts and remains to be so for the entire execution.
⊲ So called activation record (A.R.) when a procedure is invoked. ⊲ This area holds all data that are local to this procedure. ⊲ This area is active only when the associated procedure is called. ⊲ May have multiple copies when recursive calls are allowed.
Compiler notes #6, 20070608, Tsan-sheng Hsu 4
Compiler notes #6, 20070608, Tsan-sheng Hsu 5
⊲ create a new symbol table. ⊲ link it to the symbol table previous.
⊲ insert a new identifier name with type type and offset into table; ⊲ check for possible duplication.
⊲ increase the size of the symbol table table by width.
⊲ insert a procedure name into table; ⊲ the symbol table of name is newtable.
⊲ check whether name is declared in symbol table table, ⊲ return the entry if it is in table.
Compiler notes #6, 20070608, Tsan-sheng Hsu 6
Compiler notes #6, 20070608, Tsan-sheng Hsu 7
⊲ {top(offset) := 0;}
⊲ {enter(top(tblptr),id.name,T.type,top(offset)); ⊲ top(offset) := top(offset) + T.width; }
⊲ { T.type := integer; ⊲ T.width := 4; }
⊲ { T.type := double; ⊲ T.width := 8; }
⊲ { T.type := pointer(T1.type); ⊲ T.width := 4; }
Compiler notes #6, 20070608, Tsan-sheng Hsu 8
⊲ { pop(tblptr); ⊲ pop(offset); }
⊲ { t := mktable(top(tblptr)); ⊲ push(t,tblptr); ⊲ push(top(offset),offset);}
Compiler notes #6, 20070608, Tsan-sheng Hsu 9
⊲ { T.type := record(top(tblptr)); ⊲ T.width := top(offset); ⊲ pop(tblptr); ⊲ pop(offset); }
⊲ { t := mktable(null); ⊲ push(t,tblptr); ⊲ push(0,offset);}
Compiler notes #6, 20070608, Tsan-sheng Hsu 10
⊲ {t := top(tblptr); /∗ symbol table for this procedure ∗/ ⊲ addwidth(t,top(offset)); ⊲ generate code for de-allocating A.R.; ⊲ pop(tblptr); pop(offset); ⊲ enterproc(top(tblptr),id.name,t);}
⊲ { /∗ enter a new scope ∗/ ⊲ t := mktable(top(tblptr)); ⊲ push(t,tblptr); push(0,offset); }
⊲ {generate code for allocating A.R.; }
⊲ ǫ-productions easily trigger conflicts.
Compiler notes #6, 20070608, Tsan-sheng Hsu 11
⊲ {t := top(tblptr); /∗ symbol table for this procedure ∗/ ⊲ addwidth(t,top(offset)); ⊲ generate code for de-allocating A.R.; ⊲ pop(tblptr); pop(offset); ⊲ enterproc(top(tblptr),id.name,t);}
⊲ { /∗ enter a new scope ∗/ ⊲ t := mktable(top(tblptr)); ⊲ push(t,tblptr); push(0,offset); }
⊲ {generate code for allocating A.R.; }
Compiler notes #6, 20070608, Tsan-sheng Hsu 12
⊲ Use switch statement to actually print out the target code; ⊲ Can have different gen() for different target codes;
⊲ Local temp space. ⊲ Parameter. ⊲ Local variable. ⊲ Non-local variable. ⊲ Global variable. ⊲ Registers, constants, . . .
Compiler notes #6, 20070608, Tsan-sheng Hsu 13
⊲ Using a bit array to indicate the usage of temp space. ⊲ Usually use a circular array data structure.
Compiler notes #6, 20070608, Tsan-sheng Hsu 14
⊲ { p := lookup(id.name,top(tblptr)); ⊲ if p is not null then gen(p, “:=”,E.place); else error(“var undefined”,id.name); }
⊲ {E.place := newtemp(); ⊲ gen(E.place, “:=”,E1.place,”+”,E2.place); ⊲ freetemp(E1.place);freetemp(E2.place);}
⊲ {E.place := newtemp(); ⊲ gen(E.place, “:=”,“uminus”,E1.place); ⊲ freetemp(E1.place);}
⊲ {E.place := E1.place;}
⊲ {p := lookup(id.name,top(tblptr)); ⊲ if p = null then E.place := p.place else error(“var undefined”,id.name);}
Compiler notes #6, 20070608, Tsan-sheng Hsu 15
⊲ generate no conversion code ⊲ E.type = E1.type
⊲ E.type = float ⊲ temp1 = newtemp(); ⊲ if E1.type = integer then gen(temp1,“:=”, int-to-float,E1.place); gen(E,“:=”,temp1,“+”,E2.place); ⊲ else gen(temp1,“:=”, int-to-float,E2.place); gen(E,“:=”, temp1,“+”,E1.place); ⊲ freetemp(temp1);
Compiler notes #6, 20070608, Tsan-sheng Hsu 16
⊲ lower bound in address = low ⊲ element data width = w ⊲ starting address = start addr
⊲ = start addr + (i − low) ∗ w ⊲ = i ∗ w + (start addr − low ∗ w) ⊲ The value, called base, (start addr − low ∗ w) can be computed at compile time, and then stored at the symbol table.
Compiler notes #6, 20070608, Tsan-sheng Hsu 17
⊲ A[1, 1], A[1, 2], A[1, 3], A[2, 1], A[2, 2], . . . ⊲ A[i] means the ith row. ⊲ Advantage: A[i,j] = A[i][j].
⊲ A[1, 1], A[2, 1], A[1, 2], A[2, 2], A[1, 3], . . .
⊲ n2 is the number of elements in a row. ⊲ low1 is the lower bound of the first coordinate. ⊲ low2 is the lower bound of the second coordinate.
Compiler notes #6, 20070608, Tsan-sheng Hsu 18
i=2ni +i2 ∗Πk i=3ni +· · ·+ik)∗w +(start addr −low1 ∗w ∗Πk i=2ni −
i=3ni − · · · − lowk ∗ w)
⊲ ni is the number of elements in the ith coordinate. ⊲ lowi is the lower of the ith coordinate.
i=2ni+i2∗Πk i=3ni+· · ·+ik) can be computed incrementally
⊲ f(1) = i1; ⊲ f(j) = f(j − 1) ∗ nj + ij; ⊲ f(k) is the value we want;
i=2ni−low2∗w∗Πk i=3ni−
Compiler notes #6, 20070608, Tsan-sheng Hsu 19
⊲ type of variables ⊲ offset
Compiler notes #6, 20070608, Tsan-sheng Hsu 20
⊲ {if L.offset = null then /* L is a simple id */ ⊲ E.place := L.place; ⊲ else begin ⊲ E.place := newtemp(); ⊲ gen(E.place,”:=”,L.place,”[“,L.offset,”]”); ⊲ end}
⊲ {p := lookup(id.name,top(tblptr)); ⊲ if p = null then begin ⊲ L.place := p.place; ⊲ L.offset := null; ⊲ end ⊲ else error(“var undefined”,id.name);}
Compiler notes #6, 20070608, Tsan-sheng Hsu 21
⊲ {L.offset := newtemp(); ⊲ gen(L.offset,“:=”,Elist.elesize,“∗”,Elist.place); freetemp(Elist.place); ⊲ L.place := Elist.base;}
⊲ { t := newtemp(); m := Elist1.ndim + 1; ⊲ gen(t, “:=”,Elist1.place,“∗”,limit(Elist1.array,m)); ⊲ gen(t,“:=”,t,“+”,E.place); freetemp(E.place); ⊲ Elist.array := Elist1.array; Elist.place := t; Elist.ndim := m; }
⊲ {Elist.place := E.place; Elist.ndim := 1; ⊲ p := lookup(id.name,top(tblptr)); check for id errors; ⊲ Elist.elesize := p.size; Elist.base := p.base; ⊲ Elist.array := p.place;}
⊲ {p := lookup(id.name,top(tblptr)); check for id errors; ⊲ E.place := p.place;}
Compiler notes #6, 20070608, Tsan-sheng Hsu 22
⊲ 1: true; 0: false. ⊲ = 0: true; 0: false.
⊲ a1 is true, then a2 is not evaluated.
⊲ Example: (a > 1) and (p function(· · · ) > 100) ⊲ if the calling of p function() creates some side effects, then this side effect is not carried out in the case of (a > 1) being false.
Compiler notes #6, 20070608, Tsan-sheng Hsu 23
Compiler notes #6, 20070608, Tsan-sheng Hsu 24
Compiler notes #6, 20070608, Tsan-sheng Hsu 25
1
Compiler notes #6, 20070608, Tsan-sheng Hsu 26
Compiler notes #6, 20070608, Tsan-sheng Hsu 27
1
2
2
2
Compiler notes #6, 20070608, Tsan-sheng Hsu 28
⊲ goto label1; ... label1: goto label2; ⊲ goto label2; ... label1: goto label2;
Compiler notes #6, 20070608, Tsan-sheng Hsu 29
Compiler notes #6, 20070608, Tsan-sheng Hsu 30
1
Compiler notes #6, 20070608, Tsan-sheng Hsu 31
Compiler notes #6, 20070608, Tsan-sheng Hsu 32
code to evaluate E into t goto test L[1]: code for S[1] goto next ... L[k]: code for S[k] goto next L[d]: code for S[d] goto next test: if t = V[1] goto L[1] ... if t = V[k] goto L[k] goto L[d] next: ... Can easily be converted into a lookup table! code to evaluate E into t if t <> V[1] goto L[1] code for S[1] goto next L[1]: if t <> V[2] goto L[2] code for S[2] goto next ... L[k-1]: if t <> V[k] goto L[k] code for S[k] goto next L[k]: code for S[d] next:
Compiler notes #6, 20070608, Tsan-sheng Hsu 33
... V[1] V[2] V[3] L[1] L[2] L[3] L[1]: L[2]: S[1] S[2]
Compiler notes #6, 20070608, Tsan-sheng Hsu 34
Compiler notes #6, 20070608, Tsan-sheng Hsu 35
⊲ {for each item p on the queue Elist.queue do ⊲ gen(“PARAM”, q); ⊲ gen(“call”, id.place);}
⊲ {append E.place to the end of Elist.queue}
⊲ {initialize Elist.queue to contain only E.place}
⊲ code for E1, store in t1 ⊲ · · · ⊲ code for Ek, store in tk ⊲ PARAM t1 ⊲ · · · ⊲ PARAM tk ⊲ call p
Compiler notes #6, 20070608, Tsan-sheng Hsu 36
⊲ parameters, formal parameters, or formals.
⊲ arguments, actual parameters, or actuals.
⊲ right value ⊲ on the right side of assignment
⊲ left value ⊲ on the left side of assignment
Compiler notes #6, 20070608, Tsan-sheng Hsu 37
Compiler notes #6, 20070608, Tsan-sheng Hsu 38
⊲ If an argument has an address then that is what is passed. ⊲ If an argument is an expression that does not have an l-value (e.g., a + 6), then evaluate the argument and store the value in a temporary address and pass that address.
Compiler notes #6, 20070608, Tsan-sheng Hsu 39
Compiler notes #6, 20070608, Tsan-sheng Hsu 40
⊲ call-by-reference with an argument that can also be accessed by the called procedure directly, e.g., global variables. ⊲ call-by-reference with the same expression as an argument twice; e.g., test(x, y, x).
Compiler notes #6, 20070608, Tsan-sheng Hsu 41
Compiler notes #6, 20070608, Tsan-sheng Hsu 42
Compiler notes #6, 20070608, Tsan-sheng Hsu 43
Compiler notes #6, 20070608, Tsan-sheng Hsu 44
Compiler notes #6, 20070608, Tsan-sheng Hsu 45
Compiler notes #6, 20070608, Tsan-sheng Hsu 46
Compiler notes #6, 20070608, Tsan-sheng Hsu 47
Compiler notes #6, 20070608, Tsan-sheng Hsu 48