More on stack architecture - Understanding Programming Languages

stack overflow. A program might completely ignore the possibility and accept the fact that in extreme circumstances the program will crash. Another possibility is to check the stack size before each procedure call, but this might be too inefficient. A compromise solution would be to check the stack size periodically and take some action if it fell below some threshold, say 1000 words.

7.7 More on stack architecture

Accessing variables at intermediate levels

We have discussed how local variables are efficiently accessed at fixed offsets from the bottom pointer of an activation record. Global data, that is data declared in the main program, can also be accessed efficiently. The easiest way to see this is to imagine that global data is considered “local” to the main procedure; thus memory for the global data is allocated at the main procedure entry, that is, at the beginning of the program. Since this location is known at compile-time, or more exactly when the program is linked, the actual address of each element is known either directly or as an offset from a fixed location. In practice, global data is usually allocated separately (see Section 8.5), but in any case the addresses are fixed.

Variables at intermediate levels of nesting are more difficult to access:

Ada procedure Main is G: Integer; procedure Proc 1 is L1: Integer; procedure Proc 2 is L2: Integer;

begin L2 := L1 + G; end Proc 2; procedure Proc 3 is

L3: Integer;

begin L3 := L1 + G; Proc 2; end Proc 3; begin - - Proc 1 Proc 3; end Proc 1; begin - - Main Proc 1; end Main;

7.7. MORE ON STACK ARCHITECTURE 131 but how can L1 be accessed in Proc 3? The answer is that the value of the bottom pointer is stored at procedure entry and is used as a pointer to the activation record of the enclosing procedure Proc 1. The bottom pointer is stored at a known location and can be immediately loaded, so the overhead is an additional indirection.

If deeper nesting is used, each activation record contains a pointer to the previous one. These pointers to activation records form the dynamic chain (Figure 7.9). To access a shallow variable (one that is less deeply nested), instructions have to be executed to “climb” the dynamic chain. This potential inefficiency is the reason that accessing intermediate variables in deeply nested procedures is discouraged. Accessing the immediately previous level requires just one indirection and an occasional deep access should not cause any trouble, but a loop statement should not contain statements that reach far back in the chain.

Dynamic Chain Main Proc 1 Proc 3 . . .

? ?

Figure 7.9: Dynamic chain

Calling shallow procedures

Accessing intermediate variables is actually more complicated than the discussion above would indicate because a procedure is allowed to call other procedures that are at the same level of nesting or lower. In the example, Proc 3 calls Proc 2. The activation record for Proc 2 will store the bottom pointer of Proc 3 so that it can be restored, but the variables of Proc 3 are not accessible in Proc 2 by the rules of scope.

Somehow the program must be able to identify the static chain, the link of activation records that defines the static context of the procedure according to the rules of scope, as opposed to the dynamic chain of procedure calls at execution time. As an extreme example, consider a recursive procedure: there may be dozens of activation records in the dynamic chain (one for each recursive call), but the static chain will consist only of the current record and the record for the main procedure.

One solution is to store the static level of nesting of each procedure in the activation record, because the compiler knows what level is needed for each access. In the example, if the main program is level 0, Proc 2 and Proc 3 are both at level 2. When searching up the dynamic chain, the level of nesting must decrease by one to be considered part of the static chain, thus the record for Proc 3 is skipped over and the next record, the one for Proc 1 at level 1, is used to obtain a bottom index.

Another solution is to explicitly include the static chain on the stack. Figure 7.10 shows the static chain just after Proc 3 calls Proc 2. Before the call, the static chain is the same as the dynamic chain while after the call, the static chain is shorter and contains just the main procedure and Proc 2.

7.8. * IMPLEMENTATION ON THE 8086 132

Static Chain Dynamic Chain Main Proc 1 Proc 3 Proc 2 . . .

? ? ?

6 6

Display

6 6

Figure 7.10: Variables at intermediate levels

The advantage of using an explicit static chain is that a static chain is often shorter than a dynamic chain (again think of a recursive procedure as an extreme case). However, we still have to do the search for each access of an intermediate variable. A solution that can be more efficient is to use a display which is an array that holds the current static chain, indexed by the nesting level (Figure 7.10). Thus to access a variable at an intermediate level, the nesting level is used as an index to obtain a pointer to the correct activation record, then the bottom pointer is obtained from the record and finally the offset is added to obtain the variable address. The disadvantage of a display is that additional overhead is needed to update the display at procedure entry and exit. The potential inefficiencies of accessing intermediate variables should not deter the use of nested procedures, but programmers should carefully consider such factors as the depth of nesting and the trade-offs between using parameters as opposed to direct access to variables.

7.8 * Implementation on the 8086

To give a more concrete idea of how a stack architecture is implemented, we describe the actual machine code for procedure entry and exit on the Intel 8086 series of processors. The example program is:

Ada procedure Main is

Global: Integer;

procedure Proc(Parm: in Integer) is Local1, Local2: Integer;

begin

Local2 := Global + Parm + Local1; end Proc;

begin

Proc(15); end Main;

7.8. * IMPLEMENTATION ON THE 8086 133 The 8086 has built-in push and pop instructions which assume that the stack grows from higher to lower addresses. Two registers are dedicated to stack operations: the sp register which points to the “top” element in the stack, and the bp register which is the bottom pointer that identifies the location of the start of the activation record.

To call the procedure the parameter is pushed onto the stack and the call instruction executed:11 mov ax,#15 Load value of parameter

push ax Store parameter on stack call Proc Call the procedure

Figure 7.11 shows the stack after executing these instructions—the parameter and the return address have been pushed onto the stack.

194 196 198 200 202 204 206 208 210 RA 15 ? sp ? bp

Figure 7.11: Stack before procedure entry

The next instructions are part of the code of the procedure and are executed at procedure entry; they store the old bottom pointer (the dynamic link), set up the new bottom pointer and allocate memory for the local variable by decrementing the stack pointer:

push bp Save the old dynamic pointer mov bp,sp Set the new dynamic pointer sub sp,#4 Allocate the local variables The stack now appears as shown in Figure 7.12.

194 196 198 200 202 204 206 208 210 RA 15 L 1 L 2 210 ? sp ? bp ? old bp

Figure 7.12: Stack after procedure entry Now the body of the procedure can be executed:

7.9. EXERCISES 134 mov ax,ds:[38] Load the variable Global

add ax,[bp+06] Add the parameter Parm add ax,[bp-02] Add the variable Local1 mov ax,[bp] Store in the variable Local2

Global variables are accessed as offsets from a special area of memory pointed to by the ds (data segment) register. The parameter Parm which is “lower” down in the stack than the start of the ac- tivation record is accessed at a positive offset from bp. The local variables which are “higher” up in the stack are accessed at a negative offset from bp. What is important to note is that since the 8086 processor has registers and addressing modes designed for common stack-oriented computations, all these variables can be accessed in a single instruction.

Procedure exit must reverse the effects of the procedure call: mov sp,bp Release all local variables pop bp Restore the old dynamic pointer ret 2 Return and release parameters

The stack pointer is reset to the value of the bottom pointer thus effectively releasing memory allocated for the local variables. Then the old dynamic pointer is popped from the stack so that bp now points at the previous activation record. The only remaining tasks are to return from the procedure using the return address, and to release the memory allocated for the parameters. The ret instruction performs both of these tasks; the operand of the instruction indicates how many bytes of parameter memory must be popped from the stack. To summarize: procedure exit and entry require just three short instructions each, and access to local and global variables and to parameters is efficient.

7.9 Exercises

1. Does your Ada compiler use value or reference semantics to pass arrays and records? 2. Show how last-call optimization is implemented. Can last-call optimization be done on the

factorial function?

3. McCarthy’s function is defined by the following recursive function:

Ada function M(I: Integer) return Integer is

begin

if I > 100 then return I-10; else return M(M(I+11)); end M;

(a) Write a program for McCarthy’s function and compute M(I) for 80 ≤ I ≤ 110. (b) Simulate by hand the computation for M(91) showing the growth of the stack. (c) Write an iterative program for McCarthy’s function.

7.9. EXERCISES 135

Ada function A(M, N: Natural) return Natural is

begin

if M = 0 then return N + 1;

elsif N = 0 then return A(M – 1, 1); else return A(M – 1, A(M, N – 1)); end A;

(a) Write a program for Ackermann’s function and check that A(0,0)=1, A(1,1)=3, A(2,2)=7, A(3,3)=61.

(b) Simulate by hand the computation for A(2,2)=7 showing the growth of the stack. (c) Try to compute A(4,4) and describe what happens. Try the computation using several

compilers. Make sure you save your files before doing this! (d) Write a non-recursive program for Ackermann’s function.12

5. How are variables of intermediate scope accessed on an 8086?

6. There is a parameter passing mechanism called call-by-name in which each access of a formal parameter causes the actual parameter to be re-evaluated. This mechanism was first used in Algol but does not exist in most ordinary programming languages. What was the motivation for call-by-name in Algol and how was it implemented?

12_{Solutions can be found in: Z. Manna. Mathematical Theory of Computation. McGraw-Hill, 1974, p. 201 and p.}

Part III

Advanced

Concepts

8 Pointers

8.1 Pointer types

A variable is nothing more than a convenient notation for referring to the address of a memory location. A variable name is static and determined at compile-time: distinct names refer to distinct locations and there is no way to “compute” a name except in limited contexts such as array indexing. A value of pointer type is an address; a pointer variable contains the address of another variable or constant. The object pointed to is called the designated object. Pointers are used for computing with the address of a memory location rather than with the contents of the location. The following example:

C int i = 4;

int *ptr = &i;

produces the result shown in Figure 8.1. ptr is itself a variable with its own memory location (284), but its contents are the address (320) of another variable i.

320 4

284 320

ptr i

Figure 8.1: Pointer variable and designated variable

The syntax of the declaration is rather confusing because the asterisk “*” refers to the type int and not to the variable ptr. The correct way of reading the declaration is: “ptr is of type pointer to int”. The unary operator “&” returns the address of its following operand.1

The value of the variable i can of course be accessed by using its name in an expression such as i+1, but it can also be accessed by dereferencing the pointer, using the syntax *ptr. When you dereference a pointer you are asking not for the contents of the pointer variable ptr, but the contents of the memory location pointed to by the address in ptr, that is for the designated object.

1_{Do not confuse this operator with the use of “&” to indicate a reference parameter in C++.}

8.1. POINTER TYPES 138

Typed pointers

The addresses in the example are written like integers, but an address is not an integer. The form of an address will depend on the computer architecture. For example, the Intel 8086 computer uses two 16-bit words which are combined to form a 20-bit address. Nevertheless, it is a reasonable assumption that all pointers are represented the same way.

In programming, it is more useful and safer to use typed pointers that are declared to point to a specific type, such as int in the example above. The designated object *ptr is required to be an integer, and after dereferencing it can be used in any context in which an integer is required such as:

C int a[10];

a[*ptr] = a[(*ptr)+5]; /* Dereference and index */ a[i] = 2 * *ptr; /* Dereference and multiply */

It is important to distinguish between a pointer variable and the designated object, and to be very careful when assigning or comparing pointers:

C int i1 = 10;

int i2 = 20;

int *ptr1 = &i1; /* ptr1 points to i1 */ int *ptr2 = &i2; /* ptr2 points to i2 */

*ptr1 = *ptr2; /* Both variables have same value */ if (ptr1 == ptr2) . . . /* False, different pointers */

if (*ptr1 == *ptr2) . . . /* True, designated objects equal */ ptr1 = ptr2; /* Both point to i2 */

Figure 8.2(a) shows the variables after the first assignment statement: dereferencing is used so designated objects are assigned and i1 receives the value 20. After executing the second assignment statement which assigns pointers, not designated objects, the variable i1 is no longer accessible through a pointer as shown in Figure 8.2(b).

20 20 ptr1 i1 ptr2 i2 - - (a) 20 20 ptr1 i1 ptr2 i2 - ¡¡ ¡¡¡ µ (b) Figure 8.2: Pointer assignment

8.1. POINTER TYPES 139 It is also important to be aware of the difference between a constant pointer and a pointer to a constant designated object. Making a pointer constant does not protect the designated object from modification:

C int i1, i2;

int * const p1 = &i1; /* Constant pointer */ const int * p2 = &i1; /* Pointer to constant */

const int * const p3 = &i1;/* Constant pointer to constant */ p1 = &i2; /* Error, pointer constant */

*p1 = 5; /* OK, designated obj. not constant */ p2 = &i2; /* OK, pointer not constant */

*p2 = 5; /* Error, designated obj. constant */ p3 = &i2; /* Error, pointer constant */

*p3 = 5; /* Error, designated obj. constant */

In C, a pointer to void is an untyped pointer. Any pointer can be implicitly converted to a pointer to void and conversely, though mixing typed pointer assignments will usually get you a warning message. Fortunately, C++ _{is much more careful about type checking. Typed pointers can be}

implicitly converted to void pointers but not conversely:

C void *void ptr; /* Untyped pointer */

int *int ptr; /* Typed pointer */ char *char ptr; /* Typed pointer */ void ptr = int ptr; /* OK */

char ptr = void ptr; /* OK in C, error in C++ */ char ptr = int ptr; /* Warning in C, error in C++ */

Since no type checking in done in C, an arbitrary expression may be assigned to a pointer. There is no guarantee that the designated object is of the expected type; in fact, the value of a pointer might not even be an address assigned to the program. At best, this will cause the program to crash because of an addressing fault and you will get an appropriate message from the operating system. At worst, this can cause the operating system data to become corrupted. Mistakes with pointers are very difficult to debug because the absolute addresses that the debugger displays are hard to understand. The solution is stronger type checking on pointers as in Ada and C++_.

Syntax

We depart from our usual custom of glossing over syntax because the syntax for dereferencing can be confusing and must be clearly understood. Dereferencing of pointers, indexing of arrays and selection within records are the means by which data within a data structure are accessed. Pascal has the clearest syntax because each of these three operations is denoted by a separate symbol which is written after the variable. In this example, Ptr is declared as a pointer to an array of records with an integer field:

8.1. POINTER TYPES 140

Pascal type Rec Type =

record

Field: Integer; end;

type Array Type = array[1..100] of Rec Type; type Ptr Type = ↑Array Type;

Ptr: Ptr Type;

Then using the symbol (↑) which denotes dereferencing, each addition of a symbol goes one step deeper in the decomposition of the data structure:

Ptr (* Pointer to array of records with integer field *) Ptr↑ (* Array of records with integer field *)

Ptr↑[78] (* Record with integer field *) Ptr↑[78].Field (* Integer *)

In C, the dereferencing symbol (*) is a prefix operator so that the above example would be written: C typedef struct {

int field;

} Rec Type;

typedef Rec Type Array Type[100]; Array Type *ptr;

ptr /* Pointer to array of records with integer field */ *ptr /* Array of records with integer field */

(*ptr)[78] /* Record with integer field */ (*ptr)[78].field /* Integer */

where the parentheses are needed because array indexing has higher precedence than pointer dereferencing. In a complicated data structure it can be confusing to decipher a decomposition that uses dereferencing as a prefix, and indexing and selection as a suffix. Fortunately, the most common sequence of operations, dereferencing followed by selection, has a special elegant syntax. If ptr points to a record, then ptr− >field is a shorthand for (*ptr).field.

The Ada syntax is based on the assumption that dereferencing is almost always followed by selection, so there is no need for a separate notation for dereferencing. You cannot tell if R.Field is simply the selection of a field of an ordinary record called R, or if R is a pointer to a record which is dereferenced before selection. While this can be confusing, it does have the advantage that a data structure can be changed from using records to using pointers to records, without otherwise

8.1. POINTER TYPES 141 modifying the program. On those occasions when pure dereferencing is needed, a rather awkward syntax is used, as shown by repeating the above example in Ada:

Ada type Rec Type is

record

Field: Integer; end record;

type Array Type is array(1..100) of Rec Type; type Ptr Type is access Array Type;

Ptr: Ptr Type;

Ptr - - Pointer to array of records with integer field Ptr.all - - Array of records with integer field

Ptr.all[78] - - Record with integer field Ptr.all[78].Field - - Integer

Note that Ada uses a keyword access, rather than a symbol, to denote pointers. The keyword all is used on those few occasions when deferencing without selection is required.

Implementation

The indirect access to data via pointers requires an additional instruction in the machine code. Let us compare a direct assignment statement with an indirect assignment; for example:

C int i,j; int *p = &i; int *q = &j; i = j; /* Direct assignment */ *p = *q; /* Indirect assignment */ The machine instructions for the direct assignment are:

load R1,j store R1,i

while the instructions for the indirect assignment are:

load R1,&q Addr(designated object) load R2,(R1) Load designated object load R3,&p Addr(designated object) store R2,(R3) Store to designated object

8.1. POINTER TYPES 142 Some overhead is inevitable in indirection, but is usually not serious, because if a designated object is accessed repeatedly, the optimizer can ensure that the pointer is loaded only once. Given:

p− >right = p− >left;

once the address of p is loaded into a register, subsequent accesses can use the register: load R1,&p Addr(designated object)

load R2,left(R1) Offset from start of record store R2,right(R1) Offset from start of record

A potential source of inefficiency in indirect access to data via pointers is the size of the pointers

In document Understanding Programming Languages (Page 143-156)