Type Inference

(1)

Arvind

Computer Science and Artificial Intelligence Laboratory M.I.T.

September 26, 2006 http://www.csg.csail.mit.edu/6.827

Type Inference

September 26, 2006

(2)

L06-2

Type Inference

• Type inference is typically presented in two different forms:

– Type inference rules: Rules define the type of each expression

• Clean and concise; needed to study the semantic properties, i.e., soundeness, of the type system – Type inference algorithm: Needed by the compiler

writer to deduce the type of each subexpression or to deduce that the expression is ill typed.

• Often it is nontrivial to derive an inference

algorithm for a given set of rules. There can be

many different algorithms for a set of typing rules.

(3)

L06-3

first ...

A Simple Type System

(no polymorphism)

(4)

L06-4

-calculus with Constants & Letrec

E ::= x | x.E | E E

| Cond (E, E, E)

| PF

_k

(E

₁

,...,E

_k

)

| CN

₀

| CN

_k

(E

₁

,...,E

_k

) | CN

_k

(SE

₁

,...,SE

_k

)

| let S in E

PF

₁

::= negate | not | ... | Prj

₁

| Prj

₂

| ...

PF

₂

::= + | ...

CN

₀

::= Number | Boolean CN

₂

::= cons | ...

Statements

S ::=  | x = E | S; S

not in initial terms

There are no types in the syntax of the language!

(5)

L06-5

A Simple Type System

Types

::=  base types

| t type variables

| 

_

-> 

₂

Function types

Type Environments

TE ::= Identifiers  Types

int, bool, ...

(6)

L06-6

Simple Type Inference Rules

Typing: TE ├ ^{e : }

(App) TE ├ ^e

1

:  -> 

^’

TE ├ ^e

2

: 

 TE ├ ^(e

1

e

₂

) : 

^’

(Abs)

TE ├ x.e : -> 

^’

(Var)

TE ├ ^{x : }

(Const)

TE ├ ^{c : }

(Let)

TE ├ (let x = e

₁

in e

₂

) : 

^’

TE + {x : } ├ ^{e : }

^’

(x : )TE

typeof(c) = 

TE+{x : } ├ ^e

1

: TE+{x : } ├ ^e

2

: 

^’

(7)

L06-7

Simple Type Inference Rules ^cont

(Cond)

TE ├ ^Cond(e

B

, e

₁

, e

₂

) :  (PF)

TE ├ ^(e

1

+ e

₂

) : int (CN)

TE ├ ^CN(e

1

, e

₂

) : CN(

_



_



(Pro)

TE ├ ^Prj

1

(e) : 

_

TE ├ ^e

B

: bool TE ├ ^e

1

: TE ├ ^e

2

: 

TE ├ ^e

1

: int TE ├ ^e

2

: int

TE ├ ^e

1

: 

_

TE ├ ^e

2

: 

_

TE ├ ^e ^{: CN(}





_

)

one such

rule for

each PF

one set

of such

rules for

each data

structure

type

(8)

L06-8

Simple Inference Algorithm

W(TE, e) returns (S,) such that S (TE) ├ e : 

The type environment TE records the most general type of each identifier while the substitution S

records the changes in the type variables

Def W(TE, e) = Case e of

x = ...

x.e = ...

(e

₁

e

₂

) = ...

let x = e

₁

in e

₂

= ...

(9)

L06-9

Simple Inference Algorithm ^(cont-1)

Def W(TE, e) = Case e of

c = ({}, Typeof(c))

x = if (x  Dom(TE))  then Fail else let = TE(x);

in ({},  )

x.e = let (S

₁

, 

₁

) = W(TE + { x : u }, e);

in (S

₁

, S

₁

(u) -> 

₁

) (e

₁

e

₂

) =

let x = e

₁

in e

₂

=

u’s

represent new type variables

let (S

₁

, 

₁

) = W(TE, e

₁

);

(S

₂

, 

₂

) = W(S

₁

(TE), e

₂

);

S

₃

= Unify(S

₂

(

₁

), 

₂

-> u);

in (S

₃

S

₂

S

₁

, S

₃

(u))

let (S

₁

, 

₁

) = W(TE + {x : u}, e

₁

);

S

₂

= Unify(S

₁

(u), 

₁

);

(S

₃

, 

₂

) = W(S

₂

S

₁

(TE) + {x : 

_

}, e

₂

);

in (S

₃

S

₂

S

₁

, 

₂

)

(10)

September 26, 2006 http://www.csg.csail.mit.edu/6.827 L06-10

Simple Inference Algorithm ^(cont-1)

Cond(e

_B

,e

₁

, e

₂

) =

(e

₁

+ e

₂

) =

CN(e

₁

, e

₂

) =

Prj

₁

(e) =

let (S

₁

, 

₁

) = W(TE, e

₁

);

(S

₂

, 

₂

) = W(S

₁

(TE), e

₂

);

S

₃

= Unify(

₁

, int); S

₄

= Unify(

₂

, int);

in (S

₄

S

₃

S

₂

S

₁

, int)

let (S

₁

, 

₁

) = W(TE, e

₁

);

(S

₂

, 

₂

) = W(S

₁

(TE), e

₂

);

in (S

₂

S

₁

, CN(

₁

, 

₂

)) let (S, ) = W(TE, e);

S

₁

= Unify(, CN(u1,u2)));

in (S S, S (u1))

These should probably be type

constants let (S

_B

, 

_B

) = W(TE, e

_B

);

(S

₁

, 

₁

) = W(S

_B

(TE), e

₁

);

(S

₂

, 

₂

) = W(S

₁

S

_B

(TE), e

₂

);

S

₃

= Unify(

_B

, bool); S

₄

= Unify(

₁

,

₂

);

in (S

₄

S

₃

S

₂

S

₁

S

_B

, S

₄

S

₃



₂

)

(11)

L06-11

Type Inference : Example

**let f = n.cond ((eq0 n), 1, n*(f (pred n)) in f**

W(  , A) =

W({f : u

₁

}, n.B) =

( [Int / u

₂

] , Bool ) W({f : u

₁

, n : u

₂

}, (eq0 n) ) =

W({f : u

₁

, n : u

₂

}, B) = ( [ ] , Int -> Int )

Unify(u

₂

, Int) =

A B

[ Int / u

₂

]

W({f : u

₁

, n : Int}, 1) = ( [ ] , Int) W({f : u

₁

, n : Int}, n*(f (pred n))) =

W({f : u

₁

, n : Int}, n) = ( [ ] , Int)

( [Int -> Int / u

₁

] , Int)

W({f : Int -> Int}, f ) = ( [ ] , Int -> Int)

([Int -> Int / u

₁

, Int / u

₂

] , Int) ([Int -> Int / u

₁

] , Int -> Int)

W({f : u

₁

, n : Int}, f (pred n)) = ( [Int -> u

₃

/ u

₁

] , u

₃

) Unify(u

₁

, Int -> u

₃

) = [ Int -> u

₃

/ u

₁

]

Unify(Int -> Int -> Int , Int -> u

₃

-> Int) = [ Int / u

₃

]

(12)

L06-12

Soundness

• The proposed type system is said to be sound if e :  then e indeed evaluates to a value in 

• A method of proving soundness:

– The semantics of the language is defined in terms of a value space that has integer

values, Boolean values etc. as subspaces.

– Any expression that may evaluate to a special value “wrong” will not type check.

– There is no type expression that denotes the subspace “wrong”.

Such proofs tend to be difficult ...

(13)

L06-13

Some observations

• A type system restricts the class of programs that are considered “legal”

• It is possible a term in the untyped –

calculus may not cause any errors but may not be typeable in a particular type system

let

id = x. x

in ... (id True) ... (id 1) ...

This term is not typeable in the simple type

system we have discussed so far.

(14)

L06-14

Polymorphic Types

Constraints:

let

id = x. x

in ... (id True) ... (id 1) ...

id :: t

₁

--> t

₁

id :: Int --> t

₂

id :: Bool --> t

₃

Does not unify!!

id :: t

₁

. t

₁

--> t

₁

Different uses of a generalized type variable may be instantiated differently

id

₂

: Bool --> Bool id

₁

: Int --> Int

Solution: Generalize the type variable

When can we

generalize?

(15)

L06-15

now...

The Hindley-Milner Type System

(16)

L06-16

A mini Language

to study Hindley-Milner Types

• There are no types in the syntax of the language!

• The type of each subexpression is derived by the Hindley-Milner type inference algorithm.

Expressions

E ::= c constant

| x variable

| x. E abstraction

| (E

₁

E

₂

) application

| let x = E

₁

in E

₂

let-block

(17)

L06-17

A Formal Type System

Note, all the ’s occur in the beginning of a type scheme, i.e., a type  cannot contain a type scheme 

Types

::=  base types

| t type variables

| 

_

-> 

₂

Function types

Type Schemes

::= 

|t. 

_

Type Environments

TE ::= Identifiers  Type Schemes

(18)

L06-18

Instantiations

• Type scheme can be instantiated into a type ^’ by substituting types for the bound variables of , i.e.,

^’= S  for some S s.t. Dom(S)  BV() - ^’ is said to be an instance of (>^’)

- ^’ is said to be a generic instance of when S maps variables to new variables.

=t

₁

...t

_n

. 

Example:

=t

₁

. t

₁

-> t

₂

t

₃

-> t

₂

is a generic instance of

Int -> t

₂

is a non generic instance of 

(19)

L06-19

Generalization aka Closing

• Generalization introduces polymorphism

• Quantify type variables that are free in  but not free in the type environment (TE)

• Captures the notion of new type variables of 

Gen(TE,) = t

₁

...t

_n

. 

where { t

₁

...t

_n

} = FV() - FV(TE)

(20)

L06-20

HM Type Inference Rules

Typing: TE ├ ^{e : }

(App) TE ├ ^e

1

:  -> 

^’

TE ├ ^e

2

: 

 TE ├ ^(e

1

e

₂

) : 

^’

(Abs)

TE ├ x.e : -> 

^’

(Var)

TE ├ ^{x : }

(Const)

TE ├ ^{c : }

(Let)

TE ├ (let x = e

₁

in e

₂

) : 

^’

TE + {x : } ├ ^{e : }

^’

(x : )TE   

typeof(c)  

TE+{x : } ├ ^e

1

: TE+{x:Gen(TE,)} ├ ^e

2

: 

^’

(21)

L06-21

HM Inference Algorithm

Def W(TE, e) = Case e of

c = ({}, Typeof(c)) x =

x.e =

(e

₁

e

₂

) =

let x = e

₁

in e

₂

=

u’s

represent new type variables

let (S

₁

, 

₁

) = W(TE, e

₁

);

(S

₂

, 

₂

) = W(S

₁

(TE), e

₂

);

S

₃

= Unify(S

₂

(

₁

), 

₂

-> u);

in (S

₃

S

₂

S

₁

, S

₃

(u))

if (x  Dom(TE))  then Fail else let t

₁

...t

_n

.= TE(x);

in ( { }, [u

_i

/ t

_i

] )

let (S

₁

, 

₁

) = W(TE + { x : u }, e);

in (S

₁

, S

₁

(u) -> 

₁

)

let (S

₁

, 

₁

) = W(TE + {x : u}, e

₁

);

S

₂

= Unify(S

₁

(u), 

₁

);

 = Gen(S

₂

S

₁

(TE), S

₂

(

₁

) );

(S

₃

, 

₂

) = W(S

₂

S

₁

(TE) + {x : }, e

₂

);

in (S

₃

S

₂

S

₁

, 

₂

)

(22)

L06-22

Hindley-Milner: Example

x. let f = y.x

in (f 1, f True)

W(  , A) =

W({x : u

₁

}, B) =

( [ ] , u

₁

) ( [ ] , u

₃

-> u

₁

)

u

₃

.u

₃

-> u

₁

TE = {x : u

₁

, f : u

₃

.u

₃

-> u

₁

}

( [ ] , u

₄

-> u

₁

) W(TE, 1 ) = ( [ ] , Int )

[ Int / u

₄

, u

₁

/ u

₅

] ( [ ] , u

₁

)

( [ ] , (u

₁

,u

₁

) )

Unify(u

₄

-> u

₁

, Int -> u

₅

) = W({x : u

₁

, f : u

₂

, y : u

₃

}, x ) = W({x : u

₁

, f : u

₂

}, y.x ) =

Gen({x : u

₁

}, u

₃

-> u

₁

) = W(TE, (f 1) ) =

( [ ] , u

₁

-> (u

₁

,u

₁

) )

W(TE, f ) =

Unify(u

₂

, u

₃

-> u

₁

) =

B A

[ (u

₃

-> u

₁

) / u

₂

]

...

(23)

L06-23

Important Observations

• Do not generalize over type variables used elsewhere

• Let is the only way of defining polymorphic constructs

• Generalize the types of let-bound identifiers

only after processing their definitions

(24)

L06-24

Properties of HM Type Inference

• It is sound with respect to the type system.

An inferred type is verifiable.

• It generates most general types of expressions.

Any verifiable type is inferred.

• Complexity

PSPACE-Hard

DEXPTIME-Complete

Nested let blocks

(25)

L06-25

Extensions

• Type Declarations

Sanity check; can relax restrictions

• Incremental Type checking

The whole program is not given at the same time, sound inferencing when types of some functions are not known

• Typing references to mutable objects

Hindley-Milner system is unsound for a language with refs (mutable locations)

• Overloading Resolution

(26)

L06-26

HM Limitations:

-bound vs Let-bound Variables

Only let-bound identifiers can be instantiated differently.

let

twice f x = f (f x) in twice twice succ 4

vs.

let

twice f x = f (f x) foo g = (g g succ) 4 in foo twice

foo is not type correct !

Generic vs. Non-generic type variables

(27)

L06-27

Puzzle: Another set of Inference rules

(Gen) TE ├ e : t  FV(TE) TE ├ e : t.

Type Inference

Arvind

Computer Science and Artificial Intelligence Laboratory M.I.T.

Type Inference

September 26, 2006

Type Inference

• Type inference is typically presented in two different forms:

• Often it is nontrivial to derive an inference

algorithm for a given set of rules. There can be

many different algorithms for a set of typing rules.

first ...

A Simple Type System

(no polymorphism)

-calculus with Constants & Letrec

E ::= x | x.E | E E

| Cond (E, E, E)

| PF

(E

,...,E

)

| CN

| CN

(E

,...,E

) | CN

(SE

,...,SE

)

| let S in E

PF

::= negate | not | ... | Prj

| Prj

| ...

PF

::= + | ...

CN

::= Number | Boolean CN

::= cons | ...

Statements

S ::=  | x = E | S; S

not in initial terms

There are no types in the syntax of the language!

A Simple Type System

Types

::=  base types

| t type variables

| 

-> 

Function types

Type Environments

TE ::= Identifiers  Types

int, bool, ...

Simple Type Inference Rules

Typing: TE ├ e : 

(App) TE ├ e

:  -> 

TE ├ e

: 

 TE ├ (e

e

) : 

(Abs)

TE ├ x.e : -> 

(Var)

TE ├ x : 

(Const)

TE ├ c : 

(Let)

TE ├ (let x = e

in e

) : 

TE + {x : } ├ e : 

(x : )TE

typeof(c) = 

TE+{x : } ├ e

: TE+{x : } ├ e

: 

Simple Type Inference Rules cont

(Cond)

TE ├ Cond(e

Typing: TE ├ ^{e : }

(App) TE ├ ^e

TE ├ ^e

 TE ├ ^(e

TE ├ ^{x : }

TE ├ ^{c : }

TE + {x : } ├ ^{e : }

TE+{x : } ├ ^e

: TE+{x : } ├ ^e

Simple Type Inference Rules ^cont

TE ├ ^Cond(e

TE ├ ^(e

TE ├ ^CN(e

TE ├ ^Prj

TE ├ ^e

: bool TE ├ ^e

: TE ├ ^e

TE ├ ^e

: int TE ├ ^e

TE ├ ^e

TE ├ ^e

TE ├ ^e ^{: CN(}

Simple Inference Algorithm ^(cont-1)