Fibrational induction rules for initial algebras

(1)

Ghani, Neil and Johann, Patricia and Fumex, Clement (2010) Fibrational

induction rules for initial algebras. In: Computer Science Logic. Lecture

Notes In Computer Science, 6247 . Springer, pp. 336-350. ISBN

978-3-642-15204-7 , http://dx.doi.org/10.1007/978-3-642-15205-4_27

This version is available at https://strathprints.strath.ac.uk/33723/

Strathprints is designed to allow users to access the research output of the University of Strathclyde. Unless otherwise explicitly stated on the manuscript, Copyright © and Moral Rights for the papers on this site are retained by the individual authors and/or other copyright owners. Please check the manuscript for details of any other licences that may have been applied. You may not engage in further distribution of the material for any profitmaking activities or any commercial gain. You may freely distribute both the url (https://strathprints.strath.ac.uk/) and the content of this paper for research or private study, educational, or not-for-profit purposes without prior permission or charge.

Any correspondence concerning this service should be sent to the Strathprints administrator: [email protected]

The Strathprints institutional repository (https://strathprints.strath.ac.uk) is a digital archive of University of Strathclyde research outputs. It has been developed to disseminate open access research outputs, expose data about those outputs, and enable the

(2)

Fibrational Induction Rules for Initial Algebras

�

Neil Ghani, Patricia Johann, and Cl´ement Fumex

University of Strathclyde, Glasgow G1 1XH, UK

{neil.ghani, patricia.johann, clement.fumex}@cis.strath.ac.uk

Abstract. _{This paper provides an induction rule that can be used to}

prove properties of data structures whose types are inductive, i.e., are carriers of initial algebras of functors. Our results are semantic in nature and are inspired by Hermida and Jacobs’ elegant algebraic formulation of induction for polynomial data types. Our contribution is to derive, under slightly different assumptions, an induction rule that is generic overall inductive types, polynomial or not. Our induction rule is generic over the kinds of properties to be proved as well: like Hermida and Jacobs, we work in a general fibrational setting and so can accommodate very general notions of properties on inductive types rather than just those of particular syntactic forms. We establish the correctness of our generic induction rule by reducing induction to iteration. We show how our rule can be instantiated to give induction rules for the data types of rose trees, finite hereditary sets, and hyperfunctions. The former lies outside the scope of Hermida and Jacobs’ work because it is not polynomial; as far as we are aware, no induction rules have been known to exist for the latter two in a general fibrational framework. Our instantiation for hyperfunctions underscores the value of working in the general fibrational setting since this data type cannot be interpreted as a set.

1 Introduction

Iteration operators provide a uniform way to express common and naturally occurring patterns of recursion over inductive data types. Expressing recursion via iteration operators makes code easier to read, write, and understand; facili-tates code reuse; guarantees properties of programs such as totality and termi-nation; and supports optimising program transformations such as fold _fusion and short cut fusion. Categorically, iteration operators arise from initial algebra semantics of data types, in which each data type is regarded as the carrier of the initial algebra of a functorF. Lambek’s Lemma ensures that this carrier is the least fixed point µF of F, and initiality ensures that, given anyF-algebra

h:F A→A, there is a uniqueF-algebra homomorphism, denotedf old h, from the initial algebra in : F(µF) →µF to that algebra. For each functor F, the mapfold: (F A→A)→µF→Ais the iteration operator for the data typeµF. Initial algebra semantics thus provides a well-developed theory of iteration which is i) principled, and so helps ensure that programs have rigorous mathematical foundations that can be used to ascertain their meaning and correctness; ii) ex-pressive, and so is applicable to all inductive types, rather than just syntactically defined classes of data types such as polynomial data types; and iii)correct, and so is valid in any model — set-theoretic, domain-theoretic, realisability, etc. — in which data types are interpreted as carriers of initial algebras.

(3)

Because induction and iteration are closely linked we may reasonably expect that initial algebra semantics can be used to derive a principled, expressive, and correct theory of induction for data types we well. In most treatments of induction, given a functorF together with a propertyP to be proved about data of type µF, the premises of the induction rule forµF constitute an F-algebra with carrier Σx : µF. P x. The conclusion of the rule is obtained by supplying such an F-algebra as input to the fold forµF. This yields a function fromµF

toΣx:µF. P xfrom which function of type ∀x:µF. P xcan be obtained. It has not, however, been possible to characteriseF-algebras with carrierΣx:µF. P x

without additional assumptions onF. Induction rules are thus typically derived under the assumption that the functors involved have a certain structure, e.g., that they are polynomial. Moreover, taking the carriers of the algebras to be

Σ-types assumes that properties are represented as type-valued functions. So while induction rules derived as described are both principled and correct, their expressiveness is limited along two dimensions: with respect to the data types for which they can be derived and the nature of the properties they can verify.

One principled and correct approach to induction is given by Hermida and Jacobs [6]. They lift each functor F on a base category of types to a functor

ˆ

F on a category of properties over those types, and take the premises of the induction rule for the typeµF to be an ˆF-algebra. Hermida and Jacobs work in a fibrational setting and the notion of property they consider is, accordingly, very general. Indeed, they accommodate any notion of property that can be fibred over the category of types, and so overcome one of the two limitations mentioned above. On the other hand, their approach is only applicable to polynomial data types, so the limitation on the class of data types treated remains in their work. This paper shows how to remove the restriction on the class of data types treated.Our main result is a derivation of a generic induction rule that can be instantiated to every inductive type— i.e., toeverytype which is the carrier of the initial algebra of a functor — regardless of whether it is polynomial. We take Hermida and Jacobs’ approach as our point of departure and show that, under slightly different assumptions on the fibration involved, we can liftany functor on its base category that has an initial algebra to a functor on its properties. This is clearly an important theoretical result, but it also has practical consequences:

– We show in Example 2 how our generic induction rule can be instantiated to the codomain fibration to derive the rule for rose trees that one would intuitively expect. The data type of rose trees lies outside the scope of Her-mida and Jacobs’ results because it is not polynomial. On the other hand, an induction rule for rose trees is available in the proof assistant Coq, although it is neither the one we intuitively expect nor expressive enough to prove properties that ought to be amenable to inductive proof. The rule we derive for rose trees is indeed the expected one, which suggests that our deriva-tion may enable automatic generaderiva-tion of more useful inducderiva-tion rules in Coq, rather than requiring the user to hand code them as is currently necessary.

(4)

finite hereditary sets. This data type is defined in terms of quotients and so lies outside current theories of induction.

– Finally, we show in Example 4 how our generic induction rule can be instan-tiated to the subobject fibration overωcpo to derive a rule for the data type of hyperfunctions. Because this data type cannot be interpreted as a set, a fibration other than the codomain fibration overSet_{is required; in this case,} use of the subobject fibration allows us to derive an induction rule for admis-sible subsets of hyperfunctions. Moreover, the functor underlying the data type of hyperfunctions is not strictly positive, and this fact again underscores the advantage of being able to handle a very general class of functors. As far as we know, induction rules for finite hereditary sets and hyperfunctions have not previously existed in the general fibrational framework.

Although our theory of induction is applicable to all functors having initial algebras, including higher-order ones, our examples show that working in the general fibrational setting is beneficial even if attention is restricted to first-order functors. Note also that our induction rules coincide with those of Hermida and Jacobs when specialised to polynomial functors in the codomain fibration. But the structure we require of fibrations generally is slightly different from that required by Hermida and Jacobs, so while our theory is in essence a generalisation of theirs, the two are, strictly speaking, incomparable. The structure we require is, however, still minimal and certainly present in all standard fibrational models of type theory (see Section 4). Like Hermida and Jacobs, we prove our generic induction rule correct by reducing induction to iteration.

We take a purely categorical approach to induction in this paper, and derive our generic induction rule from only the initial algebra semantics of data types. As a result, our work is inherently extensional. While translating our construc-tions into intensional settings may therefore require additional effort, we expect the guidance offered by the categorical viewpoint to support the derivation of induction rules for functors that are not treatable at present. Since we do not use any form of impredicativity in our constructions, and instead use only the weaker assumption that initial algebras exist, this guidance will be widely applicable.

The remainder of this paper is structured as follows. To make our results as accessible as possible, we illustrate them in Section 2 with a categorical derivation of the familiar induction rule for the natural numbers. In Section 3 we derive an induction rule for the special case of the codomain fibration over Set_{, i.e.,} for functors on the category of sets and properties representable as set-valued predicates. We also show how this rule can be instantiated to derive the one from Section 2, and the ones for rose trees and finite hereditary sets mentioned above. Then, in Section 4 we sketch the general fibrational form of our derivation (space constraints prevent a full treatment) and illustrate the resulting generic induction rule with the aforementioned application to hyperfunctions. Section 5 concludes and offers some additional directions for future research.

(5)

2 A Familiar Induction Rule

Consider the inductive data type Nat, which defines the natural numbers and can be specified in a programming language with Haskell-like syntax by

data Nat = Zero | Succ Nat

The observation thatNatis the least fixed point of the functorN onSet_defined byN X = 1 + X can be used to define the following iteration operator for it:

foldNat = X →(X →X)→Nat→X

foldNatz sZero = z

foldNatz s(Succn) = s(foldNatz s n)

Categorically, iteration operators such asfoldNat arise from the initial alge-bra semantics of data types. IfBis a category andF is a functor onB, then an

F-algebrais a morphismh:F X →Xfor some objectX ofB. We callXthe car-rierofh. For any functorF, the collection ofF-algebras itself forms a category AlgF which we call thecategory of F-algebras. InAlgF, anF-algebra morphism

between F-algebras h: F X →X andg : F Y →Y is a map f :X →Y such that f ◦h=g◦F f. When it exists, the initial F-algebra in : F(µF)→µF is unique up to isomorphism and has the least fixed point µF ofF as its carrier. Initiality ensures that there is a uniqueF-algebra morphismfold h:µF →X

from in to any other F-algebra h: F X →X. This gives rise to the following iteration operatorfold forF or, equivalently, for the inductive typeµF:

fold : (F X→X)→µF →X

foldh(int) = h(F(foldh)t)

Since fold is derived from initial algebra semantics it is principled and correct. It is also expressive, since it can be defined foreveryinductive type. In fact,fold is a single iteration operator parameterised over inductive types rather than a family of iteration operators, one for each such type, and the iteration operator foldNat above is the instantiation toNat of the generic iteration operatorfold. The iteration operatorfoldNat can be used to derive the standard induction rule forNat. This rule says that if a propertyP holds for 0, and ifPholds forn+1 whenever it holds for a natural numbern, thenP holds for all natural numbers. Representing each property of natural numbers as a predicate P : Nat → Set mapping eachn:Natto the set of proofs thatPholds forn, we wish to represent this rule at the object level as a functionindNat with type

∀(P :Nat →Set₎_{. P}_Zero _→₍_∀_n_:_Nat_{. P n}_→_P₍_Succ_n₎₎_→₍_∀_n_:_Nat_{. P n}₎

(6)

motivations for working in the general fibrational setting of Section 4. Of course, the use of category theory to suggest computational constructions has long been accepted within the functional programming community (see, e.g., [1, 2, 13]).

A functionindNat with the above type takes as input the propertyP to be proved, a proofφthatP holds forZero, and a functionψmapping eachn:Nat and each proof thatPholds fornto a proof thatP holds forSuccn, and returns a function mapping each n: Nat to a proof that P holds for n. We can write indNat in terms offoldNat — and thus reduce induction forNatto iteration for Nat— as follows. First note thatindNat cannot be obtained by instantiating the typeX in the type offoldNat to a type of the formP nfor a specific nbecause indNat returns elements of the typesP nfor different valuesn and these types are, in general, distinct from one another. We therefore need a type containing all of the elements of P nfor everyn. Such a type can be formally given by the dependent typeΣn:Nat. P ncomprising pairs (n, p) wheren:Nat andp:P n. The standard approach to defining indNat is thus to apply foldNat to an

N-algebra with carrier Σn : Nat. P n. Such an algebra has components α :

Σn : N at. P n and β : Σn : N at. P n → Σn : N at. P n. Given φ : PZero and ψ : ∀n. P n → P(Succn), we choose α = (Zero, φ) and β(n, p) = (Succn, ψ n p) and note thatf oldN at α β:N at→Σn:N at. P n. We tentatively take indNat P φ ψ nto be p, wheref oldN at α β n = (m, p). But in order to know that p actually gives a proof for n itself, we must show that m = n. Fortunately, this follows easily from the uniqueness of foldNatα β. Lettingπ�P

be the second projection on dependent pairs, the induction rule forNat is

indNat : ∀(P :Nat →Set₎_{. P}_Zero _→₍_∀_n_:_Nat_{. P n}_→_P₍_Succ_n₎₎

→(∀n:Nat. P n)

indNat P φ ψ = πP� ◦(foldNat (Zero, φ) (λ(n, p).Succn, ψ n p))

The use of dependent types is fundamental to this formalization of the in-duction rule forNat, but this is only possible because properties are taken to be set-valued functions. The remainder of this paper uses fibrations to generalise the above treatment of induction to arbitrary functors which have initial algebras and arbitrary properties which are fibred over the category whose objects inter-pret types. In the general fibrational setting, properties are given axiomatically via the fibrational structure rather than assumed to be (set-valued) functions.

3 Induction rules over

Set

(7)

The derivation for Nat in Section 2 suggests that an induction rule for an inductive data type µF should, in general, look something like this:

ind : ∀P :µ F →Set_. _??? _{→ ∀}_x_:_{µF. P x}

But what should the premises — denoted ??? here — of the generic induction rule ind be? Since we want to construct, for any term x:µF, a proof term of typeP x from proof terms forx’s substructures, and since the functionality of the fold operator for µF is precisely to compute a value for x : µF from the values for x’s substructures, it is natural to try to equip P with an F-algebra structure that can be input to fold to yield a mapping of each x : µF to an element of P x. But this approach quickly hits a snag. Since the codomain of every predicateP :µF→Set_isSet_{itself, rather than an object of}Set_,_F _cannot be applied toP as is needed to equipP with anF-algebra structure. Moreover, an induction rule forµF cannot be obtained by applyingfold to anF-algebra with carrierP xfor any specificx.

Such considerations led Hermida and Jacobs [6] to define a category of pred-icatesP and a lifting ˆF for every polynomial functor F on Set _{to a functor ˆ}_F onP that respects the structure ofF. They then constructed ˆF-algebras with carrierP to serve as the premises of their induction rules. Their construction is very general: they consider functors on bicartesian categories rather than just on Set_{, and represent properties by bicartesian fibrations over such categories} instead of using the specific notion of predicate from Definition 2 below. On the other hand, they define liftings for polynomial functors only. In this section we focus exclusively on functors onSet_{and a particular category of predicates, and} show how to define a lifting for all functors on Set_{, including non-polynomial} ones. In this setting our results properly extend those of Hermida and Jacobs, thus catering for a variety of data types that they cannot treat.

Definition 1 Let X be a set. Apredicate onX is a pair(X, P)whereP :X →

Set_{maps each}_x_∈_X _{to a set}_{P x}_{. We call}_X _the_domain_{of the predicate}₍_{X, P}₎_.

Definition 2 Thecategory of predicatesP has predicates as its objects. A mor-phism from a predicate(X, P)to a predicate(X�_{, P}�₎_{is a pair}₍_{f, f}∼_{) : (}_{X, P}₎_→

(X�_{, P}�₎_{of functions, where} _f _:_X _→_X� _and_f∼_:_∀_x_:_{X. P x}_→_P�₍_{f x}₎_.

The notion of a morphism from (X, P) to (X�_{, P}�_{) does not require the sets of}

proofs P x and P�₍_{f x}_{), for any} _x _∈ _X_{, to be} _equal_{. Instead, it requires only}

the existence of a function f∼ _{which maps, for each} _x_{, each proof in} _{P x}_{to a}

proof inP�₍_{f x}_{). We denote by}_U _:_{P →}_Set_the_{forgetful functor}_{mapping each}

predicate (X, P) to its domainX and each predicate morphism (f, f∼_{) to}_f_.

(8)

to use arrow categories to model properties is thus dictated by our desire to lift functors in such a way that it can be extended to the general fibrational setting.

Definition 3 Let F be a functor on Set_{. A} _lifting _of _F _from Set _to _P _{is a} functorFˆ on P such thatF U =UFˆ.

Note that ifP is a predicate onX, then ˆF P is a predicate onF X. We can now derive the standard induction rule from Section 2 forNat as follows.

Example 1 The data type of natural numbers isµN whereN is the functor on Set_{defined by}_{N X} _{= 1+}_X_{. If}_P_{is a predicate on}_X_{, then a lifting}_{N P}ˆ _{: 1+}_X _→ Set _of_N _from Set_to_P _{is given by}_{N P}ˆ ₍_inl_·_{) = 1}_and_{N P}ˆ ₍_inr_n_{) =} _{P n}_{. An}

ˆ

N-algebra with carrier P :Nat →Set _{can be given by in} _{: 1 +}_Nat _→_{Nat and}

in∼ _: _∀_t_{: 1+}_Nat_._{N P t}_ˆ _→_P₍_in_t₎_{. Since in}₍_inl_·_{) = 0}_{and in}₍_inr_n_{) =}_n₊₁_{, we}

see thatin∼_{is an element}_h

1:P0and a functionh2:∀n:Nat. P n→P(n+1).

Thus, the second component in∼of theNˆ-algebra with carrierP :Nat→Set_and first component in gives the premises of the familiar induction rule in Example 1.

The notion of predicate comprehension is a key ingredient of our lifting.

Definition 4 LetP be a predicate onX. ThecomprehensionofP, denoted{P}, is the typeΣx:X. P xcomprising pairs(x, p)wherex:X andp:P x. The map taking each predicate P to {P}, and taking each predicate morphism (f, f∼_{) :} P → P� _to _{₍_{f, f}∼₎_} _: _{_P_{} → {}_P�_} _{defined by} _{₍_{f, f}∼₎_}₍_{x, p}_{) = (}_{f x, f}∼_{x p}₎_,

defines the comprehension functor{−}from P toSet_.

Definition 5 If F is a functor on Set_{, then the lifting} _Fˆ _{is the functor on}_P given as follows. For every predicate P on X, F Pˆ : F X →Set _{is defined by}

ˆ

F P = (F πP)−1, where the natural transformation π : {−} → U is given by

πP(x, p) =x. For every predicate morphism f : P →P�, F fˆ = (k, k∼) where

k=F(U f), andk∼_:_∀_y_:_{F X.}_{F P y}_ˆ _→_{F P}_ˆ �₍_{k y}₎_{is given by} _k∼_{y z}₌_F_{_f_}_z_.

The inverse imagef−1 _of_f _:_X _→_Y _{is a predicate} _P _:_Y _→_Set_{. Thus if}_P _{is a}

predicate onX, then ˆF P is a predicate onF X, so ˆF is a lifting ofF fromSet_to

P. The lifting ˆF captures an “all” modality generalising Haskell’sall_function on lists to arbitrary data types. A similar modality is given in [12] for indexed containers.

The lifting in Example 1 is the instantiation of the construction in Definition 5 toN X= 1 +X onSet_{. Indeed, if}_P _{is any predicate, then ˆ}_{N P} _{= (}_{N π}_P₎−1_{, i.e.,}

ˆ

N P = (id+πP)−1, by Definition 5. Since the inverse image of the coproduct

of functions is the coproduct of their inverse images, sinceid−11 = 1, and since

π−P1n={(n, p)|p:P n}for alln, we have ˆN P(inl·) = 1 and ˆN P(inr n) =P n.

The rest of this section shows that F-algebras with carrier {P} are inter-derivable with ˆF-algebras with carrierP, and then uses this result to derive our induction rule.

Definition 6 The functor K1 : Set → P maps each set X to the predicate

(9)

The predicateK1X is called thetruth predicate on X. For everyx:X, the set

K1Xxof proofs thatK1X holds forxis a singleton, and thus is non-empty. For

any functorF, the lifting ˆF maps the truth predicate on a setXto that onF X.

Lemma 1 For any functor F onSet _{and any} _X_:Set_,_Fˆ₍_K₁_X_{) =}_K₁₍_{F X}₎_.

Proof: By Definition 5, ˆF(K1X) = (F πK₁X)−1. We have that πK₁X is a

iso-morphism since there is only one proof of K1X for each x: X, and thus that

F πK₁Xis an isomorphism as well. As a result, (F πK₁X)−1maps everyy:F Xto

a singleton set, and therefore ˆF(K1X) = (F πK₁X)−1=λy:F X.1 =K1(F X).

The fact thatK1� {−}is critical to the constructions below. This is proved

in [6]; we include its proof here for completeness and to establish notation. The description of comprehension as a right adjoint can be traced back to Law-vere [10].

Lemma 2 K1 is left adjoint to{−}.

Proof:We must show that, for any predicatePand any setY, the setP(K1Y, P)

of morphisms from K1Y to P in P is in bijective correspondence with the set

Set₍_Y,_{_P_}_{) of morphisms from} _Y _to _{_P_}_{. Define maps (}₋₎† _: Set₍_Y,_{_P_}₎ _→

P(K1Y, P) and (−)# :P(K1Y, P)→Set(Y,{P}) by h† = (h1, h2) wherehy=

(v, p), h1y =v and h2y =p, and (k, k∼)# =λ(y : Y).(ky, k∼y). These give a

natural isomorphism between Set₍_Y,_{_P_}_{) and}_P₍_K₁_{Y, P}_).

Naturality of (−)† _{ensures that (}_g_◦_f₎† ₌ _g†_◦_K

1f for allf :Y�→Y and

g :Y → {P}. Similarly for (−)#. Moreover,id† is the counit of the adjunction betweenK1and{−}. These observations are used in the proof of Lemma 4.

Lemma 3 There is a functor Φ:AlgF →AlgFˆ such that if k:F X→X, then

Φk: ˆF(K1X)→K1X.

Proof: For anF-algebrak:F X →X defineΦk=K1k, and for twoF-algebras

k : F X → X and k� _: _{F X}� _→ _X� _{and an} _F_{-algebra morphism} _h _: _X _→ _X�

between them define the ˆF-algebra morphism Φh : Φk → Φk� by Φh = K1h.

ThenK1(F X) = ˆF(K1X) by Lemma 1, so thatΦkis an ˆF-algebra andK1his an

ˆ

F-algebra morphism. It is easy to see thatΦpreserves identities and composition.

Lemma 4 The functor Φhas a right adjoint Ψ such that if j : ˆF P →P, then

Ψ j:F{P} → {P}.

Proof: We constructΨ :AlgFˆ→AlgF as follows. Given an ˆF-algebraj: ˆF P →

P, we use the fact that ˆF(K1{P}) = K1(F{P}) by Lemma 1 to define Ψ j :

F{P} → {P} byΨ j= (j◦F idˆ †)†_{. To specify the action of}_Ψ _{on an ˆ}_F_-algebra

morphismh, defineΨ h={h}. ClearlyΨ preserves identity and composition. Next we showΦ� Ψ, i.e., for every F-algebra k :F X →X and ˆF-algebra

j : ˆF P →P withP a predicate onX, there is a natural isomorphism between

(10)

observe that anF-algebra morphism fromktoΨ jis a map fromX to{P}, and an ˆF-algebra morphism fromΦktojis a map fromK1X toP. An isomorphism

between such maps is given by the adjunctionK1� {−}from Lemma 2, and so

is natural. We must check thatf :X → {P} is anF-algebra morphism fromk

toΨ j ifff†:K1X →P is an ˆF-algebra morphism fromΦk toj.

So assume f : X → {P} is an F-algebra morphism from k to Ψ j, i.e.,

f ◦k =Ψ j◦F f. We must prove that f†_◦ _Φk ₌_j_◦_{F f}_ˆ †_{. By the definition of} Φ in Lemma 3, thus amounts to showingf†_◦_K

1k=j◦F fˆ †. Since (−)† is an

isomorphism,f is an F-algebra morphism iff (f ◦k)† _{= (}_{Ψ j}_◦_{F f}₎†_{. Naturality}

of (−)† ensures that (f◦k)† =f†◦K1kand that (Ψ j◦F f)† = (Ψ j)†◦K1(F f),

so the previous equality holds iff f†◦K1k= (Ψ j)†◦K1(F f). But

j◦F fˆ †

=j◦Fˆ(id†_◦_K

1f) by naturality of (−)† andf =id◦f

= (j◦F idˆ †)◦Fˆ(K1f) by the functoriality of ˆF

= (Ψ j)†_◦_K

1(F f) by the definition ofΨ, the fact that (−)† and (−)#

are inverses, and Lemma 1

=f∗◦K1k by the observation immediately preceding this proof

=f†◦Φk by the definition ofΦ

Sof†◦K1k= (Ψ j)†◦K1(F f), andf† is an ˆF-algebra morphism fromΦk toj.

Lemma 4 ensures thatF-algebras with carrier{P}are interderivable with ˆF -algebras with carrierP. For example, theN-algebra [α, β] with carrier{P}from Section 2 can be derived from the ˆN-algebra with carrierP given in Example 1. Since we define a lifting ˆF for any functor F, Lemma 4 thus shows how to constructF-algebras with carrierΣx:µF. P xfor anyF.

We can now derive our generic induction rule. For every predicateP onX

and every ˆF-algebra (k, k∼_{) : ˆ}_{F P} _→_P_{, Lemma 4 ensures that}_Ψ_{constructs from}

(k, k∼_{) an} _F_{-algebra with carrier} _{_P_}_{. Thus,}_fold ₍_Ψ₍_{k, k}∼_{)) :}_µF _{→ {}_P_} _and

this map decomposes into two parts: φ = πP ◦fold (Ψ(k, k∼)) :µF →X and

ψ : ∀(t:µF). P(φ t). Initiality ofinensuresφ=foldk. This gives the following generic induction rule for the typeX, which reduces induction to iteration:

genind : ∀(F :Set_→Set_{) (}_P _:_X _→Set_{) ((}_{k, k}∼_{) : ( ˆ}_{F P} _→_P_{)) (}_x_:_µF₎_.

P(foldk x) genindF P = π�P◦fold◦Ψ

WhenX =µFandk=in, initiality ofinfurther ensures thatφ = foldin = id, and thus thatgenind specialises to the expected induction rule for an inductive data typeµF:

ind : ∀(F :Set_→Set_{) (}_P _:_µF _→Set_{) ((}_{k, k}∼_{) : ( ˆ}_{F P} _→_P₎₎_. (k=in)→ ∀(x:µF). P x

(11)

This rule can be instantiated to familiar rules for polynomial data types, as well as to ones we would expect for data types such as rose trees and finite hereditary sets, both of which lie outside the scope of standard methods.

Example 2 The data type of rose trees is given in Haskell-like syntax by data Rose =Node(Int,List Rose)

The functor underlyingRoseisF X =Int×ListX and its induction rule is

indRose : ∀(P :Rose→Set_{) ((}_{k, k}∼_{) : ( ˆ}_{F P} _→_P₎₎_.

(k=in)→ ∀(x:Rose). P x

indRoseF P = πP� ◦fold◦Ψ

Calculating F Pˆ = (F πP)−1 : FRose → Set, and writing xs!!k for the kth

component of a list xs, we have that

ˆ

F P(i, rs)

={z:F{P} |F πpz= (i, rs)}

={(j, cps) :Int×List{P} |F πP(j, cps) = (i, rs)}

={(j, cps) :Int×List{P} |(id×ListπP)(j, cps) = (i, rs)}

={(j, cps) :Int×List{P} |j=iand List πP cps=rs}

={(j, cps) :Int×List{P} |j=iand∀k <lengthcps. πP(cps!!k) =rs!!k}

An Fˆ-algebra whose underlying F-algebra is in :FRose →Rose is thus a pair of functions (in, k∼), where k∼ has type

∀i:Int.∀rs:List Rose.

{(j, cps) :Int×List{P} |j=iand∀k <lengthcps. πP(cps!!k) =rs!!k}

→ P(Node(i, rs)) =∀i:Int.∀rs:List Rose.

{cps:List{P} | ∀k <lengthcps. πP(cps!!k) =rs!!k} → P(Node(i, rs))

=∀i:Int.∀rs:List Rose.(∀k <lengthrs. P(rs!!k))→P(Node(i, rs))

The last equality is due to surjective pairing for dependent products and the fact that lengthcps=lengthrs. The type ofk∼ _{gives the hypotheses of the induction}

rule for rose trees.

Example 3 Hereditary sets are sets whose elements are themselves sets, and so are the core data structures within set theory. The data type HS of finite hereditary sets is µPf for the finite powerset functor Pf. If P :X →Set, then

PfπP :Pf(Σx:X.P x)→PfX maps each set {(x1, p1), . . . ,(xn, pn)} to the set

{x1, . . . , xn}, so that (PfπP)−1 maps a set {x1, . . . , xn} to the setP x1×. . .×

P xn. A Pˆf-algebra with carrier P:HS→Set and first component intherefore

has as its second component a function of type

∀({s1, . . . , sn}:Pf(HS)). P s1×. . .×P sn→P(in{s1, . . . , sn})

The induction rule for finite hereditary sets is thus

indHS:: (∀({s1, . . . , sn}:Pf(HS)). P s1×. . .×P sn→P(in{s1, . . . , sn}))

(12)

4 Induction rules in the fibrational setting

We can treat a more general notion of predicate using fibrations. We motivate the move from the codomain fibration to arbitrary fibrations by observing that i) the semantics of data types in languages involving recursion and other effects usually involves categories other thanSet_{; ii) in such circumstances, the notion} of a predicate can no longer be taken as a function with codomain Set_{; iii)} even when working inSet_{there are reasonable notions of “predicate” other than} that in Section 3 (for example, a predicate on a set X could be a subobject of X); and iv) when, in future work, we come to consider induction rules for data types such as nested types, GADTs, indexed containers, and dependent types (see Section 5), we will want to appropriately instantiate a general theory of induction rather than having to invent a new one. Thus, although we could develop anad hoctheory for each choice of category, functor, and predicate, it is far preferable to develop a uniform, axiomatic approach that is widely applicable. Fibrations support precisely this kind of axiomatic approach, so this sec-tion generalises the construcsec-tions of the previous one to the general fibrasec-tional setting. The standard model of type theory based on locally cartesian closed categories does arise as a specific fibration — namely, the codomain fibration — but the general fibrational setting is far more flexible. In locally cartesian closed models of type theory, predicates and types coexist in the same category, so a functor can be taken to be its own lifting. In the general setting, predicates are not simply functions or morphisms, properties and types do not coexist in the same category, and a functor cannot be taken to be its own lifting. There is no choice but to construct a lifting. Details about fibrations can be found in standard references such as [8, 14].

Working in the general fibrational setting also facilitates a direct comparison of our work with that of Hermida and Jacobs [6]. The main difference is that they use fibred products and coproducts in defining their liftings, whereas we use left adjoints to reindexing functors instead. The codomain fibration overSet has both, so their derivation gives exactly the same induction rule as ours in the setting of Section 3.

LetU :E → Bbe a fibration. Objects of the total categoryEcan be thought of as properties, objects of the base categoryBcan be thought of as types, and

U can be thought of as mapping each propertyEin E to the typeU Eof which

E is a property. One fibrationU can capture many different properties of the same type, so U is not injective on objects. For any objectB ofB, we write EB

for thefibre above B, i.e., for the subcategory ofE consisting of objectsE such thatU E=B and morphismskbetween objects ofEB such thatU k=idB. Let

f_E§ be the cartesian morphism determined byf andE. Thenf_E§ is unique up to isomorphism for every choice of objectE and morphism f with codomainU E. If f : B → B� _{is a morphism in} _B_{, then the} _{reindexing functor induced by} _f

is the functor f∗ : EB� → EB defined on objects byf∗E =dom(fE§) and, for a

morphismk:E→E� _in_E

B�,f∗kis the morphism such thatk◦f_E§ =f_E§�◦f∗k. The universal property offE§� ensures the existence and uniqueness off

(13)

Proceeding by analogy with the situation forSet_{-based predicates — where} Set _{plays the role of}_B _and _P _{plays the role of}_E _{— we now define, for every} functorF onB, a lifting ˆF of F toE such that UFˆ =F U. We construct ˆF by generalising each aspect of the construction of Section 3 to the general setting.

•The Truth Functor To construct the truth functor in the general setting, we assume B and E have terminal objects 1B and 1E, respectively, such that U(1E) = 1B. Writing !B for the unique map from an object B of B to 1B, we define K1 :B → E by settingK1B= (!B)∗1E and, for a morphism k:B →B�, taking K1k to be the unique morphism guaranteed to exist by the universal

property of k§. Then, for every B in B, K1B is the terminal object of EB, so

U(K1B) =B. In fact,U �K1, and the unit of this adjunction isid, soU K1=id.

•Comprehension Recall from Lemma 2 that the comprehension functor of Section 3 is right adjoint to the truth functor. Since right adjoints are defined up to isomorphism, in the general fibrational setting we can define the compre-hension functor{−}to be the right adjoint to the truth functorK1. A fibration

U :E → B which has a right adjointK1 which itself has a right adjoint{−}is

called acomprehension category[8]. We henceforth restrict attention to compre-hension categories. We write � for the counit of the adjunction{−} � K1 and

so, for any objectE ofE, we have that�E:K1{E} →E.

• Projection Recall from Section 3 that the first step of the construction of our lifting is to define the projection πP mapping the comprehension of a

predicateP toP’s domainU P. As in Section 3, we also want comprehension to be a natural transformation, so we actually seek to constructπ:{−} → U for an arbitrary comprehension category. Since�E:K1{E} →E for every objectE

of E, we have that U �E : U K1{E} → U E. Because U K1 =id, defining U �by

(U �)E=U �E gives a natural transformation from{−}to U. We may therefore

define the projection in an arbitrary comprehension category byπ=U �. • Inverses The final step in defining ˆF is to turn each component F πE

of the natural transformation F π : F{−} → F U defined by (F π)E = F πE

into a predicate over F U E. In Section 3, this was done via an inverse image construction. To generalise it, first note that we can construct a predicateinvf in EB� for any mapf :B→B� in Bif we assume a small amount of additional

standard fibrational structure, namely that for each such f the functor f∗ _: EB� → EB has a left adjoint. As in [6], no Beck-Chevalley condition is required

on this adjoint, which we denoteΣf :EB→ EB�. We defineinvf to beΣf(K1B).

•The Lifting Putting this all together, we now define the lifting ˆF :E → E by ˆF E = ΣF π_E(K1(F{E})) for every object E of E. For completeness we

also give the action of ˆF on morphisms. For each k : E → E�_{, define ˆ}_{F k} ₌ (F U k)§αK₁F{E�_}ΣF πEγ(K1F{E}). Here, i)αK1F{E�}:ΣF πE(F{k})

∗_K

1F{E�}

→(F U{k})∗_Σ

F π_E�K1F{E

�_}_{is the component for}_K

1F{E�}of the natural

trans-formation from ΣF πE(F{k})∗ to (F U{k})∗ΣF π_E� arising from the facts that

ΣF π_E is the left adjoint of (FπE)∗ and that Fπ is a natural transformation,

and ii)γ:ΣF π_EK1F{E} →ΣF πE(F{k})∗K1F{E�}is the isomorphism arising

from the fact that (F{k})∗_{is a right adjoint by the existence of}_Σ

(14)

•Generalising Lemma 1 As in Section 3, we ultimately want to show thatF -algebras with carrier{P}are interderivable with ˆF-algebras with carrierP. We first show that, as in Lemma 1, ˆF(K1B) =K1(F B) for any functorF onBand B in B. Recall thatU K1=idand define πK1:{−}K1→Id to be the natural

transformation with components (πK1)B = πK₁B. Note that ((πK1)B)−1 = U K1ηB, whereη :Id → {−}K1 is the unit of the adjunction{−} �K1, so that πK1is a natural isomorphism. If we further defineF πK1:F{−}K1→F U K1to

be the natural transformation with components (F πK1)B =F((πK1)B), then F πK1is also a natural isomorphism. We will use this observation below to show

that, for every objectBofB,Σ(F πK1)Bis not only left adjoint by definition, but

also right adjoint, to ((F πK1)B)∗. Then, observing that right adjoints preserve

terminal objects and thatK1(F B) is the terminal object ofEF B (sinceK1B is

the terminal object ofEB for anyB), we will have shown that ˆF(K1B) — i.e., Σ(F πK1)B(K1(F{K1B})) — must be the terminal object of EF U(K1B), i.e., of EF B. In other words, we will have shown that ˆF(K1B) =K1(F B).

So, fix an object B of B. To see that Σ(F πK1)B � ((F πK1)B)

∗_{, first note}

that, for any isomorphismf :B→B� _in_B_,_f∗ _{and (}_f−1₎∗ _{both exist and both}

f∗ _�₍_f−1₎∗ _{and (}_f−1₎∗ _�_f∗ _{hold. Then, since}_f∗ _�_Σ

f by definition, we have Σf is (f−1)∗, and thus thatΣf �f∗. Instantiatingf to (F πK1)B and recalling

that (F πK1)B is an isomorphism, we have thatΣ(F πK1)B �((F πK1)B) ∗_. •A Generic Fibrational Induction Rule Analogues of Lemma 3 and Lemma 4 hold in the general fibrational setting provided all occurrences ofSet_{are replaced} byBand all occurrences ofPare replaced byEand, in the analogue of Lemma 2, (−)† _:_B₍_B,_{_E_}₎_{→ E}₍_K

1B, E) and (−)#:E(K1B, E)→ B(B,{E}) are defined

by the adjunction{−} �K1.

The above construction thus yields the following generalised induction rule:

genfibind : ∀(F :B → B) (E:EX) (k: ˆF E→E). µF → {E}

genfibindF E = fold◦Ψ

This induction rule looks slightly different from the one for set-valued predicates. In Section 3, we were able to use the specific structure of comprehensions for set-valued predicates to extract proofs for particular data elements from them. But in the general fibrational setting, predicates, and hence comprehensions, are left abstract, so we take the return type of the general induction schemegenfibind to be a comprehension. We expect that, when genfibind is instantiated to a fibration of interest, we should be able to use knowledge about that fibration to extract from the comprehension it constructs further proof information relevant to the application at hand. This expectation is justified, as in [6], by{−} �K1.

We now give an induction rule for a data type and properties that cannot be modelled inSet_.

Example 4 The fixed pointHyp=µF of the functorF X= (X →Int)→Int

is the data type of hyperfunctions. SinceF has no fixed point inSet_{, we interpret} it in the category ωCP O⊥ of ω-cpos with ⊥ and strict continuous monotone

(15)

sub-ωCP O⊥ A of X. Admissibility means that the bottom element of X is in

A and A is closed under the least upper bounds of X. This structure forms a Lawvere category [7, 8]. The truth functor maps X to X, and comprehension maps a sub-ωCP O⊥ A ofX toA. The liftingFˆ maps a sub-ωCP O⊥ A ofX to

the sub-ωCP O⊥ F Aof F X. Finally, the derived induction rule states that if A

is an admissible sub-ωCP O⊥ ofHyp, and if Fˆ(A)⊆A, thenA=Hyp.

5 Conclusion and future work

We give an induction rule that can be used to prove properties of data structures of inductive types. Like Hermida and Jacobs, we give a fibrational account of induction, but we derive, under slightly different assumptions on fibrations, a generic induction rule that can be instantiated toanyinductive type rather than just to polynomial ones. This rule is based on initial algebra semantics of data types, and is parameterised over both the data types and the properties involved. It is also principled, expressive, and correct. Our derivation yields the same induction rules as Hermida and Jacobs’ when specialised to polynomial functors in the codomain fibration, but it also gives induction rules for non-polynomial data types such as rose trees, and for data types such as finite hereditary sets and hyperfunctions, for which no induction rules have previously been known.

There are several directions for future work. The most immediate is to instan-tiate our theory to give induction rules for nested types. These are exemplified by the data type of perfect trees given in Haskell-like syntax as follows:

data PTree a:Set_where PLeaf :a→PTree a

PNode:PTree(a, a)→PTreea

Nested types arise as least fixed points of rank-2 functors; for example, the type of perfect trees isµH for the functorH given byHF =λX. X+F(X×X). An appropriate fibration for induction rules for nested types thus takesBto be the category of functors onSet_,_E _{to be the category of functors from}Set_to_P_{, and}

U to be postcomposition with the forgetful functor from Section 3. A lifting ˆH

ofH is given by ˆH P X(inla) = 1 and ˆH P X(inrn) =P(X×X)n. Taking the premise to be an ˆH-algebra gives the following induction rule for perfect trees:

indP T ree:∀(P :Set_{→ P}₎_.

(U P =PTree)→(∀(X :Set₎₍_x_:_X₎_{. P}₍_PLeaf _x₎₎_→

(∀(X:Set₎₍_t_:_PTree₍_X_×_X₎_{. P}₍_X_×_X₎_t_→_P₍_PNode_t₎₎₎_→

∀(X:Set₎₍_t_:_PTree_X₎_{. P X t}

This rule can be used to show, for example, thatP T reeis a functor.

(16)

give useful induction rules, as well as how to translate the rules to which these fibrations give rise to an intensional setting.

Matthes [11] gives induction rules for nested types (including truly nested ones) in an intensional type theory. He handles only rank-2 functors that underlie nested types (while we handle any functor of any rank with an initial algebra), but his insights may help guide choices of fibrations for truly nested types. These may in turn inform choices for GADTs, indexed containers, and dependent types. Induction rules can automatically be generated in many type theories. Within the Calculus of Constructions [3] an induction rule for a data type can be gen-erated solely from the inductive structure of that type. Such generation is also a key idea in the Coq proof assistant [4]. But as far as we know, generation can currently be done only for syntactic classes of functors rather than for all functors with initial algebras. In some type theories induction schemes are added as axioms rather than generated. For example, attempts to generate induction schemes based on Church encodings in the Calculus of Constructions proved un-successful and so initiality was added to the system, thus giving the Calculus of Inductive Constructions. Whereas Matthes’ work is based on concepts such as impredicativity and induction recursion rather than initial algebras, ours reduces induction to initiality, and may therefore help lay the groundwork for extending implementations of induction to more sophisticated data types.

References

1. R. S. Bird and O. De Moor. Algebra of Programming. International Series in Com-puting Science, volume 100. Prentice Hall, 1997.

2. R. Bird and L. Meertens. Nested Datatypes. Proceedings, Mathematics of Program Construction, pp. 52–67, 1998.

3. T. Coquand and G. Huet. The Calculus of Constructions.Information and Compu-tation76 (2-3), pp. 95–120, 1988.

4. The Coq Proof Assistant. Available atcoq.inria.fr

5. N. Ghani and P. Johann. Foundations for Structured Programming with GADTs. Proceedings, Principles of Programming Languages, pp. 297–308, 2008.

6. C. Hermida and B. Jacobs. Structural Induction and Coinduction in a Fibrational Setting.Information and Computation145 (2), pp. 107–152, 1998.

7. B. Jacobs. Comprehension Categories and the Semantics of Type Dependency. The-oretical Computer Science107, pp. 169–207, 1993.

8. B. Jacobs.Categorical Logic and Type Theory. North Holland, 1999.

9. P. Johann and N. Ghani. Initial Algebra Semantics is Enough! Proceedings, Typed Lambda Calculus and Applications, pp. 207–222, 2007.

10. F. W. Lawvere. Equality in Hyperdoctrines and Comprehension Scheme as an Adjoint Functor.Applications of Categorical Algebra, pp. 1–14, 1970.

11. R. Matthes. An Induction Principle for Nested Datatypes in Intensional Type Theory.Journal of Functional Programming19 (3&4), pp. 439–468, 2009.

12. P. Morris.Constructing Universes for Generic Programming. Dissertation, Univer-sity of Nottingham, 2007.

13. E. Moggi. Notations of Computation and Monads.Information and Computation 93 (1), pp. 55–92, 1991.