Morita Equivalence

(1)

Morita Equivalence

∗

Thomas William Barrett

Hans Halvorson

Abstract

Logicians and philosophers of science have proposed various formal criteria for theoretical equivalence. In this paper, we examine two such proposals: definitional equivalence and categorical equivalence. In order to show precisely how these two well-known criteria are related to one an-other, we investigate an intermediate criterion called Morita equivalence.

1 Introduction

Many theories admit different formulations, and these formulations often bear interesting relationships to one another. One relationship that has received significant attention from logicians and philosophers of science is theoretical equivalence.1 _{In this paper we will examine two formal criteria for theoretical} equivalence. The first criterion, calleddefinitional equivalence, has been known to logicians since the middle of the twentieth century.2 _{It was introduced into} philosophy of science by Glymour (1970, 1977, 1980). The second criterion is calledcategorical equivalence. It was first described by Eilenberg and Mac Lane (1942, 1945), but was only recently introduced into philosophy of science by Halvorson (2012, 2015) and Weatherall (2015a).

In order to illustrate the relationship between these two criteria, we will consider a third criterion for theoretical equivalence calledMorita equivalence. We will show that these three criteria form the following hierarchy, where the arrows in the figure mean “implies.”

Definitional equivalence

Morita equivalence

Categorical equivalence

∗_{The authors can be reached at [email protected] and [email protected].}

We would like to thank Dimitris Tsementzis, Jim Weatherall, and JB Manchak for helpful comments and discussion.

1_{See Quine (1975), Sklar (1982), Halvorson (2012, 2013, 2015), Glymour (2013), van}

Fraassen (2014), and Coffey (2014) for discussion of theoretical equivalence in philosophy of science.

2_{Artigue et al. (1978) and de Bouv´}_{ere (1965) attribute the concept of definitional}

(2)

Our discussion will allow us to evaluate definitional equivalence against categor-ical equivalence. Indeed, it will demonstrate a precise sense in which definitional equivalence is too strict a criterion for theoretical equivalence, while categorical equivalence is too liberal. There are theories that are not definitionally equiv-alent that one nonetheless has good reason to consider equivequiv-alent. And on the other hand, there are theories that are categorically equivalent that one has good reason to considerinequivalent.

2 Many-sorted logic

All of these criteria for theoretical equivalence are most naturally understood in the framework of first-order many-sorted logic. We begin with some prelimi-naries about this framework.3

2.1 Syntax

A signature Σ is a set of sort symbols, predicate symbols, function symbols, and constant symbols. Σ must have at least one sort symbol. Each predicate symbol p _∈ Σ has an arity σ1 ×. . . ×σn, where σ1, . . . , σn ∈ Σ are (not

necessarily distinct) sort symbols. Likewise, each function symbol f _∈ Σ has anarityσ1×. . .×σn →σ, whereσ1, . . . , σn, σ∈Σ are again (not necessarily

distinct) sort symbols. Lastly, each constant symbolc _∈ Σ is assigned a sort

σ_∈Σ. In addition to the elements of Σ we also have a stock of variables. We use the letters x, y, and z to denote these variables, adding subscripts when necessary. Each variable has a sortσ_∈Σ.

A Σ-termcan be thought of as a “naming expression” in the signature Σ. Each Σ-term has a sortσ ∈Σ. The Σ-terms of sort σ are recursively defined as follows. Every variable of sort σis a Σ-term of sort σ, and every constant symbolc ∈Σ of sort σ is also a Σ-term of sort σ. Furthermore, if f ∈ Σ is a function symbol with arityσ1×. . .×σn→σandt1, . . . , tn are Σ-terms of sorts σ1, . . . , σn, thenf(t1, . . . , tn) is a Σ-term of sort σ. We will use the notation t(x1, . . . , xn) to denote a Σ-term in which all of the variables that appear in t

are in the sequence x1, . . . , xn, but we leave open the possibility that some of

thexi do not appear in the termt.

AΣ-atomis an expression either of the forms(x1, . . . , xn) =t(x1, . . . , xn),

wheresandt are Σ-terms of the same sortσ_∈Σ, or of the formp(t1, . . . , tn),

wheret1, . . . , tn are Σ-terms of sortsσ1, . . . , σn andp∈Σ is a predicate of arity σ1×. . .×σn. TheΣ-formulasare then defined recursively as follows.

• Every Σ-atom is a Σ-formula.

• Ifφis a Σ-formula, then¬φis a Σ-formula.

• If φ and ψ are Σ-formulas, then φ _→ ψ, φ_∧ψ, φ_∨ψ and φ _↔ ψ are Σ-formulas.

3_{Our notation follows Hodges (2008). We present the more general case of many-sorted}

(3)

• Ifφis a Σ-formula andxis a variable of sortσ_∈Σ, then_∀σxφand∃σxφ

are Σ-formulas.

In addition to the above formulas, we will use the notation∃σ=1yφ(x1, . . . , xn, y)

to abbreviate the formula∃σy(φ(x1, . . . , xn, y)∧∀σz(φ(x1, . . . , xn, z)→y=z)).

As above, the notationφ(x1, . . . , xn) will denote a Σ-formulaφin which all of

the free variables appearing in φare in the sequence x1, . . . , xn, but we again

leave open the possibility that some of thexido not appear as free variables in φ. AΣ-sentenceis a Σ-formula that has no free variables.

2.2 Semantics

AΣ-structureA is an “interpretation” of the symbols in Σ. In particular,A

satisfies the following conditions.

• Every sort symbolσ∈Σ is assigned a nonempty setAσ. The setsAσ are

required to be pairwise disjoint.

• Every predicate symbol p_∈ Σ of arity σ1×. . .×σn is interpreted as a

subsetpA

⊂Aσ1×. . .×Aσn.

• Every function symbolf _∈Σ of arityσ1×. . .×σn→σis interpreted as

a functionfA_:_A

σ1×. . .×Aσn→Aσ.

• Every constant symbolc _∈Σ of sort σ _∈Σ is interpreted as an element

cA

∈Aσ.

Given a Σ-structureA, we will often refer to an elementa_∈Aσ as “an element

of sortσ.”

Let A be a Σ-structure with a1, . . . , an ∈ A elements of sorts σ1, . . . , σn.

We let t(x1, . . . , xn) be a Σ-term of sort σ, with x1, . . . , xn variables of sorts σ1, . . . , σn, and we recursively define the element tA[a1, . . . , an] ∈ Aσ. If t is

the variablexi, then tA[a1, . . . , an] =ai, and if tis the constant symbolc∈Σ,

then tA[a1, . . . , an] = cA. Furthermore, if t is of the form f(t1, . . . , tm) where

each ti is a Σ-term of sort τi ∈ Σ and f ∈ Σ is a function symbol of arity τ1×. . .×τm→σ, then

tA[a1, . . . , an] =fA tA1[a1, . . . , an], . . . , tAm[a1, . . . , an]

One can think of the element tA_[_a

1, . . . , an] ∈ Aσ as the element of the

Σ-structure A that is denoted by the Σ-term t(x1, . . . , xn) when a1, . . . , an are

substituted for the variablesx1, . . . , xn.

Our next aim is to define when a sequence of elementsa1, . . . , an∈Asatisfy

a Σ-formulaφ(x1, . . . , xn) in the Σ-structureA. When this is the case we write A φ[a1, . . . , an]. We begin by considering Σ-atoms. Let φ(x1, . . . , xn) be

a Σ-atom with x1, . . . , xn variables of sorts σ1, . . . , σn and let a1, . . . , an ∈ A

(4)

φ(x1, . . . , xn) is the formula s(x1, . . . , xn) = t(x1, . . . , xn), where s and t are

Σ-terms of sortσ, thenAφ[a1, . . . , an] if and only if

sA[a1, . . . , an] =tA[a1, . . . , an]

Second, ifφ(x1, . . . , xn) is the formula p(t1, . . . , tm), where eachti is a Σ-term

of sort τi and p ∈ Σ is a predicate symbol of arity τ1×. . .×τm, then A φ[a1, . . . , an] if and only if

tA

1[a1, . . . , an], . . . , tAm[a1, . . . , an]∈pA

This definition is extended to all Σ-formulas in the following standard way.

• A¬φ[a1, . . . , an] if and only if it is not the case thatAφ[a1, . . . , an].

• Aφ_∧ψ[a1, . . . , an] if and only ifAφ[a1, . . . , an] andAψ[a1, . . . , an].

The cases of_∨,_→, and _↔are defined analogously.

• Suppose thatφ(x1, . . . , xn) is ∀σyψ(x1, . . . , xn, y), where σ∈Σ is a sort

symbol. ThenAφ[a1, . . . , an] if and only ifAψ[a1, . . . , an, b] for every

elementb∈Aσ. The case of∃σ is defined analogously.

Ifφ is a Σ-sentence, thenA φ just in case A φ[], i.e. the empty sequence satisfiesφinA.

2.3 Relationships between structures

There are different relationships that Σ-structures can bear to one another. An

isomorphismh:A_→Bbetween Σ-structuresAandBis a family of bijections

hσ:Aσ→Bσfor each sort symbolσ∈Σ that satisfies the following conditions.

• For every predicate symbol p ∈ Σ of arity σ1 ×. . . ×σn and all

ele-mentsa1, . . . , an ∈A of sorts σ1, . . . , σn, (a1, . . . , an)∈ pA if and only if

(hσ1(a1), . . . , hσn(an))∈p

B_.

• For every function symbolf _∈Σ of arityσ1×. . .×σn→σand all elements a1, . . . , an∈Aof sortsσ1, . . . , σn,

hσ fA(a1, . . . , an)=fB hσ1(a1), . . . , hσn(an)

• For every constant symbolc_∈Σ of sortσ,hσ(cA) =cB.

When there is an isomorphismh:A_→B one says thatA andB are isomor-phicand writesA∼=B.

There is another important relationship that Σ-structures can bear to one another. Anelementary embeddingh:A→B between Σ-structuresAand

B is a family of mapshσ:Aσ →Bσ for each sort symbolσ∈Σ that satisfies

(5)

for all Σ-formulasφ(x1, . . . , xn) and elementsa1, . . . , an ∈Aof sortsσ1, . . . , σn.

Given an isomorphism or elementary embeddingh:A→B, we will often use the notationh(a1, . . . , an) to denote the sequence of elementshσ1(a1), . . . , hσn(an).

Every isomorphism is an elementary embedding, but in general the converse does not hold.

There is an important relationship that can hold between structures of differ-ent signatures. Let Σ⊂Σ+be signatures and suppose thatAis a Σ+-structure. One obtains a Σ-structure A_|Σ by “forgetting” the interpretations of symbols in Σ+

−Σ. We call A_|Σ thereductofA to the signature Σ, and we callAan

expansionofA_|Σto the signature Σ+. Note that in general a Σ-structure will have more than one expansion to the signature Σ+_.

We can now discuss first-order theories in many-sorted logic. AΣ-theory

T is a set of Σ-sentences. The sentences φ_∈T are called the axioms of T. A Σ-structureM is amodelof a Σ-theoryT ifM φfor allφ_∈T. A theoryT

entailsa sentenceφ, writtenT φ, ifM φfor every modelM ofT.

We begin our discussion of theoretical equivalence with the following pre-liminary criterion.

Definition. TheoriesT1andT2arelogically equivalentif they have the same class of models.

One can easily verify that T1 and T2 are logically equivalent if and only if

{φ:T1φ}={ψ:T2ψ}.

3 Definitional equivalence

Logical equivalence is a particularly strict criterion for theoretical equivalence. Indeed, theories can only be logically equivalent if they are formulated in the same signature. There are many cases, however, of theories in different signa-tures that are nonetheless intuitively equivalent. For example, the theory of groups can be formulated in a signature with a binary operation · and a con-stant symbole, or it can be formulated in a signature with a binary operation

·and a unary function−1 (Barrett and Halvorson, 2015). Similarly, the theory of linear orders can be formulated in a signature with the binary relation<, or it can be formulated in a signature with the binary relation≤. Since logical equivalence does not capture any sense in which these theories are equivalent, logicians and philosophers of science have proposed more general criteria for theoretical equivalence.

One such criterion is definitional equivalence. This criterion is well known among logicians, and many results about it have been proven.4 _{The basic idea} behind definitional equivalence is simple. TheoriesT1and T2 are definitionally equivalent ifT1 can define all of the symbols thatT2 uses, and in a compatible way,T2can define all of the symbols thatT1uses. In order to state this criterion precisely, we need to do some work.

4_{For example, see de Bouv´}_{ere (1965), Kanger (1968), Pinter (1978), Pelletier and Urquhart}

(6)

3.1 Definitional extensions

We first need to formalize the concept of a definition. Let Σ⊂Σ+be signatures and letp_∈Σ+

−Σ be a predicate symbol of arity σ1×. . .×σn. Anexplicit definition of pin terms of Σ is a Σ+_{-sentence of the form}

∀σ1x1. . .∀σnxn p(x1, . . . , xn)↔φ(x1, . . . , xn)

whereφ(x1, . . . , xn) is a Σ-formula. Note that an explicit definition ofpin terms

of Σ can only exist ifσ1, . . . , σn∈Σ. An explicit definition of a function symbol f _∈Σ+

−Σ of arityσ1×. . .×σn→σ is a Σ+-sentence of the form

∀σ1x1. . .∀σnxn∀σy f(x1, . . . , xn) =y↔φ(x1, . . . , xn, y)

(1)

and an explicit definition of a constant symbol c∈Σ+₋_{Σ of sort} _σ _{is a Σ}+ -sentence of the form

∀σy y=c↔ψ(y) (2)

where φ(x1, . . . , xn, y) and ψ(y) are both Σ-formulas. Note again that these

explicit definitions off andccan only exist ifσ1, . . . , σn, σ∈Σ.

Although they are Σ+_{-sentences, (1) and (2) have consequences in the} sig-nature Σ. In particular, (1) and (2) imply the following sentences, respectively:

∀σ1x1. . .∀σnxn∃σ=1yφ(x1, . . . , xn, y)

∃σ=1yψ(y)

These two sentences are called theadmissibility conditionsfor the explicit definitions (1) and (2).

Adefinitional extension of a Σ-theoryT to the signature Σ+ _{is a theory}

T+ =T_{∪ {}δs:s∈Σ+−Σ}

that satisfies the following two conditions. First, for each symbols_∈ Σ+

−Σ the sentenceδsis an explicit definition ofsin terms of Σ, and second, ifs is a

constant symbol or a function symbol andαs is the admissibility condition for δs, thenT αs.

3.2 Three results

A definitional extension of a theory “says no more” than the original theory. There are a number of ways to make this idea precise. Of particular interest to us will be the following three. The reader is encouraged to consult Hodges (2008, p. 58–62) for proofs of these results.

(7)

Theorem 3.1. LetΣ_⊂Σ+ _{be signatures and} _T _a_Σ_{-theory. If}_T+ _{is a} defini-tional extension ofT toΣ+_{, then every model} _M _of _T _{has a unique expansion}

M+ _{that is a model of}_T+_.

Theorem 3.1 provides a semantic sense in which a definitional extensionT+ “says no more” than the original theoryT. The models ofT+ _{are completely} determined by the models ofT.

In order to state the second result, we need to introduce some terminology. Let Σ _⊂ Σ+ _{be signatures. A Σ}+_-theory _T+ _{is an} _extension _{of a Σ-theory}

T ifT φimplies that T+

φ for every Σ-sentenceφ. A Σ+_-theory _T+ _{is a}

conservative extensionof a Σ-theoryTifT φif and only ifT+

φfor every Σ-sentence φ. All conservative extensions are extensions, but in general the converse does not hold. We have the following simple result about definitional extensions.

Theorem 3.2. IfT+ _{is a definitional extension of}_T_{, then}_T+ _{is a conservative} extension ofT.

If T+ _{is a conservative extension of} _T_{, then} _T+ _{entails precisely the same} Σ-sentences asT. Theorem 3.2 therefore shows that a definitional extensionT+ “says no more” in the signature Σ than the original theoryT does.

The third result shows something stronger. IfT+_{is a definitional extension} ofT, then every Σ+-formulaφ(x1, . . . , xn) can be “translated” into an equivalent

Σ-formulaφ∗₍_x

1, . . . , xn). The theory T+ might use some new language that T did not use, but everything that T+ _{says with this new language can be} “translated” back into the old language of T. This result captures another robust sense in which the theoryT+ _{“says no more” than the theory} _T_.

Theorem 3.3. Let Σ_⊂ Σ+ _{be signatures and} _T _a _Σ_{-theory. If} _T+ _{is a} def-initional extension of T to Σ+ _{then for every} _Σ+_-formula _φ₍_x

1, . . . , xn) there

is aΣ-formulaφ∗₍_x

1, . . . , xn) such thatT+ ∀σ1x1. . .∀σnxn(φ(x1, . . . , xn)↔

φ∗₍_x

1, . . . , xn)).

These results capture three different senses in which a definitional extension has the same expressive power as the original theory. With this in mind, we have the resources necessary to state definitional equivalence.

Definition. Let T1 be a Σ1-theory and T2 be a Σ2-theory. T1 and T2 are

definitionally equivalent if there are theories T1+ and T +

2 that satisfy the following three conditions:

• T₁+ is a definitional extension ofT1,

• T₂+ is a definitional extension ofT2,

• T1+ andT2+are logically equivalent Σ1∪Σ2-theories.

(8)

robust sense in which theories with a common definitional extension “say the same thing,” even though they might be formulated in different signatures.

One trivially sees that if two theories are logically equivalent, then they are definitionally equivalent. But there are many examples of theories that are definitionally equivalent and not logically equivalent. The theory of groups for-mulated in the signature{·, e}is definitionally equivalent to the theory of groups formulated in the signature{·,−1}. And likewise, the theory of linear orders formulated in the signature_{<_}is definitionally equivalent to the theory of lin-ear orders formulated in the signature_{≤}. Definitional equivalence is therefore a weaker criterion for theoretical equivalence than logical equivalence. It is ca-pable of capturing a sense in which theories formulated in different signatures might nonetheless be equivalent.

4 Morita equivalence

Definitional equivalence, however, isincapable of capturing any sense in which theories formulated with different sorts might be equivalent. We have provided no way of defining new sort symbols. One can therefore easily verify that if

T1 and T2 are definitionally equivalent, then it must be that Σ1 and Σ2 have the same sort symbols. There are many theories with different sort symbols, however, that one has good reason to consider equivalent.

One particularly famous example of this is Euclidean geometry. It can be formulated with only a sort of “points” (Tarski, 1959), with only a sort of “lines” (Schwabh¨auser and Szczerba, 1975), or with both a sort of “points” and a sort of “lines” (Hilbert, 1930).5 _{Category theory can also be formulated using different} sorts. The standard formulation uses both a sort of “objects” and a sort of “arrows” (Eilenberg and Mac Lane, 1942, 1945). But it is well known that category theory can instead be formulated using only a sort of “arrows” (Mac Lane, 1948).6 _{Since these formulations use different sort symbols, definitional} equivalence does not capture any sense in which they are equivalent.

In addition to these two famous examples, we have the following simple example.

Example 1. Let Σ1 =_{σ1, p, q} and Σ2 ={σ2, σ3} be signatures withσ1, σ2, and σ3 sort symbols andpand q predicate symbols of arity σ1. Consider the Σ1-theory

T1=

∃σ1x1p(x1),∃σ1x2q(x2),∀σ1x1(p(x1)∨q(x1)),

∀σ1x1¬(p(x1)∧q(x1))

and the Σ2-theory T2 =∅. Since the signatures Σ1 and Σ2 have different sort symbols,T1andT2 are not definitionally equivalent. y

5_{Szczerba (1977) and Schwabh¨}_{auser et al. (1983, Proposition 4.59, Proposition 4.89) discuss}

the relationships between these formulations.

(9)

Even thoughT1 andT2 are not definitionally equivalent, one still has good reason to consider them equivalent. The theoryT1partitions everything into the things that arepand the things that areq. Similarly, the theoryT2 partitions everything into the things of sortσ1 and the things of sortσ2. BothT1 andT2 say “there are two kinds of things.” The only difference between them is that

T1 uses predicates to say this, whileT2uses sorts.

These examples all show that definitional equivalence does not capture the sense in which some theories are equivalent. If one wants to capture this sense, one needs a more general criterion for theoretical equivalence than definitional equivalence. Our aim here is to introduce one such criterion. We will call it Morita equivalence.7 _{This criterion is a natural generalization of definitional} equivalence. In fact, Morita equivalence is essentially the same as definitional equivalence, except that it allows one to define new sort symbols in addition to new predicate symbols, function symbols, and constant symbols. In order to state the criterion precisely, we again need to do some work. We begin by defining the concept of a Morita extension. We then make precise the sense in which Morita equivalence is a natural generalization of definitional equivalence by proving analogues of Theorems 3.1, 3.2 and 3.3.

4.1 Morita extensions

As we did for predicates, functions, and constants, we need to say how to define new sorts. Let Σ_⊂Σ+ _{be signatures and consider a sort symbol}_σ

∈Σ+

−Σ. One can define the sort σas a product sort, a coproduct sort, a subsort, or a quotient sort. In each case, one definesσusing old sorts in Σ and new function symbols in Σ+

−Σ. These new function symbols specify how the new sortσis related to the old sorts in Σ. We describe these four cases in detail.

In order to define σ as a product sort, one needs two function symbols

π1, π2 ∈Σ+−Σ with π1 of arityσ→σ1,π2 of arity σ→σ2, andσ1, σ2∈Σ. The function symbolsπ1andπ2serve as the “canonical projections” associated with the product sortσ. An explicit definition of the symbolsσ, π1, andπ2 as aproduct sortin terms of Σ is a Σ+_{-sentence of the form}

∀σ1x∀σ2y∃σ=1z(π1(z) =x∧π2(z) =y)

One should think of a product sort σ as the sort whose elements are ordered pairs, where the first element of each pair is of sortσ1and the second is of sort

σ2.

One can also define σ as a coproduct sort. One again needs two function symbols ρ1, ρ2 ∈ Σ+−Σ with ρ1 of arity σ1 → σ, ρ2 of arity σ2 → σ, and

7_{This criterion is already familiar in certain circles of logicians. See Andr´}_{eka et al. (2008).}

(10)

σ1, σ2 ∈ Σ. The function symbols ρ1 and ρ2 are the “canonical injections” associated with the coproduct sort σ. An explicit definition of the symbols

σ, ρ1, andρ2 as acoproduct sort in terms of Σ is a Σ+-sentence of the form

∀σz ∃σ1=1x(ρ1(x) =z)∨ ∃σ2=1y(ρ2(y) =z)

∧ ∀σ1x∀σ2y¬ ρ1(x) =ρ2(y)

One should think of a coproduct sortσas the disjoint union of the elements of sortsσ1 andσ2.

When defining a new sortσas a product sort or a coproduct sort, one uses two sort symbols in Σ and two function symbols in Σ+

−Σ. The next two ways of defining a new sort σ only require one sort symbol in Σ and one function symbol in Σ+

−Σ.

In order to defineσas a subsort, one needs a function symboli_∈Σ+

−Σ of arityσ_→σ1 with σ1 ∈Σ. The function symbol iis the “canonical inclusion” associated with the subsortσ. An explicit definition of the symbolsσand ias asubsortin terms of Σ is a Σ+_{-sentence of the form}

∀σ1x φ(x)↔ ∃σz(i(z) =x)

∧ ∀σz1∀σz2 i(z1) =i(z2)→z1=z2 (3) where φ(x) is a Σ-formula. One can think of the subsort σ as consisting of “the elements of sortσ1 that are φ.” The sentence (3) entails the Σ-sentence

∃σ1xφ(x). As before, we will call this Σ-sentence theadmissibility condition

for the definition (3).

Lastly, in order to defineσ as a quotient sort one needs a function symbol

_∈Σ+

−Σ of arityσ1→σwithσ1∈Σ. An explicit definition of the symbols

σandas aquotient sortin terms of Σ is a Σ+_{-sentence of the form}

∀σ1x1∀σ1x2 (x1) =(x2)↔φ(x1, x2)

∧ ∀σz∃σ1x((x) =z) (4)

whereφ(x1, x2) is a Σ-formula. This sentence definesσas a quotient sort that is obtained by “quotienting out” the sortσ1with respect to the formulaφ(x1, x2). The sort σshould be thought of as the set of “equivalence classes of elements of σ1 with respect to the relation φ(x1, x2).” The function symbol is the “canonical projection” that maps an element to its equivalence class. One can verify that the sentence (4) implies thatφ(x1, x2) is an equivalence relation. In particular, it entails the following Σ-sentences:

∀σ1x(φ(x, x))

∀σ1x1∀σ1x2(φ(x1, x2)→φ(x2, x1))

∀σ1x1∀σ1x2∀σ1x3 (φ(x1, x2)∧φ(x2, x3))→φ(x1, x3)

These Σ-sentences are theadmissibility conditionsfor the definition (4). Now that we have presented the four ways of defining new sort symbols, we can define the concept of a Morita extension. A Morita extension is a natural generalization of a definitional extension. The only difference is that now one is allowed to define new sort symbols. Let Σ_⊂Σ+_{be signatures and}_T_{a Σ-theory.} AMorita extensionofT to the signature Σ+_{is a Σ}+_-theory

(11)

that satisfies the following conditions. First, for each symbols _∈Σ+

−Σ the sentenceδsis an explicit definition ofsin terms of Σ. Second, ifσ∈Σ+−Σ is

a sort symbol andf ∈Σ+₋_{Σ is a function symbol that is used in the explicit} definition ofσ, thenδf =δσ. (For example, ifσis defined as a product sort with

projectionsπ1andπ2, thenδσ =δπ1 =δπ2.) And third, ifαsis an admissibility

condition for a definitionδs, thenT αs.

Note that unlike a definitional extension of a theory, a Morita extension can have more sort symbols than the original theory.8 _{The following is a particularly} simple example of a Morita extension.

Example 2. Let Σ =_{σ, p_} and Σ+₌

{σ, σ+_{, p, i}

}be a signatures withσand

σ+_{sort symbols,}_p_{a predicate symbol of arity}_σ_{, and}_i_{a function symbol of arity}

σ+

→ σ. Consider the Σ-theory T =_{∃σxp(x)}. The following Σ+-sentence

defines the sort symbol σ+ _{as the subsort consisting of “the elements that are}

p.”

∀σx p(x)↔ ∃σ+z(i(z) =x)

∧ ∀σ+z₁∀_σ+z₂ i(z1) =i(z2)→z₁=z₂

(δσ+)

The Σ+_-theory _T+ ₌ _T _{∪ {}_δ

σ+} is a Morita extension ofT to the signature

Σ+_{. The theory}_T+ _{adds to the theory}_T _{the ability to quantify over the set of}

“things that arep.” y

4.2 Three results

As with a definitional extension, a Morita extension “says no more” than the original theory. We will make this idea precise by proving analogues of Theorems 3.1, 3.2, and 3.3. These three results also demonstrate how closely related the concept of a Morita extension is to that of a definitional extension.

Theorem 3.1 generalizes in a perfectly natural way. When T+ _{is a Morita} extension ofT, the models ofT+_{are “determined” by the models of}_T_.

Theorem 4.1. Let Σ⊂Σ+ _{be signatures and}_T _a_Σ_{-theory. If}_T+ _{is a Morita} extension ofT toΣ+, then every model M of T has a unique expansion (up to isomorphism)M+ _{that is a model of}_T+_.

Before proving Theorem 4.1, we introduce some notation and prove a lemma. Suppose that a Σ+_-theory _T+ _{is a Morita extension of a Σ-theory} _T_{. Let} _M andNbe models ofT+_with_h_:_M

|Σ→N|Σan elementary embedding between the Σ-structuresM_|ΣandN|Σ. The elementary embeddinghnaturally induces a maph+_:_M

→N between the Σ+_-structures_M _and_N_.

8_{Also note that if}_T+ _{is a Morita extension of}_T _{to Σ}+_{, then there are restrictions on}

the arities of predicates, functions, and constants in Σ+−Σ. Ifp ∈Σ+−Σ is a predicate symbol of arityσ1×. . .×σn, we immediately see thatσ1, . . . , σn∈Σ. Taking a single Morita

extension does not allow one to define predicate symbols that apply to sorts that are not in Σ. One must take multiple Morita extensions to do this. Likewise, any constant symbol

c∈Σ+₋_{Σ must be of sort} _σ_∈_{Σ. And a function symbol}_f _∈_Σ+₋_{Σ must either have}

arityσ1×. . .×σn→σwithσ1, . . . , σn, σ∈Σ, orfmust be one of the function symbols that

(12)

We know that his a family of mapshσ :Mσ→Nσ for each sortσ∈Σ. In

order to describeh+ _{we need to describe the map}_h+

σ :Mσ→Nσ for each sort σ∈Σ+_{. If}_σ_∈_{Σ, we simply let}_h+

σ =hσ. On the other hand, whenσ∈Σ+−Σ,

there are four cases to consider. We describeh+

σ in the cases where the theory T+ _defines_σ _{as a product sort or a subsort. The coproduct and quotient sort} cases are described analogously.

First, suppose thatT+ definesσas a product sort. Letπ1, π2 ∈Σ+ be the projections of arityσ_→σ1 andσ→σ2 withσ1, σ2 ∈Σ. The definition of the functionh+

σ is suggested by the following diagram.

Mσ

Mσ1 Nσ1

Nσ Mσ2 Nσ2

h+

σ

πM1 h

+

σ1 π₁N

πM 2 h+ σ2 πN 2

Let m _∈ Mσ. We define hσ+(m) to be the unique n ∈ Nσ that satisfies both πN

1 (n) = h+σ1◦π

M

1 (m) and πN2 (n) = h+σ2◦π

M

2 (m). We know that such an n exists and is unique because N is a model of T+ _and_T+ _{defines the symbols}

σ, π1, and π2 to be a product sort. One can verify that this definition of h+σ

makes the above diagram commute.

Suppose, on the other hand, that T+ _defines_σ _{as the subsort of “elements} of sortσ1 that are φ.” Leti∈Σ+ be the inclusion map of arityσ→σ1 with

σ1∈Σ. As above, the definition ofh+σ is suggested by the following diagram.

Mσ

Mσ1 Nσ1

Nσ h+

σ

iM _h+_σ

1 iN

Letm_∈Mσ. We see that following implications hold:

M φ[iM(m)]_⇒M_|Σφ[iM(m)]

⇒N_|Σφ[h+σ1(i

M₍_m_))]

⇒N φ[h+σ1(i

M₍_m_))]

The first and third implications hold sinceφ(x) is a Σ-formula, and the second holds because hσ1 = h

+

σ1 and h is an elementary embedding. T

+ _{defines the}

symbols i and σ as a subsort and M is a model of T+_{, so it must be that}

M φ[iM₍_m_{)]. By the above implications, we see that} _N _φ_[_h+

σ1(i

M₍_m_))].

SinceN is also a model ofT+, there is a uniquen∈Nσ that satisfiesiN(n) = h+

σ1(i

M₍_m_{)). We define} _h+

σ(m) = n. This definition of h+σ again makes the

(13)

WhenT+ _defines_σ_{as a coproduct sort or a quotient sort one describes the} map h+

σ analogously. For the purposes of proving Theorem 4.1, we need the

following simple lemma about this maph+_.

Lemma 4.1. If h:M_|Σ →N|Σ is an isomorphism, then h+ : M →N is an isomorphism.

Proof. We know that hσ :Mσ →Nσ is a bijection for eachσ∈Σ. Using this

fact and the definition ofh+_{, one can verify that} _h+

σ :Mσ →Nσ is a bijection

for each sortσ _∈Σ+_{. So} _h+ _{is a family of bijections. And furthermore, the} commutativity of the above diagrams implies that h+ _{preserves any function} symbols that are used to define new sorts.

It only remains to check that h+ _{preserves predicates, functions, and} con-stants that have arities and sorts in Σ. Sinceh:M|Σ→N|Σis a isomorphism, we know thath+ preserves the symbols in Σ. So letp∈Σ+−Σ be a predicate symbol of arityσ1×. . .×σn withσ1, . . . , σn∈Σ. There must be a Σ-formula φ(x1, . . . , xn) such that T+ ∀σ1x1. . .∀σnxn(p(x1, . . . , xn) ↔ φ(x1, . . . , xn)).

We know thath : M_|Σ → N|Σ is an elementary embedding, so in particular it preserves the formulaφ(x1, . . . , xn). This implies that (m1, . . . , mn)∈pM if

and only if (hσ1(m1), . . . , hσn(mn))∈p

N_{. Since}_h+

σi =hσi for eachi= 1, . . . , n,

it must be that h+ _{also preserves the predicate} _p_{. An analogous argument} demonstrates thath+ _{preserves functions and constants.}

We now turn to the proof of Theorem 4.1.

Proof of Theorem 4.1. Let M be a model of T. First note that if M+ _exists, then it is unique up to isomorphism. For ifN is a model ofT+_with_N_|

Σ=M, then by lettingh be the identity map (which is an isomorphism) Lemma 4.1 implies thatM+∼=N. We need only define the Σ+-structureM+. To guarantee that M+ _{is an expansion of}_M _{we interpret every symbol in Σ the same way} thatM does. We need to say how the symbols in Σ+

−Σ are interpreted. There are a number of cases to consider.

Suppose that p _∈ Σ+

−Σ is a predicate symbol of arity σ1 ×. . .×σn

with σ1, . . . , σn ∈ Σ. There must be a Σ-formula φ(x1, . . . , xn) such that T+

∀σ1x1. . .∀σnxn(p(x1, . . . , xn) ↔ φ(x1, . . . , xn)). We define the

inter-pretation of the symbol p in M+ _{by letting (}_a

1, . . . , an) ∈ pM

+

if and only if M φ[a1, . . . , an]. It is easy to see that this definition of pA implies that M+

δp. The cases of function and constant symbols are handled similarly.

Letσ_∈Σ+

−Σ be a sort symbol. We describe the cases whereT+ _defines

σas a product sort or a subsort. The coproduct and quotient sort cases follow analogously. Suppose first that σis defined as a product sort with π1 and π2 the projections of arity σ → σ1 and σ → σ2, respectively. We defineMσ+ = M+

σ1 ×M

+

σ2 with π

M+

1 : Mσ+ → Mσ+1 and π

M+

2 : Mσ+ → Mσ+2 the canonical

projections. One can easily verify thatM+

δσ. On the other hand, suppose

that σ is defined as a subsort with defining Σ-formula φ(x) and inclusioni of arityσ_→σ1. We define M+

σ ={a∈Mσ1 :M φ[a]} withi

M+

:M+

σ →Mσ+1

the inclusion map. One can again verify thatM+

(14)

We have shown that the exact analogue of Theorem 3.1 holds for Morita extensions. Theorem 3.2 also generalizes in a perfectly natural way. Indeed, the generalization follows as a simple corollary to Theorem 4.1.

Theorem 4.2. If T+ _{is a Morita extension of} _T_{, then} _T+ _{is a conservative} extension ofT.

Proof. Suppose that T+ _{is not a conservative extension of} _T_{. One can easily} see thatT φimplies thatT+

φfor every Σ-sentence φ. So there must be some Σ-sentence φsuch thatT+

φ, but T 6 φ. This implies that there is a model M of T such that M ¬φ. This model M has no expansion that is a model ofT+ _since_T+

φ, contradicting Theorem 4.1.

Theorems 3.1 and 3.2 therefore generalize naturally from definitional exten-sions to Morita extenexten-sions. In order to generalize Theorem 3.3, however, we need to do some work. Theorem 3.3 said that ifT+_{is a definitional extension of}

T to Σ+_{, then for every Σ}+_-formula_φ₍_x

1, . . . , xn) there is a corresponding

for-mulaφ∗₍_x

1, . . . , xn) that is equivalent toφ(x1, . . . , xn) according to the theory T+_{. The following example demonstrates that this result does not generalize to} the case of Morita extensions in a perfectly straightforward manner.

Example 3. Recall the theoriesT andT+ _{from Example 2 and consider the} Σ+_-formula_φ₍_{x, z}_{) defined by}_i₍_z_{) =}_x_{. One can easily see that there is no} Σ-formulaφ∗₍_{x, z}_{) that is equivalent to}_φ₍_{x, z}_{) according to the theory}_T+_{. Indeed,} the variablezcannot appear in any Σ-formula since it is of sort σ+

∈Σ+

−Σ. A Σ-formula simply cannot say how variables with sorts in Σ relate to variables with sorts in Σ+_.

y

In order to generalize Theorem 3.3, therefore, we need a way of specifying how variables with sorts in Σ+₋_{Σ relate to variables with sorts in Σ. We do} this by defining the concept of a “code.”9 _{Let Σ}_⊂ _Σ+ _{be signatures with} _T a Σ-theory andT+ a Morita extension ofT to Σ+. Acode for the variables

x1, . . . , xn of sortsσ1, . . . , σn∈Σ+−Σ is a Σ+-formula

ξ1(x1, y11, y12)∧. . .∧ξn(xn, yn1, yn2)

where the conjunctsξi are defined as follows. Suppose thatT+ definesσi as a

product sort withπ1andπ2the projections of arityσi→σi1andσi→σi2. The

conjunctξi(xi, yi1, yi2) is then the Σ+-formulaπ1(xi) =yi1∧π2(xi) =yi2,where

yi1 andyi2 are variables of sortsσi1, σi2∈Σ. On the other hand, suppose that

T+ _defines _σ

i as a coproduct sort with injections ρ1 and ρ2 of arityσi1 → σi

and σi2 → σi. Then the conjunct ξi is either the Σ+-formula ρ1(yi1) = xi

or the Σ+_-formula_ρ₂₍_y

i2) =xi, where yi1 and yi2 are again variables of sorts

σi1, σi2∈Σ.

The subsort and quotient sort cases are handled analogously. Suppose that

T+ _defines_σ

i as a subsort withithe inclusion map of arityσi→σi1. Then the

conjunctξi is the Σ+-formulai(xi) =yi1, whereyi1is a variable of sortσi1∈Σ.

(15)

And finally, suppose thatT+ _defines_σ

i as a quotient sort withthe projection

of arityσi1 →σi. The conjunctξi is then the Σ+-formula (yi1) =xi, where yi1 is again a variable of sortσi1 ∈Σ. Given the empty sequence of variables, we let the empty code be the tautology ∃σx(x=x), where σ ∈ Σ is a sort

symbol.

We will use the notationξ(x1, . . . , yn2) to denote the codeξ1(x1, y11, y12)∧

. . .∧ξn(xn, yn1, yn2) for the variables x1, . . . , xn. Note that the variables yi1 and yi2 have sorts in Σ for each i = 1, . . . , n. One should think of a code

ξ(x1, . . . , yn2) forx1, . . . , xn as encoding one way that the variablesx1, . . . , xn

with sorts in Σ+

−Σ might be related to variablesy11, . . . , yn2that have sorts in Σ. One additional piece of notation will be useful in what follows. Given a Σ+ -formulaφ, we will writeφ(x1, . . . , xn, x1, . . . , xm) to indicate that the variables x1, . . . , xn have sorts σ1, . . . , σn ∈ Σ+−Σ and that the variables x1, . . . , xm

have sortsσ1, . . . , σm∈Σ.

We can now state our generalization of Theorem 3.3. One proves this result by induction on the complexity ofφ(x1, . . . , xn). The proof has been placed in

an appendix.

Theorem 4.3. Let Σ⊂Σ+ _{be signatures and}_T _a _Σ_{-theory. Suppose that}_T+ is a Morita extension of T to Σ+ _{and that} _φ₍_x

1, . . . , xn, x1, . . . , xm) is a Σ+

-formula. Then for every codeξ(x1, . . . , yn2)for the variablesx1, . . . , xn there is

aΣ-formulaφ∗(x1, . . . , xm, y11, . . . , yn2)such that

T+

∀σ1x1. . .∀σnxn∀σ1x1. . .∀σmxm∀σ11y11. . .∀σn2yn2 ξ(x1, . . . , yn2)→

(φ(x1, . . . xn, x1, . . . , xm)↔φ∗(x1, . . . , xm, y11, . . . , yn2))

The idea behind Theorem 4.3 is simple. Although one might not initially be able to translate a Σ+_-formula _φ _{into an equivalent Σ-formula} _φ∗_{, such a} translation is possible after one specifies how the variables inφ with sorts in Σ+

−Σ are related to variables with sorts in Σ. Theorem 4.3 has the following immediate corollary.

Corollary 4.1. Let Σ_⊂Σ+ _{be signatures and}_T _a_Σ_{-theory. If}_T+ _{is a Morita} extension of T to Σ+_{, then for every} _Σ+_-sentence _φ _{there is a}_Σ_-sentence _φ∗ such thatT+

φ↔φ∗_.

Proof. Let φ be a Σ+_{-sentence and consider the empty code} _ξ_{. Theorem 4.3} implies that there is a Σ-sentenceφ∗ _{such that}_T+

ξ→(φ↔φ∗_{). Since} _ξ_is a tautology we trivially have thatT+_φ_↔_φ∗_.

Theorems 4.1, 4.2, and 4.3 capture different senses in which a Morita ex-tension of a theory “says no more” than the original theory. The definition of Morita equivalence is exactly analogous to definitional equivalence.

Definition. LetT1 be a Σ1-theory andT2a Σ2-theory. T1 andT2 areMorita

equivalent if there are theories T1

(16)

• Each theoryT1i+1 is a Morita extension ofT1i,

• Each theoryT2i+1 is a Morita extension ofT2i,

• Tn

1 andT2mare logically equivalent Σ-theories with Σ1∪Σ2⊂Σ.

Two theories are Morita equivalent if they have a “common Morita exten-sion.” The situation can be pictured as follows, where each arrow in the figure indicates a Morita extension.

T1 T2

T2m

T1n ∼=

T21

T11

· ·· · ··

At first glance, Morita equivalence might strike one as different from def-initional equivalence in an important way. To show that theories are Morita equivalent, one is allowed to take any finite number of Morita extensions of the theories. On the other hand, to show that two theories are definitionally equiv-alent, it appears that one is only allowed to takeone definitional extension of each theory. One might worry that Morita equivalence is therefore not perfectly analogous to definitional equivalence.

Fortunately, this is not the case. Theorem 3.3 implies that if theories

T1, . . . , Tn are such that each Ti+1 is a definitional extension of Ti, then Tn

is in fact a definitional extension of T1. (One can easily verify that this is not true of Morita extensions.) To show that two theories are definitionally equiva-lent, therefore, one actuallyis allowed to take any finite number of definitional extensions of each theory.

If two theories are definitionally equivalent, then they are trivially Morita equivalent. Unlike definitional equivalence, however, Morita equivalence is ca-pable of capturing a sense in which theories with different sort symbols are equivalent. The following example demonstrates that Morita equivalence is a more liberal criterion for theoretical equivalence.

(17)

arityσ2→σ1 andσ3→σ1. Consider the following Σ-sentences.

∀σ1x p(x)↔ ∃σ2y(i2(y) =x)

∧ ∀σ2y1∀σ2y2 i2(y1) =i2(y2)→y1=y2

(δσ2)

∀σ1x q(x)↔ ∃σ3z(i3(z) =x)

∧ ∀σ3z1∀σ3z2 i3(z1) =i3(z2)→z1=z2

(δσ3)

∀σ1x ∃σ2=1y(i2(y) =x)∨ ∃σ3=1z(i3(z) =x)

∧ ∀σ2y∀σ3z¬ i2(y) =i3(z)

(δσ1)

∀σ1x p(x)↔ ∃σ2y(i2(y) =x)

(δp)

∀σ1x q(x)↔ ∃σ3z(i3(z) =x)

(δq)

The Σ-theoryT1

1 =T1∪ {δσ2, δσ3}is a Morita extension of T1 to the signature

Σ. It definesσ2 andi2 to be the subsort of “elements that arep” andσ3 and

i3 to be the subsort of “elements that areq.” The theoryT21=T2∪ {δσ1} is a

Morita extension ofT2 to the signature Σ2∪ {σ1, i2, i3}. It definesσ1to be the coproduct sort ofσ2andσ3. Lastly, the Σ-theoryT22=T21∪ {δp, δq}is a Morita

extension ofT1

2 to the signature Σ. It defines the predicates pand qto apply to elements in the “images” ofi2 andi3, respectively. One can verify that T11 andT2

2 are logically equivalent, soT1 andT2 are Morita equivalent. y

5 Categorical Equivalence

Morita equivalence captures a clear and robust sense in which theories might be equivalent, but it is a difficult criterion to apply outside of the framework of first-order logic. Indeed, without a formal language one does not have the resources to say what an explicit definition is. Questions of equivalence and inequivalence of theories, however, still come up outside of this framework. It is well known, for example, that there are different ways of formulating the theory of smooth manifolds (Nestruev, 2002). There are also different formulations of the theory of topological spaces (Kuratowski, 1966). None of these formulations are first-order theories. Physical theories too are rarely formulated in first-order logic, and there are many pairs of physical theories that are often considered equivalent.10

Morita equivalence is incapable of capturing any sense in which these the-ories are equivalent. We need a criterion for theoretical equivalence that is applicable outside the framework of first-order logic. Categorical equivalence is one such criterion.11 _{It was first described by Eilenberg and Mac Lane (1942,}

10_{For example, see Glymour (1977), Knox (2013), and Weatherall (2015a) for discussion}

of whether or not Newtonian gravitation and geometrized Newtonian gravitation are equiva-lent. See North (2009), Halvorson (2011), Swanson and Halvorson (2012), Curiel (2014), and Barrett (2014) for discussion of whether or not Hamiltonian and Lagrangian mechanics are equivalent. See Rosenstock et al. (2015) for a discussion of general relativity and the theory of Einstein algebras and Weatherall (2015b) for a summary of many of these results.

11_{The reader is encouraged to consult Mac Lane (1971), Borceux (1994), or Awodey (2010)}

(18)

1945), but was only recently introduced into philosophy of science by Halvorson (2012, 2015) and Weatherall (2015a). In this section, we describe categorical equivalence and then show how it is related to Morita equivalence.

Categorical equivalence is motivated by the following simple observation: First-order theories have categories of models. AcategoryCis a collection of objects with arrows between the objects that satisfy two basic properties. First, there is an associative composition operation◦defined on the arrows ofC, and second, every objectcinC has an identity arrow 1c:c→c. IfT is a Σ-theory,

we will use the notation Mod(T) to denote thecategory of modelsofT. An object in Mod(T) is a modelM ofT, and an arrowf :M _→N between objects in Mod(T) is an elementary embeddingf :M _→N between the modelsM and

N. One can easily verify that Mod(T) is a category.

Before describing categorical equivalence, we need some additional terminol-ogy. LetC andD be categories. Afunctor F :C_→D is a map from objects and arrows ofCto objects and arrows of D that satisfies

F(f :a_→b) =F f :F a_→F b F(1c) = 1F c F(g◦h) =F g◦F h

for every arrow f : a → b in C, every object c in C, and every composable pair of arrows g and h in C. Functors are the “structure-preserving maps” between categories; they preserve domains, codomains, identity arrows, and the composition operation. A functorF :C _→D is full if for all objectsc1, c2 in

C and arrowsg:F c1 →F c2 in D there exists an arrowf :c1→c2 in C with

F f=g. F isfaithful ifF f =F gimplies thatf =g for all arrowsf :c1→c2 andg : c1 → c2 in C. F is essentially surjective if for every object din D there exists an objectc in C such thatF c∼=d. A functor F :C _→D that is full, faithful, and essentially surjective is called anequivalence of categories. The categoriesCandD areequivalentif there exists an equivalence between them.12

A first-order theory T has a category of models Mod(T). This categorical structure, however, is not particular to first-order theories. Indeed, one can easily define categories of models for the different formulations of the theory of smooth manifolds and for the different formulations of the theory of topological spaces. The arrows in these categories are simply the structure-preserving maps between the objects in the categories. One can also define categories of models for physical theories.13 This means that the following criterion for theoretical equivalence is applicable in a more general setting than definitional equivalence and Morita equivalence. In particular, it can be applied outside of the framework of first-order logic.

Definition. Theories T1 and T2 are categorically equivalent if their cate-gories of models Mod(T1) and Mod(T2) are equivalent.

12_{The concept of a “natural transformation” is often used to define when two categories are}

equivalent. C and Dare equivalent if there are functors F :C→Dand G:D→C such thatF Gis naturally isomorphic to the identity functor 1D andGF is naturally isomorphic

to 1C. See Mac Lane (1971) for the definition of a natural transformation and for proof that

these two characterizations of equivalence are the same.

(19)

Categorical equivalence captures a sense in which theories have “isomorphic semantic structure.” IfT1andT2are categorically equivalent, then the relation-ships that models ofT1bear to one another are “isomorphic” to the relationships that models ofT2 bear to one another.

In order to show how categorical equivalence relates to Morita equivalence, we focus on first-order theories. We will show that categorical equivalence is a strictly weaker criterion for theoretical equivalence than Morita equivalence is. We first need some preliminaries about the category of models Mod(T) for a first-order theoryT. Suppose that Σ_⊂Σ+ _{are signatures and that the Σ}+ -theoryT+ _{is an extension of the Σ-theory}_T_{. There is a natural “projection”} functor Π : Mod(T+₎

→ Mod(T) from the category of models of T+ _{to the} category of models ofT. The functor Π is defined as follows.

• Π(M) =M_|Σfor every objectM in Mod(T+).

• Π(h) =h|Σ for every arrowh:M →N in Mod(T+), where the family of mapsh|Σis defined to beh|Σ={hσ:Mσ→Nσ such thatσ∈Σ}.

Since T+ is an extension of T, the Σ-structure Π(M) is guaranteed to be a model of T. Likewise, the map Π(h) : M_|Σ → N|Σ is guaranteed to be an elementary embedding. One can easily verify that Π : Mod(T+₎

→Mod(T) is a functor.

The following three propositions will together establish the relationship be-tween Mod(T+_{) and Mod(}_T_{) when}_T+ _{is a Morita extension of}_T_{. They imply} that whenT+ _{is a Morita extension of}_T_{, the functor Π : Mod(}_T+₎

→Mod(T) is full, faithful, and essentially surjective. The categories Mod(T+_{) and Mod(}_T₎ are therefore equivalent.

Proposition 5.1. Let Σ _⊂ Σ+ _{be signatures and} _T _a _Σ_{-theory. If} _T+ _{is a} Morita extension ofT toΣ+_{, then}_Π _{is essentially surjective.}

Proof. If M is a model of T, then Theorem 4.1 implies that there is a model

M+ _of_T+ _{that is an expansion of}_M_{. Since Π(}_M+_{) =}_M+_|

Σ=M the functor Π is essentially surjective.

Proposition 5.2. Let Σ _⊂ Σ+ _{be signatures and} _T _a _Σ_{-theory. If} _T+ _{is a} Morita extension ofT toΣ+_{, then}_Π _{is faithful.}

Proof. Let h : M _→N and g : M _→N be arrows in Mod(T+_{) and suppose} that Π(h) = Π(g). We show that h=g. By assumptionhσ =gσ for every sort

symbolσ ∈Σ. We show that hσ =gσ also forσ∈ Σ+−Σ. We consider the

cases whereT+ _defines _σ _{as a product sort or a subsort. The coproduct and} quotient sort cases follow analogously.

Suppose that T+ definesσas a product sort with projections π1 and π2 of arityσ_→σ1 andσ→σ2. Then the following equalities hold.

πN1 ◦hσ =hσ1◦π

M

1 =gσ1◦π

M

1 =π1N ◦gσ

(20)

πN

2 ◦hσ =πN2 ◦gσ. Since N is a model ofT+ and T+ defines σas a product

sort, we know thatN ∀σ1x∀σ2y∃σ=1z(π1(z) =x∧π2(z) =y). This implies

thathσ=gσ.

On the other hand, if T+ _defines _σ _{as a subsort with injection} _i _{of arity}

σ→σ1, then the following equalities hold.

iN ◦hσ =hσ1◦i

M ₌_g σ1◦i

M ₌_iN

◦gσ

These equalities follow in the same manner as above. SinceiN _{is an injection it}

must be thathσ=gσ.

Before proving that Π is full, we need the following simple lemma.

Lemma 5.1. Let M be a model of T+ _with_a

1, . . . , an elements ofM of sorts σ1, . . . , σn ∈ Σ+−Σ. If x1, . . . , xn are variables sorts σ1, . . . , σn, then there

is a code ξ(x1, . . . , xn, y11, . . . , yn2) and elements b11, . . . , bn2 of M such that

M ξ[a1, . . . , an, b11, . . . , bn2].

Proof. We define the code ξ(x1, . . . , yn2). If T+ defines σi as a product sort,

quotient sort, or subsort then we have no choice about what the conjunct

ξi(xi, yi1, yi2) is. IfT+definesσias a coproduct sort, then we know that either

there is an element bi1 of M such thatρ1(bi1) =ai or there is an elementbi2 ofM such that ρ2(bi2) =ai. If the former, we letξi be ρ1(yi1) =xi and if the

latter, we letξi be ρ2(yi2) = xi. One defines the elementsb11, . . . , bn2 in the obvious way. For example, ifσi is a product sort, then we letbi1=π1M(ai) and bi2=π2M(ai). By construction, we have thatM ξ[a1, . . . , an, b11, . . . , bn2].

We now use this lemma to show that Π is full.

Proposition 5.3. Let Σ _⊂ Σ+ _{be signatures and} _T _a _Σ_{-theory. If} _T+ _{is a} Morita extension ofT toΣ+_{, then}_Π _{is full.}

Proof. Let M and N be models of T+ _with _h _{: Π(}_M₎

→ Π(N) an arrow in Mod(T). This means that h : M_|Σ → N|Σ is an elementary embedding. We show that the maph+ _:_M _→_N _{is an elementary embedding and therefore an} arrow in Mod(T+_{). Since Π(}_h+_{) =}_h_{this will imply that Π is full.}

Letφ(x1, . . . , xn, x1, . . . , xm) be a Σ+-formula and leta1, . . . , an, a1, . . . , am

be elements of M of the same sorts as the variables x1, . . . , xn, x1, . . . , xm.

Lemma 5.1 implies that there is a codeξ(x1, . . . , xn, y11, . . . , yn2) and elements b11, . . . , bn2 of M such that M ξ[a1, . . . , an, b11, . . . , bn2]. The definition of

the maph+_{implies that}_N

ξ[h+₍_a

1, . . . , an, b11, . . . , bn2)]. We now show that M φ[a1, . . . , an, a1, . . . , am] if and only ifN φ[h+(a1, . . . , an, a1, . . . , am)].

By Theorem 4.3 there is a Σ-formulaφ∗₍_x

1, . . . , xm, y11, . . . , yn2) such that

T+ ∀σ1x1. . .∀σnxn∀σ1x1. . .∀σmxm∀σ11y11. . .∀σn2yn2 ξ(x1, . . . , yn2)→

φ(x1, . . . xn, x1, . . . , xm)↔φ∗(x1, . . . , xm, y11, . . . , yn2)

(21)

We then see that the following string of equivalences holds.

M φ[a1, . . . , an, a1, . . . , am]⇐⇒M φ∗[a1, . . . , am, b11, . . . , bn2]

⇐⇒M_|Σφ∗[a1, . . . , am, b11, . . . , bn2]

⇐⇒N|Σφ∗[h(a1, . . . , am, b11, . . . , bn2)]

⇐⇒N φ∗[h(a1, . . . , am, b11, . . . , bn2)]

⇐⇒N φ∗[h+(a1, . . . , am, b11, . . . , bn2)]

⇐⇒N φ[h+(a1, . . . , an, a1, . . . , am)]

The first and sixth equivalences hold by (5) and the fact that M and N are models ofT+_{, the second and fourth hold since}_φ∗_{is a Σ-formula, the third since}

h:M|Σ→N|Σis an elementary embedding, and the fifth by the definition of

h+ _{and the fact that the elements}_a

1, . . . , am, b11, . . . , bn2 have sorts in Σ.

These three propositions provide us with the resources to show how categor-ical equivalence is related to Morita equivalence. Our first result follows as an immediate corollary.

Theorem 5.1. Morita equivalence entails categorical equivalence.

Proof. Suppose that T1 andT2 are Morita equivalent. Then there are theories

T1

1, . . . , T1n andT21, . . . , T2mthat satisfy the three conditions in the definition of Morita equivalence. Propositions 5.1, 5.2, and 5.3 imply that the Π functors between these theories, represented by the arrows in the following figure, are all equivalences.

· ·· · ·· Mod(T1n) Mod(T2m)

Mod(T21)

Mod(T2) Mod(T1)

Mod(T11)

=

This implies that Mod(T1) is equivalent to Mod(T2), and so T1 and T2 are categorically equivalent.

The converse to Theorem 5.1, however, does not hold. There are theories that are categorically equivalent but not Morita equivalent.14 _{In order to show} this, we need one piece of terminology. A categoryC isdiscreteif it is equiv-alent to a category whose only arrows are identity arrows.

Theorem 5.2. Categorical equivalence does not entail Morita equivalence.

(22)

Proof. Let Σ1 = _{σ1, p0, p1, p2, . . .} be a signature with a single sort symbol

σ1 and a countable infinity of predicate symbols pi of arity σ1. Let Σ2 =

{σ2, q0, q1, q2, . . .} be a signature with a single sort symbol σ2 and a countable infinity of predicate symbolsqi of arity σ2. Define the Σ1-theory T1 and Σ2-theoryT2 as follows.

T1={∃σ1=1x(x=x)}

T2={∃σ2=1y(y=y),∀σ2y(q0(y)→q1(y)),∀σ2y(q0(y)→q2(y)), . . .}

The theoryT2has the sentence∀σ2y(q0(y)→qi(y)) as an axiom for eachi∈N.

We first show that T1 andT2 are categorically equivalent. It is easy to see that Mod(T1) and Mod(T2) both have 2ℵ0 _{(non-isomorphic) objects.}

Further-more, Mod(T1) and Mod(T2) are both discrete categories. We show here that Mod(T1) is discrete. Suppose that there is an elementary embeddingf :M _→N

between modelsM and N of T1. It must be that f maps the unique element

m_∈ M to the unique element n _∈N. Furthermore, since f is an elementary embedding,M pi[m] if and only ifN pi[n] for every predicatepi∈Σ1. This

implies thatf :M _→N is actually an isomorphism. Every arrowf :M _→N in Mod(T1) is therefore an isomorphism, and there is at most one arrow between any two objects of Mod(T1). This immediately implies that Mod(T1) is discrete. An analogous argument demonstrates that Mod(T2) is discrete. Any bijection between the objects of Mod(T1) and Mod(T2) is therefore an equivalence of categories.

ButT1 andT2 are not Morita equivalent. Suppose for contradiction thatT is a “common Morita extension” ofT1 andT2. Corollary 4.1 implies that there is a Σ1-sentenceφsuch thatT ∀σ2yq0(y)↔φ. One can verify using Theorem

4.2 and Corollary 4.1 that the sentenceφhas the following property: If ψis a Σ1-sentence andT1 ψ→φ, then either (i)T1¬ψ or (ii)T1φ→ψ. But

φcannot have this property. Consider the Σ1-sentence

ψ:=φ∧ ∀σ1xpi(x)

wherepi is a predicate symbol that does not occur in φ. We trivially see that T1ψ→φ, but neither (i) nor (ii) hold ofψ. This implies thatT1 andT2 are not Morita equivalent.

6 Conclusion

We have discussed three formal criteria for theoretical equivalence, and we have shown that they form the following hierarchy.

Definitional equivalence

Morita equivalence

Categorical equivalence

(23)

theories with different sort symbols equivalent. But definitional equivalence does not allow one to do this. Morita equivalence, on the other hand, does allow one to capture a sense in which such theories might be equivalent.

The hierarchy also yields a precise sense in which categorical equivalence is too liberal a criterion for theoretical equivalence. The example from Theorem 5.2 is quite general. Any two theories with discrete categories of models will be categorically equivalent, as long as they have the same number of models. But one often has good reason to consider two such theories inequivalent. For example, there is a sense in which the two theories from Theorem 5.2 do not “say the same thing.” According to the theoryT2, there is a special predicateq0. If the predicateq0holds, that completely determines what else is true according to T2. The theory T1, however, singles out no such predicate. If one takes categorical equivalence as the standard for theoretical equivalence, then one is forced to considerT1andT2equivalent. Morita equivalence, on the other hand, allows one to consider them inequivalent.

Even though there is a sense in which it is too liberal, categorical equivalence is currently our most promising formal criterion for theoretical equivalence out-side the framework of first-order logic. We have seen that it is a weaker criterion than Morita equivalence, but one nonetheless hopes that it is not “too much weaker.” One could substantiate this hope by proving a result of the following form.

IfT1andT2are categorically equivalent andP, thenT1 andT2 are Morita equivalent.

Pis some additional constraint thatT1andT2might be required to satisfy. For example, one might hope that the result could be proven whenPis “T1andT2 have finite signatures.” If a result of this form holds for a general propertyP, that would show that categorical equivalence is “almost as strong” as Morita equivalence.

Promising work in this direction has been done by Makkai (1991) and Awodey and Forssell (2010). Makkai shows that if the ultracategories Mod(T1) and Mod(T2) are equivalent, then T1 and T2 are Morita equivalent. Awodey and Forssell show that if thetopological groupoids Mod(T1) and Mod(T2) are equiv-alent, thenT1 andT2 are Morita equivalent. But there is still more work to be done before we completely understand the relationship between Morita equiva-lence and categorical equivaequiva-lence.?

References

Ad´amek, J., Sobral, M., and Sousa, L. (2006). Morita equivalence of many-sorted algebraic theories. Journal of Algebra.

Andr´eka, H., Madara´sz, J., and N´emeti, I. (2008). Defining new universes in many-sorted logic. Manuscript.

?_{This material is based upon work supported by the National Science Foundation under}

(24)

Andréka, H., Madarász, J. X., and Németi, I. (2005). Mutual definability does not imply definitional equivalence, a simple example. Mathematical Logic Quarterly.

Artigue, M., Isambert, E., Perrin, M., and Zalc, A. (1978). Some remarks on bicommutability. Fundamenta Mathematicae.

Awodey, S. (2010). Category Theory. Oxford.

Awodey, S. and Forssell, H. (2010). First-order logical duality. Manuscript.

Barrett, T. W. (2014). On the structure of classical mechanics. The British Journal for the Philosophy of Science (forthcoming).

Barrett, T. W. and Halvorson, H. (2015). Glymour and Quine on theoretical equivalence. Manuscript.

Borceux, F. (1994). Handbook of Categorical Algebra, volume 1. Cambridge University Press.

Coffey, K. (2014). Theoretical equivalence as interpretive equivalence. The British Journal for the Philosophy of Science.

Curiel, E. (2014). Classical mechanics is Lagrangian; it is not Hamiltonian.The British Journal for the Philosophy of Science.

de Bouv´ere, K. L. (1965). Synonymous theories. InSymposium on the Theory of Models. North-Holland Publishing Company.

Dukarm, J. J. (1988). Morita equivalence of algebraic theories. Colloquium Mathematicae.

Eilenberg, S. and Mac Lane, S. (1942). Group extensions and homology.Annals of Mathematics.

Eilenberg, S. and Mac Lane, S. (1945). General theory of natural equivalences. Transactions of the American Mathematical Society.

Freyd, P. (1964). Abelian categories. Harper and Row.

Friedman, H. and Visser, A. (2014). When bi-interpretability implies synonymy. Manuscript.

Glymour, C. (1970). Theoretical realism and theoretical equivalence. InPSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.

Glymour, C. (1977). The epistemology of geometry. Nous.

Glymour, C. (1980). Theory and Evidence. Princeton.

(25)

Halvorson, H. (2011). Natural structures on state space. Manuscript.

Halvorson, H. (2012). What scientific theories could not be. Philosophy of Science.

Halvorson, H. (2013). The semantic view, if plausible, is syntactic. Philosophy of Science.

Halvorson, H. (2015). Scientific theories. Forthcoming in Oxford Handbooks Online.

Hilbert, D. (1930). Grundlagen der Geometrie. Teubner.

Hodges, W. (2008). Model Theory. Cambridge University Press.

Johnstone, P. T. (2003). Sketches of an Elephant. Oxford.

Kanger, S. (1968). Equivalent theories. Theoria.

Knox, E. (2013). Newtonian spacetime structure in light of the equivalence principle. The British Journal for the Philosophy of Science.

Kuratowski, K. (1966). Topology, volume 1. Academic Press.

Mac Lane, S. (1948). Groups, categories, and duality. Proceedings of the Na-tional Academy of Sciences.

Mac Lane, S. (1971). Categories for the working mathematician. Springer.

Makkai, M. (1991). Duality and definability in first order logic, volume 503. American Mathematical Society.

Montague, R. (1957). Contributions to the Axiomatic Foundations of Set The-ory. PhD thesis, University of California, Berkeley.

Nestruev, J. (2002). Smooth Manifolds and Observables. Springer.

North, J. (2009). The ‘structure’ of physics: A case study.Journal of Philosophy.

Pelletier, F. J. and Urquhart, A. (2003). Synonymous logics. Journal of Philo-sophical Logic.

Pinter, C. C. (1978). Properties preserved under definitional equivalence and interpretations. Mathematical Logic Quarterly.

Quine, W. V. O. (1975). On empirically equivalent systems of the world. Erken-ntnis.

Rosenstock, S., Barrett, T. W., and Weatherall, J. O. (2015). On Einstein algebras and relativistic spacetimes. Manuscript.

(26)

Schwabh¨auser, W., Szmielew, W., and Tarski, A. (1983). Metamathematische Methoden in der Geometrie. Springer.

Shoenfield, J. (1967). Mathematical Logic. Addison-Wesley.

Sklar, L. (1982). Saving the noumena. Philosophical Topics.

Swanson, N. and Halvorson, H. (2012). On North’s ‘The structure of physics’. Manuscript.

Szczerba, L. (1977). Interpretability of elemenatry theories. InLogic, Founda-tions of Mathematics, and Computability Theory. Springer.

Tarski, A. (1959). What is elementary geometry? In The Axiomatic Method With Special Reference to Geometry and Physics. North-Holland.

Tsementzis, D. (2015). A syntactic characterization of morita equivalence. Manuscript.

van Fraassen, B. C. (2014). One or two gentle remarks about Hans Halvorson’s critique of the semantic view. Manuscript.

Weatherall, J. O. (2015a). Are Newtonian gravitation and geometrized Newto-nian gravitation theoretically equivalent? Manuscript.

Weatherall, J. O. (2015b). Categories and the foundations of classical field theories. Manuscript.

Weatherall, J. O. (2015c). Understanding gauge. Manuscript.

Appendix

This appendix contains a proof of Theorem 4.3, which we restate here for con-venience.

Theorem 4.3. Let Σ_⊂Σ+ _{be signatures and}_T _a _Σ_{-theory. Suppose that}_T+ is a Morita extension of T to Σ+ _{and that} _φ₍_x

1, . . . , xn, x1, . . . , xm) is a Σ+

-formula. Then for every codeξ(x1, . . . , yn2)for the variablesx1, . . . , xn there is

aΣ-formulaφ∗₍_x

1, . . . , xm, y11, . . . , yn2)such that

T+∀σ1x1. . .∀σnxn∀σ1x1. . .∀σmxm∀σ11y11. . .∀σn2yn2 ξ(x1, . . . , yn2)→

φ(x1, . . . xn, x1, . . . , xm)↔φ∗(x1, . . . , xm, y11, . . . , yn2)

We first prove the following lemma. Given a Σ+_-term_t_{, we will again write}