• No results found

Competing Conventions

N/A
N/A
Protected

Academic year: 2020

Share "Competing Conventions"

Copied!
54
0
0

Loading.... (view fulltext now)

Full text

(1)

Competing Conventions

Philip R. Neary

April 30, 2012

Abstract

This paper introduces a new coordination problem for a large but finite popu-lation - The Language Game. The population is partitioned into two groups of identical agents. Each player shares a common two-action strategy set and in-teracts pairwise with everyone else. Both symmetric profiles are Pareto-efficient strict equilibria, but the groups rank them differently. The profile where success-ful coordination occurs only within-group, with each group adopting their most preferred equilibrium action, is also an equilibrium provided the smaller group’s preferences are sufficiently strong. In all dynamically stable long-run outcomes, players in the same group adopt the same action. Three properties, that do not matter for equilibrium selection in standard homogeneous agent models, do matter in the Language Game. These are: group size, strength of group prefer-ences, and rates of group adaptiveness (“group dynamism”). A relative increase in group size and group dynamism is always weakly beneficial.

I would like to thank Sarada, Nageeb Ali, Prashant Bharadwaj, Larry Blume, Tiffany Chou, Vince Crawford, Silke Januszewski Forbes, Dan Friedman, Dalia Ghanem, Roger Gordon, Youjin Hahn, Oana Hirakawa, Matt Jackson, Michihiro Kandori, Chulyoung Kim, David Levine, Daniel Lima, Stephen Morris, Peter Neary, Paul Niehaus, Frances Ruane, Bill Sandholm, Lucas Siga, James Slevin, Bernhard von Stengel, John Sutton, Joel Watson, Hee-Seung Yang, Peyton Young, and seminar participants at UCSD, University of Colorado Boulder, The Brattle Group, LSE, and Royal Holloway for helpful comments. Two anonymous referees and an associate editor made many good suggestions that improved the paper considerably. I would especially like to thank Joel Sobel for all his help. Remaining errors are mine.

(2)

“Nobody will ever win the battle of the sexes.

There’s just too much fraternizing with the enemy.”

- Henry Kissinger

1

Introduction

A large population coordination problem is one wherein all parties can realize mutual

gains, but only by making mutually consistent decisions. Often, such mutually

consis-tent decisions require that everybody behave identically, but on occasion they do not.

For example, while writing academic papers in English is a must since English is the

language of science, and driving on the opposite side of the road as the oncoming traffic

hardly seems wise, it may still make sense to buy a Mac even if it is only your friends

who use them.

The emergence of coordinated outcomes in large societies, often referred to as

con-ventions (Lewis, 1969; Young, 1993, 1996, 2001), is conventionally modelled using the framework of evolutionary game theory. In the simplest model, due toKandori, Mailath, and Rob (1993) (hereafter KMR), a large population of homogeneous players are ran-domly matched to play a symmetriclocal-interaction 2×2 game of pure coordination.

This situation is then repeated indefinitely with players assumed to be myopic

best-responders. Coordination is guaranteed in the long run, the only question is which

strict equilibrium (convention) the population will coordinate on.1 However, by

con-struction, KMR’s model can only be used for studying the emergence of conventions in

societies where all agents have identical preferences. Furthermore, since everyone is the

same, all players long for the same coordinated outcome, and so the only level at which

competition occurs is at that of the strategy. This is limiting, since in many situations

of interest people have different tastes, and so what might be best for some may not be

best for all.

In this paper, I introduce “The Language Game”, that allows the study of

conven-tions in a particular heterogeneous environment.2 The population is composed of not

one, but two homogeneous groups with differing preferences.3 Each player has the same

1Young(1993) describes a broad class of games for which such “lock in” will certainly occur. 2A similar model is presented in Quilter(2007).

3Another setting with two homogeneous groups with (potentially) differing preferences are the

(3)

two-action strategy set and interacts pairwise with everyone else. Utilities are the sum

of payoffs earned from playing the field, where the same action must be used with each

opponent. Uniform adoption of either action is always a strict equilibrium, but the

groups rank them differently. If the smaller group has sufficiently strong preferences,

then the members of each group adopting their preferred action, and thus forfeiting

coordination with those of the other group, can also be supported in equilibrium.

Next, the Language Game becomes the stage game of a repeated interaction. Time

is discrete, begins at t = 0, and goes forever. Payoffs are received every period and actions for tomorrow must be chosen at the end of today. The same assumptions on

individual behaviour used in KMR generate population dynamics that will lock in on

some convention with probability one. I then turn to the issue ofequilibrium selection,

employing the concept of stochastic stability, introduced in the seminal work of

(Fos-ter and Young, 1990). The basic idea is that players occasionally, and independently, choose non-optimal responses. Such “mutations” perturb the dynamics, enabling crisp

predictions to be made about long-run behaviour. Equilibrium selection in the

Lan-guage Game depends on three factors that have no bearing in KMR’s model. They

are: group size, group payoffs, and group dynamism (how regularly members of a given

group react).4 These properties can be traded off against each other. Increased group

dynamism and greater group numbers are always desirable. However, perhaps

surpris-ingly, stronger preferences need not be.

Welfare issues are interesting. In the homogeneous agent model, for any convention

each player’s behaviour and payoff are identical. As such, the literature has focused

primarily on the tension that arises when the “good” Pareto-dominant equilibrium

action does not coincide with the “safe” risk-dominant one. Foster and Young (1990), KMR and Young (1993) were the first to show that evolutionary forces coupled with mutations will propel population behaviour towards risk-dominance.5 Only

Pareto-players are matched against a population of “column” Pareto-players to play an asymmetric game, e.g. Battle of the Sexes. However, and this is a big difference, it is still the case that players from each group only interact with one other type of player (those from the other group). Thus, despite adding heterogeneity, there is still only one common local interaction.

4“Group dynamism” permits many interpretations. It can be thought of as how sensitive, on

average, a particular group’s members are to their surroundings; as differences in inertia - life in the city is more hectic than life in the countryside; as the frequency of death and replacement; as adjustment costs varying across group since some find change less burdensome; or even as variable information access which would enable greater transparency in decision making.

(4)

Pareto-efficient equilibria are selected in the Language Game.

The “Language Game” moniker is due to the model’s applicability to issues of

compatibility: successful coordination is good, while all types of failed coordination

are equally bad. An example is the adoption of technological standards (Katz and

Shapiro, 1985; Farrell and Saloner, 1985; Arthur, 1989). The key observation in this literature is one of network effects, whereby the usefulness of a technology is increasing

in the number of its users. The homogeneity of existing models means the population

collectively agrees on the optimal standard, and whatever standard does emerge is

adopted by one and all. The Language Game helps to organize observations because

often subsets of the population have inherent preferences for different standards, and

often multiple standards coexist. Section 2illustrates this with an example.

It has been traditional in evolutionary models to view the players as being drawn

from a single population. This assumption constrains all local interactions to be the

same and limits the scope of the analysis. The main contribution of this paper is

to provide an uncomplicated model, with immediate economic applications, with this

assumption relaxed. However, while the Language Game is the first detailed setting

with multiple populations interacting together, what I believe to be the major insight

(that of multiple populations all interacting together) is not as novel as I’d hoped, and I

would be remiss not to mention some other work.6 Friedman(1998) discusses a general

setting with two populations where, in addition to asymmetric interactions, there are

also “own-population effects”. Cressman, Garay, and Hofbauer (2001) characterize

Evolutionary Stable Strategies (Smith and Price, 1973) for what they call N-Species Frequency-Dependent Interactions; the textbookCressman (2003) explores these ideas further. Quilter (2007) analyses a similar model, but focuses on how the specifics of the perturbations can affect selection.7

The plan of the paper is as follows. In the next section, I present a simple example

dominant equilibrium action can emerge in similar settings. Ely(2002),Oechssler(1999) andOechssler

(1997) are models with endogenous local interactions, Canals and Vega-Redondo (1998) and Robson and Vega-Redondo(1996) vary the frequency with which players may interact, while Kim and Sobel

(1995) add a round of costless communication, “cheap talk”, before actions are taken.

6Thanks to David Levine, Bill Sandholm, and an anonymous referee for drawing my attention to

these papers.

7Other papers with heterogeneity include Carvalho(2010), where ‘taboos’ are imposed on certain

(5)

that demonstrates how homogeneous groups with heterogeneous preferences can be

a more natural way to think about certain large population situations, in particular

technology adoption. The Language Game is formally defined in Section 3, where I

also characterize the set of pure strategy equilibria. Section 4 shows how decisions

at the individual level are aggregated to yield group dynamics, and illustrates via an

example how path dependence may be influenced by the details of the dynamics. This

analysis is carried forward to Section5 which contains the main results on equilibrium

selection. Section 6 looks at welfare properties of selected equilibria, while Section 7

examines how the set of selected equilibria varies as Language Game parameters change.

Section8 concludes and discusses some potential avenues for future work.

2

A Story

The story is an extension of one from KMR. I begin by reminding the reader of theirs

and then build on it.

There is a university dormitory of 10 identical students, referred to as GroupA. Each Group A student chooses a computer system from the set {a, b}. Each evening, the students assemble in the study hall, where everybody encounters everyone else. When

two students interact, they can collaborate by playing games, sharing files, swapping

add-ons, etc. But - and this is key - meetings are fruitful if and only if both students use

the same system. Assuming that system a is inherently superior to system b induces the following local-interaction pure coordination game,GAA,

AI

AII

a b a 3,3 0,0

b 0,0 2,2

whereAI andAII are any pair of GroupA students. The local interactionGAA has two

(6)

Following KMR, assume that students occasionally have the opportunity to change

their computer, and that students are myopic in that decisions taken are best-responses

to the current distribution of computers. This generates Darwinian dynamics. Initial

conditions are key: if more than two fifths of the population begins using system a

(5 or more since the population is of size 10), then outcome a will be reached; while

if 6 or more students start out using b, then b will be the final resting point. The reasoning is simple, all players collectively agree on what action is a best-response, so

the best-response today must be at least as good a response tomorrow as the number

of players taking that action can only (weakly) increase. Thus population behaviour is

always drifting towards either aor b.

However, when noise in the form of individual trembles/mistakes/experimentation

is incorporated into the dynamics, it is possible to select between strict equilibrium

outcomes. Suppose that the probability that a student mistakenly chooses the computer

that is not an optimal response is given byε. It takes 5 or more simultaneously occurring mistakes to dislodge the system from b, and 6 or more to get away from a. The most

likely events of this form occur with probability of orders ε5 and ε6 respectively. For small values of ε, ε6 ε5, and so KMR conclude that when agents are myopic best-responders, who occasionally make mistakes, that outcome a is far more likely to be

observed in the long run. Equilibriuma is said to be selected.

A key component of the above story was systema’sinherent superiority to systemb. While in many coordination problems it is plausible to believe that coordination on one

particular strategy is better (by any metric) than another, words like “better” derive

from primitive preferences, and preferences are individual by nature. In a population

with heterogeneous agents, what might be best for some may not be best for all.

To illustrate the impact of adding heterogeneity, consider the following extension to

the above story. Suppose Group A are “slackers” - they must also do coursework on their machines, but their main use for computers is playing games. Instead of assuming

that computer system a is flat out better than b, suppose that system a more readily supports gaming platforms, which justifies GroupA’s preference for coordinating on it. Suppose further that there is another dorm of 5 more students, referred to as Group

B, in the next building. Every evening, these 5 Group B students meet in a separate study hall and exchange software, etc. Again, interactions are beneficial if and only if

(7)

“serious” scholars, and that platform b suits these needs better. The local interaction between two GroupB students is thus given by the following pure coordination game,

GBB,

BI

BII

a b a 1,1 0,0

b 0,0 2,2

whereBIand BII are any two GroupB students. By an analysis identical to that given

above for GroupA, left to their own devices GroupB will adopt computer system b in the long run.8 Now, consider the 15 person population as a whole. Writing a

group-symmetric profile9 as a 2-dimensional vector, with the Group A profile written first,

there are 4 strict equilibria: (a,a),(a,b),(b,a),(b,b). The selected equilibrium is (a,b).

Now suppose that in an effort to free up space, the university stipulates that both

groups should study Group A’s larger study room which can accommodate 5 extra bodies (this frees up the smaller study room for other activities). So now, all 15 students

meet in the same room every evening. Everybody interacts with everybody else, and

within-group local interactions are as before. It remains to specify the local interaction

occurring between students from opposite groups. This is described by the following

coordination game,GAB, in which the row player,Ai, is from GroupA, and the column

player, Bj, is from GroupB,

Ai

Bj

a b a 3,1 0,0

b 0,0 2,2

The local interaction GAB is not symmetric. It has two Pareto-efficient pure

strat-egy equilibria, (a, a) and (b, b), over which players’ preferences disagree. Considering the population coordination problem, interactions are now occurring both within- and

8The probability of Group B transitioning from ato boccurs with probability of order ε2, while

that of transitioning frombtoa occurs with probability of orderε4.

9A group-symmetric profile is one in which those in the same group take the same action. A

(8)

across-group. Each Group A student interacts with 9 fellow Group A students and 5 GroupBstudents, while each GroupB student interacts with 4 other GroupBstudents and 10 Group A students. The only group-symmetric equilibria to this new situation are (a,a) and (b,b).10 While both are Pareto-efficient, the 10 GroupA students prefer (a,a), while the 5 Group B students view (b,b) as most desirable.

I now pause and ask the reader to predict what they think the selected outcome

will be (recalling that (a,a) and (b,b) are the only viable candidates). One conjecture might be the following. Even though behaviour evolves in a decentralised manner via

individual best-responses, the greater Group A numbers should somehow collectively force its preferred outcome, (a,a), onto the population at large. However, this goes against the equally simple theory that (relative) strength of preferences should be the

determining factor. If preferences were the key ingredient, the selection prediction in

this example would be tilted towards Group B’s preferred outcome, (b,b).

Clearly group sizes and group preferences are important. However, it turns out that

selection also depends on a variety of additional factors aside from these. The first of

these is noise (individual mistakes). In the original story with only GroupA, all agents were identical so assuming they all make time- and profile-independent mistakes with

equal probability seemed not completely unreasonable. However, with a population

composed of two “types” of agents, if Group A students tremble with probability εA

while Group B students tremble with probability εB, there is no obvious reason to

conclude that εA=εB.

The second complicating factor is a property I refer to as group dynamism. Group

dynamism can be thought of as the rate at which a group responds. It may be that

GroupAstudents are more lethargic than Group B students. Perhaps on average only one GroupAstudent updates his action in a given period, whereas all GroupBstudents update their action every period. In the real world, there is no reason to suppose that

different groups respond at identical rates.

However, with these caveats in mind, let’s begin by assuming that: (1) payoffs are

as given in GAA, GAB and GBB; (2) the group sizes are 10 and 5 for Groups A and

B respectively; (3) mistakes are such that εA = εB = ε; and (4) both groups evolve

according to the best-reply dynamic in which all students react every period.

10While (a,b), and (b,a) were equilibria when the two groups were unconnected, they are no longer

(9)

Suppose first that the population profile is (b,b). Any Group B student needs to see a minimum of 10 others taking action a in order to switch his/her action, while a Group A student needs to see a minimum of 7. Let’s say between 7 and 9 students accidentally chose actiona(it is not important how these students are distributed across the two groups). In the following period, all GroupB students choose action b, but all Group A students choose action a, so that the new profile is (a,b). With no further mistakes, all students take action a the following period. The conclusion is that 7 or more simultaneous mistakes are sufficient to shift the population from (b,b) to (a,a). Now assume that the current profile is (a,a). If between 6 and 8 mistakes occur whereby students accidentally choose action b, it is enough to induce the 5 Group B

students to take actionb, but not enough to induce the 10 Group Astudents to do so. Thus, next period’s profile is (a,b), and with no further mistakes, the system reverts to (a,a). It can be computed that a minimum of 9 simultaneous mistakes are required to shift the system from (b,b) to (a,a).

Transitioning from (a,a) to (b,b) requires an event that occurs with probability of order ε9, while transitioning from (b,b) to (a,a) one with probability of order ε7. For small values of ε, the second transition is far more likely. So provided that: (i) everybody interacts with everybody else; (ii) both groups respond according to best-reply dynamics; (iii) the probability of a student making a mistake is small, equal for all students, and independent of the current population profile; and (iv) payoffs and group-sizes are as specified above. Thus, the informal analysis above concludes that

the selected equilibrium will be (a,a).

Now suppose that the groups adapt at different rates. The situation is as before

except for the dynamics. Now, every period, three Group B students best-respond, while only one Group A student best-responds. Again let us calculate how easy it is to dislodge the population from each of the equilibria. The bounds on player numbers

from before are derived from preferences and group sizes, not dynamics, so those have

not changed. The difference in this analysis is that it will matter exactly who is

mak-ing mistakes. For a pictorial explanation of why it matters who makes mistakes, the

interested reader should look ahead to Figures1 and 2in Section 4.2.

(10)

actiona, so that the total number taking action a is increased to 8. The following pe-riod 8 GroupA students are using action a. And so on. Once all 10 Group Astudents are taking action a, action a becomes optimal for Group B students. After two more periods, all of Group B adopts it. Thus, 7 of the “right kind” of mistakes are enough to transition from (b,b) to (a,a).

Now let the current profile be (a,a), and suppose that 6 GroupA students acciden-tally choose action b (again, it matters that these 6 students are from Group A). At this new profile, Group preferences disagree. The following period, 3 GroupB students adopt action b, while one of the 6 Group A students who mistakenly chose action b

reverts to action a. Thus, there are 8 = 3 + (6−1) students taking action b. Next period, all GroupB students adopt actionb while 4 GroupA players do, and so there are now 9 = 5 + (5−1) students taking action b. This is enough for all for Group

A students to prefer action b. From this new situation, by an analysis similar to the previous paragraph, population behaviour moves incrementally to (b,b).

Under these different dynamics, transitioning from (b,b) to (a,a) still occurs with probability of orderε7. More importantly however, the likelihood of transitioning from (a,a) to (b,b) has been lowered to ε6. So in this case, the analysis concludes that equilibrium (b,b) will be selected.

So what, the reader might ask, is the point of this section? There is certainly no

hint of a crisp result like the risk-dominance predictions of KMR and Young (1993). But in fact, the lack of a clean result is precisely the point. That group size and

strength of payoffs affect long-run behaviour is intuitive but not predicted by KMR’s

homogeneous model. In that framework, once an equilibrium is risk-dominant it is

selected (even though it may be Pareto-dominated to an arbitrary extent), and this is

of course independent of population size.

3

The Language Game

3.1

The Model

The Language Game, G, is defined as the tuple {N,Π, S,G}, where N :={1, . . . , N}

(11)

the binary action set common to all players; G := GAA, GAB, GBB is the collection

oflocal interactions, where GAA is the pairwise exchange between a player from Group

A and a player from Group A, etc. The local interactions are given as follows,

GAA GBB

A1

A2

a b

a p, p 0,0

b 0,0 1−p,1−p B1

B2

a b

a 1−q,1−q 0,0

b 0,0 q, q GAB

A

B

a b

a p,1−q 0,0

b 0,0 1−p, q

where p, q ∈ (1/2,1), so that Group A members always prefer to coordinate on a, and GroupB members always prefer to coordinate onb.

The Language Game is a simultaneous move game and players do not randomize.

Utilities are the sum of payoffs earned from playing the field, where the same action

must be used with one and all. Local interactions are opponent-independent in that a

player’s payoff depends only on the actions chosen and not the other player’s identity.

Thus, a player cares only about the number of others who choose the same action, but

not who those others are.

With only two “types” of agent, population behaviour can be written concisely. Let

[ω]Kdenote the number of players in groupK ∈Π using actiona. Callω = [ω]A,[ω]B

thestate of the play, where thestate space is Ω :=0, . . . , NA ×

(12)

K ∈Π receives from taking actions ∈ {a, b} in state ω, written UK(s;ω), is given by

UA(a;ω) :=

na−1

p (1)

UA(b;ω) :=N −na−1

(1−p) (2)

UB(a;ω) :=na−1

(1−q) (3)

UB(b;ω) :=N −na−1

q (4)

Before discussing individual behaviour, I will mention genericity. Letting N :=

{1,2, . . .}, the set of Language Games can be parameterized by Θ = {(NA, NB, p, q) :

NA, NB

N\ {1};p, q ∈(1/2,1)}. For a given gameG ∈Θ and a given group K ∈Π,

the statement “if there does not exist a state ω ∈ Ω such that UK(a;ω) = UK(b;ω)”,

will be abbreviated to “genK”. If there exists such a state, the shorthand is “ngenK”.

The subset of the parameter space for which any indifference occurs can be shown to

have a closure of measure zero specifically, for a givenNA,NB, andp, there is only one value ofq such that UB(a;ω) = UB(b;ω)

and so when both genA and genB, following

standard terminology I sayG is generic. Otherwise,G isnongeneric.

3.2

Individual Behaviour

Let R denote the real line, and R+ its positive part. For any x ∈ R, let dxe :=

min{n∈N|x≤n} and bxc:= max{n ∈N|x≥n}. Define the following,11

nAa := minna

UA(a;ω)> UA(b;ω) =d(1−p)N + (2p−1)e, (5)

nBa := minna

UB(a;ω)> UB(b;ω) =dq(N −2) + 1e, (6)

nAb := minnb

UA(a;ω)< UA(b;ω) =dp(N −2) + 1e, (7)

nBb := minnb

UB(a;ω)< UB(b;ω) =d(1−q)N + (2q−1)e (8)

nA

a is the number of players currently taking action a for a player from Group A to

strictly prefer action a, etc. When p = q, by symmetry nA

a = nBb and nAb = nBa. This

appears to suggest that the strategic situation is mirrored for groups A and B. While this is true at the individual level, it need not be true at the level of the group.

(13)

Let ΩA,ab and ΩB,ab denote the set of states such that A players and B players respectively strictly prefer action a to actionb. Similarly for ΩA,ba and ΩB,ba. Thus we can write

ΩA,ab :=nω ∈Ω [ω]A+ [ω]B ≥nAa o

(9)

ΩB,ab :=nω ∈Ω [ω]A+ [ω]B ≥nBa o

(10)

ΩA,ba:=nω ∈Ω [ω]A+ [ω]B ≤N −nAb o

(11)

ΩB,ba:=

n

ω ∈Ω [ω]A+ [ω]B ≤N −nBb o

(12)

Sets ΩA,ab, ΩB,ab, ΩA,ba, and ΩB,ba are defined likewise but for weak preference.

Generically, these sets of weak- and strict-preference coincide. Letting ⊆ (⊂) denote

weak (strict) inclusion, we have the following lemma.

Lemma 1. When NA2 and NB 2,

1. ΩB,ab ⊆ΩA,ab,

2. ΩA,ba ⊆ΩB,ba.

The proof is straightforward and so omitted. The interpretation is as follows. Fix

a state ω. If all members of Group B (A) weakly prefer action a (b) at this particular state, then this same action is strictly preferred by all members of Group A (B), and hence is preferred by the population as a whole. Lemma1 does not say that if action

a (b) is preferred by Group A (B), it must simultaneously be preferred by Group B

(A). That is, there may exist a state such that group preferences differ. The following provides mild sufficient conditions for the existence of such a state.

Lemma 2. If either

• N is even, or

• N is odd and N >2 + p+1q−1. then

ΩA,ab∩ΩB,ba6=∅.

(14)

Lemmas 1 and 2 become important once the Language Game gets repeated and

best-response based dynamics are introduced. An immediate implication of Lemma

1 is that there is never a state such that members of both groups are simultaneously

indifferent. This means that the set of non-equilibrium states at which the dynamics can

“get stuck” is not large. Either condition of Lemma2 holding generates an immediate

tension, since there then exists a nonempty set of states at which the groups prefer

different actions. How the dynamics behave in these states is important.

3.3

Equilibria

Behaviour at states (0,0),(0, NB),(NA,0), and (NA, NB), is referred to as

group-symmetric, and denoted by ωbb, ωba,ωab, and ωaa respectively. For a given game, G, let

E(G) denote the set of group-symmetric equilibria. With an abuse of terminology, I refer toE(G) as theequilibrium set, since it turns out (Section4) that group-symmetric equilibria are the only serious candidates for long-run behaviour. The following

theo-rem, stated without proof, classifiesE(G) for various parameters.

Theorem 1. For a given Language Game, G,

1. State ωaa is always a strict equilibrium (a convention).

2. State ωbb is always a strict equilibrium (a convention).

3. State ωba is never an equilibrium.

4. State ωab is an equilibrium (a convention) if and only if

p≥ N

B

N −1 and q≥

NA

N −1

Parts 1 - 2 of Theorem 1 are easily understood since deviating from a symmetric

profile means failing to coordinate with everyone in the population. Part3is also clear.

One of the groups must be (weakly) smaller, and at stateωba, members of this (weakly)

smaller group observe strictly more than half the population adopting their preferred

action. Hence they wish to deviate.

The intuition for part 4 is as follows. To sustain ωab as an equilibrium, the high

(15)

from successful coordination with the members of the other group on the less preferred

action. This requires the product of “own group size” and “preferred local-interaction

payoff” to be sufficiently large. The inequality relevant to the larger group always holds,

and so one must only check that of the smaller group.

While Theorem 1 is simple, it is also intuitive. The following example, which was

looked at informally in Section2 and is carried throughout the paper, illustrates this.

Example 1. LetG1 = (10,5,3/5,2/3).

For these parametersωab is not an equilibrium, because from this state GroupB players

have an incentive to deviate to action a. Precisely, the second inequality of part 4 in Theorem 1(corresponding to the smaller group, Group B) does not hold.

Forωab to be an equilibrium, either NB must be increased to 6, or qmust be increased

to 10/14. If this happens, the Group B members’ relative reward for coordinating on action b over action a will have increased sufficiently that even coordinating with less than half the population on their most preferred action can compensate for the larger

number offailed coordinations.

4

Evolutionary Dynamics

4.1

Specification

Now suppose the Language Game becomes the stage game of a repeated interaction.

Time is discrete, begins at t = 0, and goes forever. Utilities are received every period and actions for tomorrow are chosen at the end of today. I avail of precisely the

assumptions on individual behaviour from KMR.

Assumption 1. Inertia: At the end of each period, a nonempty subset of players are

provided with the opportunity to revise their strategy for the following period.

Assumption 2. Myopia: When a player reacts, he/she best-responds to the current

environment.

Let ωt denote the state at time t. Aggregating responses from the individual level,

(16)

Ψ, where

ωt+1 := Ψ(ωt)

= ΨA(ωt),ΨB(ωt)

The mappings ΨA : Ω →

0, . . . , NA and ΨB : Ω →

0, . . . , NB are the respective group dynamics. By Assumptions 1 and 2, each of ΨA and ΨB possesses the “

Dar-winian” property of KMR, so that Ψ satisfies the following,

Definition 1(Group-Darwinian Adjustment Process). Say that Ψ = (ΨA,ΨB) has the

Group-Darwinian Property if, for any K ∈ {A, B}, 1. for all ω6∈

ω0 [ω0]K = 0, NK ,

sign ΨK(ω)−[ω]K= sign UK(a;ω)−UK(b;ω) (13)

2. • ΨK(ω) = 0, if [ω]K = 0 and UK(a;ω)≤UK(b;ω)

• ΨK(ω) = NK, if [ω] K =N

K and UK(a;ω)UK(b;ω)

Group-Darwinianism naturally extends the Darwinian property of KMR to a

situ-ation with multiple groups. It is similar, in that it makes the standard evolutionary

assumption that better strategies are no worse represented next period. It isdifferent,

in that the extent to which strategies are better represented may differ greatly across

group.

As time proceeds the population reacts every period, and so interest lies in repeated

applications of Ψ. For all ω, let Ψ0(ω) = ω, and for m ∈ N\ {0} define the m-fold repetition of Ψ, Ψm, inductively as Ψm(ω) = Ψ Ψm−1(ω). Define the set ofrest points, Ω0 := {ω|Ψ(ω) =ω} ⊂ Ω. It can be shown that for each ω ∈ Ω, there exists ˆmˆ(ω),

such that for allm ≥mˆˆ(ω), Ψm(ω)

0. Finally, let ˆm:= maxωmˆˆ(ω).

Definition 2. For each ω ∈ Ω0, define the basin of attraction of ω by V(ω) := {ω0| ∀m ≥m,ˆ Ψm(ω0) =ω}.

The basin of attraction of a rest point is the set of states such that if the process

(17)

so the state space can be partitioned into {V(ω)}ω∈E(G). When ngenA, it is possible that (nA

a −1,0)∈Ω0, and when ngenB, it may be that (NA, NB−nBb + 1)∈Ω0.

When Lemma 2 holds, the basins of attraction depend on Ψ. Keeping track of the

basins of attraction will be of prime concern for issues of equilibrium selection. To

assist in this, define a partial ordering on Ω as follows. If ω and ω0 are elements of Ω, write ω >a ω0, if [ω]A ≥ [ω0]A and [ω]B ≥ [ω0]B. That is, ω >a ω0 if, in state ω, there are (weakly) more players in both groups taking action a than in state ω0. The pair (Ω,>a) is a complete lattice with bottom element ωbb and top element ωaa. (Lattices

are briefly discussed in AppendixA.)

The class of dynamics satisfying Definition 1is very broad. Including the following

additional constraint will make classifying the basins of attraction far easier.

Definition 3 (Monotonic Adjustment Process). Say that Ψ : Ω → Ω is monotonic, if

for any pair ω0, ω00∈Ω,

ω0 >aω00⇒Ψ(ω0)>a Ψ(ω00). (14)

One can construct adaptive processes satisfying Definition 1, but not Definition 3,

and vice versa. I limit attention to the class of dynamics satisfying both definitions, and

refer to these to asmonotone Group-Darwinian processes. Soon I place further

restric-tions on this class, but first, a useful result implying that knowledge of theboundaries

of a basin of attraction is sufficient to fully classify it.12

Lemma 3. For any monotonic Group-Darwinian process, and any state ω ∈ Ω0, the set V(ω) is convex.

Proof. The proof is contained in Appendix C.

The subclass of monotonic Group-Darwinian dynamics on which I focus, is one

where each group responds at a constant rate.

Definition 4. Say that GroupK ∈ {A, B}responds atconstant rate ξK, if there exists

ξK ∈

1, . . . , NK , such that for allω ∈Ω, and all t,

ΨK(ωt) = [ωt+1]K =    

  

max{0,[ωt]K −ξK}, if UK(a;ωt)< UK(b;ωt),

[ωt]K, if UK(a;ωt) = UK(b;ωt),

min

[ωt]K+ξK, NK , if UK(a;ωt)> UK(b;ωt),

(18)

Definition 5. Say that Ψ is a constant rate dynamic if both groups adapt at constant

rates. If Groups A and B respond at constant rates ξA and ξB respectively, write

Ψ = (ΨA ξA,Ψ

B ξB).

Constant rate dynamics have a simple interpretation. Next period, a fixed number

of new agents from each group adopt that group’s best-response (provided this new

number, when added to the original number of agents who were already taking that

action, does not exceed the size of the group). This is natural since it means that

better strategies are better represented, but is arguably artificial since it implies that

exactly the same number of “unhappy” (not currently best-responding) agents change

their action each and every period.

The best-reply dynamic, B := (BA,BB), is a constant rate dynamic with ξ

A =NA

and ξB =NB. I now define what it means for one group to bemore dynamic than the

other. Best-replying will be the most reactive a group can be.

Definition 6. If Ψ is a constant rate dynamic, say that GroupK ∈ {A, B}is(weakly) more-dynamic than GroupK0 6=K, written ΨK

d(d)ΨK

0

, if

ξK0 <(≤) min

n

ξK, NK

0o

Groups adapt at the same rate, written ΨA

d ΨB, if neither is more dynamic.

Formally, ΨA

dΨB if either (i)ξA=ξB=ξ ∈

1, . . . ,min

NA1, NB1

, or (ii)

one of the following holds: ξA = NA ≤ ξB ≤ NB

or ξB = NB ≤ ξA ≤ NA

. The

first condition says that neither group best-replies, and, whenever possible, next period

equal numbers of new agents from each group adopt that group’s best-response. The

second says that if the smaller of the two groups is best-replying, then even if, whenever

possible, the larger group has more agents reacting each period, both groups are still

said to be evolving at the same rate.

Armed with Definitions 4-6, it will now be possible to make positive statements

about varying dynamics. Normative statements are more problematic. While increased

adaptiveness is desirable in the Language Game, that is because locally risk-dominant

actions coincide with most-preferred equilibrium actions.13 If these actions did not

13It is not even clear that the population profile where each player adopts his/her preferred action

(19)

accord, then greater group dynamism could be detrimental. See Section7 for more on

this.

4.2

Path Dependence and Basins of Attraction

In KMR’s homogeneous agent model, thepath dependenceof population behaviour rests

crucially on the initial conditions, but the final outcome is “independent ... to all but

the coarsest features of the dynamics”. The reasoning is simple: all players collectively

agree on what action is a best-response, so the optimal action today must be at least as

good tomorrow as the number of players taking it can only (weakly) increase. However,

in the Language Game, the final outcome may depend not only on the initial state,ω0,

but also on the exact specification of the dynamics.

To illustrate this, recall Example 1 from Section 3, where G1 =

(NA, NB),(p, q)

={(10,5),(3/5,2/3)}. (This is also the example covered in Section2.) Figure1 shows the state space Ω as an 11×6 lattice, with [ω]A ∈ {0, . . . ,10} on the horizontal-axis, and [ω]B ∈ {0, . . . ,5} on the vertical-axis. Each state is depicted by a circle.

Figure 1 is also a preference map, with ΩA,ba,ΩB,ab,ΩA,ab ∩ΩB,ba as the colour-coded partition of Ω. The set of blue circles is ΩA,ba, while the red circles

denote ΩB,ab. At the states depicted by hollow circles, the set ΩA,ab∩ΩB,ba, group

preferences disagree (by Lemma 1, GroupA prefers a, Group B prefers b). These sets are defined by (nAa, nAb, nBa, nBb ) = (7,9,10,6), calculated using equations (5)-(8).

To understand Group-Darwinianism, consider the state (5,3) in Figure 1. Since (5,3) ∈ ΩA,ab ∩ ΩB,ba, it must be that Ψ (5,3) ∈ {6, . . . ,10} × {0,1,2}. Mono-tonicity is also easily understood. Consider the pair of states, (5,2) and (5,3). Clearly (5,3)>a(5,2), and so Ψ (5,3)>aΨ (5,2). Since (5,2)∈ΩA,ab∩ΩB,ba, by Group-Darwinianism, Ψ (5,2)

∈ {6, . . .10} × {0,1}. If Ψ (5,3)

= (7,1), then monotonicity further restricts Ψ (5,2) to lie in the set {(6,1),(7,0),(7,1)}.

A large circle denotes a rest point of the dynamics. By Theorem 1, corner states

ωbb and ωaa are always rest points, while corner state ωab is also if the conditions of

part4 are satisfied. While states (nA

a −1,0) and (NA, NB−nBb + 1) can be rest points

nongenerically, no other state can be. Example 1 is both generic and the conditions

of Theorem 1 part 4 are not satisfied. So in Figure 1, the only two rest points are

(20)

~ s s s s s s s s s s s s s s s s s s s s s s s s s s c c c c c c c c c c c c c c c c c c s s s s s s s s s s s s s s s s s s s s ~ 0 1 2 3 4 5

0 1 2 3 4 5 6 7 8 9 10

Figure 1: ΩA,ba - ΩA,abB,ba - B,ab.

and ΩB,ba, the colouring scheme for these states also connotes the rest point to which

they will lead.

Figure1 is not a complete description since it does not depict basins of attraction.

To partition Ω into {V(ω)}ω∈

0, further information on the details of the dynamics is needed. To see how these details can matter, consider again state (5,3) in Figure 1. When Ψ = (BA,BB), the dynamics terminate at (10,5) via the path {(5,3) (10,0)

→ (10,5)}. When Ψ = (ΨA

1,ΨB3), state (5,3) leads to (0,0), via the path {(5,3) →

(6,0)→(5,0)→(4,0)→(3,0)→(2,0)→(1,0)→(0,0)}.

Appendix B shows in detail how to construct basins of attraction for arbitrary

parameters when the groups respond at constant rates. While this requires some

nota-tion, the analysis has an intuitive graphical interpretation. Essentially, the procedure

amounts to conducting the analysis done on state (5,3) in the previous paragraph, for all states in ΩA,ab ∩ΩB,ba. Generically it is true that ΩA,ba ⊆ V(ωbb) and ΩB,ab

⊆ V(ωaa), so only states in ΩA,ab ∩ΩB,ba need be considered. Examples 2 and 3

below compute the basins of attraction for two separate dynamics for the parameters

of Example1.

Example 2(Basins of attraction when Ψ = (BA,BB)). From Figure 1, the situation is

clear: no matter what the current state, the following state must beωbb, ωab, orωaa. Blue

states jump immediately toωbb, black states toωab, and red states toωaa. Formally, for

(21)

anyω∈ΩA,ab∩ΩB,ba,B(ω) =ωab andB2(ω) =ωaa. That is, states in ΩA,ab∩ΩB,ba

transition first to ωab, and from there to ωaa, and so with Ψ = B, are part of V(ωaa).

ThusV(ωbb) = ΩB,ba and V(ωaa) = ΩA,ab∪(ΩA,ab∩ΩB,ba) = ΩB,ab.

Example 3 (Basins of attraction when Ψ = (ΨA

1,ΨB3)). Figure 2 is not a preference

map. It colour-codes the basins of attraction, but also contains additional detail

illus-trating the recursive nature of the procedure used to construct them.

I show how to constructV(ωbb) from the ground up while glossing over some tedious

technical details. (AppendixB deals with these details.) Since ΨB

d ΨA, the

proce-dure says to begin by considering ΩA,ba ⊆ V(ωbb). The recursive part is as follows:

any state that maps to ΩA,ba must be part of V(ωbb); but then so must any state that

maps to any state that maps to ΩA,ba; but then so must ..., etc.

Begin by considering the upper boundary (ΩA,ba)+={ω| [ω]A+ [ω]B = 6}.

Con-sider those statesω∈ΩB,ba∩ΩA,absuch that Ψ(ω)∈(ΩA,ba)+. Call this set of states,

X, and consider the down set generated by it, X↓. Now define B1 := ΩA,ba X.

This appends the set{(2,5),(3,4),(3,5),(4,3), (4,4),(5,2),(5,3)}to ΩA,ba. In Figure 2these states are denoted by a hollow blue circle. But now, any stateω∈ΩB,ba∩ΩA,ab such that Ψ(ω) ∈ (B1)

+ must be part of V(ωbb).14 State (4,5) is the only state that

satisfies the requirement. This state is denoted by a hollow blue circle containing a

smaller solid blue circle in order to differentiate it. Since no states map to (4,5) under Ψ, it must be that all remaining states in ΩB,ba∩ΩA,ab are part ofV(ωaa) and so we

are done. These remaining states are denoted by a hollow red circle.

We can also trace how the boundaries of a basin of attraction vary. When ΨA dΨB,

a basin of attraction’s boundaries and total boundaries will coincide.15 By Lemma 3,

for any monotone dynamics the basins will be convex. Extrapolating from Figure1, for

any Ψ such that ΨA dΨB,

• V(ωbb)

+= V(ωbb)

++={(1,5),(2,4),(3,3),(4,2),(5,1),(6,0)} • V(ωaa)

− = V(ωaa)

−− ={(2,5),(3,4),(4,3),(5,2),(6,1),(7,0)}

while in Figure2, with Ψ = (ΨA

1,ΨA3), • V(ωbb)

+={(4,5),(5,3),(6,0)}

14The procedure is simplified by noting that actually only states inX\ΩA,ab need be considered.

15In fact, this statement also holds true under the condition thatξAnA

(22)

~ s s s s s s s s s s s s s s s s s s s s s s s s s s q c c c c c c c c c c c c c c c c c c s s s s s s s s s s s s s s s s s s s s ~ 0 1 2 3 4 5

0 1 2 3 4 5 6 7 8 9 10

Figure 2: V(ωbb) and V(ωaa) when Ψ = (ΨA1,BB).

• V(ωbb)

++={(4,5),(4,4),(5,3),(5,2),(5,1),(6,0)} • V(ωaa)

− ={(5,4),(6,1),(7,0)}

• V(ωaa)

−− ={(5,5),(5,4),(6,3),(6,2),(6,1),(7,0)}

5

Equilibrium Selection

Any deterministic Group-Darwinian dynamic, Ψ, induces a time-homogeneous Markov

process on the finite state space Ω. LetP be the associated transition matrix.16 That is,

for every pair of statesω0, ω00 ∈Ω,P(ω0, ω00)≥0 denotes the probability of transitioning fromω0 toω00, and for each ω∈Ω,P

ω0P(ω, ω0) = 1.

Let 4(Ω) denote the set of distributions on Ω. A stationary distribution of P is a row-vectorµ∈ 4(Ω), such thatµP =µ. The set of stationary distributions is denoted

40(Ω). Writing Pt for the t-fold application of P, and supp(µ) for the support of µ,

say that D ⊆ Ω is a recurrent class of P, if for all ω ∈ Ω, and all µ ∈ 4(Ω) with supp(µ) ⊆ D, limt→∞µPt

(ω) > 0 ⇐⇒ ω ∈ D. A state is recurrent if it is an element of a recurrent class, and transient otherwise. A singleton recurrent class is

anabsorbing state. For the Language Game, all recurrent classes are absorbing states,

corresponding with the rest points Ω0, as defined in Section 4.

(23)

All Markov processes possess at least one stationary distribution, while ergodic

Markov processes possess only one. The third assumption of KMR, perturbs the

deter-ministic dynamics in such a way as to induce a new Markov process that is ergodic.17

Assumption 3. Behavioural mutation: There is a small probability that an agent may

choose an action at random.

Specifically, after afforded decisions have been taken, but before payoffs are made,

with probability εA > 0 (εB > 0) each Group A (B) player switches his/her current

action, and with probability 1−εA (1εB) maintains it.18

For constant rate dynamics, I claimed that there were no grounds for always

as-suming ΨA

d ΨB. Similarly, there is no reason to suppose that behavioural mutations

occur with equal likelihood for the different groups.19 But, while ultimately interest

will lie in the case where (εA, εB) ↓ (0,0), I will make the strong assumption that

εA = εB = ε, for all states and all time periods. It is tempting to insist on a milder

condition likeεA =O(εB) andεB=O(εA),20 but the equilibrium selection results may

differ.21

Mutations of size ε > 0 induce a new, ergodic Markov process with associated transition matrixPεand unique stationary distributionµε. Consider a sequence of such transition matrices {Pε}ε>0, each with corresponding stationary distribution {µε}ε>0. By continuity, the accumulation point of {µε}

ε>0, µ?, is a stationary distribution of

P := limε↓0Pε. Interest lies in the states to which µ? assigns positive probability. Definition 7. Stateω isstochastically stable, orselected, ifµ?(ω)>0, and uniquely so

if µ?(ω) = 1. Let Ω? denote the set of stochastically stable states.

17Actually this assumption makes the processirreducible andaperiodic, but for finite state Markov

processes this is sufficient for ergodicity. SeeKarlin and Taylor(1975).

18This interpretation is qualitatively different, though technically identical, to KMR’s interpretation. 19Nor is there any reason to suppose that mutations are both state- and time-independent. The

effects that subtle differences in mutation rates can have on equilibrium selection in general settings are discusses in detail inBergin and Lipman (1996).

20Letting both εA andεB be functions of ε, εA(ε) =O εB(ε)

as ε↓0, if and only if there exists positive numbersM andδ, such thatεA(ε)≤ |M εB(ε)|for allε < δ.

(24)

Calculating µ? is the objective. This is done using tree-surgery techniques from

Freidlin and Wentzell (1998), first introduced to game theory in Foster and Young (1990). To do so, consider states in Ω as the vertices of a fully connected directed

graph, Γ?. An edge in Γ? from ω0 to ω00 is denoted (ω0 → ω00). A path from ω0 to ω00

is a sequence of edges{(ωi →ωi+1)}

m−1

i=0 where ω0 =ω0, and ωm =ω00, and all vertices

are distinct. A typical path from ω0 to ω00 is denoted by h(ω0, ω00), and the set of all paths from ω0 to ω00 byH(ω0, ω00). Using this, the set of all paths from a state ω to a setQ63ω, H(ω, Q), can be defined as follows, H(ω, Q) :=∪ω0∈QH(ω, ω0).

Let k,k denote the taxicab metric (L1 distance) on Ω. For any pair ω0, ω00, define

cΨ(ω0, ω00) := kΨ(ω0), ω00k. The mapping cΨ : Ω×Ω → {0, . . . , N} is known as a cost function. The value it takes for any pair (ω0, ω00) is the minimum number of simultaneous mutations needed to transition directly fromω0 toω00 when the dynamics are described by Ψ. In graph theoretic terms, this is the cost of edge (ω0 →ω00) in Γ?.22

For any functionτ : Ω→Ω, a path from ω0 toω00 inτ, is a path{(ω0 →ω1),(ω1 →

ω2), . . . ,(ωm−1 → ωm)}, where ω0 = ω0 and ωm = ω00, such that τ(ωi) = ωi+1 for all

i = 0, . . . , m−1. An ω-tree, τω, is a mapping τω : Ω → Ω such that: (i) τω(ω) = ω;

(ii) for every ω0 ∈ Ω\ {ω}, there is a unique path in τω fromω0 to ω. Say that ω00 is a

successor of ω0 in τω if τωm(ω

0) = ω00 for some m 1, and the immediate successor if

m= 1.

For each ω, let Tω denote the set of all ω-trees. The cost of τω ∈ Tω is the sum of

costs of its edges,

cΨ(τω) = X

ω06=ω

cΨ ω0 →τω(ω0)

Define the set of states that achieve minimum cost ω-trees as Ξ(G, cΨ) :=

n

ω? ∈Ω

for any ω ∈Ω,τmin ω?∈Tω?

cΨ(τω?)≤ min τω∈Tω

cΨ(τω) o

The following is the result of Freidlin and Wentzell (1998) as applied to the Language Game. Note it’s relation to Definition 7above.

Lemma 4. Stateω is stochastically stable, or selected, if ω ∈Ξ(G, cΨ), and uniquely

so if {ω}= Ξ(G, cΨ). That is, Ξ(G, cΨ) = Ω?.

22In more general settings,c

Ψ can take any value inR+∪ {∞}, where∞is the value if a transition

(25)

By Theorem 4 of Young (1993), the stochastically stable states are contained in a recurrent class, which in this case are the absorbing states, Ω0. The key to computing

minimum costω-trees of the absorbing states, is to find paths of minimum cost between all pairs of them. Since, V(ω0)∩ V(ω00) = ∅ for all distinct ω0, ω00 ∈ Ω0, this amounts

to finding paths of minimum cost from each absorbing state to the convex basin of

attraction of the others. Before doing this, we need to know the precise structure of

the basins of attraction. Appendix B does this formally, though the procedure was

sketched in Examples 2and 3.

The two main results of the paper, Theorems2and3, concern equilibrium selection.

They compute the stochastically stable equilibria for the case whereE(G) = {ωbb, ωaa}

and E(G) = {ωbb, ωab, ωaa} respectively. Lemma 8 in Appendix C is needed for both

results. Lemma8says that a path of minimum cost out of a region of the state space in

which the dynamics are unambiguous, involves a direct transition out. An immediate,

and, for my analysis, the important consequence, is that it holds for the symmetric

profilesωbb and ωaa.

The analysis of each case is quite different, but the case whereE(G) ={ωbb, ωab, ωaa}

uses intuition that is more easily described in the first case and so I begin with it.

5.1

Equilibrium set is

{

ω

bb

, ω

aa

}

The main result in this section, Theorem 2, has a nice geometric interpretation that I

will explain before the formal statement. Refer back to Figures 1 and 2 which regard

the parameters from Example 1. From equation (5), nA

a = 7 ≤ NA, and therefore

ωab = (10,0)∈ V(ωaa). Figure1, with the hollow states assumed to be red, represents

the basins of attraction when ΨA

d ΨB. In this case, a minimum cost ωaa-tree has

cost of 7 - one candidate costly edge being (0,0)→ (7,0)

. A minimum cost ωbb-tree

has cost of 9 - one candidate costly edge of the path being (10,5) → (1,5).23 Thus when ΨA∼d ΨB, clearly ωaa is the stochastically stable outcome.

Now look at Figure 2 representing the basins of attraction when Ψ = (ΨA

1,ΨB3).

As described in Section 4.2, the greater rate of adapting for Group B means that

V(ωbb) “eats” in to the region ΩA,ab ∩ΩB,ba. The cost of a minimum cost ωaa-tree

23Some work is required to show this since the path “jumps” through part of the region where

(26)

is unchanged, since the transition (0,0) → (7,0), is still available as an edge on a path from ωbb to V(ωaa). However, the minimum cost ωbb-tree is now different. Now

the transition (10,5)→(4,5)

is the unique costly edge on the path of minimum cost

fromωaa toV(ωbb). This has a cost of 6. The stochastically stable equilibrium changed

fromωaa to ωbb.

The following theorem performs this computation for any parameters.

Theorem 2. Suppose Condition 4 of Theorem 1 does not hold, so E = {ωbb, ωaa},

and that the monotonic Group-Darwinian adjustment process is such that both groups

evolve at constant rates. Let τ?

ωbb and τ ?

ωaa, denote minimum cost ω-trees for ωbb and

ωaa respectively. Then,

1. if (NA,0) = ω

ab∈ V(ωaa), then

cΨ(τω?aa) =n A

a, (15)

cΨ(τω?bb) =n A b −

(V(ωbb))+

N W , (Ω A,ba

)+

N W

; (16)

2. if (NA,0) = ωab∈ V(ωbb), then

cΨ(τω?bb) = n B

b , (17)

cΨ(τω?aa) = n B a −

(V(ωaa))−

N W , (Ω A,ba

)−

N W

. (18)

The set of stochastically stable states are those with the ω-tree of minimum cost. Proof. The proof is found in AppendixC.

Example 1 falls into case1 of the theorem. Some general statements can be made.

First, the state ωab plays an important role. When it is not an equilibrium, it always

lies in the basin of attraction of the larger group’s preferred convention. If the larger

group is weakly more reactive, then the region where preferences disagree will be part

of that group’s basin of attraction, and the larger group’s preferred outcome will be

stochastically stable. Furthermore, the feature whereby the minimum cost ωaa-tree is

unchanged from the example is robust, as it is always unchanged by varying dynamics

(27)

So when Group A is the larger group, the minimum cost ωaa-tree is nAa (since

(nA

a,0)∈ V(ωaa) always). The upper bound for a minimum cost ωbb-tree is nAb is given

in Equation (16). The second term in Equation (16) can be thought of as having an

indicator function in front, such that it only “kicks in” when GroupB is more reactive. This term computes the cost by which the minimum cost ωbb-tree is reduced from nAb.

A lower bound for the minimum cost ωbb-tree is given by nBb . This means that

Group B, the smaller group, must not only react quicker, but also have stronger rela-tive preferences for its preferred equilibrium to be stochastically stable. Conversely, if

GroupA, the larger group, has equally strong preferences, then (a,a) will always be a stochastically stable state.

5.2

Equilibrium set is

{

ω

bb

, ω

ab

, ω

aa

}

Again, I will work through an example and then state the theorem. The example is a

variant ofG1 from Example1, where the size of the groups have been altered.

Example 4. LetG2 = (12,9,3/5,2/3). For these parameters E(G2) ={ω

bb, ωab, ωaa}.

Using equations (5) - (8), we get that (nAa, nAb, nBa, nBb ) = (9,13,14,8).

The state space forG2is a 13×10 grid, as depicted in Figure3. The first thing to note

is that regardless of rates of reaction, there exist some states at which the dynamics are

unambiguous. These states are represented by solid circles in Figure3, and are colour

coded by the convention to which they lead. The blue states represent ΩA,ba, the red

states comprise ΩB,ab, and the black states define the set Ωab ⊂(ΩA,ab ∩ΩB,ba), as

defined at the beginning of the proof of Theorem3. The remaining states are depicted

as hollow circles but they are also colour coded. State (7,2) is hollow and blue, because, depending on the dynamics, it can be an element ofV(ωbb) or V(ωab), but not V(ωaa).

Similarly, the hollow red circles, states (8,3),(8,4),(8,5),(9,3),(9,4), and (10,3), are so coloured since no matter what the dynamics, they cannot be elements ofV(ωbb). The

black hollow circles can be elements of any basin depending on the dynamics.

To solve for the stochastically stable states, we need to compute paths of

min-imum cost between the six pairs, (ωbb, ωab), (ωbb, ωaa), (ωab, ωbb),(ωab, ωaa),(ωaa, ωbb),

and (ωaa, ωab). Using Lemma 8, a minimum cost path over all h0 ∈ H(ωbb, ωab), has

edge of positive cost ωbb→(nAa,0)

(28)

w r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r b b b b b b b b b b b b b b b b b b b b b b b b b b b b b b r r r r r r r r r r r r r w b b b b b b r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r r rw 0 1 2 3 4 5 6 7 8 9

0 1 2 3 4 5 6 7 8 9 10 11 12

Figure 3: ΩA,ba=•. Ωab =•. ΩB,ab =•.

The cost of this path isnA

a (= 9). A similar statement holds for a path of minimum cost

fromωaa to ωbb, which has cost nBb (= 8), with costly edge ωaa →(NA, NB−nBb )

.

Lemma 10, in the Appendix, is essentially the analog of Lemma 8 for states in Ωab

(the solid black states in Figure3). Lemma 10is used to show that the unique path of

minimum cost fromωab toωbb has costly edge ωab →(NA−(nAb −NB),0)

with total

cost nA

b −NB (= 4). Similarly, the path of minimum cost from ωab to ωaa has costly

edge ωab →(NA, nBa −NA)

with total cost nB

a −NA (= 2).

The last two pairs to be considered are (ωbb, ωaa) and (ωaa, ωbb). An upper bound

for the paths of minimum costs between these can found by combining paths computed

above. For example, the path of minimum cost fromωbbtoωaa, must have cost at most

equal to nA

a + (nBa −NA) = 11. Lower cost paths between symmetric conventions can

be found when the dynamics are varied, but not when the groups react at equal rates.

This is stated formally in the following Theorem, whose proof is in AppendixC.

Theorem 3. Suppose Condition 4 of Theorem 1 holds, so E(G) ={ωbb, ωab, ωaa}, and

that the monotonic Group-Darwinian adjustment process is such that both groups evolve

at constant and equal rates. Let τω?bb, τω?ab, and τω?aa, denote minimum cost ω-trees for

ωbb, ωab, and ωaa respectively. Then,

cΨ(τω?bb) =n B b + (n

A b −N

B) (19)

cΨ(τω?ab) =n B

b +nAa (20)

cΨ(τω?aa) =n A a + (n

B a −N

A

(29)

The set of stochastically stable states are those equilibria withω-tree of minimum cost.

When the groups react at equal rates all hollow states in Figure 3 are part of

V(ωab). Let’s now apply Theorem 3. Plugging in to equations (19) - (21), we get that

cΨ(τω?bb), cΨ(τ ?

ωab), and cΨ(τ ?

ωaa) have costs 12 (= 8 + 4), 17 (= 9 + 8), and 11 (= 9 + 2) respectively. So when ΨA∼dΨB, conventionωaa is stochastically stable.

This is a good time to mention a few features of the set up. Due to the 2-dimensional

nature of the state space, there need not be a connection between the size of the basins

of attraction and stochastic stability.24 In fact, it is possible that the equilibrium with

the smallest basin of attraction is stochastically stable. To illustrate this refer back

to Figure 3 once again. When ΨA ∼d ΨB, simple counting yields |V(ωbb)| = 45,

|V(ωab)|= 44, and |V(ωaa)|= 36. We saw already thatωaa is stochastically stable, and

this is despite it having the maximal basin of attraction.

Another observation concerns the notions of the radius and modified coradius of a

recurrent class, introduced by Ellison (2000). The radius is defined as the minimum

number of mutations necessary to escape the basin of attraction, while the modified

coradius is defined as the maximum (over all states) of the minimum cost of a path

necessary to reach it, where one subtracts from the cost the radii of the intermediate

recurrent classes through which the path passes. Ellison (2000) shows that when the radius of a recurrent class is strictly greater than its modified coradius, this recurrent

class contains a stochastically stable state. Though the result is not universally powerful

as it only provides sufficient conditions, it does hold for many of the most frequently

studied games. In the Language Game, however, it need not be applicable whenE(G) =

{ωbb, ωab, ωaa}. Example4with ΨA∼dΨBdisplays this. The [radius, modified coradius]

pair for each absorbing state is given as follows, ωbb 7→[9,10], ωab 7→[2,9], and ωaa 7→

[8,9]. Note that all absorbing states have a greater modified coradius than radius. I now show (Theorem 4) that if ωab is stochastically stable when ΨA ∼d ΨB, then

the set of stochastically stable equilibria is independent of the specifics of the dynamics.

What happens is this. First, varying rates of adjustment will never lower the cost τω?

ab. Showing this is straightforward. Second, while it may lower the cost ofτ?

ωbb orτ ? ωaa, it will not lower the cost enough to alter selection. That is, Theorem4 does not say that

the minimum cost ω-tree of each convention is necessarily unchanged and as given by equations (19) - (21). Rather it just says that if ωab is ever stochastically stable for

(30)

some constant rate dynamic, it will always be for any constant rate dynamic and it will

haveω-tree with cost given by that in equation20.

Theorem 4. Suppose Condition4of Theorem 1holds, so E ={ωbb, ωab, ωaa}, and that

the monotonic Group-Darwinian adjustment process is such that both groups evolve at

constant rates. Ifωab ∈Ξ(G, cΨ) whenΨA∼dΨB, thenωab ∈Ξ(G, cΨ) for any constant rate Group-Darwinian adjustment processΨ.

Proof. The proof is found in AppendixC.

To reiterate, Theorem 4 does not say that equilibrium selection is unaffected by

varying rates. To see this, again consider Example4 and suppose that Ψ = (ΨA1,BB).

In Figure 3, all the hollow blue and black states are then elements of V(ωbb), and all

the hollow red states are elements ofV(ωab). The minimum cost ω-trees forωab andωaa

are unchanged. However, a new minimum costωbb-tree is attained with costly branches

ωab → (12,2)

and ωaa → (4,9)

, for a total cost of 10. Thus the stochastically

stable equilibrium is now ωbb. A formal result of this form would be similar to that of

Theorem3. The difference would appear in the analogs to equations (19) and (21). The

minimum cost ω-tree for ωbb, for example, would be the minimum of that in equation

(19), and the sum of (nB

a −NA) and a term similar to k (V(ωbb))+

N W, ωaak. While

this second term’s value must be at least nB

b , slightly more care must be taken since it

need not be the case that ωba ∈ V(ωbb). And a statement like that of Lemma 9 need

not hold true in this case.

6

Stability versus Welfare

When the population is homogeneous, players collectively agree on what action they

“would like to have taken” earlier today, and hence what action they will choose for

tomorrow if afforded a revision opportunity. The main issue in existing large population

coordination problems is the tension between efficiency and risk-dominance. Both KMR

and Young (1993) show that the risk-dominant action will emerge under perturbed best-response based dynamics. This result isnegative in the sense that the locally

risk-dominant equilibrium action need not coincide with the Pareto-risk-dominant one, and may

(31)

In the Language Game, both symmetric profiles are Pareto-efficient equilibria, and

there is never uniform preference over these. Thus Pareto-efficiency is useless as a

selection device. The purpose of this section is to rank profiles in E(G) according to various welfare criteria, and then compare this ranking to the outcome(s) selected by

decentralised dynamics. Since all profiles in E(G) are group-symmetric, to infer how members of the population rank them, it suffices to analyse the situation from the

perspective of any one agent from each Group.

Theorem 5. Within the set of group-symmetric profiles:

1. ωaa and ωbb are always Pareto-efficient.

2. ωab is Pareto-efficient if and only if

p≥ N −1

2N −NB2 and q≥

N−1

2N −NA2 (22)

Proof. The proof is straightforward and is omitted.

A natural question to ask is the relationship between Pareto-efficiency and

equilib-rium. The following shows that a Pareto-efficient profile must be an equilibrium, but

that not all equilibria are Pareto-efficient.

Theorem 6. If ωab is Pareto-efficient, then it must be an equilibrium. Butωab may be

an equilibrium without being Pareto-efficient.

Proof. Follows from conditions in part4of Theorem1, and those in part 2of Theorem

5. The second imply the first, while the first need not imply the second.

Consider another variant of G1 from Example 1, in which Group B’s preference

for coordinating on action b are stronger. Specifically, consider the new game G3 =

(10,5,3/5,5/6). In this case, E(G3) = {ω

bb, ωab, ωaa}. Evaluating utilities at each

equi-librium givesUA(ω

aa), UB(ωaa) ={42/5,28/6},

UA(ω

ab), UB(ωab) ={27/5,50/6},

and {UA(ω

bb), UB(ωbb)} = {28/5,140/6}. Note that ωab is Pareto-dominated by ωbb,

and so provides an example of Theorem6 at work.

It can be checked thatωab is not a stochastically stable inG3. One might then

won-der whether or not stochastic stability will ever select a Pareto-inefficient convention.

(32)

Theorem 7. Suppose E(G) = {ωbb, ωab,

Figure

Figure 1: ΩA,b⪰a - ΩA,a≻b ∩ ΩB,b≻a - ΩB,a⪰b.
Figure 2: V(ωbb) and V(ωaa) when Ψ = (ΨA1
Figure 3: ΩA,b⪰a = •. Ωab = •. ΩB,a⪰b = •.

References

Related documents