spatialviolence.pdf

(1)

Ethnic Violence Across Space

Hannes Mueller Dominic Rohner David Sch¨onholzer*

June 3, 2020

Abstract

Ethnic violence requires spatial proximity between ethnic groups, but the role of distance between groups for conflict remains understudied. We develop a structural model of spatial vio-lence in which ethnic groups recruit fighters strategically across space to maximize their impact. Spatial decay of violence determines the equilibrium placement of fighters and implies specific spatial patterns of conflict. We then estimate the structural parameters of the model using fine-grained data on ethnic groups and violence from 24 ethnically divided countries. We find that in more than half of these, spatial decay is substantial: half of all ethnic violence dissipates after 350km. Consequently, violence is higher near ethnic borders, although a substantial share of violence originates from distant locations. Using the estimated parameters of the model, we also find that ethnic conflict is strongly asymmetric, especially in states with higher state capac-ity. Counterfactual estimates suggest that setting up barriers can reduce violence but pacifying groups with more grievances is often more effective.

Keywords: Conflict, Ethnic Violence, Spatial Data, Distance Costs, Polarization, Fraction-alizaton, Segregation, Insurgency.

JEL Classification: D74, K42, Z10.

*_{Mueller: Institut d’Analisi Economica (CSIC), Barcelona GSE and CEPR (email: [email protected]).}

Rohner: Department of Economics, University of Lausanne and CEPR (email: [email protected]). Sch¨onholzer: Institute for International Economic Studies, Stockholm University (email: [email protected]).

Acknowl-edgements: We thank Nezih Guner and three anonymous referees for very helpful suggestions. Further, we are grateful to Quentin Gallea, Yihuan Hu, Dong Ook Eun, Augustin Tapsoba and Nghia-Piotr Trong Le for excellent research assistance, and to Joan Maria Esteban, Mathias Thoenig, and Debraj Ray for useful comments. We also

thank seminar participants at Institut d’Analisi Economica (CSIC), UC Berkeley, NYU Abu Dhabi, and CEMFI Madrid, and participants to the IEA World Congress, Barcelona GSE Summer Forum, EEA annual congress in Geneva, ENCoRe Bonn, ”Social Interactions, Norms and Development” conference in Moscow, Swiss Development

Network workshop in Fribourg, and the ”Conflict Prediction for Prevention” conference in Barcelona. All errors are of course ours. Hannes Mueller acknowledges financial support from Grant number ECO2015-66883-P, the Ramon y Cajal programme and the Severo Ochoa Programme and Dominic Rohner is grateful for financial support from the

(2)

1 Introduction

For several decades, the role of ethnic diversity in conflict has been a major focal point for

aca-demics, NGOs, politicians and journalists alike. Strikingly, however, the emphasis has been almost

exclusively on nationwide ethnic population composition, ignoring the location of groups. This

omission weighs heavily, as ethnic demography on its own can only explain a mere tenth of violence

eruptions.1 _{Indeed, the heterogeneity of outcomes even for the most ethnically polarized countries}

is impressive (see Montalvo and Reynal-Querol 2005 for ethnic polarization definition and data):

For every high-polarization country with recent conflict (say, Guatemala, Afghanistan, Northern

Ireland, Sri Lanka) there are others that are equally ethnically polarized but have been peaceful

for decades or even centuries (Jordan, Mauritius, Switzerland, Malaysia).

In order to capture this heterogeneity in the fate of ethnically diverse countries, we move beyond

ethnic population shares by addingspace to the equation. The motivating rationale for the focus on

spatial interaction is that interactions between people are typically easier, and hence more intense

and frequent, when they are geographically close. This decay of interaction with increasing distance

has been found in various fields of economics. For example, trade economists refer to “iceberg trade

costs” increasing in distance and use “gravity models of trade” to account for the fact that trade

between more distant places is more costly, and drawing on this spatial decay has proven fruitful

to explain the distribution of economic activity across space (see e.g. Fajgelbaum et al. 2017 and

Donaldson 2018).2

To integrate space into the study of ethnic conflict, we build a novel game-theoretic model of

ethnic interactions across space and estimate the structural parameters of the model. In our setting,

each ethnic group strategically recruits fighters across heterogeneous locations to maximize their

spoils of conflict, taking into account the spatial composition of opposing groups in the vicinity

and local support in the homelands. This leads to specific predictions how the interaction of

spatial distance and population distributions matters for conflict intensity. We then estimate the

parameters of our structural model for 24 countries with substantial ethnic conflict (drawing on

the classification by Kalyvas and Balcells 2010, as discussed in detail below). For each country, we

estimate the extent to which spatial distance is a key feature of strategic conflict interactions, as

well as a series of further parameters related to fighting motivation, reliance on local support and

the spatial origins of attacks. It turns out that for 13 out of 24 countries space is a key element of

conflict interaction and for the median spatial decay we find that attack effectiveness is reduced by

half over a distance of 350 km.

1

See e.g. column 1 in Table 1 of Esteban et al. (2012) where nationwide ethnic diversity measures and total population together account for 13 percent of variation in conflict onsets.

2

(3)

Our setting also allows us to estimate the shares of violence crossing district borders, the

group-level stakes of conflict and the relative aggressiveness of a given group (i.e. the difference between

the attacks perpetuated versus suffered). These parameters and measures endow our framework

with substantial additional explanatory power relative to widely used country-level diversity

mea-sures such as ethnic polarization. We also demonstrate that by separating targets and perpetrators

of violence our model provide several meaningful statistics like the share of violence inflicted across

locations, the aggressiveness of groups or the asymmetry of the ethnic conflict. To interpret our

results, we also relate our estimated parameters to topographic, geological and demographic

covari-ates. Finally, the estimates of our structural model allow us to perform counterfactual estimations,

highlighting the country-specific violence potential linked to changing spatial decay.

Our findings provide several new insights into the nature of ethnic conflict. First, we document

in simple reduced-form regressions that conflict is more intense in locations near ethnic boundaries:

if a location is 10% closer to an ethnic boundary, it suffers 1.3% more conflict casualties than

comparable locations. Second, this reduced-form finding masks substantial heterogeneity in the

extent of spatial decay in our structural model: some countries show little evidence of distance

decay, while for others almost all violence may dissipate even after 100km. Third, as a result of

this decay of violence across space, the location of strategically placed fighters and the resulting

violence in a location may depend on the spatial composition of ethnic groups across the entire

country. Thus, conflict in a given location may originate almost entirely from other places: across

the 24 countries in our sample, we associate more than three quarters of ethnic violence in a district

to the ethnic composition of surrounding districts as opposed to its own. Fourth, the extent to

which violence emanates from outside a location is strongly driven by group-specific motivations

to engage in violence with others. We find that this engagement exhibits substantial asymmetry:

some groups instigate almost all ethnic violence in a country, while others seem to be mostly on

the receiving end of aggression.

Two caveats apply to our argument. First, we study how geographical proximity affects the

risk of attacks in ongoing conflicts. This focus makes perfect sense in the short-run when fighting

is acute. However, in the long-run, positive interactions between ethnic or religious groups could

build trust between them and the actual motives for attacks may be reduced (see Rohner et al.

(2013b). Hence, while bigger geographical distances can indeed reduce the number of attacks

during a conflict (as emphasized by the current paper), in post-conflict reconstruction ”building

bridges” and reducing inter-group distance may be important policies to re-enforce peace. This

subtle point has important policy implications: While physically separating groups (e.g. through

the construction of barriers) may indeed be justifiable while fighting is still virulent, it may be

(4)

to measure the support bases of the groups engaged in conflict. This is important, as these bases

produce the support needed for violence becoming visible elsewhere. In this study we rely on the

location of ethnic homelands to capture support bases and show that, even with this crude measure,

our model is able to provide several new insights. However, our work should still be seen as only a

first step in bringing the notion of space and the separation of attackers and targets to the empirical

study of conflict.

Our paper is linked to several strands of the literature. In particular, it is related to the

theoretical literature on ethnic and religious conflict (e.g. Horowitz 2000; Varshney 2001; Esteban

and Ray 2008; Rohner 2011; Caselli and Coleman 2013; Rohner et al. 2013b), and the empirical

studies linking ethnic diversity to civil war at the country-year level (see Fearon and Laitin 2003;

Collier and Hoeffler 2004; Montalvo and Reynal-Querol 2005; Cederman and Girardin 2007; Collier

et al. 2009; Esteban et al. 2012; Michalopoulos and Papaioannou 2016). These articles generally

find that ethnic heterogeneity (and in particular ethnic polarization) increases the risk of conflict,

but – contrary to our current contribution – they do not study the spatial patterns of ethnic

violence.3 Our main contribution here is to show that, holding the overall size of different ethnic groups (i.e. ethnic polarization or fractionalization measures) fixed, overall levels of violence will

differ dramatically depending on how the populations are distributed in space. In this regard our

work directly relates to studies on the impact of segregation, with some scholars finding that it

increases the risk of ethnic conflict (Medrano, 1994; Olzak et al., 1996), while others argue that

partition could be a solution to ethnic conflict (Horowitz, 2000).4

In terms of further related literatures, in recent years there has also been an increasing number

of papers studying violence at a disaggregate, local level (e.g. Harari and Ferrara 2018; Rohner et al.

2013a; Dube and Vargas 2013; Berman et al. 2017; K¨onig et al. 2017; Huber and Mayoral 2019).

Also the micro-level literature on insurgency and counter-insurgency is relevant (see Kalyvas 2006;

Lyall 2010; Bhavnani et al. 2011; Kocher et al. 2011; Berman et al. 2011). Some of these studies

stress the importance of territorial control for understanding how violence evolves in civil war. This

provides a key stepping stone for our model as it provides the idea of bases of support which come

with territorial control over population or resources. Importantly, however, none of these studies

provide a formal model of conflict in which the location of attackers and targets differs. As this

literature moves into more and more disaggregated data, the confounding of motivation, origin and

destination of attacks will become problematic, as the drivers of conflict like economic deprivation,

ideology, ethnic hatred or the presence of natural resources may be increasingly spatially separate

from where attacks are observed. Our model is able to account for this, by allowing attackers to

3_{The results in Spolaore and Wacziarg (2016) suggest that this may not hold for international wars where}

geneti-cally similar populations are engaged more in conflict.

4

(5)

operate or draw support from other locations than the targets of violence.

Maybe closest to our contribution is the literature focusing on spatial patterns of violence. There

is a small literature in political science studying –inspired by the epidemiological literature on the

spread of diseases– diffusion and clustering patterns of violence over space and time (Townsley

et al., 2008; Schutte and Weidmann, 2011). Further, Novta (2016) builds a simulation-based model

of how conflict spreads dynamically in space. She models the armed groups in 109 municipalities

as separate players who can only attack in their home village while the focus of our framework lies

precisely in providing a model of attacks across regions.5 Bhavnani et al. (2014) link segregation to urban conflict, using a simulated agent-based model, calibrated for Jerusalem. Finally, the empirical

contribution of Balcells et al. (2016) studies post-conflict sectarian clashes in Northern Ireland from

2005-2012. A main difference between our current paper and the existing work on spatial violence

patterns is that –contrary to the existing literature– our empirical analysis estimates a structural

model of conflict in space. We are able to derive such a model by assuming that conflict is motivated

by aggregate factors - exactly as the ”macro” contest literature does (see Konrad 2009). In this way

we build a link from the newer studies using micro data to the classic theoretical models studying

the intensity of conflict at the country level.

The predictive power of our framework is useful when it comes to forecasting the impact of

conflict on economic outcomes. Besley and Mueller (2012) show for Northern Ireland that compared

to peaceful areas, housing in the most violent areas sold for between 2 and 17 percent less

-depending on the level of violence, and Mueller (2016) shows that changes in the distribution of

violence within a country can have a substantial impact on aggregate growth. Thus, predicting

well the location of attacks does not only help in forecasting local economic outcomes, but also

countrywide performance.

The paper is organized as follows: Section 2 sets up a simple model of spatial interaction. In

Section 3 we introduce the data used and display reduced-form stylized facts. Next, while Section

4 is devoted to a detailed account of our structural estimation, in Section 5 we carry out the

econometric analysis and present the main results and robustness checks. Section 6 provides a

counter-factual policy analysis and Section 7 concludes. Appendices provide further details and

results.

2 Model

We begin the model exposition of how space matters for armed conflict by setting up an environment

with multiple ethnic groups spread across many districts. Groups may endogenously recruit fighters

(6)

from their own ethnic groups at some cost, who can conduct attacks on other groups in their district

as well as all other districts depending on how difficult it is for them to project violence across space.

After laying out the foundations of the model, we derive the equilibrium placement of fighters,

which gives us an expression for predicted violence in a district as a function of the spatial

distri-bution of groups across all districts, the extent of spatial decay of violence, local recruitment cost,

and group-specific features driving the level of their engagement in the conflict. Once we

character-ize the equilibrium, we derive various novel country-level and group-level indices capturing salient

features of spatial violence and shedding new light on the nature of ethnic conflict. Finally, we

discuss the assumptions underlying the model in the context of existing models of conflict.

2.1 Setup

We model conflict between groups g, k = 1, ..., G which are residing in regions i, j = 1, ..., J. The

population of group g in region i is given by Nig. We denote group population aggregates as

Ng =PJi=1Nig. Our model integrates two ideas. First, attacks happen across space. In particular,

attacks by fighters of groupgin regionion groupkin regionjare governed by a distance parameter

wij. Attacks on this dyad are then given by

Aijgk =Fig·wij ·Njk,

which makes clear that our model has a spatial dimension. Theoretically, the distance weightswij

could take many forms. In the empirical model we will assume a simple functional form

wij = exp (−κ·distance(i, j))

so that for κ >0 we will see fewer attacks between locations that are further apart.

The second key assumption is that the recruitment of fighters Fig is endogenously determined

by the struggle of all groups at the macro level. Specifically, we assume that the expected benefits

of conflict to group g are given by a conflict prizeRg times the probability of winning the contest

given by

Ag PG

k=1Ak

Rg,

where Ag is the total number of attacksAijgk of groupg across all districts and opposing groups.

This is the standard contest success function and is widely used in the conflict literature (see the

literature survey in Konrad 2009). This implies that we are able to describe aggregate violence as a

(7)

allows us to estimate the model despite the fact that the recruitment of fighters is endogenous and

determined locally.6

Groupg recruits local fighters, Fig in regions i= 1, ..., J, to maximize the objective function

πg=

AgRg PG

k=1Ak

−1

2

J X

i=1

F_ig2 N_igµ.

The first term are the expected benefits of attacks, while the second term captures the fact that

recruiting fighters is costly. We make this specific cost function assumption only for simplicity.

Two aspects are critical: First, we model the cost function as convex in the number of fighters to

ensure an equilibrium.7 Typically, salary costs of fighters should be thought of as convex, as the first few hirings will be cheap given that it will be feasible to target exclusively individuals with low

wages in the regular economy (and hence low opportunity costs of fighting) and/or with low moral

costs of killing. When extending the pool of fighters in a region, the group also needs to recruit

individuals with better outside options and higher moral costs who will require higher monetary

compensation.

The second key aspect of the payoff function is that recruitment costs of fighters are a function

of the local population. More specifically, the larger the population of locals of a given group

in a given region, the cheaper the hiring costs, i.e. µ ≥ 0. This assumption makes sense in a

context where fighters face lower risks of identification and being arrested and can benefit from

local support through information and safehouses. This assumption finds considerable backing in

military guides like the US counter-insurgency manual (Army and Corps, 2006) and the existing

literature in political science (Kalyvas, 2006) which both stress the role of local informants in civil

wars. This means that even professional armies will require the support of local population when

fighting in asymmetric wars. We discuss these assumptions in greater detail in Section 2.6.

To close the model we will use matrix notation as this simplifies the description of optimal

fighting effort. Denote the group-specific vector of district populations as a J ×1 vector Ng =

(N1g, ..., NJ g)0 and theJ×J spatial weights matrixW = (W1, ...,WJ) withWi = (wi1, ..., wiJ)0.

Total attacks projected by groupg from region iare then

Aig =Fig J X

j=1

wijNj,−g =FigWi0N−g (1)

6

We assume throughout thatRg >0, which implies that all groups in a country are engaged in conflict. This

ensures the existence of an interior solution in our model. In the empirical implementation we will also assume that

Rg >0, but will allow the group-specific value ofRg to be arbitrarily close to zero to capture groups that inflict very

little violence.

7

(8)

where Nj,−g = Pk=6 gNjk denotes the population of all groups that are not g at location j and N−g = Pk6=gNk is the corresponding vector of total population in all groups that are not g in

locationsj = 1, ..., J. Similarly, instead of examining attacks by groupgemanating fromi, we can

write violence against groupg in regionifrom all other regions as

Vig =Nig J X

j=1

wijFj,−g =NigWi0F−g, (2)

where F−g is a J ×1 vector with elements Fj,−g = Pk6=gFjk. This expression shows that the

amount of violence against g in i is the product of its local population size Nig and the spatial

projection of all opposing fighters onto region igiven byW0_iF−g.

2.2 Equilibrium Spatial Violence

The equilibrium solution to this model is surprisingly simple. Equilibrium fighters of group g in

regioniare given by

F_ig∗ =αgN_igµWi0N−g (3)

where αg = RgA−g

(PG k=1Ak)

2 only varies at the group level. In other words, we can express the fighting

effort at each location simply through the interaction of aggregate, strategic incentives (αg) with

the local interaction of population in space (N_igµW0_iN−g). Empirically it will be important thatαg

is merely a scaling factor that only varies across groups but not locations - this will allow us to

identify the parameters of the model through local variation in population and violence. Inserting

(3) into (2) and aggregating across all groups, we get all equilibrium attacks into regioni:

V_i∗=

G X

g=1

NigW0i X

k6=g

αkdiag Nµk

WN−k, (4)

where diag(·) turns the vector Nµ_k into a J ×J diagonal matrix with elements N_jkµ on the diagonal and zeros on the off-diagonals. Existence and uniqueness are shown below in Appendix

A.1. The expression in equation (4) fully captures the mechanics of conflict posited in our model:

the last term in equation (4), the J ×1 vector WN−k, captures the incentives for group k to

recruit fighters in each district, depending how easy it is to reach targets. If many opposing groups

are clustered in a location, it is more appealing to place fighters there. The middle term, the

1×J vectorW0_iP

k6=gαkdiag Nµ_k

, captures the average distance to the support bases of fighters

to conduct violence in location i. This depends on the engagement of groups in the conflict, αk,

(9)

location i. Finally, the third expression, the scalar PG

g=1Nig, is the sum of all potential victims

across all groups ini. The product of these three terms is the amount of violence inipredicted by

the model.

Distance enters in this formula through the weights of the weighting matricesWi andW. Low

values in Wi shield locationi from outside attacks as fighters from other locations cannot attack

it easily. The weighting matrix W appears also at the end of the equation because it determines how many fighters are recruited in other locations j.

2.3 Illustration

Despite its simplicity, the model generates complex spatial interactions represented in equations

(1) and (2). To illustrate these interactions we show a simple example with two groups at three

locations in Figure 1. The locations are called 1, 2 and 3 and the groups are called R (red) and B

(blue).

In the top panel A, we show the incoming attacks by group B into location 1 on group R. This

is the kind of data that a typical violence dataset would provide - the violence suffered by targets

at a location. Group R at location 1 is attacked by blue fighters stationed in the same location,

F1,B. These fighters have N1,R targets at location 1 and a distance weight w1,1 so that violence

from blue fighters at location 1 is given by F1,Bw1,1N1,R. Blue fighters at locations 2 and 3 also

attack the red population at location 1, only now the distance weights are given by w1,2 and w1,3.

If distance plays a role we would expect thatw1,1> w1,2, w1,3 so that the blue fighters at location

1 are more effective in attacking red population at location 1.

A key advantage of the model is, however, that we can use the estimated parameters to provide

estimates of outgoing violence as well. In the bottom panel B, we show the outgoing attacks by

red fighters at location 1 into other regions. With the estimated parameters of the model this

violence can be easily calculated. These fighters have N1,B targets at location 1 and a distance

weight w1,1 so that violence from red fighters at location 1 is given by F1,Rw1,1N1,B. Panel B also

illustrates nicely that the recruitment of fighters at a location will be a function not only of the

potential target population at that location but also of targets nearby. In the example, recruitment

of fighters at location 1 is partly driven by the blue population at locations 2 and 3. This feature

makes the spatial dynamics of the model non-trivial as the incoming attacks in panel A will be a

function of the respective neighbourhoods of location 2 and 3, given that fighters at those locations

are recruited with targets in mind.

With higher spatial decay, i.e. quickly fallingwij with longer distance, recruitment at a location

will mostly be driven by the population that lives in that region. In the example, this would imply

(10)

two groups do not mix then violence is reduced. The opposite case is where there is very little

spatial decay. In the example this would imply, for example, that the number of red fighters

is only a function of the population from which the fighters can be recruited from, i.e. the red

population. The recruitment of red fighters would then be motivated by total blue population

(N1,B +N2,B +N3,B) and, assuming µ > 0, be stronger in location 2 as it has a larger red

population.

2.4 External Violence, Aggressive Groups and Asymmetry

The model of spatial interactions offers the opportunity to understand conflict data in a new light

by separating perpetrators from targets. We use the model to calculate three statistics, which have

an immediate interpretation and can help understand the conflict dynamics of a country.

The first summary statistic that we can calculate is the share of all violence at a given location

that travelled from outside that location. This statistic separates the violence coming from fighters

at the same location from the violence originating from other locations. To calculate the share we

first calculate the total sum of violence at locationias:

V_total∗ =

J X

i=1

V_i∗ =

G X

g=1

X

k6=g

N0_gWF∗_k,

whereF∗_k is the vector of equilibrium fighters for group kgiven by:

F∗_k=αkdiag Nµk

WN−k.

We can contrast this with the violence emanating only from within a given region. In the matrix

notation this can be derived with a simple change in the spatial weights, by only letting these

fighters attack who are in the same region. This is done by changing the attack weighting matrix

toW= diag (w11, ..., wJ J). We call this local violence:

V_local∗ =

G X

g=1

X

k6=g

N0_gdiag (w11, ..., wJ J)F∗k.

This implies that we can write the share of violence from outside as:

vexternal=

V_total∗ −V_local∗ V_total∗ =

PG g=1

P

k6=gN0gWFf ∗_k PG

g=1

P

k6=gN0gWF∗k

, (5)

(11)

share of violence coming from outside locations. We regard vexternal as an important statistic to

capture the extent of violence being “mis-attributed” to a target location despite the fact that

it originated from fighters at other locations. For studies using disaggregated data to assess the

drivers of conflict, expression 5 is an important object to study.

Incoming and outgoing violence are also useful to understand how “aggressive” groups are by

comparing the number of casualties that they suffer to the attacks that they inflict. Note that the

sum of attacks inflicted by group gacross all districts is simply

A∗_g,_total≡

J X

i=1

A∗_ig =F0∗_gWN−g,

while the sum of violence against groupg across all districts

V_g,∗_total ≡

J X

i=1

V_ig∗ =N0_gWF∗−g.

Hence, at the country level we can write:

V_total∗ =A∗_total =

G X

g=1

X

k6=g

N0_gWF∗−g

and at the group level we calculate an index ag,aggressive given by

ag,aggressive=

A∗_g,_total−V_g,∗_total

V_total∗ , (6)

which is positive if a group is inflicting more casualties than it is suffering and negative if it is

suffering more casualties. The index ranges from −1 to 1, where −1 captures that a group is

suffering all violence in the country and 1 that it is generating all violence in the country. If groups

inflict as many casualties as they suffer, ag,aggressive will be close to 0.

From this index, we can easily calculate an index of asymmetry of violence at the country level:

aasymmetric=

1 G

G X

g=1

|ag,aggressive|=

1 G G X g=1   PJ i=1 A ∗

ig−Vig∗ PG k=1 PJ i=1Vik∗



, (7)

which captures the average, absolute aggressiveness score. Thus, if all groups inflict the same

number of casualties as they suffer, then aasymmetric = 0. In contrast, if some groups conduct all

(12)

2.5 Ethnic Interaction versus Nationalized Violence

As we are interested in analyzing the extent of spatial interactions, we will focus on the case of

µ > 0 for most parts of the paper. This assumption implies that fighters cannot be recruited ad

hoc in regions without local support, where local support should be seen as a broad concept which

includes situations where the local population will provide information to stationed troops.

The central empirical question is then how much spatial decay there is.8 If spatial weights are decreasing, our model generates violence patterns where violence clusters in ethnically mixed,

densely populated areas. This is because average distance between targets and fighters will be

smallest (wij will be highest) where ethnic groups mix, leading to more intense recruiting of fighters

in equilibrium. We call such a type of conflict ethnic interaction. The illustration in Section 2.3

presents a simple example of such a case.

If µ >0 but all weights are the same (wij = wi0_j0 for all i, j, i0, j0) then the distance between

fighters and targets is not important and the attacks on Nig are simply a function of conflict

motivationRg and the total population Nbut not how concentrated population is in space. If we

normalize wij = 1 we can re-write the optimal recruitment of fighters

F_ig∗ =αgN_igµ 

 X

j X

k6=g

Njk 

 (8)

so that the local recruitment of fighters at locationiis now only a function of the total number of

targets in the country, the motivationαg and the ability to recruit locallyN_igµ. Military technology

will obviously play a role here as the use of mobile units, for example, would mean that it is better

to build strong military bases and start attacks from there. Violence in locationiis then given by

V_i∗ =

G X g=1 Nig   X

k6=g X

i

F_ik∗ 

 (9)

which is simply the number of all fighters in all other regions attacking the population at locationi.

This will generate violence patterns in which violence follows target population without the spatial

patterns of ethnic interaction violence. We call such violence patternsnationalized violence.

2.6 Discussion of Model Assumptions Relative to Alternative Settings

Key assumptions of the modeling strategy concern the payoff functions. In particular, we assume

that the groups’ winning chances depend on the number of successful attacks rather than e.g. on

8

(13)

the number of fighters they field. This reflects the view that realized attacks matter more than

potential ones, and is also in line with the available micro-level data that contains information on

fighting events but not on troop strength (see Section 3 below). Further, in our model recruitment

and fighting efforts of a groupg in a regionionly depend on i) the aggregate nationwide strategic

interaction and ii) the demographic spatial distribution of the opponent group (i.e. an exogenous

spatial weights matrix), but not on the (endogenously selected) number of adversary troops in a

given area. This is in line with empirical findings that in recent decades civilians are often heavily

(and deliberately) targeted in wars (Esteban et al., 2015).

These building blocks keep our setting tractable, allow us to have a unique interior equilibrium

and estimate our structural model with empirical data.9 This setting also has the advantage of generating predictions on fighting intensity that can be compared to existing country-level measures,

such as ethnic fractionalization and polarization, but augmented by the impact of distance at the

local level.

Our approach can be compared to a series of related frameworks, such as, for example, Colonel

Blotto games which study situations where two groups strategically decide to allocate fighters to

different battlefields (see e.g. Shubik and Weber 1981; Roberson 2006). This usually leads to very

complex equilibrium structures that involve mixed equilibria and that are very difficult to test

empirically, unless some strong restrictions are made, such as e.g. assuming symmetry (Hart, 2008)

or having sequential moves (Powell, 2007, 2009).

Our setting is also complementary to models that focus on situations of anarchy and the dynamic

spread of violence. For example, in Franke and ¨Ozt¨urk (2015) there is a multitude of fighting groups

which are all one node and engage in bilateral conflicts over local fiefs. This captures well situations

of anarchy with local landlords fighting each other in a disaggregate way. Related to this, the model

of De Jong et al. (2014) studies the surge of empires, starting from a series of nodes where each node

can conquer other nodes until one eventually gains control over the whole territory. In contrast to

this complementary strand of the literature, our model captures other types of contexts (typically

with a stronger state and nationwide fighting goals rather than a multitude of local warlords fighting

each other) and focuses on a different research question (i.e. the spatial decay of ethnic violence,

rather than the dynamics of empire construction). Finally, in such models of the spread of dynamic

violence, every bilateral conflict is stochastic and hence there is an infinite number of potential

paths taking place, all with some positive probability. Given that we only observe ”one history”

among this infinite number of counter-factual histories, our data would not allow us to back out

empirically the underlying parameters of such as setting (at least not with forward-looking actors).

9_{While our framework keeps, for the sake of tractability, the spatial weights matrix exogenous, note that we}

(14)

3 Data and Stylized Facts

In order to confront our theoretical framework with the data, we assemble a novel cross-sectional

dataset combining a series of raw data sources on ethnic groups and conflict across 24 countries.

It contains information on the location of a total of 3,014 regions, 140 ethnic groups, and 570,023

casualties of conflict from 84,711 conflict events. We begin by describing the data sources, after

which we turn to our sample inclusion criteria. Finally, we describe how we combine the data from

these sources into a single consistent dataset across our country sample.

3.1 Data Sources and Sample Construction

To construct consistent subnational units across countries, we use data from the Global

Adminis-trative District Maps (GADM, 2020) database. In addition to country boundaries, it contains data

on every existing level of administrative divisions from every country in the world. Using

admin-istrative units as opposed to, say, artificial cells has the advantage of being politically meaningful

and often being in line with relevant physical or social boundaries. We focus on the second level of

administrative divisions, typically corresponding to counties in the United States or d´epartements

in France, which we refer to as districts. For countries that do not have a second administrative

level or for whom the number of resulting units is too large for the structural estimation procedure,

we aggregate to the first administrative level, corresponding to the state or province.10

We combine these administrative divisions with data on geocoded conflict from across the globe.

Specifically, we rely on conflict events data from the Uppsala Conflict Data Program (UCDP)

Georeferenced Event Dataset (GED, Stina 2019). UCDP is among the best sources to consistently

document conflict across a large number of countries and periods. We focus on all fatalities over

the years 2000-2018 in all countries included in UCDP.

To consistently identify the spatial distribution of ethnic groups within a country, we draw upon

data assembled in the Geo-referencing Ethnic Power Relations (GeoEPR, see Cederman et al. 2016)

database. The data contains potentially overlapping territories of ethnic group boundaries for 128

countries. Compared to alternative datasets, GeoEPR has the advantage of being recent and

relatively precise, as well as compatible with information on ethnic power relations. We

comple-ment these data with compatible data from the Geographical Research On War, Unified Platform

(GrowUP, see Cederman et al. 2016) project, which we use to measure various country-level and

ethnic group-level characteristics.

10_{Computational time of our estimator is convex in the number of districts. Hence, we aggregate districts to the}

(15)

While we are able to observe the extensive margin of the presence of an ethnic group in any

location using the shapefiles from GeoEPR, the data lacks information on the population

frequen-cies, which are a crucial element of our model. To construct population frequencies by district and

ethnic group, we use data from the Gridded Population of the World (GPW, see SEDAC 2018)

database. The data contains estimated population counts for every 2.5-by-2.5 minute land cell on

earth, which corresponds to about 5-by-5 kilometers at the equator.

3.1.1 Sample Inclusion Criteria

To arrive at our 24 included countries, we combine several criteria to restrict the sample of countries

to settings in which our model plausibly applies. In particular, for a country to be included, it

needs to fulfill three conditions: First, we only include countries that have suffered from a relatively

large number of conflict instances, as we need to have enough identifying variation to estimate the

parameters of our model. In particular, we require that a country has experienced at least 200

conflict events according to UCDP. This criterion is fulfilled by 41 countries worldwide.

As a second inclusion criterion, we focus on settings that are characterized by ethnic or religious

conflict – given that we model inter-group strategic interaction rather than, say, class conflict in

a homogenous society. To distinguish ethnic versus non-ethnic conflicts, we draw on the conflict

classification of Kalyvas and Balcells (2010), a widely-used reference on cross-country classification

of conflict. Combining this restriction with the one on the minimal number of conflict events, we

end up with 26 countries in our sample.

Our third and final inclusion criterion concerns the availability of data on the spatial distribution

of ethnic groups within and across countries. Specifically, using the GeoEPR data, we require that a

country has at least two politically relevant ethnic groups in the year 2000. Applying this criterion

further reduces our sample to the final 24 countries.

This sample of countries included in the analysis is depicted in Figure 3. Darker colors (red)

represent a higher death toll from conflict over the period 2000-2018, while lighter colors (yellow)

correspond to fewer fatalities. Countries in gray are not part of the sample. We see that our sample

contains various countries from Africa, Asia and the middle East.

3.1.2 Construction of Ethnic Group Population per District

The next challenge is to estimate the population of each ethnic groups residing in a given district,

corresponding to the quantityNig in our model. To the best of our knowledge, there does not exist

census data at the district level that covers in a consistent way a wide number of countries. Hence,

we rely on the combination of GeoEPR and GPW to proxy for the ethnic composition of district

(16)

We first estimate district population by summing up population counts of all GPW cells in the

district. We then identify all groups that at least partially overlap with a district and compute for

each group the sum of GPW cells weighted by the share of the district overlapping with the ethnic

group. In case more than one ethnic group is present in a given cell, we divide the population in

the cell equally among them. Since GeoEPR deems some ethnic groups not politically relevant, we

do not count GPW population cells that do not overlap with any ethnic group.

3.2 Example: Iraq

In Figure 4 we show an example country to demonstrate the different types of data we are using.

The map shows the different districts in Iraq. Population is shown as a grayscale layer where darker

colours mean lower density. This is overlayed with the EPR data on the homelands of ethnic groups

which we show as colored areas. The third ingredient to our model is the UCDP violence data

which we depict as red rings, where larger rings represent larger fatality counts. The figure also

illustrates spatial patterns of violence that ethnic interactions will imply. In the North of Iraq most

violence is generated in areas where two homelands of ethnic groups meet. Districts deeper inside

homelands suffer less from violence.

Obviously, our estimation exercise heavily relies on the data that is put into the model. There

are, for example, parts of the country which are not coded as belonging to one group or another.

Furthermore, there are obviously local areas within the EPR homelands where other groups live

which we will not be able to capture with this relatively aggregated data. What we offer is a new

look at existing data through the lens of our model.

3.3 Stylized Facts in the Reduced Form

Drawing on the aforementioned datasets, we start the empirical investigation with a display of

reduced-form stylized facts of ethnic geography and conflict. The rough overall patterns of how

ethnic borders relate to conflict is displayed graphically in Figure 2: Places closer to an ethnic

group boarder are exposed to much larger casualty levels.11

To refine the analysis, we study the role of distance to boundaries between ethnic groups on both

the intensive margin and the extensive margin of conflict in our cross-sectional data. In Table 2, we

regress a binary measure of conflict exposure at the district level (taking a value of 1 if ever between

2000 and 2018 a given district has experienced conflict) as well as the log number of casualties and

conflict events on the distance of the district centroid to the nearest ethnic boundary and district

11_{Since these reduced-form estimates are not computationally demanding, we show results with a district}

(17)

population. We gradually introduce country fixed effects, a battery of controls, and dominant group

fixed effects (see table notes).12 This means we are comparing the presence and amount of violence within ethnic groups of each country between districts that are nearer or further from an ethnic

boundary. It turns out that indeed within a given country the most conflict-prone districts are those

located close to ethnic group boundaries and having a large population. Specifically, controlling for

various district-specific features, being 10% closer to an ethnic boundary is associated with a 0.3%

higher probability of any conflict, 1.1% more conflict events, and 1.3% more conflict casualties.

While these associations illustrate the nature of our data and stress well the motivating stylized

facts, we want, in what follows, more specifically confront our novel model to the data, estimating

the structural parameters of our model. In this way, we can not only provide evidence for the

mechanisms proposed in the model – that violence is conducted across space, recruiting fighters to

attack other ethnic groups in the vicinity – but also quantify the structural indices of external and

asymmetric violence discussed in Section 2.4.

4 Estimation

The central expression of our theory is predicted violenceV_i∗ =Vi(N,D, θ) in locationias a

func-tion of three components: (a) the spatial distribufunc-tion of groups across locafunc-tionsN= (N1, ...,NG);

(b) the bilateral distance between locations D with elements Dij = distance (i, j); and

parame-ters θ = (α1, ..., αG, µ, κ), where κ captures exponential spatial decay between locations: wij =

exp (−κDij). Using equation (4), we can write this as

Vi(N,D, θ) = G X

g=1

NigW0i(κ) X

k6=g

αkdiag Nµk

W(κ)N−k

We estimate these parameters via maximum likelihood. To this end, denote Yi as the observed

violence in location i, which we assume is an independently and identically distributed draw from

a Poisson distribution with meanVi(N, θ). Under this assumption, the log likelihood function ofθ

given Vi(N, θ) is

L(θ|Y1, ..., YJ,N,D) = J X

i=1

[YilogVi(N,D, θ)−Vi(N,D, θ)−log (Yi!)]

(18)

and the estimator is given by

b

θ= arg max

θ∈ΘL(θ|Y1, ..., YJ,N,D)

where Θ⊂_RG+2_{. To ensure that parameter estimates are consistent with our model, we restrict}

the parameter space over which we search for a maximum to the positive orthant.13 In this sense, having established the existence of spatial decay in the reduced form, we think of this estimation

exercise as quantifying the degree of spatial decay for a given country. Appendix B.1 shows that

the estimator is able to precisely recover the speed of spatial decay in Monte Carlo simulations.

Since our primary interest lies in the existence and speed of spatial decay of violence, we estimate

κ in two steps. First, we estimate all parameters including µ flexibly. Second, we fix µ to some

value, using the estimates from the first step as initial values. In this way, we can examine how

κ varies across countries in a ceteris paribus fashion: holding constant the marginal cost of local

support, how far does violence travel in a country? We fix µ to various values according to the

results from the first step, including its mean, the mean among those with sufficiently large κ, as

well as at discrete intervals from the mean. We show in Table A.3 that estimates of κ are similar

for various fixed values ofµ.

This two-step procedure also ensures that the parameter vector remains well-identified from an

econometric perspective even whenκ becomes small. As κ approaches zero, group-specific factors

captured inαg become collinear with the cost of local support because they both rely on variation

in country-wide group-specific population, a feature we discuss in Appendix A.3. By fixing µ to

a positive value, we ensure that αg and µ remain separately identified. Despite this potential

collinearity issue in the first step, we show empirically that the estimator converges robustly in the

first step in Appendix B.2.

5 Estimation Results

In this section we describe the results of our estimation. We first provide a brief overview of the

estimated parameters and then discuss in detail how our structural model can open up new avenues

on understanding patterns of violence.

13

Numerically, we restrict the search to an optimum in the positive orthant by taking the logarithm of positive initial values and then exponentiating the parameter values in each iteration. Since the positive orthant is a convex subset ofRG+2, uniqueness of the equilibrium carries through to the estimation, and we show in Appendix B.2 that

(19)

5.1 Overview

In Table 3 we provide an overview of the estimates from our model. We find telling evidence for

spatial spillovers. In a bit more than half of the countries we estimate a κ > 0, and the overall

sample mean ofκ >0 is 0.015.14 This means that in many countries violence does not simply take place where target population lives but rather where different ethnic groups are located close to

each other. In contrast, the population living deep inside ethnic homelands (and far from ethnic

borders) is less exposed to violence. These kinds of patterns cannot be simply accounted for by

spatial spillovers of violence – they are driven by the interaction of groups across as space highlighted

by our model.

How important is the spatial decay implied by these estimates? To answer this question, we

display the spatial weights corresponding to the κ estimates in Figure 5. The red line represents

the spatial weights of the medianκ and the blue line depicts the meanκ. The other lines illustrate

all individual country estimates with κ > 0. At the mean κ fighters at a location become about

a quarter as effective after 100 km distance. Fighters at the median κ of 0.002 lose effectiveness

much less rapidly and still maintain about half of effectiveness after 350 km.

We also find that our estimates for the average group motivation for conflict, Rg, varies

sub-stantially across groups. These estimates are hard to interpret, however, and we will therefore use

two alternative measures at the country level, vexternal and aasymmetric (see the formal definitions

above). The share of violence coming from outside a district, vexternal, varies significantly across

countries, but even in the lowest decile it is more than 50 percent (see Appendix Figure A.10).

This is an important novel result, because it highlights the importance of separating the origin and

target of violence when analyzing disaggregated violence data. It becomes clear from our estimates

that inter-group interactions across districts play typically a much bigger role than stressed by the

literature on subnational conflict (see literature review above).

Our second summary statistic at the country level is aasymmetric, which lies between 0 and

1, where 0 corresponds to full symmetry, while 1 implies that some groups only inflict fatalities

and other groups only suffer violence. The mean asymmetry is 0.348 which means that in many

countries some groups are estimated to be substantially more aggressive than others. We find a

large amount of heterogeneity in our measure of asymmetry, with some countries being close to

symmetry whereas others are completely asymmetric.

At the group level we provide estimates for Rg, as well as the violence suffered by a group, Vg,

and the violence inflicted by a group, Ag. The three are obviously related, as groups with a high

Rg will inflict more violence. As Vg and Ag are two sides of the same coin, their mean value is

14_{We round values of}_κ_{to zero if they are below 1}_/₁₀_,_{000. While this is arbitrary, it coincides with the distinction}

(20)

identical. Interestingly, estimates ofVg are slightly more dispersed than the estimates of Ag: Most

groups suffer some violence and some few groups suffer huge amounts.

5.2 Individual Country Estimates

In Table 4 we show our estimates country by country. It is immediately clear from this list that

spatial interactions are an empirically relevant phenomenon, with 13 out of our 24 countries

dis-playing spatial dynamics with a positive estimate of κ. In several countries, estimates of κ are

fairly large and suggest extreme spatial decay. For example in densely-populated Sri Lanka it is

hard to project violence over wide distances. In contrast, in slightly less than half of the countries

we do not detect a spatial decay of violence, and one can categorize them as having nationalized

violence patterns. These countries include e.g. Afghanistan – where the United States combatted

the Taliban nationwide – and Turkey, Russia and Pakistan which all have a strong national army

engaged in fighting opposition or rebel groups. Hence, given that in these cases national military

campaigns are more prominent than spatial interaction between ethnic groups, we unsurprisingly

find aκ= 0.

In the next two columns of Table 4 we show two measures introduced in section 2.4. The

measure of vexternal captures what share of violence in a district comes from outside the district.

The shares we find here are very large with many countries featuring values close to 1. This provides

an easy benchmark for how important spatial interactions are. In the third column we display our

estimate ofaasymmetric. This statistic is between 0 and 1 where 1 captures a situation where groups

that inflict most casualties do not suffer from any casualties. An interesting feature of the results

here is that countries known for having weak state and military capacities, like e.g. the Democratic

Republic of Congo, have the most symmetric violence. We return to the role of military capacity

in the following section.

Finally, also the estimates for the medianRg at the country level vary considerably, as depicted

in the last column of Table 4. According to our estimates in some countries the overall stakes of

fighting are very high (e.g. in Sri Lanka), while in others they are small (e.g. in India).

5.3 Interpretation

After having displayed above the main estimates, we will discuss in the current subsection the fit

of the model and the various ways in which our model-driven approach allows for new insights to

be derived from the violence data. We start by discussing the fit of our model to the violence data.

Our framework is essentially modeling violence as an interaction between districts. Can such a

(21)

As an example, figure 7 shows the (log of) the district estimates for Iraq together with (log of)

the original violence data, revealing a very strong association with a correlation coefficient of 0.853.

It is important to keep in mind that our estimates suggest that 75 percent of this violence came

from outside the respective district, i.e. the majority of violence experienced by ethnic groups in

a district is driven by an interaction with population in other districts – which means that this

strong correlation is not just mechanically driven by district characteristics.

Moving beyond examples, we can aggregate the estimates of local violence to understand

aggre-gate violence. To give an impression of how well our model fits the violence data at the aggreaggre-gate

level, we plot the model fit at the cross country level in Figure 6. On the y-axis we depict observed

violence and on the x-axis we show the numbers coming from our model of spatial violence. The

fit is striking, and implies that our model is able to capture large differences of violence across

countries by modeling local interaction.

Our model provides novel insights beyond the existing macro literature and well-known diversity

indices. To illustrate this, we start by depicting in the left panel of Figure 8 the fit of the log

violence across countries and the ethnic polarization index, as provided by Montalvo and

Reynal-Querol (2005). In line with the findings in this literature (see e.g. Montalvo and Reynal-Reynal-Querol

(2005); Esteban et al. (2012)), there is a remarkably strong positive and significant relationship

between log violence and ethnic polarisation. To highlight what our framework can add to this,

in the right panel of Figure 8 we display the relationship between overall violence and our spatial

model fit, controlling for polarization. What we find is a very robust positive and significant

(p-value=0.008) relationship, showing that the distribution of ethnic groups across space provides

considerable additional explanatory power beyond what is captured by ethnic polarization.

As an additional check for whether our spatial model provides interesting predictive power,

we produce a comparison with a simple model of violence at the district level which only takes

into consideration the population in that district. To make this comparison we focus on the set of

countries for which we estimate κ >0 as we only expect our model of spatial interactions to add

beyond a model based on target population when there are spatial interactions. Figure 9 compares

the error of our spatial model on the x-axis with the error from the population based model on the

y-axis. In Panel A we aggregate the predictions to the country level and show the error at that level.

Countries below the red 45 degree line have a higher error in our model whereas countries above

the line have a larger error in the simple population model. For the large majority of countries

our spatial model provides much better explanatory power. In Panel B we show the example of

Iraq at the district level. Note that errors in the population based model occur for two reasons.

First, a district has a large population but this population is not interacting with the population

(22)

relatively small population but is located very close to population from other ethnic groups and

therefore suffers heavily from violence. The low error rates of the spatial model highlights again

the importance of studying interactions between different groups across districts when modelling

violence in some countries.

But how does the model generate this additional explanatory power? Unsurprisingly, a first

factor is the variation in κ across countries. There are obviously many potential drivers of κ.

As shown in the appendix (see Figure A.11), we find reasonable associations with geographic

features like land area of the country, ruggedness and the presence of deserts. This suggests that in

larger countries with smooth terrain spatial distance inhibits fighting less. Obviously, these simple

associations need to be interpreted with great caution, as they are based on only 24 data points.

15 _{But the approach we propose here suggests the possibility of an alternative way to study the}

effect of factors like curfews, barriers, infrastructure or peacekeeping forces which would all be

testable with more fine-grained data on the location of ethnic groups and specific hypotheses on

determinants ofκ in the weighting matrixW.

A key aspect of our model is that it combines the deep parameter estimates of κ, µ and the

group-specificRg, together with the population data, to produce the predictions shown in Figures

6 and 7. One implication of having such a rich set of estimates is that they allow us to separate the

origin and target of attacks. In particular, we are able to back out the group-specific aggressiveness.

In Table 5 we correlate the group-specific violence share and aggressiveness, respectively, with

two key covariates, i.e. group size and group-level oil holdings (from GrowUP (Cederman et al.

2016), see Table notes), that have attracted attention in the literature on group-level conflict (see

e.g. Mayoral and Ray (2017); Morelli and Rohner (2015)). In the first part of the table we have as

dependent variable the share of country-level violence involving a given ethnic group. In column 1

we run a simple regression as one would run it without the estimated model. This simply correlates

the share of violence in the country observed in the homelands of ethnic groups with the group

size and group-level oil holdings. We find that larger groups are on average more often involved

in fighting, and that oil holdings are not statistically significant. These findings carry over to the

columns 2 and 3 where we include country fixed effects and controls.

Do these findings mean that larger groups are more often entangled in fighting because they

are (inherently) more aggressive? Our framework, and in particular columns 4-6 of Table 5, answer

this question, drawing on the aggressiveness statistic provided in equation (6). This variable ranges

between -1 and 1 and captures whether a group attacks more than it is the target of attacks. When

we correlate this variable with group size, the sign switches, and we find a negative relationship, i.e.

15_{It also needs to be kept in mind that our analysis here should only be regarded as a first step towards unwrapping}

(23)

that larger groups are on average less aggressive. We also find that oil-rich groups are on average

more aggressive.

Asymmetry of violence at the country level should relate to how strong the different ethnic

groups are and whether incumbents can control a strong military force – a group which is in control

of a stronger state can more easily project violence across the country and targets and origins of

violence will coincide less geographically. We would then, conditional on observing violence, expect

a larger degree of asymmetry in countries with stronger militaries as these are directed against

non-incumbent groups. And, indeed, we find a positive association between military expenditure

per capita and asymmetry at the country level.16 _{We also find a strong positive link between GDP}

per capita and asymmetry which is in line with broader ideas on state capacity proposed in Besley

and Persson (2010). Both associations are shown in Appendix Figure A.12. Note, that this holds

despite the fact that our proxies of state capacity and overall levels of violence are not significantly

associated in our dataset.

These results demonstrate the potential of our model-based approach to provide new insights

into the origins of violence and on which countries and regions can be expected to become most

violent if conflict were to break out. This being said, it is of course crucial to interpret these

correlations with much caution, as omitted variables remain a valid concern.

5.4 Robustness

We run several robustness checks in the appendix. Most importantly, we check whether our

esti-mates of the role of spatial decay are a function of whether we fixµand whether the level matters.

In Appendix Table A.3 we show that estimates ofκare remarkably robust across different estimates

of µ. This further justifies fixingµ to be able to make ceteris paribus comparisons.

Given that casualties are measured noisily in UCDP, we also use the number of conflict events

itself as our outcome of interest. This yields qualitatively similar estimates ofκin Appendix Table

A.3, both with fixed and flexible µ. We conclude that patterns of spatial decay are indeed mostly

driven by structural features relating the distribution of ethnic groups to the frequency of conflict

across locations, as opposed to the exact choice of proxy for conflict intensity in a location.

6 Counterfactuals

One of the key advantages of estimating our structural model over reduced form estimates is that we

can use it to run counterfactual analyses in which we change key parameters of the model. In Table

6 we run four counterfactual experiments. First, we half κ: such a policy could be thought of as

16

(24)

increasing infrastructure projects or facilitating ethnic targeting through information technologies.

While such measures may have large economic benefits, they may also under some conditions

facilitate ethnic violence. Strikingly, this experiment produces dramatically different effects across

countries. In Algeria or Kenya violence may increase more than sevenfold, whereas in Bangladesh

or Chad it would not even have doubled, and in Angola the simulation suggests that violence would

have increased by just over 20 percent. Our take on this is not that such infrastructure projects

should necessarily be put on hold in high-risk countries, but rather that the government may want

to accompany them with pacification efforts (see discussion of columns 3 and 4 below).

In column 2 we simulate a policy which doubles the spatial decay parameterκ. This is a policy

which can be thought of as – during peaks of violence – severing or controlling transport links

or imposing curfews. We find a dramatic fall of violence in most countries. Interestingly, this is

the sort of reaction that has been found by scholars who have analyzed the role of the Covid-19

pandemic on armed conflict (see e.g. Berman et al. 2020). The reduced mobility of armed actors

seems to have indeed reduced violence.

In column 3 we show an experiment in which the ethnic group with maximum Rg is pacified,

i.e. where the group’sRg is set to 0. This will have the biggest impact in countries where one group

was engaging in a lot of violence. Reducing the stakes of fighting could be achieved for example

through power-sharing agreements that result in reducing the edge between being inside versus

outside the government (see Mueller and Rohner 2018). Column 4 performs a related experiment –

setting the conflict prize of all groups with an above-median estimated conflict prize to the median

– and again finds a strong pacification potential of such measures.

7 Conclusion

The aim of this article has been to introduce the concepts of space and distance into a canonical

model of conflict, in order to understand the drivers of conflict from a distribution of violence and

population across space. This approach has been extremely successful in other areas like trade

(Fajgelbaum et al., 2017; Donaldson, 2018) by integrating trade models with empirical measures

of trade costs to understand the distribution of economic activity across space. However, coercion

follows a very different logic than the exchange of goods and the conflict literature has up to now

typically followed the approach of using contest functions as a way to model conflict effort (see e.g.

Konrad 2009). In the current contribution we have brought the idea of spatial interactions into this

class of models with the goal of studying situations in which the drivers of violence are separated

spatially from the violence.

(25)

coun-tries – highlighting a substantial between-country heterogeneity that is associated among others

to topological factors. In more than half of the countries in our sample inter-group distance plays

a key role for violence, and in the median country having 350 km larger distances between given

groups cuts fighting by half. Our estimates allow for tracing back the origin of violent attacks and

carrying out a series of counterfactual experiments. Beyond estimating the role of space separately

for each country in our sample, our model also permits to forecast potential implications of specific

shocks to the model parameters. For example, the lockdowns put in place in the face of the current

covid-19 pandemic make it typically harder for anyone (fighters included) to travel across space,

resulting in higher κ, which in turn would typically reduce the scope for ethnic violence. This is

consistent with recent empirical evidence (Berman et al., 2020).

Several avenues seem promising for future research: First, our setting entails the possibility of

building joint models of coercion and voluntary contract in space which may foster our

understand-ing of the distribution of economic activity in the least developed countries that are often plagued

by armed conflict.17 Further research in this direction is very much encouraged. Second, it would be interesting to extend the model to allow for beneficial effects of inter-group interaction (e.g. with

trust-building `a la Rohner et al. 2013b). Third, we warmly encourage studies that build on the

current framework to study other phenomena than conflict where spatial heterogeneity of intensity

and local interactions play an important role. Migration, for example, is most attractive where

rich areas are close to poor areas. Other examples are topics in electoral politics, such as the study

of local campaigning in national elections, or public health policies such as anti-AIDS campaigns.

Finally, for specific cases with more fine-grained population and violence data it would be

possi-ble to study specific policies to promote or hinder movement such as barriers or the presence of

peacekeepers. A model which integrates such policies into the spatial decay matrix and tests their

impact inside the model would provide a completely new avenue of research into the effectiveness

of such measures.

17_{See Dal B´}_{o and Dal B´}_{o (2011) and Gonzalez et al. (2012) for existing joint models of coercion and voluntary}