DKV_2016_FinalOct26WithoutRed.pdf

(1)

Non-performance Pay and Relational Contracting:

Evidence from CEO Compensation

Jed DeVaro

∗

Jin-Hyuk Kim

†

Nick Vikander

‡

October 2016

Abstract

CEOs are routinely compensated for aspects of firm performance that are beyond their control. This is puzzling from an agency perspective, which assumes performance pay should be efficient. Working within an agency framework, we provide a rational for this seemingly inefficient feature of CEO compensation by invoking the idea of informal agreements, specifically the theory of relational contracting. We derive observable implications to distinguish relational from formal contracting, and using ExecuComp data, find that CEOs’ annual cash and equity incentive payments positively correlate with the cyclical component of sales and respond to measures of persistence as relational contracting theory predicts.

JEL Classification: D86, J33, M12

Keywords: relational contracts; CEO compensation; pay-for-luck; skimming view

(2)

1 Introduction

Weak ties between CEO pay and firm performance provoke outcries from shareholders and

the general public, particularly when firm performance is poor. But when firm performance

depends largely on factors that are beyond the CEO’s control, pay-for-performance amounts

to “pay-for-luck”. Moreover, empirical evidence suggests that such pay-for-luck regularly

occurs.1

The pay-for-luck phenomenon is at odds with the contracting view of executive

compen-sation, which predicts that luck should not affect incentive pay. The contracting view says

that CEOs should be offered efficient performance pay, for the purpose of reducing agency

problems between the CEO and shareholders. According to the Informativeness Principle,

only performance measures that provide the shareholders with information about the CEO’s

desired action should be used in the incentive contract (Shavell, 1979; Holmstr¨om, 1979),

whereas luck is unrelated to CEO actions and should therefore not affect compensation.

Why do firms pay CEOs for luck when the contracting view says they should not?

Bertrand and Mullainathan (2000, 2001) argue that CEOs influence how their own pay

is set. This “skimming view” of compensation is similar to Bebchuk and Fried’s (2003,

2004) “managerial power hypothesis”, which postulates that pay-for-luck is driven by a lack

of arm’s-length relationship between the CEO and the board. These arguments suggest

that observed pay-for-luck is inefficient and reflects the CEO’s ability to extract rents from

shareholders.

We offer an alternative explanation for pay-for-luck that does not rely on inefficiency and

that is consistent with the empirical evidence we present from ExecuComp and CompuStat

data. The novel element of our analysis is that we consider relational contracts, where the

firm retains the option to renege on paying a promised bonus, in contrast to formal contracts, 1_{For example, measuring luck as the portion of firm performance that can be predicted by exogenous price}

(3)

which are directly enforceable in court.2 _{We show that when luck is persistent, relational}

contracts can explain pay-for-luck in a manner consistent with the Informativeness Principle.3

Compensation is efficient, given the constraints imposed under relational contracting, in

particular that the firm can always renege on its promises.

The idea behind our mechanism is that today’s realization of luck often provides useful

information for predicting luck tomorrow, which in turn has implications for the future value

of the employment relationship. Such information is irrelevant under formal contracting,

in which the firm is contractually obligated to pay a promised bonus. But it is relevant

under relational contracting, when the firm will only pay the promised bonus if the future

value of the employment relationship is sufficiently high. Moreover, this difference between

formal and relational contracts is amplified when luck is more persistent.4 _{The greater the}

persistence of luck, the more informative today’s luck realization is about the likely state of

the employment relationship tomorrow, and the more today’s luck will influence CEO effort

and contract design under relational contracting.

We start by developing a theoretical framework for analyzing the effect of the “state”

(i.e., a persistent measure of luck) on performance pay under both formal and relational

contracting. Specifically, we consider a simple Markovian environment where the state can

take on two possible values (high or low). The CEO exerts effort to influence the probability 2_{Examples of formal contracts are piecewise-linear bonus contracts called 80/120 plans (Murphy, 1999)}

and severance payments received in the event of an involuntary separation. Unlike with formal contracting, under relational contracting the principal lacks the ability to commit to the terms of performance pay, which leaves open the possibility of reneging on a promised payment.

3_{Persistence in luck has not featured prominently in the literature on executive compensation; however,}

luck may be persistent within firms for many reasons. For instance, an initial success in advertising, word-of-mouth and innovation can have a lasting effect on a firm’s sales. Similarly, negative publicity, consumer lawsuits and natural disasters can undermine the firm’s reputation as well as sales while inadvertently boosting rival firms’ sales. These examples are at least partially firm specific and are mostly beyond the control of the CEO.

4_{There is empirical evidence that the measures of luck used in the literature are persistent. For example,}

(4)

of successful production, expecting that success will result in a larger bonus. The principal

is contractually obligated to pay any promised bonus under formal contracting, whereas

reneging is possible under relational contracting.

We show that under formal contracting, the principal offers the same bonus for success in

both states. In contrast, under relational contracting, when the discount factor is sufficiently

low, the principal offers a bonus that is smaller than under formal contracting and that is

conditional on the state: a larger bonus is paid in the high state than in the low state,

because a high state increases the principal’s credibility not to renege on his promise. Put

another way, optimal performance pay for the CEO can exhibit pay-for-luck if contracting

is relational. Since the optimal formal contract does not involve pay-for-luck, this result

provides a way to empirically detect relational contracting in CEO compensation.

Empirically, we find that a measure of the state is positively and robustly correlated

with the CEO’s cash and equity incentives, consistent with our theoretical predictions under

relational contracting. Borrowing from the literature on the propagation of business cycle

shocks, we construct our state proxy by applying the Hodrick-Prescott (HP) filter to the

time series of a firm’s sales to obtain the cyclical component of these sales. Applying the

filter to our sample of individual firms’ time-series data yields a firm-specific, autocorrelated

proxy for luck. We find that the effect of the state on CEO pay is stronger for firms with

a more persistent state variable. Hence, our results suggest that conditioning CEO pay on

(persistent) luck may be efficient. We also investigate the skimming view of compensation

using the corporate governance index that has been widely adopted since Gompers et al.

(2003), interacted with our state proxy. The skimming view suggests that pay-for-luck is

inefficient and driven by managerial power, and should therefore be more prevalent in poorly

governed firms. However, we find little evidence that CEO compensation is more positively

correlated with our measure of state when shareholder rights are weaker.5

5_{Cronqvist and Fahlenbrach (2013) also cast doubt on the skimming view, by showing that CEO contracts}

(5)

A key contribution to the pay-for-luck literature is Bertrand and Mullainathan (2001),

and our study differs from theirs in several respects. First, we provide a formal theoretical

explanation for pay-for-luck. Second, the persistence of luck is not recognized in their study

but is central to ours. Many measures of luck (including the oil price movements and

industry performance used in that study) are autocorrelated, which suggests a dynamic

setting in which today’s luck realizations convey useful information about the future value

of an employment relationship. Third, their proxies for luck are not firm specific.

Economy-wide shocks, such as swings in oil prices or exchange rates, or industry average performance,

can only capture the components of luck that vary at the country- or industry-level. But

firms experience variation in luck even within narrowly-defined industries, and the luck can

be firm-specific and persistent. Applying the HP filter to firm sales and using the cyclical

component provides a novel firm-specific empirical measure of the state, and we consider the

implications of varying the degree of persistence in the measure.6

Alternative explanations to ours for the efficiency of pay-for-luck have been proposed.

One is that pay-for-luck may be optimal if the value of CEOs’ outside options varies with their

firms’ performance, and re-contracting is costly, so that firms adjust CEO pay for retention

purposes (Oyer, 2004).7 _{To allow for this possibility, our theoretical analysis assumes that}

the agent’s outside option may vary with the state, and we show that such correlation can

indeed affect compensation. However, this effect operates entirely through the base salary

offered in each period and has no effect on performance pay. We show that even if outside

options vary, a positive correlation between the state (i.e., luck) and bonus payments can only

be optimal under relational contracting. Our approach also squares well with the argument

of Holmstr¨om (2005) that increased demand (or the CEO’s outside option) is not likely to

fully explain changes in CEO pay, and it is important to examine dynamic models where 6_{A potential limitation of virtually any firm-specific state measure (including ours) is that there is the}

potential for CEO effort to be embedded within it. To mitigate such concerns, we include firm accounting performance as a covariate in our regressions, which controls for the direct impact of CEO effort.

7_{Gabaix and Landier (2008) and Tervio (2008) explore related assignment models of managerial talent}

(6)

commitment problems and implicit incentives play a role.8

Although formal contracts are used in executive compensation, there are strong reasons

to expect that relational contracts are also important. In a cross-sectional study, Gillan et al.

(2009) argue that roughly half of the S&P 500 CEOs work without any explicit employment

contracts and that even explicit contracts often leave details about future compensation

lev-els to the discretion of the board. Moreover, an emerging body of empirical work provides

support for relational contracting, but this work has focused on inter-firm (supply)

rela-tionships rather than intra-firm (employment) relarela-tionships.9 Our study contributes to this

literature by exploring how relational contracts can affect CEO compensation, focusing on

the key role played by luck’s persistence, and highlighting a connection between relational

contracting and pay-for-luck.

Another factor suggesting the relevance of relational contracting is that even formal CEO

contracts can have relational elements. For example, in the case of an “80/120” bonus plan,

the board might be able to exercise discretion either in the performance metrics used or in

judgments about whether certain performance standards have been achieved. Murphy and

Oyer (2004) find that nearly two thirds of surveyed companies based executive incentives in

part on subjective assessments of individual performance and “alternatively, the boards of

directors may make discretionary adjustments.”10 _{Murphy and Jensen (2012) further}

artic-ulate the scope for discretion in executive compensation by noting that “sometimes these

shadow plans have little or nothing to do with the performance criteria specified in the

shareholder approved plans.”

We focus on CEOs because bonus payments are particularly important and prevalent for

executives. Our goal, theoretically and empirically, is to highlight how an analysis of the 8_{Prior work on dynamic CEO contracting and/or pay-for-luck includes Axelson and Baliga (2009),}

Gopalan et al. (2010), and Noe and Rebello (2011). For more details, see Edmans and Gabaix’s (2009) survey of recent optimal contracting approaches to CEO pay.

9_{See, e.g., McMillan and Woodruff (1999), Banerjee and Duflo (2000), Johnson et al. (2002), Gil (2013),}

Gil and Marion (2013), Zanarone (2013), Barron et al. (2015), Calzolari et al. (2015), Gil and Zanarone (2015), and Macchiavello and Morjaria (2015).

10_{This survey also documents how certain firms claim not to use discretion but nonetheless report having}

(7)

determinants of bonus payments can shed light on the presence of relational contracting,

so an occupational setting in which bonuses are widespread and an important component

of compensation is appealing. Focusing on a particular job title also eliminates much of

the heterogeneity in tasks and job assignments that would pose econometric problems in

broader samples of workers. Further, our theoretical model offers a new explanation for

pay-for-luck, which is a phenomenon the literature discusses in the context of CEOs. However,

our theoretical arguments should also be applicable to other worker groups where there is

discretion in compensation (e.g., bankers). Future empirical work based on non-executives

would be helpful to clarify the extent to which our results generalize.11

The study is organized as follows. Section 2 describes the theoretical model, and Section

3 presents its predictions and a discussion of their robustness. Section 4 describes the data

set construction, and Section 5 contains estimation results. Section 6 concludes. All proofs

are in the Appendix.

2 The Model

The purpose of the model is twofold. First, it offers an explanation for pay-for-luck that does

not rely on inefficiency. Second, it contrasts the observable implications of formal contracting

versus relational contracting, which sets the stage for an empirical investigation of whether

evidence of relational contracting can be found in executive compensation.

We consider a repeated relationship between a principal and an agent, both of whom are

risk neutral. Time has an infinite horizon with discrete periods indexed by t = 1,2, . . .. In

each period t, the agent chooses effort, which influences whether his work succeeds or fails.

Specifically, the agent’s contribution to output is xSfor a success andxF for a failure, where

xS > xF >0. Output in each period, xt, depends on the agent’s contribution and also on a 11_{Our mechanism is based on contractual incompleteness, and such incompleteness is relatively}

(8)

stochastic component, ∆t, which we refer to as the state.12

The period-t state, ∆t, is a measure of luck, capturing factors that influence output but

that are beyond the agent’s control. The state can take either of two values and evolves

according to a Markov process: ∆t ∈ {−∆,∆}, where 0 < ∆ < xF, and P(∆t+1 = ∆t) =

θ ∈(1/2,1) is the stationary transition probability. We say that the period-t state is high if

∆t= ∆ and that it is low if ∆t =−∆. Both states are a priori equally likely, i.e., P(∆1 =

∆) =P(∆1 =−∆) = 1/2. Thus, the per-period (total) output is xt∈ {xF + ∆t, xS+ ∆t}.

The fact that θ exceeds 1/2 implies that luck is persistent.

In each period t, the agent chooses effort et ∈ [0,1] subject to moral hazard. The cost

of effort is C(et), where C(0) = 0, C0(0) = 0, C0(e) ≥ 0, C00(e) > 0, C000(e) ≥ 0, and

lime→1C0(e) = ∞. A technology maps the agent’s effort into the probability of success,

for which we assume first-order stochastic dominance. That is, the probability of success,

p(et)≡p(xt =xS|et), is increasing in effort, where p(0) = 0,p0(e)>0, p00(e)<0, p000(e)≤0.

The principal observes whether the agent’s work succeeds or fails. For now, we leave open

the possibility that the principal also directly observes the agent’s effort choice. In Section 3

we describe how the assumption about whether or not the principal observes agent effort is

unimportant for our analysis, in the sense that it has no impact on the qualitative, observable

implications we derive for how relational contracting differs from formal contracting.

At the beginning of the game, the principal offers the agent a contract, (S, B), where S

is a set of base salaries, and B is a set of bonuses. For each period t, the contract specifies a

salary s∈S to be paid at the start of that period, and a bonus b∈B to be paid at the end

of that period. The principal can commit to bonus payments when output xt is verifiable

(formal contracting) but not when it is unverifiable.13 _{In the latter case, the principal must}

rely on relational contracting, where the incentive to pay the agent a promised bonus depends 12_{The canonical models of stationary relational contracts are well known (see, e.g., Bull, 1987; MacLeod}

and Malcomson, 1989; Baker et al., 1994; Levin, 2003; Kvaløy and Olsen, 2009; Malcomson, 2012, 2015). The innovation in our framework is the presence of a stochastic state that influences profits, which allows us to address the issue of pay-for-luck.

(9)

on the future value of continuing the employment relationship. Both the principal and agent

discount future payoffs with factor δ∈(0,1).

At the start of every period t, the state is publicly revealed, the principal offers the base

salary specified under S, and the agent chooses whether to participate in that period. If the

agent does not participate, then he takes the outside option, which yields payoff vH ≥ 0 if

the state is high andvL≥0 if the state is low, and the principal earns a payoff of zero. If the

agent participates in periodt, then he receives the base salary and chooses effortet. Output

xt is then realized and observed by both parties. The principal makes the bonus payment

specified under B, or possibly reneges on this payment under relational contracting, and

the period ends. The agent then chooses whether or not to continue the relationship. The

timing of play within each period is summarized in Figure 1 for the case where the agent

participates.

Figure 1: Timing of play within each period

State Δt is

realized

Agent chooses effort et

Output xt is

realized

Principal pays bonus or reneges Start of

period t

Start of period t+1 Agent continues

or ends the relationship Principal

offers base salary

We focus on stationary perfect public equilibria where, on the equilibrium path, period-t

payments and actions depend only on (i) the period-t state, ∆t, and (ii) either period-t

effort, et, if effort is observable, or period-t output, xt, if it is not.14 We can therefore

write S = (sH, sL), denoting the base salary in the high and the low state. If effort is

observable, we can write bonus schedules B(e|H) and B(e|L), which map effort e ∈ [0,1]

to payments in the high and low state respectively. If effort is unobservable, we can write 14_{Note that period-}_t _{output is a noisy signal of period-}_t _{effort. Hence, the principal will not condition}

(10)

B = (bSH, bF H, bSL, bF L), where bSH and bF H denote the bonuses for success and failure,

respectively, when the state is high (andbSL andbF L are defined analogously when the state

is low). Without serious loss, we assume limited liability: S ≥ 0, B ≥ 0, B(e|I) ≥ 0 for

all e∈[0,1],I ∈ {H, L}. We also assume trigger strategies specifying the harshest credible

punishment if the principal deviates from the equilibrium by reneging on a bonus specified

in the relational contract; that is, the agent then ends the employment relationship, takes

his outside option, and the principal earns zero. If the relationship continues to periodt+ 1,

then the players face the same choices as in period t.

We adopt the convention that all compensation not related to agent performance is

cap-tured in the base salary, rather than the bonuses. With observable effort, this amounts to

im-posing mine∈[0,1]B(e|I) = 0,I ∈ {H, L}. Pay-for-luck is then equivalent toB(e|H)6=B(e|L)

for at least some value e ∈ [0,1]; that is, the bonus schedule depends on the realized

state. With unobservable effort, the convention amounts to imposing min(bSH, bF H) =

min(bSL, bF L) = 0. Since offering a higher bonus for failure than for success can never

induce positive effort, it follows that the principal will set bF H =bF L = 0, so we can write

B = (bSH,0, bSL,0) ≡ (bH, bL).15 Pay-for-luck is then equivalent to bH 6= bL; that is, the

bonus for success depends on the realized state. Finally, we assume throughout the analysis

that the agent’s outside options, vH ≥0 and vL ≥0, are sufficiently small for the principal

to always induce participation in each period under the optimal contract; that is, in any

given period, the compensation required to induce the agent to work does not exceed the

expected output produced by working.16

15_{This convention will pin down the optimal bonuses under formal contracting when the agent’s}

partici-pation constraint binds, so where the agent requires a positive base salary. Under relational contracting, the principal would have a strict incentive to abide by this convention to reduce his own temptation to renege on paying the bonuses.

16_{This assumption rules out the possibility that the bonus varies trivially with the state simply because}

(11)

3 Theoretical Analysis

3.1 Optimal contracts

We derive the optimal contracts under formal and relational contracting. These contracts

are optimal from the principal’s perspective, in the sense of maximizing expected profits.

Our interest is in generating testable predictions, to ascertain whether there is evidence

of relational contracting in CEO compensation. The testable predictions will follow from

Propositions 1 and 2, derived later in this section. These results show that under formal

con-tracting, bonus payments are independent of the state, whereas under relational concon-tracting,

bonuses may depend on the state and, moreover, the sensitivity of bonuses to the state is

increasing in state persistence.

Deriving these formal results requires taking a stand on whether the principal can observe

agent effort. However, it turns out that Propositions 1 and 2 hold regardless of whether or

not effort is observable. There is therefore no loss in assuming, as we do, that the principal

does not observe effort (see Section 3.2 on robustness for further discussion). We also view

unobservable effort as a natural assumption in the context of CEOs. It implies that agents

will be rewarded for good outcomes more than for bad ones, and that agents acquire strictly

positive surplus, where both implications appear reasonable for CEOs (see Section 3.2 for

further discussion).

We first provide expressions for the expected payoffs of the principal and the agent.

Henceforth, let eH (eL) denote the agent’s optimal effort choice in the high (low) state,

conditional on participation. Suppose that the period-t state is high (i.e., ∆t = ∆). Then

given contract (S, B) and effort choice eH, the principal’s expected period-t profits are

(12)

and the agent’s expected payoff is

uH((S, B), eH) = p(eH)bH +sH −C(eH). (2)

Similarly, the principal’s expected profits when the state is low are

πL((S, B), eL) = p(eL)(xS −∆−bL) + (1−p(eL))(xF −∆)−sL, (3)

and the agent’s expected payoff is

uL((S, B), eL) =p(eL)bL+sL−C(eL). (4)

For any t0 ≥ t, define Pt0_,t ≡ P(∆_t0 = ∆_t), which is the probability that the state in

period t0 is the same as the state in period t. Given that P(∆t+1 = ∆t) = θ, we can define

Pt0_,t recursively by

Pt0_,t =θP_t0₋₁_,t+ (1−θ)(1−P_t0₋₁_,t), (5)

fort0 ≥t+ 1, withPt,t= 1. Givenθ >1/2, the correlation between states in any two periods

is positive and increasing in θ.17

The present discounted value of expected profits as of a period when the state is high is

ΠH((S, B), eH, eL) =

∞

X

t=1

δt−1

Pt,1πH((S, B), eH) + (1−Pt,1)πL((S, B), eL)

, (6)

with πH((S, B), eH) given by (1) and πL((S, B), eL) given by (3). Similarly, the present

discounted value of expected profits as of a period when the state is low is

ΠL((S, B), eH, eL) =

∞

X

t=1

δt−1

Pt,1πL((S, B), eL) + (1−Pt,1)πH((S, B), eH)

. (7)

(13)

Given a stationary contract and unobservable effort, the agent’s optimal effort choice in

any period t depends only on the incentives offered in that period. Both states are a priori

equally likely, so the principal’s program under formal contracting is

max (S,B) Π =

1

2ΠH((S, B), eH, eL) + 1

2ΠL((S, B), eH, eL), subject to (8)

eH = arg max e∈[0,1]

uH((S, B), e), (9)

eL= arg max e∈[0,1]

uL((S, B), e), (10)

(S, B) = (sH, sL, bH, bL)≥0. (11)

Notice from (2) and (9), and from (4) and (10), that the optimal effort level in a given

state depends only on the bonus offered for success in that state. We can therefore write

eH = e(bH) and eL =e(bL). Recall our assumption that vH and vL are relatively small, so

the principal finds it optimal to induce the agent’s participation in each period, regardless

of the state. That is, the principal will offer a contract that meets the agent’s participation

constraints,

uH((S, B), eH)≥vH, (12)

uL((S, B), eH)≥vL. (13)

Under relational contracting, the principal must have an incentive to pay each bonus

specified under B as promised. Given trigger strategies, the optimal way for the principal

to renege on a bonus is to withhold it completely, so the benefit of reneging equals the

size of the bonus. The cost of reneging is the expected future profits lost when the agent

(14)

contracting includes additional, dynamic enforcement constraints,

bH ≤δ

θΠH(sH, sL, bH, bL, eH, eL) + (1−θ)ΠL(sH, sL, bH, bL, eH, eL)

, (14)

bL≤δ

θΠL(sH, sL, bH, bL, eH, eL) + (1−θ)ΠH(sH, sL, bH, bL, eH, eL)

, (15)

where both the cost and benefit from reneging may depend on the current state.

The following lemma shows that the optimal effort level is increasing in the size of the

bonus but at a decreasing rate. As a result, expected profits in any given period will be a

concave function of the promised bonus.

Lemma 1 Let I ∈ {H, L}, and consider the agent’s optimal effort e(bI) in state I,

condi-tional on participating. Then 0 ≤ e(bI) < 1, where the inequality is strict for all bI > 0.

Moreover, e(bI) is unique, with e0(bI)>0 and e00(bI)<0.

We now state our first main result, where superscripts f and r denote formal and

rela-tional contracting, respectively.

Proposition 1 If output is verifiable, the principal will choose formal contracting and offer

the same bonus in both states: bf _≡_bf

H =b

f

L >0. If output is nonverifiable so that contracting

is relational, and the agent’s outside options vH and vL are sufficiently small, then the

principal will offer a different bonus in each state when the discount factor is sufficiently

low: for any θ ∈ (1/2,1), there exists δ0 ∈ (0,1) such that 0 < brL < brH ≤ bf for all

δ∈[0, δ0) and brH =bLr =bf for all δ∈[δ0,1].

Proposition 1 says that under relational contracting, the agent may be paid for luck, in

the sense that incentive payments can be positively correlated with the state. In contrast,

pay-for-luck never occurs under formal contracting. The intuition behind the result is as

follows. A large bonus will generate high effort, which increases expected output, but it will

(15)

takes into account these two opposing effects by setting the optimal bonus that gives profits

of zero on the margin. Marginal profits are independent of the state, since the state does

not affect the relationship between bonus and effort or the relationship between effort and

success. It follows that the principal offers the same bonus, bf_{, in both states under formal}

contracting.

The difference under relational contracting is that the principal faces a commitment

prob-lem. He would like to offer the same profit-maximizing bonus as under formal contracting,

but he needs an incentive to actually pay this bonus when the agent succeeds. Put another

way, the principal cannot credibly offer a bonus that violates his dynamic enforcement

con-straint. The bonus bf _{will not be credible if it exceeds the discounted value of expected}

future profits, which is what occurs if the discount factor is sufficiently low. With a low

discount factor, the principal can credibly promise a larger bonus in the high state than in

the low state, since the high state offers a looser dynamic enforcement constraint; intuitively,

this is because profits are higher in the high state and states are positively correlated over

time. Total profits are lower than under formal contracting, since marginal profits are strictly

positive at the constrained optimum.18

As Proposition 1 states, correlation between the state and the optimal bonus is a

dis-tinguishing feature of relational contracting compared to formal contracting. This is not

necessarily true for correlation between the state and the optimal base salary. The proof of

Proposition 1 shows that the optimal base salary under formal contracting depends on the

agent’s outside option whenever the participation constraint binds. It follows that a

corre-lation between outside options and state can generate correcorre-lation between state and base

salary. Thus, with formal contracting, the principal may well adjust the base salary over

time for retention purposes, to take into account changes in outside options, but it would 18_{Notice that neither contract specified in Proposition 1 is first best. This follows from our assumption}

(16)

always set the same bonus bf _>_{0. This means that variations in the agent’s outside option}

cannot explain state-contingent bonuses in our framework.

In the empirical section, we will describe the measures of the components of executive

compensation and our choice of state proxy. The bonus in our theoretical model is any

compensation given after output is realized. Hence, its empirical counterpart can comprise

either cash bonuses or equity grants but not fixed salary. There are differences between cash

bonuses and equity grants; for instance, an equity award need not be immediately cashed

out. However, the data capture the grant-date value of stocks and options, and as long as

the executive perceives the grant-date value as the reward for that period’s outcome, then

our model should apply to equity as well as to cash bonuses.

The following proposition describes how the optimal bonus depends on the degree of

persistence of the state over time.

Proposition 2 Consider the optimal bonuses bf_, _br

H, and brL given by Proposition 1. Then

bf _{is independent of} _θ_{, whereas} _br

H −brL is increasing in θ whenever bLr < bf, brH < bf.

Proposition 2 states that the sensitivity of bonus payments to the state with relational

contracting should be higher for firms facing a more persistent state, whereas under formal

contracting the optimal bonus remains the same regardless of the state’s persistence (i.e.,

there is no “pay for luck”). This result accords well with the Informativeness Principle.

When persistence is low, the current state provides little information about future profits.

The principal then faces similar credibility issues across states and offers similar bonuses.

As the state becomes more persistent, the difference in expected future profits between the

high and low states increases. This persistence differentially affects the principal’s credibility

across states and leads to increasingly different bonuses being offered.

The preceding results have the following testable implications. If the state has a strong

positive correlation over time, then bonus payments under relational contracting are more

(17)

employment relationship. This means that the difference between the bonus in the high state

and the low state is large. In contrast, in the extreme case where the state is independent

over time, the bonus is insensitive to the state, as it would be under formal contracting.

Thus, persistence of the state is key for empirically detecting relational contracting.

3.2 Alternative Modeling Assumptions

We now briefly describe how our theoretical results on relational versus formal contracting

are robust to relaxing various assumptions. Specifically, we consider the impact of (i) agent

effort being observable; (ii) the presence of more than two states and non-Markov state

transitions; and (iii) agent risk aversion. The overall conclusion in all these cases is that our

main qualitative results continue to hold.19

First, we have derived results showing that Propositions 1 and 2 continue to hold if effort

is observable. That is, the optimal bonus is still independent of the state under formal

contracting but not under relational contracting, and the sensitivity of bonus payments to

the state under relational contracting is increasing in state persistence. Observability of effort

does have some impact on the structure of compensation, as it affects how much information

the principal possesses. When effort is observable, the principal offers the agent a positive

bonus, equal to the cost of effort, for the desired effort level, and offers zero bonus for all

other choices of effort. Doing so allows the principal to capture all surplus under both formal

and relational contracting. But it has no impact on the distinguishing features of relational

contracting versus formal contracting, i.e., the correlation between state and bonus size, and

the relationship between bonus payments and persistence.

Second, our assumptions of a binary state variable and of Markov state transitions help

keep the analysis tractable but are not important for the economic mechanism driving our

results. The positive relationship between state and bonus size under relational

(18)

Proposition 1, where we impose the auxiliary assumptions of binary effort and zero outside

options but where we also allow forN ≥2 states and non-Markov transitions.20 _{This result}

shows a positive relationship between state and bonus under relational contracting, but not

under formal contracting, regardless of whether we relax only the assumption of binary state,

only the assumption of Markov transitions, or both assumptions. The crucial feature about

state transitions is not their (non-)Markov character but rather that there be some notion

of persistence. That is, for any particular threshold value of the state, a higher state today

makes it more likely that the state exceeds that particular threshold in the future, captured

in our original modeling framework by the assumption θ >1/2.

Third, another question is whether Proposition 1, which shows that bonus payments are

both smaller and more variable under relational contracting compared to formal contracting,

continues to hold under risk aversion. A risk-averse agent would dislike this variability in

bonuses payments under relational contracting. All else being equal, it might seem plausible

that risk aversion would push up the size of bonus payments under relational contracting

above their level under formal contracting, to compensate the agent for the higher variability.

It turns out that this is not the case. In fact, Proposition 1 continues to hold with agent

risk aversion, so relational contracting continues to yield both smaller and more variable

bonus payments. The economic intuition for this result is that the dynamic enforcement

constraint under relational contracting pushes the principal to implement lower effort than

under formal contracting. This is the case if the agent is risk averse, just as it is if the agent

is risk neutral. Since the agent exerts lower effort under relational contracting, he requires

lower compensation, and hence receives smaller bonuses.21

20_{The auxiliary assumptions allow us to maintain tractability in a setting where the state variable can}

assume arbitrarily many values and where transitions may depend on the entire history of earlier state realizations.

21_{Moreover, in our setting, the principal has no interest in offering high compensation in both states as a}

(19)

4 Data and Measures

We extract CEO compensation data from the ExecuComp database from 1992 to 2014.22

Specifically, our sample includes all individuals who have (or have had) the CEO title from the

historical S&P 1500 Index constituent firms, which covers 90% of the market capitalization

in the U.S. stock market. Compensation items include salary, cash bonus, equity-based

payments (i.e., stock and option awards) as well as total compensation (see Table 1 for

summary statistics and data definitions). All compensation variables are deflated to 2009

dollars, and matched to the firm-level financial data to be described shortly. We also include

the CEO’s age, tenure (measured in years from the date the individual became CEO), and

an indicator for the CEO’s first year, because there might be signing bonuses when a CEO

first joins a firm (in which case first-year compensation would be overestimated relative to

what the CEO would normally be paid).

The company financial data are drawn from the CompuStat Annual database from 1987

to 2014. These include a company’s sales, deflated to 2009 dollars, which we use to extract

a firm-specific state proxy. The reason for the lead time prior to 1992 is to increase the data

length for the filtering mechanism to be described shortly.23 _{Consistent with Bertrand and}

Mullainathan (2001), we include the logarithm of shareholder wealth (Mkval) and income

before extraordinary items divided by total assets (IB/AT) to control for firm size and CEO

effort, respectively. In addition, we include market-to-book (MtB) and the leverage ratio

(Lev) to control for growth potential and the free cash flow effect, respectively. We also

include the Gompers–Ishii–Metrick index of corporate governance (G index), where a higher

score indicates more restrictions on shareholder rights or a greater number of anti-takeover

measures, to test the skimming view of pay-for-luck.24

22_{Comprehensive data on executive compensation in publicly-traded companies are available from 1992,}

when the SEC revised its proxy and reporting rules, which required greater disclosure of top executive compensation.

23_{We drop firms with fewer than ten observations (}_N _≈ _{1800), to achieve reliable filtering results, and}

firms with a gap in the sales data (N ≈1300) because gaps are disallowed by the filtering method we use to construct a measure of the state.

(20)

The idea behind the construction of our measure of the state variable borrows from the

theory of business cycles, which is concerned with extracting and understanding the cyclical

component of a country’s GDP fluctuations in time series data. The point is to separate

the frequency of business cycles (which typically range from one to several years) from the

nonstationary, long-run trend or high-frequency noise. Although a company’s output is

much smaller than (and different from) a country’s, these time series filtering methods can

be fruitfully applied to companies, and a key point is that there is variation in the nature of

cycles across companies within the same country. For example, even if a company’s business

cycle wavelength is similar in duration to a country’s business cycle, each firm can have a

different amplitude in the time series as well as a different timing of crest and trough. One

considerable advantage of this approach is that the filtering can be applied to each firm’s

time series data, and thus persistent luck need not be common for all firms in a sector or

economy, which allows us to empirically address across-firm comparative statics results such

as Proposition 2.

Specifically, to construct a measure of the state proxy, we apply the Hodrick-Prescott

(HP) filter to the time series of a firm’s annual sales to obtain its cyclical component.25

Numerous studies have shown that the HP filter does a good job of isolating the business

cycle component from long-run trends (e.g., Kydland and Prescott, 1990; Backus and Kehoe,

1992). In considering the relative variance between the cyclical and trend components, we

set the HP smoothing parameter equal to 6.25, following the recent treatment by Ravn and

Uhlig (2002), but also to 100 in a robustness check, following the original suggestion by

Backus and Kehoe (1992). Furthermore, to ensure the robustness of our results, we also

apply a new band-pass filter due to Christiano and Fitzgerald (2003), which is a natural

when we use this variable in our regressions. The G index was published in July 1993, July 1995, February 1998, November 1999, January 2002, January 2004, and January 2006. We imputed the 1992 value with the 1993 value, 1994 with 1995, 1996 and 1997 with 1998, 1999 with 2000, 2001 with 2002, 2003 with 2004, and 2005 with 2006.

25_{The HP filter smooths a data series} _y _{= (}_y

1, ..., yT)0 by minimizing, over all τ ∈ RT, the function

PT

t=1(yt−τt)2+λP T−1

t=2 (τt+1−2τt+τt−1)2, whereλis a smoothing parameter that for our annual data is

(21)

extension of the HP filter.

Figure 2(a) illustrates the HP filtering result (with a smoothing parameter of 6.25) for

the first firm in our sample (i.e., AAR Corporation). The solid line is the original (deflated)

sales data (Sale), and the dotted line nearby is the long-run trend (Sale tr). The cyclical

component (Sale cy) is the deviation of the original data from the trend and is represented

by the dashed line around the horizontal axis. That is, the HP filter leaves the mean of the

original data and the long-run trends almost the same and creates the cyclical component,

which is our state proxy, around mean zero with a sizeable standard deviation. In Table 1,

the HP-filtered state proxy is denoted byStateRU when the HP smoothing parameter is 6.25,

and by StateBK when it is 100. The band-pass filtered state proxy is denoted by StateCF.

Figure 2(b) shows the three different state proxies for AAR Corp. The series tend to comove

but they nonetheless also exhibit some distinct variation.

Finally, we measure the degree of state persistence in two ways. The first measure

is the correlation coefficient between the firm’s state proxy and its once-lagged variable.

Table 1 shows that this correlation (Corr) is positive in the three measurement cases. The

correlation coefficient does not explicitly consider the structure of state dependence assumed

in our model. Hence, we construct an alternative measure of state persistence by fitting

the AR(1) time-series model to each firm’s state proxy and use the coefficient on the

first-lagged dependent variable as another measure of state persistence. Table 1 shows that the

autoregressive coefficient (AR1) is positive, which ensures that the autocorrelation at any

lag is also positive. We estimate this regression individually for every firm, so bothCorr and

AR1 are constant within firms and vary across firms. Thus, when empirically evaluating the

comparative statics result in Proposition 2, our interpretation is that varying θ represents

(22)

Figure 2

(a) HP filtering of a firm’s sales

(b) Comparison of three filters

5 Empirical Evidence

5.1 Main results

Our goal in the empirical analysis is to explain the variation in CEO “bonus” compensation

over time and across firms as a function of the state proxy and the persistence of the state

proxy, to evaluate whether the evidence is consistent with the optimal relational contracting

results in Propositions 1 and 2.26 _{We then test an implication of the skimming view, which} 26_{Our test for relational contracting, which is based on the empirical relationship between the bonus and}

(23)

implies that pay-for-luck should be stronger in the more poorly governed firms. Because the

G index (a measure of corporate governance) varies relatively little over time, both

predic-tions on state persistence and corporate governance, which are the key effects of interests,

are essentially based on across-firm variation.

Since our theoretical model does not distinguish between cash and equity-based bonuses,

we use both CEO cash bonuses (Bonus) and stock and option awards (Equity) as primary

dependent variables. We also use CEO total compensation (which includes, e.g., deferred

compensation), net of base salary, as a dependent variable (Non Salary=Total−Salary),

which we take as the total measure of CEO compensation that is subject to the board’s

discretion. Our reason for focusing on the above measures of compensation, rather than on

base salary, is that our theoretical model does not provide a clear distinguishing test between

relational and formal contracting when salary is the dependent variable.27

Firstly, Proposition 1 posits that the state has positive direct effects on CEO bonus

compensation under relational contracting. Using the empirical proxies discussed above, we

estimate the following regression to test this prediction:

Yit =β0+β1Stateit+Xitβ+φi+ξt+εit,

where for each CEO-firm combination,Yitis CEOi’s year-tsalary, cash bonus, equity awards,

and total compensation net of salary; Stateit is the cyclical component of firm sales; Xit

contains the aforementioned firm and CEO characteristics and also industry-specific (2-digit

NAICS) linear time trends; φi is a CEO-firm fixed effect; ξt is a year effect; and εit is a

disturbance term that may be correlated within CEO-firm pairs.28

absence of formal contracts does not necessarily imply the existence of relational contracts.

27_{See the discussion following Proposition 1, describing how under formal contracting, the state has no}

impact on bonus size but may have an impact on base salary.

28_{Because our unit of analysis is a CEO-firm pair, and there is only one CEO per firm per year, the subscript} irefers to both the CEO and the firm. Therefore, there are no CEO (or firm) fixed effects identified separately from the CEO-firm fixed effect,φi. The linear trend terms contained inXitareP

22

s=1Indsi×T imet, where Indsi is an indicator for the industry of firmi, andT imetis a linear index measured in increments of single

(24)

Specifically, Proposition 1 predicts β1 = 0 under formal contracting, and β1 > 0 under

relational contracting when the dependent variable is Bonus, Equity, and Non Salary, if at

least some firms in the sample have a sufficiently low discount factor. As alluded to above, our

model does not have an unambiguous prediction for the sign ofβ1under formal and relational

contracting when the dependent variable is Salary, but we also present these regression

results for the sake of completeness. Estimation results are presented in Table 2, which

reveal that β1 is positive and significant in all columns, except in the first column (Salary),

and in all panels (using different filtering). This is consistent with our basic prediction on

relational contracting. Each measure of CEO “bonus” payments (i.e., cash bonus, equity

awards and total net of salary) is, on average, positively correlated with the firm-specific

state proxy.

Secondly, if the data-generating process is characterized at least in part by relational

contracting, then Proposition 2 posits that firms with a more persistent state should be

driving this result. We test this prediction by using the following interactive specification:

Yit =β0+β1Stateit+β2(Stateit×Corrit) +Xitβ+φi+ξt+εit.

Results from this interactive model are reported in Table 3. The analogous specification

using AR1it, instead of Corrit, as a measure of state persistence is estimated and also

reported in Table 4. In columns (2)-(4), and in all panels (a)-(c), across the two tables,

the coefficient estimates on the interaction term, β2, is positive and statistically significant.

Thus, the results appear supportive of the hypothesis that relational contracting plays a role

in generating the “pay-for-luck” phenomenon in CEO compensation.

The above results are qualitatively and quantitatively robust to the inclusion and

exclu-sion of industry-specific time trends. We also used a log specification where theYit variables

are in logs, and where sales in logs are subjected to the HP filter to derive the state proxy.

(25)

sample sizes are roughly two thirds of those reported here, due to zero or negative entries

and the gaps in the time series data that such entries create for the filtering.

5.2 Alternative explanations for pay-for-luck

We now briefly discuss three possible alternative explanations for pay-for-luck and relate

them to our empirical findings. One alternative focuses on changes to CEO outside options.

This view, described in Section 1, is that firms may compensate CEOs for factors beyond

their control not for incentive reasons but rather for retention purposes. If a CEO’s outside

option varies with the state, then compensation might also need to vary with the state so

as to meet the CEO’s participation constraint. The theoretical analysis in Section 3 shows

that this mechanism may generate correlation between the state and the optimal base salary,

but it cannot generate correlation between the state and the optimal bonus under formal

contracting. Given our empirical results, this suggests that varying outside options is unlikely

to explain pay-for-luck in performance pay.

Another alternative explanation relates to liquidity constraints. The idea here is that a

liquidity-constrained firm might use part of any unexpected cash flow (i.e., due to “luck”)

to award CEO bonuses. However, if pay-for-luck were due only to liquidity constraints, then

the impact of the state on CEO compensation should not depend on the state’s persistence;

all that would matter would be current unexpected cash flow. In addition, our empirical

specifications include company financials such as leverage, which plays an important role in

the free cash flow and moral hazard theory (Jensen, 1986), but our results still show that

persistence matters. While these results do not rule out a possible link between liquidity

constraints and CEO compensation, they suggest that such constraints will not be sufficient

to explain the pay-for-luck observed in our data.

We now turn to the skimming view of executive compensation, which concerns the

(26)

forms of pay. We use the G index to estimate the following interactive specification:

Yit=β0+β1Stateit+β2(Stateit×G indexit) +β3G indexit+Xitβ+φi+ξt+εit.

The coefficient of interest is β2. According to the skimming view, β2 is expected to be

positive, because the higher the G index the weaker are shareholder rights. Table 5 displays

the results, where there is no evidence that β2 is positive and significant. If anything, in

some specifications in panel (b), the coefficient estimates on β2 are negative and significant.

Hence, these results are inconsistent with the hypothesis that managerial entrenchment leads

to the observed pay-for-luck phenomenon.

6 Conclusion

We have presented a theoretical contracting model in which formal and relational contracts

have different implications for how bonus payments respond to persistent “luck.” Drawing

on a large sample of publicly-traded companies, and using empirical measures of the

firm-specific state and its persistence, we found evidence consistent with the relational contracting

view of CEO bonus compensation. A reason why firms seem to reward CEOs for luck is that

the expected future value of the employment relationship is larger in good states of the world

than in bad ones, so that the firm’s credibility to pay bonuses (as well as the CEO’s incentive

to exert effort) is higher in good states. Thus, pay-for-luck can be rationalized within the

standard agency framework. In addition to offering an explanation for pay-for-luck that

does not rely on inefficiency, we have provided empirical evidence consistent with the use of

relational contracts in executive compensation.

One policy implication concerns the efficiency losses caused by caps on executive pay.

During the global financial crisis of 2008, when failing firms were getting public bailouts and

(27)

of 2009 imposed caps on CEO pay.29 _{Our analysis implies that under relational contracting,}

if a cap on executive bonuses is imposed that binds in some states of the world, then the

distortion the cap causes on pay contracts should vary with state persistence. To the extent

that persistence differs across industries and production environments, this has implications

for how the damage (in terms of distortions in pay contracts) varies across industries. The

caps may be more damaging in industries/settings where the persistence is high, and

pay-for-luck is relatively important, than in settings where the opposite is true. Thus, our analysis

offers an explanation for why the distortions of pay caps may vary across industries and

suggests in which settings these regulations can be expected to have the most bite.

Appendix

Proof of Lemma 1. We prove the case for I = H, where the proof is entirely analogous

for I = L. From (2), the optimal effort eH = arg max e∈[0,1]

uH(bH, e) is uniquely defined by the

first-order condition

p0(eH)bH −C0(eH) = 0, (16)

with the second-order condition, p00(eH)bH −C00(eH) < 0, is satisfied by C00(e) > 0 and

p00(e) <0. Neither condition depends on ∆, meaning that effort only depends indirectly on

the state, through the bonus offered in that state.

IfbH = 0, then (16) reduces toC0(eH) = 0, so thatC0(0) = 0 impliese(0) = 0. If instead

bH >0, then (16) will be violated at eH = 0, since p0(0) > 0 and C0(0) = 0. (16) will also

be violated at eH = 1, since p0(1) is finite and lime→1C0(e) =∞. It follows that eH ∈[0,1)

with eH ∈(0,1) for bH >0.

Implicitly differentiating (16) with respect to bH yields

(28)

where we drop the arguments for p0, p00 and C00. Rearranging gives

e0(bH) =

p0 C00₋_p00_b

H .

The second-order condition and p0 >0 then imply e0(bH)>0. Now implicitly differentiating

(17) with respect to bH yields

p000(e0(bH))2bH +p00e00(bH)bH + 2p00e0(bH)−C000(e0H(bH))2−C00e00(bH) = 0,

and rearranging gives

e00(bH) =

(p000bH −C000)(e0(bH))2+ 2p00e0(bH) C00₋_p00_b

H

.

The numerator is strictly negative by p000 ≤ 0, C000 ≥ 0, p00 < 0 and e0(bH) > 0. The

denominator is strictly positive by the second-order condition. It follows thate00(bH)<0.

Proof of Proposition 1. Relational contracting only differs from formal contracting

through additional constraints (14) and (15). It follows immediately that the principal will

use formal contracting whenever it is feasible, so whenever output is verifiable.

Suppose for now that the optimal contract always induces the agent to participate,

re-gardless of the realization of the state. By (6) and (7), Π = 1₂ΠH + 1₂ΠL is increasing in πH

and πL, where specifically

ΠH(sH, sL, bH, bL) =

∞

X

t=1

δt−1Pt,1

πH(sH, bH) +

∞

X

t=1

δt−1(1−Pt,1)

πL(sL, bL), (18)

and

ΠL(sH, sL, bH, bL) =

∞

X

t=1

δt−1Pt,1

πL(sL, bL) +

∞

X

t=1

δt−1(1−Pt,1)

(29)

Write (1) as

πH(sH, bH) = p(e(bH))(xS−xF −bH)−sH +xF + ∆, (20)

and write (3) as

πL(sL, bL) =p(e(bL))(xS−xF −bL)−sL+xF −∆. (21)

The first- and second-order conditions for profits with respect to bonus are independent

of base salary and are the same in both states. Therefore, we write these conditions as

π0(b) = 0, π00(b)<0, with

p0(e)e0(b)(xS−xF −b)−p(e) = 0, (22)

p00(e)e02+p0(e)e00(b)

(xS−xF −b)−2p0(e)e0(b)<0.

The first-order condition is violated at b = 0, where π0(0) > 0, since e(0) = 0 and

e0(0) >0 by Lemma 1, and since p(0) = 0 andp0(0) >0. It is also violated for b≥xS−xF,

whereπ0(b)<0, since e(b)>0 ande0(b)>0 by Lemma 1, and since p(e)>0 andp0(e)>0.

The second-order condition is satisfied for all b < xS−xF, since e00(b)<0 by Lemma 1,

and sincep00(e)<0. It follows thatbf ≡bf_H =bf_L= arg max b≥0

πH(sH, b) = arg max b≥0

πL(sL, b)∈

(0, xS −xF) is uniquely defined by (22).

The optimal bonus bf is independent of the state and the base salary. Thus, the only

reason for the principal to offer a positive base salary in either state is to induce agent

participation. This means that the optimal contract under formal contracting isS∗ = (0,0)

and B∗ = (bf_{, b}f_{), if this contract satisfies participation constraints (12) and (13). More}

generally the optimal contract is S= (max(vH −u(S∗, B∗),0), max(vL−u(S∗, B∗),0)) and

B = (bf_{, b}f_{). That is, the principal only offers a positive base salary} _s

(30)

the participation constraint bind.

The principal will induce participation in both states if doing so yields non-negative

profits under the optimal contract: πH(sH = 0, bf) −max(vH − u(S∗, B∗),0) ≥ 0 and

πL(sL = 0, bf)−max(vL−u(S∗, B∗),0)≥0, with πH(sH = 0, bf) given by (20) and πL(sL=

0, bf_{) given by (21). These inequalities reduce to} _π

H(sH = 0, bf)≥0 and πL(sL = 0, bf)≥0 when vH =vL = 0, sinceu(S∗, B∗)≥0 follows from the fact that the agent can always earn

zero payoff by exerting zero effort. Note that the principal could always earn positive profits

by offering zero bonus: ∆< xF impliesπL(sL= 0, bL= 0) >0 andπH(sH = 0, bH = 0) >0;

hence πH(sH = 0, bf)> 0 and πL(sL = 0, bf)> 0 follow by the optimality of bf. Thus, the

principal finds it optimal to always induce participation if and only ifvH ∈[0, vH] andvL∈

[0, vL], wherevH ≡πH(sH = 0, bf) +u(S∗, B∗)>0 and vL≡πL(sL= 0, bf) +u(S∗, B∗)>0.

We now turn to the optimal contract under relational contracting. We will consider

contracts with zero base salary, sH = sL = 0, and derive the optimal bonuses brH and b r L, supposing that the participation constraints (12) and (13) are satisfied. We will then argue

that the contract thus derived is in fact optimal and satisfies the participation constraints

when outside options vH ≥0 and vL≥0 are sufficiently small.

Denoting b =bL, and substituting (6) and (7) into (14) gives

bH ≤δ

∞

X

t=1

δt−1[θPt,1+ (1−θ)(1−Pt,1)]πH(bH) +

∞

X

t=1

δt−1[(1−θ)Pt,1+θ(1−Pt,1)]πL(b) !

,

(23)

where πH(bH) is evaluated at e(bH) and πL(b) is evaluated at e(b).

Similarly, denoting b=bH, and substituting (6) and (7) into (15) gives

bL ≤δ

∞

X

t=1

δt−1[θPt,1+ (1−θ)(1−Pt,1)]πL(bL) +

∞

X

t=1

δt−1[(1−θ)Pt,1 +θ(1−Pt,1)]πH(b) !

,

(24)

where πL(bL) is evaluated at e(bL) and πH(b) is evaluated at e(b).

(31)

br

L ≤bf. If either inequality were violated, then setting (bf, bf) instead of (brH, brL) would yield strictly higher profits Π = 1₂ΠH +1₂ΠL by (18) and (19), where πH0 (b)<0 andπ

0

L(b)<0 for b > bf. Moreover, setting (bf, bf) instead of (brH, brL) would decrease the left-hand side and

increase the right-hand side of both (23) and (24). Hence, if (23) and (24) are satisfied by

(br

H, brL), they must also be satisfied by (bf, bf).

The constraint (23) holds strictly at bH = 0, since πH(0) = xF + ∆ > 0 by (20) and

πL(0) = xF −∆ > 0 by (21). The right-hand side of (23) is concave in bH on [0, bf] while

the left-hand side is linear. It follows that for any given b, there is at most one value of

bH ≤bf for which (23) binds, and this value (if it exists) will be strictly positive. Define the

function bH(b) on domain [0, bf] as the minimum of this value andbf. Since π0H(bH)>0 for

bH ∈[0, bf), the optimal bonus in the high state givenbL=b is precisely bH(b).

By the same reasoning, for any b, there is at most one value of bL ≤ bf for which (24)

binds, and this value (if it exists) will be strictly positive. Define the function bL(b) on

domain [0, bf] as the minimum of this value and bf. Since π_L0 (bL) > 0 for bL ∈ [0, bf), the

optimal bonus in the low state given bH =b is preciselybL(b).

The optimal bonus pair (br

H, brL) must satisfybH(bL(brH)) = brH, withbrL=bL(brH). Define the correspondence f(b) by f(bH(b))≡b, so that f(b) is the inverse of bH(b). Then (brH, brL)

must also satisfy bL(brH) = f(brH), again with brL = bL(brH). Geometrically, f(b) is the

reflection of bH(b) about the 45 degree line, and (bHr , brL) lies at an intersection of f(b) and

bL(b).

The function bL(b) is continuous, with domain [0, bf] and range [bL(0), bL(bf)], where

bL(0) >0 and bL(bf)≤bf. The function bH(b) is continuous, with domain [0, bf] and range

[bH(0), bH(bf)], where bH(0) > 0 and bH(bf) ≤ bf. It follows that its inverse f(b) is also

continuous, with domain [bH(0), bH(bf)] and range [0, bf], withf(bH(0)) = 0.

We now show that b0_H(b)>0 andb00_H(b)<0 whenever bH(b)< bf, and thatb0L(b)>0 and

b00_L(b)<0 wheneverbL(b)< bf. The fact thatb0H(b)>0 implies that the correspondencef(b)

is single valued for all b < bf_{, with} _f0₍_b₎ _> _{0, where} _b00

H(b) < 0 implies f

(32)

that bH(b) < bf, so that (23) binds at bH < bf, and fix the value of bH. The right-hand

side of (23) is increasing in b, since π0_L(b) > 0 for all b < bf_{. It follows that (23) no longer}

binds after a marginal increase in b. Since the left-hand-side of (23) is convex in bH and the

right-hand-side is concave, it follows that b0_H(e)>0, and hence f0(b)>0.

Similarly, suppose that bL(b) < bf, so that (24) binds at bL < bf, and fix the value

of bL. The right-hand side of (24) is increasing in b, since πH0 (b) > 0 for all b < bf. It

follows that (24) no longer binds after a marginal increase inb. Hence,b0_L(b)>0. Implicitly

differentiating (23) with respect to b and rearranging gives

b0_H(b) =

δ

P∞ t=1δ

t−1

(1−θ)Pt,1+θ(1−Pt,1)

π0_L(b)

1−δ

P∞ t=1δt−1

θ Pt,1+ (1−θ)(1−Pt,1)

π0

H(bH)

. (25)

The numerator and the denominator of (25) are both positive byπ_L0 (b)>0 andb0_H(b)>0.

An increase in b will decrease the numerator since π00_L(b)<0, and increase the denominator

sinceb0_H(b)>0 andπ_H00(bH)<0. This means thatb00H(b)<0 wheneverbH(b)< bf. Implicitly

differentiating (24) with respect to b and rearranging yields

b0_L(b) =

δ

P∞ t=1δ

t−1

(1−θ)Pt,1+θ(1−Pt,1)

π_H0 (b)

1−δ

P∞ t=1δt−1

θPt,1+ (1−θ)(1−Pt,1)

π0

L(bL)

. (26)

The numerator and the denominator of (26) are both positive byπ_H0 (b)>0 andb0_L(b)>0.

An increase in b will decrease the numerator since π00_H(b)<0 and increase the denominator

since b0_L(b) > 0 and π00_L(bL) < 0. This means that b00L(b) < 0 whenever bL(b) < bf, so that

f00(b)>0, as required.

Given continuity on their respective domain and range, there exists some br

H ∈(0, bf] for which bL(brH) = f(brH), so where the curves bL(b) and f(b) intersect. Since bL(b) is concave

whilef(b) is strictly convex, this point of intersection is unique. It defines the optimal bonus

pair, br

(33)

To compare the size of the two bonuses, note that br

L = brH holds if and only if bL(b) and f(b) intersect on the 45 degree line. This is the case if bL(b) = b implies f(b) = b or,

equivalently, bH(b) = b. In contrast, bLr < brH holds if and only if bL(b) intersects the 45

degree line before f(b) does. This is the case if bL(b) = b implies f(b) < b or, equivalently,

bH(b)> b.

We now prove that bH(b) = bf whenever bL(b) = bf and that bH(b) > bL(b) whenever

bL(b)< bf. It is sufficient to show that (23) holds strictly whenever (24) is satisfied.

Substi-tuting (20) and (21) into (23) and (24) shows the latter two constraints are symmetric with

respect to bH and bL except for terms involving ∆.

Specifically, the expression δ(2θ−1)P∞ t=1δ

t−1₍₂_P

t,1−1)∆ enters with a positive sign on the right-hand side of (23) and a negative sign on the right-hand side of (24). We claim that

Pt,1 >1/2 for allt≥1 which, together with θ >1/2, implies this expression is itself strictly

positive. Hence, (23) holds strictly whenever (24) is satisfied.

We prove the claim Pt,1 > 1/2 by induction. Recall from (5) the recursive definition

P1,1 = 1 and Pt,1 = θPt−1,1 + (1 −θ)(1 − Pt−1,1) for all t ≥ 2. Clearly Pt,1 > 1/2 for

t = 1. Now fix t0 ≥ 2, and suppose that Pt,1 > 1/2 for all t = 1, . . . t0 −1 and write

Pt0_,₁ = θP_t0₋₁_,₁ + (1−θ)(1−P_t0₋₁_,₁). The induction hypothesis implies P_t0₋₁_,₁ > 1/2 and

(1−Pt0₋₁_,₁)<1/2. Combined with θ >1/2, this yields P_t0_,₁ >1/2.

Note that the right-hand side of (24) increases without bound as δtends to 1. Fixb =bf

and defineδ0 ∈(0,1) as the value ofδfor which (24) binds. Then (24) holds for all b∈[0, bf]

if and only if δ ∈[δ0,1]. Therefore, we havebrL< brH for all δ ∈(0, δ0) and brL=brH =bf for

allδ ∈[δ0,1].

We now conclude by showing that the contract with zero base salary, sH = sL = 0,

along with bonus (br

H, brL) is in fact optimal if it satisfies participation constraints (12) and (13), and that these constraints are satisfied when outside options vH ≥ 0 and vL ≥ 0 are

sufficiently small.

(34)

de-creasing in sH and sL, throughπH(sH, bH) andπL(sL, bL). This means that holding bonuses

and agent effort constant, both constraints are more difficult to satisfy when sH > 0 or

sL >0 than when sH =sL = 0. Recall from (9) and (10) that the agent’s optimal effort was

independent of sH and sL. Hence, the contract in question is optimal if it induces the agent

to participate.

Lemma 1 showed that the agent’s payoff is strictly concave in effort, with optimal effort

e(b) > 0 for any b > 0, so in particular e(b_Hr ) > 0 and e(br_L) > 0. Hence, expected payoffs

u(0, br_H) and u(0, br_L) are strictly greater than zero, which is what the agent could earn in

each state by exerting zero effort. It then follows from (12) and (13) that the contract in

question satisfies both participation constraints for vH and vL sufficiently small.

Proof of Proposition 2. The bonus bf _{under formal contracting is defined by the}

first-order condition (22), which is independent ofθ. As in the proof of Proposition 1, define

the optimal bonus pair (br_H, br_L) by bH(bL(brH)) = brH or, equivalently, bL(brH) = f(brH), with

br_L =bL(brH).

We first show that ∂bH

∂b < ∂bL

∂b for any b such that bL(b)< b

f _and _b

H(b)< bf. Recall that ∂bH

∂b and

∂bL

∂b are given by expressions (25) and (26). Since π

0

H(b) = π

0

L(b), these expression only differ by the terms π0_H(bH) and π0L(bL) in their respective denominators. Combining

bH(b) > bL(b) from the proof of Proposition 1 with πH00(b) = π

00

L(b) < 0 for all b < bf gives π_H0 (bH)< πL0(bL), so that (25) and (26) imply ∂b_∂bH < ∂b_∂bL.

We now show that ∂bH

∂θ >0 whenever b < bH(b)< b

f _{and that} ∂bL

∂θ <0 wheneverbL(b)< b < bf. Suppose that (23) binds at bH ∈(b, bf), and fix the value of bH. Differentiating the

right-hand side of (23) with respect to θ gives

δ

∞

X

t=1

δt−1 2Pt,1−1 +Pt,01(2θ−1)

(πH(bH)−πL(b))

, (27)

where P_t,0₁ denotes the derivative of Pt,1 with respect toθ.