Pricing Bermudan options using regression: optimal rates of convergence for lower estimates

(1)

SFB 649 Discussion Paper 2009-023

Pricing Bermudan options

using regression: optimal

rates of convergence for

lower estimates

Denis Belomestny*

*Weierstrass Institute Berlin, Germany

This research was supported by the Deutsche

Forschungsgemeinschaft through the SFB 649 "Economic Risk". http://sfb649.wiwi.hu-berlin.de ISSN 1860-5664 SFB 649, Humboldt-Universität zu Berlin

S

FB

6

4

9 E

C

O

N

O

M

I

C

R

I

S

K

B

E

R

L

I

N

(2)

Pricing Bermudan options using regression: optimal

rates of convergence for lower estimates

Denis Belomestny1,∗ April 21, 2009

Abstract

The problem of pricing Bermudan options using Monte Carlo and a nonparametric regression is considered. We derive optimal non-asymptotic bounds for a lower biased estimate based on the subop-timal stopping rule constructed using some estimates of continuation values. These estimates may be of different nature, they may be local or global, with the only requirement being that the deviations of these estimates from the true continuation values can be uniformly bounded in probability.

Keywords: Bermudan options; Regression; Boundary condition.

1 Introduction

An American option grants the holder the right to select the time at which to exercise the option, and in this differs from a European option which may be exercised only at a fixed date. A general class of American option pricing problems can be formulated through an_Rd _{Markov process} _{_X₍_t₎_, ₀_≤_t_≤ T_}defined on a filtered probability space (Ω,F_,₍F_t₎₀_≤_t_≤_T_,_{P). It is assumed}

thatX(t) is adapted to (F_t₎₀_≤_t_≤_T _{in the sense that each}_X_t_isF_t_measurable.

Recall that each F_t _{is a} _σ_{-algebra of subsets of Ω such that} F_s _⊆ F_t _⊆ F

fors_≤t. We interpretF_t _{as all relevant financial information available up}

to timet. We restrict attention to options admitting a finite set of exercise opportunities 0 =t0 < t1 < t2 < . . . < tL =T, sometimes called Bermudan

options. If exercised at timetl, l= 1, . . . , L, the option pays fl(X(tl)), for

some known functionsf0, f1, . . . , fL mappingRd into [0,∞). LetTndenote

the set of stopping times taking values in _{n, n+ 1, . . . , L_}. A standard result in the theory of contingent claims states that the equilibrium price

1_{Weierstrass Institute for Applied Analysis and Stochastics, Mohrenstr. 39, 10117} Berlin, Germany. [email protected].

2_{JEL Subject Classification: G14; C15.}

∗

(3)

Vn(x) of the American option at timetnin statexgiven that the option was

not exercised prior totn is its value under an optimal exercise policy:

Vn(x) = sup τ∈T_n

E[fτ(X(tτ))|X(tn) =x), x∈Rd.

Pricing an American option thus reduces to solving an optimal stopping problem. Solving this optimal stopping problem and pricing an American option are straightforward in low dimensions. However, many problems arising in practice (see e.g. Glasserman (2004)) have high dimensions, and these applications have motivated the development of Monte Carlo meth-ods for pricing American option. Pricing American style derivatives with Monte Carlo is a challenging task because the determination of optimal ex-ercise strategies requires a backwards dynamic programming algorithm that appears to be incompatible with the forward nature of Monte Carlo sim-ulation. Much research was focused on the development of fast methods to compute approximations to the optimal exercise policy. Notable exam-ples include the functional optimization approach in Andersen (2000), mesh method of Broadie and Glasserman (1997), the regression-based approaches of Carriere (1996), Longstaff and Schwartz (2001), Tsitsiklis and Van Roy (1999) and Egloff (2005). A common feature of all above mentioned algo-rithms is that they deliver estimates Cb0(x), . . . ,CbL−1(x) for the so called

continuation values:

Ck(x) := E[Vk+1(X(tk+1))|X(tk) =x], k= 0, . . . , L−1.

(1.1)

An estimate forV0, the price of the option at timet0 can then be defined as

e

V0(x) := max{f0(x),Cb0(x)}, x∈Rd.

This estimate basically inherits all properties of Cb0(x). In particular, it is

usually impossible to determine the sign of the bias of Ve0 since the bias of

b

C0 may change its sign. One way to get a lower bound (low biased estimate)

forV0 is to construct a (generally suboptimal) stopping rule

b

τ = min_{0_≤k_≤L:Cbk(X(tk))≤fk(X(tk))}

withCbL≡0 by definition. Simulating a new independent set of trajectories

and averaging the pay-offs stopped according to_bτ on these trajectories gives us a lower bound Vb0 forV0. As was observed by practitioners, the so

con-structed estimateVb0has rather stable behavior with respect to the estimates

of continuation valuesCb0(x), . . . ,CbL−1(x), that is even rather poor estimates

of continuation values may lead to a good estimateVb0.The aim of this paper

is to find a theoretical explanation of this observation and to investigate the properties ofVb0. In particular, we derive optimal non-asymptotic bounds for

(4)

It is shown that the bounds forV0−EVb0are usually much tighter than ones

forV0−EVe0implying a better quality ofVb0 as compared to the quality ofVe0

constructed using one and the same set of estimates for continuation values. The issues of convergence for regression algorithms have been already studied in several papers. Cl´ement, Lamberton and Protter (2002) were first who proved the convergence of the Longstaff-Schwartz algorithm. Glasser-man and Yu (2005) have shown that the number of Monte Carlo paths has to be in general exponential in the number of basis functions used for regres-sion in order to ensure convergence. Recently, Egloff, Kohler and Todorovic (2007) (see also Kohler (2008)) have derived the rates of convergence for continuation values estimates obtained by the so called dynamic look-ahead algorithm (see Egloff (2004)) that “interpolates” between Longstaff-Schwartz and Tsitsiklis-Roy algorithms. They presented the convergence rates forVe0

which coincide with the rates of Cb0 and are determined by the smoothness

properties of the true continuation valuesC0, . . . , CL−1. It turns out that the

convergence rates forVb0 depend not only on the smoothness of continuation

values (as opposite to Ve0), but also on the behavior of the underlying

pro-cess near the exercise boundary. Interestingly enough, there are cases where these rates become almost independent either of the smoothness properties of_{Ck} or of the dimension ofXand the bias ofVb0 decreases exponentially

in the number of Monte Carlo paths used to construct_{Cbk}.

The paper is organized as follows. In Section 2.1 we introduce and dis-cuss the so called boundary assumption which describes the behavior of the underlying processX near the exercise boundary and heavily influences the properties of Vb0. In Section 2.2 we derive non-asymptotic bounds for the

biasV0−EVb0and prove that these bounds are optimal in the minimax sense.

Finally, we illustrate our results by a numerical example.

2 Main results

2.1 Boundary assumption

For the considered Bermudan option let us introduce a continuation region

C_{and an exercise (stopping) region}E _:

C _:= _{₍_{i, x}_{) :}_f_i₍_x₎_{< C}_i₍_x₎_}_,

(2.2)

E _:= _{₍_{i, x}_{) :}_f_i₍_x₎_≥_C_i₍_x₎_}_.

Furthermore, let us assume that there exist constantsB0,k >0,k= 0, . . . , L−

1 andα >0 such that the inequality

(2.3) Ptk|t0(0<|Ck(X(tk))−fk(X(tk))| ≤δ)≤B0,kδ

α_, _{δ >}₀_,

holds for all k = 0, . . . , L₋1, where P_tk_|_t₀ is the conditional distribution of X(tk) given X(t0). Assumption (2.3) provides a useful characterization

(5)

of the behavior of the continuation values _{Ck} and payoffs {fk} near the

exercise boundary ∂E_{. Although this assumption seems quite natural to}

look at, we make in this paper, to the best of our knowledge, a first attempt to investigate its influence on the convergence rates of lower bounds based on suboptimal stopping rules.

In the situation when all functionsCk−fk, k= 0, . . . , L−1 are smooth

and have non-vanishing derivatives in the vicinity of the exercise boundary, we haveα= 1. Other values of α are possible as well. We illustrate this by two simple examples.

Example 1 Fix someα >0 and consider a two period (L= 1) Bermudan

power put option with the payoffs

f0(x) =f1(x) = (K1/α−x1/α)+, x∈R+, K >0.

(2.4)

Denote by ∆ the length of the exercise period, i.e. ∆ =t1−t0.If the process

X follows the Black-Scholes model with volatilityσ and zero interest rate, then one can show that

C0(x) := E[f1(X(t1))|X(t0) =x] =K1/αΦ(−d2)

−x1/αe∆(α−1−1)(σ2/2α)Φ(₋d1)

with Φ being the cumulative distribution function of the standard normal distribution, d1 = log(x/K) + _α1 ₋1₂σ2_∆ σ√∆ and d2 = d1−σ √

∆/α. As can be easily seen, the function C0(x)−f0(x)

satisfies_|C0(x)−f0(x)| ≍x1/α forx→+0 andC0(x)> f0(x) for all x >0

ifα_≥1. Hence

P(0<_|C0(X(t0))−f0(X(t0))| ≤δ).δα, δ →0, α≥1.

Taking differentαin the definition of the payoffs (2.4), we get (2.3) satisfied forα ranging from 1 to_∞.

In fact, even the extreme case “α=_∞” may take place as shown in the next example.

Example 2 Let us consider again a two period Bermudan option such

that the corresponding continuation valueC0(x) = E[f1(X(t1))|X(t0) = x]

is positive and monotone increasing function ofx on any compact set inR. Fix some x0 ∈ R and choose δ0 satisfying δ0 < C0(x0). Define the payoff

functionf0(x) in the following way

f0(x) =

(

C0(x0) +δ0, x < x0,

(6)

1 2 3 4 5 6 7 8 0.0 0.5 1.0 1.5 2.0 2.5 3.0 x f0(x) C0(x)

Figure 1: Illustration to Example 2.

So,f0(x) has a “digital” structure. Figure 1 shows the plots ofC0 and f0 in

the case whereX follows the Black-Scholes model andf1(x) = (x−K)+. It

is easy to see that

Pt0(0<|C0(X(t0))−f0(X(t0))| ≤δ0) = 0.

On the other hand

C ₌ _{_x_∈_R_:_C₀₍_x₎_≥_f₀₍_x₎_}₌_{_x_∈_R_:_x_≥_x₀_}_, E ₌ _{_x_∈_R_:_C₀₍_x₎_{< f}₀₍_x₎_}₌_{_x_∈_R_:_{x < x}₀_}_.

So, both continuation and exercise regions are not trivial in this case. The last example is of particular interest because as will be shown in the next sections the bias of Vb0 decreases in this case exponentially in the

number of Monte Carlo paths used to estimate the continuation values, a lower boundVb0 was constructed from.

2.2 Non-asymptotic bounds for V₀₋EVb₀

LetCbk,M, k= 1, . . . , L−1,be some estimates of continuation values obtained

usingM paths of the underlying processX starting fromx0 at timet0. We

may think of (X(1)₍_t₎_{, . . . , X}(M)₍_t_{)) as being a vector process on the product}

probability space withσ-algebraF⊗M _{and the product measure P}⊗M

x0 defined

onF⊗M _via

(7)

with Am ∈ F, m = 1, . . . , M. Thus, each Cbk,M, k = 0, . . . , L −1, is

measurable with respect to F⊗M_{. The following proposition provides}

non-asymptotic bounds for the bias V0−EVb0,M of a lower bound Vb0,M given

uniform probabilistic bounds for _{Cbk,M}.

Proposition 2.1. Suppose that there exist constants B1, B2 and a positive

sequenceγM such that for any δ > δ0 >0 it holds

P⊗_x₀M_|Cbk,M(x)−Ck(x)| ≥δγ_M−1/2

≤B1exp(−B2δ)

(2.5)

for almost all x with respect to P_tk_|_t₀, the conditional distribution of X(tk)

givenX(t0), k= 0, . . . , L−1. Define V0,M := E f_τM_b (X(t_τM_b ))_|X(t0) =x0 (2.6) with b τM := min n 0_≤k_≤L:Cbk,M(X(tk))≤fk(X(tk)) o . (2.7)

If the boundary condition (2.3) is fulfilled, then

0_≤V0−EP⊗M x₀ [V0,M]≤B "_L₋₁ X l=0 B0,l # γ_M−(1+α)/2

with some constant B depending only on α, B1 and B2.

The above convergence rates are, in fact, optimal in the following sense.

Proposition 2.2. Fix a set of non-zero payoff functions f0, . . . , fL and let

P_α _{be a class of pricing measures such that the boundary condition} _(2.3) _is

fulfilled with someα >0. For any positive sequenceγM satisfying

γ_M−1=o(1), γM =O(M), M _{→ ∞},

there exist a subset P_α,γ _of P_α _{and a constant} _{B >} ₀ _{such that for any}

M _≥1, any stopping rule τM_b and any set of estimates _{Cbk,M} measurable

w.r.t. F⊗M_{, we have for some}_{δ >}₀ _and _k_{= 0}_{, . . . , L}₋₁_,

sup

P∈P_α,γ

P⊗M_|Cbk,M(x)−Ck(x)| ≥δγM−1/2

>0

for almost allx w.r.t. any P_∈P_α,γ _and

sup P∈P_α,γ sup τ∈T₀ EFt0 P [fτ(X(tτ))]−EP⊗M[E F_t 0 P fbτM(X(tbτM))] ≥Bγ_M−(1+α)/2.

(8)

Finally, we discuss the case when “α = _∞”, meaning that there exists

δ0 >0 such that

P_tk_|_t₀(0<_|Ck(X(tk))−fk(X(tk))| ≤δ0) = 0

(2.8)

for k = 0, . . . , L₋1. This is very favorable situation for pricing. It turns out that if the continuation values estimates_{Cbk,M}satisfy a kind of

expo-nential inequality and (2.8) holds, then the bias of Vb0,M converges to zero

exponentially fast inγM.

Proposition 2.3. Suppose that for any δ >0 there exist constants B1, B2

possibly depending onδ and a sequence of positive numbersγM not depending

on δ such that

P⊗_x₀M_|Cbk,M(x)−Ck(x)| ≥δ

≤B1exp(−B2γM)

(2.9)

for almost all x with respect to Ptk|t0, k = 0, . . . , L−1. Assume also that

there exist a constant Bf >0 such that

(2.10) E max k=0,...,Lf 2 k(X(tk)) ≤Bf, k= 0, . . . , L.

If the condition (2.8) is fulfilled with some δ0 >0, then

0_≤V0−E_P⊗M

x₀ [V0,M]≤B3exp(−B4γM)

with some constant B3 andB4 depending only on B1, B2 and Bf.

Discussion Let us make a few remarks on the results of this section. First,

Proposition 2.1 implies that the convergence rates ofVb0,M are always faster

than the convergence rates of_{Cbk,M}provided thatα >0. Indeed, while the

convergence rates of_{Cbk,M}are of orderγM−1/2, the bias ofVb0,M converges to

zero as fast asγ_M−(1+α)/2.As to the variance ofVb0,M, it can be made arbitrary

small by averagingVb0,M over a large number of sets, each consisting of M

trajectories, and by taking a large number of new Monte Carlo paths used to average the payoffs stopped according toτ_bM.

Second, if the condition (2.8) holds true, then the bias of Vb0,M decreases

exponentially in γM, indicating that even very unprecise estimates of the

continuation values would lead to the estimate Vb0,M of acceptable quality.

Finally, let us stress that the results obtained in this section are quite general and do not depend on the particular form of the estimates _{Cbk,M},

only the inequality (2.5) being crucial for the result to hold. This inequality holds for various types of estimators. These may be global least squares estimators or local polynomial estimators. In particular, it can be shown that if all continuation values _{Ck} belong to the H¨older class Σ(β, H,Rd)

and the conditional law of X satisfies some regularity assumptions, then the local polynomial estimates of continuation values satisfy inequality (2.5) withγM =M2β/(2β+d)log−1(M).

(9)

3 Numerical example: Bermudan max call

This is a benchmark example studied in Broadie and Glasserman (1997) and Glasserman (2004) among others. Specifically, the model withdidentically distributed assets is considered, where each underlying has dividend yieldδ. The risk-neutral dynamic of assets is given by

dXk(t) Xk(t)

= (r₋δ)dt+σdWk(t), k= 1, ..., d,

where Wk(t), k = 1, ..., d, are independent one-dimensional Brownian

mo-tions andr, δ, σ are constants. At any time t_{∈ {}t0, ..., tL} the holder of the

option may exercise it and receive the payoff

f(X(t)) = (max(X1(t), ..., Xd(t))−K)+.

We take d= 2, r = 5%, δ = 10%, σ = 0.2 andti =iT /L, i= 0, ..., L, with T = 3, L = 9 as in Glasserman (2004, Chapter 8). First, we estimate all continuation values via the dynamic programming algorithm using the so called Nadaraya-Watson regression estimator

(3.11) Cbr,M(x) = PM m=1K((x−X(m)(tr))/h)Y (m) r+1 PM m=1K((x−X(m)(tr))/h) withY_r(₊₁m) = max(fr+1(X(m)(tr+1)),Cbr+1,M(X(m)(tr+1))), r= 0, . . . , L−1.

Here K is a kernel, h > 0 is a bandwidth and (X(m)(t1), . . . , X(m)(tL)),

m = 1, . . . , M, is the set of paths of the process X, all starting from the point x0 = (90,90) at t0 = 0. As can be easily seen the estimator (3.11) is

a local polynomial estimator of degree 0. Upon estimating Cb1,M, we define

an estimate for the price of the option at timet0= 0 as

e V0 := 1 M M X m=1 Y₁(m).

Next, using the so constructed estimates of continuation values we construct a stopping policy _bτ which is defined pathwise as

b

τ(n) := minn1_≤k_≤L:Cbk,M(Xe(n)(tk))≤fk(Xe(n)(tk))

o

, n= 1, . . . , N,

where (Xe(n)₍_t

1), . . . ,Xe(n)(tL)), n = 1, . . . , N, is a new independent set of

trajectories of the process X, all starting fromx0= (90,90) att0 = 0. The

stopping policyτbyields a lower bound

b V0= 1 N N X n=1 f_τ_b(n)(Xe(n)(t_τ_b(n))).

(10)

hat V_0 tilde V_0 6 7 8 9 10 h=5 hat V_0 tilde V_0 7 8 9 10 h=40 hat V_0 tilde V_0 7 8 9 10 11 12 13 h=60 hat V_0 tilde V_0 8 10 12 14 16 18 h=100

Figure 2: Boxplots of the estimates Vb0 and Ve0 for different values of the

(11)

In Figure 2 we show the boxplots of Ve0 and Vb0 based on 100 sets of

tra-jectories each of the size M = 1000 for different values of the bandwidthh, where the triangle kernel K(x) = (1_{− k}x_k2)+ is used to construct (3.11). Also the true value V0 of the option (8.08 in this case), computed using a

two-dimensional binomial lattice, is shown as a red base line. Several obser-vations can be made by an examination of Figure 2. First, while the bias of

b

V0 is always smaller then the bias ofVe0, the largest difference takes place for

largeh. This can be explained by the fact that for largehmore observations

Y_r(₊₁m) with X(m)(tr) lying far away from the given pointx become involved

in the construction of Cbr,M(x). This has a consequence of increasing the

bias of the estimate (3.11). The most interesting phenomenon is, however, the behavior of Vb0 which turns out to be quite stable with respect toh. So

even in the case of rather poor estimates of continuation values (whenh is large)Vb0 still looks reasonable.

We stress that the aim of this example is not to show the strength of the local polynomial estimation algorithms (for this we would take largerM and higher order kernels) but rather to illustrate the main claim of this paper, namely the claim about the efficiency of Vb0 as compared to the estimates

based on the direct use of continuation values estimates.

4 Conclusion

In this paper we have derived optimal rates of convergence for lower biased estimates of the price of a Bermudan option based on suboptimal exercise policies obtained from some estimates of the optimal continuation values. We have shown that these rates are usually much faster than the convergence rates of the corresponding continuation values estimates. This may explain the efficiency of these lower bounds observed in practice. Moreover, it turns out that there are some cases where the expected values of the lower bounds based on suboptimal stopping rules achieve very fast convergence rates which are exponential in the number of paths used to estimate the corresponding continuation values. This suggests that the algorithms based on suboptimal stopping rules (e.g. Longstaff-Schwartz algorithm) rather than on the direct use of the continuation values estimates might be preferable.

5 Proofs

5.1 Proof of Proposition 2.1 Define τj := min{j≤k < L:Ck(X(tk))≤fk(X(tk))}, j= 0, . . . , L, b τj,M := min{j≤k < L:Cbk(X(tk))≤fk(X(tk))}, j= 0, . . . , L

(12)

and

Vk,M(x) := E[fbτk,M(X(tbτk,M))|X(tk) =x], x∈Rd.

The so called Snell envelope process Vk is related to τk via Vk(x) = E[fτk(X(tτk))|X(tk) =x], x∈Rd.

The following lemma provides a useful inequality which will be repeatedly used in our analysis.

Lemma 5.1. For any k= 0, . . . , L₋1, it holds with probability one

(5.12) 0_≤Vk(X(tk))−Vk,M(X(tk)) ≤EFtk "_L₋₁ X l=k |fl(X(tl))−Cl(X(tl))| ×1_{b_τl,M_{>l, τl}₌_l_}+1_{b_τl,M₌_{l, τl>l}_}i.

Proof. We shall use induction to prove (5.12). Fork=L₋1 we have

VL−1(X(tL−1))−VL−1,M(X(tL−1)) = = EFtL−1 h (fL−1(X(tL−1))−fL(X(tL)))1{τL−1=L−1,τLb −1,M=L} i + EFtL−1 h (fL(X(tL))₋fL−1(X(tL−1)))1{τL−1=L,τbL−1,M=L−1} i =_|fL−1(X(tL−1))−CL−1(X(tL−1))|1{bτL−1,M6=τL−1}

since events _{τL−1 = L} and {bτL−1,M = L} are measurable w.r.t. FtL−1.

Thus, (5.12) holds withk=L₋1. Suppose that (5.12) holds withk=L′+1. Let us prove it fork=L′_{. Consider a decomposition}

fτL′(X(tτ_L′))−fbτ_L′_,M(X(tbτ_L′_,M)) = S1+S2+S3 with S1 := fτ_L′(X(tτ_L′))−fτb_L′_,M(X(tbτ_L′_,M)) 1_{_τ L′>L′,bτ_L′_,M>L′} S2 := fτL′(X(tτ_L′))−fτb_L′_,M(X(tbτ_L′_,M)) 1{τ_L′>L′,bτ_L′_,M=L′} S3 := fτ_L′(X(tτ_L′))−fτb_L′_,M(X(tbτ_L′_,M)) 1_{_τ_L′=L′,bτ_L′_,M>L′}. Since EFt_L′ _[_S 1] = E F_t L′ _V L′₊₁(X(t_L′₊₁))−V_L′₊₁_,M(X(t_L′₊₁)) 1{τ_L′>L′,bτ_L′_,M>L′}, EFt_L′ _[_S 2] = EFt_L′ h fτ_L′₊₁(X(tτ_L′₊₁)) i −fL′(X(t_L′)) 1_{_τ_L′>L′,bτ_L′_,M=L′} = (CL′(X(t_L′))−f_L′(X(t_L′)))1_{_τ L′>L′,τb_L′_,M=L′}

(13)

and EFt_L′ _[_S 3] = fL′(X(t_L′))−E F_t L′ h f_bτ_L′₊₁_,M(X(tbτ_L′₊₁_,M)) i 1{τ_L′=L′,bτ_L′_,M>L′} = (fL′(X(t_L′))−C_L′(X(t_L′)))1_{_τ L′=L′,τb_L′_,M>L′} + EFt_L′ h VL′₊₁(X(t_L′₊₁))−V_L′₊₁_,M(X(t_L′₊₁)) 1_{_τ_L′=L′,τb_L′_,M>L′} i ,

we get with probability one

VL′(X(t_L′))−V_L′_,M(X(t_L′) ≤ |f_L′(X(t_L′))−C_L′(X(t_L′))| ×1_{b_τ_L′_,M>L′, τ_L′=L′}+1{bτ_L′_,M=L′, τ_L′>L′} + EFt_L′ _V L′₊₁(X(t_L′₊₁))−V_L′₊₁_,M(X(t_L′₊₁)) .

Our induction assumption implies now that

VL′(X(t_L′))−V_L′_,M(X(t_L′))≤ EFt_L′ "_L₋₁ X l=L′ |fl(Xl)−Cl(Xl)| 1_{b_τl,M_{>l, τl}₌_l_}+1_{b_τl,M₌_{l, τl>l}_} #

and hence (5.12) holds fork=L′_.

Let us continue with the proof of Proposition 2.1. Consider the sets

E_l_,A_l,j _⊂_Rd_{, l}_{= 0}_{, . . . , L}₋₁_{, j}_{= 1}_,₂_{, . . . ,}_{defined as} E_l _:= n_x_∈_Rd_:_Cb_l,M₍_x₎_≤_f_l₍_x₎_{, C}_l₍_x₎_{> f}_l₍_x₎o ∪nx_∈Rd:Cbl,M(x)> fl(x), Cl(x)≤fl(x) o , A_l,₀ _:= n_x_∈_Rd_{: 0}_<_|_C_l₍_x₎₋_f_l₍_x₎_{| ≤}_γ−1/2 M o , A_l,j _:= n_x_∈_Rd_{: 2}j−1_γ−1/2 M <|Cl(x)−fl(x)| ≤2jγM−1/2 o , j >0. We may write V0(X(t0))−V0,M(X(t0)) ≤ EFt0 "_L₋₁ X l=0 |fl(X(tl))−Cl(X(tl))|1{X(tl)∈E_l_} # = ∞ X j=0 EFt₀ "_L₋₁ X l=0 |fl(X(tl))−Cl(X(tl))|1{X(tl)∈A_l,j_∩E_l_} # ≤ γ_M−1/2 L_X−1 l=0 Ptl|t0 0<_|Cl(X(tl))−fl(X(tl))| ≤γM−1/2 + ∞ X j=1 EFt₀ "_L₋₁ X l=0 |fl(X(tl))−Cl(X(tl))|1{X(tl)∈Al,j∩El} # .

(14)

Using the fact that

|fl(X(tl))−Cl(X(tl))| ≤ |Cbl,M(X(tl)−Cl(X(tl))|, l= 0, . . . , L−1,

onE_l_{, we get for any}_j _≥_{1 and}_l_≥₀

EFt0 E P⊗M x0 h |fl(X(tl))−Cl(X(tl))|1{X(tl)∈A_l,j_∩E_l_} i ≤2jγ_M−1/2EFt₀_E P⊗M x₀ h 1 {|Cl,Mb (X(tl)−Cl(X(tl))|≥2j−1_γ−1/2 M } ×1 {0<|fl(X(tl))−Cl(X(tl))|≤2j_γ−1/2 M } i ≤2jγ_M−1/2EFt₀h_P⊗M x0 (|Cbl,M(X(tl))−Cl(X(tl))| ≥2 j−1_γ−1/2 M ) ×1 {0<|fl(X(tl))−Cl(X(tl))|≤2j_γ−1/2 M } i ≤B12jγ_M−1/2exp −B22j−1 P_tl_|_t₀(0 <_|fl(X(tl))−Cl(X(tl))| ≤2jγ_M−1/2) ≤B1B0,l2j(1+α)γ_M−(1+α)/2exp −B22j−1 ,

where Assumption 2.3 is used to get the last inequality. Finally, we get

V0(X(t0))−E_P⊗M x₀ [V0,M(X(t0))] ≤ "_L₋₁ X l=0 B0,l # γ_M−(1+α)/2+B′ "_L₋₁ X l=0 B0,l # γ_M−(1+α)/2X j≥1 2j(1+α)exp(₋B22j−1) ≤B "_L₋₁ X l=0 B0,l # γ_M−(1+α)/2

with some constantB depending on B1,B2 and α.

5.2 Proof of Proposition 2.2

For the sake of simplicity we consider the case of a three period Bermu-dan option with two possible exercise datest1 and t2 (exercise at t0 is not

possible). We also assume that the payoff functionf2 has a “digital”

struc-ture, i.e. it takes two values 0 and 1. The extension to a general case is straightforward but somewhat cumbersome.

We have (5.13) V0(X(t0))−Vb0,M(X(t0)) = = EFt0 [(f₁(X(t₁))−f₂(X(t₂)))1(τ₁ = 1,_bτ₁_,M = 2)] + EFt₀ _[(_f 2(X(t2))−f1(X(t1)))1(τ1 = 2,bτ1,M = 1)] = EFt0 h |f1(X(t1))−C1(X(t1))|1{bτ1,M6=τ1} i .

(15)

For an integerq_≥1 consider a regular grid on [0,1]d defined as Gq = 2k1+ 1 2q , . . . , 2kd+ 1 2q : ki ∈ {0, . . . , q−1}, i= 1, . . . , d .

Letnq(x)∈Gqbe the closest point tox∈Rdamong points inGq. Consider

the partition X′

1, . . . ,X′qd of [0,1]d canonically defined using the grid Gq (x

andybelong to the same subset if and only ifnq(x) =nq(y)). Fix an integer m_≤qd. For any i_{∈ {}1, . . . , m_}, defineX_i ₌X′

i and X0 =Rd\Smi=1Xi, so

that X₀_{, . . . ,}X_m _{form a partition of} _Rd_{. Denote by} B_q,j _{the ball with the}

center innq(Xj) and radius 1/2q.

Define a hypercubeH₌_{_P_¯_σ _{: ¯}_σ_{= (}_σ₁_{, . . . , σ}_m₎_{∈ {−}₁_,₁_}m_}_of

probabil-ity distributions Pσ¯ of the r.v. (X(t1), f2(X(t2))) valued inRd×{0,1}as

fol-lows. For any Pσ¯ ∈Hthe marginal distribution ofX(t1) (givenX(t0) =x0)

does not depend on ¯σand has a bounded densityµw.r.t. the Lebesgue mea-sure on_Rd _{such that P}

µ(X0) = 0 and

Pµ(Xj) = Pµ(Bq,j) =

Z

B_q,j

µ(x)dx=ω, j= 1, . . . , m

for someω >0. In order to ensure that the densityµ remains bounded we assume that qd_ω₌_O_(1).

The distribution of f2(X(t2)) given X(t1) is determined by the

proba-bility P¯σ(f2(X(t2)) = 1|X(t1) =x) which is equal toC1,¯σ(x). Define C1,σ¯(x) =f1(x) +σjφ(x), x∈Xj, j= 1, . . . , m,

and C1,σ¯(x) = f1(x) on X0, where φ(x) = γ_M−1/2ϕ(q[x−nq(x)]), ϕ(x) = Aϕθ(kxk) with some constant Aϕ >0 and with θ:R+ →R+ being a

non-increasing infinitely differentiable function such thatθ(x)_≡1 on [0,1/2] and

θ(x)_≡0 on [1,_∞). Without loss of generality we may assume thatf1(x) is

strictly positive on [0,1]d_{, i.e. there exist two real numbers 0}_{< f}

−< f+<1

such that f− ≤f1(x) ≤ f+. Taking Aϕ small enough, we can then ensure

that 0 _≤ C1,σ¯(x) ≤ 1 on Rd. Obviously, it holds φ(x) = Aϕγ_M−1/2 for

x_∈B_q,j_{. As to the boundary assumption (2.3), we have}

Pµ(0<|f1(X(t1))−C1,σ¯(X(t1))| ≤δ) = m X j=1 Pµ(0<|f1(X(t1))−C1,¯σ(X(t1))| ≤δ, X(t1)∈Bq,j) = m X j=1 Z B_q,j 1_{₀_<φ₍_x₎_≤_δ_}µ(x)dx=mω1 {Aϕγ_M−1/2≤δ}

(16)

and (2.3) holds provided that mω=O(γ_M−α/2). Let τ_bM be a stopping time

measurable w.r.t. F⊗M_{, then the identity (5.13) leads to}

EFt0 Pσ¯ [fτ(X(τ))]−EP⊗σ¯M[E F_t 0f b τM(X(bτM))] = E_P⊗M ¯ σ E F_t 0 Pµ h |∆σ¯(X(t1))|1{bτ1,M6=τ1} i , with ∆σ¯(X(t1)) = f1(X(t1))−C1,σ¯(X(t1)). By conditioning on X(t1), we get E_P⊗M ¯ σ E F_t 0 Pµ h |∆¯σ(X(t1))|1{τb1,M6=τ1} i =ω m X j=1 E_P⊗M ¯ σ E F_t 0 Pµ h φ(X(t1))1{bτ1,M=6 τ1}|X(t1)∈Bq,j i =Aϕmωγ_M−1/2E F_t 0 Pµ P ⊗M ¯ σ (bτ1,M 6=τ1).

Using now a well known Birg´e’s or Huber’s lemma (see, e.g. Devroye, Gy¨orfi and Lugosi, 1996, p. 243), we get

sup ¯ σ∈{−1;+1}m P⊗_¯_σM(bτ1,M 6=τ1)≥ 0.36_∧ 1₋ M KH log(_|H_|₎ ,

whereKH:= sup_P,Q_∈HK(P, Q) andK(P, Q) is a Kullback-Leibler distance

between two measuresP andQ. Since for any two measures P andQfrom

H _with_Q₆₌_P _{it holds} K(P, Q) _≤ sup ¯ σ1,σ¯2∈{−1;+1}m ¯ σ16=¯σ2 EFt0 Pµ C1,σ¯2(X(t1)) log C1,σ¯1(X(t1)) C1,σ¯2(X(t1)) +(1₋C1,¯σ2(X(t1))) log 1₋C1,σ¯1(X(t1)) 1₋C1,σ¯2(X(t1)) ≤ (1₋f+−Aϕ)−1(f−−Aϕ)−1E F_t 0 Pµ φ2(X(t1))1{X(t1)6∈X0}

for small enoughAϕ, and log(|H|) =mlog(2), we get

sup ¯ σ∈{−1;+1}m n EFt0 Pσ¯ [fτ,σ¯(X(τ))]−EP⊗¯σM[E F_t 0 f_b_τM_,_σ_¯(X(bτM))] o ≥ AϕmωγM−1/2(1−AM γM−1ω)&γ −(1+α)/2 M ,

provided thatmω > Bγ_M−α/2 for someB > 0 and AM ω < γM, whereA is

a positive constant depending on f−, f+ and Aϕ.Using similar arguments,

we derive

sup

¯

σ∈{−1;+1}m

P_σ⊗_¯M(_|C1,¯σ(x)−Cb1,M(x)|> δγ−_M1/2)>0

for almost x w.r.t. Pµ, some δ > 0 and any estimator Cb1,M measurable

(17)

5.3 Proof of Proposition 2.3

Using the arguments similar to ones in the proof of Proposition 2.1, we get

(5.14) V0(X(t0))−EP⊗M x0 [V0,M(X(t0))]≤ δ0 L_X−1 l=0 P_tl_|_t₀(0<_|Cl(X(tl))−fl(X(tl))| ≤δ0) + L−1 X l=0 EFt₀ _E P⊗M x₀ [|Cl(X(tl))−fl(X(tl))| ×1_{_X₍_tl₎_∈E_l_}1_{|_Cl₍_X₍_tl₎₎₋_fl₍_X₍_tl₎₎_|_>δ₀_}

with E_l _{defined as in the proof of Proposition 2.1. The first summand on}

the right-hand side of (5.14) is equal to zero due to (2.8). Hence, Cauchy-Schwarz and Minkowski inequalities imply

Now the application of (2.9) finishes the proof.

References

L. Andersen (2000). A simple approach to the pricing of Bermudan swap-tions in the multi-factor Libor Market Model. Journal of Computational

Finance,3, 5-32.

D. Belomestny, G.N. Milstein and V. Spokoiny (2006). Regression methods in pricing American and Bermudan options using consumption processes, to appear inQuantitative Finance.

D. Belomestny, Ch. Bender and J. Schoenmakers (2007). True upper bounds for Bermudan products via non-nested Monte Carlo, to appear in

Math-ematical Finance.

M. Broadie and P. Glasserman (1997). Pricing American-style securities us-ing simulation.J. of Economic Dynamics and Control, 21, 1323-1352. J. Carriere (1996). Valuation of early-exercise price of options using

simu-lations and nonparametric regression. Insuarance: Mathematics and

(18)

E. Cl´ement, D. Lamberton and P. Protter (2002). An analysis of a least squares regression algorithm for American option pricing. Finance and

Stochastics,6, 449-471.

L. Devroye, L. Gy¨orfi and G. Lugosi (1996). A probabilistic theory of pattern recognition. Application of Mathematics (New York), 31, Springer. R. M. Dudley (1999). Uniform Central Limit Theorems, Cambridge

Univer-sity Press, Cambridge, UK.

D. Egloff (2005). Monte Carlo algorithms for optimal stopping and statistical learning.Ann. Appl. Probab.,15, 1396-1432.

D. Egloff, M. Kohler and N. Todorovic (2007). A dynamic look-ahead Monte Carlo algorithm for pricing Bermudan options, Ann. Appl. Probab., 17, 1138-1171.

E. Gin´e and A. Guillou (2002). Rates of strong uniform consistency for multivariate kernel density estimators.Ann. I. H. Poincar´e,6, 907-921. P. Glasserman (2004). Monte Carlo Methods in Financial Engineering.

Springer.

P. Glasserman and B. Yu (2005). Pricing American Options by Simula-tion: Regression Now or Regression Later?, Monte Carlo and Quasi-Monte Carlo Methods, (H. Niederreiter, ed.), Springer, Berlin.

M. Kohler (2008). Universally consistent upper bounds for Bermudan op-tions based on Monte Carlo and nonparametric regression. Working paper. D. Lamberton and B. Lapeyre (1996). Introduction to Stochastic Calculus

Applied to Finance. Chapman & Hall.

F. Longstaff and E. Schwartz (2001). Valuing American options by simula-tion: a simple least-squares approach. Review of Financial Studies, 14, 113-147.

M. Talagrand (1994). Sharper bounds for Gaussian and empirical processes.

Ann. Probab.,22, 28-76.

J. Tsitsiklis and B. Van Roy (1999). Regression methods for pricing complex American style options.IEEE Trans. Neural. Net.,12, 694-703.

(19)

SFB 649 Discussion Paper Series 2009

For a complete list of Discussion Papers published by the SFB 649,

please visit http://sfb649.wiwi.hu-berlin.de.

001 "Implied Market Price of Weather Risk" by Wolfgang Härdle and Brenda López Cabrera, January 2009.

002 "On the Systemic Nature of Weather Risk" by Guenther Filler, Martin Odening, Ostap Okhrin and Wei Xu, January 2009.

003 "Localized Realized Volatility Modelling" by Ying Chen, Wolfgang Karl Härdle and Uta Pigorsch, January 2009.

004 "New recipes for estimating default intensities" by Alexander Baranovski, Carsten von Lieres and André Wilch, January 2009.

005 "Panel Cointegration Testing in the Presence of a Time Trend" by Bernd Droge and Deniz Dilan Karaman Örsal, January 2009.

006 "Regulatory Risk under Optimal Incentive Regulation" by Roland Strausz, January 2009.

007 "Combination of multivariate volatility forecasts" by Alessandra Amendola and Giuseppe Storti, January 2009.

008 "Mortality modeling: Lee-Carter and the macroeconomy" by Katja Hanewald, January 2009.

009 "Stochastic Population Forecast for Germany and its Consequence for the German Pension System" by Wolfgang Härdle and Alena Mysickova, February 2009.

010 "A Microeconomic Explanation of the EPK Paradox" by Wolfgang Härdle, Volker Krätschmer and Rouslan Moro, February 2009.

011 "Defending Against Speculative Attacks" by Tijmen Daniëls, Henk Jager and Franc Klaassen, February 2009.

012 "On the Existence of the Moments of the Asymptotic Trace Statistic" by Deniz Dilan Karaman Örsal and Bernd Droge, February 2009.

013 "CDO Pricing with Copulae" by Barbara Choros, Wolfgang Härdle and Ostap Okhrin, March 2009.

014 "Properties of Hierarchical Archimedean Copulas" by Ostap Okhrin, Yarema Okhrin and Wolfgang Schmid, March 2009.

015 "Stochastic Mortality, Macroeconomic Risks, and Life Insurer Solvency" by Katja Hanewald, Thomas Post and Helmut Gründl, March 2009.

016 "Men, Women, and the Ballot Woman Suffrage in the United States" by Sebastian Braun and Michael Kvasnicka, March 2009.

017 "The Importance of Two-Sided Heterogeneity for the Cyclicality of Labour Market Dynamics" by Ronald Bachmann and Peggy David, March 2009.

018 "Transparency through Financial Claims with Fingerprints – A Free Market Mechanism for Preventing Mortgage Securitization Induced Financial Crises" by Helmut Gründl and Thomas Post, March 2009.

019 "A Joint Analysis of the KOSPI 200 Option and ODAX Option Markets Dynamics" by Ji Cao, Wolfgang Härdle and Julius Mungo, March 2009. 020 "Putting Up a Good Fight: The Galí-Monacelli Model versus ‘The Six Major

Puzzles in International Macroeconomics’", by Stefan Ried, April 2009. 021 "Spectral estimation of the fractional order of a Lévy process" by Denis

Belomestny, April 2009.

022 "Individual Welfare Gains from Deferred Life-Annuities under Stochastic Lee-Carter Mortality" by Thomas Post, April 2009.

SFB 649, Spandauer Straße 1, D-10178 Berlin http://sfb649.wiwi.hu-berlin.de

(20)

SFB 649 Discussion Paper Series 2009

For a complete list of Discussion Papers published by the SFB 649,

please visit http://sfb649.wiwi.hu-berlin.de.

SFB 649, Spandauer Straße 1, D-10178 Berlin http://sfb649.wiwi.hu-berlin.de

This research was supported by the Deutsche

023 "Pricing Bermudan options using regression: optimal rates of conver-gence for lower estimates" by Denis Belomestny, April 2009.