University of Warwick institutional repository: http://go.warwick.ac.uk/wrap
A Thesis Submitted for the Degree of PhD at the University of Warwick
http://go.warwick.ac.uk/wrap/35249
This thesis is made available online and is protected by original copyright.
Please scroll down to view the document itself.
AUTHOR:
Nastasiya GrinbergDEGREE:
Ph.D.TITLE:
Some problems in stochastic analysis: Itô’s formula for convex functions, in-teracting particle systems and Dyson’s Brownian motionDATE OF DEPOSIT:
. . . .I agree that this thesis shall be available in accordance with the regulations govern-ing the University of Warwick theses.
I agree that the summary of this thesis may be submitted for publication.
Iagreethat the thesis may be photocopied (single copies for study purposes only). Theses with no restriction on photocopying will also be made available to the British Library for microfilming. The British Library may supply copies to individuals or libraries. subject to a statement from them that the copy is supplied for non-publishing purposes. All copies supplied by the British Library will carry the following statement:
“Attention is drawn to the fact that the copyright of this thesis rests with its author. This copy of the thesis has been supplied on the condition that anyone who consults it is understood to recognise that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without the author’s written consent.”
AUTHOR'S SIGNATURE:
. . . .USER’S DECLARATION
1. I undertake not to quote or make use of any information from this thesis without making acknowledgement to the author.
2. I further undertake to allow no-one else to use this thesis while it is in my care.
DATE SIGNATURE ADDRESS
. . . .
. . . . . . . .
. . . .
M A
E
G NS
I T A T MOLEM
U N
IV ER
SITAS WARWICEN
SIS
Some problems in stochastic analysis: Itô’s formula for
convex functions, interacting particle systems and
Dyson’s Brownian motion
by
Nastasiya Grinberg
Thesis
Submitted to the University of Warwick
for the degree of
Doctor of Philosophy
Department of Statistics
Contents
Acknowledgments iv
Declarations v
Abstract vi
Chapter 1 Itô’s formula for convex functions of continuous semimartingales:
Brownian perturbation approach. 1
1.1 Classical Itô’s formula and its extensions . . . 2
1.2 Convergence of semimartingales and convex functions . . . 5
1.3 Elements of convex analysis . . . 7
1.3.1 Convex functions: some notation and results . . . 7
1.3.2 Differential theory of convex functions . . . 9
1.4 Piecewise linear convex functions and Meyer-Tanaka formula . . . 12
1.5 Semimartingale decomposition of f(Xet) . . . 15
1.6 Proof of Theorem 1.2 . . . 22
Chapter 2 Elements of representation theory. 26 2.1 Basics . . . 26
2.2 Representations ofsl(2)andU(sl(2)). . . 33
2.3 Quantum enveloping algebraUq(sl(2)). . . 36
2.4 Some combinatorics . . . 38
Chapter 3 Pitman’s theorem and radial part of the 3-d Brownian Motion I:
some discrete Markov chains 50
3.1 Two classical constructions . . . 51
3.1.1 Random matrix theory . . . 55
3.2 Markov functions of Markov processes . . . 59
3.2.1 Dynkin criterion and the Chapman-Kolmogorov equation . . . 59
3.2.2 Intertwining . . . 61
3.3 Discrete Pitman’s theorem via randomly growing Young tableaux (q=0) . 64 3.4 2-dimensional Markov chain from quantum probability theory (q=1) . . 71
3.4.1 Elements of quantum probability . . . 71
3.4.2 Quantum Bernoulli random variables and quantum Bernoulli ran-dom walks . . . 75
3.4.3 The Spin process . . . 80
3.4.4 2-dimensional Markov chain associated to QBRW . . . 81
3.5 q-generalisation . . . 85
3.6 Analysing the Markov chains . . . 87
3.6.1 Marginal processes . . . 88
3.6.2 Intertwining . . . 91
3.6.3 Counterexample . . . 94
3.7 Diffusion approximation . . . 99
Chapter 4 Pitman’s theorem and radial part of the 3-d Brownian Motion II: bi-variate diffusions in a wedge 111 4.1 A martingale problem: starting away from the origin . . . 113
4.1.1 Symmetric case,µ=0 . . . 115
4.1.2 Drifting case,µ >0 . . . 122
4.1.3 Martingale problem with any initial distribution . . . 124
4.2 Forward equation . . . 125
4.3 Intertwining . . . 131
4.4 A martingale problem: starting at the origin . . . 133
Chapter 5 Process in the Gelfand-Cetlin cone 143
5.1 Dyson’s Brownian motions and the eigenvalues of random matrices . . . . 144
5.1.1 H-transformed Brownian motions and non-colliding Brownian mo-tions. . . 144
5.1.2 Dyson’s Brownian motions on the spectra of Hermitian Brownian motion . . . 146
5.1.3 A Markov function of the GUE minor process . . . 150
5.2 Multidimensional Pitman’s theorem and non-colliding processes . . . 159
5.2.1 A representation for conditioned random walks . . . 160
5.3 A Markov chain in the Gelfand-Cetlin cone . . . 164
Acknowledgments
First of all I would like to thank my supervisors Prof. Wilfrid Kendall and Dr. Jon Warren, both of whom have been infinitely patient with me all this time, and whose invaluable help allowed me to finish this exciting but tough journey. I will especially hold dear the pre-submission e-mail frenzy that I have subjected Jon to, in what seems like, in vain.
I am also grateful to all of the friends I have made in the Statistics Department and especially to my housemates, Peter and Martin, all of whom supported and encour-aged me when I most needed it.
Needless to say that I would not have been where I am now if not for my parents and my sister and their unconditional love and support. Doing most things in life would have been pointless if not for my family.
Declarations
Abstract
This thesis consists of two main parts: Chapter 1 is concerned with studying an ex-tension of the Itô lemma to the convex functions. We prove that the local martingale part of the decomposition of a convex function f of a continuous semimartingale can be expressed in a similar way to the classical formula with the gradient of f replaced with itssubgradient. The result itself is not new, however, our approach via Brownian perturbation is.
The second, and the largest, part of the thesis focusses on the study of a certain family of bivariate diffusionsZ(θ,µ)= (X,R)in a wedgeW ={(x,r)∈R×R+:|x| ≤r},
Chapter 1
Itô’s formula for convex functions
of continuous semimartingales:
Brownian perturbation approach.
The first chapter of this Thesis concentrates on the study of convex functions in semi-martingale theory; it is self-contained and covers a topic different from the rest of the Thesis.
We consider an extension of the celebrated Itô’s formula for the continuous semimartingales to the class of convex functions. In particular we prove that the local martingale part of a convex function f of a d-dimensional semimartingaleX =M+A can be written in terms of an Itô stochastic integralRH(X)d M, where H(x) is some particular measurable choice of subgradient∇f(x)of f at x, andM is the martingale part of X. This result was first proved by Bouleau in[15]. Here we present a new treatment of the problem. We first prove the result forXe=X +εB,ε >0, whereB is a
1.1
Classical Itô’s formula and its extensions
LetX be a continuousRd-valued semimartingale with decompositionXt =X0+Mt+At,
where M is a local martingale and Ais a process of finite variation, and let f be a twice continuously differentiable function. Then Itô’s lemma states that f(X)is also a continuous semimartingale and that, moreover,
f(Xt) = f(X0) +
X
i
Z t
0
∂f
∂xi
(Xs)d Xs+1 2
X
i,j
Z t
0
∂2f
∂xi∂xj
(Xs)d〈Xs,Xs〉.
In particular, the martingale part of f(X)is given by
Nt:=X
i
Z t
0
∂f
∂xi(Xs)d Ms=
Z t
0
∇f(Xs)d Ms ,
where ∇f(x) is the gradient of f at x (see section 1.3.1 for definition and more de-tails).
One of the most well-known, and also one of the simplest, extensions of the Itô’s formula is theTanaka’s formula(see, for example,[64, p. 169]), which states that for a standard Brownian motionBwe have
|Bt|=|B0|+
Z t
0
sgn(Bs)d Bs+L0t ,
where sgn(x) =−1 for x ≤0 and sgn(x) =1 for x >0, and Lt is the so calledlocal timeofB at the origin, given by
L0t =lim ε↓0
1
2ε|{s≤t:Bs∈(−ε,+ε)}|.
Note that Tanaka’s formula is also the easiest extension of Itô’s result to convex functions. Meyer proves a more general result
Theorem 1.1. (Meyer[57, p. 361]) Let X be anRd-valued semimartingale and let f be
Moreover, for the cased=1 he derives the change of variables formula
f(Xt) = f(X0) +
Z t
0
∇−f(Xs)d Xs+
Z
R
Latµ(d a),
where∇−f is the left-hand side derivative of f, andµ= 12∂∂2xf2 is viewed as a measure. The increasing processLat is thelocal time process of X given by
Lat(X) =lim ε↓0
1
ε Z t
0
1[a,a+ε)(Xs)d〈Xs,Xs〉,
and which can be viewed as the time X spends at a. We refer to the above formula as theMeyer-Itô formula. The so-calledMeyer-Tanaka formulais its special case: for a continuous semimartingaleX and a function f(x) =|x|we have
|Xt|=|X0|+
Z t
0
sgn(Xs)d Xs+L0t .
WhenX is standard Brownian motion, we recover Tanaka’s formula.
Now consider a general convex functionf :Rd→R, not necessarily everywhere
differentiable. Every differentiable point x ∈Rd has a unique tangential hyperplane,
while at non-differentiable points there is a whole set of supporting hyperplanes. For a continuous semimartingaleX with decompositionX =M+Awe prove that the (local) martingale part of f(X)can be expressed in terms of a stochastic integral of a measur-able selection of subgradient∇f(X)againstM. For piecewise linear 1-dimensional con-vex functions this follows from the Meyer-Tanaka formula. For example, for f(x) =|x| we have∇f(x) =sgn(x). So at the origin, which is the only point where derivative is not defined, we can take the supporting line to be y =−x. Moreover, because as we know Brownian motion spends zero time in Lebesgue-null sets, we can in fact choose
For a continuous semimartingaleX with decompositionX =M+Awe define
kX kHp=k 〈M,M〉1/2
∞ +
Z ∞
0
|dAs| kLp
for 1≤p<∞. TheHp-space consists of all semimartingalesX such thatkX kHp<∞.
The main result of this chapter is the following
Theorem 1.2. Let f : Rd → R be convex and let X be a continuous Rd-valued
semi-martingale defined on filtered probability space(Ω,F,{F }t≥0,P)with Meyer decomposi-tion Xt=X0+Mt+At. Then f(Xt)is again a continuous semimartingale; in particular, its local martingale part is given by
Z t
0
∇f(Xs)d Ms, locally inH1 ,
where∇f(x)is some choice of subgradient of f at x, such that∇f(Xt)isFt-adapted.
The above theorem was first stated and proved by Bouleau ([15, Lemma 2]). In the follow-up paper[16] he proves the conjecture stated in [15] that in fact any measurable choiceof∇f(x)can be used. In this chapter we prove the first of the two results using an approach different to that in[15].
There are many other papers on extending the Itô’s formula by considering dif-ferent classes of functions f or stochastic processes, or both. In [72], for example, Russo and Vallois derive Itô’s formula forC1(Rd)-functions of continuous
semimartin-gales whose time-reversals are also continuous semimartinsemimartin-gales. They also extend the formula to the case of C1(Rd)-functions with first order derivatives being
Hölder-continuous with any parameter and the process given by a stochastic flow generated by a so-called C0(Rd,Rd)-semimartingale. In both cases the quadratic variation
pro-cess is expressed in terms of the generalised quadratic covariation propro-cess〈f0(X),X〉t
with a locally square integrable derivative andX a 1-dimensional Brownian motion, for which a version of Itô’s formula is derived with the finite variation part expressed again in terms of the quadratic covariation〈f0(B),B〉t. The multidimensional case (where f
belongs to the Sobolev spaceW1,2) is treated in[32]. In[51]Kendall discusses
semi-martingale decomposition ofr(B), whereris the distance function of Brownian motion on a manifold. The problem tackled in[51]is similar to ours as r fails to be differen-tiable on a set of measure zero, called the cut-locus. It is proved in[51]that r(B)is a semimartingale and its canonical decomposition is found explicitly in the sequel[23].
In the proof of our main Theorem 1.2 we need the results of Carlen and Protter
[19]and Barlow and Protter[4], which we discuss in the next section.
1.2
Convergence of semimartingales and convex functions
Paper of Barlow and Protter concerns convergence of semimartingales together with their semimartingale decomposition. We will present their results in the continuous setting, even though the original paper considered a wider class of semimartingales with jumps.
Suppose{Xn}n≥1 is a sequence of continuous semimartingales with the decom-positionXn = X0n+Mn+An, such that limn→∞E[(Xn−X)∗] = 0. Here and in what
follows we denoteX∗=supt|Xt|andX∗t =sups≤t|Xs|. Barlow and Protter prove that
under some regularity conditions imposed onMnandAnnot only that the limiting pro-cessX is again a continuous semimartingale, but that there is also convergence of the corresponding martingale and finite variation process parts of the decompositions.
Theorem 1.3. ([4, Thm. 1]) Let{Xn}n≥1 be a sequence of continuous semimartingales
inH1with canonical decomposition Xn=X0n+Mn+An, satisfying for some constant K
E
hZ ∞
0
|dAns|i≤K, (1.1a)
and let X be a continuous process such that
lim
n→∞E[(X
n
−X)∗] =0 . (1.2)
Then X is a continuous semimartingale in H1 and, moreover, if X = X0+M +A is its decomposition, then
E[M∗]≤K, E[
Z ∞
0
|dAs|]≤K
and
lim
n→∞kM
n
−MkH1=0 , (1.3a)
lim
n→∞E[(A
n
−A)∗] =0 . (1.3b)
In[19]Carlen and Protter prove that the assumptions of[4, Thm. 1]are sat-isfied in case when the sequence ofC2 convex functions{fn}n≥1 of (a not necessarily continuous) semimartingaleX=M+Aconverges to a convex f, thus making the result important in our situation.
We equip the set of convex functions onRd with uniform convergence on
com-pact sets with the corresponding metric ρ, defined by ρ(f,g) = P∞k=12−kρk(f,g), where
ρk(f,g) =
sup|x|≤k|f(x)−g(x)|
1+sup|x|≤k|f(x)−g(x)| .
Theorem 1.4. ([19, Thm. 1]) Let X be a continuous Rd-valued semimartingale in H1
with X0=0and the decomposition X=M+A. For eachα >0, define
Tα=inf{t>0 :|Xt|> α}.
Let{fn}n≥1 be a sequence of C2 convex functions on Rd and let f be a convex
and, moreover, if its decomposition is given by f(Xt) = f(X0) +Nt+St, then for allα >0
lim
n→∞k(N
n
−N)Tαk
H1=0 , lim
n→∞E[(S
n
−S)Tα] =0 ,
where
Ntn=
Z t
0
∇fn(Xs)d Ms
and
Snt =
Z t
0
∇fn(Xs)dAs+
1 2
X
i,j
Z t
0
∂2
∂xi∂xjf(Xs)d〈X
i s,X
j s〉
are the martingale and finite variation parts of the decomposition of fn(X)respectively.
1.3
Elements of convex analysis
In order to prove the main result of this chapter, we require some notation and re-sults from convex analysis. Proofs of the rere-sults stated in this section and more details on convex functions are given in[66]. See also[36]. We start by introducing some elementary notation.
1.3.1
Convex functions: some notation and results
Letf be any function living onRdand taking values in[−∞,+∞]. At any pointx ∈Rd
we define theone-directional derivative of f with respect to a vector y ∈Rd, if it exists,
as follows
D f(x)[y]:=lim λ↓0
f(x+λy)−f(x)
λ .
The two sided derivative atx in direction y exists if and only if−D f(x)[−y], given by,
−D f(x)[−y]:=lim λ↑0
f(x+λy)− f(x)
λ
is also well-defined and
Now, if the function f is convex, then the one-directional derivative always exists and, moreover, we may write
D f(x)[y] = inf λ>0
f(x+λy)−f(x)
λ . (1.5)
FurthermoreD f(x)[y]is positively homogeneous (i.e. D f(x)[λy] =λD f(x)[y]for
λ∈(0,∞)), convex in y withD f(x)[0] =0[66, Thm. 23.1]and
D f(x)[y]≥ −D f(x)[−y]. (1.6)
If for a convex f defined onRd and finite at some x∈Rd all directional
deriva-tives atx exist, are two-sided and finite then we have ([66, Thm. 25.2])
D f(x)[y] =〈∇f(x),y〉, ∀y∈Rd ,
where
∇f(x):=
∂f
∂x1(x), ...,
∂f
∂xd(x)
is thegradient of f at x = (x1, ...,xd). Note that ∂ f
∂xi(x) =D f(x)[ei], ei being thei
th
canonical basis vector ofRd.
Of course a general convex function f is not necessarily everywhere differen-tiable, a simple example being f(x) =|x|which is not differentiable at x =0. We can however define a set ofsubgradientsat each “troublesome” point like this.
Definition 1.5. Let f :Rd→Rbe a convex function. A subgradient∇f(x)of f at x∈Rd
is a gradient of an affine hyperplane h(x) =α+βTx, forα,β∈Rd, passing through the
point(x,f(x))and satisfying
h(x0)≤ f(x0)
for all x06=x. We denote any subgradient at x in the direction of y ∈Rnby〈∇f(x),y〉.
set of all subgradients atx is called thesubdifferentialoff atx, denoted∂f(x). A con-vex function with finite values issubdifferentiableeverywhere. In subsequent sections we will need the following result
Theorem 1.6. ([66, Thm. 23.2]) Let f be a convex function and x a point at which f is
finite. Then∇f(x)is a subgradient of f at x if and only if
D f(x)[y]≥ 〈∇f(x),y〉 ∀y ∈Rd\{0}. (1.7)
The above says that a subgradient at x in the direction of y will always be less or equal to the one-sided directional derivative at x with respect to y. Relation (1.7) is called the subgradient inequality and can be used as an alternative definition of a subgradient.
Finally we mention the Lipschitz continuity property of convex functions.
Theorem 1.7. ([36, Ch. 3.1, Thm. 10]) Let f be a convex function onRd and let U be
an open convex subset ofRd. Then f is continuous on U if and only if f is locally Lipschitz
on U, i.e. for all u∈U there exist constants K>0andε >0such that
|f(x)−f(y)| ≤Kkx−yk, ∀x,y∈Bu(ε),
where Bu(ε)is an open ball of radiusεcentered at u andk · kis the usual Euclidean norm.
1.3.2
Differential theory of convex functions
This section is devoted to studying the set D of points in the domain of f at which the supporting hyperplane is unique. In what follows we assume that f isproper, i.e. f(x) < +∞ for at least one x and f(x) > −∞ for all x. By domf we denote the effective domainof f, that is domf ={x ∈Rd : f(x)<∞}. We denote by int(domf)
the interior of domf.
Theorem 1.8. ([66, Thm. 25.2], [66, Thm. 25.4]) Let f be a proper convex function
D f(x)[·]is a linear function onRd iff x∈ D. Moreover,Dis dense in int(dom f), and its
complement in int(dom f)is a set of measure zero.
In fact, in order for D f(x)[·]to be linear, and hence for f to be differentiable at x, it suffices that the partial derivatives with respect to the basis vectors ofRd exist
atx.
So, we see that setDc of points at which a convex function f does not have a
unique supporting hyperplane has measure zero. Consequently any process which has a probability density at each time t spends time of measure zero inDc, an important
fact we will use in the sequel.
To prove Theorem 1.2 for a general (continuous and proper but not necessarily differentiable) convex f we will approximate it by a sequence of twice continuously differentiable convex functions fn :Rd → R, to which we know Itô’s formula can be
applied. On top of this, working with convex functions gives us an advantage of being able to deduce from the pointwise convergence of the functions something about the convergence of their corresponding gradients.
Theorem 1.9. (variation of[66, Thm. 25.7]) Let f be a convex function defined onRd
and {fn} a sequence of smooth convex functions onRd such that limn→∞fn(x) = f(x)
∀x ∈Rd. LetD ⊆int(dom f)be the set of points where f is differentiable. Then
lim
n→∞∇fn(x) =∇f(x) ∀x ∈ D. (1.8)
Proof. See proof of[66, Thm. 25.7].
This result will be used several times in Sections 5 and 6.
We conclude this section with a result concerning convergence of subgradients. Consider a sequence {xn}n≤1 with xn ∈ int(domf), n≥ 1, and x ∈ int(domf) such that limn→∞xn = x. Of course in general limn→∞∇f(xn) need not exist. However
the situation when xn = x+εny for some y ∈Rd and εn →0 as n→ ∞, i.e. when
xn approaches x from a single direction y, is special. In this case it is known that
Proposition 1.10. Let f :Rd →Rbe a convex function. For any x ∈Rd, for almost all
y∈Sd−1, where Sd−1is the unit sphere inRd,
lim
ε↓0∇f(x+εy)
exists, belongs to ∂f(x) and is unique for any selection∇f(x +εy) ∈∂f(x +εy) we may make from the subdifferential of f at x+εy for anyε >0.
Proof. First of all recall that D f(x)[y] = limε↓0(f(x+εy)− f(x))/ε is a positively homogeneous function, convex in y withD f(x)[0] =0. Letg(y):=D f(x)[y]. Hence
∇g(λy) exists and is unique for allλ > 0 for almost all y ∈Rd. Fix x,y ∈Rd and
without loss of generality, by adding a suitable affine function to f, assume that
f(x) =g(y) =∇g(y) =0 .
We argue by contradiction. If theorem fails then we can find a subsequence
εn→0 and a selection∇f(x+εny)∈∂f(x+εny)such that
lim
n→∞∇f(x+εny) =h6=0 , (1.9)
and also a vectoru∈Rd with〈h,u〉>0. For suchuconsider
f(x+εny+εnλu)− f(x+εny)
εn
=λf(x+εny+εnλu)−f(x+εny)
εnλ
.
Using (1.5) and homogeneity of g(y)the above is greater or equal to
λ εn
D f(x+εny)[εnu] =λD f(x+εny)[u]≥λ〈∇f(x+εny),u〉=λ〈h,u〉+o(1)
where the last two inequality signs come from expressions (1.7) and (1.9) respectively, ando(1)→0 asn→ ∞. Thus we obtain
f(x+εny+εnλu)−f(x+εny)
εn
On the other hand, since f(x) =g(y) =0, we have
f(x+εny)−f(x)
εn
= f(x+εny)
εn
=o(1). (1.11)
Hence combining (1.10) and (1.11) one obtains
f(x+εny+εnλu)−f(x)
εn
= f(x+εny+εnλu)−f(x+εny)
εn
+ f(x+εny)−f(x)
εn
≥λ〈h,u〉+o(1).
Lettingn→ ∞, i.e.εn→0, the above inequality becomes
D f(x)[y+λu] =g(y+λu)≥λ〈h,u〉>0
⇒ g(y+λu)
λ =
g(y+λu)−g(y)
λ ≥ 〈h,u〉>0 .
And so lettingλ→0 one obtains
〈∇g(y),u〉 ≥ 〈h,u〉>0 .
But this contradicts the assumption that∇g(y) =0.
1.4
Piecewise linear convex functions and Meyer-Tanaka
for-mula
In this section we start our analysis of the martingale part of f(X). However, instead of treating the case of a general convex f we first prove Theorem 1.2 in a special case when f is piecewise linear. Using the Meyer-Tanaka formula, we will verify that any piecewise linear convex function of a continuous semimartingale is itself a continuous semimartingale and find the martingale part of the decomposition explicitly.
for a detailed discussion of classical Tanaka and Itô-Tanaka formulas (ford =1). One might also find a discussion of convex functions in[65, Appendix.§3]useful.
Proposition 1.11. Let X= (X1, ...,Xd)be a continuous semimartingale living onRd, with
ithcomponent having decomposition Xti=X0i+Mti+Ait, i∈ {1, ...,d}. Let f :Rd→Rbe a
function defined by f(x) =l1(x)∨...∨lk(x), x∈Rd, where li(x) =αi+
Pd
j=1βi jxj=αi+
βT
i x, forαi,βi∈Rd, i∈ {1, ...,k}, and x∨y :=sup{x,y}. Then f(X)is a semimartingale
with decomposition
f(Xt) = f(X0) +
k
X
i=1
Z t
0
1B i(Xs)β
T i d Xs+
1
2Lt , (1.12)
where Bi = {x : min{k : supj{lj(x)} = lk(x)} = i} and Lt is an increasing process, constant on the complement of{t:li(Xt) =lj(Xt)for any i6= j}. In particular the local martingale part of f(X)is given by
k
X
i=1
Z t
0
1B i(Xs)β
T
i d Ms . (1.13)
Proof. We prove the proposition for the case when k =2 and any d ≥1 and general case follows by induction. Consider f(x) = l1(x)∨l2(x). Denote l1(Xt) = Yt and
l2(Xt) =Zt. Since Xt is a continuous semimartingale so are affine functionals, Yt and Zt, ofXt. Let the corresponding decompositions beY =M+AandZ=N+S. Consider
f(x) =l1(x)∨l2(x) = y∨z. We can rewrite y∨zas follows
y∨z= 1
2 |y−z|+y+z
.
Hence, using the differential notation for simplicity, we obtain
d(Yt∨Zt) =
1
2d |Yt−Zt|+Yt+Zt
= 1
2 d(|Wt|) +d Yt+d Zt
,
above becomes
1 2
sgn(Wt)dWt+d L0t +d Yt+d Zt
,
where L0t is the local time ofW at 0. Next
1 2
sgn(Wt)d(Mt−Nt) +sgn(Wt)d(At−St) +d(Mt+At) +d(Nt+St) +d L0t=
= 1
2
sgn(Wt) +1
d Mt− sgn(Wt)−1
d Nt+
+ sgn(Wt) +1
dAt− sgn(Wt)−1
dSt+d L0t
.
Now sgn(Wt) =sgn(Yt−Zt) =1[Yt>Zt]−1[Yt≤Zt]and so sgn(Wt) +1=21[Yt>Zt] and sgn(Wt)−1=−21[Yt≤Zt]. Hence we obtain
d Yt∨Zt
=1[Yt>Zt]d Mt+1[Yt≤Zt]d Nt+1[Yt>Zt]dAt+1[Yt≤Zt]dSt+ 1 2d Lt=
=1[Y
t>Zt]d Yt+1[Yt≤Zt]d Zt+
1 2d Lt
or
Yt∨Zt=Y0∨Z0+
Z t
0
1[Y
s>Zs]d Ys+ Z t
0
1[Y
s≤Zs]d Zs+
1 2d Lt,
whereLtis a continuous increasing process, constant on the complement of{t:l1(Xt) =
l2(Xt)}. The above expression is exactly (1.12) for n=2. Noticing that x∨ y∨z =
(x∨y)∨z, the general case follows by induction.
Clearly the integrand in (1.13) is a measurable selection of the multivalued map∂f(x) and so Theorem 1.2 holds in the special case of convex piecewise linear functions. To illustrate this result we consider our simple example again: forf(x) =|x|
1.5
Semimartingale decomposition of
f
(
X
e
t)
We are now ready to start the analysis of the general case of a convex f defined over the whole of Euclidean space Rd. Let X be a continuous semimartingale in
Rd with decomposition X = M +A and defined on some filtered probability space
(Ω,{Ft}t≥0,F,P). Let(Ωe,{Fet}t≥0,Fe,Pe)be some enlargement of this space and letB
be an(Fet)-standard Brownian motion independent ofX. Define theperturbed process
e
X on(Ωe,{Fet}t≥0,Fe,Pe)by
e
X(tε):=Xet:=Xt+εBt, ε >0, t≥0 .
For simplicity of notation we shall suppress the superscript(ε) wherever possible. For simplicity also but without loss of generality we can assume thatX0=Xe0=0.
In this section we find the martingale part of f(Xe(ε))explicitly in order to take
the limit asε→0 in the next section and hence prove Theorem 1.2. The reasoning be-hind adding a small amount of Brownian motion toXtis as follows: we know very little about the behaviour ofXtas it is a general semimartingale. For instance, it can at some
times be trivial, i.e. constant. Hence it might spend positive amount of time in those points where f is not differentiable, that is, where it has more than one supporting hyperplane. To avoid this happening we perturbXt by addingεBt. Then
Lemma 1.12. Xet has a probability density at each t>0and, in particular, spends zero
time in any null set.
Proof. It suffices to prove that eP(Xet ∈ N) = 0 for any t > 0 and N ⊂ Rd with
Le b(N) =0. Then it will follow that for allt>0 the law ofXetunderePis absolutely
con-tinuous with respect to the Lebesgue measure, and the corresponding Radon-Nikodym derivative is the probability density ofXet. For any Lebesgue-null setN we have
e
P(Xet∈N) =E
e
P(Xet∈N|Ft)
,
whereFt=σ({Xs; 0≤s≤t}), and we use the tower property of conditional
ofXt, and hence ofFt, to obtain
EeP(Xt+εBt∈N|Ft)
=
Z
e
P(x+εBt∈N)dµt(x),
whereµtis the law ofXt. Observe thatBˆt :=x+εBtis a Brownian motion started at x with〈Bˆt,Bˆt〉=ε2t. But we know that Brownian motion hits null-sets with probability zero. Hence, the above integral is equal to zero and the lemma is proved.
In Section 1.3.2 we have seen thatDc, the set of points at whichf is not
differen-tiable, is Lebesgue-null. Consequently, by the above lemma,Xespends zero time at those
“ambiguous” points. Hence,∇f(Xe)is almost surely everywhere defined. Moreover,
be-cause of Lemma 1.12 a particular measurable choice of∇f(x)∈∂f(x) at x ∈ Dc is unimportant as it does not change the value of the stochastic integralRt
0∇f(Xes)dMes, which we will show is the martingale part of f(Xe). To do that we approximate f by a
sequence of convextwice continuously differentiablefunctions.
Let {fn}n≥1 be a sequence of twice continuously differentiable convex func-tions on Rd converging to f with respect to ρ, i.e. such that limn→∞ρ(fn,f) =
0, where metric ρ is as defined in Section 1.2. We need to prove that stochastic integral R0t∇fn(Xes)dMes, the martingale part of fn(Xe), converges in some sense to Rt
0∇f(Xes)dMes for some measurable choice of ∇f(x) ∈∂f(x), and that it is indeed the martingale part of f(Xe).
We need to note the following two inequalities
Lemma 1.13. (Adapted from[19, Lemma, p. 2])
sup
n
sup |x|≤r
|∇fn(x)| ≤Cr<∞, ∀r>0 , (1.14)
and
sup |x|≤r
|∇f(x)| ≤Cr<∞, ∀r>0 , (1.15)
∂f(x).
Proof. To see why inequality (1.14) is true, first notice that, since limn→∞ρ(fn,f) =0, the variation of the convex functions fnis uniformly bounded in non{|x| ≤r+1}for
anyr >0. Denote this bound byCr. Letxnbe such that
∇fn(xn) = sup |x|≤r
|∇fn(x)|
and letun:=∇fn(xn)/|∇fn(xn)|. Then
|∇fn(xn)|=〈∇fn(xn),
∇fn(xn)
|∇fn(xn)|〉=〈∇fn(x),un〉=D fn(x)[un]
= inf λ>0
fn(xn+λun)−fn(xn)
λ ≤ fn(xn+un)−fn(xn). (1.16)
But, since|xn+un| ≤r+1, the above is less thanCr for allnand (1.14) follows. Now, since fn converges to f uniformly on compact sets, we also have fn → f pointwise. Therefore, for anyx,ywith|x|,|y|<r+1 the inequality fn(x)−fn(y)≤Cr,
∀n≥1, (which follows sinceCr bounds the variation of fn’s) implies f(x)−f(y)≤Cr by virtue of taking the limitn→ ∞. So, by a calculation similar to (1.16), we have for any∇f(x)∈∂f(x)
|∇f(x∗)|=〈∇f(x∗),u∗〉 ≤D f(x∗)[u∗]≤ f(x∗+u∗)− f(x∗)≤Cr ,
where x∗ is such that ∇f(x∗) =sup|x|≤r|∇f(x)| andu∗:=∇f(x∗)/|∇f(x∗)|for any r>0.
We are now ready to prove the following
Lemma 1.14. The local martingale part of f(Xet)is given by the limit
lim
n→∞
Z t
0
∇fn(Xes)dMes= Z t
0
∇f(Xes)dMes (1.17)
Proof. Since for each n ≥ 1 fn is in C2, the martingale part of f
n(Xe) is given by R
∇fn(Xe)dM, wheree Me = M +εB. Theorem 1.4, applied to our sequence {fn}n≥1
and the semimartingaleXe, then ensures that the martingale part of the limiting process
f(Xet)is given by the limit of R
∇fn(Xe)dMe asntends to infinity, locally inH1. Our aim
is to prove that this limit is indeed equal toR∇f(Xe)dMe for some measurable choice of
subgradient∇f ∈∂f.
Let B(r) be an open ball of radius r and B(r0) an open ball of radius r0 with r0 > r > 0, both centred at the origin. For all r,r0 > 0 define stopping times Tr :=
inf{t:Xt∈/B(r)}andTer0:=inf{t:Xet∈/B(r0)}and takeTe=Tr∧Ter0. Assume also that
e
Xt∧Te,Xt∧Te ∈ H1 for all t ≥0; we know that continuous semimartingales are at least locally inH1. We consider the stopped processXet∧
e
T. Note thatXt∧Te ∈B(r)⊂B(r
0) andXet∧
e
T ∈B(r0) for all t ≥0. By Lemma 1.12 the law of Xet∧
e
T has a density for all
t<Te.
Note that for proving Lemma 1.14 it would have sufficed to stopXeatTer0.
How-ever, in order to be consistent with localisation we will be using to prove Theorem 1.2, we useTe=Tr∧Ter0 instead.
Notice that convergence of a continuous (local) martingale M inHp is
equiv-alent to convergence of〈M,M〉1/2 in Lp. So, in this case convergence inHp implies convergence inHlfor 1≤l<p. In our case it is easier to prove convergence (1.17) in
H2 and then deduce convergence inH1. For any measurable selection∇f ∈∂f and t>0 we have
lim
n→∞
Z t∧Te
0
∇fn(Xes)− ∇f(Xes)
dMes
H2
= lim
n→∞E
h Z t∧eT
0
∇fn(Xes)− ∇f(Xes) 2
d〈Me,Me〉s i1/2
.
expectation sign above as follows
Z t∧Te
0
∇fn(Xes)− ∇f(Xes) 2
d〈Me,Me〉s≤4Cr20 Z t∧Te
0
d〈M,e Me〉s
≤4C2r0〈Me,Me〉t∧
e
T <∞,
where the quadratic variation〈M,e Me〉t∧
e
T is finite, because it is the bracket of a bounded
continuous semimartingaleXet∧
e
T (see[65, Ch. IV, Thm. 1.3]). Using dominated
con-vergence theorem we can now take the limit inside the expectation sign and, since the integrand is bounded above by 4Cr20, we can also pull the limit inside the integral sign. We can then use almost sure convergence of ∇fn(Xet) to ∇f(Xet) for all Xet ∈ D and
the fact that particular choices∇f(Xet) ∈∂ f(Xet)for Xet ∈ Dc are not charged by the
integral to conclude that the limit in question is equal to
E
hZ t∧Te
0
lim
n→∞
∇fn(Xes)− ∇f(Xes) 2
d〈M,e Me〉s i1/2
=0 .
It follows that Rt∧Te
0 ∇fn(Xes)dMes converges to
Rt∧Te
0 ∇f(Xes)dMes in H 2, and
hence in H1. This is true for any radiuses r0 > r > 0 of localisation, and so (1.17) follows.
We also prove the following lemma concerning the semimartingale decomposi-tion of f(Xe)that we will require for the proof Theorem 1.2.
Lemma 1.15. Let Net(ε) = Rt
0∇f(Xes(ε))dMes(ε) and Se(ε) be the martingale and the finite
variation parts of the decomposition of f(Xe(ε))respectively. Then for allε≤1
E
h
sup
t≤Te
|Ne
(ε) t |
i
≤Kr,r0 , (1.18a)
E
h Z Te
0
|dSe(sε)| i
≤Kr,r0, (1.18b)
Proof. The proof largely follows proof of Theorem 1.4 in[19]: we prove that the se-quence of continuous semimartingales{fn(Xe)}n≥1 satisfies the conditions of Theorem
1.3 of Barlow and Protter, i.e.
lim
n→∞E
h
sup
t≤eT
|fn(Xet)−f(Xet)| i
=0 (1.19a)
sup
n≥1E
h
sup
t≤eT
|Netn| i
≤Kr,r0 , (1.19b)
sup
n≥1E
hZ eT
0
|deSsn| i
≤Kr,r0 , (1.19c)
where for eachn≥1Nen andSen are the martingale and the finite variation part of the
decomposition of fn(Xe)respectively. Then (1.18a) and (1.18b) will follow by Theorem
1.3. The difference is only in the fact that we need to ensure that for small enoughε the constantKr,r0 above can be taken to be independent ofε.
First of all notice that (1.19a) follows from the fact that limn→∞ρ(fn,f) =0.
Next consider (1.19b). By the Burkholder-Davis-Gundy inequality we have for some constantp<∞
E
h
sup
t≤eT
|Netn| i
≤pE
h
〈Nen,Nen〉
1/2
e
T
i
=pE
hZ Te
0
|∇fn(Xet)|
2d
〈Me,Me〉t 1/2i
≤pCr0E
h 〈Me,Me〉
1/2
e
T
i
=pCr0E
h
(〈M,M〉 e
T +ε
2
e
T)1/2i.
To finish we need to bound〈Me,Me〉
e
T by some constant independent of ε. We
have〈M,e Me〉
e
T =〈M,M〉eT+ε 2
e
T which for allε≤1 is less than〈M,M〉Te+Te which is in turn bounded above by〈M,M〉T
r+Tr, sinceTr≥Te=Tr∧Ter0. Hence, eventually for
allε
E
h
sup
t≤Te
|Netn| i
≤pCr0E
h
(〈M,M〉T
r +Tr)
1/2i,
We now turn to (1.19c). The finite variation part of fn(Xet∧
e
T)is given by
e
Sn
t∧eT =
Z t∧Te
0
∇fn(Xes)dAs+
1 2
Z t∧Te
0
X
i,j
∂2f
n
∂xi∂xj(Xes)d〈Xes,Xes〉. (1.20)
WritingVars≤t(Ys)forRt
0|d Ys|for a continuous processYt, we have
Z eT
0
|deSsn|=Vart≤
e
T(eSnt)
≤Vart≤ e
T
Z t
0
∇fn(Xes)dAs
+Vart≤ e T 1 2 Z t 0 X
i,j
∂2f
n
∂xi∂xj(Xes)d〈Xes,Xes〉
.
Now
Vart≤ e
T
Z t
0
∇fn(Xes)dAs
≤Cr0Vart≤
e
T
Z t
0
dAs≤Cr0
Z t∧Tr
0
|dAs|, (1.21)
where we have usedTr≥Tr∧Ter0=Teand the fact thatVartis an increasing process of
t. Hence, the bound on the expectation of the RHS above depends onr andr0but not onε.
Next we observe that, since fn is convex and the bracket〈Xes,Xes〉is an
increas-ing process of t, the second summand of (1.20), which we denote by Ftn, is also an increasing function oft. Thus,
Vart≤eT
1
2
Z t
0
X
i,j
∂2f
n
∂xi∂xj(Xes)d〈Xes,Xes〉
≤Fn
e
T ≤F n Tr .
But, because fnis twice continuously differentiable, using Itô’s formula we can
also write
E[FTn
r] =E[fn(XeTr)−fn(Xe0)]−E hZ Tr
0
∇fn(Xes)dAs i
.
corre-sponding Lipschitz constant) we have
E[FTnr]≤Kr0E[|XeTr|] +Cr0E[|ATr|]
=Kr0E[|XT
r+εBTr|] +Cr0E[|ATr|] ≤Kr0(r+E[BT
r]) +Cr0E[|ATr|]
for allε ≤1, i.e. the bound on E[FTn
r]is independent of both nand ε for allε ≤1.
Combining this with (the expectation of) the bound (1.21) we obtain (1.19c). The assertion of the lemma now follows by Theorem 1.3.
1.6
Proof of Theorem 1.2
Finally we need to derive the analogous result for our original object of interest, a continuous semimartingaleX.
Proof of Theorem 1.2. We have limε↓0Xe(ε)=Xalmost surely and, thus, for a continuous
convex f, limε→0 f(Xe(ε)) = f(X) almost surely. Note that the limit of the process
e
X(ε) as ε tends to zero lives in the enlarged probability space (Ωe,Fe,Pe), even though
the original process X is defined on (Ω,F,P). Crucially by Itô’s lemma f(Xe(tε)) is a
continuous semimartingale for everyε >0. Hence, we can apply Theorem 1.3 to the sequence of semimartingales{f(Xe(ε))}ε>0 if we can show that the conditions (1.1a),
(1.1b) and (1.2) of the theorem are satisfied in our case.
We use the same localisation as in the proof of Lemma 1.14, i.e. we consider
e
Xt∧ e
T foreT=Tr∧Ter0=inf{t:Xt∈/B(r)} ∧inf{t:Xet∈/B(r0)}, with r0>r>0.
In view of Lemma 1.15 the only thing we need to prove in order to apply results of Theorem 1.3 is
lim ε↓0E
h
sup
t≤eT
|f(Xet(ε))−f(Xt)| i
=0 ,
Since by Theorem 1.7 f is Lipschitz in the ballB(r0), we have
E
h
sup
t≤Te
|f(Xet)− f(Xt)| i
≤Kr0E
h
sup
t≤Te
|Xet−Xt| i
=εKr0E
h
sup
t≤eT
|Bt|
i
where Kr0 <∞is a Lipschitz constant depending on r0. Taking the limit ε→0 gives
the desired result.
Together with expressions (1.18a) and (1.18b) of Lemma 1.15 this ensures that the conditions of Theorem 1.3 are satisfied in our case. It follows immediately that the martingale part of f(X) is given by the limit as ε → 0 of R∇f(Xe(ε))dMe(ε), the
martingale part of f(Xe(ε)), locally in H1. All is left to prove now is that this limit is
given byR∇f(X)d M for some choice of∇f(x)∈∂f(x), i.e. that for all t>0
lim ε↓0
Z t
0
∇f(Xe(ε)
s∧Te
)dMe(ε)
s∧eT
=
Z t
0
∇f(Xs∧Tr)d Ms∧Tr (1.22)
inH1 for allr0>r>0.
Proving the above convergence will require us to consider the limit of∇f(Xe
(ε) t∧eT) asε tends to 0. From Proposition 1.10 we know that for all t≥0 for almost all values ofBt the limit limε↓0∇f(Xt+εBt)exists and belongs to∂f(Xt). Denote this limit by
∇f(Xt). Also for any path of X andBfor small enoughε, i.e. eventually for allε, we
haveTr<Ter0. That isTe→Trasε→0 a.s. and so
lim
ε↓0∇f(Xt∧Te+εBt∧Te) =∇f(Xt∧Tr) a.s. . (1.23)
Again we consider convergence inH2 first, and convergence in H1 follows. We have, using the fact that limε↓0Met∧
e
T =limε↓0(Mt∧Te+εBt∧Te) =limε↓0Mt∧Te a.s.
lim ε↓0
Z t 0
∇f(Xes∧
e
T)dMes∧
e
T −
Z t
0
∇f(Xs∧Tr)d Ms∧Tr H2 =lim ε↓0E
Z eT
0
∇f(Xes)2d〈Ms,Ms〉+ Z Tr
0
∇f(Xs)2d〈Ms,Ms〉
−2
Z ∞
0
∇f(Xes∧
e
T)∇f(Xs∧Tr)d〈Ms∧Te,Ms∧Tr〉 1/2
. (1.24)
Additionally we have
Z eT
0
∇f(Xes)2d〈Ms,Ms〉 ≤Cr20〈M
e
T,MTe〉<∞
and
Z ∞
0
∇f(Xes∧
e
T)∇f(Xs∧Tr)d〈Ms∧eT,Ms∧Tr〉 ≤CrCr0〈MTe,MTe〉<∞,
since〈Mt,Mt〉, is finite for anyt≤Te. Appealing to the dominated and bounded
conver-gence theorems we can interchange the limit with the expectation and the integration signs respectively. Convergence (1.23) and the fact thatTe→Tr a.s. then finally yield
(1.22). Noticing that the above is true for allr0>r>0 concludes the proof.
Example. As was mentioned before, in[16]Bouleau has proved thatanymeasurable choice of subgradient ∇f(Xt) works for the stochastic integral of Theorem 1.2. A
function
∇ef(x) =lim
θ↓0E[∇f(x+θN)], (1.25) whereN is a standardd-dimensional Gaussian random variable, is a particular exam-ple.∇ef(x)can be regarded as a sort of an average of (sub)gradients within the vicinity ofx. To verify that it does indeed define a subgradient of f at each x ∈Rd we check
the subgradient inequality (1.7) of Theorem 1.6. For any y∈Rd\{0}we have
〈∇ef(x),y〉=〈lim
θ↓0E[∇f(x+θN)],y〉=limθ↓0E[〈∇f(x+θN),y〉]. (1.26)
〈∇f(x+θN),y〉 ≤D(x+θN)[y] = inf λ>0
f(x+θN+λy)− f(x+θN)
λ ≤ f(x+θN+ y)−f(x+θN)≤K|y|
for some Lipshitz constantK < ∞depending on x andN. Appealing to the bounded convergence theorem now allows us to take the limit inside the expectation in equation (1.26) above
〈∇ef(x),y〉=E[〈lim
θ↓0∇f(x+θN),y〉]. (1.27) But by Proposition 1.10 limθ↓0∇f(x +θN) exists, is unique and belongs to
∂f(x)for almost allN. Denote this limit by∇∗f(x). Then (1.27) is equal to
E[〈∇∗f(x),y〉]≤E[D f(x)[y]] =D f(x)[y].
Hence we have〈∇ef(x),y〉 ≤D f(x)[y]for any y ∈Rd\{0}for all x, and so∇ef(x)
Chapter 2
Elements of representation theory.
Starting with the present chapter the thesis is taking up a study of a different subject to that of Chapter 1. Chapters 3, 4 and 5 are concerned with a certain family of bivariate Markov chains, diffusions and repelling particle systems, respectively.
In this chapter we outline some basic results from representation theory which we will employ in later chapters. In particular we will describe finite-dimensional repre-sentations of the universal enveloping algebraU(gl(n))and its quantisationUq(gl(n)),
treating separately the case ofU(sl(2))andUq(sl(2)), and explain how they are related to combinatorics. We start with some basic definitions and notation.
2.1
Basics
Given a groupG, Representation theory is concerned with finding all finite- and infinite-dimensional vector spaces on which elements of the group act as automorphisms and describing these actions. Thus homomorphisms, structure preserving maps between two algebraic objects, play a fundamental role in Representation theory. Agroup ho-momorphism between two groups (G,·) and (H,×) is a map ϕ : G → H such that
anautomorphismis an endomorphism which is a bijection. Now
Definition 2.1. (Group representation) A representation of a group G on a complex vector
space V is a homomorphismρ
ρ:G→Aut(V),
where Aut(V)is a family of automorphisms of V .
In other words, a representation of G on V is a mapρsuch that for all g ∈G the mapρ(g):V →V is linear and invertible and such that for allg,g0∈G we have
ρ(g g0)(v) =ρ(g)ρ(g0)(v), v ∈ V. We say that V is a G-module. The dimension of V is called thedimension of the representation. It is customary (if slightly inaccurate) to call the vector spaceV itself a representation of G and write g vas a shorthand for
ρ(g)(v). IfV is endowed with an inner product〈·,·〉then it is aunitary representation if the group acts on members ofV unitarily, i.e. for anyg∈G andv,u∈V
〈g v,gu〉=〈v,u〉.
Next we define thedirect sumand thetensor productof two vector spacesV and W. The direct sum V ⊕W is a Cartesian product V ×W with element-wise addition
(v1,w1) + (v2,w2) = (v1+v2,w1+w2) for v1,v2 ∈ V, w1,w2 ∈W. If V and W are inner product spaces, so is its direct sum via〈(v1,w1),(v2,w2)〉=〈v1,v2〉+〈w1,w2〉. Of course we have dim(V ⊕W) =dim(V) +dim(W).
Suppose now V and W are finite-dimensional; to construct a direct product V⊗W we associate to each pair(u,v)an elementu⊗vvia a bi-linear mapφ:V×W →
V ⊗W. The map φ is unique up to an isomorphism. Consider a space of all linear combinationsF :={P
nαnun⊗vn,(un,vn)∈U×V,αn∈C}and a subspaceSgenerated
by all the elements ofF of the form
u⊗w+v⊗w−(u+v)⊗w, u⊗w+u⊗v−u⊗(w+v),
(αu)⊗v−α(u⊗v), u⊗(αv)−α(u⊗v)