für Angewandte Analysis und Stochastik
Leibniz-Institut im Forschungsverbund Berlin e. V.
Preprint
ISSN 2198-5855
Hydrodynamic limit and large deviations of
reaction-diffusion master equations
Markus Mittnenzweig
submitted: July 16, 2018 Weierstrass Institute Mohrenstr. 39 10117 Berlin Germany E-Mail: [email protected] No. 2521 Berlin 20182010Mathematics Subject Classification. 60F10, 35K57, 82C35, 82C22.
Key words and phrases. Reaction-diffusion master equation, hydrodynamic limit, large deviations, interacting particle systems.
Leibniz-Institut im Forschungsverbund Berlin e. V. Mohrenstraße 39 10117 Berlin Germany Fax: +49 30 20372-303 E-Mail:
[email protected]
reaction-diffusion master equations
Markus MittnenzweigAbstract
We derive the hydrodynamic limit of a reaction-diffusion master equation, that combines an exclusion process with a reversible chemical master equation expression for the reaction rates. The crucial assumption is that the associated macroscopic reaction network has a detailed bal-ance equilibrium. The hydrodynamic limit is given by a system of reaction-diffusion equations with a modified mass action law for the reaction rates. We provide the upper bound for large deviations of the empirical measure from the hydrodynamic limit.
1
Introduction
Stochastic reaction-diffusion processes are model systems of interacting particles in statistical physics, for which various aspects, such as hydrodynamic limits [DFL86, DFL85], phase transitions [Täu17], tur-bulence [CC∗08] or stationary non-equilibrium states [BD∗07] have been studied. Moreover, stochastic reaction - diffusion models are widely used for the simulation of biochemical reaction networks in cel-lular biology, where copy numbers of molecules can be small and concentrations can vary across different regions of a cell [AnB04, ErC09].
In this work, we consider the reaction-diffusion master equation for multiple species in the diffusive scaling limit, where particles can react with each other when they find themselves at the same lattice site. The stochastic reaction rates follow a modified chemical master equation expression [GM∗76, Isa08]. Furthermore, we will only allow a maximal number of
M
particles per species per lattice point. Particle jumps between different sites are therefore modeled by an exclusion process. We show that if the associated reaction network satisfies detailed balance, the hydrodynamic limit of the reaction -diffusion master equation is given by a reaction--diffusion partial differential equation with mass action law kinetics. In the second part of this work, we study large deviations from the hydrodynamic limit and prove rigorously the large-deviation upper bound. Our work is a generalization of [JLLV93], in which the authors established a dynamic large deviations principle for a scalar reaction-diffusion master equation. Further results for two species models were obtained in [Per00, Bo ˇC07].The crucial observation in this work is, that if the macroscopic reaction network satisfies detailed balance with respect to an equilibrium concentration
w
= (
w
1, . . . , w
I)
, then the generatorL
of the reaction - diffusion master equation is as well in detailed balance with respect to a product binomial distribution. ForM
=
∞
, i.e. no restriction on the number of particles per lattice site, the analogous result for the Poisson distribution is known to hold for chemical reaction networks satisfying a complex balance condition (which is a weaker condition than the detailed balance condition). As a result of the detailed balance property for finiteM
, we can apply the well-developed entropy method of Guo, Papanicalou and Varadhan [GPV88] to derive the macroscopic hydrodynamic limit of the reaction-exclusion process. More precisely, we considerI
different species withR
different reactionsα
r1X
1+
· · ·
+
α
rIβ
rbetween them, where
α
r,
β
r∈
N
Iare the stoichiometric coefficients of ther
th reaction. This reaction network satisfies the (macroscopic) detailed balance condition if∃
w
∈
R
I +:
κ
f w rw
αr=
κ
bw rw
βr∀
r
= 1
, . . . , R,
(2) withw
α=
Q
I i=1w
αii being the mass action law rate associated to the stoichiometric vector
α
. The microscopic model on the discrete torusT
dN=
Z
d/N
Z
dis a stochastic reaction-exclusion processη
t(
·
)
of particle configurationsη
t:
T
dN→
N
I with0
≤
η
it
(
x
)
≤
M
for alli
. The generator of the process is given byLf
(
η
) =
X
x,|e|=1N
2D
iη
i(
x
) 1
−
ηi(xM+e)f
η
x,xi +e)
−
f
(
η
)
+
M
X
x,rg
rf w(
η
(
x
))
f
η
γr x−
f
(
η
)
+
g
rbw(
η
(
x
))
f
η
−γr x−
f
(
η
)
(3)where
η
x,xi +eis the new configuration where one particle of typei
jumped fromx
tox
+
e
andη
γrx is the configuration where reaction
r
took place at lattice sitex
, i.e.η
γrx
(
x
) =
η
(
x
) +
γ
r. The reaction ratesg
f wr(
n
)
have the formg
rf w(
n
) =
κ
f wrC
rM IY
i=1 ni! (ni−αir)!·
(M−ni)! (M−ni−βir)!,
with the equivalent expression for
g
rbw(
n
)
with exchanged indicesα
r andβ
r. The constantC
rM is some normalization constant, see (16). The ratesg
f w/bwr are modified versions of the chemical master equation (CME). The standard CME rate for a general reaction of the form (1) would bet
αr(
n
) =
Q
I i=1ni!
(ni−αir)! times a constant prefactor. However, the rate
t
αr(
n
)
can lead to more thanM
particlesper lattice site and therefore it needs to be adapted accordingly. The modified rate
g
f wr(
n
)
not only takes into account the numbern
i of particles available, but also the number of holesM
−
n
iavailable for each speciesi
.The central object in the study of the hydrodynamic limit is the empirical measure
π
Nt associated to a stochastic trajectory
η
t(
·
)
and defined asπ
Nt=
1
M N
dX
x∈Td Nη
t(
x
)
δ
x/N.
We show that if the chemical reaction network satisfies the detailed balance condition (2), then the empirical measure
π
Nt converges for
N
→ ∞
in a weak sense to a reaction-diffusion system. More precisely, we show that for allδ >
0
andG
∈
C
1,2([0
, T
]
×
T
d,
R
I)
,
lim sup
N→∞P
µ NZ
T 0h
π
Nt,
G
(
t,
·
)
i − h
c
(
t,
·
)
,
G
(
t,
·
)
i
d
t
> δ
= 0
.
where
c
(
t, x
)
is the solution of the reaction-diffusion system˙
c
=
D
∆
c
−
RX
r=1(
α
r−
β
r)(
κ
f w rc
αr(
1
−
c
)
βr−
κ
bw rc
βr(
1
−
c
)
αr)
(4)with
D
=
diag(
D
1, . . . , D
I)
and1
= (1
, . . . ,
1)
. Because of the prefactorsc
αr(
1
−
c
)
βr, the solu-tionsc
(
t, x
)
of (4) are uniformly bounded by0
≤
c
(
t, x
)
≤
1
, which is the macroscopic equivalent of the microscopic constraint0
≤
η
t(
x
)
≤
M.
In the last part, we study dynamic large deviations from the hydrodynamic limit. Because the reaction-exclusion process is reversible with respect to the equilibrium product binomial distribution, we can derive a superexponential estimate and obtain a large-deviation upper bound. Let
M
I+be the space of positive vector-valued measures onT
d, i.e.π
Nt
∈ M
I+and letD
([0
, T
];
M
I+)
be the correspondingSkorohod path space for trajectories
t
7→
π
Nt . Then we show thatlim sup
N→∞
N
−dlog
P
N(
π
N·∈ C
)
≤ −
inf
c∈C
I
(
c
)
∀C
closed. The rate functionalI
consists of a static and dynamic partI
(
c
) =
I
0(
c
(0
,
·
)) +
I
1(
c
)
.
I
0 characterizes large deviations of the initial distributionµ
0 ofη
0(
·
)
from the stationary productbinomial distribution. It is given by
I
0(
c
) =
R
Td
F
(
c
(
x
)
|
w
)d
x
with the Fermi-Dirac-type entropyF
(
c
|
w
) =
IX
i=1c
ilog
c
iw
i+ (1
−
c
i) log
1
−
c
i1
−
w
ithat is typical for simple exclusion processes. The dynamic rate functional
I
1(
c
)
has the implicitrep-resentation
I
1(
c
) =
sup
H∈C1,2([0,T];Td)
J
lin(
c
,
H
)
− J
re(
c
,
H
)
− J
ex(
c
,
H
)
where
J
lin(
c
,
H
)
is linear in the functionH
andJ
ex(
c
,
H
)
andJ
re(
c
,
H
)
are convex inH
:J
ex(
c
,
H
) =
IX
i=1Z
T 0Z
Td|∇
H
i(
t, x
)
|
2c
i(
t, x
) 1
−
c
i(
t, x
)
d
x
d
t
J
re(
c
,
H
) =
X
rZ
T 0Z
TdR
fr(
c
(
t, x
)) e
γr·H(t,x)−
1
+
R
f r(
c
(
t, x
)) e
−γr·H(t,x)−
1
d
x
d
t.
J
ex(
c
,
H
)
is the typical contribution from the exclusion process [KOV89]. The exponential form of thereaction part
J
re(
c
,
H
)
agrees with the large deviations functional for the chemical master equationwithout spatial degrees of freedom [MP∗17]. We do not give a rigorous proof of a large deviations lower bound here. However, it seems reasonable that the lower bound of [JLLV93] for
d
= 1
andI
= 1
can be generalized to arbitraryd
andI
. Ford
= 1
andI
= 1
, density large deviations where also extended to joint large deviation principles for the empirical density and empirical current [BoL12] as well as to static large deviations from non-equilibrium stationary states [LaT18].It remains an open, if one can relax the detailed balance condition (2). For
M
=
∞
, the central Propo-sition 2.4 holds also true for chemical reaction networks satisfying only the weaker complex balanced condition [ACK10, HoJ72, Fei72]. This work leaves also open the convergence to hydrodynamic limit in the case ofM
=
∞
, for which particle numbers per lattice site can become arbitrarily large. For the macroscopic reaction-diffusion PDE, weak solutions do globally exist in time in the case of detailed balance and arbitrary quadratic reactions [DF∗07].The paper is structured as follows: In Section 2, we introduce the reaction-diffusion models in detail. In Section 3, we prove the hydrodynamic limit and Section 4 establishes the large deviations upper bound.
Notation
I
is the number of species,R
the number of reactions.
M
∈
N
is the maximal number of particles per lattice site for each species.Bold letters always represent vectors with
I
components. The bold letters1
= (1
, . . . ,
1)
,M
= (
M, . . . , M
)
as well asc
= (
c
1, . . . , c
I)
for concentrations andh
=
1
−
c
for hole concentrations will be used throughout the paper.
T
d=
R
d/
[0
,
1]
dis thed
-dimensional torus.
T
dN=
Z
d/N
Z
dis the discrete,N
-periodic,d
-dimensional torus.
π
Nt
=
M
−1
N
−dP
x∈Td
N
η
t(
x
)
δ
x/N is the empirical measure associated to the particlecon-figuration
η
t(
x
)
at timet
.
N
M=
{
n
∈
N
I|
n
≤
M
}
is the space of allowed lattice site occupation numbers.
X
N=
{
η
(
·
)
|
η
:
T
dN→ N
M}
space of allowed particle configurationsη
(
·
)
.
M
I+is the space of vector-valued positive probability measures on
T
d, i.e. forπ
∈ M
I+ withπ
= (
π
1, . . . , π
I)
, eachπ
iis a positive measure onT
d.
D
([0
, T
];
M
+)
is the Skorohod path space for empirical measuresπ
Nt
η
x,yi is the configuration where a particle of speciesi
moved from pointx
to pointy
, see (11).
η
γx is the configuration whereη
(
x
)
is replaced byη
(
x
) +
γ
, see (10).
H
(
µ
|
ν
) =
P
η
µ
(
η
) log
µ(η)ν(η) is the relative entropy.
P
µN is the probability measure for empirical measuresπ
N·∈ D
([0
, T
];
M
+)
as well asparti-cle configurations
η
t(
·
)
.
2
Microscopic and macroscopic reaction-diffusion models
2.1
Chemical reaction networks
We consider
I
different particle speciesX
1, . . . , X
I with concentrationsc
= (
c
1, . . . , c
I)
∈
R
I+thatcan react with each other according to
r
= 1
, . . . , R
different possible reactionsα
1rX
1+
· · ·
+
α
rIβ
r1
X
1+
· · ·
+
β
IrX
I.
(5) whereβ
r,
α
r∈
N
I are the corresponding stoichiometric vectors. In the followingγ
r
=
β
r−
α
rwill always denote the difference between the two. The prototypical example is a bimolecular reaction of the formA
+
B
C
+
D,
(6)for instance
H
2+ Cl
22HCl
.
When particle numbers are large enough and the whole system iswell-mixed, rate equations of the form
˙
are typically used to model the dynamics of the system. For elementary reactions such as (6), the mass-action law is a good approximation to the kinetics of the reaction. When every reaction (5) follows the mass action law, the reaction rate
T
(
c
)
of the reaction network is given byT
(
c
) =
RX
r=1(
α
r−
β
r)(
κ
f wrc
αr−
κ
bwrc
βr)
withc
α=
IY
i=1c
αi i.
(7)Here
κ
f wrc
α is the mass action rate associated to the stoichiometric vectorα
of ther
th forward re-action. Typically, the number of reactionsR
is smaller than the number of speciesI
and the stoichio-metric vectors(
β
r−
α
r)
Rr=1do not span the full spaceR
I. In that case, there exist globally conservedquantities
S
=
span{
β
r−
α
r|
r
= 1
, . . . , R
}
S
⊥=
{
ξ
|
ξ
·
c
= 0
∀
c
∈
S
}
that correspond to atomic species in applications. In the following, we will consider a variation of the mass action kinetics (7), that incorporates an exclusuion rule and that guarantees that the concentra-tions
c
i are uniformly bounded by1
, i.e.c
i≤
1
for everyi.
One possibility is to introduce for every particle typeX
ia hole speciesH
i.α
r·
X
+
β
r·
H
β
r·
X
+
α
r·
H
(8) since for every removed particleX
i a corresponding holeH
i is created and vice versa. At any time, the particle and the hole concentrationsc
i(
t
)
andh
i(
t
)
satisfy the constraintc
i(
t
) +
h
i(
t
) = 1
.
The mass action law reaction rate associated to (8) is then given byR
(
c
) =
RX
r=1(
α
r−
β
r)(
κ
f w rc
αrh
βr−
κ
bw rc
βrh
αr)
,
h
=
1
−
c
.
This reaction rate clearly preserves the positivity of both
c
andh
. Next, we will introduce the central notion of a detailed balanced equilibrium of a chemical reaction network.Definition 2.1. (Detailed balance condition) A chemical reaction network (5) with mass action law reaction rates (7) satisfies the detailed balance condition if
∃
w
∈
R
I+
:
κ
f wrw
αr=
κ
bwrw
βr∀
r.
(9)Proposition 2.2. If the reaction network (5) without hole species
H
iis in detailed balance forw
>
0
, then the reaction network with holes (8) is in detailed balance forw
∗withw
∗,i=
1+wwii and vice versa.
Proof. The chemical reaction network with holes is in detailed balance if
∃
w
∗∈
R
I+:
κ
rf ww
∗αr(
1
−
w
∗)
βr=
κ
bwrw
∗βr(
1
−
w
∗)
αr∀
r.
By dividing by(
1
−
w
∗)
βr and(
1
−
w
∗)
αr, we see that this expression is equivalent to∃
w
∗∈
R
I+:
κ
f w rw
αr=
κ
bw rw
βr∀
r
withw
i=
w∗,i 1−w∗,i.
Proposition 2.2 shows that the existence of a detailed balance equilibrium of the chemical reaction network with holes is equivalent to the existence of a detailed balance equilibrium of the chemical reaction network without holes. An equivalent characterization of detailed balance uses the matrix
W
= (
β
r−
α
r)
r=1,...,R Twhich is called Wegscheider matrix due its first use in [Weg02], see also [GlM12, MHM15]. The de-tailed balance condition (9) is then equivalent to
W
log
w
=
log
κ
f w rκ
bw r r=1,...,Rwith
log
w
= (log
w
i)
i=1,...,I.
By Fredholm’s alternative, this equation is solvable if and only if
ξ
·
log
κ
f w rκ
bw r r=1,...,R= 0
∀
ξ
∈
S
⊥ whereS
=
RanW
TS
⊥= ker
W
.
The last conditions are called Wegscheider conditions [VlR09, ScS89, Weg02] and give polynomial relations on the rates
κ
f wr andκ
bwr if the stoichiometric vectors(
β
r−
α
r)
are not linearly independent. See [Fei89] for more details on necessary and sufficient conditions of detailed balancing.2.2
The reaction-diffusion master equation
The reaction-diffusion master equation is a continuous-time Markov chain, where particles can hop on a given lattice and two or more particles can react with each other, when they are at the same lattice position [Isa09, Isa08, WiS16]. In this article, the lattice will always be the discrete torus
T
dN=
Z
d/N
Z
d. A particle configurationη
(
·
)
is a functionη
:
T
dN→
N
I,
where the value
η
i(
x
)
∈
N
describes the number of particles of speciesi
at the lattice pointx
. We will distinguish between the two cases that either particle numbers per lattice site can be arbitrarily large or that they are uniformly bounded by an integerM
, i.e.η
i(
x
)
≤
M
. As explained in the previous section, a reaction is characterized by its stoichiometric vectorsα
r,
β
r∈
N
I.
If the reactionr
takes place at the lattice pointx
, then the particle configurationη
(
x
)
will change toη
(
x
)
→
η
(
x
)+
γ
rwhere
γ
r=
β
r−
α
r.
We will writeη
γrx for this new configuration, i.e.
η
γrx
(
y
) =
(
η
(
x
) +
γ
r ify
=
x
η
(
y
)
else. (10)If a particle of species
i
jumps fromx
toy
, the new configuration will be denoted byη
x,yi(
z
) =
η
(
y
) +
e
i ifz
=
y
η
(
x
)
−
e
i ifz
=
x
η
(
z
)
else (11)where
e
i= (0
, . . . ,
1
, . . . ,
0)
is thei
th unit vector. The generator of the reaction-diffusion master equation is given byLf
(
η
) =
X
x,|e|=1N
2D
iη
i(
x
)
f
(
η
x,xi +e)
−
f
(
η
)
+
X
x,rκ
f wrt
αr(
η
)
f
(
η
γr x)
−
f
(
η
)
+
κ
bwrt
βr(
η
)
f
(
η
−γr x)
−
f
(
η
)
(12)with
D
ibeing the diffusion constant of speciesi
,κ
f w/bwr the forward and backward reaction rates and
t
α(
n
) =
(
n! (n−α)! forn
≥
α
0
forn
α
wheren
! :=
IY
i=1n
i!
.
(13)The above generator contains a diffusion and a reaction part
L
=
N
2L
diff+
L
react.
(14)Note that because
t
α(
n
) = 0
forn
α
, particle numbersη
(
x
)
will always remain non-negative. Without spatial diffusion, the Kolmogorov forward equation of the above generator is commonly re-ferred to as chemical master equation in the physical literature [Gil07, Gil92]. The specific form oft
α(
n
)
relies on the following combinatorial argument. The idea is, that each lattice point represents a well-mixed volume, in which these particles move rapidly around. Then, the probability that a reaction occurs must be proportional to the number of possibilities to choose a tupleα
out of then
particles. But this is exactlyI
Y
i=1n
i(
n
i−
1)
·
. . .
(
n
i−
α
i+ 1) =
n
!
(
n
−
α
)!
for distinguishable particles. Next, we introduce a variant of the reaction-diffusion process (12), for which only a maximum of
M
particles is allowed per species per lattice site. LetN
M=
n
∈
N
I|
0
≤
n
i≤
M
∀
i
= 1
, . . . , I
denote the state of allowed lattice occupationsn
∈
N
Iand letX
N=
η
(
·
)
|
η
(
x
)
∈ N
M∀
x
∈
T
dN be the space of all allowed particle configurationsη
:
T
dN
→ N
M.
For eachη
∈ X
N, we define then the generatorL
of the reaction-exclusion process throughLf
(
η
) =
X
x,|e|=1N
2D
iη
i(
x
) 1
−
ηi(x+e) Mf
(
η
x,xi +e)
−
f
(
η
)
+
M
X
x,rg
rf w(
η
(
x
))
f
(
η
γr x)
−
f
(
η
)
+
g
rbw(
η
(
x
))
f
(
η
−γr x)
−
f
(
η
)
(15)with the reaction rates
g
rf w(
n
) =
κ
f wrC
rMt
αr(
n
)
t
βr(
M
−
n
)
g
rbw(
n
) =
κ
bwrC
rMt
βr(
η
)
t
αr(
M
−
n
)
(16) whereM
= (
M, . . . , M
)
andC
M r=
t
αr+βr(
M
)
−1. As before, we can write
L
=
N
2L
excl+
L
react.
(17)Up to a prefactor, the rates
g
rf w/bware the chemical master equation rate for reactions with holes (8). The prefactorC
Mr is chosen in such a way that the associated macroscopic reaction rates will be of the form
κ
f wr
c
αr(
1
−
c
)
βr, see (4) and Proposition 2.7. The hopping of a particle of speciesi
from lattice pointx
tox
+
e
can also be written as a reactionNote that particle jumps between neighboring lattice sites are by a factor
N
2 faster than the otherreactions. Depending on the size of
M
, the reaction-exclusion master equation (12) can either be viewed as a microscopic description chemical reactions, where only a few particles are allowed on each site, or as a mesoscopic description, where each lattice site is a well-mixed container with several particles in it.Next, we introduce on
N
M for somec
∈
[0
,
1]
Ithe binomial distributionν
c(
n
) :=
IY
i=1M
n
ic
ni i(1
−
c
i)
M−ni forn
∈ N
M as well as the Poisson distributionχ
c(
n
) := e
− PI i=1ci·
c
nn
!
=
IY
i=1χ
ci(
n
i)
(19) for somec
∈
R
I+. We will also write
ν
c(
n
)
andχ
c(
n
)
for the one-dimensional binomial and Poisson distributions. Moreover, for particle configurationsη
(
·
)
∈ X
N, we define the product distributionsν
cN(
η
(
·
)) :=
Y
x∈TdNν
c(
η
(
x
))
χ
Nc(
η
(
·
)) :=
Y
x∈TdNχ
c(
η
(
x
))
.
(20)They play an important role, because they are the equilibrium distributions of the pure exclusion and the diffusion processes.
Lemma 2.3. For every
c
∈
[0
,
1]
I the product binomial distributionν
Nc is an equilibrium distribution
of
L
excland for every˜
c
∈
R
I+, the product Poisson distributionχ
N˜c is an equilibrium of the diffusionprocess, i.e.
L
∗exclν
cN= 0
L
∗diffχ
Nc= 0
.
The next proposition shows that if the chemical reaction network satisfies the detailed balance condi-tion (9) for some
w
∗∈
R
I+,
then the distributionsν
wN(
η
(
·
))
(withw
i=
1+ww∗,i∗,i) and
χ
N
w∗
(
η
(
·
))
areequilibria of the reaction-exclusion (15) and the reaction-diffusion master equation (12) respectively. For the Poisson distribution, this result is contained inTheorem 4.1 of [ACK10].
Proposition 2.4. Suppose the reaction network (5) satisfies the detailed balance condition (9) for the concentration
w
∗,
i.e.κ
f wrw
αr=
κ
bwr
w
βr
∀
r.
Then the reaction-diffusion process (12) is reversible with respect to the product Poisson distribution
χ
w∗(
η
(
·
))
and the reaction-exclusion process (15) is reversible with respect to the product binomialdistribution
ν
w∗(
η
(
·
))
withw
i=
w∗,i
1+w∗,i. In particular, both distributions
χ
N
w∗
(
η
(
·
))
andν
N
w
(
η
(
·
))
areequilibrium distributions of the respective processes.
Proof. Let us start with the binomial distribution. For a general jump process on a finite state space of the form
Lf
(
η
) =
X
η06=η
r
(
η
,
η
0)
f
(
η
0)
−
f
(
η
)
,
reversibility of
L
with respect to a distributionµ
(
η
)
is defined asThus we need to show the two conditions
ν
wN(
η
)
·
η
i(
x
)(
M
−
η
i(
y
)) =
ν
wN(
η
x,y i)(
η
i(
y
)+1)(
M
−
(
η
i(
x
)
−
1))
ν
wN(
η
)
·
g
rf w(
η
(
x
)) =
ν
wN(
η
γr x)
·
g
bw r(
η
(
x
) +
γ
r)
.
For the first line, we use thatν
wN(
η
(
·
))
ν
N w(
η
x,y i(
·
))
=
ν
wi(
η
i(
x
))
·
ν
wi(
η
i(
y
))
ν
wi(
η
i(
x
)
−
1)
·
ν
wi(
η
i(
y
)+1)
where
ν
wi(
n
)
is the scalar binomial distributionν
wi(
n
) =
M n
w
ni(1
−
w
i)
M−n.
An explicit calculation for the binomial distribution shows thatν
w(
n
)
ν
w(
n
−
1)
=
M
−
(
n
−
1)
n
·
w
1
−
w
which proves the first line. For the reaction part we note first that
ν
N w(
η
)
ν
N w(
η
γr x)
=
ν
w(
η
(
x
))
ν
w(
η
(
x
)+
γ
r)
.
Thus, using (1−wwγ)γ=
w
γ ∗, we obtainν
w(
n
)
ν
w(
n
+
γ
r)
=
M nw
n(
1
−
w
)
M−n M n+γrw
n+γr(
1
−
w
)
M−n−γr=
(
n
+
γ
r)!(
M
−
n
−
γ
r)!
n
!(
M
−
n
)!
·
w
−γr ∗.
On the other hand, using the definition (16) of the reaction rates
g
rf w/bw, which contain the functiont
α(
n
) =
(n−n!α)!, it follows thatκ
f wrκ
bw r·
g
bw r(
n
+
γ
r)
g
rf w(
n
)
=
t
βr(
n
+
γ
r)
·
t
αr(
M
−
n
−
γ
r)
t
αr(
n
)
·
t
βr(
M
−
n
)
=
(n+γr)! (n−αr)!·
(M−γr−n)! (M−βr−n)! n! (n−αr)!·
(M−n)! (M−n−βr)!=
(
n
+
γ
r)!(
M
−
n
−
γ
r)!
n
!(
M
−
n
)!
.
Finally, by using the detailed balance condition κbwr
κf wr
=
w
−γr ∗,
we thus showedg bw r (n+γr) gf wr (n)=
νw(n) νw(n+γr),which concludes the proof of reversibility for the binomial distribution. The analogous computation for the Poisson distribution is
κ
f w r·
χ
w∗(
n
)
κ
bw r·
χ
w∗(
n
+
γ
r)
=
κ
f w rw
∗n·
(
n
+
γ
r)!
κ
bw rw
n+γr ∗·
n
!
=
t
βr(
n
+
γ
r)
t
αr(
n
)
which shows reversibility of the reaction part with respect to
χ
w∗.
Reversibility of the diffusion partfollows in the same way.
Remark 2.5. For chemical reaction networks, there also exists the notion of a complex balanced equi-librium. Every detailed balance equilibrium is also in complex balance, but not vice versa. The authors of [ACK10] show for the chemical master equation without diffusion, that the Poisson distribution
χ
w∗is an equilibrium of the process if
w
∗is a complex balanced equilibrium of the chemical reaction net-work. It is unclear if a similar statement holds true for the binomial distribution. The notion of complex balanced equilibria might also be more natural if one introduces only one hole species, see AppendixC. For chemical reaction networks with mass action kinetics, there exists a well-developed theory for complex balanced equilibria [Fei87]. We do not know if a similar theory, such as the deficiency zero theorem, can be developed for reaction rate equations of the form
˙
c
=
X
r,l
(
α
l−
α
r)
κ
rlc
αr(
1
−
c
)
αlfor reactions
α
r·
X
→
α
l·
X
.
Remark 2.6. A variant of the exclusion process considered here puts a bound
M
on the total number of particles, i.e.P
Ii=1η
i(
x
)
≤
M.
Particles jump between different lattice sites with the rater
(
η
,
η
x,xi +e) =
η
i(
x
) 1
−
PI i=1ηi(x) M.
If reaction rates are also adapted accordingly, then the whole reaction-diffusion process is reversible with respect to a multinomial distribution for some parameter
w
.
See Appendix C for further details.Proposition 2.4 makes it possible to derive the hydrodynamic limit as well as a superexponential es-timate [KiL98]. The macroscopic reaction rate as a function of
c
will be given by an average of the microscopic reaction ratesg
rf w/bw(
n
)
with respect to the binomial distributionν
c(
n
)
with parameterc
. The binomial distribution for generalc
appears here, because the binomial product distributionsν
Nc
(
η
(
·
))
are the equilibrium measures of the exclusion process partL
excl. The heuristics is that lo-cally, in the neighborhood of a macroscopic concentrationc
(
x
)
, the system is ergodic with respect to the fast process on small spatial scales, which is the exclusion partL
excl. Therefore, the macroscopic rates are the ergodic averages of the microscopic rates with respect to equilibrium measures of the fast process. For later usage, we calculate the macroscopic reaction ratesE
νc[
g
f w
r
(
n
)]
in the next proposition.Proposition 2.7. The average reaction rates follow the mass-action law form
E
χct
α(
n
)
=
c
αE
νct
α(
n
)
t
β(
M
−
n
)
=
t
α+β(
M
)
·
c
α(
1
−
c
)
β.
ThusE
νc[
g
f w r(
n
)] =
κ
f w rc
αr(
1
−
c
)
βrE
νc[
g
bw r(
n
)] =
κ
bw rc
βr(
1
−
c
)
αr.
Proof. The calculation in the Poisson case reads
E
χct
α(
η
)
=
c
αe
−|c|X
n≥αc
n−α(
n
−
α
)!
=
c
α.
For the binomial distribution, we find
E
νct
α(
n
)
t
β(
M
−
n
)
=
KX
n>α,M−n>βc
n(
1
−
c
)
M−nM
!
(
M
−
n
)!
n
!
·
n
!
(
n
−
α
)!
·
(
M
−
n
)!
(
M
−
n
−
β
)!
=
c
α(
1
−
c
)
βt
α+β(
M
)
M−βX
n≥αc
n−α(
1
−
c
)
M−n−βM
−
α
−
β
n
−
α
=
c
α(
1
−
c
)
βt
α+β(
M
)
.
2.3
Entropy bounds
We next introduce the relative entropy and the Dirichlet form of a probability distribution as well as an entropy-dissipation inequality relating the two. This inequality will give an a priori bound on both quan-tities, which will be needed in the derivation of the hydrodynamic limit. For two probability measures
µ
N, ν
N onX
N, the space of all particle configurations
η
(
·
)
, we define the relative entropyH
(
µ
N|
ν
N) =
X
η∈XNµ
N(
η
(
·
)) log
µ
N(
η
(
·
))
ν
N(
η
(
·
))
.
We will writeµ
N0 for the probability distribution of
η
0(
·
)
, i.e. the initial distribution of the Markov processη
t(
·
)
andµ
Nt for the distribution ofη
t(
·
)
, i.e.µ
Nt=
P
t∗µ
N0where
P
t∗ is the generated semigroup ofL
∗. The relative entropy between two states is monotonically decreasing under the action ofP
t∗,
i.e.H
(
µ
Nt|
ν
tN)
≤
H
(
µ
N0|
ν
0N)
.
In particular, the relative entropy
H
(
µ
Nt
|
ν
wN)
to the equilibrium distributionν
wN is monotonicallyde-creasing and therefore the entropy production is always positive:
−
d
d
t
H
(
µ
N
t
|
ν
w)
≥
0
.
A related object is the Dirichlet form
D
(
µ
N)
of a probability distribution defined throughD
(
µ
) :=
−
X
η∈XNf
(
η
)(
Lf
)(
η
)
ν
wN(
η
(
·
))
f
(
η
) =
s
µ
N(
η
)
ν
N w(
η
)
.
(21)Because
L
is reversible with respect toν
Nw
,
D
(
µ
)
is a symmetric quadratic form as a function off
(
η
) =
q
µN(η)
νN
w(η). We define
D
here as a function of the probability measureµ
N, because we will only need it in this form. Furthermore, we will write
D
(
µ
) =
N
2Dexcl
(
µ
) +
Dreact
(
µ
)
for the exclusion and reaction process parts. The Dirichlet form of any probability probability distribution is always bounded from above by the entropy production:
D
(
µ
N)
≤ −
d
d
t
t=0H
(
µ
N t|
ν
w)
.
Thus, the following entropy-dissipation inequality holds true.
Proposition 2.8. The relative entropy and the Dirichlet form satisfy the inequality
H
(
µ
Nt|
ν
wN) +
Z
t 0D
(
µ
Ns)d
s
≤
H
(
µ
N0|
ν
wN)
.
In particular, ifH
(
µ
N 0|
ν
w)
≤
CN
d, thenH
(
µ
Nt|
ν
w)
≤
CN
dZ
t 0D
excl(
µ
Ns)d
s
≤
CN
d−2.
(22)3
The Hydrodynamic limit
In this section, we show convergence of the reaction-exclusion process
Lf
(
η
) =
X
x,|e|=1N
2D
iη
i(
x
) 1
−
ηi(xM+e)f
η
x,xi +e)
−
f
(
η
)
+
M
X
x,rg
rf w(
η
(
x
))
f
η
γr x−
f
(
η
)
+
g
rbw(
η
(
x
))
f
η
−γr x−
f
(
η
)
(23)to the reaction-diffusion equation
˙
c
= div(
D
∇
c
)
−
RX
r=1(
α
r−β
r)
κ
f wrc
αr(
1
−
c
)
βr−
κ
bw rc
βr(
1
−
c
)
αr (24)with
D
= diag(
D
1, . . . , D
I)
and1
= (1
, . . . ,
1)
using the entropy method of Guo, Papanicolaou and Varadhan [GPV88]. The central object in this approach is the empirical measureπ
N=
1
M N
dX
x∈Td Nη
(
x
)
δ
x/N (25)that can be associated to any particle configuration
η
(
·
)
∈ XN
. We will denote byM
I+
=
π
= (
π
1, . . . , π
I|
π
iis a positive measure onT
d,
the space of vector-valued positive measures onT
dand writeh
π
N,
G
i
=
1
M N
dX
x∈TdN IX
i=1η
i(
x
)
G
i(
x/N
)
for the pairing of a measure
π
N with a functionG
∈
C
2(
T
d,
R
I)
.
The central theorem of this section is the following.Theorem 3.1. Let
G
∈
C
1,2([0
, T
]
×
T
d;
R
I)
and let the initial distributionµ
N0 satisfy for all
δ >
0
that
lim sup
N→∞P
µN 0h
π
N0,
G
(0
,
·
)
i − h
c
(0
,
·
)
,
G
(0
,
·
)
i
> δ
= 0
for somec
(0
,
·
)
∈
C
(
T
d,
R
I+)
. Furthermore, let the entropy of the initial distributionµ
N0 be boundedby
H
(
µ
N0|
ν
w)
≤
CN
d.
Then, for everyδ >
0
,lim sup
N→∞P
µNZ
T 0h
π
Nt,
G
(
t,
·
)
i − h
c
(
t,
·
)
,
G
(
t,
·
)
i
d
t
> δ
= 0
.
wherec
(
t, x
)
solves the reaction-diffusion equation (24).The theorem states that if the initial distribution
µ
N0 satisfies an entropic bound and the empirical
measure
π
N0 converges to an initial profile
c
0(
x
)
, thenπ
Nt converges in a weak sense toc
(
t, x
)
.
For a scalar reaction diffusion model, such a hydrodynamic limit was first derived in [DFL85, DFL86] using the BBGKY hierarchy for correlation functions. Below, we will introduce the main steps of the entropy method for the derivation of the hydrodynamic limit in the vector-valued case. The proof of Theorem (3.1) will be given at the end of this section.The first step consists in showing that the sequence
π
Nt of empirical measures is compact in a suitable space. As explained in [KiL98], Chapters 4 and 5, the Skorohod space
D
([0
, T
]
,
M
I+
)
of cadlag trajectories
π
·: [0
, T
]
→ M
I+, equipped with the Skorohod topology, is a possible choice. Itturns
D
([0
, T
]
,
M
I+
)
into a complete separable metric space. LetP
N be the sequence of probabilitymeasures on
D
([0
, T
]
,
M
I+
)
induced by the empirical measuresπ
Nt , i.e.P
N(
O
) =
P
(
π
N·∈ O
)
.
On that space, the following compactness result holds true.Proposition 3.2. The sequence of probability measures
P
N is tight and therefore relatively compact in the space of probability measures onD
([0
, T
];
M
I+
)
(endowed with the weak topology for probabilitymeasures).
We refer to [KiL98, Chapter 4] for a detailed exposition of the proof. Given the existence of a limit probability measure
P
∗ onD
([0
, T
];
M
I+)
, the entropy method proceeds in showing that the proba-bility measureP
∗is concentrated on pointsπ
t∈ D
([0
, T
]
,
M
I+)
that are weak solutions of (24). Theprincipal idea is to decompose the pairing
h
π
Nt,
G
(
t,
·
)
i
into a predictable part and a martingale part and to show that the martingale part converges to zero, while the predictable part must converge to a weak solution of (24).More precisely, to every time-dependent real-valued function
f
(
t,
η
t)
of a Markov processη
t on a finite state space, one can associate the two martingalesM
tf andN
tf defined throughM
tf=
f
(
t,
η
t)
−
f
(0
,
η
0)
−
Z
t 0(
∂
s+
L
)
f
(
s,
η
s)d
s
N
tf=
M
tf2
−
Z
t 0Lf
2(
s,
η
s)
−
2
f
(
η
s)
Lf
(
s,
η
s)d
s
(26) We will callM
f t:=
Z
t 0Lf
2(
s,
η
s)
−
2
f
(
s,
η
s)
Lf
(
s,
η
s)d
s
(27) the (predictable) quadratic variation ofM
tf. We prove in Appendix B that the two functionsM
tf andN
tf are indeed martingales, see also [Pro05] for more details on quadratic variations of cadlag mar-tingales. We us the martingale decomposition for the linear functionf
(
t,
η
) =
h
π
,
G
(
t,
·
)
i
withG
∈
C
1,2([0
, T
]
×
T
d;
R
I)
. Let∆
N denote the discrete Laplacian(∆
NG
)(
x
N
) :=
N
2X
e:|e|=1G
(
x
+
e
N
)
−
G
(
x
N
)
.
Lemma 3.3. The martingale decomposition ofh
π
Nt
,
G
(
t,
·
)
i
is given byh
π
Nt,
G
(
t,
·
)
i
=
h
π
N0,
G
(0
,
·
)
i
+
Z
t 0h
π
Ns, ∂
sG
i
+
L
h
π
Ns,
G
i
d
s
+
M
G t.
withL
=
N
2Lexcl
+
Lreact
andN
2L
exclh
π
Ns,
G
(
s,
·
)
i
=
M
−1N
−d IX
i=1X
xD
iη
s,i(
x
)∆
NG
i(
s,
x
N
)
L
reacth
π
Ns,
G
(
s,
·
)
i
=
N
−dX
x,r(
G
(
s,
Nx)
·
γ
r)
g
rf w(
η
)
−
g
rbw(
η
)
.
(28)The quadratic variation of both terms equals
M
tGexcl=
1
M
2N
2d−2Z
t 0X
x,e,iD
iE
[
η
s,i(
x
)]
G
i(
s,
xN+e)
−
G
i(
s,
Nx)
2
d
s
M
tGreact=
1
M N
2dZ
t 0X
x,rE
h
g
rf w(
η
s(
x
))+
g
rbw(
η
s(
x
))
i
G
(
s,
Nx)
·
γ
r2
d
s.
(29)Proof. We need to calculate
(
Lf
)(
η
)
forf
(
t,
η
) =
h
π
,
G
(
t,
·
)
i
. Sincef
(
t,
η
x,xi +e)
−
f
(
t,
η
) =
1
M N
d(
G
it,
x+e N)
−
G
it,
x N))
one obtainsL
exclh
π
Ns,
G
(
s,
·
)
i
=
1
M N
dX
x,e,iD
iη
s,i(
x
) 1
−
ηs,i(x+e) M(
G
i(
t,
xN+e)
−
G
it,
Nx))
=
1
M N
d−2 IX
i=1X
xD
iη
s,i(
x
)∆
NG
i(
s,
x
N
)
.
In going from the first to second line, it was used that the terms with prefactor
η
s,i(
x
)(1
−
ηs,i(Mx+e))
cancel each other. The form of the reaction partL
reactπ
N s,
G
(
s,
·
)
, follows fromf
(
t,
η
γr x)
−
f
(
t,
η
) =
G
t,
Nx·
γ
r=
IX
i=1G
it,
Nxγ
ri.
It remains to derive the expressions for the quadratic variation. A general Markov jump process of the form
Lf
(
η
) =
X
η0
r
(
η, η
0)(
f
(
η
0)
−
f
(
η
))
always satisfies the identity
2
f
(
η
)(
Lf
)(
η
)
−
(
Lf
2)(
η
) =
X
η0
r
(
η, η
0)
f
(
η
0)
−
f
(
η
)
2
.
This proves the form (29) for the quadratic variation.
The important point about the expression for the quadratic variation is that it vanishes for
N
→ ∞
. SinceG
i(
s,
xN+e)
−
G
i(
s,
Nx)
2
≤
C
(
G
)
N
−2for
G
∈
C
1,2([0
, T
]
×
T
d;
R
I)
and since all the other terms inside the sums of (29) are of orderO
(1)
, the quadratic variation decays on the orderM
G t,react+
M
G t,diff≤
C
·
N
−d.
As a result, by Doob’s martingale inequality, one can conclude that
P
µNsup
t≤T|
M
tG|
> δ
≤
4
δ
2E
[
M
GT]
≤
4
C
δ
2N
d.
In summary, the above reasoning shows the following proposition.Proposition 3.4. For every
δ >
0
andG
∈
C
1,2([0
, T
]
×
T
d;
R
I)
lim
N→∞P
µNsup
t≤Th
π
Nt,
G
(
t,
·
)
i − h
π
N0,
G
(0
,
·
)
i −
Z
t 0h
π
Ns, ∂
sG
i
−
L
reacth
π
Ns,
G
(
s,
·
)
i −
L
exclh
π
Ns,
G
(
s,
·
)
i
d
s
> δ
= 0
.
(30) The next natural step would be to pass to the limitπ
Nt→
π
tinside the supremum. However, the partL
reacth
π
Ns,
G
(
s,
·
)
i
cannot be written as a function of the empirical measureπ
Ns , since it contains the functionsg
f w/bwr(
η
)
which are nonlinear inη
(
x
)
.
The remedy to this problem is the replacement lemma. which is the crucial part of the entropy method. In order to state it we need to introduce some notation. For a microscopic functionψ
:
N
M→
R
,
letΨ(
c
)
denote the average ofψ
(
n
)
with respect to the binomial distributionν
c:Ψ(
c
) :=
E
νc[
ψ
(
n
)]
.
Furthermore, let
η
l(
x
)
be the block averageη
l(
x
) =
1
M
(2
l
+ 1)
dX
|y−x|≤l
η
(
y
)
(31)of a configuration
η
(
·
)
over a cube of lengthl
. Letτ
xdenote the shift operator onX
N defined by(
τ
xη
)(
y
) :=
η
(
y
+
x
)
.
Theorem 3.5. (Replacement lemma) For every
δ >
0
and every functionψ
:
N
M→
R
lim sup
ε→0lim sup
NP
µNZ
T 01
N
dX
x∈TdNV
εN(
τ
xη
s)d
s
≥
δ
= 0
(32) whereV
l(
η
) =
1
(2
l
+ 1)
dX
|y|≤lψ
(
η
(
y
))
−
Ψ(
η
l(0))
.
The replacement lemma says, that one can replace a term of the form
Z
T 0N
−dX
x∈Td NF
(
s,
Nx)
ψ
(
η
s(
x
))d
s
by the Block average
Z
T0
N
−dX
x∈TdN
F
(
s,
Nx)Ψ(
η
ls(
x
))d
s
for large
N
andl
without making a large error. The replacement lemma states in fact, that this error is arbitrarily small forl
=
εN
andN
→ ∞
forε
small enough. We refer to [Spo91, Lemma II.3.4] for a proof of the replacement that also applies to our model or alternatively to [KiL98, Lemma 5.1.10] where the replacement lemma is shown under more general assumptions. The proof of the replacement lemma is entirely based on the following entropy bounds that were derived for the reaction-exclusion -process in Proposition 2.8:H
(
µ
Nt|
ν
wN)
≤
CN
d andZ
t0