für Angewandte Analysis und Stochastik
Leibniz-Institut im Forschungsverbund Berlin e. V.
Preprint
ISSN 2198-5855
A large-deviations approach to gelation
Luisa Andreis
1, Wolfgang König
1,2, Robert I. A. Patterson
1submitted: January 7, 2019 1 Weierstrass Institute Mohrenstr. 39 10117 Berlin Germany E-Mail: [email protected] [email protected] [email protected]
2 Technische Universität Berlin
Institut für Mathematik Straße des 17. Juni 136 16023 Berlin
Germany
No. 2568 Berlin 2019
2010 Mathematics Subject Classification. 05C80, 60F10, 60K35, 82B26.
Key words and phrases. Coagulation process, multiplicative coalescent, gelation, phase transition, large deviations, Erd ˝os-Rényi random graph.
This research has been partially supported by the Deutsche Forschungsgemeinschaft (DFG) through grant CRC 1114 “Scaling Cascades in Complex Systems”, Project C08.
Weierstraß-Institut für Angewandte Analysis und Stochastik (WIAS) Leibniz-Institut im Forschungsverbund Berlin e. V.
Mohrenstraße 39 10117 Berlin Germany
Fax: +49 30 20372-303
E-Mail:
[email protected]
Luisa Andreis, Wolfgang König, Robert I. A. Patterson
Abstract
A large-deviations principle (LDP) is derived for the state, at fixed time, of the multiplicative coalescent in the large particle number limit. The rate function is explicit and describes each of the three parts of the state: microscopic, mesoscopic and macroscopic. In particular, it clearly captures the well known gelation phase transition given by the formation of a particle containing a positive fraction of the system mass at timet = 1. Via a standard map of the multiplicative coalescent onto a time-dependent version of the Erd ˝os-Rényi random graph, our results can also be rephrased as an LDP for the component sizes in that graph. Our proofs rely on estimates and asymptotics for the probability that smaller Erd ˝os-Rényi graphs are connected.
1
Introduction
Smoluchowski introduced a (deterministic) ODE model for the concentrations of coagulating particles in the course of his work on Brownian motion [vS16]. This was motivated by an underlying concept of microscopic dynamics similar to the mean-field limit. Flory [Flo41a] investigated the distributions of polymer sizes and connectivity structures in order to understand the physical properties of these (at that time relatively new) materials. As is now well known the particle and random graph models are almost equivalent, but unlike Smoluchowski, Flory was interested in the phase transition that occurs when a giant particle/connected component of the graph forms, which he called a gel, terminology which we shall adopt.
In this paper, we study one of the simplest stochastic coagulation models and give a complete description by means of a powerful mathematical theory, the large-deviations theory. This turns out to be equivalent to the description, in the large-deviations framework of the statistics of connected components in the sparse regime for the Erd ˝os-Rényi random graph, i.e.
G(N, p)
whenp
∼
N1.
We provide a new analysis to the gelation phase transition via the large-deviations rate function. This then provides an alternative proof of the uniqueness of the giant component in sparse Erd ˝os-Rényi random graphs.1.1
A non-spatial mean-field coagulation model
In this paper, we study a stochastic coagulation process, called the Marcus–Lushnikov process, see [Mar68, Gil72, Lus78]. This is a continuous-time Markov process of vectors of particle masses
M
(N )i
(t)
∈ N
at timet
∈ [0, ∞)
, arranged in descending order:M
(N ) 1(t)
≥ M
(N ) 2(t)
≥ M
(N ) 3(t)
≥ · · · ≥ M
(N ) n(t)(t)
≥ 1,
n(t)X
i=1M
(N ) i(t) = N,
(1.1)for some parameter
N
∈ N
. This process is specified by the initial configuration, which we take in the monodisperse case, i.e.M
(N )i
(0) = 1
for alli = 1, . . . , N = n(0)
, and by the transition mechanism, which is given in terms of a symmetric, non-negative coagulation kernelK
N: N
× N → [0, ∞)
. That is, we start withN
particles of unit mass at time 0, and in the course of the process, each (unordered) pair of particles with respective massesm, e
m
∈ N
coagulate to a particle of massm + e
m
with rateK
N(m, e
m)
, independently of all the other pairs of particles.If the dependence of the coagulation kernel
K
N onN
is chosen so thatK
N(m, e
m)
≍ 1/N
for fixedm, e
m
asN
→ ∞
, then any given pair of finitely sized particles coagulates after a time≍ N
and so each particle undergoes a coagulation with some other particle after an order 1 time, since it is in contact with≍ N
other particles. This is a mean field interaction in which every particle in a large population has contact with every other on an equal basis. In this limit it is reasonable to writel
k(t) = lim
N→∞ N1#
{
particles of sizek
at timet
}
, then under suitable conditions these limits satisfyd
dt
l
k(t) =
1
2
X
m, em : m+ em=kl
m(t)l
me(t)K(m, e
m)
− l
k(t)
X
ml
m(t)K(k, m)
∀k ∈ N
(1.2)where
K(m, e
m) = lim
N→∞N K
N(m, e
m)
. This is the Smoluchowski equation [vS16] referred to above. In this paper, we exclusively study the case of the multiplicative kernel,K
N(m, e
m) = m e
m/N
. This choice has the two interesting features: (1) it can be mapped onto a natural time-dependent version of the Erd ˝os-Rényi random graph [ER61], and (2) it exhibits an interesting phase transition in the limitN
→ ∞
at timet = 1
, because a gel, i.e., a particle of macroscopic size, appears. Our main goal in this paper is to recover the gelation phenomenon in rather explicit terms through a large-deviations principle (LDP). Moreover, we will be able to describe the large-deviations of all parts of the particle model, the microscopic, mesoscopic and macroscopic parts. As a consequence, the above mentioned phase transition as well as the solution of the Smoluchowski ODE will be clear from our formulas and will be given a new interpretation in terms of combinatorial structures. Indeed, we will analyse the joint distribution of the microscopic and the macroscopic empirical measure of the particle sizes, and we will also deduce interesting information about the mesoscopic part of the configuration. We will keep the timet
∈ (0, ∞)
fixed and consider only the limit asN
→ ∞
.There is a well-known description of the distribution of the coagulation process that we study in the present work at time
t
in terms of the well-known Erd ˝os-Rényi random graph onN
nodes with edge probability1
− e
−t/N, see the review [Ald99]. More precisely, the joint distribution of all the cluster sizes in this graph is equal to the distribution of the particle sizes,(M
(N )i
(t))
i. Hence, the formation of a gel in the coagulation process is equivalent to occurrence of percolation in the graph, i.e., formation of a giant component, see the classic reference [Bél01]. The connection between the two models will be the starting point of our analysis and will be recalled at the beginning of Section 2.1. Because of this connection, our results give a new contribution to the theory of the Erd ˝os-Rényi graph in terms of an LDP for connected component statistics in the sparse regime. These kind of asymptotic results were not previously available in random graph theory, even though there are large-deviations results of various types, see Section 1.5 for an overview.1.2
Our results: Large-deviations principles
In this section, we present all our results on the exponential behaviour of distributions of the main charac-teristics of the Marcus–Lushnikov model. In Section 1.3 we will draw conclusions about the gelation phase transition from that.
For
N
∈ N
we consider the state spaceS
N=
n
(m
i)
i=1,...,n∈ N
n: n
∈ N, n ≤ N, m
1≥ m
2≥ m
3≥ · · · ≥ m
n≥ 1,
nX
i=1m
i= N
o
(1.3)of tuples of positive integers summing to
N
, ordered in a decreasing way. Starting from the initial configura-tionM
(N )(0) = (1, . . . , 1)
∈ S
N, the Markov process
(M
(N )(t))
t∈[0,∞) specified in the previous section has the generatorL
Nf (m
i)
i=1,...,n=
1
2
nX
i,j=1 i6=jK
N(m
i, m
j)
h
f ( e
m
(i,j))
− f (m
i)
i=1,...,ni
,
where
m
e
(i,j)is the collection of then
− 1
numbersm
kwith
k
6= i, j
andm
i+ m
j, properly ordered. Here we restrict to the multiplicative kernel, i.e.K
N(m, e
m) = m e
m/N
. That is, each pair of distinct particles coagulates after an exponential time with meanN/m e
m
wherem
andm
e
are the particle masses. Coagula-tion means that the two particles are replaced by one particle of massm + e
m
. All the exponential waiting times are independent. Then the numbern(t)
of particles at timet
is a decreasing function oft
, and with probability one, it reaches the value one in finite time.We denote the probability and expectation by
P
N andE
N, respectively. We fix a time horizont
∈ (0, ∞)
and describe the distribution of the particle masses
M
(N )(t)
in (1.1) in the limitN
→ ∞
in terms of a large-deviations principle. It will be convenient to work with the empirical measures of the particle masses in the microscopic and macroscopic size ranges:Mi
(N )(t) =
1
N
n(t)X
i=1δ
M(N ) i (t) andMa
(N )(t) =
n(t)X
i=1δ
1 NM (N ) i (t).
(1.4) Intuitively, whileMi
(N )(t)
registers the numbers of particles of “microscopic” sizes 1,2,3,... on the scaleN
, in contrastMa
(N )(t)
registers the numbers of particles of “macroscopic” sizes of orderN
. Even though each of the two measures admits a one-to-one map onto the vector(M
(N )i
(t))
i for fixedN
∈ N
, in the limitN
→ ∞
, for topological reasons, they will be able to describe only the statistics of the microscopic, respectively macroscopic, part of the particle configuration. For a full description, a kind of mesoscopic part has to be considered, but this is a more complicated issue, which we defer.Mi
(N )(t)
is a random element of the set
N =
S
c∈[0,1]N (c)
of measures onN
that have an integralagainst the identity not larger than one, where
N (c) =
n
λ
∈ [0, ∞)
N:
X
k∈Nkλ
k= c
o
,
c > 0.
(1.5)We equip
N
with the topology of coordinate-wise convergence, which is compact by the Bolzano-Weierstrass theorem combined with Fatou’s lemma.Ma
(N )(t)
is a random element of the setM
N0=
S
c∈[0,1]M
N0(c)
, whereM
N0(c) =
n
α
∈ M
N0((0, 1]) :
Z
(0,1]x α(dx) = c
o
,
(1.6)and
M
N0((0, 1])
is the set of all measures on(0, 1]
with values inN
0=
{0} ∪ N
. We equipM
N0 with the topology that is induced by functionals of the formµ
7→
R
(0,1]f (x) µ(dx)
wheref : (0, 1]
→ R
is continuous and compactly supported. We always write the elements ofM
N0(c)
asα =
P
jδ
αj with1
≥ α
1≥ α
2≥ · · · > 0
andP
jα
j= c
, wherej
extends over a finite subset ofN
or overN
. Then convergence is equivalent with the pointwise convergence of each of the atoms. By similar arguments as forN
, alsoM
N0 is compact.Note that the microscopic and the macroscopic total masses
P
kkMi
(N )k(t)
andR
(0,1]x Ma
(N )(t)(dx)
are each equal to one, hence indeedMi
(N )(t)
∈ N (1)
andMa
(N )(t)
∈ M
N0(1)
. However the functionsλ
7→ c
λ:=
X
k∈Nkλ
k andα
7→ c
α:=
Z
(0,1]x α(dx)
are not continuous, but only lower semicontinuous in the respective topologies.
We equip the product of
N
andM
N0 with the product topology, so that it is also compact.Our main result is the following description of the two empirical measures in terms of a joint large-deviations principle (LDP).
Theorem 1.1 (LDP for the empirical measures). Fix
t
∈ [0, ∞)
. Then, asN
→ ∞
, the pair(Mi
(N )(t), Ma
(N )(t))
satisfies a large-deviations principle with speed
N
and rate function(λ, α)
7→ I(λ, α; t) =
(
I
Mi(λ; t) + I
Ma(α; t) + (1
− c
λ− c
α)
t 2− log t
,
ifc
λ+ c
α≤ 1,
∞
otherwise, whereI
Mi(λ; t) =
∞X
k=1λ
klog
k!tλ
ke k
k−2+ c
λ1 +
t
2
− log t
,
c
λ=
∞X
k=1kλ
k,
(1.7)I
Ma(α; t) =
Z
1 0h
x log
x
1
− e
−tx+
t
2
x(1
− x)
i
α(dx),
c
α=
Z
(0,1]x α(dx).
(1.8)The proof of this theorem is in Section 3. Let us recall the notion of an LDP: Theorem 1.1 says that, for any open set
G
⊂ N × M
N0 respectively closed setF
⊂ N × M
N0,lim inf
N→∞1
N
log P
N((Mi
(N )(t), Ma
(N )(t))
∈ G) ≥ − inf
GI(
·; t),
lim sup
N→∞1
N
log P
N((Mi
(N )(t), Ma
(N )(t))
∈ F ) ≤ − inf
FI(
·; t).
For the theory of large-deviations, see e.g. [DZ10]. It is not difficult to see that the rate function
I(
·, ·; t)
is lower semicontinuous. SinceN × M
N0 is compact, it is even a good rate function, i.e., its level sets{(λ, α): I(λ, α; t) ≤ r}
are compact for anyr
.From our main result, the LDP in Theorem 1.1, a number of other LDPs follow via the contraction principle (which says that if a random variable satisfies an LDP, so does its image under a continuous transformation). Let us begin with the particle size distribution of the microscopic part.
Corollary 1.2 (LDP for particle size statistics). Fix
t
∈ [0, ∞)
. Then, asN
→ ∞
,Mi
(N )(t)
satisfies an LDP with rate functionI
Mi(
·; t): N → [0, ∞]
, given byI
Mi(λ; t) = inf
α∈MNI(λ, α; t) = I
Mi(λ; t)
− (1 − c
λ)
log
1
− e
(cλ−1)t1
− c
λ−
c
λt
2
.
(1.9)The first equality is the contraction principle [DZ10]; the second equality is checked in Lemma 4.1. In the same way one can investigate the macroscopic part of the system.
Corollary 1.3 (LDP for macroscopic particles). Fix
t
∈ [0, ∞)
. Then, asN
→ ∞
,Ma
(N )(t)
satisfies an LDP with rate functionI
Ma(
·; t): M
N0→ [0, ∞]
, given byI
Ma(α; t) = inf
λ∈NI(λ, α; t)
= I
Ma(α; t) + (1
− c
α)
t
2
− log t
+ C
α,tlog(tC
α,t)
−
t
2
C
α,t,
(1.10) whereC
α,t= (1
− c
α)
∧
1t (recallc
α=
R
01x α(dx)
).Only the second equality has to be checked; this is done in Lemma 4.2.
Hence, we can hope to derive a phase transition from non-existence to existence of a gel, i.e., to a non-trivial macroscopic part, at
t = 1
, from the rate functions. However, even though it seems as if this phenomenon is present only in the macroscopic rate functionI
Ma, actually it has its origin in the discussion of the existence of minimizersλ
for the microscopic rate functionI
Mi in Corollary 1.2; this is the content of Theorem 1.5.Now we come to the mesoscopic part of the particle configuration. Since this part comprises particle sizes on all the scales between finite and
O(N )
, it makes no sense to consider an empirical measure. Instead, we consider only the total mass of the mesoscopic part. Letε > 0
andR
∈ N
, we define the(R, ε)
-mesoscopic total mass asMe
(N )R,ε(t) =
1
N
X
i : R<Mi(N )(t)<εNM
(N ) i(t).
(1.11)The mesoscopic total mass in a strict sense arises after taking the limits
N
→ ∞
, followed byε
↓ 0
andR
→ ∞
, but this does not define a random variable. However, it is possible to calculate an LDP in theN
→ ∞
limit and then to study the rate function,J
Me(R,ε), asε
↓ 0
andR
→ ∞
. On the other hand, the proof of Theorem 1.1 shows that it is possible to define a coupled mesoscopic total massMe
(N )RN,εN(t)
, for any diverging sequenceR
N and vanishing sequenceε
N. This is a well-defined random variable and it satisfies an LDP.Corollary 1.4 (LDP for mesoscopic mass). Fix
t
∈ [0, ∞)
.1 Then, for any
R
∈ N
andε
∈ (0, 1)
, asN
→ ∞
,Me
(N )R,ε(t)
satisfies an LDP with rate functionc
7→ J
(R,ε) Me(c; t)
, whereJ
(R,ε) Me(c; t) = inf
n
I(λ, α; t) :
RX
k=1kλ
k+ c +
Z
1 εx α(dx) = 1
o
.
2 For any
R
N∈ N
andε
N∈ (0, 1)
such that1
≪ R
N< ε
NN
≪ N
, the coupled mesoscopic total massMe
(N )RN,εN(t)
satisfies an LDP with rate functionJ
Me(c; t) = (1
− c)
log(1
− c)t −
(1
− c)t
2
+
t
2
− log t.
(1.12)The function
J
Me(c; t)
is strictly increasing inc
, its minimum over[0, 1]
isJ
Me(0; t) = 0
.Hence,
J
Me(
·; t)
can rightfully be called the rate function for the mesoscopic total mass. The probability to have a non-trivial mesoscopic part decays exponentially towards zero. Interestingly, takingR
N+ 1 =
ε
nN
∈ N
, we see that already just one mesoscopic particle alone satisfies the same LDP as the entire(R, ε)
-mesoscopic total mass in the limitR
→ ∞
,ε
↓ 0
.Corollary 1.4 part (1) is a simple consequence of the contraction principle, as the maps
λ
7→
P
Rk=1kλ
k andα
7→
R
ε1x α(dx)
are continuous. Assertion (2) follows as a byproduct of our proof of Theorem 1.1 in Section 3.1.3
Our results: Phase transitions
Now we proceed with the main phenomenon in the Marcus–Lushnikov model: the gelation phase transition. We will deduce it from our large-deviations rate functions from Section 1.2. The LDPs and the identification of their strict minimiser(s) lead to laws of large numbers for a number of random quantities.
Consider the following functions of the total masses of the microscopic and macroscopic particles respec-tively:
J
Mi(c; t) = inf
λ∈N (c)
I
Mi(λ; t)
andJ
Ma(c; t) =
α∈Minf
N0(c)I
Ma
(α; t),
where
c
∈ [0, 1]
. Clearly,J
Mi(c; t) =
J
Ma(1
− c; t)
. These two functions are not entirely analogous toJ
Me(c; t)
as rate functions for the total masses of the micro and the macro part, because the total masses both ofMi
(N )(t)
andMa
(N )(t)
are equal to one. This is consistent with the fact that the contraction principle cannot be applied to total masses, as they are not continuous functions of the measures. However, they contain rather interesting information about the gelation phase transition.Theorem 1.5 (Microscopic total mass phase transition). Fix
t
∈ [0, ∞)
. 1 For anyc
∈ [0, 1]
,J
Mi(c; t) = tc + (1
− c) log
1
− c
1
− e
t(c−1)+
(
c log c
− tc
2 forc <
1 t,
−
1 2t−
t 2c
2− c log t
forc
≥
1 t.
(1.13)2 For
c
∈ (0, 1]
, the minimum ofN (c) ∋ λ 7→ I
Mi(λ; t)
is attained precisely atλ
∗(c; t)
∈ N (c)
given byλ
∗k(c; t) =
k
k−2
c
kt
k−1e
−ctkk!
,
k
∈ N,
(1.14)and the minimum of the function
c
7→ J
Mi(c; t)
is attained precisely atc = 1
with valueJ
Mi(1; t) =
0
. Therefore the infimuminf
(λ,α)∈N ×MN0
I(λ, α; t)
(1.15)3 For
t
∈ (1, ∞)
, the minimum of the functionc
7→ J
Mi(c; t)
is attained atc = β
twhereβ
t∈ (0, t)
is the smallest positive solution tolog β
t= tβ
t− t.
(1.16)The infimum in (1.15) is attained precisely at
(λ, α) = (λ
∗(β
t; t), (1
− β
t, 0, 0, . . . ))
.The proof is found in Section 4.2.
Theorem 1.5 implies the well-known phase transition at
t = 1
because the derivative of the minimiser jumps at this point. Combining Theorem 1.5 with the LDP in Theorem 1.1 one has the following law of large numbers:Mi
(N )(t), Ma
(N )(t)
N=
→∞⇒
(
(λ
∗(1; t), 0)
ift
≤ 1,
(λ
∗(β
t; t), (1
− β
t, 0, . . . ))
ift
≥ 1,
and can check for
t
≤ 1
thatλ
∗(1; t)
is the exact solution of (1.2), the Smoluchowski equation, also given in [Ald99, Table 2]. One also sees that the cut-off versions of the total masses,P
Rk=1kMi
(N )k
(t)
andR
[ε,1]x Ma
(N )(t)(dx)
, converge towards the respective cut-off versions of the limits, and their limits asR
→ ∞
andε
↓ 0
are(1, 0)
fort
≤ 1
and(β
t, 1
− β
t)
fort
≥ 1
.1.4
Literature remarks
It was expected for a long time that the empirical measure of the masses from the Marcus–Lushnikov pro-cess converge in a weak sense to a solution of the Smoluchowski ODEs. The first rigorous convergence result of this kind is due to Lang and Nguyen [LN80], but Lushnikov [Lus78] provided a more informal justifi-cation. In the case of a multiplicative kernel,
K(m, e
m) = cm e
m
for allm, e
m
and a proportionality factorc
, Smoluchowski’s ODEs exhibit mass loss after a critical time (which is equal to1/c
); a feature that cannot be reproduced by any Marcus–Lushnikov process for finiteN
. Lushnikov however realised that at large enough timesM
1(N )(t)
≍ N
, that is: a macroscopic particle or gel forms and the Smoluchowski ODEs are not able to describe it. On the other hand the Smouchowski ODEs can be augmented by an equation for the size of the gel, this takes into account the gelation phase transition and convergence has been later proved for the empirical measure plus the rescaled gel mass by Norris [Nor00].In [HSES85], it is noted that the representation of the solutions of the master equation1 from Lush-nikov [Lus78] can be written in product form as
P
M
(N ) 1(t) = m
1, M
2(N )(t) = m
2, . . .
=
1
Z
NY
iϕ
N(m
i),
(m
i)
i∈ S
N,
(1.17)for some
Z
N> 0
and positive functionϕ
N. Actually, this is true for any coagulation kernel that can be written asK(m, e
m) = mf ( e
m) + f (m) e
m
for anym, e
m
, for some positive functionf
; see [BP90]. Necessary conditions for (1.17) to hold are given by Granovsky and Kryvoshaev [GK12].Building on the product structure in (1.17), Buffet and Pulé [BP90, BP91] make precise and rigorous the insight of Lushnikov regarding the formation of a gel in the limit
N
→ ∞
. Their main result is the existence of the gelation phase transition, for the kernelK
N(m, e
m) = m e
m/N
, somewhere in the time interval1The master equation is the Kolmogorov forward equation, that is, the ODE for the time marginalsν(N)
t of the law ofM(N). It is given bydtdν(N ) t =L † Nν (N ) t , whereL † Nis the adjoint ofLN.
[log 2, 1]
, by exclusively looking at the macroscopic particles and deriving estimate for the expected values of their sizes. In arriving at these estimates they use bounds on the solution of the master equation derived from the recursive representation going back to Lushnikov [Lus78], which are equivalent to our limits derived via random graph arguments in (2.5) below. They do not prove large-deviations upper bounds for the state of the Marcus–Lushnikov process. In a later work, providing more precise characterisation of the gel, but still at a formal level, Lushnikov [Lus04] picks up on this idea talking about a particular quantity playing the role of a “free energy”. We also exploit this product structure in the present work along with connections to random graph theory.1.5
Large-deviations for Erd ˝os-Rényi random graphs
As noted in the introduction, the distribution at time
t
of the Markus–Lushnikov process with multiplica-tive kernel is closely related to that of the connected component sizes for the Erd ˝os-Rényi random graphG(N, 1 − e
−t/N)
. This correspondence was not mentioned in [BP90], but was discussed one year later in [BP91], which highlights the connection between gelation in the coagulation process and the phase transition given by the formation of a giant connected component in the Erd ˝os-Rényi random graph [ER60].Our analysis and results are both closely connected to the study of Erd ˝os-Rényi random graph
G(N, 1 −
e
−t/N)
, more precisely with the large-deviation properties of the sizes of all the components asN
→ ∞
. The literature does not contain many results in this respect for sparse graphs, that isG(N, p)
withp
∼
1/N
. An LDP for the size of the largest component has been found [O’C98], some results dealing with the macroscopic components [Puh05] and the degree distribution are available [BC15]. Under assumptions that imply at a minimump
≫ N
−12 recent progress has been made on the upper tails of sub-graph counts[CD16, BCCL18, Aug18, CD18]. In the case of dense graphs (fixed
p
∈ (0, 1)
) there is a complete treatment thanks to Chatterjee and Varadhan [CV11], see [Cha16] for an overview.1.6
Pathwise large-deviations
In this work we focus on large-deviations for the coagulation process at a single time
t
starting with an initial state composed entirely of monomers, that is, with no prior coagulation. One could seek to derive an LDP for the paths of the coagulation process on a compact time interval. For this, one would have to extend the combinatorial and asymptotical work of the present paper to general starting configurations, and use the Markov property to prove LDPs for the finite dimensional distributions. One could then use a projective limit argument augmented by a path space exponential tightness result (see for example [FK06, Chapter 4]). We consider this programme doable, but cumbersome, and therefore decided to defer it to future work.Such an LDP would be in the spirit of the well-known Wentsel–Freidlin theory, and there are already a number of results of this type in the literature. For the coalescent process an LDP has been derived formally [MPPR17, Thm 3.4] following the non-linear semi-group approach of [FK06]. This type of results yield a formula for the rate function that is much less explicit and rather different from those that an extension of the present work would yield. On the other hand the less explicit approach is applicable to a wide range of Markov processes.
function for the entire path of the microscopic part of the system
[0, T ]
∋ t 7→ λ(t)
that can be written asI
Mi(λ) =
Z
T 0L λ(t), ˙λ(t)
dt,
where
L
is the solution of variational problem. In order to derive, via the contraction principle, a formula for the rate function forλ(t)
at a fixed timet
, one observes that all paths where the rate function is finite are continuous (in fact a.e. differentiable), and obtains, for configurationsλ
,I
Mi(λ; t) =
inf
c : c(t)=λZ
T 0L c(s), ˙c(s)
ds.
Solving this multilayered optimisation problem seems to require major work, and identifying the right-hand side with our formula in (1.9) remains an intriguing problem.
1.7
Comparison to Bose-Einstein condensation without interaction
Our large-deviations approach to the Marcus–Lushnikov models shows remarkable similarities to another well-known phase transition in a non-spatial model, the non-interacting Bose gas. Here the situation is sim-ilar in that the gas can be conceived as a joint distribution of
N
particles that are randomly grouped into smaller units, called cycles, which can become arbitrarily large. The natural question is then, under what circumstances do macroscopic cycles arise. An explicit answer in terms of a large-deviations analysis has been given in [Ada08], where the transition, the famous Bose-Einstein Condensation (BEC) in dimensionsd
≥ 3
, is derived from the minimization of the rate function, in a way analogous to that in our Theorem 1.5. The two phase transitions differ in that the BEC transition is of saturation type, while the gelation transition is not.For the non-interacting Bose gas in the thermodynamic limit at temperature
1/β
∈ (0, ∞)
with particle densityρ
∈ (0, ∞)
the partition function is given byZ
(β) ΛN=
X
(ℓk)k∈N∈NN0: P kkℓk=NY
kN
ℓkℓ
k! k
ℓk[ρ(4πβk)
d2]
−ℓk,
where
Λ
N is the centred box inR
dwith volumeN/ρ
. The free energy per particle is thenf (β, ρ) = lim
N→∞1
N
log Z
(β) ΛN=
− inf
λ∈N (ρ)
I(λ),
whereI(λ) =
X
kλ
klog
λ
kk
(4πβk)
d2e
.
For the Marcus–Lushnikov model the equivalent quantity is the rate function
I
Mi from (1.9). The key differ-ence between the rate functions is that onlyI
Mi contains terms in the total mass of microscopic particles,c
λ. This reflects the fact that the giant particle makes a significant contribution to the rate function in the Marcus–Lushnikov model, but the condensate in the non-interacting Bose gas does not.The respective minimisers of
I
MiandI
arekλ
(ML) k(c; t) =
1
t
(cte
−ct)
kk
1−kk!
∼
1
√
2πt
(cte
−ct+1)
kk
3/2 andkλ
(BEC) k(α; β) =
1
ρ(4πβ)
d2e
−αkk
d2,
where
c
andα
control the values ofP
kkλ
k.The crucial parameters are the time
t
for the Marcus–Lushnikov model and the inverse temperatureβ
for the Bose gas. Both models have a trivial upper bound for the total microscopic mass,P
kkλ
k, namely one. One additional upper bound arises in each model from the optimisation of the rate function with respect to theλ
k, but these are not relevant, untilt
respectivelyβ
rises to its critical value. For the Marcus–Lushnikov model this bound is1/t
, becauseP
k (ctek1−k−ctk!)k≤ 1
for allct
∈ (0, ∞)
, and the summands take their maxima atct = 1
, when they correspond to the Borel probability distribution with parameter 1. Forλ
(BEC) this boundis
ρ
−1(4πβ)
−d/2P
kk
−d2. At this point we see a difference between the two models, because the totalmicroscopic mass in the Bose gas remains on this bound as
β
rises further, while for the Marcus–Lushnikov model it immediately drops strictly below the bound. This explains why BEC is known as a saturation phase transition, but this description cannot be applied to gelation.2
The distribution of the particle sizes
We fix the parameter
t
∈ (0, ∞)
. In Section 2.1, we derive, for fixedN
∈ N
, an explicit formula for the distribution of the empirical measure of the particle sizesM
(N )i
(t)
in terms of connectivity probabilities for Erd ˝os-Rényi random graphs. Furthermore, we prepare in Section 2.2 for the asymptotic analysis by giving some estimates and asymptotics for the most crucial object, the probability that a graph is connected.2.1
The connection with random graphs
Let us explain the connection between the Marcus–Lushnikov coagulation model with multiplicative kernel and a time-dependent version of the well-known Erd ˝os-Rényi graph, see [Ald99]. This will be our starting point for the identification of the joint distribution of
Mi
(N )(t)
andMa
(N )(t)
.Equip each unordered pair
{i, j}
of distinct numbers in{1, . . . , N}
with an exponentially distributed random timee
i,j with parameterN
, i.e., with expected value1/N
. All theseN (N
− 1)/2
random times are assumed to be independent. At timee
i,j a bond is created betweeni
andj
so the probability of a bond betweeni
andj
forming by timet
is1
− e
−t/N. By independence, an Erd ˝os-Rényi graphG(N, p)
with parameterp
N(t) = 1
− e
−t/N arises. If nowm
(N )i(t)
denotes the size of thei
-th largest connected component (cluster) of this graph, then we have that(m
(N )i(t))
iand(M
i(N )(t))
i are identical in distribution. (This equality is even true process-wise int
, but we are here not interested in that.) In this way, we can see the coagulation process as a function ofG(N, p
N(t))
. The fact that this description of the distribution is correct comes from the two characteristic properties of the exponential distribution: (1) it has no memory, and (2) the minimum of two independent exponential times is an exponential random time with parameter equal to the sum of the two parameters. Two particles in the coagulation process of cardinalitiesm
andm
e
at a given time have preciselym e
m
independent exponential times with parameterN
that have not yet elapsed; any elapsure of any of them would connect the two particles. Altogether, this means that these two particles coagulate with ratem e
m/N = K
N(m, e
m)
.Hence, an important quantity is
µ
(N )t
(k) = P
G(k, 1 − e
−t
N
)
is connected,
(2.1)We denote by
P
Nthe set of all partitions of{1, . . . , N}
. We writeB
i(π)
for the number of sets inπ
∈ P
N with cardinalityi
. Then we can describe the distribution of the coagulation process at timet
as follows.Lemma 2.1. For any
N
∈ N
and every(m
i)
i∈ S
N,P
N(M
(N ) i(t))
i= (m
i)
i= #
{π ∈ P
N: B
i(π) = m
i∀i} ×
Y
iµ
(N ) t(m
i)
×
Y
i6=je
−2Nt mimj.
(2.2)Proof. A set
A
⊂ {1, . . . , N}
of indices is a connected component in the graphG(N, p
N(t))
if and only if (1) no bond between any index inA
and any index outside has been connected by timet
, and (2) the subgraph formed out of the vertices inA
and all the bonds between any two vertices inA
that have been created by timet
is connected. This is the case precisely if any only ife
i,j> t
for alli
∈ A
andj
∈ A
c=
{1, . . . , N} \ A
, andA
is connected. This has probabilitye
−|A| |Ac|t/N× µ
(N )t
(
|A|)
. Applying this reasoning toA
c and describing the next cluster, and iterating this argument, shows that the product of the two products on the right-hand side of (2.2) is equal to the probability, for a given partitionπ
withm
i sets of sizei
for anyi
, that the clusters ofG(N, p
N(t))
are precisely the sets ofπ
. Since this probability depends only on the cardinalities, the counting term completes the formula.Now we rewrite the right-hand side of (2.2) in terms of the empirical measure of
(m
i)
i, i.e., of the numbersℓ
kof indicesi
such thatm
i= k
. Introduce the eventA
N,t(ℓ) =
\
k∈N{#{i: M
(N ) i(t) = k
} = ℓ
k},
ℓ = (ℓ
k)
k∈N∈ N
N 0,
(2.3)Corollary 2.2. For any
N
and anyℓ = (ℓ
k)
k∈ N
N0 satisfyingP
kkℓ
k= N
,P
N(A
N,t(ℓ)) = N !
Y
kµ
(N ) t(k)
ℓke
− t 2Nk(N−k)ℓkk!
ℓkℓ
k!
.
(2.4)Proof. Note that the last product on the right-hand side of (2.2) can also be written as
Q
ie
−t2mi(N−mi).
Hence, if
ℓ
kis equal to the number ofi
such thatm
i= k
for anyk
, then the product of the last two product can be written asY
kµ
(N ) t(k)
ℓke
− t 2Nk(N−k)ℓk.
The counting term is easily identified as
#
{π ∈ P
N: #
{A ∈ π : |A| = k} = ℓ
k∀k} =
N !
Q
k
k!
ℓkℓ
k!
.
Substituting ends the proof.
(To avoid confusion, we note that there is a typographical error in Section 4.5 of [Ald99], where the formula (2.4) appears with
e
−t2.2
The probability of being connected
Our analysis of (2.4) will depend crucially on an analysis of
µ
(N )t
(k)
. The next two lemmas collect results from [Ste70, Lemma1&2, Theorem 1].Lemma 2.3 (Bounds and asymptotics for
µ
(N )t , [Ste70]). For any
N
∈ N
and anyk
≤ N
,e
−2Nt(k
− 1)(k − 2) ≤
µ
(N ) t(k)
k
k−2(1
− e
−t N)
k−1≤ 1.
(2.5) In particular, ifk = o(
√
N )
,µ
(N ) t(k) = k
k−2t
N
k−1(1 + o(1)),
N
→ ∞.
The expression for the upper bound in (2.5) appears to be present (using somewhat applied chemical language) in [Flo41b, equation (5)]. The following is an alternative upper bound to
µ
(N )t
(k)
, which will be useful in the macroscopic setting, together with an asymptotic result for the connection probability in the so-called sparse case, where the bond probability is proportional to the inverse of the size of the graph.Lemma 2.4 ([Ste70]). For all
t > 0
andk
∈ N
µ
(N )t(k)
≤ 1 − e
−kt N
k−1.
Fix
α
∈ (0, 1)
. Then, forN
→ ∞
,µ
(N ) t(
⌊αN⌋) =
1
−
αt
e
αt− 1
1
− e
−tααN(1 + o(1)).
The assertion remains true if the bond probability
1
−e
−t/N is replaced by any sequencet
N
=
Nt(1+o(1))
.3
Proof of the LDP
In this section we prove the main result of this paper, the large-deviations principle in Theorem 1.1.
Recall the topological remarks on the two state spaces
N
andM
N0 at the beginning of Section 1.2. The metricsd
onN
andD
onM
N((0, 1])
, defined byd(λ, e
λ) =
∞X
k=12
−k|λ
k− eλ
k|
andD(α, e
α) =
∞X
i=12
−i|α
i− ˜α
i|,
(3.1)induce the respective topologies of pointwise and vague convergence. We write
B
δ(λ)
respectivelyB
ρ(α)
for theδ
-ball aroundλ
respectively for theρ
-ball aroundα
. Our main result, the LDP in Theorem 1.1, follows from the following.Proposition 3.1. Fix
t
∈ [0, ∞)
. Then, for anyλ
∈ N
andα
∈ M
N((0, 1])
,lim
δ,ρ↓0Nlim
→∞1
N
log P
NMi
(N )(t)
∈ B
δ(λ), Ma
(N )(t)
∈ B
ρ(α)
=
−I(λ, α; t).
(3.2)Proof. To each element
(m
i)
i of the state spaceS
N defined in (1.3), we associate a unique element of the spaceN
N=
n
ℓ = (ℓ
k)
k∈ N
N 0:
X
kkℓ
k= N
o
,
(3.3)where for each
k
,ℓ
kis the number of indicesi
such thatm
i= k
. The map(m
i)
i7→ ℓ
is a bijection and in the following we refer to configurations equally in terms of(m
i)
iorℓ
.Fix
δ, ρ > 0
andN
∈ N
and recall the definition ofA
N,t(ℓ)
in (2.3), then we see thatP
NMi
(N )(t)
∈ B
δ(λ), Ma
(N )(t)
∈ B
ρ(α)
=
X
ℓ∈NN1l
{d(
1 Nℓ, λ) < δ
} 1l{D(ℓ
⌊·N⌋, α) < ρ
} P
N(A
N,t(ℓ)).
(3.4)Step 1: Cardinality of
N
N: First we note that|N
N| = e
o(N )because the following argument (which is due to an argument in [Ada08]). For anyℓ
∈ N
N, the setH(ℓ) =
{k ∈ N: ℓ
k> 0
}
has no more than2
√
N
elements, sinceN =
X
k∈H(ℓ)kℓ
k≥
X
k∈H(ℓ)k
≥
|H(ℓ)|X
k=1k =
|H(ℓ)|
1
2
(
|H(ℓ)| − 1).
Hence, |NN| ≤ n(ℓk)k ∈ N N 0: X k kℓk= N,|H(ℓ)| ≤ 2 √ No ≤ X H⊂N : |H|≤2√N {(ℓk)k∈H ∈ NH: X k∈H kℓk= No ≤ X H⊂N : |H|≤2√N {(Lk)k∈H ∈ NH: X k∈H Lk= No ≤ N ⌊2√N⌋ N + ⌊2√N⌋ ⌊2√N⌋ = eo(N ).Hence, we only have to give asymptotic estimates on the single summands on the right-hand side of (3.4). Step 2: Splitting
P
N(A
N,t(ℓ))
. The strategy is to divide the terms in the product representation fromCorol-lary 2.2 into three groups, which we call micro-, meso-, and macroscopic. We fix two increasing sequences
R
N andε
NN
inN
such thatR
Nր ∞
,ε
N↓ 0
andR
N< ε
NN
. We writeP
N(A
N,t(ℓ)) = N !
× F
Mi(ℓ)
× F
Me(ℓ)
× F
Ma(ℓ),
(3.5) whereF
Mi(ℓ) =
RNY
k=1z
k(ℓ),
F
Me(ℓ) =
Y
RN<k≤εNNz
k(ℓ),
F
Ma(ℓ) =
Y
εNN <k≤Nz
k(ℓ),
andz
k(ℓ) =
µ
N t(k)
ℓke
− t 2Nk(N−k)ℓkk!
ℓkℓ
k!
.
Let us setc
Mi(ℓ/N ) =
1
N
RNX
k=1kℓ
k,
c
Me(ℓ/N ) =
1
N
X
RN<k≤εNNkℓ
k,
c
Ma(ℓ/N ) =
1
N
X
εNN <k≤Nkℓ
k.
(3.6)Note that the sum of these three terms is equal to one. For the factor
N !
, we use Stirling’s formulaN ! =
(
Ne)
Ne
o(N )so that uniformly inℓ
∈ N
N
N ! =
N
e
N cMi(ℓ/N )N
e
N cMe(ℓ/N )N
e
N cMa(ℓ/N )e
o(N ),
N
→ ∞.
(3.7)Step 3: Upper bound in the case
c
λ+ c
α≤ 1
. We start by looking at the first term on the right-hand side, i.e., the ‘microscopic’ term. We use the upper bound in (2.5), to obtainz
k(ℓ)
≤
k
(k−2)ℓkt
(k−1)ℓke
−2Nt k(N−k)ℓkk!
ℓkN
(k−1)ℓk(
1e
ℓ
k)
ℓk.
Using this in the first term of (3.5) (together with the first term in (3.7)), we obtain, uniformly for
ℓ
∈ N
N, also using thatP
RN k=1 2Ntk
2ℓ
k≤
2tR
Nc
Mi(ℓ/N )
,N
e
N cMi(ℓ/N )F
Mi(ℓ)
≤
RNY
k=1hN
e
kℓkk
(k−2)ℓkt
(k−1)ℓke
ℓke
−2tkℓke
2Nt k2ℓkk!
ℓkN
(k−1)ℓk(ℓ
k)
ℓki
= exp
− N
RNX
k=11
N
ℓ
klog
k!e
k 1 Nℓ
kk
k−2t
k−1e
1−t 2ke
o(N )= exp
−NI
(RN ) Mi(
N1ℓ; t)
e
o(N ),
whereI
Mi(RN )(e
λ; t) = f
(RN )(e
λ; t) +
RNX
k=1ke
λ
kt
2
− log t
withf
(RN )(e
λ; t) : =
RNX
k=1eλ
klog
k!te
k−1eλ
kk
k−2,
is the cut-off version of the rate function defined in (1.7). Recall thatd(
N1ℓ, λ) < δ
and thatc
λ=
P
k∈N
kλ
k∈ [0, 1]
and observe thatlim
R→∞I
Mi(R)(λ; t) = I
Mi(λ; t)
. Therefore we see that, for anyR
∈ N
,N
e
N cMi(ℓ/N )F
Mi(ℓ)
≤ exp (−NI
Mi(λ; t)) e
N (CR(δ)+γR)+o(N )e
−N(t
2−log t)(cMi(ℓ/N )−cλ)
,
N
→ ∞,
(3.8) where
lim
R→∞γ
R= 0
andlim
δ↓0C
R(δ) = 0
. Indeed, sincef
(R)(
·; t)
is continuous, it is clear thatsup
ℓ : d(1Nℓ,λ)<δ
|f
(R)(
1N
ℓ; t)
− f
(R)
(λ; t)
|
vanishes asδ
↓ 0
and can therefore be estimated against sucha
C
R(δ)
. Moreover, we estimate (substituting N1ℓ
byeλ
), for anyN
such thatR
N> R
, with the help of the Stirling boundk!e
kk
−k≥ 1
and Jensen’s inequality forϕ(x) = x log x
, as follows:f
(RN )(e
λ; t)
− f
(R)(e
λ; t) =
RNX
k=R+1eλ
klog
k!te
k−1eλ
kk
k−2≥
RNX
k=R+1eλ
klog
k
2te
λ
ke
≥
RNX
k=R+1e
tk
2ϕ
X
RN k=R+1eλ
k.
X
RN k=R+1e
tk
2=
RNX
k=R+1eλ
klog
X
RN k=R+1eλ
k.
X
RN k=R+1e
tk
2≥
RNX
k=R+1eλ
klog
cR
RNX
k=R+1eλ
k≥ −γ
R,
(3.9)for some
c > 0
, where we used that the remainder sumP
k>R k12 is of order1/R
asR
→ ∞
and thatP
RNk=R+1
eλ
k≤ 1/R
sinceP
k
ke
λ
k≤ 1
and that the mapx
7→ x log(cRx)
is decreasing in(0, 1/eRc)
, introducing some−γ
Rthat vanishes asR
→ ∞
. In this way, we arrived at the estimate in (3.8). Notice that the last term on the right-hand side cannot be further estimated with the help of continuity (sinceλ
7→ c
λ is not continuous), but will be jointly handled together with the correspondent macroscopic and mesoscopic terms.For handling the third term in (3.5), we proceed analogously, but use the upper bound in Lemma 2.4, to obtain, for
k
∈ {ε
NN, . . . , N
}
,z
k(ℓ)
≤
(1
− e
−tk N)
(k−1)ℓke
−2Nt k(N−k)ℓkk!
ℓk(
1 eℓ
k)
ℓk and consequentlyN
e
N cMa(ℓ/N )F
Ma(ℓ)
≤
Y
εNN≤k≤NhN
e
kℓk(1
− e
−t k N)
kℓke
−t2kℓke
2tk2ℓk/Nk!
ℓkℓ
k!
i
≤
Y
εNN≤k≤NhN
k
kℓk(1
− e
−tNk)
kℓke
− t 2kℓke
2tk2ℓk/Ni
= exp
− NI
(εN ) Ma(ℓ
⌊·N⌋; t)
(3.10) whereI
(εN ) Ma(e
α; t) = g
(εN )(e
α; t) +
Z
[εN,1]x e
α(dx)
t
2
− log t
withg
(εN )(e
α; t) =
Z
[εN,1]h
x log
tx
1
− e
−tx−
t
2
x
2i
e
α(dx),
denotes the cut-off version of the rate function
I
Ma defined in (1.7). Recall thatD(ℓ
⌊·N⌋, α) < ρ
andc
α=
R
(0,1]x α(dx)
≤ 1
and observe thatlim
ε↓0I
Ma(ε)(α; t) = I
Ma(α; t)
, then we see that, for anyε > 0
,N
e
N cMa(ℓ/N )F
Ma(ℓ)
≤ exp − NI
Ma(α; t)
e
N (Cε(ρ)+γε+ t 2ε)+o(N )e
−N( t 2−log t)(cMa(ℓ/N )−cα),
N
→ ∞,
(3.11) for someC
ε(ρ)
andγ
εthat satisfylim
ε↓0γ
ε= 0
andlim
ρ↓0C
ε(ρ) = 0
. Indeed, first observe thatg
(ε)(
·; t)
is continuous and hence|g
(ε)(ℓ
⌈·N⌉
; t)
− g
(ε)(α; t)
|
can be estimated against such aC
ε(ρ)
, uniformly inN
∈ N
andℓ
such thatD(ℓ
⌊·N⌋, α) < ρ
. Furthermore, for anyε > 0
and anyN
∈ N
such thatε
N< ε
,g
(εN )(ℓ
⌈·N⌉; t)
− g
(ε)(ℓ
⌈·N⌉; t) =
εNX
k=εNNℓ
kk
N
log
k Nt
1
− e
−tk N−
t
2
k
N
!
≥ −
t
2
ε,
since
log
1−ex−x≥ 0
for allx > 0
. Hence, we arrived at the bound in (3.11). Notice that again we refrainfrom estimating the term
e
−N(2t−log t)(cMa(ℓ/N )−cα), which needs to be coupled with the microscopic and theThen we are left to handle the middle term in (3.5), for which we use again the upper bound in (2.5) and Stirling’s formula, to see that
N
e
N cMe(ℓ/N )F
Me(ℓ)
≤
⌊εY
NN⌋ k=RN+1hN
e
kℓkk
(k−2)ℓk(1
− e
−t/N)
(k−1)ℓke
− t 2Nk(N−k)ℓkk!
ℓkℓ
k!
i
≤
⌊εY
NN⌋ k=RN+1h Ne
k
2ℓ
kt
i
ℓk ⌊εN N⌋Y
k=RN+1e
2Nt k2ℓkte
−t/2N cMe(ℓ/N ).
We claim that the right-hand side is equal to
(te
−t/2)
N cMe(ℓ/N )e
N LN(ℓ) for someL
N
(ℓ)
that vanishes, uni-formly inℓ
, asN
→ ∞
. First note that the one-but-last term is such a term, since 2NtP
⌊εk=RNN⌋N+1
k
2ℓ
k≤
t 2ε
NN c
Me(ℓ/N )
. Furthermore,P
⌊εNN⌋k=RN+1
ℓ
k≤ N/R
N, which shows that the terms containingt
ande
inthe first product are as small. With the same approach as in (3.9), we see the lower bound
lim inf
N→∞ ⌊εX
NN⌋ k=RN+1ℓ
kN
log
k
2ℓ
kN
≥ 0.
Therefore, uniformly in
ℓ
such thatD(ℓ
⌈·N⌉, α) < ρ
, we have arrived at the estimateN e N cMe(ℓ/N ) FMe(ℓ)≤ te−t/2N cMe(ℓ/N ))+o(N )= exp −N t 2−log t cMe(ℓ/N ) eo(N ), N → ∞. (3.12)
Now we collect (3.8), (3.11) and (3.12) and substitute them in (3.5), also using (3.7), then we obtain, uniformly in
ℓ
such thatd(
N1ℓ, λ) < δ)
andD(ℓ
⌊·N⌋, α) < ρ
, for anyR
∈ N
and anyε > 0
, asN
→ ∞
,1
N
log P
N(A
N,t(ℓ))
≤ −I
Mi(λ; t)
− I
Ma(α; t) + C
R(δ) + γ
R+ C
ε(ρ) + γ
ε+
t
2
ε
−
t
2
− log t
(1
− c
λ− c
α) + o(1)
=
−I(λ, α; t) + K
R,ε(δ, ρ) + o(1),
where
K
R,ε(δ, ρ)
vanishes asδ
↓ 0
andρ
↓ 0
, followed byR
→ ∞
andε
↓ 0
, and we recall thatc
Me(ℓ/N ) = 1
−c
Mi(ℓ/N )
−c
Ma(ℓ/N )
. This implies the upper bound in (3.2) in the case wherec
λ+ c
α≤
1
.Step 4: Upper bound in the case
c
λ+ c
α> 1
. In this case, we implicitly use the lower semicontinuity of the mapsλ
7→ c
λandα
7→ c
α to show that the eventA
N,t(ℓ)
is empty for anyℓ
such thatd(
N1ℓ, λ) < δ
andD(ℓ
⌊·N⌋, α) < ρ
, ifδ
andρ
are small enough. This will give the right super-exponential upper bound forP
N(A
N,t(ℓ))
, sinceI(λ, α; t) =
∞
.Indeed, first pick
R
∈ N
so large andε
∈ (0, 1)
so small thatP
Rk=1kλ
k+
R
[ε,1]
x α(dx)
are larger than one, say equal to1 + η
for someη > 0
. Then chooseδ
andρ
in(0, 1)
so small that, for anyℓ
such thatd(
N1ℓ, λ) < δ)
andD(ℓ
⌊·N⌋, α) < ρ
, we haveN
1
RX
k=1kℓ
k+
1
N
X
εN≤k≤Nkℓ
k−
RX
k=1kλ
k+
Z
[ε,1]x α(dx)
<
η
2
.
For such
ℓ
, we then have that, using the notation in (3.6),c
Mi(ℓ/N ) + c
Ma(ℓ/N )
≥ 1 +
η
2
,
which contradicts the fact that
c
Mi(ℓ/N )
,c
Me(ℓ/N )
andc
Ma(ℓ/N )
sum up to one.Step 5: Logarithmic asymptotics of the lower bound. For the lower bound we only need to consider the case
I(λ, α; t) <
∞
, and here we construct a “recovery sequence”, that is, a sequence(ℓ
(N ))
N∈N such that
lim
N→∞d(
1 Nℓ
(N ), λ) = 0,
(3.13)lim
N→∞D(ℓ
(N ) ⌊·N⌋, α) = 0,
(3.14)lim inf
N→∞1
N
log P
N(A
N,t(ℓ
(N )))
≥ −I(λ, α; t).
(3.15)For
N
large enough defineℓ
(N )∈ N
N by
ℓ
(N ) k=
⌊λ
kN
⌋
fork = 2, . . . , R
N;
⌊
(1−cλ−cα)N RN+1⌋
fork = R
N+ 1;
α
k−1N,
k N fork = R
N+ 2, . . . , N ;
N
−
P
Nj≥2jℓ
(N ) j fork = 1,
(3.16)where
R
N is an arbitrary diverging sequence inN
such thatlog N
≪ R
N. It is clear by construction that (3.13) and (3.14) hold.Using Lemma 2.3, a calculation similar to that for the upper bound shows that
N
e
N cMi(ℓ(N )/N )F
Mi(ℓ
(N ))
≥ exp −NI
Mi(RN )(λ; t)
e
o(N ),
and in the same way, one checks that for
k = R
N+ 1
lim inf
N→∞1
N
log z
kℓ
(N )k+ kℓ
(N )klog N !
N
≥ (1 − c
λ− c
α)
log t
−
t
2
.
Now, fix
δ
∈ (0, 1)
. One can see thatα
RN+2 N, δ
≤
N
R
N+ 2
Z
δ RN +2 Nx α(dx)
≤
N
R
N+ 2
c
α(δ),
(3.17)where
c
α(δ)
vanishes asδ
↓ 0
wheneverα
∈ M
N0. Then we use Lemma 2.3 and Stirling’s upper bound to see that, forN
large1
N
⌊δN⌋X
k=RN+2log z
k(ℓ
(N )) + kℓ
(N )klog N !
N
≥ log t −
t 2P
δN k=RN+2 k Nα
k N,
k+1 N]
!
−
1 NP
δN k=RN+2log α (
k N,
k+1 N]
−
δNX
k=RN+22 log k + log
√
2πk
N
α (
k N,
k+1 N]
+ o(1).
By using (3.17),
log k
≤ k
and that δNY
k=RN+2α (
k N,
k+1 N]
!
≤
P
δNk=RN+2α
k N,
k+1 N!
≤
N RN+2!
we see that, whenever
log N
≪ R
N, there exist a finiteε
δ, vanishing asδ
ց 0
such thatlim inf
N→∞1
N
⌊δN⌋X
k=RN+2log z
k(ℓ
(N )) + kℓ
(N )klog N !
N
≥ ε
δ.
For the remaining terms one calculates that1
N
NX
⌊δN⌋+1log z
kℓ
(N )k+ kℓ
(N ) klog N !
N
=
1
N
NX
⌊δN⌋+1kℓ
(N ) k"
log
µ
(N ) t(k)
1 kk/N
!
−
t
2N
(N
− k)
#
+ o(1)
=
Z
(δ,1]x
"
log
µ
(N ) t(
⌊Nx⌋)
1 ⌊N x⌋x
!
−
t
2
(1
− x)
#
α(dx) + o(1).
(3.18)Now by Lemma 2.4 the integrand converges pointwise to
x
log
1
− e
−txx
−
t
2
(1
− x)
.
Since, for
N
large enough,x log
x
µ
(N ) t(
⌊Nx⌋)
1 ⌊N x⌋!
≤ x
log
x
1
− e
−tx+ 1
,
which is clearly integrable over
x
∈ (δ, 1]
with respect toα
, we can apply the dominated convergence theorem.Combining the above estimates we see that for any
δ
∈ (0, 1)
we havea
δwithlim
δց0a
δ= 0
andlim inf
N→∞1
N
log P
N(A
N,t(ℓ
(N )))
≥ −I(λ, α; t) − a
δ (3.19)so (3.15) follows on taking the limit
δ
ց 0
.4
Corollaries and study of the rate functions
In this section we analyse, for fixed
t
∈ [0, ∞)
, the minima of the rate function,I(λ, α; t)
, over the config-urationsλ
respectivelyα
, and afterwards the minimima of the rate functions for the total masses,J
Mi,J
Me andJ
Ma. In particular, we find analytical characterisations for the gelation phase transition at1/t
ift > 1
.4.1
Rate functions for the microscopic part
We start by minimizing
I(λ, α; t)
, for a fixedλ
∈ N
over all compatibleα
∈ M
N0. We will obtain the rate function for the microscopic part, and we will see that this minimum is attained forα
of the formα = δ
cα.Informally speaking, the following in particular implies that, with probability tending to one, there is at most one macroscopic particle.
Lemma 4.1 (Analysis of the microscopic rate function). Fix
λ
∈ N
and recall thatc
λ=
P
k∈Nkλ
k∈ [0, 1]
, theninf
α∈MNI(λ, α; t) =
∞X
k=1λ
klog
k!e
k−1tλ
kk
k−2− (1 − c
λ)
log
1
− e
(cλ−1)t(1
− c
λ)
−
c
λt
2
+ c
λt
2
− log t
.
Moreover, the minimum is attained precisely at
α = δ
1−cλ.Proof. Clearly
inf
α∈MNI(λ, α; t) =
inf
c∈[0,1−cλ]inf
α∈MN0(c)I(λ, α; t)
= I
Mi(λ; t) +
inf
c∈[0,1−cλ]inf
α∈MN0(c)I
Ma(α; t) + (1
− c
λ− c)
t
2
− log t
.
Fix
c
∈ [0, 1]
andα
∈ M
N0(c)
. Note thatα((c, 1]) = 0
sinceα
is a point measure withR
(0,1]
x α(dx) = c
. We have, denotingf
t(x) = log
1−ex−tx+
2t(1
− x)
,I
Ma(α; t) =
Z
(0,c]xf
t(x) α(dx)
≥
Z
xf
t(c) α(dx) = cf
t(c) = c log
c
1
− e
−tc+
t
2
c(1
−c) = I
Ma(δ
c; t),
(4.1) sincef
tis strictly decreasing in[0,
∞)
. Indeed,f
t′(x) =
1
x
−
te
−tx1
− e
−tx−
t
2
=
t(1 + y)
2y(1
− e
−2y)
h1 − y
1 + y
− e
−2yi
,
y =
tx
2
.
We want to prove that
f
′(x) < 0
forx
∈ [0, ∞)
. Fory
≥ 1
, this is obvious from above, and fory
∈ [0, 1)
, this is easily seen as follows.e
2y= 1 +
∞X
k=1(2y)
kk!
< 1 +
∞X
k=12y
k= (1 + y)
∞X
k=0y
k=
1 + y
1
− y
,
since2k!k
< 2
for allk
≥ 3
. Hence, we see thatf
t′(x)
≤ 0
forx
∈ [0, ∞)
, and (4.1) follows. Furthermore, it is immediate thatc log
c1−e−tc
+
2tc(1
− c) + (1 − c
λ− c)(
2t− log t)
is decreasing inc
,and hence the optimal value of
c
isc = 1
− c
λ.Now the proof of Corollary 1.2 directly follows from Theorem 1.1, Lemma 4.1 and the contraction principle since the projection