A Note on the Replica Symmetric Formula for the SK Model
Christian Brennecke, Horng-Tzer Yau Department of Mathematics, Harvard University,
One Oxford Street, Cambridge MA 02138, USA
September 16, 2021
Abstract
We provide a simple extension of Bolthausen’s Morita type proof [6] of the replica symmetric formula for the Sherrington-Kirkpatrick (SK) model and prove the replica symmetry for all (β, h) that satisfy β
2E sech
2(β √
qZ + h) ≤ 1, where q = E tanh
2(β √
qZ + h). Compared to [6], the key of the argument is to apply the conditional second moment method to a suitably reduced partition function.
1 Introduction
We study systems of N spins σ
i, i ∈ {1, . . . , N }, with values in {−1, 1} and with the Hamiltonian H
N: {−1, 1}
N→ R defined by
H
N(σ) = β
√ 2
X
1≤i,j≤N
g
ijσ
iσ
j+ h
N
X
i=1
σ
i. (1.1)
The interactions {g
ij} are i.i.d. centered Gaussians of variance 1/N for i 6= j and we set g
ii≡ 0. β ≥ 0 denotes the inverse temperature and h > 0 the external field strength.
Eq. (1.1) corresponds to the Sherrington-Kirkpatrick spin glass model [21] and we are interested in its free energy f
Nat high temperature, where
f
N= 1
N log Z
N, Z
N= X
σ∈{−1,1}N
e
HN(σ). (1.2)
The mathematical understanding of the SK model has required substantial efforts until the famous Parisi formula [18, 19] was rigorously established by Guerra [10] and Tala- grand [22]. Later, Panchenko [15, 17] gave another proof based on the ultrametricity of generic models. For a thorough introduction to the topic, we refer to [14, 23, 24, 16].
Despite the validity of the Parisi formula, it is an interesting question to prove the replica symmetry of the SK model at high temperature, as predicted by de Almeida and
arXiv:2109.07354v1 [math-ph] 15 Sep 2021
Thouless [3]. Replica symmetry is expected for all (β, h) that satisfy β
2E sech
4(β √
qZ + h) < 1, (1.3)
where q denotes the unique solution of the self-consistent equation q = E tanh
2(β √
qZ + h). (1.4)
In both cases Z ∼ N (0, 1) denotes a standard Gaussian and E the expectation over Z.
In the special case that the external field in direction of σ
iis a centered Gaussian random variable, h
i= hg
ifor i.i.d. g
i∼ N (0, 1) (independent of the g
ij), replica symmetry has been recently shown in [9] for all (β, h) that satisfy (1.3) (in which case h in (1.3) and (1.4) is replaced by hZ
0for some Z
0∼ N (0, 1) independent of Z); see also [20] for previous results in this case. For Hamiltonians H
Nas in (1.1) (or, more generally, Hamiltonians with non-centered random external field), however, replica symmetry is to date only known above the AT line up to a bounded region in the (β, h)-phase diagram.
This has been analyzed in [13]. Like [9], this analysis is based on the Parisi variational problem and we refer to [13] for the details. For previously obtained results based on the Parisi formula, see also [24, Chapter 13].
In this note, instead of analyzing the high temperature regime in view of the Parisi variational problem, we give a simple extension of Bolthausen’s argument [6] and prove the replica symmetric formula for all (β, h) that satisfy
β
2E sech
2(β √
qZ + h) ≤ 1. (1.5)
Although (1.5) is clearly stronger than the condition (1.3), it already covers a fairly large region of the high temperature regime, see Fig. 1 for a schematic. It improves upon the inverse temperature range from [6], where β was assumed to be sufficiently small.
Theorem 1.1. Assume that (β, h) satisfies (1.5), then
N →∞
lim E 1
N log Z
N= log 2 + E log cosh(β √
qZ + h) + β
24 (1 − q)
2. (1.6) Remarks:
1) From [11], it is well-known that lim
N →∞E
N1log Z
Nexists and that almost surely lim
N →∞N1log Z
N= lim
N →∞E
N1log Z
N.
2) It follows from the results of [10] that the right hand side in (1.6) provides an upper bound to the free energy lim
N →∞E
N1log Z
N, for all inverse temperatures and external fields (β, h). To establish Theorem 1.1, it is therefore sufficient to prove that the right hand side in (1.6) provides a lower bound to lim
N →∞E
N1log Z
N. We conclude this introduction with a quick heuristic outline of the main argument.
To this end, consider first the case h = 0 where the critical temperature corresponds to β = 1. In this case, it is straight forward (see e.g. [7, Chapter 1, Section 3]) to see that
N →∞
lim 1
N log EZ
N= log 2 + β
24 , lim
N →∞
1
N log EZ
N2= 2 log 2 + β
22
for all β < 1. The replica symmetric formula thus follows from the second moment method, using the Gaussian concentration of the free energy. In fact, for h = 0, also the fluctuations of Z
N/EZ
Nhave been known for a long time [2].
Clearly, it would be desirable to extend this simple argument to the case h > 0, but a direct application of the second moment method does not work here. However, as suggested in [6], one may hope to obtain a model similar to the case h = 0 by centering the spins around suitable magnetizations and viewing Z
N, up to normalization, as an average over the corresponding coin-tossing measure. To center the spins correctly, recall that at high temperature one expects the TAP equations [25] to hold, that is
m
i≈ tanh
h + β X
j6=i
¯
g
ijm
j− β
2(1 − q)m
i(for i = 1, . . . , N ), (1.7)
where ¯ g
ij= (g
ij+g
ji)/ √
2 and m
i= Z
N−1P
σ
σ
ie
HN(σ). The validity of (1.7) is known for sufficiently small β (see [23, 8] and, more recently, [1]; see also [12] on the TAP equations for generic models, valid at all temperatures) and expected to be true under (1.3). In [5, 6], Bolthausen has provided an iterative construction (m
(s))
s∈Nof the solution to (1.7) that converges (in a suitable sense) in the full high temperature regime (1.3). The main result of [6] is a novel proof of (1.6) for β small enough, based on a conditional second moment argument, given the approximate solutions (m
(s))
s∈N. It has remained an open question, however, if the approach can be extended to the region (1.3).
In this note, while we are not able to resolve this question for all (β, h) satisfying (1.3), we improve the range of (β, h) to (1.5) as follows. [5, 6] show, roughly speaking, that m
(k+1)≈ P
ks=1
γ
sφ
(s)for certain orthonormal vectors φ
(s)∈ R
Nand deterministic numbers γ
s( ≈ hm
(k+1), φ
(s)i with high probability), where hx, yi = N
−1P
Ni=1
x
iy
ifor x, y ∈ R
N. One also has g = g
(k+1)+ P
ks=1
ρ
(s)for the interaction g = (g
ij)
1≤i,j≤N, where the ρ
(s)∈ R
N ×Nare measurable w.r.t. (m
(s))
s≤k+1and where the modified interaction g
(k+1)is Gaussian, conditionally on the (m
(s))
s≤k+1, with the property that g
(k+1)P
ks=1
γ
sφ
(s)= 0. Up to negligible errors, one obtains with σ = σ − m b
(k+1)that 1
N log Z
N≈ log 2 + E log cosh(β √
qZ + h) + 1
N log X
σ∈{−1,1}N
p
free(σ) exp N β
√ 2
b σ, g
(k)σ + O max b
s
|γ
s− hσ, φ
(s)i|
, (1.8) where p
freedenotes the product measure for which P
σ
p
free(σ) σ = m
(k+1). A simple observation is now that we can ignore the error O(max
s|γ
s− hσ, φ
(s)i|) in (1.8) by restricting the modified partition function to those σ with max
s|γ
s− hσ, φ
(s)i| ≈ 0.
Note that the probability of the complement of this set is small under p
free, because
γ
s≈ hm
(k+1), φ
(s)i. This yields a simple lower bound on
N1log Z
Nand we can apply the
conditional second moment argument to the restricted partition function. We show that
its first conditional moment equals β
2(1 − q)
2/4 (up to negligible errors) in the full high
Figure 1: Schematic of the (T, h) phase diagram, where T = 1/β denotes the temperature. In the blue region, whose boundary corresponds to the AT line (1.3), the SK model is known to be replica symmetry breaking. The bound- ary of the green region corresponds to the condition (1.5). Theorem 1.1 proves the replica symmetry in the green region.
temperature regime (1.3). To dominate its second moment by the square of the first, on the other hand, we need to impose the stronger condition (1.5).
Notice that imposing similar orthogonality restrictions on the partition function has been proved useful before for obtaining lower bounds on the free energy, like in the TAP analysis of the spherical SK model [4].
Although (1.5) covers already a comparably large region of the high temperature phase as schematically shown in Fig. 1, we do not expect that our argument can be extended to the full high temperature regime (1.3), for essentially the same reason as already pointed out by Bolthausen in [6]. In regimes in which h 1 so that we have m
i≈ m (≈ 1), the orthogonality relation hσ − m, mi ≈ 0, restricts the model to those σ with |{i ∈ {1, . . . , N } : σ
i= 1}| ≈
N2(1 + m). As pointed out in [6], in this toy case the AT condition (1.3) corresponds, at least on a heuristic level, to the condition β
2(1 − m
2)
2< 1 and it can be verified explicitly that the second moment method does not work for β
2≈ (1 − m
2)
−2and m ≈ 1.
The paper is structured as follows. In the next two sections we set up the notation and recall Bolthausen’s iterative construction of the magnetization [5, 6]. In Section 4, we define the reduced partition function and compute its first and second moments. In the last section, we apply the conditional second moment method to prove Theorem 1.1.
Acknowledgements. The work of H.-T. Y. is partially supported by NSF grant DMS- 1855509 and a Simons Investigator award.
2 Notation
In this section, we introduce basic notation and conventions. We follow closely [6].
We usually denote vectors in R
Nby boldface or greek letters. If x ∈ R
N, and g : R → R, we define g(x) in the componentwise sense. By h·, ·i : R
N× R
N→ R, we denote the normalized inner product
hx, yi = 1 N
N
X
i=1
x
iy
iand by k · k = ph·, ·i the induced norm. We also normalize the tensor product x ⊗ y : R
N→ R
Nof two vectors x, y ∈ R
Nso that for all z ∈ R
N(x ⊗ y)(z) = hy, zi x.
Given a matrix A ∈ R
N ×N, A
T∈ R
N ×Ndenotes its transpose and ¯ A ∈ R
N ×Ndenotes its symmetrization
A = ¯ A + A
T√ 2 .
We mostly use the letters Z, Z
0, Z
1, Z
2, etc. to denote standard Gaussian random variables independent of the disorder {g
ij} and independent from one another. When we average over such Gaussians, we denote the corresponding expectation by E, to distinguish it from the expectation E with respect to the disorder {g
ij}. Unless specified otherwise, we consider all Gaussians to be centered.
Finally, given two sequences of random variables (X
N)
N ≥1, (Y
N)
N ≥1that may de- pend on parameters like β, h, etc. we say that
X
N' Y
Nif and only if there exist positive constants c, C > 0, which may depend on the parame- ters, but which are independent of N , such that for every t > 0 we have
P(|X
N− Y
N| > t) ≤ Ce
−ct2N.
3 Bolthausen’s Construction of the Local Magnetizations
In this section, we recall Bolthausen’s iterative construction of the solution to the TAP equations [5, 6] and list the properties that we will need for the proof of Theorem 1.1.
We follow here the conventions of [6] and we refer to [6, Sections 2, 4 & 5] for the proofs of the following statements.
First of all, we define three sequences (α
k)
k∈N, (γ
k)
k∈Nand (Γ
k)
k∈N. Set α
1= √
qγ
1, γ
1= E tanh(β √
qZ + h), Γ
21= γ
12,
where here and in the following q denotes the unique solution of (1.4). Then, we define ψ : [0, q] → [0, q] by
ψ(t) = E tanh(β √
tZ + β √
q − tZ
0+ h) tanh(β √
tZ + β √
q − tZ
00+ h) and set recursively
α
k= ψ(α
k−1), γ
k= α
k− Γ
2k−1q
q − Γ
2k−1, Γ
2k=
k
X
j=1
γ
k2.
The following lemma collects important properties of (α
k)
k∈N, (γ
k)
k∈Nand (Γ
k)
k∈N.
Lemma 3.1. ([5, Lemma 2.2, Corollary 2.3, Lemma 2.4], [6, Lemma 2])
1) ψ is strictly increasing and convex in [0, q] with 0 < ψ(0) < ψ(q) = q. If (1.3) is satisfied, then q is the unique fixed point of ψ in [0, q].
2) The sequence (α
k)
k∈Nis increasing and α
k> 0 for every k ∈ N. If (1.3) is satisfied, then lim
k→∞α
k= q and if (1.3) is satisfied with a strict inequality, the convergence is exponentially fast.
3) For all k ≥ 2, we have that 0 < Γ
2k−1< α
k< q and that 0 < γ
k< q
q − Γ
2k−1. If (1.3) is true, then lim
k→∞Γ
2k= q and, as a consequence, lim
k→∞γ
k= 0.
Next, we recall Bolthausen’s modified interaction matrix. We define g
(1)= g, φ
(1)= 1 ∈ R
N, m
(1)= √
q1 ∈ R
N. Assuming g
(s), φ
(s), m
(s)are defined for 1 ≤ s ≤ k, we set
ζ
(s)= ¯ g
(s)φ
(s)and we define the σ-algebra G
kthrough
G
k= σ g
(s)φ
(s), (g
(s))
Tφ
(s): 1 ≤ k ≤ s.
Expectations with respect to G
kare denoted by E
k. Furthermore, we set h
(k+1)= h1 + β
k−1
X
s=1
γ
sζ
(s)+ β q
q − Γ
2k−1ζ
(k), m
(k+1)= tanh(h
(k+1)),
φ
(k+1)= m
(k+1)− P
ks=1
hm
(k+1), φ
(s)iφ
(s)m
(k+1)− P
ks=1
hm
(k+1), φ
(s)iφ
(s)(3.1)
and we note that φ
(k+1)is P − a.s. well-defined for all k if k < N [6, Lemma 5]. Finally, the modified interaction matrix g
(k+1)is defined by
g
(k+1)= g
(k)− ρ
(k), where
ρ
(k)= g
(k)φ
(k)⊗ φ
(k)+ φ
(k)⊗ (g
(k))
Tφ
(k)− hg
(k)φ
(s), φ
(k)i φ
(k)⊗ φ
(k). In particular, this means that ¯ g
(k+1)is equal to
g ¯
(k+1)= ¯ g
(k)− ¯ ρ
(k), ρ ¯
(k)= ζ
(k)⊗ φ
(k)+ φ
(k)⊗ ζ
(k)− hζ
(k), φ
(k)i φ
(k)⊗ φ
(k). It is clear that (φ
(s))
ks=1forms an orthonormal sequence of vectors in R
Nand we denote by P
(k)and Q
(k)the corresponding orthogonal projections in R
N, that is
P
(k)=
k
X
s=1
φ
(s)⊗ φ
(s)= (P
ij(k))
1≤i,j≤N, Q
(k)= 1 − P
(k)= (Q
(k)ij)
1≤i,j≤N. By [6, Lemma 3], m
(k)and φ
(k)are G
k−1-measurable for all k ∈ N and we also have that
g
(k)φ
(s)= (g
(k))
Tφ
(s)= ¯ g
(k)φ
(s)= 0, ∀ s < k.
Proposition 3.2. ([6, Prop. 4])
1) Conditionally on G
k−2, g
(k)and g
(k−1)are Gaussian with conditional covariance, given G
k−2, equal to
E
k−2g
(k)ijg
st(k)= 1
N Q
(k−1)isQ
(k−1)jt.
2) Conditionally on G
k−2, g
(k)is independent of G
k−1. In particular, conditionally on G
k−1, g
(k)is Gaussian with the same covariance as in 1).
3) Conditionally on G
k−1, the random variables ζ
(k)are Gaussian with E
k−1ζ
i(k)ζ
j(k)= Q
(k−1)ij+ 1
N φ
(k)iφ
(k)j. The main result of [5] is summarized in the following proposition.
Proposition 3.3. ([5, Prop. 2.5], [6, Prop. 6]) For every k ∈ N and s < k, one has hm
(k), φ
(s)i ' γ
s, hm
(k), m
(s)i ' α
s, hm
(k), m
(k)i ' q.
The next lemma collects a few auxiliary results that are helpful in the sequel.
Lemma 3.4. ([6, Lemmas 11, 14, 15(b)]) 1) For every k ∈ N, hφ
(k), ζ
(k)i = √
2 hφ
(k), g
(k)φ
(k)i is unconditionally Gaussian with variance 2/N .
2) For every Lipschitz continuous f : R → R with |f (x)| ≤ C(1 + |x|) for some C > 0, one has for all k ≥ 2 that
N →∞
lim E
1 N
N
X
i=1
f (h
(k+1)i) − Ef (β √
qZ + h)
= 0.
3) For every k ∈ N and t > 0 it holds true that
N →∞
lim P kζ
(k)k ≥ 1 + t = 0.
4 Conditional Moments of Reduced Partition Function
Using Bolthausen’s magnetizations, we compute in this section the first two conditional moments of a suitably reduced partition function e Z
N(S), for a subset S ⊂ {−1, 1}
Ndefined below. This will suffice to establish Theorem 1.1, as explained in Section 5.
Let ε > 0 and k ∈ N be fixed. We define the set S
ε,k⊂ {−1, 1}
Nthrough S
ε,k= n
σ ∈ {−1, 1}
N: |hσ − m
(k+1), φ
(s)i| ≤ ε/k, ∀ 1 ≤ s ≤ k o
, (4.1)
with φ
(s), γ
sfrom Section 3. We define the reduced partition function Z
N(k+1)(S
ε,k) by Z
N(k+1)(S
ε,k) = X
σ∈Sε,k
p
free(σ) exp N β
√
2 hσ, g
(k+1)σi
= X
σ∈Sε,k
p
free(σ) exp N β
2 hσ, ¯ g
(k+1)σi
,
(4.2)
where p
free: {−1, 1}
N→ (0, 1) denotes the coin-tossing measure p
free(σ) =
N
Y
i=1
1 2
exp h
(k+1)iσ
icosh h
(k+1)i. (4.3)
The following lemma records that p
free(S
ε,kc) is exponentially small in N .
Lemma 4.1. Let ε > 0, k ∈ N, and let S
ε,kand p
freebe defined as in (4.1) and (4.3), respectively. Then, there exist c, C > 0, independent of N and ε, such that
p
free(S
ε,k) ≥ 1 − Ce
−cN ε2. (4.4) Proof. By a standard union bound, we have that
p
free(S
ε,kc) ≤ k max
s=1,...,k
p
freen
σ ∈ {−1, 1}
N: hσ, φ
(s)i − hm
(k+1), φ
(s)i > ε k
o
+ k max
s=1,...,k
p
freen
σ ∈ {−1, 1}
N: hm
(k+1), φ
(s)i − hσ, φ
(s)i > ε k
o
, which implies
p
free(S
ε,kc)
≤ k max
s=1,...,k
inf
λ≥0
exp
− N λε k − 1
N
N
X
i=1
log cosh h
(k+1)i+ λφ
(s)icosh h
(k+1)i+ λhm
(k+1), φ
(s)i
+ k max
s=1,...,k
inf
λ≥0
exp
− N λε k − 1
N
N
X
i=1
log cosh h
(k+1)i− λφ
(s)icosh h
(k+1)i− λhm
(k+1), φ
(s)i
.
Using the pointwise bound log cosh(x + y) ≤ log cosh(x) + y tanh(x) +
y22for x, y ∈ R, hφ
(s), φ
(s)i = 1 and the identity tanh h
(k+1)= m
(k+1), we obtain
p
free(S
ε,kc) ≤ 2k inf
λ≥0
exp
− N λε
k + N λ
22
= 2ke
−N ε2/(2k2). This concludes (4.4) for c = c
k= 1/(2k
2), C = C
k= 2k.
We notice that the constants c, C > 0 in (4.4) are independent of the realization of the disorder {g
ij}. Thus, a.s. in the disorder (so that φ
(s), s = 1, . . . , k, and hence ¯ g
(k+1)as well as p
freeare well-defined), S
ε,k6= ∅ for N large enough.
The next lemma determines the first conditional moment of the reduced partition
function Z
N(k+1)(S
ε,k) and is valid in the full high temperature regime (1.3).
Lemma 4.2. Let ε > 0, k ∈ N and let S
ε,k, Z
N(k+1)(S
ε,k) and p
freebe as in (4.1), (4.2) and (4.3), respectively. Assume that (β, h) satisfy the AT condition (1.3). Then
ε→0
lim lim
k→∞
lim sup
N →∞
E
1
N log E
kZ
N(k+1)(S
ε,k) − β
24 (1 − q)
2= 0 (4.5)
Proof. By Proposition 3.2, we have that E
kZ
N(k+1)(S
ε,k) = X
σ∈Sε,k
p
free(σ) exp β
2N
24 E
khσ, g
(k+1)σi
2= X
σ∈Sε,k
p
free(σ) exp β
2N 4
1 −
k
X
s=1
hσ, φ
(s)i
2 2.
Centering around m
(k+1)yields 1 −
k
X
s=1
hσ, φ
(s)i
2= 1 −
k
X
s=1
hm
(k+1), φ
(s)i
2− 2
k
X
s=1
hσ − m
(k+1), φ
(s)ihm
(k+1), φ
(s)i
−
k
X
s=1
hσ − m
(k+1), φ
(s)i
2so that
sup
σ∈Sε,k
1 −
k
X
s=1
hσ, φ
(s)i
2−
1 −
k
X
s=1
hm
(k+1), φ
(s)i
2≤ C sup
σ∈Sε,k
k
X
s=1
|hσ − m
(k+1), φ
(s)i| ≤ Cε for some C > 0, independent of N and k. This implies with Lemma 4.1 that
1
N log E
kZ
N(k+1)(S
ε,k) − β
24
1 −
k
X
s=1
hm
(k+1), φ
(s)i
2 2≤ Cβ
2ε + C
N | log(1 − Ce
−cN ε2)|.
Moreover, by Proposition 3.3, we have that lim
N →∞P
ks=1
hm
(k+1), φ
(s)i
2= Γ
2kin L
p(dP), for any p ∈ [1; ∞), so that
lim sup
N →∞
E
1
N log E
kZ
N(k+1)(S
ε,k) − β
24 1 − Γ
2k2≤ Cβ
2ε.
Finally, since lim
k→∞Γ
2k= q under the AT condition (1.3), by Lemma 3.1, we let N → ∞, then k → ∞ and then ε → 0 which implies
ε→0
lim lim
k→∞
lim sup
N →∞
E
1
N log E
kZ
N(k+1)(S
ε,k) − β
24 (1 − q)
2= 0.
The following lemma computes the second conditional moment of Z
N(k+1)(S
ε,k) under the stronger high temperature condition (1.5).
Lemma 4.3. Let ε > 0, k ∈ N and let S
ε,k, Z
N(k+1)(S
ε,k) and p
freebe as in (4.1), (4.2) and (4.3), respectively. Assume that (β, h) satisfy the condition (1.5). Then
ε→0
lim lim
k→∞
lim sup
N →∞
E
1 N log E
kh
Z
N(k+1)(S
ε,k)
2i
− β
22 (1 − q)
2= 0. (4.6) Proof. Proceeding as in the previous proposition, we compute
E
kh
Z
N(k+1)(S
ε,k)
2i
= X
σ,τ ∈Sε,k
p
free(σ)p
free(τ ) exp β
2N
24 E
khσ, g
(k+1)σi + hτ, g
(k+1)τ i
2= X
σ,τ ∈Sε,k
p
free(σ)p
free(τ ) exp β
2N 4
1 −
k
X
s=1
hσ, φ
(s)i
2 2+ β
2N 4
1 −
k
X
s=1
hτ, φ
(s)i
2 2× exp β
2N 2
hσ, τ i −
k
X
s=1
hσ, φ
(s)ihφ
(s)τ i
2= X
σ,τ ∈Sε,k
p
free(σ)p
free(τ ) exp β
2N
2 hσ, Q
(k)τ i
2× exp β
2N 2
1 −
k
X
s=1
hm
(k+1), φ
(s)i
2 2+ N O(β
2ε)
.
Arguing as in the previous lemma, we therefore see that it is enough to show that E
1
N log X
σ,τ ∈Sε,k
p
free(σ)p
free(τ ) exp β
2N
2 hσ, Q
(k)τ i
2vanishes when N → ∞. To this end, recall that by definition of S
ε,k, we have that sup
τ ∈Sε,k
kQ
(k)τ k
2− 1 − hm
(k+1), P
(k)m
(k+1)i
≤ Cε (4.7)
for some C > 0 independent of N and k. For fixed τ ∈ S
ε,k, we therefore find with log cosh(x + y) ≤ log cosh(x) + y tanh(x) + y
2/2 for x, y ∈ R and Markov’s inequality
p
free(S
ε,k) ≤ X
σ∈Sε,k
p
free(σ) exp β
2N
2 hσ, Q
(k)τ i
2≤ p
free(S
ε,k) + Z
10
dt N β
2t e
N2β2t2p
freeσ ∈ {−1, 1}
N: |hσ, Q
(k)τ i| > t
≤ p
free(S
ε,k) + 2 Z
10
dt N β
2t e
N t2
2(1−q) β2(1−q)−2+kQ(k)τ k2(1−q)
e
1−qN tkQ(k)m(k+1)k.
In particular, by (4.7) and because
N →∞
lim hm
(k+1), P
(k)m
(k+1)i = Γ
2k, lim
N →∞
kQ
(k)m
(k+1)k
2= q − Γ
2k,
in L
p(dP) for p ∈ [1; ∞), we obtain under the condition (1.5), i.e. β
2(1 − q) ≤ 1, that
ε→0
lim lim
k→∞
lim sup
N →∞
E
1
N log X
σ,τ ∈Sε,k
p
free(σ)p
free(τ ) exp β
2N
2 hσ, Q
(k)τ i
2= 0,
which implies (4.6).
5 Proof of Theorem 1.1
In this section we prove Theorem 1.1, based on Lemmas 4.2 and 4.3. Before we start, let us first re-center the Hamiltonian H
Nappropriately, as outlined in the introduction.
Using the notation of Section 3, we have that H
N(σ)
N = β
2 hσ, ¯ gσi + hh, σi = β
2 hσ, ¯ g
(k+1)σi + β 2
k
X
s=1
hσ, ¯ ρ
(s)σi + hh, σi.
In contrast to [6], instead of centering the spins σ around m
(k+1), we center the spins in hσ, ¯ ρ
(s)σi around γ
sφ
(s)in order to produce the right cavity field h
(k+1). Notice that the remaining term hσ, ¯ g
(k+1)σi contains automatically centered spins around P
ks=1
γ
sφ
(s)(which approximately equals m
(k+1)), as ¯ g
(k+1)φ
(s)= 0 for s < k + 1. We thus write
hσ, ¯ ρ
(s)σi = 2γ
shσ, ¯ ρ
(s)φ
(s)i + hσ − γ
sφ
(s), ¯ ρ
(s)(σ − γ
sφ
(s))i − γ
s2hφ
(s), ¯ ρ
(s)φ
(s)i
= 2γ
shσ, ζ
(s)i + h b σ
(s), ¯ ρ
(s)b σ
(s)i − γ
s2hφ
(s), ζ
(s)i,
which follows from ¯ ρ
(s)φ
(s)= ζ
(s)and where we set σ b
(s)= σ − γ
sφ
(s). Hence H
N(σ)
N = β
2 hσ, ¯ g
(k+1)σi + hh
(k+1), σi + β
2
k
X
s=1
h σ b
(s), ¯ ρ
(s)σ b
(s)i − β 2
k
X
s=1
γ
s2hφ
(s), ζ
(s)i + β γ
k−
q
q − Γ
2k−1hσ, ζ
(k)i.
Since an exact evaluation of the free energy seems rather involved, let us notice here that for configurations σ ∈ S
ε,kas defined in (4.1) instead, we have approximately H
N(σ)/N ≈
β2hσ, ¯ g
(k+1)σi + hh
(k+1), σi. Indeed, we find that
h b σ
(s), ¯ ρ
(s)σ b
(s)i = 2hσ − γ
sφ
(s), ζ
(s)i(hφ
(s), σi − γ
s) − hφ
(s), ζ
(s)i(hφ
(s), σi − γ
s)
2with γ
s' hm
(k+1), φ
(s)i. Similarly, recall that hφ
(s), ζ
(s)i ∼ N (0, 2/N ) for each s and
γ
k−
q
q − Γ
2ksup
σ∈{−1,1}N
hσ, ζ
(k)i ≤
γ
k+ q
q − Γ
2kkζ
(k)k
with lim
k→∞|q − Γ
2k| = lim
k→∞γ
k= 0 under (1.3), by Lemmas 3.1 and 3.4.
Thus, we obtain the simple lower bound 1
N log Z
N= 1
N log X
σ∈{−1,1}N
e
HN(σ)= log 2 + 1 N
N
X
i=1
log cosh(h
(k+1)i) + 1
N log X
σ∈{−1,1}N
p
free(σ)e
HN(σ)−N hh(k+1),σi≥ log 2 + 1 N
N
X
i=1
log cosh(h
(k+1)i) + 1
N log X
σ∈Sε,k
p
free(σ)e
HN(σ)−N hh(k+1),σiso that 1
N log Z
N≥ log 2 + 1 N
N
X
i=1
log cosh(h
(k+1)i) + 1
N log Z
N(k+1)(S
ε,k)
− C βε k
k
X
s=1
kζ
(s)k − Cβ
k
X
s=1
kζ
(s)k
hφ
(s), m
(k+1)i − γ
s− β 2
k
X
s=1
γ
s2hφ
(s), ζ
(s)i − βγ
kkζ
(k)k − β q
q − Γ
2kkζ
(k)k.
(5.1)
We have now all necessary preparations for the proof of Theorem 1.1.
Proof of Theorem 1.1. Up to minor modifications, we follow [6, Section 3] and we also abbreviate RS(β, h) = log 2 + E log cosh(β √
qZ + h) + β
2(1 − q)
2/4.
By the Paley-Zygmund inequality, we have that
P
kZ
N(k+1)(S
ε,k) ≥ E
kZ
N(k+1)(S
ε,k)/2
≥ E
k(Z
N(k+1)(S
ε,k)
24E
k(Z
N(k+1)(S
ε,k))
2Given δ
1> 0, Lemmas 4.2 and 4.3 imply
P 2
N log E
kZ
N(k+1)(S
ε,k) ≥ 1
N log 4 E
kZ
N2(S
ε,k) − δ
1≥ 1 2
if we choose ε > 0 sufficiently small, k sufficiently large and N ≥ N
1(ε, k) ∈ N sufficiently large. This also implies that
P
P
k1
N log Z
N(k+1)(S
ε,k) ≥ 1
N log E
kZ
N(k+1)(S
ε,k) − log 2 N
≥ e
−δ1N≥ 1 2 . On the other hand, applying Lemma 3.4 3), we note that
N →∞
lim P 1 k
k
X
s=1
kζ
(s)k > 2
≤ lim
N →∞
k
X
s=1
P
kζ
(s)k > 2
= 0
and that
N →∞
lim P
kζ
(k+1)k > 2
= 0.
Moreover, Lemma 3.4 and the fact that hφ
(s), m
(k+1)i ' γ
sby Proposition 3.3 imply
N →∞
lim
Cβ
k
X
s=1
kζ
(s)k
hφ
(s), m
(k+1)i − γ
s+ β
2
k
X
s=1
γ
s2hφ
(s), ζ
(s)i
= 0 as well as
N →∞
lim 1 N
N
X
i=1
log cosh h
(k+1)i= E log cosh(β √
qZ + h) in probability. Now, the lower bound (5.1) implies that
P
P
k1
N log Z
N≥ log 2 + 1 N
N
X
i=1
log cosh(β √
qZ + h)
+ 1
N log E
kZ
N(k+1)(S
ε,k) − log 2 N − E
ε,k≥ e
−δ1N≥ 1 2 , where we defined the error E
ε,kby
E
ε,k= Cβε k
k
X
s=1
kζ
(s)k + Cβ
k
X
s=1
kζ
(s)k
hφ
(s), m
(k+1)i − γ
s+ β 2
k
X
s=1
γ
s2hφ
(s), ζ
(s)i + βγ
kkζ
(k)k + β q
q − Γ
2kkζ
(k)k.
Given δ
2> 0, we may choose ε > 0 sufficiently small, k ∈ N sufficiently large and N ≥ N
2(k, ε) ∈ N sufficiently large such that
P
|E
ε,k| ≤ δ
24 ,
1 N
N
X
i=1