Variational characterization of the critical curve for pinning of random polymers

(1)

2013, Vol. 41, No. 3B, 1767–1805 DOI:10.1214/11-AOP727

©Institute of Mathematical Statistics, 2013

VARIATIONAL CHARACTERIZATION OF THE CRITICAL CURVE FOR PINNING OF RANDOM POLYMERS

BYDIMITRISCHELIOTIS1 ANDFRANK DENHOLLANDER

University of Athens and Leiden University

In this paper we look at the pinning of a directed polymer by a one-dimensional linear interface carrying random charges. There are two phases, localized and delocalized, depending on the inverse temperature and on the disorder bias. Using quenched and annealed large deviation principles for the empirical process of words drawn from a random letter sequence according to a random renewal process [Birkner, Greven and den Hollander,Probab. The-ory Related Fields148(2010) 403–456], we derive variational formulas for the quenched, respectively, annealed critical curve separating the two phases. These variational formulas are used to obtain a necessary and sufficient cri-terion, stated in terms of relative entropies, for the two critical curves to be different at a given inverse temperature, a property referred to as relevance of the disorder. This criterion in turn is used to show that the regimes of relevant and irrelevant disorder are separated by a unique inverse critical temperature. Subsequently, upper and lower bounds are derived for the inverse critical tem-perature, from which sufficient conditions under which it is strictly positive, respectively, finite are obtained. The former condition is believed to be nec-essary as well, a problem that we will address in a forthcoming paper.

Random pinning has been studied extensively in the literature. The present paper opens up a window with a variational view. Our variational formulas for the quenched and the annealed critical curve are new and provide valu-able insight into the nature of the phase transition. Our results on the inverse critical temperature drawn from these variational formulas are not new, but they offer an alternative approach, that is, flexible enough to be extended to other models of random polymers with disorder.

1. Introduction and main results.

1.1. Introduction.

I. Model. LetS=(Sn)n∈N0 be a Markov chain on a countable state spaceS in

which a given point is marked 0 (N0=N∪ {0}). WritePto denote the law ofS

givenS0=0 andEthe corresponding expectation. LetK denote the distribution

Received January 2011; revised July 2011.

1_{Supported in part by the DFG-NWO Bilateral Research Group “Mathematical Models from}

Physics and Biology.”

MSC2010 subject classifications.Primary 60F10, 60K37; secondary 82B27, 82B44.

Key words and phrases.Random polymer, random charges, localization vs. delocalization, quenched vs. annealed large deviation principle, quenched vs. annealed critical curve, relevant vs. irrelevant disorder, critical temperature.

(2)

of the first return time ofSto 0, that is,

K(n):=P(Sn=0, Sm=0∀0< m < n), n∈N.

(1.1)

We will assume that_n_∈_NK(n)=1 (i.e., 0 is a recurrent state) and

lim

n→∞

logK(n)

logn = −(1+α) for someα∈ [0,∞).

(1.2)

Letω=(ωk)k∈N0 be i.i.d. R-valued random variables with marginal

distribu-tionμ0. WriteP=μ⊗0N0to denote the law ofω, andEto denote the corresponding

expectation. We will assume that

M(λ):=E(eλω0_{) <}∞ ∀_λ∈R_,

(1.3)

and thatμ0has mean 0 and variance 1.

Letβ∈ [0,∞)andh∈R, and for fixedωdefine the lawPβ,h,ωn on{0} ×Sn,

the set ofn-steps paths inSstarting from 0, by putting

dPβ,h,ωn

dPn

((Sk)nk=0):=

1

Zβ,h,ωn

exp

_n₋₁

k=0

(βωk−h)1{Sk=0}

1_{Sn=0},

(1.4)

wherePnis the projection ofPonto{0} ×Sn. Here,β plays the role of the inverse

temperature,hthe role of the disorder bias, whileZnβ,h,ωis the normalizing

parti-tion sum. Note thatk=0 contributes to the sum, whilek=ndoes not. Also note that the path is tied to 0 at both ends. This is done for later convenience.

REMARK1.1. Note that (1.2) impliesp:=gcd[supp(K)] =1. Ifp≥2, then the model can be trivially restricted topN, so there is no loss of generality. More-over, if n∈NK(n) <1, then the model can be reduced to the recurrent case by

a shift of h. Similarly, the restriction to μ0 with mean 0 and variance 1 can be

removed by a scaling ofβand a shift ofh.

REMARK 1.2. The key example of the above setting is the simple random

walk onZ, for which p=2 andα=1₂ (Spitzer [19], Section 1). In that case the process(n, Sn)n∈N0 can be thought of as describing a directed polymer inN0×Z,

that is, pinned to the interfaceN0× {0}by random chargesβω−h; see Figure1.

When the polymer hits the interface at timek, it picks up a reward exp[βωk−h],

which can be either>1 or<1, depending on the value ofωk. Forh≤0 the polymer

(3)

FIG. 1. A directed polymer sampling random charges at an interface.

II. Free energy and phase transition. Thequenched free energyis defined as

fque(β, h):= lim

n→∞

1

nlogZ

β,h,ω

n .

(1.5)

Standard subadditivity arguments show that the limit existsω-a.s. and inP-mean, and is nonrandom; see, for example, Giacomin [11], Chapter 5, and den Hol-lander [8], Chapter 11. Moreover,fque(β, h)≥0 becauseZnβ,h,ω≥eβω0−hK(n),

n∈N, and limn→∞1_nlogK(n)=0 by (1.2). The lower boundfque(β, h)=0 is

attained whenSvisits the state 0 only rarely. This motivates the definition of two quenched phases,

L:= {(β, h):fque(β, h) >0},

(1.6)

D:= {(β, h):fque(β, h)=0},

referred to as thelocalizedphase, respectively, thedelocalizedphase.

Sinceh→fque(β, h)is nonincreasing for everyβ∈ [0,∞), the two phases are separated by aquenched critical curve

hque_c (β):=inf{h:fque(β, h)=0}, β∈ [0,∞),

(1.7)

withLthe region below the curve andDthe region on and above. Since(β, h)→ fque(β, h)is convex andD= {(β, h):fque(β, h)≤0}is a level set offque, it fol-lows thatDis a convex set andhquec is a convex function. Sinceβ=0 corresponds

to a homopolymer, we havehquec (0)=0; seeAppendix A. It was shown in

Alexan-der and Sidoravicius [2] thathquec (β) >0 forβ ∈(0,∞). Therefore we have the

qualitative picture drawn in Figure2. We further remark that limβ→∞h que c (β)/β

is finite if and only if supp(μ0)is bounded from above.

The mean value of the disorder isE(βω0−h)= −h. Thus, we see from Figure2

that for the random pinning model localization may even occur formoderately neg-ativemean values of the disorder, contrary to what happens for the homogeneous pinning model, where localization occurs only for a strictly positive parameter; see

Appendix A. In other words, even a globally repulsive random interface can pin

(4)

0 β

h

L D

FIG. 2. Qualitative plot ofβ→hquec (β).The fine details of this curve are not known.

Theannealed free energyis defined by

fann(β, h):= lim

n→∞

1

nlogE(Z

β,h,ω

n ).

(1.8)

Since

E(Zβ,h,ω_n )=E

exp

_n₋₁

k=0

[logM(β)−h]1_{Sk=0}

1_{Sn=0}

,

(1.9)

we have that fann(β, h) is the free energy of the homopolymer with parameter logM(β)−h. The associatedannealed critical curve

hann_c (β):=inf{h:fann(β, h)=0}, β∈ [0,∞),

(1.10)

therefore equals

hann_c (β)=logM(β).

(1.11)

Sincefque≤fann, we havehquec ≤hannc .

DEFINITION1.3. The disorder is said to berelevantfor a given choice ofK, μ0 andβwhenh

que

c (β) < hannc (β), otherwise it is said to beirrelevant.

Note: In the physics literature, the term relevant disorder is reserved for the situation where the disorder not only changes the critical value but also changes the behavior of the free energy near the critical value. In the present paper we adopt the more narrow definition above.

Our main focus in the present paper will be on derivingvariational formulas forhquec andhannc , and on investigating under what conditions onK,μ0 andβ the

(5)

1.2. Main results. This section contains three theorems and four corollaries, all valid subject to (1.2) and (1.3). To state these we need some further notation.

I. Notation. Abbreviate

E:=supp[μ0] ⊂R.

(1.12)

LetE:= _k_∈_NEk be the set offinite wordsconsisting oflettersdrawn fromE. LetP(EN)denote the set of probability measures oninfinite sentences, equipped with the topology of weak convergence. Writeθ for the left-shift acting onEN, andPinv(EN)for the set of probability measures that are invariant underθ.

ForQ∈Pinv(EN), letπ1,1Q∈P(E)denote the projection ofQonto the first

letter of the first word. Define the set

C:=

Q∈Pinv(EN):

E|

x|d(π1,1Q)(x) <∞

,

(1.13)

and on this set the function

(Q):=

E

xd(π1,1Q)(x), Q∈C.

(1.14)

We also need two rate functions on Pinv(EN), denoted by Iann andIque, which will be defined in Section2. These are the rate functions of the annealed and the quenched large deviation principles that play a central role in the present paper, and they satisfyIque≥Iann.

II. Theorems. With the above ingredients, we obtain the following characteri-zation of the critical curves.

THEOREM1.4. Fixμ0 andK.For allβ∈ [0,∞),

hque_c (β)=sup

Q∈_C[

β(Q)−Ique(Q)],

(1.15)

hann_c (β)=sup

Q∈C

[β(Q)−Iann(Q)].

(1.16)

We know that hann_c (β) = logM(β). However, the variational formula for

hann_c (β)will be important for the comparison withhquec (β).

Next, forβ∈ [0,∞)define the probability measures

dμβ(x):=

1

M(β)e

βx_d_μ

0(x), x∈E,

(1.17)

and

dqβ(x1, x2, . . . , xn):=K(n)dμβ(x1)dμ0(x2)× · · · ×dμ0(xn),

(1.18)

n∈N, x1, x2, . . . , xn∈E.

Further, let Qβ :=qβ⊗N∈Pinv(EN). Then Q0 is the probability measure under

(6)

0 β

h

hquec (β)

hann_c (β)

βc

FIG. 3. Uniqueness of the critical inverse temperatureβc.

fromμ0, whileQβ differs fromQ0 in that the first letter of each word is drawn

from the tilted probability distributionμβ. We will see thatQβ is the unique

max-imizer of the supremum in (1.16) [note thatQβ ∈Cbecause of (1.3)]. This leads

to the followingnecessary and sufficient criterion for disorder relevance.

THEOREM1.5. Fixμ0 andK.For allβ∈ [0,∞),

hque_c (β) < hann_c (β) ⇐⇒ Ique(Qβ) > Iann(Qβ).

(1.19)

What is appealing about (1.19) is that the gap between Ique andIann needs to be establishedonlyfor the measureQβ, which has a simple and explicit form. We

will see that the supremum in (1.15) is attained, which is to be interpreted as saying that there is a localization strategyatthe quenched critical line.

Disorder relevance ismonotoneinβ; see Figure3.

THEOREM1.6. For allμ0andKthere exists aβc=βc(μ0, K)∈ [0,∞]such

that

hque_c (β)

₌_hann

c (β), ifβ∈ [0, βc],

< hann_c (β), ifβ∈(βc,∞).

(1.20)

III. Corollaries. From Theorems1.4–1.6we draw four corollaries. Abbreviate

χ:=

n∈N

[P(Sn=0)]2, w:=sup[supp(μ0)].

(1.21)

COROLLARY1.7. Ifα=0,thenβc= ∞for allμ0.

COROLLARY1.8. Ifα∈(0,∞),then the following bounds hold:

(i) βc≥βc∗withβc∗=βc∗(μ0, K)∈ [0,∞]given by

β_c∗:=0∨sup{β:M(2β)/M(β)2<1+χ−1}.

(7)

(ii) βc≤βc∗∗withβc∗∗=βc∗∗(μ0, K)∈(0,∞]given by

β_c∗∗:=inf{β:h(μβ|μ0) > h(K)},

(1.23)

whereh(μβ|μ0)=

Elog(dμβ/dμ0)dμβ is the relative entropy of μβ w.r.t.μ0,

andh(K):= −_n_∈_NK(n)logK(n)is the entropy ofK.

COROLLARY1.9. Ifα∈(0,∞)andχ <∞,thenβc>0for allμ0.

COROLLARY1.10. Ifα∈(0,∞),thenβc<∞for allμ0 withμ0({w})=0

(which includesw= ∞).

We close with a conjecture stating that the conditionχ <∞in Corollary1.9is not only sufficient forβc>0 but also necessary. This conjecture will be addressed

in a forthcoming paper.

CONJECTURE1.11. Ifα∈(0,∞)andχ= ∞,thenβc=0for allμ0.

1.3. Discussion.

I. What is known from the literature? Before discussing the results in Sec-tion 1.2, we give a summary of what is known about the issue of relevant vs. irrelevant disorder from the literature. This summary is drawn from the papers by Alexander [1], Toninelli [20, 21], Giacomin and Toninelli [14], Derrida, Giacomin, Lacoin and Toninelli [9], Alexander and Zygouras [3, 4], Giacomin, Lacoin and Toninelli [12, 13] and Lacoin [18].

THEOREM1.12. Suppose that condition(1.2)is strengthened to

K(n)=n−(1+α)L(n)

(1.24)

withα∈ [0,∞)andLstrictly positive and slowy varying at infinity.

Then:

(1) βc=0whenα∈(1₂,∞).

(2) βc=0whenα=1₂ andlimn→∞[logn]δ−1L2(n)=0for someδ >0.

(3) βc>0whenα=1₂ andn∈Nn−1[L(n)]−2<∞.

(4) βc>0whenα∈(0,1₂).

(5) βc= ∞whenα=0.

The results in Theorem 1.12 hold irrespective of the choice of μ0; see

Re-mark 1.13 below. Toninelli [21] proves that if logM(λ)∼Cλγ as λ→ ∞ for someC∈(0,∞)andγ ∈(1,∞), thenβc<∞irrespective ofα∈(0,∞)andL.

Note that there is a small gap between cases (2) and (3) at the critical threshold

(8)

For the cases of relevant disorder, bounds on the gap between hann_c (β) and

hquec (β)have been derived in the above cited papers subject to (1.24). As β↓0,

this gap decays like

hann_c (β)−hque_c (β) ⎧ ⎪ ⎨ ⎪ ⎩

β2, ifα∈(1,∞), β2ψ (1/β), ifα=1, β2α/(2α−1), ifα∈1₂,1 (1.25)

for all choices ofL, withψslowly varying and vanishing at infinity whenL(∞)∈ (0,∞).

Partial results are known for α = 1₂. For instance, it is shown in Giacomin, Lacoin and Toninelli [13] that, under the condition in Theorem1.12(2), the gap decays faster than any polynomial, namely, roughly like exp[−β−2/δ], β ↓ 0, when L2(n) [logn]1−δ, n→ ∞. This implies that the disorder can at most bemarginally relevant, a situation where standard perturbative arguments do not work.

REMARK1.13. Some of the above mentioned results are proved for Gaussian disorder only, and are claimed to be true for arbitrary disorder subject to (1.3). Full proofs for arbitrary disorder are in [9, 13, 18, 21].

REMARK 1.14. The fact thatα= 1₂ is critical for relevant vs. irrelevant dis-order is in accordance with the so-calledHarris criterionfor disordered systems (see Harris [17]): “Arbitrary weak disorder modifies the nature of a phase transi-tion when the order of the phase transitransi-tion in the nondisordered system is<2.” The order of the phase transition for the homopolymer, which is briefly described inAppendix A, is<2 precisely whenα∈(1₂,∞)(see Giacomin [11], Chapter 2). This link is emphasized in Toninelli [20].

II. What is new in the present paper? The main importance of our results in Section 1.2is that theyopen up a new window on the random pinning problem. Whereas the results cited in Theorem1.12are derived with the help of a variety of estimation techniques, like fractional moment estimates and trial choices of local-ization strategies, Theorem1.4gives avariational characterizationof the critical curves, that is, new. (It is very rare indeed that critical curves for disordered sys-tems allow for a direct variational representation.) Theorem1.5gives a necessary and sufficient criterion for disorder relevance that, although not easy to handle, at least is explicit and offers a different handle. Theorem1.6shows that unique-ness of the inverse critical temperature is a direct consequence of this criterion, while Corollaries1.7–1.10show that the criterion can be used to obtain important information on the inverse critical temperature.

(9)

REMARK 1.16. Corollary 1.7 is the main result in Alexander and Zy-gouras [4].

REMARK1.17. Since (see Section8)

lim

β↓0M(2β)/M(β)

2₌₁_, _lim

β→∞h(μβ|μ0)=log[1/μ0({w})],

(1.26)

with the understanding that the second limit is∞whenμ0({w})=0, Corollary1.8

implies Corollaries1.9and1.10. Corollary1.10was noted also in Alexander and Zygouras [4].

REMARK1.18. Note thatχ=E(|I1∩I2|)withI1, I2two independent copies

of the set of return times ofS [recall (1.1)]. Thus, according to Corollary1.9and Conjecture 1.11, βc>0 is expected to be equivalent to the renewal process of

joint return times to be recurrent. Note that 1/P(I1 ∩I2 =∅)=1+χ−1 (see

Spitzer [19], Section 1), the quantity appearing in Corollary1.8(i).

REMARK 1.19. If μ0 is Bernoulli(1/2) on {−1,1}, (1.26) gives that

limβ→∞h(μβ|μ0)=log 2. For anyα >0, we can find a distributionK that

sat-isfies (1.2) andH (K) <log 2, and thus (1.23) implies thatβc=βc(μ0, K) <∞.

This shows that for α >0, the condition μ0({w})=0 is not (!) necessary for

βc<∞.

REMARK 1.20. As shown in Doney [10], subject to the condition of regular variation in (1.24),

P(Sn=0)∼

Cα

n1−α_L(n)

(1.27)

asn→ ∞withCα=(α/π )sin(απ )whenα∈(0,1).

Hence the condition χ <∞ in Corollary 1.9 is satisfied exactly for α ∈(0,1₂)

andL arbitrary, and for α= 1₂ andn∈Nn−1[L(n)]−2 <∞. This fits precisely

with cases (3) and (4) in Theorem1.12.

REMARK1.21. Corollary1.8(ii) is essentially Corollary 3.2 in Toninelli [21],

where the condition for relevance, h(μβ|μ0) > h(K), is given in an equivalent

form (see equation (3.6) in [21]). Note that, by (1.2),h(K) <∞whenα∈(0,∞).

(10)

FIG. 4. Cutting words out from a sequence of letters according to renewal times.

In Section5we reformulate this criterion to put it into a form, that is, more con-venient for computations. In Section6we use the latter to prove Theorem1.6. In Sections7–8we prove Corollaries1.7–1.10.Appendix Acollects a few standard facts about the homopolymer, whileAppendix Bprovides the details of the proof of a key lemma in Section3based on an approximation argument in [6].

2. Annealed and quenched LDP. In this section we recall the main results from Birkner, Greven and den Hollander [6] that are needed in the present paper. Section 2.1introduces the relevant notation, while Sections2.2and2.3state the relevant annealed and quenched LDP’s.

2.1. Notation. LetEbe a Polish space, playing the role of an alphabet, that is, a set ofletters. LetE:= _k_∈_NEkbe the set offinite wordsdrawn fromE, which can be metrized to become a Polish space.

Fixμ0∈P(E), andK∈P(N)satisfying (1.2). LetX=(Xk)k∈N0 be i.i.d.E

-valued random variables with marginal law μ0, and τ =(τi)i∈N i.i.d. N-valued

random variables with marginal lawK. Assume thatXandτare independent, and writeP∗to denote their joint law. Cut words out of the letter sequenceXaccording toτ (see Figure4), that is, put

T0:=0 and Ti:=Ti−1+τi, i∈N,

(2.1)

and let

Y(i):=(XTi−1, XTi−1+1, . . . , XTi−1), i∈N.

(2.2)

Under the lawP∗,Y =(Y(i))i∈N is an i.i.d. sequence of words with marginal

dis-tributionq0onEgiven by

dq0(x1, . . . , xn)

:=P∗Y(1)∈(dx1, . . . ,dxn)

(2.3)

=K(n)dμ0(x1)× · · · ×dμ0(xn), n∈N, x1, . . . , xn∈E.

The reverse operation of cuttingwords out of a sequence of letters isglueing words together into a sequence of letters. Formally, this is done by defining a con-catenationmapκfromENtoEN0_{. This map induces in a natural way a map from} P(EN)toP(EN0₎_{, the sets of probability measures on}_EN_and_EN0_{(endowed with}

the topology of weak convergence). The concatenationq₀⊗N◦κ−1 ofq₀⊗N equals

μN0

(11)

2.2. Annealed LDP. Let Pinv(EN) be the set of probability measures on

EN that are invariant under the left-shift θ acting on EN. For N ∈ N, let

(Y(1), . . . , Y(N ))perbe the periodic extension of theN-tuple(Y(1), . . . , Y(N ))∈EN

to an element ofEN, and define

RN :=

1

N

N−1

i=0

δθi_(Y(1)_,...,Y(N )₎per∈Pinv(EN).

(2.4)

This is the empirical process of N-tuples of words. The following annealed LDP is standard; see, for example, Dembo and Zeitouni [7], Section 6.5. For

Q∈Pinv(EN), letH (Q|q₀⊗N)be thespecific relative entropy ofQw.r.t.q₀⊗N de-fined by

H (Q|q₀⊗N):= lim

N→∞

1

Nh(πNQ|πNq ⊗N

0 ),

(2.5)

whereπNQ∈P(EN)denotes the projection ofQ onto the firstN words, h(·|·)

denotes relative entropy, and the limit is nondecreasing.

THEOREM2.1. The familyP∗(RN ∈ ·),N∈N,satisfies the LDP onPinv(EN)

with rateN and with rate functionIanngiven by

Iann(Q):=H (Q|q₀⊗N), Q∈Pinv(EN).

(2.6)

This rate function is lower semi-continuous,has compact level sets,has a unique zero atq₀⊗N,and is affine.

2.3. Quenched LDP. To formulate the quenched analog of Theorem 2.1, we need some more notation. Let Pinv(EN0₎ _{be the set of probability measures on} EN0_{that are invariant under the left-shift}_θ _{acting on}_EN0_{. For}_Q∈Pinv₍_EN₎_such

thatmQ:=EQ(τ1) <∞(whereEQdenotes expectation under the law Qandτ1

is the length of the first word), define

Q:=

1

mQ

EQ _τ₁₋₁

k=0

δ_θk_{κ(Y )}

∈Pinv_(EN0_).

(2.7)

Think ofQ as the shift-invariant version of Q◦κ−1 obtained after

randomiz-ing the location of the origin. This randomization is necessary because a shift-invariantQin general does not give rise to a shift-invariantQ◦κ−1.

For tr∈N, let[·]tr:E→ [E]tr= tr_n₌₁Endenote thetruncation mapon words defined by

y=(x1, . . . , xn)→ [y]tr:=(x1, . . . , xn∧tr), n∈N, x1, . . . , xn∈E,

(2.8)

(12)

[E]N_tr, and fromPinv(EN)toPinv([E]N_tr). Note that ifQ∈Pinv(EN), then[Q]tris an element of the set

Pinv,fin₍_EN₎_{= {}_Q_∈_Pinv₍_EN₎_:_m

Q<∞}.

(2.9)

THEOREM2.2. (Birkner,Greven and den Hollander[6])Assume(1.2).Then, for μ⊗N0

0 -a.s. all X, the family of(regular) conditional probability distributions

P∗(RN ∈ ·|X),N∈N,satisfies the LDP onPinv(EN)with rateN and with

deter-ministic rate functionIque given by

Ique(Q):=

Ifin(Q), ifQ∈Pinv,fin(EN),

lim

tr→∞I

fin₍_[_Q_]tr_), _otherwise_,

(2.10)

where

Ifin(Q):=H (Q|q₀⊗N)+αmQH (Q|μ⊗₀N0).

(2.11)

This rate function is lower semi-continuous,has compact level sets,has a unique zero atq₀⊗Nand is affine.

There is no closed form expression forIque(Q)whenmQ= ∞. For later

refer-ence we remark that, for allQ∈Pinv(EN),

Iann(Q)= lim

tr→∞I

ann₍_[_Q_]tr₎₌_sup

tr∈N

Iann([Q]tr),

(2.12)

Ique(Q)= lim

tr→∞I

que₍_[_Q_]tr₎₌_sup

tr∈N

Ique([Q]tr)

as shown in [6], Lemma A.1. A remarkable aspect of (2.11) in relation to (2.6) is that itquantifiesthe difference betweenIqueandIann. Note the explicit appearance of the tail exponentα. Also note thatIque=Iannwhenα=0.

3. Variational formulas: Proof of Theorem 1.4. In Section 3.1 we prove (1.16), the variational formula for the annealed critical curve. The proof of (1.15) in Sections 3.2–3.4, the variational formula for the quenched critical curve, is longer. In Section3.2we first give the proof for μ0 with finite support.

In Section3.3we extend the proof toμ0 satisfying (1.3). In Section3.4we prove

three technical lemmas that are needed in Section3.3.

3.1. Proof of(1.16).

PROOF. Recall from (1.17) and (1.18) that Qβ=qβ⊗N, and from (1.11) that

hann_c (β)=logM(β). Below we show that for everyQ∈Pinv(EN),

β(Q)−Iann(Q)=logM(β)−H (Q|Qβ).

(13)

Taking the supremum overQ, we arrive at (1.16). Note that the unique probability measure that achieves the supremum in (3.1) is Qβ, which is an element of the

setCdefined in (1.13) because of (1.3).

To get (3.1), note that H (Q|Qβ) is the limit as N → ∞ of [recall (1.17)

and (1.18)] 1

N

EN log _d_(π

NQ)

d(πNQβ)

(y1, . . . , yN)

d(πNQ)(y1, . . . , yN)

= 1

N

EN log _d_(π

NQ)

d(πNQ0)

(y1, . . . , yN)

× M(β)N

eβ[c(y1)+···+c(yN)]

d(πNQ)(y1, . . . , yN)

(3.2)

=logM(β)+ 1

Nh(πNQ|πNQ0)

−β 1 N

EN[c(y1)+ · · · +c(yN)]d(πNQ)(y1, . . . , yN),

where,c(y)denotes the first letter of the wordy. In the last line of (3.2), the limit as

N → ∞of the second quantity isH (Q|Q0)=Iann(Q), while the integral equals

N (Q)by shift-invariance ofQ. Thus, (3.1) follows.

3.2. Proof of(1.15)forμ0with finite support.

PROOF. The proof comes in three steps.

Step 1: An alternative way to compute the quenched free energy fque(β, h)

from (1.5) is through the radius of convergencezque(β, h)of the power series

n∈N

znZ_nβ,h,ω,

(3.3)

because

zque(β, h)=e−fque(β,h).

(3.4)

Write

Z_nβ,h,ω=

N∈N

0=k0<k1<···<kN=n

N

i=1

K(ki−ki−1)eβωki−1−h,

(3.5)

so that, forz∈(0,∞),

n∈N

znZ_nβ,h,ω=

N∈N

F_Nβ,h,ω(z),

(3.6)

where we abbreviate

F_Nβ,h,ω(z):=

0=k0<···<kN<∞

N

i=1

zki−ki−1_K(k

i−ki−1)eβωki−1−h.4

(14)

Step 2: We return to the setting of Section2. The letter space isE, the word space isE= _k_∈_NEk, the sequence of letters isω=(ωk)k∈N0, while the sequence

of renewal times is(Ti)i∈N0 =(ki)i∈N0. Each intervalIi:= [ki−1, ki)of integers

cuts out a wordωIi :=(ωki−1, . . . , ωki−1). Let

R_Nω =R_Nω((ki)Ni=0):=

1

N

N−1

i=0

δθi_(ω

I₁,...,ω_IN)per

(3.8)

denote the empirical process of N-tuples of words in ω cut out by the first N

renewals. Then we can rewriteF_Nβ,h,ω(z)as

F_Nβ,h,ω(z)=E

exp

N

E

τ (y)logz+βc(y)−hd(π1RNω)(y)

(3.9)

=e−N hEexp[N mR_Nωlogz+Nβ(RωN)]

,

where τ (y)and c(y) are the length, respectively, the first letter of the word y,

π1R_Nω is the projection of R_Nω onto the first word, while mRω_N and (RNω) are

the average word length, respectively, the average first letter of the first word un-derRω_N.

To identify the radius of convergence of the series in the left-hand side of (3.6), we apply the root test for the series in the right-hand side of (3.6) using the expres-sion in (3.9). To that end, let

Sque(β;z):=lim sup

N→∞

1

N logE

exp[N mR_Nω logz+Nβ(RNω)]

.

(3.10)

Then

lim sup

N→∞

1

N logF

β,h,ω

N (z)= −h+S

que

(β;z).

(3.11)

We know from (3.4) and the nonnegativity offque(β, h)thatzque(β, h)≤1, and we are interested in knowing when it is<1, respectively,=1 [recall (1.6)]. Hence, the sign of the right-hand side of (3.11) for z↑1 will be important as the next lemma shows.

LEMMA3.1. For allβ∈ [0,∞)andh∈R,

Sque(β;1−) < h ⇒ f (β, h)=0,

(3.12)

Sque(β;1−) > h ⇒ f (β, h) >0.

PROOF. The first line holds because, by (3.11),−h+Sque(β;1−) <0 implies that the sums in (3.6) converge for |z|<1, so that zque(β, h)≥1, which gives

fque(β, h)≤0. The second line holds because if−h+Sque(β;1−) >0, then there exists az0<1 such that−h+Sque(β;z0) >0, which implies that the sums in (3.6)

(15)

0 z

Sque(β;z)

hquec (β)

1

∞

FIG. 5. Qualitative plot ofz→Sque(β;z).

Lemma3.1implies that

hque_c (β)=Sque(β;1−).

(3.13)

The rest of the proof is devoted to computingSque(β;1−).

Step 3: Since μ0 has finite support,Q→(Q) is continuous. Therefore we

can apply Varadhan’s lemma to the expression in (3.10) forz=1 using the LDP of Theorem2.2. This gives

Sque(β;1)= sup

Q∈Pinv₍_EN₎

[β(Q)−Ique(Q)].

(3.14)

We would like to do the same for (3.10) withz <1, and subsequently take the limit

z↑1, to get (see Figure5)

Sque(β;1−)= sup

Q∈Pinv₍_EN₎

[β(Q)−Ique(Q)].

(3.15)

However, even thoughQ→(Q)is continuous (becauseμ0 has finite support),

Q→mQ is only lower semicontinuous. Therefore we proceed by first showing

that the termN mRω_Nlogzin (3.10) is harmless in the limit asz↑1.

LEMMA3.2. Sque(β;1−)=Sque(β;1)for allβ∈ [0,∞).

PROOF. SinceSque(β;1−)≤Sque(β;1), we need only prove the reverse

in-equality. The idea is to show that, for any Q∈Pinv(EN) and in the limit as

N → ∞, Rω_N can be arbitrarily close toQwith probability ≈exp[−N Ique(Q)]

whilemR_Nω remains bounded by a large constant. Therefore, letting N→ ∞

fol-lowed by z↑1, we can remove the term N mRω_Nlogz in (3.10). The details are

given inAppendix B.

(16)

3.3. Proof of (1.15) for μ0 satisfying (1.3). The proof stays the same up

to (3.13). Henceforth write C=C(μ0) to exhibit the fact that the setC in (1.13)

depends onμ0via its supportEin (1.12), and define

A(β):= sup

Q∈_C(μ0)

[β(Q)−Ique(Q)],

(3.16)

which replaces the right-hand side of (3.15). We will show the following.

LEMMA3.3. Sque(β;1−)=A(β)for allβ∈(0,∞).

PROOF. The proof of the lemma is accomplished in four steps. Along the way

we use three technical lemmas, the proof of which is deferred to Section3.4. Our starting point is the validity of the claim for μ0 with finite support obtained in

Lemma3.2. (Note that|E|<∞impliesC=C(μ0)=Pinv(EN).)

Step1:Sque(β;1−)≤A(β)for allβ∈(0,∞)whenμ0satisfies (1.3).

PROOF. We have Sque(β;1−)≤Sque(β;1). We will show that Sque(β;1)≤ A(pβ)/p for allp >1. Takingp↓1 and using the continuity of A, proven in Lemma3.4below, we get the claim.

ForM >0, let

M(Q):=

E

(x∧M)d(π1,1Q)(x).

(3.17)

Then, for anyp, q >1 such thatp−1+q−1=1, we have

EeNβ(RNω)=_E_eβ N

i=1c(yi)1{c(yi )≤M}_eβN_i₌₁c(yi)1{c(yi )>M}

≤EepβNi=1c(yi)1{c(yi )≤M}1/p_E_eqβN_i₌₁c(yi)1{c(yi )>M}1/q

(3.18)

≤EeNpβM(RNω)1/p_E_eqβ N

i=1c(yi)1{_{c(yi )>M}}1/q_,

wherey1, . . . , yN are theN words determiningRωN andc(yi)is the first letter of

theith word. Hence 1

N logE

eNβ(RNω)≤ 1 p

1

N logE

eNpβM(RNω)

(3.19)

+1

q

1

N logE

eqβNi=1c(yi)1{c(yi )>M}_.

SinceQ→M(Q)is upper semicontinuous, Varadhan’s lemma gives

lim sup

N→∞

1

N logE

eNpβM(RωN)≤ _sup

Q∈Pinv₍_EN₎

[pβM(Q)−Ique(Q)].

(3.20)

Clearly, Q’s with _E(x ∧0)d(π1,1Q)(x)= −∞ do not contribute to the

(17)

such Q we have Ique(Q)= ∞, by Lemma 3.5below, and M(Q) <∞. Since

M ≤, we therefore have

sup

Q∈Pinv₍_EN)

[pβM(Q)−Ique(Q)] ≤ sup

Q∈C(μ0)

[pβ(Q)−Ique(Q)]

(3.21)

=A(pβ).

Next, we use the following observation. For any sequence =(N)N∈N of

positive random variables on a space with probability measureP, we have

lim sup

N→∞

1

N logN≤lim supN→∞

1

N logE(N) P-a.s.,

(3.22)

by the first Borel–Cantelli lemma. Applying this to

N :=E

eqβ

N

i=1c(yi)1{c(yi )>M}

(3.23)

withE(N)=

E

eqβx1{x>M}_d_μ

0(x) N

=:(cM)N,

we get, after lettingN→ ∞in (3.19),

Sque(β;1)≤ 1

pA(pβ)+

1

q logcM.

(3.24)

By (1.3), we have cM < ∞ for all M > 0 and limM→∞cM = 1. Hence

Sque(β;1)≤A(pβ)/p.

Step2:Sque(β;1−)≥A(β)for allβ∈(0,∞)whenμ0has bounded support.

PROOF. In the estimates below, we abbreviate

Lω_N :=N mRω_N,

(3.25)

the sum of the lengths of the firstN words. The proof is based on a discretization argument similar to the one used in [6], Section 8. Forδ >0 andx∈E, letxδ:=

sup{kδ:k∈Z, kδ≤x}. The operation·extends to measures onE,EandENin the obvious way. Now,R_Nωδ satisfies the quenched LDP with rate functionI_δque, the quenched rate function corresponding to the measureμ0δ. Clearly,

EeLωNlogz+Nβ(RωN)≥_E_eLωNlogz+Nβ(RNωδ)_,

(3.26)

and so, by the results in Section3.2, we have

Sque(β;1−)≥ sup

Q∈C(μ0δ)

[β(Q)−I_δque(Q)].

(3.27)

For everyQ∈C(μ0), we have

(Q)=lim

δ↓0(Qδ), I

que_(Q)₌ _lim n→∞I

que

δn (Qδn),

(18)

whereδn=2−n. The first relation holds because(Qδ)≤(Q)≤(Qδ)+δ,

the second relation uses Lemma3.6(i) below. Hence the claim follows by picking

δ=δnin (3.27) and lettingn→ ∞.

Step 3: Sque(β;1−)≥A(β) for all β ∈(0,∞) when μ0 satisfies (1.3) with

support bounded from below.

PROOF. ForM >0 and x ∈E, let xM =x ∧M. This truncation operation acts onμ0by moving the mass in(M,∞)toM, resulting in a measureμM₀ with

bounded support and with associated quenched rate functionIque,M. LetRω,M_N be the empirical process ofN-tuples of words obtained fromRω_N defined in (2.4) after replacing each letterx∈EbyxM. We have

EeLωNlogz+Nβ(RωN)≥_E_eLωNlogz+Nβ(R ω,M N )_.

(3.29)

Combined with the result in Step 2, this bound implies that

S(β;1−)≥ sup

Q∈C(μM₀ )

[β(Q)−Ique,M(Q)].

(3.30)

For everyQ∈C(μ0), we have

(Q)= lim

M→∞(Q

M₎₌ _lim

M→∞

E(x∧M)d(π1,1Q)(x),

(3.31)

Ique(Q)= lim

M→∞I que,M

(QM).

The first relation holds by dominated convergence, and the second relation uses Lemma3.6(ii) below. It follows from (3.31) that

lim sup

M→∞

sup

Q∈C(μM₀ )

[β(Q)−Ique,M(Q)] ≥β(Q)−Ique(Q)

(3.32)

∀Q∈C(μ0),

which combined with (3.30) yields

S(β;1−)≥β(Q)−Ique(Q) ∀Q∈C(μ0).

(3.33)

Take the supremum overQ∈C(μ0)to get the claim.

Step4:Sque(β;1−)≥A(β)for allβ∈(0,∞)whenμ0satisfies (1.3).

PROOF. ForM >0 andx∈E, letx−M =x∨(−M). This truncation opera-tion acts onμ0 by moving the mass in(−∞,−M)to−M, resulting in a measure

(19)

As in Step 1, for anyp, q >1 such thatp−1+q−1=1, we have

EeLωNlogz+Nβ(R ω,−M N )

≤EeLωNlogz+Nβ(RωN)_e−β N

i=1c(yi)1{c(yi )<−M}

(3.34)

≤EepLωNlogz+Npβ(RωN)1/p_E_e−qβ N

i=1c(yi)1{c(yi )<−M}1/q_,

and hence

1

N logE

eLωNlogz+Nβ(R ω,−M N )

≤ 1

p

1

N logE

epLωNlogz+Npβ(RωN)

(3.35)

+1

q

1

N logE

e−qβNi=1c(yi)1{_{c(yi )<}−M}_.

LetN → ∞followed byz↑1. For the left-hand side, we have the lower bound in Step 3, while the second term in the right-hand side can be handled as in (3.22–

3.24). Therefore, recalling (3.10) and writingplogz=logzp, we get

sup

Q∈C(μ−₀M)

[β(Q)−Ique,−M(Q)] ≤ 1 pS

que_(pβ_;₁₋₎₊ 1

q logC−M

(3.36)

withC₋M:=

Ee

−qβx1_{x<−M}_d_μ

0(x).

LettingM→ ∞and using that limM→∞C−M =1 by (1.3), we arrive at

1

pS

que_(pβ,₁₋₎_≥_{lim sup}

M→∞

sup

Q∈C(μ−₀M)

[β(Q)−Ique,−M(Q)] ≥A(β),

(3.37)

where the last inequality is obtained via arguments similar to those follow-ing (3.30), which require the use of Lemma3.6(iii) below. Finally, letp↓1, and use the continuity ofβ→S(β;1−), proven in Lemma3.4below.

This completes the proof of Lemma3.3and hence of Theorem1.4.

3.4. Technical lemmas. In the proof of Lemma 3.3 we used three technical lemmas, which we prove in this section.

LEMMA 3.4. β → A(β) and β →Sque(β;1−) are finite and convex on

[0,∞)and,consequently,are continuous on(0,∞).

PROOF. For the first function, note that A(β) ≤ sup_Q_∈_C_(μ₀)[β(Q) − Iann(Q)] ≤logM(β) <∞ by (1.3) and (3.1), and convexity follows from the fact that Ais a supremum of linear functions. For the second function, note that

(20)

LEMMA 3.5. Ifμ, ν∈P (R) satisfyh(μ|ν) <∞ and_Eeλxdν(x) <∞for someλ >0,then_E(x∨0)dμ(x) <∞.

PROOF. The claim follows from the inequality

E

fdμ≤h(μ|ν)+log

E

efdν,

(3.38)

which is valid for all bounded and measurable f (see Dembo and Zeitouni [7], Lemma 6.2.13) and, by monotone convergence, extends to measurablef ≥0. Pick

f (x)=λ(x∨0),x∈E.

LEMMA3.6. For everyQ∈Pinv(EN):

(i) limn→∞Iδquen (Qδn)=I

que_(Q)_with_δ

n:=2−n;

(ii) limM→∞Ique,M(QM)=Ique(Q);

(iii) limM→∞Ique,−M(Q−M)=Ique(Q).

PROOF. (i) The proof proceeds by choosing an appropriate function I:

[0,1] →Rand proving that:

(a) I (0)=lim

δ↓0I (δ);

(b) I (0)≥I (δ1)≥I (δ2)

(3.39)

wheneverδ2=kδ1∈(0,1)for somek∈N.

Recalling (2.10) and (2.11), we see that we need the following choices forI:

(1) I (δ)=

N−1h(πNQδ|πNq0⊗Nδ), δ >0,

N−1h(πNQ|πNq₀⊗N), δ=0,

(2) I (δ)=

H (Qδ|q₀⊗Nδ), δ >0,

H (Q|q₀⊗N), δ=0,

(3.40)

(3) I (δ)=

N−1h(πNQδ|πNμ⊗₀N0δ), δ >0,

N−1h(πNQ|πNμ⊗0N0), δ=0,

(4) I (δ)=

H (Qδ|μ⊗N0

0 δ), δ >0,

H (Q|μ⊗N0

0 ), δ=0,

withN∈N. It is clear from the definition of specific relative entropy [recall2.5)] that if (a) and (b) hold for the choices (1) and (3), then they also hold for the choices (2) and (4), respectively. We will not actually prove (a) and (b) for the choices (1) and (3), but for the simpler choice

I (δ)=

h(μδ|μ0δ), δ >0, h(μ|μ0), δ=0.

(21)

The proof will make it evident how to properly deal with (1) and (3).

Let B(R) be the set of real-valued, bounded and Borel measurable functions on R and, for φ∈B(R) and δ >0, let φδ be the function defined by φδ(x):=

φ(xδ). As shown in Dembo and Zeitouni [7], Lemma 6.2.13, we have

h(μδ|μ0δ)= sup

φ∈B(R)

Rφdμδ−log

Re

φ_d_μ_0δ

(3.42)

= sup

φ∈B(R)

Rφδdμ−log

Re

φδ_d_μ

0

.

From this representation, property (b) follows for the choice in (3.41). Next, fix any ε >0 and take aφ such that_Rφdμ−log_Reφdμ0≥h(μ|μ0)−ε. Then,

since φδ converges pointwise to φ as δ↓0, the bounded convergence theorem

together with (3.42) give

lim inf

δ↓0 h(μδ|μ0δ)≥h(μ|μ0)−ε.

(3.43)

Hence lim infδ↓0I (δ)≥I (0)−ε. Since I (0)≥I (δ), property (a) follows after

lettingε↓0.

Having thus convinced ourselves that (3.39) and (3.40) are true, we now know that for anyQ∈Pinv(EN)the sequences

H (Qδ_n|q₀⊗Nδ_n), H (Qδ_n|μ⊗N0

0 δn), n∈N,

(3.44)

are increasing and converge toH (Q|q₀⊗N), respectively,H (Q|μ⊗N0

0 ). This

im-plies the claim forQwithmQ<∞[recall (2.11)]. ForQwithmQ= ∞we use

thatIque(Q)=sup_tr_∈_NI ([Q]tr)[recall (2.12)], to conclude thatI_δque_n (Qδ_n)is in-creasing and converges toIque(Q).

(ii)–(iii) The proof is similar as for (i).

4. Characterization of disorder relevance: Proof of Theorem1.5.

PROOF. We will need the following lemma, the proof of which is postponed.

LEMMA 4.1. The supremum sup_Q_∈_C[β(Q)−Ique(Q)] is attained for all

β∈(0,∞).

Let Q∗ be a measure achieving the supremum in Lemma 4.1. Suppose that

hquec (β)=hannc (β). Then

hque_c (β)=β(Q∗)−Ique(Q∗)≤β(Q∗)−Iann(Q∗)

(4.1)

≤β(Qβ)−Iann(Qβ)=hannc (β)=hquec (β),

where the second equality uses that Qβ achieves the supremum in (1.16) [with

(22)

equalities. However, sinceQβuniquely achieves the supremum in (1.16), we must

haveQ∗=Qβand thereforeIque(Qβ)=Iann(Qβ).

Conversely, suppose thatIque(Qβ)=Iann(Qβ). Then

hque_c (β)≥ [β(Qβ)−Ique(Qβ)] = [β(Qβ)−Iann(Qβ)] =hannc (β).

(4.2)

Sincehquec (β)≤hannc (β), this proves thath que

c (β)=hannc (β).

We now give the proof of Lemma4.1.

PROOF. The proof is accomplished in three steps. The claims in Steps 1 and 2

are obvious when the support of μ0 is bounded from above, because then is

bounded from above and upper semicontinuous. Thus, for these steps we may assume that the support ofμ0is unbounded from above.

Step1: The supremum can be restricted to the setC∩{Q∈Pinv(EN):Ique(Q)≤ γ}for someγ <∞.

PROOF. We first prove that

lim

a→∞ sup_Q_∈_C

(Q)=a

[β(Q)−Ique(Q)] = −∞.

(4.3)

To that end we estimate, fora∈(0,∞),

sup

Q∈C (Q)=a

[β(Q)−Ique(Q)] ≤ sup

Q∈C (Q)=a

[βa−h(π1,1Q|μ0)]

(4.4)

= sup

μ∈P(E)

E|x|dμ(x)<∞,

Exdμ(x)=a

[βa−h(μ|μ0)],

where we use that Ique(Q) ≥ Iann(Q) =H (Q|Q0) ≥ h(π1,1Q|μ0). The last

supremum is achieved by a measureμλof the form dμλ(x)=M(λ)−1eλxdμ0(x),

x∈E, withλsuch that_Exdμλ(x)=a[recall (1.17)]. To see why, first note that

such aλ=λ(a)exists because (λ→_Exdμλ(x))is continuous with value 0 at

λ=0 and limλ→∞

Exdμλ(x)=sup[supp(μ0)] =w, wherew= ∞by

assump-tion. Next note that, for any other measureμwith_Exdμ(x)=a, we have

h(μ|μλ)=h(μ|μ0)−λa+logM(λ)=h(μ|μ0)−h(μλ|μ0),

(4.5)

which shows thath(μ|μ0)≥h(μλ|μ0)with equality if and only ifμ=μλ.

Con-sequently,

sup

μ∈_P(E)

E|x|dμ(x)<∞,

Exdμ(x)=a

[βa−h(μ|μ0)] = β

Exdμλ(x)−h(μλ|μ0)

(4.6)

(23)

Clearly,a→ ∞impliesλ=λ(a)→ ∞, and so to prove (4.3) we must show that limλ→∞g(λ)= −∞.

To achieve the latter, note that a lower bound onh(μλ|μ0)is obtained by

apply-ing (3.38) tof (x):= ¯β(x∨0)for someβ > β¯ . This yields

g(λ)≤ −(β¯−β)

E

xdμλ(x)+log[M(β)¯ +1].

(4.7)

The integral in the right-hand side tends to infinity asλ→ ∞, and so (4.3) indeed follows.

Finally, recall the definition of A(β) in (3.16), which is finite because of Lemma3.4. Then, by (4.3), there is ana0<∞such that

sup

Q∈_C (Q)=a

[β(Q)−Ique(Q)] ≤A(β)−1 ∀a≥a0,

(4.8)

and so all Q∈C with β(Q)−Ique(Q) > A(β)−1 must satisfy (Q) < a0

and Ique(Q) < β(Q)+1−A(β)≤βa0+1−A(β)=:γ. Consequently, the

supremum can be restricted to the setC∩ {Q∈Pinv(EN):Ique(Q)≤γ}.

Step2:is upper semicontinuous on{Q∈Pinv(EN):Ique(Q)≤γ}for every

γ >0.

PROOF. From the definition of and the inequality h(π1,1Q|μ0) ≤

Ique(Q)≤γ, it follows that it is enough to show that the map μ→(μ):=

E(5.16), (x∨0)dμ(x)is upper semicontinuous onKγ := {μ∈P(E):h(μ|μ0)≤

γ}. To do so, let (μM)M∈N be a sequence in Kγ converging to μ weakly as

M→ ∞. Then

(μM)=

E[

(x∨0)∧n]dμM(x)+

E

x1_{x>n}dμM(x),

(4.9)

and so

lim sup

M→∞

(μM)≤

E[(x∨0)∧n]dμ(x)

(4.10)

+ sup

M∈N

E

x1_{x>n}dμM(x) ∀n∈N.

By the inequality in (3.38), we have

λ

Ex1{x>n}dμ

M_(x)_≤_h(μM_|_μ

0)+log

Ee

λx1_{x>n}_d_μ

0(x)

(4.11)

∀M, n∈N, λ >0,

and so

sup

M∈N

Ex

1_{x>n}dμM(x)≤

γ λ +

1

λlog

E

eλx1{x>n}_d_μ₀_(x).

(24)

By (1.3), the limit as n→ ∞ of the right-hand side is γ /λ. Since λ >0 is arbitrary, we conclude that the limit as n→ ∞ of the left-hand side is zero. Letting n→ ∞ in (4.10) and using monotone convergence, we therefore get lim sup_M_→∞(μM)≤(μ), as required.

Step3: Let(Q):=β(Q)−Ique(Q). Then, by Step 1, we have that for some

γ >0,

sup

Q∈_C

(Q)= sup

Q∈_C Ique_(Q)_≤_γ

(Q)≤ sup

Q∈_Pinv₍_EN₎

Ique_(Q)_≤_γ

(Q).

(4.13)

By Theorem2.2,Ique is lower semicontinuous. Hence, by Step 2,β−Ique is upper semicontinuous on the compact set{Q∈Pinv(EN):Ique(Q)≤γ}, achiev-ing its supremum at some Q∗. Let μ∗:=π1,1Q∗. Then, by (1.3), the inequality

in (3.38) gives

E

(x∨0)dμ∗(x)≤γ +log

E

exdμ0(x) <∞,

(4.14)

and, since(Q∗) >−∞, we also have_E(x∧0)dμ∗(x) >−∞, so thatQ∗∈C. Hence

sup

Q∈C

(Q)= sup

Q∈_Pinv₍_EN₎

Ique_(Q)_≤_γ

(Q)=(Q∗),

(4.15)

which completes the proof.

5. Reformulation of the criterion for disorder relevance. Note that, by (2.10) and (2.12), for α >0, the necessary and sufficient condition for rele-vance,Ique(Qβ) > Iann(Qβ), in Theorem1.5translates into

lim

tr→∞m[Qβ]trH

_[Qβ]tr|μ ⊗N0

0

>0.

(5.1)

In Lemma5.3below, we give two alternative expressions for the specific relative entropy appearing in (5.1). These expressions will be needed in Sections6and7.

I. Asymptotic mean stationarity. In what follows we will make use of the notion ofasymptotic mean stationarity(see Gray [16], Section 1.7). LetAbe a topologi-cal space and equipAN0_{with the product topology. A measure}P _on_AN0 _{is called}

asymptotically mean stationaryif for every Borel measurableG⊂AN0_,

P(G):= lim

n→∞

1

n

n−1

k=0

P(θ−kG) exists.

(25)

As in Section2,θ denotes the left-shift acting onAN0_{. If}P _{is asymptotically mean}

stationary, thenP is a stationary measure, called thestationary meanofP. ForQ∈Pinv(EN), recall from Section2.1thatκ(Q)∈P(EN0₎ _{is the}

proba-bility measure induced by the concatenation mapκ:EN→EN0 _{that glues a}

se-quence of words into a sese-quence of letters, that is, κ(Q)=Q◦κ−1. Our aim is to replace Q in (5.1) by κ(Q), which is not stationary but more

conve-nient to work with. These two probability measures are related in the following way.

LEMMA 5.1. IfmQ<∞,thenκ(Q)is asymptotically mean stationary with

stationary meanκ(Q)=Q.

PROOF. LetX:=κ(Y )∈EN0_{, where}_Y _{is distributed according to}_Q_{. Let}_I

denote the set of indicesi∈N0 where a new word starts (0∈I). For i∈N0, let

ri:=inf{j ∈N:i−j∈I}, that is, the distance fromito the beginning of the word

it belongs to. Forj∈I, letLj denote the length of the word that starts atj. Then, for anyG⊂EN0_{Borel measurable, we have}

n−1

i=0

κ(Q)(θiX∈G)=

n−1

i=0 i

k=0

Q(θiX∈G, ri=k)

(5.3)

= n−1

k=0 n−1

i=k

Q(θiX∈G, ri=k).

Next, note that

Q(θiX∈G, ri=k)

=Q(θiX∈G, i−k∈I, Li−k> k)

(5.4)

=Q(θiX∈G, Li−k> k|i−k∈I )Q(i−k∈I )

=Q(θkX∈G, L0> k)Q(i−k∈I ).

Hence, dividing the sum in (5.3) byn, we get

1

n

n−1

i=0

κ(Q)(θiX∈G)=

n−1

k=0

Q(θkX∈G, L0> k)fk,n,

(5.5)

where we abbreviate fk,n := n−1

n−k−1

j=0 Q(j ∈ I ). By the renewal theorem,

limn→∞fk,n=1/mQforkfixed. Since

∞

k=0

Q(L0> k)=mQ<∞,

(26)

we can apply the bounded convergence theorem, and conclude that

κ(Q)(G)= 1 mQ

∞

k=0

Q(θkX∈G, L0> k)

= 1

mQ ∞

k=0 ∞

j=k+1

Q(θkX∈G, L0=j )

(5.7)

= 1

mQ ∞

j=1 j−1

k=0

Q(θkX∈G, L0=j )=Q(G).

The last equality is simply the definition ofQin (2.7).

To complement Lemma5.1, we need the following fact stated in Birkner [5], Remark 5, where ergodicity refers to the left-shifts acting onENandEN.

LEMMA5.2. IfQ∈Pinv(EN)is ergodic andmQ<∞,thenQ∈Pinv(EN)

is ergodic.

An asymptotic mean stationary measure can be interchanged with its station-ary mean in several situations (see Gray [15], Chapter 6), for example, in relative entropy computations, as in Lemma5.3below. Before stating this lemma, we use an extension of the notion of specific relative entropy to measures that are not necessarily stationary. More precisely, for two measures P and Q on a product spaceAN, we define the specific relative entropy ofP w.r.t.Qas

H (P|Q):=lim sup

n→∞

1

nh(πnP|πnQ),

(5.8)

where πn is the projection onto the first n coordinates. For Q∈Pinv(EN), we

introduce the following Radon–Nikodym derivative:

fn(x):=

dπnκ(Q)

dμ⊗₀n (x), x∈E N0_.

(5.9)

With this notation, the main result of this section is the following.

LEMMA5.3. ForQ∈Pinv(EN)ergodic withmQ<∞,

H (Q|μ⊗N0

0 )=H (κ(Q)|μ ⊗N0

0 ),

(5.10)

= lim

n→∞

1

nlogfn(x) forκ(Q)-a.s.allx∈E N0_.

(5.11)