RECENT PROGRESS ON KAC’S PROBABILISTIC APPROACH
TO KINETIC THEORY
International Conference on Particle Systems and PDE’s,
Univ. Minho, December 7, 2012
Maria C Carvalho
This talk presents results of collaboration with Eric Carlen, and Michael Loss.
The Kac Collision Process
For N ∈ N, p ∈ R3 and E > |p|2, let SN,E,p be the set
consisting of N–tuples ~v = (v1, . . . , vN) of vectors vj in R3 with
1 N
N
X
j=1
|vj|2 = E and 1 N
N
X
j=1
vj = p .
In what follows, a point ~v ∈ SN,E,p specifies the velocities of a collection of N particles with unit mass.
We consider a Markov jump process on SN,E,p that was introduced by Mark Kac to describe a random binary collision process for the N particles.
When the collision process begins, associated to each pair (i, j), i < j, there is an exponential random variable Ti,j with parameter
λi,j = N N 2
!−1
|vi − vj|α , (0.1)
where 0 ≤ α ≤ 2, and α = 1 is the case of main interest, corresponding to “hard-sphere collisions”.
Ti,j is the waiting time for particles i and j to collide, and the set of these random times is taken to be independent. The first collision occurs at time
T = min{T } . (0.2)
At the time T, the pair (i, j) furnishing the minimum collide:
The state of the process “jumps” from (v1, . . . , vN) to (v1, v2, . . . , vi∗, . . . , vj∗, . . . , vN) ,
where only vi and vj have changed. Since the process models momentum and energy conserving collisions, we require that
vi∗ + vj∗ = vi + vj and |vi∗|2 + |vj∗|2 = |vi|2 + |vj|2 . Then by the parallelogram law, it follows that
Given vi and vj, the kinematically possible collisions of particles i and j may be parameterized in terms of a unit vector σ ∈ S2, the unit sphere in R3 as follows:
vi∗(σ) = vi + vj
2 + |vi − vj|
2 σ
vj∗(σ) = vi + vj
2 − |vi − vj|
2 σ
σ is selected according to the following law: There is a non-negative function b on [−1, 1] such that for any fixed σ′ ∈ S2,
Z
b(σ · σ′)dσ = 1
The object of our investigation is the spectral gap for the generator of the Markov semigroup associated to this process. For any continuos function f on SN,E,p, define
LN,αf (~v) = 1
h lim
h→0 E{f(~v(h) − f(~v)) | ~v(0) = ~v } . One readily computes that
LN,αf (~v) = −N N 2
!−1
X
i<j
|vi − vj|α×
Here,
(Ri,j,σ~v)k =
vi∗(σ) k = i vj∗(σ) k = j vk k 6= i, j
.
Introducing the notation
[f ](i,j)(~v) :=
Z
S2
b
σ · vi − vj
|vi − vj|
f (Ri,j,σ~v)dσ , we can write the generator more briefly as
L f (~v) = −N N!−1
X |v − v |α h
f (~v) − [f](i,j)(~v)i
Note that
cos θ := σ · vi − vj
|vi − vj| = vi∗ − vj∗
|vi∗ − vj∗| · vi − vj
|vi − vj| .
This shows that rates for the jump from ~v to Ri,j,σ~v and from Ri,j,σ~v to ~v are equal. This is the property of “detailed
balance” or “microscopic reversibility”. The analytic
expression of this is self-adjointness of the generator LN,α.
Define the Dirichlet form EN,α by
EN,α(f, f ) = −hf, LN,αf iL2(σN) . A simple computation shows that
EN,α(f, f ) = N 2
N 2
!−1
X
i<j
Z
SN,E,p
Z
S2 |vi − vj|α b
σ · vi − vj
|vi − vj|
[f (~v) − f(Ri,j,σ~v)]2 dσdσN . One sees from this expression that LN,α is a negative
Provided b is continuous at 1, LN,αf = 0 if and only if f is constant. We are interested in the spectral gap of the
operator LN,α on L2(SN,E,p, σN):
∆N,E,p = inf
EN,α(f, f ) : hf, 1iL2 = 0 and kfk2L2 = 1 . We now investigate the dependence of ∆N,E,p on N, E and p.
Scaling and dependence on E and p
For fixed N, the dependence of ∆N,E,p on E and p is quite simple: Consider the “shift and scaling transformation”
φE,p(v1, . . . , vN) := 1
pE − |p|2 (v1 − p, . . . , vN − p)
which identifies SN,E,p with SN,1,0. This point transformation induces the unitary operator UE,p from L2(SN,1,0, σN) to
L2(SN,E,p, σN) given by UE,pf = f ◦ φE,p A simple computation then shows that
EN,α(UE,pf, UE,pf ) = (E − |p|2)α/2EN,α(f, f ) .
The dependence of ∆N,E,p on N is not so simple.
Nonetheless, we have seen that the problem of estimating the quantity ∆N,E,p is essentially the same as the problem of estimating ∆N,1,0. We therefore simplify our notation:
DEFINITION 0.1 (Spectral gap). The spectral gap for the N particle Kac model is the quantity
∆N := ∆N,1,0 . (0.4)
The Kac Conjecture
Kac conjectured that
lim inf
N →∞ ∆N > 0 .
This has been proved in the case α = 0; see by Carlen, C., and Loss, and later Carlen, Geronimo and Loss proved that limN →∞ ∆N exists, and computed the exact value of this
limit for many choices of b. The first result in this direction was by Janvresse, who treated a one-dimensional
simplified version of the model that was also discussed by Kac. Her method gave no explicit lower bound.
All results up to now concerned the α = 0 case of uniform jump rates.
Recently, Carlen, C., and Loss have proved:
THEOREM 0.2 (Spectral gap for the Kac Model with 0 ≤ α ≤ 2). For
each function b on [−1, 1] that is continuous and strictly positive at 1,
and for each α ∈ [0, 2], there is a strictly positive constant C depending
only on b and α, and explicitly computable, so that
∆N ≥ C > 0
for all N.
The significance of the spectral gap
The Kac process was not introduced as a model of the actual, physical collision process in a gas of molecules.
Rather, it was introduced as the simplest process
conceivable from which one could deduce, in the limit of a large number of particles, the non-linear spatially
homogeneous Boltzmann equation, which is the basic evolution equation of kinetic theory.
Let
v′ = v + v∗
2 + |v − v∗|σ
2 , v∗′ = v + v∗
2 − |v − v∗|σ
2 .
The spatially homogeneous Boltzmann equation is a
non-linear equation for the evolution in t > 0 of a probability density n(v, t) for v ∈ R3:
∂
∂tn(v, t) = Q(n)(v, t) , where
Q(n)(v, t) = Z
R3×S2 B(v − v∗, σ)(n′n′∗ − nn∗)dσdv∗ , n = n(v, t), n′ = n(v′, t), n∗ = n(v∗, t) and n′∗ = n(v∗′ , t).
The function B(z, σ) in describes the rate at which the various kinematically possible collisions take place. This rate depends on the interaction between the molecules.
Maxwell determined that if the force law that governs the interaction between pairs of molecules in the dilute gas is an inverse power of the distance separating them, B takes the form
B(z, σ) = b(σ · z/|z|)|z|α
with the exponent α depending on the power in the interaction.
This is what motivates the particular form we have assumed for law used to select σ in the Kac process. The following ranges of α are usually distinguished:
α < 0 : soft collisions
α = 0 : Maxwellian molecules 0 < α ≤ 1 : hard collisions
α = 2 : super hard collisions
For hard sphere collisions, the most physically significant case of hard collisions,
We are interested in the rate of relaxation to equilibrium, both for the Kac process, and for the Boltzmann equation.
The equilibrium; i.e, steady state solutions of the Boltzmann equation are the Maxwellian distributions:
M (v) =
1 2πΘ
3/2
e−|v−u|2/2Θ
where Θ is a positive number, and u ∈ R3. To see that these are steady states, note that
M (v)M (v∗) = M (v′)M (v∗′ )
for all v, v and σ, and hence the integrand in Q vanishes
To quantify the rate of approach to equilibrium for the Boltzmann equation is a mathematically and physically significant problem.
In this regard, one quantity of interest is the spectral gap of the linearized Boltzmann equation .
To linearize the Boltzmann collision operator Q, fix Θ = 1 and u = 0, and write M to denote the corresponding
Maxwellian. Let H denote the Hilbert space
H = L2(R3, M (v)dv). Define the linearized Boltzmann operator L by
Q((1 + ǫh)M, (1 + ǫh)M )(v) = ǫM (v)Lh + O(ǫ2) .
This yields
Lh(v) = Z
R3×S2 B(|v−v′|, cos θ)[h(v′)+h(v∗′ )−h(v)−h(v∗)]M (v∗)dv∗dσ , and hence
hg, LhiH = −1 4
Z
R3×R3×S2 B(|v−v′|, cos θ)[g(v′)+g(v∗′ )−g(v)−g(v∗)]×
[h(v′) + h(v∗′ ) − h(v) − h(v∗)]M (v)M (v∗)dvdv∗dσ .
The spectral gap of the linearized Boltzmann operator is the quantity
Λ = inf
−hh, LhiH
khk2H : h ∈ (Ker(L))⊥ .
There is exactly one case in which it is relatively
straightforward to compute Λ: The case of Maxwellian molecules; i.e., α = 0.
In this case, the subspaces Hn of H that consist of
polynomials in v1, v2 and v3 of degree n or less are invariant subspaces of L. Since L is self-adjoint, this means that the
For the hard-spheres case, or indeed any other case, there is no known method for computing eigenvalues, and for a long time, there were no quantitative estimates whatsoever for the spectral gap Λ in these cases. The first quantitative estimate on Λ for hard collisions is a recent result by
Baranger and Mouhot. It works by making a comparison with the Maxwellian case. Here we shall prove:
THEOREM 0.3 (Spectral gap for Boltzmann via the Kac process). Let b
be continuous on [−1, 1] and strictly positive at 1, and let α ∈ [0, 2].
Let Λ be the spectral gap for the corresponding linearized Boltzmann equation, and let ∆N be the corresponding spectral gap for the Kac process. Then
lim sup
N →∞
∆N ≤ Λ .
Combining our two theorems yields a quantitative lower bound on Λ.
A word on the proof
The second theorem is much easier than the first. Use a trial function of the form
f (~v) =
N
X
j=1
ϕj(vj)
where ϕ is “built” out of the gap eigenfunction for the linearized collision operator. For such an f, one finds
EN,α(f, f ) = Λkfk22 ,
up to small errors (for large N), so the claim follows from the variational definition of ∆N.
Induction on the number of particles
We now explain how to estimate ∆N,α in terms of ∆N −1,α. We use a parameterization of SN in terms of SN −1 × B
where B is the unit ball.
First, for each k = 1, . . . , N, define πk : SN → B by πk(~v) = 1
√N − 1vk . (0.5)
(Note that because of the constraints PN
j=1 vj = 0 and PN
j=1 |vj|2 = N, the largest value of |vk| on SN is √N − 1.)
Define T1 : SN −1 × B → SN as follows:
T1(~y, v) =
√
N − 1v , β(v)y1 − 1
√N − 1v, . . . , β(v)yN −1 − 1
√N − 1v
, where
β2(v) = N
N − 1(1 − |v|2) .
The subscripted 1 in T1 indicates that the vector v from B went into the first place. We likewise define T2, . . . , TN by placing this coordinate in the corresponding position.
Z
SN
φ(~v)dσN = Z
B
Z
SN −1
φ(Tk(~y, v))dσN −1
dνN(v) .
where
dνN(v) = |S3N −7|
|S3N −4|(1 − |v|2)(3N −8)/2dv . Also, for i 6= k, j 6= k,
Ri,j,σ(Tk(~y, v)) = Tk(Ri,j,σ(~y), v) . We now have the means to relate EN,α to EN −1,α.
Define the projection (conditional expectation) operator
Pkφ(~v) :=
Z
SN −1
φ(Tk(~y, vk/√
N − 1))dσN −1 ,
Note that
EN,α(f, f |vk) = EN,α(f − Pkf, f − Pkf |vk) , and then one has
EN,α(f, f ) ≥ N
N − 1∆N −1× 1 N Z
N
|v |2 α/2 !
Define
P(α) = 1 N
N
X
k=1
N N − 1
1 − |vk|2 N − 1
α/2
Pk
and
W (α) = 1 N
N
X
k=1
N N − 1
1 − |vk|2 N − 1
α/2
.
LEMMA 0.4 (W(α) is constant for α = 0 and α = 2). For all ~v, W (0)(~v) = 1 while W(2)(~v) = 1 − 1
(N − 1)2 .
THEOREM 0.5. For all f ∈ L2(SN −1(√
N )) with kfk22 = 1 and with f orthogonal to the constants,
EN,α(f, f ) ≥ N
N − 1∆N −1
Z
SN
W(α)f2dσN − hf, P(α)f iL2(SN,σN)
.
Our goal is to prove from this a bound of the type
∆N ≥
1 − C N2
∆N −1 .
The point of this is that for any N0 we then have, by iteration, lim inf
N →∞ ∆N ≥
Y∞
n=N0
1 − C n2
∆N0−1 ,
and since 1/n2 is summable, the factor on the right is strictly positive.
Since
N
N − 1 = 1 + 1
N + O
1 N2
, We need
Z
SN
W(α)f2dσN − hf, P(α)f iL2(SN,σN)
= 1 − 1
N + O
1 N2
, where the coefficient of 1/N must be exactly 1.
As we now explain, this is easier to do for α = 0 and α = 2 than for α ∈ (0, 2) since in these cases only we have a
“good” pointwise bound on W(α).
Lower bound on ∆
Nfor α = 0 and α = 2.
For α = 0, the inductive relation reduces to EN,α(f, f ) ≥ N
N − 1∆N −1
Z
SN
f2dσN − hf, P(0)f iL2(SN,σN)
Define
µN = sup n
hf, P(0)f iSN : hf, 1iSN = 0 and kfkSN = 1 o , so that from the variational characterization of ∆N,
∆N ≥ ∆N −1 N
N − 1(1 − µN) .
The operator P(0) is an average over projections onto functions of a single particle’s velocity. That is, any eigenfunction f of P(0) necessarily has the form
f (~v) =
N
X
j=1
ϕj(vj)
for functions ϕj on the ball of radius √N − 1 in R3.
Determining the ϕj is then a problem in R3, no matter how large N is. While there are eigenfunctions of LN,α that have this simple form, most do not.
Define operators K and K2 by
Kϕ(v) = E{ϕ(v2) | v1 = v} , and
K2ϕ(u, v) = E{ϕ(v3, v4) | v1 = u, v2 = v} . The operators measure correlations on SN
P(0)
N
X
j=1
ϕ(vj)
= 1 N
N
X
j=1
(ϕ(vj) + (N − 1)Kϕ(vj))
Thus, the problem of estimating ∆N is reduced to the
problem of estimating the spectrum of K, and eventually,
∆2.
LEMMA 0.6 (Spectral gap for P(0)). For all N ≥ 3, µN = 1
N + 5N − 3
3N (N − 1)2 . (0.6) Applying this, we get:
∆N ≥ ∆N −1 N N − 1
1 − 1
N − 5N − 3 3N (N − 1)2
= ∆N −1
1 − 5N − 3 3(N − 1)3
We obtain the bound
lim inf
N→∞ ∆N ≥
Y∞
n=3
1 − 5n − 3 3(n − 1)3
!
∆2
∞ 5n − 3
Numerical computation yields Y∞
n=3
1 − 5n − 3 3(n − 1)3
≥ 0.236 .
Next, it is relatively easy to compute ∆2:
∆2 = 10
3 for b(x) = 2|x| .
Moreover, ∆2 is independent of α. Since v1 + v2 = 0,
|v1 − v2|2 = |2v2|2 = 2E = 4 .
For α = 2, we use:
LEMMA 0.7. For all N, all 0 ≤ α ≤ 2, and all f ∈ L2(SN), hf, P(α)f i ≤
N N − 1
α/2
hf, P(0)f i .
In particular, if f is orthogonal to the constants, then
hf, P(α)f i ≤
N N − 1
α/2
µNkfk22 .
Putting it all together, as before, we get
∆N ≥
1 − O
1 N2
∆N −1
Evaluating the constants, we obtain that there is a constant C such that C/N2 < 1 for all N ≥ 4 so that
∆N ≥
1 − C N2
∆N −1
A direct estimate on ∆3 then starts the induction.
The difficulty with α ∈ (0, 2).
Let e be any unit vector and consider
~ve :=
√
N − 1e, 1
√N − 1
e, . . . , 1
√N − 1 e
∈ SN . Then one readily computes, for α > 0,
W (α)(~ve) =
N N − 1
α−1
1 − 1 N − 1
α/2
= 1 − (1 − α/2) 1
N + O
1 N2
.
As before, we can estimate
hf, P(α)f iL2(SN,σN) ≤
1 − 1
N + O
1 N2
kfk22 .
Altogether,
Z
SN
W(α)f2dσN − hf, P(α)f iL2(SN,σN)
≤
1 − 2 − α/2
N + O
1 N2
kfk22 . (0.7)
How to proceed for 0 < α < 2:
Let Π denote the projection onto the space of functions orthogonal to the constants The operator ΠP(α)Π is self adjoint.
For any f orthogonal to the constants,
hf, P(α)f i = hf, ΠP(α)Πf i . (0.8) Now, decompose f as f = g + h where h is in the null space of ΠP(α)Π, and g is in the range. Notice that h and g are
orthogonal, so that
kfk22 = kgk22 + khk22 .
By the definition of h,
hf, P(α)f i = hg, P(α)gi , and hence
Z
SN −1(√ N )
W(α)f2dσ − hf, P(α)f iL2(SN −1(√N )) = Z
SN −1(√ N )
W(α)f2dσ − hg, P(α)giL2(SN −1(√N )) . (The formulas here and in the rest are for a simplified one dimensional model. They fit better on the slides, but the
Z
W (α)f2dσ − hf, P(α)f i
≥
1 − 1 − α/2 N − 1
khk22
+ 1
N
N
X
k=1
"
Z
SN −1(√ N )
w(α)(vk)[g − Pkg]2dσ
#
− 1 − α/2 N
Z N X
k=1
1 − vk2 N − 1
2
2ghdσ .
There is enough orthogonality that the last term,
1 − α/2 N
Z N X
k=1
1 − vk2 N − 1
2
2ghdσ
is negligible.
Since (1 − α/2) < 1, the multiple of 1/N in
1 − 1 − α/2 N − 1
khk22
is no problem.
For the remaining term, use the fact that
1 N
N
X
k=1
"
Z
SN −1(√ N )
w(α)(vk)[g − Pkg]2dσ
#
together with
g(~v) =
N
X
j=1
ϕ(vj) and
g − Pkg = X
ϕ(vj) − (N − 1)Kϕ(vk) .
Showing cross terms do not matter, as before, we get
1 N − 1
N
X
k=1
Z
SN −1(√ N )
m(vk)[g − Pkg]2dσ =
N
X
k=1
Z
SN −1(√ N )
m(vk)[X
j6=k
ϕ(vj) − (N − 1)Kϕ(vk)]2dσ (0.9)
where
m(v) =
1 − (1 − α/2)
1 − v2 N − 1
1 − v2 N − 1
+ 1
> 0 .
w(α)(v) =
N − v2 N − 1
α/2
≥ m(v) =
1 + (α/2)
1 − v2 N − 1
− (1 − α/2)
1 − v2 N − 1
2
. This reduces the weight to polynomials, facilitating the
remaining estimates.
Altogether, we get, for a computable constant C,
∆N ≥
1 − C N2
∆N −1
for all N ≥ Nα, where Nα is computable, and small. An
elementary probabilistic argument bounds ∆N from below, though with a bad N dependence. However, we use this only for the fixed value N = Nα, and then the inductive bound.