Convolution and analytic transforms - Evolution equations in non-commutative probability

Definition 5.5.1. Let ind ∈ {bool, free, mono, mono †}. We define the ind-convolution of two B-valued laws µ and ν as the law of X + Y when X and Y are ind-independent and the law of X is µ and the law of Y is ν. The convolution is denoted by µ indν, or alternatively

µ ] ν (boolean case) µ ν (free case) µ B ν (monotone case) µ C ν (anti-monotone case).

In order to verify this definition makes sense, observe first that using the GNS construc-tion (Theorem 2.6.5) and the product space construcconstruc-tion, there always exist independent operators X and Y in a B-valued probability space (A, E) such that the law of X is µ and the law of Y is ν. And second, the law of X + Y is uniquely determined by µ and ν and the independence of X and Y by Lemma 5.2.8.

More generally, given laws µ₁, . . . , µ_N, we may construct operators X₁, . . . , X_N which are independent with X_j having the law µ_j, using the GNS construction and the product space construction. Then it follows from the associativity considerations of the previous section that X₁+ · · · + X_N has the law µ₁ind(µ₂ind(. . . (µ_N) . . . ). Moreover, we also have µ₁ind(µ₂indµ₃) = (µ₁indµ₂) indµ₃, or in other words ind is associative, and thus we may remove the parentheses when expressing an iterated ind-convolution.

Observation 5.5.2.

(1) The operations ], , B, and C are associative.

(2) The operations and ] are commutative.

(3) We have µ B ν = ν C µ.

Proof. The first claim follows from the preceding discussion of associativity. The second claim is true because the conditions defining free and boolean independence do not depend on the order of the subalgebras, while for the third claim, if we were to reverse the order of the indices in monotone independence, then we obtain anti-monotone independence.

Our main task in this section is to develop analytic tools for computing the independent convolution of two laws. In the classical case, this role is played by characteristic function (Fourier transform) of a law given by F µ(ξ) = R e^ixξdµ(x), since addition of independent random variables or classical convolution of laws corresponds to multiplication of the Fourier transforms. In the non-commutative setting, this role is played by various fully matricial functions related to the Cauchy-Stieltjes transform.

5.5.1 The boolean case

The results of this section can be found in [SW97] [Ber06, Theorem 2.2] for the scalar case and [PV13, §2 and §5.3] in the operator-valued case. The proof we give here is based on analogy with the proof from the free case in the next subsection (where we also explain the history and references).

Definition 5.5.3. For a B-valued law µ, we define the K-transform as Kµ(z) := z − Fµ(z).

Remark 5.5.4. We caution that some authors work instead with B_µ(z) = ˜K_µ(z) or slight variants of this definition. We showed in Theorem 4.5.3 that K_µ(z) = µ(X)⁽ⁿ⁾+ G_σ(z) for some generalized law σ.

Theorem 5.5.5. K_µ]ν(z) = K_µ(z) + K_ν(z) as fully matricial functions.

Proof. Let X and Y be freely independent random variables in (A, E) which realize the laws µ and ν respectively. For z in M_n(B) with kzk < 1/ rad(µ), define

U_X⁽ⁿ⁾(z) = (1 − zX⁽ⁿ⁾)⁻¹− 1 =

∞

k=1

(zX⁽ⁿ⁾)^k.

This is an A-valued fully matricial function. To simplify the notation, we will suppress all the superscripts (n), so that X will stand for X⁽ⁿ⁾, where n is the size of the matrix z. Note that

1 + E[U_X(z)] = E[(1 − zX)⁻¹] = ˜G_µ(z)z⁻¹ or in other words

(1 + E[U_X(z)])⁻¹ = z ˜F_µ(z)

Note that U_X(z) is in the closed span of BhXi₀. Define U_Y(z) analogously. Then 1 − zX − zY = (1 + UX(z))⁻¹+ (1 + UY(z))⁻¹− 1

Therefore,

(1 − zX − zY )⁻¹ = [(1 + U_X(z))⁻¹+ (1 + U_Y(z))⁻¹− 1]⁻¹

= (1 + U_X(z))[1 − U_Y(z)U_X(z)]⁻¹(1 + U_Y(z))

= (1 + U_X(z))

∞

k=0

(U_Y(z)U_X(z))^k

(1 + U_Y(z)).

Next, we take the expectation. Because U_X(z) and U_Y(z) are in the closures of M_n(BhXi₀) and Mn(BhY i0) respectively and because X and Y are Boolean independent, we have

E[(1 − zX − zY )⁻¹] = (1 + E[UX(z)])

∞

k=0

(E[UY(z)]E[UX(z)])^k

(1 + E[UY(z)])

= [(1 + E[UX(z)])⁻¹+ (1 + E[UY(z)])⁻¹− 1]⁻¹ Therefore,

G˜µ]ν(z)z⁻¹ = [(1 + E[UX(z)])⁻¹+ (1 + E[UY(z)])⁻¹− 1]⁻¹ By taking reciprocals,

z ˜F_µ]ν(z) = (1 + E[U_X(z)])⁻¹+ (1 + E[U_Y(z)])⁻¹− 1

= z ˜F_µ(z) + z ˜F_ν(z) − 1,

Because z ˜F_µ(z) − 1 = z ˜K_µ(z) and the same holds for Y and X + Y , this means precisely that

z ˜Kµ]ν(z) = z ˜Kµ(z) + z ˜Kν(z)

for z in a neighborhood of 0. By Corollary 3.9.7, we have K_µ]ν = K_µ+ K_ν on the upper half plane.

5.5.2 The free case

The following analytic transforms were defined by Voiculescu [Voi86]. In the operator-valued case, the definition was developed by Dykema [Dyk07, §6].

Definition 5.5.6. For a B-valued law µ, we define F_µ(z) = G_µ(z)⁻¹ and Φ_µ(z) := F_µ⁻¹(z) − z,

where F_µ⁻¹(z) is the functional inverse and z is in the image of F_µ.

Remark 5.5.7. Many authors work with the R-transform Rµ(z) = ˜Φµ(z) = Φµ(z⁻¹). We showed in Lemma 4.5.8 that Φ_µ is defined for Im z ≥ 2δ whenever δ > kVar_µ[1]k^1/2 and in Lemma 4.5.9 that R_µ(z) is defined in a fully matricial ball around zero.

The following result on the additivity of the R-transform was discovered in the scalar-valued case by Voiculescu [Voi86]. The original proof by Voiculescu used canonical realiza-tions of a law µ by (non-self-adjoint) random variables on a Fock space, and this was adapted to the operator-valued setting by Dykema [Dyk07, §6]. This theorem can also be proved through the combinatorial apparatus of free cumulants due to Speicher [Spe94, Spe98]. The analytic proof presented here is due (in the scalar-valued setting) to Lehner [Leh01, Theorem 3.1].

Theorem 5.5.8. For Im z ≥ 2δ > 2kVar_µ(1) + Var_ν(1)k^1/2, we have Φ_µν(z) = Φ_µ(z) + Φ_ν(z).

Also, for z in a fully matricial neighborhood of 0, we have R_µν(z) = Rµ(z) + Rν(z)

Proof. Let X and Y be freely independent random variables in (A, E) which realize the laws µ and ν.

We begin by analyzing R_µ(z) in a neighborhood of the origin. Now z⁻¹+ R_µ(z) is the functional inverse of Gµ(z) in a neighborhood of 0 which means that

E[(z⁻¹+ R_µ(z) − X)⁻¹] = z.

Multiplying by z⁻¹ on the right, we can write rewrite this as E[(1 + zRµ(z) − zX)⁻¹] = 1, or in other words, the A-valued fully matricial function

UX(z) = (1 + zRµ(z) − zX)⁻¹− 1

has expectation zero. (Here, as in the previous case, we suppress the superscripts (n) but UX(z) stands for U_X⁽ⁿ⁾(z) and X denotes X⁽ⁿ⁾ where n is the size of the matrix z). The same holds for the analogously-defined function U_Y(z). We want to show that z⁻¹− R_µ(z) − R_ν(z) is the functional inverse of G_µν, which means that

G_µν(z⁻¹+ R_µ(z) + R_ν(z)) = z, which after multiplying by z⁻¹ on the right is equivalent to

E[(1 + zR_µ(z) + zR_ν(z) − zX − zY )⁻¹] = 1.

We will rewrite the left hand side in terms of U_X(z) and U_Y(z) so that we can apply freeness together with the fact that U_X(z) and U_Y(z) have expectation zero. Note that

(1 + zR_µ(z) + zR_ν(z) − zX − zY )⁻¹

= [(1 + UX(z))⁻¹+ (1 + UY(z))⁻¹− 1]⁻¹

= (1 + U_X(z))[(1 + U_Y(z)) + (1 + U_X(z)) − (1 + U_Y(z))(1 + U_X(z))]⁻¹(1 + U_Y(z))

= (1 + U_X(z))[1 − U_Y(z)U_X(z)]⁻¹(1 + U_Y(z)).

Now because U_X(0) = 0 = U_Y(0), we know that if kzk is sufficiently small, then we can expand [1 − UY(z)UX(z)]⁻¹ as a geometric series, and thus for small z,

(1 − zR_µ(z) − zR_ν(z) − zX − zY )⁻¹ = (1 + U_X(z))

∞

k=0

(U_Y(z)U_X(z))^k

(1 + U_Y(z)).

Next, we take the expectation. Because U_X(z) and U_Y(z) have expectation zero and because X and Y are free, all the terms on the right hand side have zero expectation except the term 1 which comes from multiplying together the 1 from 1 + U_X(z), the 1 from the geometric series, and the 1 from 1 + U_Y(z). Therefore, as desired,

E[(1 − zR_µ(z) − zR_ν(z) − zX − zY )⁻¹] = 1.

This shows that

R_µν(z) = R_µ(z) + R_ν(z) holds in a neighborhood of zero.

This implies that Φ_µν = Φ_µ+ Φ_ν if Im z is sufficiently large, and hence by Corollary 3.9.7, we have Φ_µν = Φ_µ + Φ_ν on H+,2δ(B), provided that this lies inside the common domain of Φ_µν, Φ_µ, and Φ_ν. Since Var_µν(1) = Var_µ(1) + Var_ν(1) and all these elements are positive, we have kVar_µν(1)k ≥ max(kVar_µ(1)k, kVar_ν(1)k), and hence it is sufficient that δ > kVar_µν(1)k^1/2.

5.5.3 The (anti-)monotone case

The following result is due to [Mur00, Theorem 3.1] in the scalar-valued case and [Pop08, Theorems 3.2 and 3.7] in the operator-valued case, whose proof we follow here. Another proof in the scalar case is in [Ber05].

Theorem 5.5.9. We have F_µBν(z) = F_µ(F_ν(z)) and F_µCν(z) = F_ν(F_µ(z)) as fully matricial functions.

Proof. Let inv denote the fully matricial function z 7→ z⁻¹ where defined. Since Fµ = inv ◦ ˜G_µ◦ inv and inv is an involution, it suffices to show that ˜G_µBν = ˜G_µ◦ ˜G_ν.

Let X and Y be monotone independent random variables in (A, E) realizing the laws µ and ν. We know that

E[f₀(Y )g₁(X)f₁(Y ) . . . g_n(X)f_n(Y )] = E[E[f₀(Y )]g₁(X)E[f₁(Y )] . . . g_n(X)E[f_n(Y )]]

whenever f (Y ) ∈ BhY i₀ and f (X) ∈ BhXi₀. However, this also holds trivially if f_j(Y ) ∈ B, and thus by linearity it holds when f_j(Y ) ∈ BhY i.

Now for kzk sufficiently small, we have G˜_µBν(z) = E[(1 − zX − zY )⁻¹z]

= E[(1 − (1 − zY )⁻¹zX)⁻¹(1 − zY )⁻¹z] = E

" _∞ X

k=1

[(1 − zY )⁻¹zX]^k(1 − zY )⁻¹z

# .

Note that (1 − zY )⁻¹ is in the closure of M_n(BhY i) and zX ∈ M_n(BhXi₀) and hence by monotone independence

G˜_µBν(z) = E

" _∞ X

k=1

[E[(1 − zY )⁻¹z]X]^kE[(1 − zY )⁻¹z]

= E

" _∞ X

k=1

[ ˜G_ν(z)X]^kG˜_ν(z)

= ˜G_µ◦ ˜G_ν(z).

This equality extends to all z by Corollary 3.9.7. The anti-monotone case follows from the monotone case since µ C ν = ν B µ.

CHAPTER 6 Tools: Norm estimates and subordination

In document Evolution equations in non-commutative probability (Page 102-108)