• No results found

Foundations for measuring multivariate association between two complex random vectors

4 Correlation analysis

4.1 Foundations for measuring multivariate association between two complex random vectors

The correlation coefficient between two scalar real zero-mean random variables u and v is defined as

ρuv= √ Euv Eu2

Ev2 = √ Ruv

Ruu

Rvv. (4.1)

The correlation coefficient is a convenient measure for how closely u andv are related.

It satisfies−1 ≤ ρuv≤ 1. If ρuv= 0, then u and v are uncorrelated. If |ρuv| = 1, then u is a linear function ofv, or vice versa, with probability 1:

u= Ruv

Rvvv. (4.2)

4.1 Measuring multivariate association 87

0 1 2

0 1 2

(b)

0 1 2

0 1 2

(a)

Figure 4.1 Scatter plots of 100 sample pairs of u andv with different ρuv.

In general, foruv| ≤ 1,

ˆ u = Ruv

Rvvv (4.3)

is a linear minimum mean-squared error (LMMSE) estimate of u fromv, and the sign ofρuvis the sign of the slope of the LMMSE estimate. An expression for the MSE is

E| ˆu − u|2= RuuRu2v

Rvv = Ruu(1− ρu2v). (4.4) Example 4.1. Consider a mean, unit-variance, real random variable u and a zero-mean, unit-variance, real random variable n, uncorrelated with u. Now letv = au + bn for given real a and b. We find

ρuv =√ Ruv

Ruu

Rvv = a

a2+ b2.

Figure 4.1depicts 100 sample pairs of u andv, for uniform u and Gaussian n. Plot (a) shows the case a= 0.8, b = 0.1, which results in ρuv= 0.9923. Plot (b) shows a = 0.8, b = 0.4, which results in ρuv= 0.8944. The line is the LMMSE estimate v = au (or, equivalently, ˆu = aˆ −1v).

How may we define a correlation coefficient between a pair of complex random vectors to measure their multivariate association? We shall consider the extension from real to complex quantities and the extension from scalars to vectors separately, and then combine our findings.

4.1.1 Rotational, reflectional, and total correlations for complex scalars

Consider a pair of scalar complex zero-mean random variables x and y. As a straight-forward extension of the real case, let us define the complex correlation coefficient

ρx y = E x y

E|x|2

E|y|2 = Rx y

Rx x

Ryy

, (4.5)

88 Correlation analysis

0 1 2

0 1 2

(a) 0 1 2

0 1 2

(b)

Figure 4.2 Sample pairs of two complex random variables x and y withρx y= exp(jπ/2) and

|Rx y|/Ryy= 1.2. Plot (a) depicts samples of x and (b) samples of y, in the complex plane. For corresponding samples we use the same symbol.

which satisfies 0≤ |ρx y| ≤ 1. The LMMSE estimate of x from y is ˆx(y)= Rx y

Ryy

y= |Rx y| Ryy

ejRx yy, (4.6)

which achieves the minimum error

E| ˆx(y) − x|2= Rx x|Rx y|2 Ryy

= Rx x(1− |ρx y|2). (4.7) Hence, ifx y| = 1, ˆx(y) is a perfect estimate of x from y. Figure4.2depicts five sample pairs of two complex random variables x and y withρx y = exp(jπ/2), in the complex plane. Plot (a) shows samples of x and (b) the corresponding samples of y. We observe that (b) is simply a scaled and rotated version of (a). The amplitude is scaled by the factor|Rx y|/Ryy, preserving the aspect ratio, and the rotation angle isRx y = ρx y.

Now what about complementary correlations? Instead of estimating x as a linear func-tion of y, we may employ the conjugate linear minimum mean-squared error (CLMMSE) estimator

ˆx(y)= E x y

E|y|2y= Rx y

Ryy

y= | Rx y| Ryy

ejRx yy. (4.8) The corresponding correlation coefficient is



ρx y = Rx y

Rx x

Ryy

, (4.9)

with 0≤ |ρx y| ≤ 1, and the CLMMSE is E| ˆx(y)− x|2= Rx x−| Rx y|2

Ryy = Rx x(1− |ρx y|2). (4.10) Hence, if |ρx y| = 1, ˆx(y) is a perfect estimate of x from y. Figure4.3depicts five sample pairs of two complex random variables x and y with ρx y = exp(jπ/2), in the complex plane. Plot (a) shows samples of x and (b) the corresponding samples of y.

4.1 Measuring multivariate association 89

0 1 2

0 1 2

(a)

0 1 2

0 1 2

(b)

Figure 4.3 Sample pairs of two complex random variables x and y withρx y= exp(jπ/2) and|Rx y|/Ryy= 1.2. Plot (a) depicts samples of x and (b) samples of y, in the complex plane.

Samples of y correspond to amplified and reflected samples of x. The reflection axis is the dashed line, which is given byRx y/2 = ρx y/2 = π/4.

We observe that (b) is a scaled and reflected version of (a). The amplitude is scaled by the factor| Rx y|/Ryy, preserving the aspect ratio. Sincex =  Rx y− y, we have, with probability 1:

x −12 Rx y = −

y − 12 Rx y

. (4.11)

Thus, the reflection axis is Rx y/2 = ρx y/2, which is the dashed line in Fig.4.3.

Depending on whether rotation or reflection better models the relationship between x and y,|ρx y| or |ρx y| will be greater. We note the ease with which the best possible reflection axis is determined as half the angle of the complementary correlation Rx y

(or half the angle of the correlation coefficientρx y). This would be significantly more cumbersome in real-valued notation.

Of course, data might exhibit a combination of rotational and reflectional correlation, motivating use of a widely linear minimum mean-squared error (WLMMSE) estimator

ˆx(y, y)= αy + βy, (4.12)

where α and β are chosen to minimize E| ˆx(y, y)− x|2. We will be discussing WLMMSE estimation in much detail in Section 5.4. At this point, we content our-selves with stating that the solution is

ˆx(y, y)= ˆx(y)− ˆx[ ˆy(y)]+ ˆx(y)− ˆx[ ˆy(y)]

1− |ρyy|2

=(Rx yRyy− Rx yRyy)y+ ( Rx yRyy− Rx yRyy)y

R2yy− | Ryy|2 , (4.13) where Ryy= Ey2is the complementary variance of y,|ρyy|2= | Ryy|2/R2yyis the degree of impropriety of y, ˆy(y)= [ Ryy/Ryy]y is the CLMMSE estimate of y from y, and

ˆy(y)= [ Ryy/Ryy]yis the CLMMSE estimate of y from y.

90 Correlation analysis

Through the connection

E| ˆx(y, y)− x|2= Rx x(1− ¯ρ2x y), (4.14) we obtain the corresponding squared correlation coefficient

ρ¯x y2 =x y|2+ |ρx y|2− 2 Re[ρx yρx yρyy] 1− |ρyy|2

=(|Rx y|2+ | Rx y|2)Ryy− 2 Re(Rx yRx yRyy)

Rx x(R2yy− | Ryy|2) , (4.15) with 0≤ ¯ρx y2 ≤ 1. We note that the correlation coefficient ¯ρx y, unlikeρx y = ρyx and



ρx y = ρyx, is not symmetric in x and y: in general, ¯ρx y2 = ¯ρyx2 .

The correlation coefficient ¯ρx yis bounded in terms of the coefficientsρx yandρx yas max(|ρx y|2, |ρx y|2)≤ ¯ρx y2 ≤ min(|ρx y|2+ |ρx y|2, 1). (4.16) The lower bound holds because a WLMMSE estimator subsumes both the LMMSE and CLMMSE estimators, so we must have WLMMSE≤ LMMSE and WLMMSE ≤ CLMMSE. However, there is no general ordering of LMMSE and CLMMSE, which we write as LMMSE CLMMSE.

A common scenario in which the lower bound is attained is when y is maximally improper, i.e., Ryy = | Ryy| ⇔ |ρyy|2= 1, which yields a zero denominator in (4.15).

This means that, with probability 1, y = ejαy for some constantα, and Rx y = ejαRx y. In this case, y and ycarry exactly the same information about x. Therefore, WLMMSE estimation is unnecessary, and can be replaced with either LMMSE or CLMMSE esti-mation. In the maximally improper case, ¯ρx y2 = |ρx y|2= |ρx y|2. Two other examples of attaining the lower bound in (4.16) are either Rx y = 0 and Ryy= 0 (i.e., ρx y = 0 andρyy = 0), which leads to ¯ρ2x y = |ρx y|2, or Rx y = 0 and Ryy = 0 (i.e., ρx y = 0 and



ρyy= 0), which yields ¯ρx y2 = |ρx y|2; cf. (4.15).

The upper bound ¯ρ2x y = |ρx y|2+ |ρx y|2is attained when the WLMMSE estimator is the sum of the LMMSE and CLMMSE estimators: ˆx(y, y)= ˆx(y) + ˆx(y). In this case, y and ycarry completely complementary information about x. This is possible only for uncorrelated y and y, that is, a proper y. It is easy to see that Ryy = 0 ⇔ |ρyy|2= 0 in (4.15) leads to ¯ρ2x y = |ρx y|2+ |ρx y|2. The following example gives two scenarios in which the lower and upper bounds are attained.

Example 4.2. Attaining the lower bound. Consider a complex random variable y= ejα(u+ n),

where u is a real random variable andα is a fixed constant. Further, assume that n is a real random variable, uncorrelated with u, and Rnn = Ruu. Let x = Re (ejαu)= cos(α)u. The LMMSE estimator ˆx(y) = 12cos(α)e−jαy and the CLMMSE estimator ˆx(y)= 12cos(α)ejαy both perform equally well. However, they both extract the same information from y because y is maximally improper. Hence, a WLMMSE estimator has

4.1 Measuring multivariate association 91

no performance advantage over an LMMSE or CLMMSE estimator:x y|2= |ρx y|2= ρ¯2x y =12.

Attaining the upper bound. Consider a proper complex random variable y and let x be its real part. The LMMSE estimator of x from y is ˆx(y)= 12y, the CLMMSE estimator is ˆx(y)=12y, and the WLMMSE estimator ˆx(y, y)= 12y+12yproduces a perfect estimate of x. Here, rotational and reflectional models are equally appropriate.

Each tells only half the story, but they complement each other perfectly. It is easy to see that x y|2= |ρx y|2=12 and ¯ρx y2 = |ρx y|2+ |ρx y|2= 1. (This also shows that ρ¯2x y = ¯ρyx2 since it is obviously impossible to perfectly reconstruct y as a widely linear function of x.)

The following definition sums up the main findings of this section, and will be used to classify correlation coefficients throughout this chapter.

Definition 4.1. A correlation coefficient that measures how well a