Chapter 2 Constructing Correlation Networks for SuperMAG
2.3 Identifying Links in the Network
We wish to establish the extent of similarity between temporal variations of the vector time-series of magnetometer stations. The magnetometer time-series are inherently non-stationary as there are many long term trends present as well as bursty activity (e.g. substorms) and as such they bear resemblance to the geophysical time-series used to establish networks in a climate context. Radebach et al. [2013] lists a number of methods that have been used in the construction of climate networks. Examples include linear (Bravais-Pearson)
correlation [Tsonis and Roebber, 2004], (cross-)mutual information [Donges et al., 2009], a phase synchronization index based on the normalized Shannon entropy of the associated phase difference time-series [Yamasaki et al., 2009], the (cross-)mutual information of order patterns [Barreiro et al., 2011], event synchronization [Malik et al., 2012] or transfer entropy [Hlinka et al., 2013]. Details of their use in a climate context can be found in the above references and we briefly outlined mutual information in chapter 1. Parameters used to establish connections in climate networks are typically scalars such as temper- ature, pressure and rainfall. Here we have the vector time-series measurements of the magnetic field. While a single component of the magnetic field could in principle be used there are significant drawbacks to this; the magnetic compo- nents, which vary in response to ionospheric currents, depend on the relative spatial position of the magnetometer station with respect to the currents. Us- ing a single component means we may miss correlated pairs of signals that are in orthogonal components.
Canonical correlation [Brillinger, 1975] offers a potential solution to this. Jackel et al. [2001] has already investigated its use with respect to mag- netometer station pairs. Canonical correlation defines a new coordinate sys- tem X′ = [X1′(t), X2′(t), X3′(t)], Y′ = [Y1′(t), Y2′(t), Y3′(t)] for a given vector time-series pairX(t), Y(t) in which the cross-correlation coefficientrX1′,Y1′ be-
tween first canonical componentsX1′(t) and Y1′(t) is maximised. HereX′(t) =
RXX(t),Y′(t) =RYY(t) and RX andRY are the respective rotation matri- ces. These matrices are conceptually thought of as a rotation matrix but
it includes stretching and shearing, hence, Det(R) ̸= 1. In addition the
cross-correlation between the orthogonal canonical componentsrX1,Y2, rX1,X2, rY1,Y2... is zero, which is to say the covariance and cross-covariances matrices
for the rotated datasets are all diagonal. Determining the canonical cross- correlation coefficients and the rotation matrices involves finding the eigenval- ues and eigenvectors for the following matrices: CX=
∑−1 XX ∑ XY ∑−1 YY ∑ YX and CY = ∑−1 YY ∑ YX ∑−1 XX ∑
XY by solving the equations below
(CX−λXiI)aXi = 0 (2.3)
and
(CY−λYiI)aYi = 0 (2.4)
where covariance matrices ∑XX, ∑Y Y and cross-covariance matrices ∑XY
and ∑Y X are defined as ∑XY =,
E[(X1(t)−µX1) (Y1(t)−µY1)] E[(X1(t)−µX1) (Y2(t)−µY2)] E[(X1(t)−µX1) (Y3(t)−µY3)] E[(X2(t)−µX2) (Y1(t)−µY1)] E[(X2(t)−µX2) (Y2(t)−µY2)] E[(X2(t)−µX2) (Y3(t)−µY3)] E[(X3(t)−µX3) (Y1(t)−µY1)] E[(X3(t)−µX3) (Y2(t)−µY2)] E[(X3(t)−µX3) (Y3(t)−µY3)] ,
with similar definitions for ∑XX, ∑Y Y and ∑Y X. AboveµX is the expecta-
tion value of X, E[X] = µX. The eigenvalues relate to the cross-correlation
coefficients for the canonical components, λXi′ = r2Xi′Yi′, λYi′ = rY2i′Xi′ and aX′i
andaYi′, are the eigenvectors that form the rows of the rotation matrices RX′
and RY′ respectively. The canonical correlation coefficients obey the relation
rY′
1X1′ ≥rY2′X2′ ≥rY3′X3′.
To identify connections in the network we only use the first canonical component of our rotated dataset. Other information such as the relative contribution of each of the original components to the respective canonical components may hold useful information, however, using this goes beyond the scope of what we wish to accomplish. Unlike Pearson correlation, which allows
for anti-correlation, canonical coefficients can only take values 0≤rY′
iX′i ≤1.
In effect, any anti-correlated variations are rotated into positive correlations. We use canonical correlation between the windowed (the length of which depends on the aims of the analysis) segments of pairs of vector magnetome- ter time series to quantify similarity between pairs of stations as a function of time. The time-series are de-trended with a linear fit within each corre-
lation window. We calculate the canonical correlation between the ith and
jth station for all possible station pairs to form a cross-correlation matrix (or weighted network), Cij(t). Cij(t) contains the correlation coefficient for the
first canonical component for each station pair and each windowed segment.
The Cij matrix could be used as a weighted adjacency matrix for the
network instead of a binary adjacency matrix. However, many of theCij values
are a result of “random correlation” and do not constitute physically related measurements, that is, it is dominated by noise. A “random correlation” here means obtaining a correlation coefficient from a pair of time series that is likely to of occurred by chance (based on a presupposed statistical significance). We outline how we determine the false positive rate in section 2.4.3. By using a
threshold CT we can obtain a binary adjacency matrix Aij which, given an
appropriate choice forCT, will contain less noise.