1.6 Construction of the Dictionary and Estimation Criteria
1.6.5 Two Step Estimation of TID Parameters
The MSTID model presented in Equation 1.8 is characterized by three parameters:
{λ, θ, ϕ}. As the reconstruction error depends on the parameters through trigono-metric functions, and a division, this error is more sensitive to some parameters than others. Specifically, the impact of small deviations of the parameters{λ, θ}on the fi-nal performance is much higher than the case of ϕ. The method is extremely precise in the estimation of propagation azimuth and wavelength, but the velocity estima-tion is less accurate, possibly because it is computed from differences of ϕ at each snapshot. Taking this into consideration, the estimation strategy will have two steps, in the first step, the parameters{λ, θ}are estimated using a dictionary fine-grained in the range of these parameters and rough in{ϕ}. In the second step, the estimation is done with λ and θ fixed to the values of the first step, and the parameter ϕ will be used to construct a new dictionary with a fine grained range of values.
The first approach for solving the problem (1.14) was based on coordinate descent, as explained in detail in Hastie, Tibshirani, and Wainwright (2015). For our imple-mentation, we have used a method for solving the problem which yields equivalent solutions but is computationally faster. This method is the LARS, see for instance Efron et al. (2004) and Hastie, Tibshirani, and Wainwright (2015). The computa-tional complexity of the LARS algorithm is equivalent to the ordinary least squares estimation which has a complexity of the order O p2N, where p is the number of elements of the dictionary, and N the number of pierce points.
The effect of splitting the estimation into two steps also has an important impact on the computational requirements of the problem. If the granularity (i.e., incre-ments) on the values of each parameter were of n, the size of the dictionary would be O(n3). By dividing the estimation into two phases, the dictionary of the struc-ture in Equation 1.19 gives a requirement of O(n2)for the first phase. The resulting nonzero-weight elements of the dictionary of the detected MSTID in the first phase M (where M N) reduces the total number of operations on the second phase to O(Mn). The range of values n needs to be adapted to the problem at hand, which depends on the margin of phases, frequencies, and azimuths of the MSTIDs that we want to detect.
1.6.5.1 Step I: Estimation of{λ, θ}and the Number of TIDs
For the estimation of {λ, θ}, we can simplify the dictionary by means of simple trigonometry, the parameter ϕ of the elements Ti(x, y)of the dictionary (See Equa-tion 1.8) is absorbed by the amplitude of the wave. Thus the element Ti(x, y), for the snapshot t is:
αiTi(x, y) = βicos 2π
λi (x·cos θi+y·sin θi)
+γisin 2π
λi (x·cos θi+y·sin θi)
(1.16) which can be rearranged as two new dictionary elements as,
Tβi(x, y) = cos 2π
λi (x cos θi+y sin θi)
(1.17) Tγi(x, y) = sin 2π
λi (x cos θi+y sin θi)
(1.18) where βi and γi are the amplitudes of the pair orthogonal sinusoidal basis, which satisfy the relationship of βi = αicos(ϕi)and γi = −αisin(ϕi). Note the values of βiand γi are used only for deciding the set of elements of the dictionary Ti(x, y)to be used in the following step. And we decided to work with an augmented dictio-nary, with elements of the form shown in equations 1.17 and 1.18, and estimate the amplitudes βiand γi by means of the LASSO algorithm.
Although the phase ϕi can be obtained from βiand γi, by their ratio and the use of the arctan function, this method is extremely unreliable, which justifies postponing the estimation of ϕi for step II. The estimation of ϕ is done in step II keeping the fixed values λ and θ estimated in step I.
The dictionary DI for step I has the following structure:
DI = [Dβ, Dγ] = [Tβ1, Tβ2, ..., TβN, Tγ1, Tγ2, ..., TγN] (1.19) where Tβi and Tγi denotes the elements of the dictionary defined in Equation 1.17 and Equation 1.18 with the corresponding parameters θβi, λβiand θγi, λγi. We denote the vectors of the weights associated with each estimated parameter as: βt and γt. Note that we did not introduce a constraint in the elements of the vectors βtand γt
so that they should be non-null simultaneously; we decided to take as candidates of MSTID the union of indices i found in both vectors, to filter the values in step II. That is, some of the candidates to be detected MSTIDs, will be discarded in the second step. The map V(x, y)at epoch t can be expressed as follows,
V(x, y) =DI· [ˆβt, ˆγt]T = Dβ· ˆβt+Dγ·γˆt (1.20)
where the vector of coefficients ˆβt and ˆγt are estimated by means of the LASSO algorithm. The elements of the dictionary DIassociated with the coefficients ˆβtand
ˆ
γtdifferent from zero, determine the wavelengths ˆλtand directions ˆθtof each MSTID candidate and are the input for the next step.
The parameter ϕtcan can be approximately computed by,
ˆ
ϕt= −arctanγˆt
ˆβt
(1.21)
Nevertheless, we have found that this estimate was extremely unreliable and in-consistent between different snap shots. Note that the solution from LASSO is the approximated decomposition of detrended VTEC map V(x, y, t)with the deviation of D· [ˆβt, ˆγt]. The method we described above in the text gives a reliable estimation for the parameters of ˆθtand ˆλtfrom the improved dictionary, but a weak estimation for ϕt. In order to improve the reliability of our estimation, we use another LASSO step in order to have a better estimate of ϕt.
1.6.5.2 Step II: Estimation of ϕ
In step I at epoch t, we detect a set of M possible MSTIDs candidates. We define the parameters of the ith detected MSTID at epoch t as the pair(ˆλi, ˆθi), which remains fixed in step II, where we create a new smaller dictionary for estimating the ϕi associ-ated with each pair(ˆλi, ˆθi), from a new variable that is more suitable. We will denote anal-ysis showed that the expression 1.23 has a lower sensitivity to the estimation error of ϕj than 1.22. Hence, the dictionary is parametrized by B(ˆλ
iˆθi)j.
The structure for the elements j associated with the pair(ˆλi, ˆθi)of the dictionary of step II, will be as follows:
T(ˆλ to a given ϕ, and the structure of the new dictionary, which we will denote as DI I is created by concatenation of the dictionaries specific for each MSTID associated with each pair(ˆλi, ˆθi)detected in step I. Therefore, the dictionary associated with a given MSTID Dˆλiˆθi, will consist of a set of waveforms that characterize the MSTID estimated for fixed values of ˆθi and ˆλi and the range of values B(ˆλ
iˆθi)j related to the
phase ϕj of the waveform.
DI I = [Dˆλ
1ˆθ1, Dˆλ2ˆθ2, ..., DˆλMˆθM] = [Tˆλ
1ˆθ11, Tˆλ1ˆθ12, ..., TˆλMˆθMN] (1.24) Thus, if the range of possible values of B(ˆλ
iˆθi)j is N, the size of the dictionary DI I is M×N (M is the number of candidate MSTIDs from step I).
Note that dividing the estimation into two steps, in addition to solving the estima-tion uncertainty in ϕj, reduces the computational needs. That is, the combined size of the two dictionaries is 2N2+MN, which is much smaller than the size N3needed for simultaneously computing all the parameters. For instance, if the resolution of θ, λ and B are 2◦, 2 km and 0.1 km respectively, assuming M≤10 in each snapshot, the size of the dictionary is N3 ≈ 108versus 2N2+MN ≈106. The value of N was variable and depended on the resolution of each variable. Typically the value of N was around 500.