Chapter 2: Compressive Sensing for Cognitive Radio: A Literature Review
2.4 Compressive Sensing
2.4.2. The CS Theoretical Framework
2.4.2.1. The CS Concept
The theoretical treatment of compressive sensing considers a discrete-time signal
x
, which can be viewed as anN×1
column vector inR
N with elementsx(n),
n=1,2,...,N
. Any signal inR
N can be represented in terms of an orthonormal basis ofN×1
vectors{ }
N1i
i
ψ
= (Strang, 2016). Using theN×N
basis matrixΨ = [ψ| ψ
1. .
1
=
i ix
s
ιψ
Ν =∑
orx = Ψs
(2.4)where
s
is theN×1
vector and represents the signal in theΨ
domain (Baraniuk,2007). Both
x
ands
are equivalent representations of the signal. Usually for radio frequency signals,x
is the time domain representation ands
is the frequency or Fourier domain representation. The signalx
isK-
sparse if onlyK
s
i coefficients arenonzero and all the others are exactly zero. The signal
x
is compressible when only a fews
icoefficients have large values and the others are relatively small. The fact thatcompressible signals are well approximated by
K-
sparse representations forms the foundation of transform coding (Mallat, 1999). Unfortunately, the procedure under transform coding works is wasteful, as it was mentioned in 2.4.1. The number of samplesN
may be large even ifK
is much smaller. Moreover, all theN
s
i coefficientshave to be computed, even though only
K
of them is useful. Lastly, the locations of the larges
icoefficients have to be known, thus an extra burden.Compressive sensing addresses these inefficiencies by directly acquiring a compressed signal representation without going through the intermediate stage of acquiring
N
samples (Candès et al., 2006a; Donoho, 2006). A linear measurement process is employed consisted ofM < N
vectors inR
N{ }ϕ
i i1Μ
= . Each of
measurements is the inner product between
x
and the vectorφ
i that is,,
i
y
=
x
ϕ
ι=ϕ
ιΤx
, where.
Tdenotes transposition. If the measurements are arrangedin an
M×1
vectory
and the measurement vectorsφ
i in anM
×N
measurementmatrix
Φ, y
can be expressed asy=Φx
and from equation (2.4) above it can bewritten as:
where Θ = ΦΨ is an M× matrix. The measurement process is not adaptive, N
meaning that
Φ
is fixed and does not depend on the signalx
(Candès & Romberg,2005; Tsaig & Donoho, 2004).
The problem now is i) to design a measurement matrix
Φ
so that the vitalinformation in the
K-
sparse or compressible signalx
is not lost by the dimensionality reduction fromx ϵ R
N toy
ϵ R
M and ii) a reconstruction algorithm to recovers
(andsubsequently
x
) from onlyM
<
N
measurements, since (2.5) forms an underdetermined system of linear equations and the traditional solving methods do not apply. For these two requirements to be fulfilled, the following should hold:a. Matrix
Θ
must preserve the Euclidean length of all theK-
sparse vectorsx
, a property called Restrictive Isometry Property (RIP) (Candès & Tao, 2005; Candès & Wakin, 2008). If this holds, then (2.5) can be solved with onlyM < N
measurements provided thatK
≤
M
(Candès & Tao, 2006; Candès et al., 2006b).b. Matrix
Φ
must be largely incoherent with the basis matrixΨ
, in whichthe signal is
K-
sparse. Coherence measures the largest correlation between any two elements ofΦ
andΨ
(Donoho & Huo, 2001). Theincoherence between
Φ
andΨ
has as result that if a signal has a denserepresentation in
Φ
, then it should have a sparse representation inΨ
(Baraniuk, 2007; Candès & Wakin, 2008). The positive effect of the incoherence between
Φ
andΨ
is that the larger it is the fewermeasurements
M
are required for stable and accurate signal recovery (Donoho & Elad, 2003; Tropp et al., 2006a). In addition, it must be noted thatM
gets smaller asK
becomes smaller as well, or in other words, signal sparsity becomes greater (Candès & Wakin, 2008), hence the CS efficiency is enhanced.Both RIP of
Θ
and incoherence betweenΦ
andΨ
can be achieved with highwith elements drawn from a normal, Gaussian, Bernoulli or other sub-Gaussian distributions. Such matrices obey the RIP (Candès & Wakin, 2008); Baraniuk et al., 2007) and in addition are largely incoherent with any fixed orthonormal basis
Ψ
(Candès & Wakin, 2008).
The random matrices have an interesting property. When
Φ
is random thenΘ=ΦΨ
will also be a random matrix for which the RIP holds regardless of the choiceof
Ψ
(Baraniuk, 2007; Candès & Wakin, 2008; Baraniuk et al., 2007; Baraniuk et al.,2017), and thus (2.5) can be solved. Therefore, the focus of the problem is not on the design of the measurement matrix
Φ
, but the on finding a basisΨ
to whichx
issparse. This property of the random matrices is called universality and constitutes a significant advantage to using them to construct the measurement matrices (Baraniuk et al., 2017).
An attractive property of a random matrix
Φ
is that the measurements it derivesare democratic, meaning that each measurement carries roughly the same amount of information about the signal being acquired, thus rendering recovery possible by using any subset of
M
measurements (Davenport et al., 2009; Laska et al., 2011). Thus, one can be robust enough to the loss or corruption of a small fraction of the measurements (Baraniuk et al., 2017).2.4.2.2. Signal Recovery
The signal reconstruction algorithm aims to find the sparsest vector
s
which is consistent with measurement vectory=Φx
(Fornassier & Rauhut, 2015). This naturally leads to solving thel
0-mimization problem (Eldar & Kutyniok, 2012):min ǁsǁ
0 subject toy = Φx = ΦΨs = Θs
(2.6)where the
l
0norm counts the number of non-zero entries in vectors
. The solution of(2.6) has been proved to be NP-hard (Mallat & Zhang, 1993; Natarajan, 1995; Muthukrishnan, 2005). One way to bypass this problem is to solve the
l
1-min ǁsǁ
1 subject toy = Φx = ΦΨs = Θs
(2.7)where the
l
1 norm represents the sum of the absolute values of all the entries ofs
. Theresulting problem, also known as basis-pursuit, is considered a convex optimization problem, is computationally feasible and can be posed as a linear programme (Donoho & Elad, 2003; Chen et al., 1998; Elad & Bruckstein, 2002; Boyd & Vanderberghe, 2004). By exploiting the signal sparsity and the RIP property of matrix
Θ
, thel
1-minimization approach to the solution equation 2.8 can exactly recover thesignal with high probability (Candès et al., 2006a; Donoho, 2006; Candès & Romberg, 2017).
There are also other signal recovery approaches, which include greedy algorithms, such as orthogonal matching pursuit (Tropp, 2004; Davenport & Wakin, 2010; Needell & Vershynin, 2010) and iterative thresholding (Blumensath & Davies, 2009; Daubechies et al., 2004; Tropp & Needell, 2009). The greedy algorithms rely on iterative approximation of the signal coefficients and support. This is done by either iteratively identifying the support of the signal until a convergence criterion is met, or by alternatively by obtaining an improved estimate of the sparse signal at each iteration that attempts to account for the mismatch to the measured data (Eldar & Kutyniok, 2012).
2.4.2.3. Compressive vs. Classical Sampling
CS differs from classical sampling in three important respects (Eldar & Kutyniok, 2012). First, sampling theory typically considers infinite length, continuous-time signals. In contrast, CS is a mathematical theory focused on measuring finite- dimensional vectors in
R
N. Second, rather than sampling the signal at specific points in time, CS systems typically acquire measurements in the form of inner products between the signal and more general test functions (measurement vectors). This is in fact in the spirit of modern sampling methods which similarly acquire signals by more general linear measurements (Walden, 1999b; Eldar & Mishali, 2009; Unser, 2000). Thirdly, the two frameworks differ in the manner in which they deal with signal recovery, i.e., the problem of recovering the original signal from the compressivemeasurements. In the Nyquist-Shannon framework, signal recovery is achieved through sinc interpolation (Marks II, 1991), a linear process that requires little computation and has a simple interpretation. In CS, however, signal recovery is typically achieved using highly nonlinear methods.