c
NEAR-OPTIMAL INVERSION OF INCOHERENT SCATTER RADAR MEASUREMENTS: CODING SCHEMES, PROCESSING TECHNIQUES, AND
EXPERIMENTS
BY
ROMINA NIKOUKAR
DISSERTATION
Submitted in partial fulο¬llment of the requirements
for the degree of Doctor of Philosophy in Electrical and Computer Engineering in the Graduate College of the
University of Illinois at Urbana-Champaign, 2010
Urbana, Illinois Doctoral Committee:
Associate Professor Farzad Kamalabadi, Chair Professor Erhan Kudeki
Professor Steven J. Franke Associate Professor Minh N. Do
ABSTRACT
Accurate and eο¬cient estimation of the key ionospheric state parameters such as electron density, ion composition, and electron and ion temperatures is required to understand fundamental issues of terrestrial plasma physics such as redistribution of energy and momentum, and coupling within terrestrial upper atmosphere regions. This work focuses on developing a modern coding scheme and inversion methodology using Arecibo incoherent scatter radar (ISR) to achieve eο¬cient, and near-optimal estimates of the key ionospheric parameters. In particular, this work considers two aspects of the ISR inversion problem: (i) ISR lag estimates at individual altitudes, and (ii) modulation techniques that can provide more accurate estimates with a speciο¬c range resolution. These two aspects suggest a unifying framework for ISR inversion in which modern computational technology and ISR methodology are utilized in a robust estimation procedure.
The ο¬rst contribution of this work is the development of a discrete forward model for F-region incoherent scatter measurements, where long pulses are utilized in transmission. The range smear-ing imposed on measurements by long pulses is modeled as a one-dimensional convolution along the range in the simpliο¬ed case where the receiver sampling is instantaneous. The next major phase of this research is to develop an eο¬cient hybrid technique that allows for estimation of plasma parameters by removing range smearing from measurements. The inversion technique incorporates both quadratic and edge-preserving regularization approaches in order to provide smooth plasma auto-correlation function (ACF) lag proο¬les in the presence of noise while still resolving the sharp gradients.
Another contribution of this thesis is to develop a technique for optimal modulation design in ISR experiments. The optimal resolution supported by ISR measurements is used as one criterion for the optimal design. The model order selection framework is applied to the problem at hand to ο¬nd the optimal resolution. The results indicate that, compared with a long pulse,
amplitude-modulated codes yield ο¬ner range resolutions with nearly similar parameter estimation errors or smaller estimation errors with the same range resolution. In medium to high SNR scenarios, a smaller on-oο¬ ratio of the transmitted waveform is recognized as a determining factor for allowing more freedom in removing range ambiguities as well as resulting in improved statistical accuracy for integrated data in range and lag directions. In order to ο¬nd the optimal amplitude modulation for Arecibo ISR measurements in medium to high SNR scenarios, a modiο¬ed form of the sequential backward selection algorithm is applied to the space of all amplitude modulated pulses with a certain on-oο¬ ratio. Due to the vast search space, there is no possibility for an exhaustive search. Therefore, the problem of ο¬nding the optimal amplitude modulation is viewed as an optimization problem. Three optimality criteria, namely, sum of squared errors, uniformity of estimation errors, and condition number of convolution matrices, are considered.
The ο¬nal contribution of this work was implementing and conducting several experiments in April 2004, August 2005, and July 2006 using the incoherent scatter radar at Arecibo Observatory, and applying the inversion technique to estimate the plasma parameters. In these experiments, the original mode of MRACF was modiο¬ed to utilize amplitude modulation. The results of all these experiments verify that when the SNR is suο¬ciently high, compared with an unmodulated long pulse, improved range resolution with nearly the same statistical accuracy is obtained when an amplitude modulation is utilized.
The results of the developed methodology and experimental design of this work can be extended to other incoherent scatter radars (such as Jicamarca radar in Peru, and advanced modular incoherent scatter radar in Alaska) to improve the estimation task in other altitude and latitude regions, and to extract many further ionospheric parameters such as electric ο¬eld strength, conductivity, current, and neutral wind speed.
To my father, my mother, and my sisters, and
my husband, Hossein, for their love and support
ACKNOWLEDGMENTS
I would like to express my deepest gratitude to my adviser, Dr. Farzad Kamalabadi, for his guidance, understanding, and patience during my graduate studies at the University of Illinois. He encouraged me to grow not only as a scientist but also as an instructor and an independent thinker. I am also thankful to him for encouraging the use of correct grammar and consistent notation in my writings and commenting on countless revisions of this manuscript. My special thanks goes to Dr. Erhan Kudeki for getting my graduate research started on the right foot, and always being there to listen and give advice. I am deeply grateful to him for the long discussions that helped me sort out the technical details of my work. I am also thankful to Dr. Steven Franke and Dr. Minh Do for commenting on my ideas and helping me understand and enrich them.
I am deeply indebted to Dr. Michael Sulzer, Dr. Nestor Aponte, and Dr. Sixto Gonzalez for their knowledge and advice. Their valuable comments and directions not only helped me in my research, but also taught me precious lessons for my future academic career. I also thank the staο¬ at the Arecibo Observatory for their support during my data collection.
Most importantly, none of this would have been possible without the incredible support of my parents, sisters, and husband. My family to whom this dissertation is dedicated, has been a constant source of love, concern, support, and strength all these years. My special thanks goes to my parents who made it possible for me to pursue and complete my PhD degree. I am also thankful to all my friends at the University of Illinois for their friendship and support.
TABLE OF CONTENTS
CHAPTER 1 INTRODUCTION . . . . 1
1.1 Motivation . . . 1
1.2 Context and research contribution . . . 3
1.3 Dissertation overview . . . 6
CHAPTER 2 PRELIMINARIES . . . . 7
2.1 The basics of incoherent scattering of radio waves . . . 7
2.1.1 Variation of the spectrum as a function of parameters . . . 8
2.2 Principles of pulsed radar operations, the soft-target radar equation, and the ambi-guity function . . . 9
2.3 Conventional coding schemes in incoherent scatter experiments . . . 13
2.3.1 Long-pulse technique . . . 13
2.3.2 Multi-pulse technique . . . 13
2.3.3 Alternating codes . . . 15
2.4 Current methodologies in incoherent scatter inversion . . . 16
2.4.1 Statistical framework for incoherent scatter inversion . . . 17
2.4.2 Height-by-height analysis . . . 20
2.4.3 Full-proο¬le analysis . . . 21
CHAPTER 3 PROPOSED TECHNIQUE FOR INVERSION OF INCOHERENT SCATTER MEASUREMENTS . . . 24
3.1 Forward model . . . 25
3.2 Inverse model . . . 27
3.2.1 Deconvolution . . . 27
3.2.2 Nonlinear least-squares (NLLS) optimization . . . 34
3.3 Extension to 2-dimensions . . . 35
3.4 Numerical results . . . 37
3.4.1 Hybrid technique versus conventional processing techniques . . . 37
3.4.2 Quadratic versus edge-preserving regularization techniques . . . 44
CHAPTER 4 INVESTIGATION OF CODING SCHEMES FOR ISR MEASUREMENTS . . . 47
4.1 Error covariance estimates . . . 48
4.2 Fundamental resolution in ISR measurements . . . 52
4.2.1 Model order selection framework . . . 52
4.2.2 Model order selection method . . . 54
CHAPTER 5 EXPERIMENTAL INVESTIGATION OF CODING SCHEMES
WITH ARECIBO ISR . . . 67
5.1 F-region measurements at Arecibo . . . 67
5.1.1 Multiple radar auto-correlation function . . . 68
5.2 Modiο¬ed MRACF experiments at Arecibo . . . 69
5.3 Observations . . . 70
5.3.1 July 2006 experiment . . . 70
5.3.2 August 2005 experiment . . . 75
5.3.3 April 2004 experiment . . . 87
CHAPTER 6 OPTIMAL MODULATION IN F-REGION INCOHERENT SCATTER EXPERIMENT DESIGN . . . 99
6.1 Experiment design in view of subset selection . . . 100
6.2 Statistical formulation . . . 101
6.3 Optimality criteria . . . 102
6.4 Sequential backward selection algorithm . . . 103
6.5 Numerical results . . . 104
CHAPTER 7 CONCLUSIONS . . . 110
7.1 Modeling . . . 110
7.2 Inversion technique . . . 111
7.3 Experiment design . . . 111
7.4 ISR experiments and parameter estimation results . . . 112
7.5 Future directions . . . 113
7.5.1 Arecibo topside mode . . . 113
7.5.2 Advanced modular incoherent scatter radar . . . 114
7.5.3 Jicamarca radar . . . 115
CHAPTER 1
INTRODUCTION
1.1 Motivation
Incoherent scatter radar (ISR) is the most powerful ground-based technique for studying the Earthβs ionosphere. The incoherent scatter (IS) echo is the result of the scattering of electromagnetic energy, radiated from radar, by electrons in the ionospheric plasma, which are themselves controlled by much slower, massive positive ions. The frequency spectrum of the received signal provides information about electron and ion temperatures, electron density, ion composition and velocity. The analytical relationship between the spectrum and these parameters has been well established in the literature, e.g. [Dougherty and Farely, 1960, 1961; Farley, 1966, 1969; Hagfors, 1961, 1971]. With the extraction of these parameters from incoherent scatter measurements, one can deduce many further ionospheric parameters such as ion composition, electric ο¬eld strength, conductivity and current, neutral air temperature and wind speed.
Although the exact forward theory of incoherent scatter was established more than four decades ago, inversion, the estimation of parameters from incoherent scatter spectra, has remained an open-ended problem, due to two major factors. The ο¬rst complication stems from the fact that variation of diο¬erent plasma parameters may give rise to similar changes in the IS spectrum [Vallinkoski, 1988], e.g. the distinction between changes of the spectra due to ion composition or temperature ratio is very diο¬cult. The same is true for ion mass and ion temperature for ο¬xed temperature ratio. This implies that spectrum, or equivalently its Fourier transform, i.e. auto-correlation function (ACF), needs to be retrieved as accurately as possible, since small errors in ACF may yield large errors in estimated parameters. The second factor is the range smearing of information from one altitude over a number of altitudes, which is due to the length of the transmitted pulse.
The transmitted pulse should be long enough to provide suο¬cient spectral resolution in measuring the spectrum, or equivalently, to permit ionospheric ACF to be measured with suο¬cient lag extent. On the other hand, at any given time, the received signal includes contributions from a volume which extends ππ2 in the radar line-of-sight direction, where π is the speed of light and π is the length of the pulse.
An example of the eο¬ect of range smearing is shown in Figure 1.1 where diο¬erent lags of the ACF are plotted prior to and after imposing the range ambiguity for an un-coded modulation of length 280 πs with a 430 MHz radar (thin and thick curves, respectively). The estimation of ionospheric parameters from the range-smeared ACF would result in a greater ion temperature than electron temperature, which is not physical [Nikoukar et al., 2008]. Conventional processing techniques utilize various ways to tackle the inclusion of range smearing into analysis. Range-gate analysis, for example, is based on the assumption of constant plasma parameters at each range gate (distance covered by the pulse length) and compensates for the eο¬ect of range smearing by a simple triangular weighting. This method, although simple and fast, is based on an unrealistic assumption and suο¬ers from the coarse resolution of estimated parameters. Full-proο¬le analysis, which attempts to estimate ionospheric parameters at all altitudes simultaneously, on the other hand, considers a full model for range smearing. This technique, however, suο¬ers from its signiο¬cant computational cost, which can be mitigated by coarser interpolation of the parameter space, hence sacriο¬cing estimation accuracy. The limitations of these processing techniques have motivated the search for more robust approaches to the problem of ISR inversion that meet the challenges of modern ISRs. The goal of this research is to develop a modern ISR parameter estimation methodology to achieve eο¬cient, near-optimal estimates of ionospheric parameters with a ο¬ne resolution. Primarily, this research will focus on the F-region ionosphere over the Arecibo Observatory, where the eο¬ect of molecular ions and collision can be simply neglected in the analysis. Moreover, the beam direction over Arecibo can be far from perpendicular to the magnetic ο¬eld lines, where the incoherent scatter process is well known. This work will consider two aspects of the ISR inversion problem: (i) ISR lag estimate at individual altitudes and (ii) modulation techniques that can provide more accurate estimates with a speciο¬c range resolution. These two aspects suggest a unifying framework for ISR inversion in which modern computational technology and ISR methodology are utilized in a robust estimation procedure.
0 50 100 150 200 250 300 0
0.5 1
Time lag ( sec)
Normalized ACF
at a given height
Figure 1.1 Normalized theoretical and measured ACF for a long un-coded pulse transmission and a constant ionosphere (thin and thick curves, respectively). Range smearing makes the shape of ACF so distorted that results of the ο¬t by standard nonlinear least-squares methods become poor. Notice that the above ACFs have been generated using a radar frequency of 430 MHz for a single ion ionosphere (oxygen). The lag spacing is 10 πs and an un-coded long pulse of length 280 πs is considered for transmission.
1.2 Context and research contribution
Since the development of ISR theory and construction of large incoherent scatter radars, such as the Arecibo radar, various methodologies were invented to deal with diο¬culties attributed to ISR parameter estimation. Generally speaking, they fall into two major categories of analysis techniques and coding schemes. To date numerous coding schemes have been primarily designed to make direct measurements of plasma ACF possible despite the pulse length. There are two main classes of coding techniques, amplitude modulation and phase modulation schemes, used for transmission. Amplitude modulation schemes such as double-pulse and multi-pulse techniques, pioneered in [Farley, 1969, 1972], exploit short pulses to obtain the spectra at individual altitudes. They, however, suο¬er from great sensitivity to the background and receiver noise. These techniques are suitable only for parameter estimation around the F-region peak where the back-scattered signal is rather high. Phase modulation schemes, such as alternating codes, maintain the ο¬ne spatial resolution of measurements, while exploiting the full duty cycle of radar, and are not as sensitive to the noise level. They, however, are only appropriate for low signal-to-noise ratio (SNR) scenarios,1
where the lag estimate errors are independent from each other. As a result, in the case of the Arecibo radar, where the transmitted power is high and the transmitting and receiving antenna area is large, this technique loses its appeal over other existing coding techniques.
Currently, the usual mode of operation for this type of measurements at Arecibo Observatory is the simple long-pulse technique. The major diο¬culty associated with the long-pulse modulation is that it can only provide range-smeared (indirect) measurements of plasma ACF. The current analysis techniques developed to deal with this range smearing of information suο¬er from severe sensitivity to noise, coarse spatial resolution, or signiο¬cant computational expense. Although the relationship between the measured signal ACF and the plasma ACF at individual altitudes has been thoroughly derived in the work of Lehtinen and Huuskonen [1996], to date no systematic approach exists to exploit this relationship in favor of more eο¬cient methodologies. In this work, we present an eο¬cient, near-optimal approach to the problem of incoherent scatter radar inversion which is extremely robust in the presence of system and background noise. The key to this new method is to consider the system of measurements as a combination of two linear and nonlinear systems. We formulate the estimation of plasma parameters as the result of linear inversion of measured signal ACF (to achieve plasma ACF at individual altitudes) and then of nonlinear inversion of plasma ACF (to obtain plasma parameters). Although the proposed analysis approach attempts to obtain estimates of plasma ACF at individual altitudes (separated by the distance equal to lag resolution), it is only able to partially remove the smearing; and thus there is a limit to the resolution of the estimated physical parameters.
Another drawback associated with un-coded long-pulse measurements is that they suο¬er from high correlation between ACF lag estimate errors. This correlation plays an important role in ISR error analysis, that is, determining the uncertainty level in estimated parameters. First, in order to avoid underestimation of error bars, one has to account for the full error covariance matrix.2 In addition, when SNR is high, random ο¬uctuations in the ACF estimates originate
in part from the self-noise, which is generated by the uncorrelated scatter from the transmitted signal from all simultaneously illuminated altitudes [Lehtinen and Huuskonen, 1996]. When the self-noise is dominating, the accuracy of observations cannot be increased by further increasing the signal strength because the random error in data increases at the same rate. Another eο¬ect of the correlation between lag estimate errors is related to the range integration. Because the estimates are highly correlated in high SNR regimes, the addition of π ACFs from adjacent altitudes does
2An error covariance matrix is formed from the correlation between errors of diο¬erent lag estimates received at
not yield an βπ-fold increase in estimation accuracy as would be the case for fully independent
observations.
In this work we present amplitude modulation (AM) as the solution to reducing the correlation between ISR lag estimate errors. This reduction in correlation, in turn, allows for improving range resolution especially in high SNR regimes. For this purpose, we study and compare the performance of amplitude modulation with that of long-pulse modulation in terms of statistical accuracy and range resolution by means of both numerical simulations and real data analysis. We show that the correlation between lag estimate errors is highly aο¬ected by the envelop of the modulated waveform for a given SNR, and that amplitude modulation reduces this correlation signiο¬cantly. We show that this lower correlation results in improved accuracy when measurements are integrated in range and lag. Using model order selection methods, we also derive a mathematical measure for estimating the fundamental range resolution supported by ISR measurements.
Moreover, we present parameter estimation results of several F-region experiments that we con-ducted with Arecibo radar in 2004, 2005, and 2006. In these experiments, long-pulse and various amplitude modulation codes were alternated in transmission to make a direct comparison between the estimation results possible. The on-oο¬ ratio of the transmitted waveform is represented as a major determining factor in the performance of an amplitude modulated code. However, slight dif-ferences in performances of AM pulses with the same on-oο¬ ratio necessitate consideration of other related factors, such as uniformity of the number of integrated heights for diο¬erent lags, or condi-tion number of convolucondi-tion matrices. In this work, we present close-to-optimal waveform design for ISR experiments using subset selection methods. Our method, however, provides preliminary results and its convergence properties have yet to be developed.
The developed ISR methodology can also be extended to other incoherent scatter radars such as Jicamarca radar in Peru or advanced modular incoherent scatter radar (AMISR) in Poker Flat, Alaska. Although the requirements of the plasma correlation measurements for these radars are quite diο¬erent from those for the Arecibo radar, the developed methodology can be expanded to meet the requirements.
1.3 Dissertation overview
This dissertation is organized, as follows: Chapter 2 presents an overview of the incoherent scatter theory, the incoherent scatter radar equation, and the concepts of range smearing and ambiguity. It also covers the conventional coding schemes and conventional incoherent scatter data process-ing techniques. The ο¬rst contribution of this work is the development of a hybrid technique that allows for eο¬cient estimation of plasma parameters from ISR measurements. In Chapter 3, a tech-nique is developed which combines regularization theory and deconvolution methods. Two diο¬erent regularization methods, quadratic and edge-preserving methods, are considered. A statistical in-terpretation of the proposed algorithm is given using statistical formulation. Numerical results are presented.
Another contribution, presented in Chapters 4, 5, and 6, is the development of a framework for determining the fundamental resolution, and optimal experiment design, in ISR measurements. We also compare the performances of long-pulse and amplitude modulation, which results in lower correlation in lag estimate errors. Numerical estimates of ACF lag variance reduction are presented for a long pulse and several amplitude modulated pulses when measurements are integrated in range and lag directions. Moreover, the improvement in range resolution for estimated parameters is presented for AM pulses using numerical simulation in Chapter 4. Chapter 5 describes several ISR experiments conducted using the radar at Arecibo observatory, and presents a compelling comparison between the performances of long pulse and AM pulses with diο¬erent on-oο¬ ratios, as well as AM pulses with the same on-oο¬ ratio. In Chapter 6, optimal design of the transmitted waveform in F-region ISR experiments is considered, and a practical algorithm is proposed for ο¬nding the optimal pulse conο¬guration.
Finally, Chapter 7 provides concluding remarks, summarizes the dissertationβs contributions, and discusses some future research directions.
CHAPTER 2
PRELIMINARIES
2.1 The basics of incoherent scattering of radio waves
The idea of incoherent scatter radars was initiated by W. Gordon in 1958. He suggested that if a powerful beam of radio waves with a frequency well above the plasma frequency was sent vertically through the ionosphere, an extremely small but still measurable amount of power would scatter back to the ground by small-scale random ο¬uctuation of electrons in the ionosphere. The backscattered signal could be roughly estimated as the sum of the powers scattered from each electron, as given by the Thompson electron cross-section [Kudeki, 2003; Nygren, 1996]. Since the electrons are randomly distributed in the ionosphere, they scatter incoherently, hence the name βincoherent scatter,β and the power scattered by each is summed to obtain the total signal. Moreover, owing to the thermal motions of the electrons, the scattered radiation would be subject to a Doppler broadening, with a frequency deviation of the order of (πΎπ /ππ)12/π0, where πΎ is the Boltzmann constant, π is the
temperature, ππ is the electron mass, and π0 is the radar wavelength.
The ο¬rst successful ISR experiment was performed by Bowles in 1958 using extremely high-powered equipment. His results indicated that the returned power was of the order of the power predicted by Gordon; however, the observed Doppler broadening was much smaller than what Gordon had anticipated. Diο¬erent groups of scientists investigated this discrepancy [Dougherty and
Farely, 1960, 1961; Farley, 1966; Hagfors, 1961, 1971]; and although they used diο¬erent theoretical
approaches, they all arrived at the following conclusion: Provided that the Debye length (the distance over which the inο¬uence of the electric ο¬eld of an individual charged particle is felt by the other charged particles inside the plasma [Bittencourt, 2004]) is suο¬ciently large, the ions play an important role in determining the Doppler spread, although it is the electrons that do the actual
scattering and that the eο¬ect of ions in the form of Coulomb collisions must be considered in the electron scattering.
In the next section, we describe the basic idea behind the incoherent scatter of the radio waves with pulsed incoherent scatter radars, and elaborate on the expression of the incoherent scatter spectrum, or equivalently its Fourier transform, auto-correlation function, as a function of iono-spheric parameters. We then demonstrate the eο¬ect of varying parameters on the resulting spectra or ACFs by a few examples.
2.1.1 Variation of the spectrum as a function of parameters
In the presence of a magnetic ο¬eld (B) in a plasma with Coulomb interactions and a beam direction away from perpendicular to B, the spectrum of the electron density ο¬uctuation, < β£ππ(k, π)β£2 >,
can be represented by < β£ππ(k, π)β£2 > = 2π0 β£π(π2β2 π+ π) + ππππ½(ππ)β£2 π π{π½(ππβ2πΆππ)} + β£π + πππ½(ππ)β£2 π π{π½(ππβ2πΆππ)} β£π(π2β2 π+ 1 + π) + πππ½(ππ) + ππππ½(ππ)β£2 (2.1)
where < . > denotes the expectation operation, π0 represents electron density, π is the electron-ion
temperature ratio ππ/ππ, βπ is the Debye length, and ππ represents the normalized velocity of the
particle π . The Gordyeve-type integral π½(ππ ) is described by
π½(ππ ) =
β« β
0 ππ‘π
βπππ‘πβ[π‘24 sin2πΌ+π2π 1 sin2(ππ π‘2 )cos2πΌ] (2.2)
where πΌ is the magnetic aspect angle (the complement of the angle between propagation vector k and magnetic ο¬led B), and π denotes the normalized gyrofrequency of the species π (Ξ©π ), i.e.
ππ = πβΞ©2πΆπ π (see [Kudeki, 2003] for a complete discussion).
In thermal equilibrium where ππ= ππ, the spectrum consists of two peaks close to the origin with
widths corresponding to ion thermal velocities. Therefore, the spectrum of a plasma has a small dip at the central frequency corresponding to no Doppler shift. In order to provide a qualitative picture of the eο¬ect of various parameters on the spectrum (and its Fourier transform equivalent, ACF), spectra (and ACFs) at various parameter combinations are plotted in Figure 2.1 (a) and (b). Plots
β200 β10 0 10 20 0.2 0.4 0.6 0.8 1 Frequency (kHz) β200 β10 0 10 20 0.2 0.4 0.6 0.8 1 Frequency (kHz) 0 100 200 300 400 500 β0.5 0 0.5 1 Lag (Β΅ s) T i = 500 K T i = 1000 K T i = 1500 K T i = 2000 K 0 100 200 300 400 500 β0.5 0 0.5 1 Lag (Β΅ s) Β΅ = 1 Β΅ = 1.5 Β΅ = 2 Β΅ = 2.5
Figure 2.1 Variation of the ISR spectrum and ACF as a function of ion temperature (left panels) and temperature ratio (right panels), respectively.
in the left panels show the variation of the spectrum and ACF as a function for ο¬xed temperature ratio (π = 2) and as a function of ion temperature, whereas plots in the right panels show the variation of the spectrum and ACF for ο¬xed ion temperature (ππ = 1000 K) and as a function of
temperature ratio. All plots have been generated for frequency of 430 MHz and a 100% oxygen ionosphere. As seen from the plots, for a ο¬xed π, the ion line becomes broadened as ion temperature increases. Moreover, the peak-to-valley ratio also increases with increasing temperature ratio with ion temperature ο¬xed.
In the next section, we describe how we can infer the plasma spectrum or ACF from an incoherent scatter radar.
2.2 Principles of pulsed radar operations, the soft-target radar
equation, and the ambiguity function
In order to introduce the soft-target radar equation, we ο¬rst provide a quick overview of the principles of pulsed radar ISR operation. For this purpose, let us denote the envelope and the length
T Sampling
Time range
r=c*t /2
t seconds
Figure 2.2 Transmission and reception scenario in a typical incoherent scatter radar. A pulse (modulated or unmodulated) of length π is transmitted through the ionosphere, and the return is sampled at πΏπ‘ intervals.
of the transmitted waveform in the radar experiment by π0(π‘) and π , respectively. Suppose the
pulse is a baseband waveform modulated by a sinusoidal wave of frequency π0. The corresponding
wavelength and wave vector will be π0 = 2πππ0 and k0 = 2ππ0Λk, respectively, where π is the speed of
light and Λk is the direction of wave transmission. The returned signal is then sampled with a period
πΏπ‘ (Figure 2.2) [Nikoukar et al., 2008].
The scattered signal from the electron density ο¬uctuations in the ionosphere can be represented by [Kudeki, 2003] π£(π‘) β β« rπ(r, π‘)π βπ2π0ππ 0(π‘ β2ππ )πr (2.3)
where π(r, π‘) represents the ο¬uctuating component of the plasma electron density at range r = πΛr along the direction Λr. The proportionality factor, π(r), describes all geometrical eο¬ects and can be written as:
π(r) = π ππππΊ(Λr)π2π π
0π2 (2.4)
where ππ β 2.8Γ10β15m is the classic electron radius, π πππ represents the radiation resistance of the
antenna, and πΊ(Λr) is the antenna gain along the direction Λr. Note that the received signal spectrum is centered at the carrier frequency, π0, instead of zero, as the antenna output is modulated by the
carrier signal. At the coherent detector, the input is mixed with the signal derived from a local oscillator centered at the carrier frequency and is low-pass ο¬ltered. This task brings the output signal to the baseband again, centered at zero frequency. The exponential term in (2.3) represents
this mixing operation.
Since there are many scattering electrons in the ionosphere, the scattered signals (or received voltages) can be treated as Gaussian random variables to a very good approximation, according to the central limit theorem [Papoulis, 1986]. Therefore, instead of working directly with voltage sam-ples, we model their joint statistics in the form of the auto-correlation function, < π£(π‘)π£(π‘ + π) >, where π represents the time lag and (.) denotes the conjugation operation, respectively.
From (2.3), the ACF of the received voltage can be expressed as
< π£(π‘)π£(π‘ + π) > β β« rβ² β« r < π(r, π‘)π(r β², π‘ + π) > π0(π‘ β2π π )π0(π‘ + π β 2πβ² π )πβ2ππ0(π β²βπ) πrπrβ² (2.5)
where rβ² = πβ²Λrβ² denotes the range πβ² along the direction Λrβ². Within the integrand above <
π(r, π‘)π(rβ², π‘ + π) > is the space-time ACF of the density ο¬uctuation π(r, π‘). With the
assump-tion that ο¬uctuaassump-tions have homogenous and staassump-tionary statistics which vanish rapidly with an increasing magnitude of x β‘ rβ²β r, we can proceed as
< π£(π‘)π£(π‘ + π) > β β« rπ0(π‘ β 2π π )π0(π‘ + π β 2π π )πr β« x< π(r, π‘)π(x + r, π‘ + π) > π β2πk0.xπx (2.6)
The inner integral represents a spatial Fourier transform, denoting the ACF of electron density ο¬uctuations at altitude π, time lag π, and the wave vector (k = β2k0), as follows:
π (k, π, π) β‘
β«
x< π(r, π‘)π(x + r, π‘ + π) > π
β2πk0.xπx (2.7)
Notice that this ACF of electron density ο¬uctuations is the same as the plasma ACF whose an-alytical expression (as a function of radar and plasma parameters) was already introduced in the previous section.
By inserting (2.7) into (2.6) we obtain [Kudeki, 2003]:
< π£(π‘)π£(π‘ + π) > β β« rπ0(π‘ β 2π π )π0(π‘ + π β 2π π )π (k, r, π)πr (2.8)
The above equation which demonstrates the relationship between the received voltage and the target in a statistical sense is called the soft-target radar equation and provides the necessary background for estimation of ionospheric parameters. Inspection of this equation brings us to the following two observations:
1. The incoherent scatter radar equation holds for each lag of ionospheric ACF independently from other time lags. Hence, to evaluate the plasma ACF at time lag π, we need to consider only the same lag of the received signal ACF.
2. Using incoherent scatter radar, it is not possible to achieve point estimations of the plasma ACF. Instead, weighted averages of this quantity over a ο¬nite-range interval are obtained. The weights are merely dependent on the modulated waveform and vary from lag to lag. The functions describing the averaging operation on the underlying plasma ACF are called soft-target radar ambiguity functions, (pπ(π‘) = π0(π‘)π0(π‘ + π)). These functions essentially indicate
that the signal coming from range π contains information from several altitudes, where the altitude interval is equal to the distance covered by the product of the waveform and its shifted version. Note that the zeroth lag contains the most range smearing of information. As we move to farther lags, this altitude interval decays as the common part between the pulse and its shifted version diminishes.
Notice that in a more general case, the ambiguity functions are dependent on both range vari-ables and time lag, as developed in Lehtinen and Huuskonen [1996] and Holt et al. [1992]. This dependence on time lag is caused by a non-ideal receiver, whose impulse response contains the time average of previous samples. For now, however, we assume the receiver has a suο¬ciently narrow impulse response, and as such the ambiguity in the lag direction is negligible. The expansion of the radar ambiguity function to the 2-dimensional case is considered in Section 3.3.
Also note that our deο¬nition of the soft-target ambiguity function diο¬ers from that of hard-target radar applications where it is deο¬ned as (see for example [Blahut, 2004])
π(π, π) =
β«
π0(π‘)π0(π‘ + π)πβπ2πππ‘ππ‘ (2.9)
variable, π, and doppler resolution, π. The hard-target radar equation is used as a measure of detectability of two hard targets with separation in range and velocity. The diο¬erence between the two deο¬nitions is due to diο¬erent ο¬lters being used in radar receivers. In a hard-target application, normally a matched ο¬lter is used, whereas in soft-target radars, a receiver with a boxcar impulse response of short length is often utilized. Through the rest of this work, we use the term βambiguity functionβ to refer to soft-target radar applications only.
2.3 Conventional coding schemes in incoherent scatter
experiments
In this section, we describe the conventional coding schemes that are used in typical F-region incoherent scatter experiments. These modulation techniques include long-pulse and multi-pulse techniques, and alternating codes.
2.3.1 Long-pulse technique
In long-pulse transmission, let π0(π‘) = 1 for0 β€ π‘ < π , where π represents the length of the pulse.
In this case, the one-dimensional ambiguity function ππ(π‘) can be represented as follows:
ππ(π‘) = ⧠ ⨠ β© 1 if 0 β€ π‘ < π β π 0 otherwise
In other words, the ambiguity function is a square pulse whose width decreases as the time lag increases. Therefore, in long-pulse transmission, it is not possible to retrieve point estimations of the plasma ACF directly from the data. To do so, one needs to account for the range ambiguity and has to devise methods that are described in the next section as well as the next chapter.
2.3.2 Multi-pulse technique
The class of multi-pulse codes pioneered by Farley [1972] is a set of amplitude modulated pulses, which can achieve point estimation of ionospheric ACF (nonzero) lag estimates at individual
alti-tudes without ambiguity. A classical pulse-code modulation meets the following conditions: 1. The code consists of π pulses of equal duration (bit length).
2. The distance of any pair of pulses is an integer multiple of the smallest (basic) inter-pulse distance (π), and all distances are diο¬erent. The lag resolution is set to be equal to the basic inter-pulse distance.
Although the multi-pulse technique can provide the ionospheric ACF estimation at individual altitudes, its major drawback is its low duty cycle, and thus its high sensitivity to the background noise level. For example, a six-pulse code with a bit length of 17 πs, will have a total length of 306 πs (see Figure 2.3). This conο¬guration yields a duty cycle of only 33%. As the number of short pulses increases, the duty cycle decreases. For example, a 7-pulse code has a duty cycle of 26% only. In order to facilitate the full radar duty cycle, and therefore to reduce the sensitivity of the method, the technique is usually implemented in the form of interlaced codes. In an experiment with these interlaced codes, the transmitted frequency is changed rapidly so that gaps are not left in the transmission. Another drawback of the multi-pulse codes is that the zero lag measured by this technique is normally discarded. The reason is that the shape of the ambiguity function for the zeroth lag diο¬ers from that of others, and its corresponding scattering volume consists of a number of individual volumes instead of a single volume. The use of multi-pulse zero lag data, however, can be used to improve incoherent scatter radar power proο¬le accuracy, as suggested by Lehtinen
and Huuskonen [1986]. 0 50 100 150 200 250 300 β1 β0.5 0 0.5 1 1.5 2 time (Β΅s) 6βpulse
Figure 2.3 Six-pulse technique. The baud is 17 πs and the pulses are located at 0, 17, 136, 187, 221, and 289 πs.
2.3.3 Alternating codes
Alternating codes are a series of phase-coded pulses which are transmitted one at a time through the ionosphere, and once all the pulses are transmitted, the cycle starts over again. Each single pulse consists of a combination of elementary pulses (bauds) with signs Β±1. These signs are changed from pulse to pulse in a way that the ambiguity function due to all of these pulses has a single peak on a zero background.
The working principle of this coding method is shown in Figure 2.4. The matrix on the left shows a set of four phase-coded pulses (scan count), where each pulse consists of four elementary pulses with varying signs. The matrix marked by π1 denotes the sign of the ambiguity functions
of the ο¬rst lag. This is obtained by multiplying each column of the matrix by its adjacent one. The matrices in the square region are obtained by further multiplying each column of π1 by the
ο¬rst through third column. According to the ο¬gure, all but one of the the columns of each matrix sum to zero. Recalling that each column in the original envelope of the pulses corresponds to one particular altitude, one can easily see that the undesired signals from the other two altitudes have been canceled out, while the signal from the altitude of interest is preserved. The same argument holds for other lags of the ambiguity function, π2 and π3. The only diο¬erence is that as we
consider higher lags, the number of columns (altitudes) decreases, until a single column is left for
π3.
In their work, Lehtinen and Huuskonen [1996] have presented an analytical solution to the prob-lem of choosing signs of every bit in all scans such that the whole set shows the desirable properties of single peaks for the ambiguity functions. For this purpose, they exploit one of the properties of the Hadamard matrices, which is orthogonality of rows [Harwit and Sloane, 1979]. As a result, certain arrangements of the rows or columns achieve the analytical cancelations. They have also shown that in situations where the received impulse response has a ο¬nite width, as opposed to a
πΏ-like response, the required number of scans in each cycle is twice the number of elementary pulses
in each scan. This requirement reduces the time resolution of the measurements by a factor of 1/ππ, where ππ is the number of scan counts in one complete cycle.
Although alternating codes guarantee the exact cancelation of the signal contribution from un-wanted regions, they fail to remove the covariances between the lag-estimate errors in an ISR
lses W1
-Figure 2.4 Working principle of the alternating codes. Ambiguity functions are shown for all possible three lags (ππ, where π is the lag number). The matrices next to each ambiguity matrix
are formed by multiplying each column (altitude) of this matrix by other columns (altitudes). This multiplication results in the canceled contributions from these columns, while preserving the signal from the ο¬rst altitude.
experiment. Covariance calculations, however, require the lag proο¬le matrices from all the codes, not only the ο¬nal results. This requirement makes the method even more computationally expen-sive, but makes it more eο¬cient when the SNR is very low and the lag estimate errors are nearly independent.
2.4 Current methodologies in incoherent scatter inversion
Let us assume that one of the above-mentioned coding schemes has been utilized in an ISR experi-ment, and estimates of the received signal ACF lags (or plasma ACF lags provided that multi-pulse or alternating codes are used) are available. In this section we provide an overview of the sta-tistical framework for inversion of incoherent scatter radar measurements to obtain estimates of ionospheric parameters. We formulate the problem of inversion in terms of the maximum likelihood (ML) principle. Next, we explain conventional inversion methodologies, height-by-height analysis and the full-proο¬le technique.
function
g(a)
m (ACF or spectrum)
+
ππ (noise)
a(Te, Ti, Ne, p, ...) g: nonlinear incoherent scatter
Figure 2.5 Incoherent scatter process as a hypothetical system, where π which is a vector of plasma parameters is the input of the system, π, the plasma spectrum or ACF, is the output, and the nonlinear function of incoherent scatter, which relates the parameters to plasma spectrum, is denoted by π(.).
2.4.1 Statistical framework for incoherent scatter inversion
Let us consider the incoherent scatter process as a system, shown in Figure 2.5 and denoted by g(.), in which the ionospheric state parameters form the input, a, and the spectrum or ACF of the received signal forms the output, m. The relationship between the measurements and the desired parameters can be written as
m = π(a) + π (2.10)
where π denotes the noise of the system and in the most general case is signal dependent. However, it gains Gaussian characteristics as the measurements are integrated over many transmissions. Therefore, it can be well described by its mean, π =< π >, and covariance matrix βπ =< πππ >.
A nonzero mean would indicate biases, and without loss of generality can be considered as zero. The conditional probability density function now can be expressed by
π(mβ£a) = 1 (2π)π/2β£Ξ£πβ£12 exp ( β12(m β π(a)πΞ£β1π (m β π(a))) ) (2.11)
where π is the number of data points. The joint probability density function of m and a can be described as π(m, a) = π(a)π(mβ£a) βΌ π(a) 1 (2π)π/2β£Ξ£πβ£12 exp ( β12(m β π(a))πΞ£β1π (m β π(a)) ) (2.12)
Let us now suppose that the a priori density of the parameters (π(a)) is approximately constant in the regions where the conditional probability density function is signiο¬cantly diο¬erent from zero.
The ML estimate is then equivalent to minimizing the quadratic form in the exponent in (2.12), i.e. π2 = β1
2(m β π(a)πΞ£β1π (m β π(a))). Thus, under the assumption of Gaussian errors, the
ML estimate can be obtained via the quadratic or least-squares optimization procedures. If the matrix βπ is diagonal, this quadratic form is reduced to the sum of the diο¬erences between the components of the predicted data, π(a), and the components of the measurements weighted by the inverses of the diagonal elements ofβπ. On the other hand, in situations whereβπ is not diagonal, the quadratic form of (2.12) deο¬nes a generalized least-squares estimation method.
Linear approximation
Let us suppose that the correct values of the parameters are a0 and the predictive measurements
are denoted by m0 = π(a0). If the errors are small, the solution to the least-squares problem is very
close to a0, and as such we can use the Taylor expansion of the ο¬rst degree to state the nonlinear
quadratic form of (2.12)
m β m0 = π(a) β π(a0) + π = A(a β a0) + π (2.13)
where Aππ = βgπ/βaπ is the partial derivative of the incoherent scatter function with respect to
plasma parameters. Thus the errors derived are only valid in the limit of small ο¬uctuations in the measurements. In practice this condition can be satisο¬ed by using longer integration times. Replacing a β a0 and m β m0 by a and m, respectively, yields
m = Aa + π (2.14)
The linear formulation is necessary and useful because the likelihood or posteriori distribution, and thus the solution, can be expressed in analytical forms. Here, we focus on the ML estimate where, with the assumption of the Gaussian noise, the likelihood can be described as
π(mβ£a) βΌ exp
(
β12(m β Aa)πΞ£β1π (m β Aa) )
(2.15)
In order to bring the expression for the transitional density to a standard quadratic form, we ο¬rst consider the Cholesky decomposition of the inverse noise covariance matrix as Ξ£β1
β
πis positive semi-deο¬nite). We proceed by simpliο¬cation of (2.15) and using the following identity
B(BTB)β1BT = I π(mβ£a) βΌ exp ( β12(m β Aa)πDπD(m β Aa)) = exp (
β12(Dm β DAa)π(DA[(DA)T(DA)]β1(DA)T)(Dm β DAa) ) = exp ( β12(AπΞ£β1π m β AπΞ£β1π Aa)π(AπΞ£β1π A)β1(AπΞ£β1π m β AπΞ£β1π Aa) ) (2.16)
Factoring the term Q = AπΞ£β1
π A and further simpliο¬cation yields
π(mβ£a) βΌ exp ( β12(Qβ1AπΞ£β1 π m β a)πQ(Qβ1AπΞ£β1π m β a) ) (2.17)
The ML estimate, which minimizes the likelihood, can be analytically expressed as
aππΏ = Qβ1AΞ£β1π m (2.18)
where its error covariance matrix is given by
Qβ1 =(AπΞ£β1π A)β1 (2.19)
The matrix Q is called the Fisher information matrix [Poor, 1994]. Clearly, all the measurement components make certain contribution to the Fisher information matrix.
The statistical inversion as described above is applicable to ISR inversion only in situations where the plasma ACF is readily available, by utilizing multi-pulse or alternating codes, for example. In cases where the modulation does not provide ambiguity-free measurements, one has to account for the eο¬ect of ambiguity in some way. Below, we review two of such methods, namely, height-by-height analysis and the full-proο¬le technique. In the next chapter, we introduce a hybrid technique for the inversion of ISR measurements.
r1 r3 r2 T lower lags Higher lags Pulse length
Figure 2.6 Range-time diagram in a long-pulse transmission. The ο¬tting is performed on the data from altitudes π1, π2, and π3 only. The number of altitudes contributing to the signal from π1
decreases as we move to higher lags.
2.4.2 Height-by-height analysis
The height-by-height analysis has been traditionally used in the analysis of long-pulse modulation in which measurements suο¬er from range ambiguity. The method relies on the assumption that parameters do not vary over a range-gate (an altitude interval which, in the most common case, is equal to the distance covered by the transmitted pulse). One then ο¬ts the plasma parameters by least-squares methods to the chosen measured ACF using the theory of incoherent scatter at individual altitudes with range-gate separation (altitudes π1, π2, and π3 in Figure 2.6 separated by
range-gates).
Prior to least-squares estimation, however, a number of modiο¬cations should be applied to the received signal ACF. These changes are required to compensate for the range smearing, and without them the shape of the ACF becomes so distorted that the results of least-squares ο¬tting become infeasible. One major reason is that the ACF of the received signal is the average of the ACF from several altitudes, and due to the nonlinear nature of the problem, the estimated parameters need not be close to the averages of the plasma parameters within the scattering volume.
Triangular weighting is a method which compensates the eο¬ect of range smearing and is based on weighing diο¬erent lags of the measured ACF to make the eο¬ective weight of all lags equal. With a transmitted pulse of π s, the zeroth lag of the received signal ACF is the superposition of signals coming from π diο¬erent altitudes separated by 1 s, whereas the (π β 1)th lag measured signal ACF results from the signal coming from only one altitude. Thus, in eο¬ect, the range smearing weighs the zeroth lag π times higher than the (π β 1)th lag, and the task of triangular weighting is to
undo this weighing process.
The hight-by-height analysis is also called range-gate analysis, as the data is processed at each range-gate separately. The technique, although simple and fast, suο¬ers from the underlying un-realistic assumption along with the coarse resolution of estimated parameter proο¬les. Introduced biases in parameter proο¬les have also been reported in electron density in regions below the peak height [Lehtinen and Huuskonen, 1996; Holt et al., 1992].
2.4.3 Full-proο¬le analysis
In this section, we ο¬rst formalize a simple version of the idea of the full-proο¬le analysis technique. We next explain the issues regarding practical implementations of such methods.
As opposed to the hight-by-height technique where only a few parameters are estimated using a nonlinear optimization procedure, in the full-proο¬le method the unknown is a much longer vector giving the plasma variables at ionospheric-height grid points. This vector can be constructed from the elements of the following matrix
a = β‘ β’ β’ β’ β’ β’ β’ β’ β£ ππ(π1) ππ(π1) ππ(π1) π(π1) β β β ππ(π2) ππ(π2) ππ(π2) π(π2) β β β ... ... ... ... ... ππ(ππ) ππ(ππ) ππ(ππ) π(ππ) β β β β€ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β¦ (2.20)
where ππ represent diο¬erent grid points for π = 1, β β β , π. ππ, ππ, ππ and π denote the electron
temperature, ion temperature, electron density, and composition, respectively. The method starts with initial estimates for all parameters at all altitudes. It then computes the theoretical ACF based on the parameters, imposes the range smearing (by taking into account the pulse shape), and makes a weighted comparison to the data, where weights are obtained from the data error covariances. The technique then proceeds by updating the parameter values so that the weighted diο¬erence between the actual data and the theoretical data is minimized.
The formulation above is in principle useful, but it is not the most eο¬cient way to perform the full-proο¬le analysis. The basic reason is that in order for the model to be accurate enough, the
p
Figure 2.7 Flowchart of the eο¬ective implementation of the full-proο¬le technique.
whose spacing is equal to the receiver sampling period. Unfortunately, considering such a dense grid increases the number of parameters of the optimization search space, which is rather costly in terms of computational power requirements, and it will be useful to develop more eο¬cient ways for analysis.
Attempts have been made to develop eο¬ective implementation of the full-proο¬le technique based on a hierarchy of grids and interpolation methods between the grids [Lehtinen et al., 1996; Holt
et al., 1992]. The diο¬erence in interpolation methods makes the distinction among these various
implementations. Holt et al. [1992] suggest Spline interpolation whereas Lehtinen et al. [1996] use Lagrange and linear interpolations throughout a hierarchy of grids. Figure 2.7 represents the ο¬owchart of such methods.
Although performing the interpolation reduces the computational cost, the technique remains computationally expensive. One basic reason is due to the requirements of nonlinear computational techniques. One requirement is the computation of the derivative of the minimization function with respect to search variables (parameters in the coarse grid). Even though analytical expressions of
the derivatives of the lags of the theoretical auto-correlation function with respect to the ionospheric parameters are available, they cannot be exploited in the optimization procedures of the full-proο¬le techniques. Therefore, forward diο¬erence, which slows down the speed of computation, is the only method that can be used for derivative calculation.
The full-proο¬le technique is optimal in the sense that all the available information, such as the complete model of the ambiguity function and the full error covariance matrix, can be incorporated in the analysis. However, the computational cost of the method limits its routine implementation. In the next chapter, we develop the theory of a new hybrid inversion technique which aims at obtaining estimates that are close to optimal at a fraction of the usual computational cost. The technique is based on a correction to the eο¬ect of the transmitted waveform on the ACF lag proο¬les through a deconvolution process, and subsequent estimation of parameters from the plasma ACF at individual altitudes.
CHAPTER 3
PROPOSED TECHNIQUE FOR INVERSION OF
INCOHERENT SCATTER MEASUREMENTS
The goal of this chapter is to introduce an eο¬cient near-optimal technique for estimation of iono-spheric parameters from incoherent scatter measurements. The technique is based on a correction to the eο¬ect of the transmitted waveform on the ACF lag proο¬les through a deconvolution process, and subsequent estimation of parameters from the plasma ACF at individual altitudes. In this work, we focus on long-pulse measurements and investigate the performance of the hybrid tech-nique on simulated data (this chapter) and actual incoherent scatter radar measurements (Chapter 5).
In development of the hybrid technique, we exploit both the simplicity of height-by-height analysis and the accuracy of full-proο¬le methods through considering the full model of ambiguity to present a simple, fast, and accurate method without the limitations of each of these common, currently used methods. For this purpose, we revisit the forward (direct) model of the incoherent scatter process as the form of the 1-dimensional convolution of the ionospheric ACF across range at each lag. We then present the matrix framework of convolution. The inversion technique is then formulated as the deconvolution of the lag proο¬les followed by the minimization of a least-squares cost function. Two diο¬erent regularization methods for performing the deconvolution are discussed. Extension of the method to 2-dimensions is also discussed. The materials presented in Sections 3.1, 3.2, and 3.4.1 follow closely the description of the procedure reported by Nikoukar et al. [2008].
3.1 Forward model
This section describes the discretization of the radar equation (Chapter 2) and the corresponding matrix framework of the problem. Let πβ² = ππ‘
2 β π and rewrite the radar equation as
< π£(π‘)π£(π‘ + π) > β β« π0(2π β² π )π0(π + 2πβ² π )π (k, ππ‘ 2 β πβ², π)ππβ² (3.1) where ππ‘
2 refers to the altitude from which the signal is received (reference altitude). We discretize
and approximate both sides of the above equation by a Reimann sum, as below
< π£(2πΞππ )π£(2πΞππ + π) > β ββ
π=ββ
π0(2πΞππ )π0(π +2πΞππ )π (k, (π β π)Ξπ, π)Ξπ βπ (3.2)
where Ξπ = sampling period2 (πΏπ‘)Γπ, and π and π are indexing terms. Note that π can range from
π β π π
2Ξπ = π β πΏπ‘π = π β ππ to π, where ππ is the number of altitudes in each range-gate. This
discretization is necessary since eventually the data and the resolution of the ο¬nal parameter grid, Ξπ, are restricted by the sampling time of the receiver.
Following the notation introduced in Chapter 2, we replace π0(2πΞππ )π0(π +2πΞππ ) in (3.2) by pπ(π)
to obtain < π£(2πΞππ )π£(2πΞππ + π) > β π β π=πβππ pπ(π)π (k, (π β π)Ξπ, π) βπ (3.3)
Note that the above equation describes the relationship between the input and output of a linear time-invariant system, where the plasma ACF at certain time lag, π, is the input, and the measured voltage ACF at the same time lag is the output. Moreover, the impulse response of the system is expressed as
pπ(π) = π0(2πΞππ )π0(π +2πΞππ ) for π = 0, 1, β β β , ππ and π = 0, Ξπ, β β β , π (3.4)
where Ξπ represents the time lag increment. Notice that pπ(π) preserves its form over diο¬erent
range-gates. Therefore, the index π can be considered to vary over a range-gate only, rather than being dependent on particular altitude π.
The above scheme can be visualized as incorporating the eο¬ects introduced by the transmitted pulse into a number of low-pass ο¬lters, each of which aο¬ects only one lag proο¬le. These eο¬ects take place in the form of weighted averaging of the lag proο¬les, where the weights are determined by the ambiguity function at each time lag π. Thus, the ο¬lter shape at each lag in the time domain is determined by the product of the pulse shape and its shifted version.
Once we envision range smearing as a ο¬ltering system, we can describe the relationship between its input and output, i.e. the plasma ACF at individual altitudes and the measured voltages ACF, as a convolution process; that is,
< π£(2πΞππ )π£(2πΞππ + π) > = pπ(π) β π (k, πΞπ, π) for βπ and π = 0, Ξπ, β β β , π (3.5)
where * represents the convolution operation. With expansion of (3.5) with respect to all possible values of π‘ and ο¬xed value of π, we can represent this convolution relationship in a matrix framework as yπ = Pπmπ for βπ (3.6) where yπ = β‘ β’ β’ β’ β’ β’ β’ β£ π£(π2Ξπ π )π£(π2Ξππ + π)β£π=ππ π£(π2Ξπ π )π£(π2Ξππ + π)β£π=ππ +1 ... π£(π2Ξπ π )π£(π2Ξππ + π)β£π=π β€ β₯ β₯ β₯ β₯ β₯ β₯ β¦ , mπ = β‘ β’ β’ β’ β’ β’ β’ β£ π (k, πΞπ, π)β£π=1 π (k, πΞπ, π)β£π=2 ... π (k, πΞπ, π)β£π=π β€ β₯ β₯ β₯ β₯ β₯ β₯ β¦ Pπ = β‘ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β£ pπ(π)β£π=ππ β β β pπ(π)β£π=1 πβπz}|{π 0 0 pπ(π)β£π=ππ β β β pπ(π)β£π=1 πβ(ππ +1) z}|{ 0 ... πβπz}|{π 0 pπ(π)β£π=ππ β β β pπ(π)β£π=1 β€ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β¦
Notice that yπ is the vector of measurement ACF at time lag π from all altitudes (lag proο¬les).
Similarly, mπ includes the altitude proο¬le (for π altitudes) of the true plasma ACF at the same
A more complete model of ISR measurement can be rewritten as
yπ = Pπmπ+ ππ βπ (3.7)
where ππ represents the measurement error at time lag π. In general this error is signal-dependent
especially when the backscattered signals are strong due to high electron densities or high transmit-ted power. Because of this dependance, it is not possible to obtain an estimate of the lag proο¬les using only one single transmission. Instead, the measured ACFs should be added for several pulse transmissions to improve statistical accuracy (integration). The data is typically integrated for several seconds. Although signal-dependent, the noise gains Gaussian characteristics as signal is integrated over many pulses (according to the central limit theorem). Therefore, it can be well described by its mean, π =< ππ >, and covariance matrix, Ξ£ππ =< πππππ >. The covariance matrix will be diagonal if the errors in diο¬erent lags are independent. Otherwise it will have nonzero oο¬-diagonal elements.
3.2 Inverse model
In the previous sections we established the relationship between the plasma ACF at individual altitudes and the received voltage ACF as a convolution process, where the shape of the convolving function is dependent on the pulse envelope as well as the time lag values. We exploit this property in our proposed inversion method, a detailed description of which is presented in the following sections.
3.2.1 Deconvolution
The major motivation for the deconvolution of the lag proο¬les is to remove the range smearing from the measured signal ACF and obtain the plasma ACF at single altitudes. The elimination of range ambiguity allows us to use analytical derivatives of the theoretical ACF lags with respect to ionospheric parameters, as opposed to forward diο¬erences, in least-squares optimization algorithms and, hence, to reduce the computational cost signiο¬cantly. Furthermore, when using deconvolution methods, one does not require the imposition of unrealistic assumptions on the parameter proο¬les,
such as stationarity, for the whole range-gate as is the case in height-by-height analysis.
Regularization
To perform the deconvolution task, we can use methods such as inverse ο¬ltering and least-squares analysis [Blahut, 2004; Lagendijk and Biemond, 2000; Karl, 2000]. These methods, although straightforward and easy to implement, suο¬er from a common drawback, which is the instabil-ity of the solution in the face of perturbations to data. This major drawback raises the need for regularization. Through regularization, we impose a priori knowledge about the underlying process to stabilize the solution in the presence of noise and to permit the identiο¬cation of physically reason-able estimates of parameters of interest. A regularization method can be considered as a modiο¬ed least-squares technique, where the modiο¬cations appear in the form of additional constraints to the residual norm deο¬ned in (3.8) as side constraint norms. More precisely, we can represent the regularized estimate as the solution to the following minimization problem
Λ mπ,πππ= arg minm π ( β£β£yπβ Pπmπβ£β£2Ξ£β1 ππ + β π πππΆπ(mπ) ) βπ (3.8)
where ππ and πΆπ are the πth regularization parameter and regularization functional, respectively.
The ο¬rst term controls data ο¬delity (i.e. how closely the solution ο¬ts the data), whereas the second term (the regularization term) controls how well the solution matches our prior knowledge. The role of the regularization parameter can be viewed as controlling the trade-oο¬ between the impact of data and the impact of a priori knowledge on the solution.
In what follows we introduce two methods of regularization which use diο¬erent side functionals.
A: Tikhonov regularization The most common regularization method is the Tikhonov regu-larization with a quadratic functional [Karl, 2000; Demoment, 1989]. The general expression for the Tikhonov method is
Λ mπ,π ππβππππ£= arg minm π ( β£β£yπ β Pπmπβ£β£2Ξ£β1 ππ + π 2β£β£Lm πβ£β£2 ) βπ (3.9)
enforces a roughness penalty and, hence, a smoothness constraint. As an example, the discretized ο¬rst-order gradient operator can be represented as
L = β‘ β’ β’ β’ β’ β’ β’ β’ β£ 1 β1 0 β β β 0 1 β1 0 β β β ... ... ... ... ... 0 β β β 1 β1 β€ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β¦ (3.10)
Note that β£β£zβ£β£ππ = (βππ§ππ)1/π. In this case β£β£Lmπβ£β£2 is a measure of the variability of the estimate.
Therefore, the overall functionality of the method can be visualized as penalizing large gradients of the plasma ACF lag proο¬les, resulting in smoother lag proο¬les where the degree of smoothness depends on the value of the regularization parameter.
The solution to the minimization in (3.8), i.e. the Tikhonov regularized estimate, can be obtained as the solution to the following set of equations
(PππΞ£β1ππ Pπ+ π2LπL) Λmπ = PππΞ£β1ππ yπ βπ (3.11)
B: Total variation (TV) regularization One drawback associated with Tikhonov regular-ization is that it severely penalizes the sharp gradients in ACF lag proο¬les; therefore, if there is a natural sharp gradient in the electron density proο¬le, for example, it will not be recovered by the Tikhonov method.
Total variation regularization is a nonlinear technique that tries to preserve sharp gradients in proο¬les [Karl, 2000; Vogel and Oman, 1996]. The general expression for the TV method is
Λ mπ,π π = arg minm π ( β£β£yπ β Pπmπβ£β£2Ξ£β1 ππ + β£β£Lmπβ£β£1 ) βπ (3.12)
The β1norm used in the TV technique does not penalize the sharp edges in lag proο¬les as severely
as the quadratic norm used in the Tikhonov method. Thus it is well suited for situations where the plasma parameter altitude proο¬les contain sharp gradients.