Rochester Institute of Technology
RIT Scholar Works
Theses
Thesis/Dissertation Collections
8-1-1994
Blind equalization
Daniel Diguele
Follow this and additional works at:
http://scholarworks.rit.edu/theses
This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion
in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact
.
Recommended Citation
Approved by:
BLIND EQUALIZATION
by
Daniel Diguele
A Thesis Submitted
in
Partial Fulfillment
of the
Requirements for the Degree of
MASTER OF SCIENCE
in
Electrical Engineering
Prof. S. Dianat (Thesis Advisor)
Prof.
J.
Delorenzo
Prof. S. W. Mclaughlin
Prof. R. Unnikrishnan (Department Head)
DEPARTMENT OF ELECTRICAL ENGINEERING
COLLEGE OF ENGINEERING
ROCHESTER INSTITUTE OF TECHNOLOGY
ROCHESTER, NEW YORK
Abstract
An
equalizeris
an adaptivefilter
that
compensatesfor
the
non-idealcharacteristics of a communicationchannel
by
processing the
received signal.The
adaptive algorithm searchesfor
the
inverse impulse
response ofthe
channel,
andit
requiresknowledge
ofatraining
sequence,
in
orderto
generatean error signal
necessary
for
the
adaptive process.There
arepracticalsituationswhere
it
wouldbe
highly
desirable
to
achievecomplete adaptationwithout
the
use of atraining
sequence,
hence
the the term
"blind". Examples
ofthese
situations aremultipointdata
networks,
high-capacity
line-of-sight
digital
radio,
andreflection seismology.A
blind
adaptive algorithmhas
been
developed,
based
on simplified equalization criteria.These
criteria arethat the
second- and
fourth-order
moments ofthe
input
and output sequencesareequalized.
The
algorithmis
entirely
driven
by
statistics, only requiring
knowledge
ofthe
variance ofthe
input
signal.Because
ofthe
insensitivity
ofhigher-order
statisticsto
Gaussian
processes, the
algorithm performswellwhen additive white
Gaussian
noiseis
presentin the
channel.Simulations
arepresented
in
whichthe
newblind
equalizerdeveloped
is
comparedto
otherAcknowledgements
I
wouldlike
to
expressmy
appreciationofDr.
S.
Dianat,
my
Thesis
Advisor,
who oversawthis
projectfrom
the
beginning. Withouth
his
keen
insight
and guidanceI
wouldhave
neverfinished
this
work,
andI
amtruly
grateful
for
his
making time
for
mein
his
very
busy
schedule.I
wouldalsolike
to thank
Dr.
M. R.
Raghuveer for
the opportunity
he
gave meto
startstudying
communications channels and adaptive equalization under
his
guidance.Table
of
Contents
Page
Abstract
ii
Acknowledgements
iii
Table
ofContents
iv
Chapter
1
:Introduction
1
Chapter 2:
Adaptive
Equalization
4
Intersymbol
Interference
5
Linear
Transversal Equalizers
9
Zero-forcing
criterion10
Mean-Square-Error
criterion12
The
LMS
algorithm15
Decision-directed
equalization18
Decision-Feedback Equalizers
19
Chapter 3:
Higher-Order
Statistics
21
Definitions:
Moments
and cumulantsofstochastic signals22
Moments
and cumulants ofstochastic processes ...24
Properties
ofmoments and cumulants28
Equalization
withhigher-order
statistics30
The
Bussgang
deconvolution
techniques
31
Special
cases ofthe
Bussgang
algorithm:The
Sato
algorithm32
Page
A
simplified set ofconditionsfor
equalization36
Constrained
criterion38
Unconstrained
criteria42
Chapter
4
:Blind
equalization46
Chapter
5
:Simulations
and results55
Case
1
:One-pole
channel57
Case
2
:One-pole
channel with additive colored noise63
Case
3
:One-pole
time-varying
channel66
Case
4
:Two-pole
channel69
Case
5:
Three-pole
channel78
Case
6
:Three-pole,
two-zero
channel82
Conclusions
86
References
89
Chapter
1
Introduction
Considerable
efforthas been devoted
in the
pastthree
decades
to the
study
ofdata-transmission
systemswhichmake efficientuse ofthe
availablepower andchannel
bandwidth.
Increasingly,
werely
oncomputercommunications
for
transmission
of vastamounts ofdata. The
needfor high
speed
data
transmissions
overanalog telephone
channelshas
primarily
been
met
by
the
appearance offast
modems whichcarry
digital
data
overthese
voice-bandwidthchannels.
Analog
channelsdeliver distorted
versions oftheir
input
signals.The
transmission
ofdigital
data
over such channelsis limited
by
the
non-idealtransformations
performedby
the
channel onthe
signalsbeing
transmitted.
For
bandwidth-limited
channels(such
asvoice-gradetelephone channels), the
chief
determining
factor
in the
design
ofhigh-speed
transmission
systemsis
Intersymbol
Interference
(ISI),
whichis
causedby
time-dispersion
in the
transmit
filter,
the transmission medium,
andthe
receivefilter. Other
limiting
factors
onthe
channelperformance arethe
possible additions ofbackground
thermal noise,
impulse
noise,
and channelfading.
Equalization
dates
back
to the
use ofloading
coilsto
improve
the
characteristics of
twisted-pair
telephone
cables.An
equalizer compensatesfor
receivedsignal.
In
mostcases, the
channel characteristics are notknown
beforehand,
sothe
equalizerin
fact
consists of anadaptivefilter
that
searchesfor
the
inverse impulse
response ofthe
channel.An
adaptivefiltering
algorithmrequires
knowledge
ofatraining
sequencein
orderto
form
an error signalnecessary
for
the
adaptive process.Since
the transmitter
andthe
receiver areusually
physically separated, there
aretwo
common waysto
generate a replica ofthe
desired
response atthe
receiver .One
suchway
is
by
using
atraining
sequenceknown
to
both
the
transmitter
andthe
receiver.A
decision-directed
method can alsobe
used whichdoes
not requireatraining
sequence,
provided a good replica of
the transmitted
sequenceis
being
produced atthe
output of a
decision device
atthe
receiver.There
arepracticalsituations whereit
wouldbe
highly
desirable
to
achievecomplete adaptation without
the
use of atraining
sequence.In
this
context,
blind
deconvolution
algorithmshave
received much attentionrecently.Blind
equalizers are adaptivefiltering
algorithmsdesigned
suchthat
they
do
not need an externally-supplied
desired
responseto
generate an errorsignal.Instead,
an estimate ofthe
training
sequenceis
generatedby
applying
a nonlinear
transformation
onthe
data
sequencespassing through the
channel.One
example of a systemin
whichblind
equalization canbe
very
helpful
is
in
a multipointdata
network,
where commonproblemsinvolve
severevariations
in
channelcharacteristics,
orsimply that
a receiver was notpowered on
during
initial
synchronizationofthe
network.In
aheavily
loaded
large
multipointnetwork,
data
throughput
is increased
andthe
burden
ofmonitoring the
networkis
easedif
someform
ofblind
equalizationis built
into
radio,
wheremultipathfading
is
abig
problem;
and reflectionseismology,
wherethe traditional
method oflinear-predictive
deconvolution
ignores
valuablephase
information
containedin the
reflectionseismogram,
unlikethe
blind
deconvolution
method^10\
In
many
practical situationsthe
channelimpulse
responsemany
notbe
minimum-phase
(i.e.
notallpoles andzeros areinside
the
unitcircle).Examples
ofsuch non-minimumphase systemsinclude telephone
channelsand
fading
radio channels.Equalization
of suchchannels requiresthe
identification
ofboth
the
magnitude andthe
phase ofthe
system'stransfer
function. The
magnitude canbe
identified
using
second-orderstatistics ofthe
output signal.
The
phaseinformation,
however,
is
more complicatedto extract,
andit involves
the
calculation ofhigher-order
statisticsf24^.
The
purposeofthis
workis
to
develop
anewblind
deconvolution
algorithm well suited
for
problems wheretracking
ofhigher-order
statistical variationsis
needed.The
convergencebehavior
ofthe
algorithm willbe
investigated,
andits
performance willbe
comparedto
other well-knownadaptive equalization schemes.
Chapter
2
provides a complete review ofthe
field
ofadaptiveequalization.
Chapter
3 introduces
higher-order
statistics and some ofthe
conceptsbehind
their
usein
equalization.In
Chapter
4,
the
development
ofthe
newblind
equalizationalgorithmis
shown.Chapter
5
presentsthe
results ofusing this
new algorithm withavariety
of communicationschannels,
and comparesthese
resultsto those
obtained with otheralgorithms.Finally,
the
conclusion and suggestionsfor
further
study
canbe found
atthe
end ofChapter
2
Adaptive
Equalization
An
equalizer compensatesfor
the
non-ideal characteristics ofacommunication channel.
The
term
equalizationis
synonymous withdeconvolution
andinverse
modeling.The
waveform ofthe transmitted
signalarrives at
the
receiverconvolvedwiththe
impulse
response ofthe
channel(Fig.
1),
hence
the
use ofthe
term
deconvolution,
to
expressthe
operation ofrestoring the
original signal.In
orderto
deconvolve
achannel,
it is
necessary to
model
the
inverse
ofits
frequency
response,
wherefromthe term
inverse
modeling
is
derived.
The inverse
model ofanunknown systemis,
itself,
a systemwithafrequency
responseapproximating
asmuchas possiblethe
reciprocal ofthe
unknown
frequency
response.In
the
case ofcommunicationschannels, time
dispersion is
the
most commontype
ofdistortion,
giving
riseto
Intersymbol
Interference (ISI). A
dispersive
channelis
onein
which signals atdifferent
frequencies
travel
withdifferent
velocities,
ordifferent
group
delays.
Other
types
oftransformations
performedby
the
channel arefrequency
translation
andnonlinearor
harmonic distortion.
Also,
corruption ofthe
input
waveformfactors
such asbackground
thermal
noiseprocesses,
impulsive
noise andchannel
fading.
Input data
sequence Encoderand
Transmitter filter
Channel
Outputdata
sequence Decision
Deviceand
Decoder
Noise
*
Adaptive Equalizer
Figure
1.
Data
transmission
systemReceiver filter
I SamrSampler
Intersymbol
Interference
(ISI)
Intersymbol interference
appearsin
all pulse-modulationsystems:frequency-shift
keying (FSK),
phase-shiftkeying (PSK),
quadrature amplitudemodulation
(QAM)
and pulse-amplitudemodulation(PAM).
Figure
1
represents a generalized equivalentmodel ofadigital
communicationsystem.
For
the
sake ofsimplicity,
it
willbe
assumedthat the
"channel"
includes the
effects ofthe transmitter
filter,
the modulator, the
actual
transmission
channel,
andthe
demodulator (see Fig.
2).
The
outputsignal
y(t)
is
the
superposition ofthe
impulse
response ofthe
channelh(t)
to
each symbol
in
the
input
sequence x(n)(i.e.
convolution)
plus additive whiteNoise
N(t)
InPutfrJ
Channel I&(Y)
mt Output x(n)^
|
h(t)
_|^W
y(t)Figure 2
y(t)
=Y^xtmit
-kTs)
+
N(t)
(1)
k
where:
Ts
:=signaling interval
(seconds),
andthus
1/Ts
:=data
transmission
rate(symbols/second)
Sampling
the
outputy(t)every
Ts
seconds :y(nTs
)
=^x(k)h(nTs
-kTs)+
N(nTs
)
(2)
k
Making
the
substitutionnTs
=n,
the
following
expressionin the
discrete-time
domain is
obtained:y(n)
= x(n)+^x(k)hn_k
+N(n)
(3)
The
summationterm
in
equation3 is
the
interference from
neighboring
symbols
(ISI),
andit
canbe
seento
consist ofpast samples ofthe
input
data
sequencex(n),
weightedby
samplesofthe
channelimpulse
responseh(t). The
first
term
is
the
desired
signalx(n),
andthe
last
is
the
additive whiteGaussian
The
ISI is
zeroif
andonly
if
the
channelimpulse
responsehas
zerocrossings at
Ts-spaced
intervals,
that
is,
if
h[(n-k)Ts]=0 for
k*n in
equation3.
When
the
impulse
responsehas
suchuniformly-spaced zero crossings(see
Figure
3
below),
it
satisfiesNyquist's
first
criterion, namely that the
channelhave
no responsebeyond
twice the
Nyquist bandwidth
fn=l/2Ts.
In
channelswhich exhibit
time
dispersion (and
thus
ISI)
the
zero crossingsin
the
impulse
responsehave
movedsothat
they
do
not occur atregularTs-spaced intervals.
(a)
Impulse
responseh(t)
-l/2Ts l/2Ts
(b)
Frequency
responseH(z)
The
effectofIntersymbol
Interference
canbe
seenin
practicefrom
atrace
ofthe
receivedsignal,
on an oscilloscope withits
time
base
synchronizedto the
symbol rate.Such
atrace
is
called an"eye
pattern".Figure
4
shows aneye pattern
for
abinary
PAM
system withnoISI.
If
a channel satisfiesthe
zero
ISI
condition, the
"eye"is
then
fully
openandthere
areonly two
distinct
levels
(for
the
binary
PAM
case athand)
atthe sampling time.
The
peakdistortion
(also
shownin
Fig.
4)
is
the
ISI
that
occurs whenthe
data
patternis
such
that
allthe
intersymbol interference
terms
in
Eq. 3
addup to
producethe
maximum
deviation from
the
desired
signal atthe sampling
time.
Signal
level
Sampling
timeFigure
4.
Outline
ofabinary
eye pattern.The
purpose of anequalizeris
to
minimizethe
ISI,
which would alsominimize
the probability
ofanincorrect
decision
atthe
receiver.Without
the
equalizer, the
eyepatternwouldnotbe
open asin Fig.
4,
but
instead it
would showawidedisparity
in
the
positive andnegative sinepulses,
indicating
the
Linear
Transversal Equalizers
The
mostcommonly
usedfilter
structurein
channel equalizationis
the
transversal
filter,
alsoknown
astapped-delay
line
filter,
shownin
Fig. 5. In
such a
filter
the
currentand pastvalues ofthe
received signaly(n)arelinearly
weighted
by
the
equalizer coefficientswk,
andthen
summedto
producethe
output z(n).
Received
signal
y(n)
l
y(V)>r~l
-2)>y(n-M+2)^_|
1
y(n-M+1)
y
w2
*
Figure
5. Transversal
equalizer.The
equalizeroutputis
givenby:
M-l
z(n)=
^wky(n-k)
k=0
where:
M
:=Number
ofequalizer weights(taps).
wk
:=Equalizer
weights.Output
signal
z(n)
Eq.
4 is
called afinite
convolutionsum,
because
it
convolvesthe
finite
impulse
response ofthe
equalizerfilter
{wk)
withthe
filter
input
(y(n)}
to
produce
the
filter
output{z(n)}.
Zero-forcing
criterion
If
the
equalizercoefficientswk,
k=0,l,....,M-l
are chosento
force
the
samples of
the
combinedchannel and equalizerimpulse
responseto
zero at allbut
one ofthe
M
Ts-spaced instants
in the
space ofthe equalizer, then
we callsuch anequalizera
zero-forcing
(ZF)
equalizert14l
8
^wkh{t-kTs)
k=0
t0-Ts
t0
t0+Ts
t0+2Ts
t0+3Ts
t0+4Ts
t0+5Ts
t0+6Ts
t0+7Ts t0+8Ts
t0+9Ts to+10Ts
V
1
'
Span
of9-tap
ZF
equalizerFigure 6.
Combined
impulse
response ofa channel andzero-forcing
equalizerin tandem
&2\
Let
Sn
be
the
convolution sumofhn
andwn
(see
Fig.
7),
that
is,
the
impulse
response ofthe
combined channel plus equalizerimpulse
responses: COsn
=Y.WkK-k
(5)
The
equalizerwk
is
assumedto
have
aninfinite
number oftaps.
Its
output atthe
n^hz(n)
= s0x(n)+x(k)sn_ji
+
^wiN(n-l)
k*n
l=-oo
(6)
The first
term
in
Eq. 6
represents a scaledversion ofthe
input (which
is
the
desired
symbol atthe
output).The
secondterm
is
the
intersymbol
interference,
andthe third
is
the
contribution ofthe
additive whiteGaussian
noise.The
peakdistortion,
whichis
the
peak value ofthe
secondterm
in
(6)
is:
D=
f|sn|
(7)
n*0
d=
I
n=
-n0
*Lwkhn-k
k=-(8)
Hence
the
peakdistortion D
is
afunction
ofthe
equalizerweightswk-If
the
equalizerhas
aninfinite
number oftaps,
it is
possibleto
choosethe
tap
weights suchthat
D=0. This
would meanthat
the
combined channel plus equalizerimpulse
response Sn=0for
all nexceptn=0.That
is,
the
intersymbol
interference
canbe
completely
eliminated.The
valuesofthe
equalizerweightsthat
would achievethis
aredetermined
by
the
condition:oo _
__
V
z,
_J
1
for
n=0sn
~Lwkrin-k
-jo
for
n^O(9)
Input
ix(n)
Channel
H(z)
Noise,
N(n)
*iy
y(n)Equalizer
W(z)
Output
z(n)
Taking
the
z-transformofEq. 9:
S(z)
=W(z)H(z)=l
Solving
for W(z):
W(z)
=(10)
(ID
H(z)
So
complete elimination ofthe
intersymbol interference
requiresthe
useofan
inverse filter
to the
channelH(z).
Such
aninverse filter is
calledazero-forcing
filter.
Notice,
however,
that
such perfectinverse filter
requiresthe
useofan
infinite
number oftap
weights,
whichis
notfeasible
in
practice.Mean-Square-Error
(MSE)
criterion
In
this criterion, the
equalizer weight coefficientswk
are adjustedto
minimize
the
mean-square value ofthe error,
defined
as:e(n)
=x(n)-z(n)
(12)
Where
x(n)is
the input
(desired
signal)to the
communicationssystem,
and z(n)is
the
output ofthe
equalizer(Fig.
7).
Let
usdefine
the
costfunction J
to
be
minimized
for
the
MSE
criterion as:J
={|eU)|2}
(13)
Where
E{}
standsfor
the
expectationoperation.By
substituting
(12)
into (13):
J
=El\x(n)-z(n)\2\
(14)
and
(4),
modifiedfor
an equalizer withaninfinite
number oftaps,
into (14):
J
=E
So
the
costfunction J
is
a quadraticfunction
ofthe
equalizercoefficients{wk}. This function
canbe
easily
minimizedwithrespectto the
(wk)
to
yieldaninfinite
set oflinear
equationsfor
the
(wk)
coefficients[20l
Another
way to
obtainthe
setoflinear
equationsfor
the
equalizercoefficients
{wk)
is
to
invoke
the orthogonality
principlein
mean-squareestimation,
i.e.
to
selectthe
weights(wk)
suchthat the
errore(n)is
orthogonalto the
input
signal sequenceto the
equalizer(y(n)} t2];
E{e(n)y*(n-l)}
=0
- <I
< oo(16)
where
the
y*denotes
the
conjugate ofthe
sequence{y(n)j,
in
casethe
data
is
complex-valued.
Substituting
e(n)into
(16):
E
{(
\
x(n)-^Wfryin-k)
y*(n-l)
=0
(17)
Taking
the
summation andthe
equalizer coefficients outofthe
expectation:CO
X
wkE{y(n-k)y
*{n
-I)}
=0
=E{x(n)y
*(n
-1)}
k=
for
-oo <I
< oo(18)
In
orderto
evaluatethe
expectationsin
(18),
an expressionis
neededfor
y(n).
Suppose
the
channelhas
afrequency
responseH(z),
whereH(z)
is
apolynomialof
degree L. Thus:
y(n)=
^hjx(n-j)
+
N(n)
j=0
(19)
y(n)
:=Data
sequencegoing
into
the
equalizer.hj
:=Impulse
responseofthe
communications channel.x(n)
:=Input
data
sequenceinto
the
channel.N(n)
:=Additive
whiteGaussian
noise sequence.Substituting
(19)
into
(18),
we obtaint^:
L
*E{y(n-k)y*(n-l)}=
h
fhj+l_k
+
N08lk
+
No5lk
fr
\l-k\<L
0
otherwiseand,
'
h*_t
for
-L<1<00
otherwise(20)
E{x{n)y*(n-l)}
=(21)
where:
5lk
:={
0,
\,kJ
(Knecker
delta)
No
:=Spectral
density
ofthe
additive whiteGaussian
noise.T"hh
'=Autocorrelation function
ofthe
channelimpulse
response=
E{h*{n)h{n
+
k)}=
X
hlhk+n
k
=0,l,....,L
(22)
=0
L
:=Number
of roots ofthe
polynomialH(z),
the
frequency
responseof
the
channel.Substituting
Eqs.
(20)
and(21)
into
(19),
andtaking
the
z-transform:W(z)
H(z)H*(z-1)
+
N0
H^-iz'1)
(23)
And
solving
for
W(z)
weobtainthe transfer
function
ofthe
equalizerbased
onW(z)
=U
/
(24)
H{z)H*{z~1)
+
N0
Equalizers built
using the
MSE
criterionare more robustthan
ZF
equalizers,
because
the
equalizercoefficients arechosento
minimizethe
mean-square-error
(the
sum ofthe
squaresofallthe
ISI
terms
plusthe
noisepowerat
the
outputofthe
equalizer).As
wesaw, the
ZF
criterion neglectsthe
effectof noise.
A finite-length ZF
equalizer willminimizethe
peakdistortion (worst
case
ISI)
only
if
the
peakdistortion
before
equalizationis
less
than
100%,
i.e. if
the
binary
eye patternis
initially
opent14l
This
conditionis
oftennotmet,
especially
athigh
speeds onbad
channels,
whichis
why
mostcurrenthigh
speedvoice-bandmodems use
MSE
equalizers.The Least-Mean-Square
(LMS)
algorithm
The
mostcommonequalizer(Fig.
8)
updatemethodinvolves
updating
each weight coefficient
every time
asymbol comesthrough
the
communicationsystem.
What
makesthis
possibleis
that the
MSE is
a quadraticfunction
ofthe
equalizer coefficients{wk),
as seenin
the
previous section.In
orderto
minimizethe
error signal(Eq.
12),
a costfunction
wasdefined
(Eq.
13)
which representedthe
mean-square value(energy
signal)
ofthe
error.Why
choosethe
squareofthe
errorasthe
objectivefunction? The fourth
powerof
the
error wouldhave
been just
asvalid.The
reasonis
that
we canuse agradient-search algorithm
to
find
the
lowest
pointin
the
bowl-shaped
quadratic performancesurface,
andto
find
the
gradientthe
derivative has
to
be
taken.
solve, especially
when comparedto the
non-linearhigher-order
equations wewouldobtain otherwise.
Additive
Gaussian
noise
N(n)
Input
x(k) ,
Channel
H(z)
-*L@
&+-Desired
signal
x(n)
Figure
8.
LMS
equalizer.Thus,
finding
the
optimum equalizer weightsinvolves
finding
the
pointalong the
performance surface wherethe
gradientis
zero.Let
the
vector ofequalizer weights
be
defined
as:W
=[w0
wj, w2
(25)
The
costfunction
J
=El
\e(n)\
> couldbe
approximatedby
time-averaging,
but
that
methodis
too time
andmemory
consuming.Instead,
acoarse estimatewill
be
used,
simply replacing the
expectation operationby
its
currentrealization.
Hence
atthe
k"1realization:But
z(k)
is
the
outputofthe equalizer,
andit
canbe
expressedin
vectorform
as:
z(k)
=YT(k)W(k)
(27)
where:
YT(k)
=[y(k)
y(k-l)
y(k-2)
y(k-L+
l)]
(28)
and
W(k)
=[w0(k)
wx(k)
w2(k)
(29)
L
:=Number
of equalizertap
weights.So
substituting
(27)
into (26):
Jk~
x(k)-YT(k)W(k)
(30)
Then
the
gradient canbe
expressed as:VtJ
=e2(k)
<9W(k)
(31)
=
2e(k)
de(k)
<?W(k)
2e(k)-x(k)-YT(k)W(k)
<9W(k)
Thus:
V
kJ = -2e(k)Yl
(k)
(32)
To initialize
the algorithm,
startwithaninitial
guessW0,
and movein
the
direction
ofdecreasing
gradient magnitude.The
search algorithmhas
the
general
form:
W(A
+
l)
=W(A)-//VfcJ
(33)
where:
U :=
Step
size.And
sothe LMS
algorithmis:
W(/e
+
1)
=W(k)
+
2fue(k)Y
l
(k)
(34)
The LMS
algorithmis
very
important
andwidely
usedbecause
ofits
simplicity
and ease ofcomputation.It does
not require off-line gradientestimations orrepetitions of
data.
However,
atevery
step
during
the
equalization
process, the
desired
response(usually
in the
form
ofatraining
sequence)must
be
known
in
orderto
generatethe
error signale(k)in
(34).
Decision-directed
equalization
Another
type
ofequalization uses an estimateofthe
error signaldifferent
from
that
ofthe
LMS
algorithm.Instead
ofusing
atraining
sequenceto
generate an errorsignal,
it
uses an estimateofthe input.
Additive
Gaussian
noise
N(n)
Input
x(k) ,
Channel
H(z)
*L0
*_!_-?
/
Equalizer
W(z)
Output
z(n)
z(n)
Error
signal
I
Decision
device
<P^
e(n) \s__xx(k)
Figure
9.
Decision-directed
equalizer.The
algorithm usedis
the
same asthe
LMS
algorithm,
but
the
errorsignal
differs
asfollows
(Fig.
9)
:e(k)
=x(k)-z(k)
The
error signalis
derived
from
the
receiverestimate,
whichis
notnecessarily
correct.For
this reason,
it is
best
to
usethis type
ofequalizerwithlow-noise,
low-distortion
channels.But
the
mostcommon use ofthe
decision-directed
equalizeris
afterthe
channelhas
already
been
equalized(i.e.
the
eyepattern
is
already
open)using
another,
morerobust,
algorithm.Then,
this
equalizer can
track
slowvariationsin
the
channelcharacteristics.Decision-Feedback (DFE)
equalizers
This
type
of equalizeris
very
usefulfor
channels withsevere amplitudedistortion. It
has
anonlinear structurethat
usesdecision
feedback
to
cancelthe
interference from
symbolswhichhave
already
been detected.
Assuming
past
detected
valuesto
be
correct, the
ISI
contributedby
these
symbols canbe
canceledexactly
by
subtracting
anappropriately
weighted version ofthese
symbols
from
the
output ofthe
equalizer.Input
xOO,Noise
N(n)
Channel
H(z)
yny
Feedforward
equalizerWF(z)
e(k)Desired
signal(Training
signal) z(k)Feedback
equalizerWB(z)
Error
signa e(k)Training
signalDecision
device
x(k:Estimate
oftheinpu^
x(k) e(k)The DFE
equalizer consists oftwo
sections(see
Fig.
9)
: afeedforward
section,
whichis
like
the
linear
transversal
equalizerdiscussed
earlier;
and afeedback
section,
whichis
the
one usedto
removethat
partofISI
causedby
the
pastdetected
symbolsfrom
the
equalizer output.The
equalizer output canbe
expressed as:z(k)
=YT(k)VyTF(k)
+
XT(k)WB(k)
(36)
where
the
equalizertaps
areupdated asfollows:
WF(k
+
1)
=WF(&)
+
/Je(k)YT
(k)
(37)
and
Chapter
3
Higher-Order Statistics
There
arethree
mainreasonsfor
using
higher-order
statisticsin
signalprocessing
t18^
:1.
To
extractinformation
due
to
deviations from Gaussianness (normality).
2.
To
estimatethe
phase of non-Gaussian parametric signals.3.
To detect
and characterizethe
non-linear properties ofmechanisms whichgenerate
time
series via phase relations oftheir
harmonic
components.The
first
two
motivations presentedaboveapply
directly
to this
workonadaptive equalization.
The
first
oneis
due
to the
fact
that
for
Gaussian
processes,
all polyspectra of orderhigher
than
two
areidentically
zero.In
signal
processing applications,
any
periodicor quasi-periodic signal canbe
characterized as
non-Gaussian,
while most additive noise processes arewhiteGaussian.
Hence
working
withhigher-order
statisticshas
the
advantage of notbeing
affectedby
Gaussian
noise.The
secondmotivationis
based
onthe
fact
that
higher-order
spectrapreserve
the
phaseinformation
ofnon-Gaussian signals.In
the
previouschapterwe sawequalizationmethods
based
onleast-squares
criteria.They
arewidely
usedbecause
they
yieldlinear
equationsthat
areeasy to
solve.But
the
information,
andhence
those
approaches cannot copewith systemsthat
arenon-minimum phase.
Definitions
Moments
and cumulants of stochastic signals
Let's
define
a setof n random variables[x\
,x2
,3
, ,xn
}
.Their
joint
moments of order r =k\
+
k2
+
+&n
are givenby
t19^
Mom
1
'2
' 'n
**<*N_*
<n
_ r
dr(cQ1,C02,.
...,con)dCD^dCDr?
, ,dcO^
n
(39)
a>i
=co2
=....=con
=0where
Ol^,^,...^^^^^^2^^)}
(40)
is
their
joint
characteristicfunction.
Another
form
ofthe
joint
characteristicfunction
is
defined
asthe
natural
logarithm
ofO
(
CO
1
,C02
, ,COn
)
:x(co1,co2,....,con)
=Ln[(co1,co2,....,con)}
(41)
The
joint
cumulantsof order r ofthe
same set of randomvariablesaredefined
asthe
coefficientsin
the
Taylor
expansion ofthe
secondcharacteristicCum
,xh
,xK
n -(-j)'drx(co1,co2,....,con)
M
dco^dco^
dC0
12
nK
(42)
fl)1
=fl)2=....=COn=0
Thus
the
joint
cumulantscanbe
expressedin terms
ofthe
joint
moments of a set of randomvariables.
The
relationships canbe
obtainedby
substituting:
<&(fi>l)=
l
+
ja>im1--^-m2
+
-mk+.
(43)
into
(39), (41),
(42)
andworking
outdifferentiations
aboutzero.For
example,
the
moments ofthe
random variable{xi}:
m\
=Mom\x\\
=_5{xx}
m2
=Mom[xixi]
=E\x+
\
7723
=M'omfaiix^i]
=El
X": >7724
=Mom\xiX\XiXi\
=E\x,
\
are related
to
its
cumulantsby:
c\
=Cum[xi]
=mi
r i 9
c2
=CumyxiXi
\
=m2
-m*C3
=CM7n[xi^iXl]
=7723
-37722^1
+
2772^
C4
=Cum[xix
i^i^i]
=7724
-477237721
=3tt22
+
12m2mi
-6772?
The
generalrelationship
between
moments of[x^
,x2
,X3
, ,xn
\
Cum
1 9 O >>*'.,n=
EH)p"1(p-i)!s-\ r
X7
E
ies
iesi
j
[ies2
where
the
summationextends over all partitions(
Sj
,
s2
, >sp
)>
P=l>2,...,n,
of
the
set ofintegers (1,2,..
..,n).For
example, the
setofintegers
(1,2,3)
canbe
partitioned
into:
p=l si=
{1,2,3}
p=2
s1={l}
s2={2,3}
si={2)
s2={l,3}
s1={3}
s2={l,2}
p=3
si={l)
s2={2}
s3={3}
And hence
the
third
ordercumulant ofthe
random variables[x
i
,x2
,x3
j
is:
Cw772[x1x2x3
]
=E{x1x2xs
}
-E{xi
}E{x2x3
}
-E{x2
}E{xix3
}
-E{x3}E{xix2
}
+
2E{x1}E{x2}E{x3
}
(45)
Moments
and cumulants
of
stochastic processes
Let
{ X(k)
},
wherek
=0,
1, 2,
....be
astationary
randomprocess.If
the
momentsof{ X(k) }
up to
order nexist, then:
Mom[X(k),X(k
+
r1),....,X(k+
zn_1)]
==
E{X(k)X(k
+ r1)....X(k +
Tn_1)}
The
momentsdepend
only
onthe time
differences
T\
,T2
,. . .Tj
=0,
1, 2,
...for
alli.
Hence,
simplifying the notation, the
momentsofastationary
random process canbe
written as:m^(rltT2t...trn.1)
=E{X(k)X(k+r1)....X(k
+
Tn.1)}
(46)
and
the
n"1order cumulants:
C*(Ti,T2,...,Tn-]_)
=Cum[X(k),X(k
+
T1),....,X(k+Tn_1)]
(47)
Combining
(44),
(46)
and(47)
the
following
relationshipsbetween
moments andcumulants ofa
stationary
random process{
X(k)
}
are obtained:First-order
cumulants:CX
=
E{X(k)}
=rn*
Mean
value(48)
Second-order
cumulants:Set
ofintegers
:(0,1)
Partitions:
p=lS^
={0,1}
p=2
si
={0}
S2
={1}
c^(T1)
=E{X(k)X(k
+
T1)}-E{X(k)}E{X(k+r1)}
(49)
And in
terms
ofmoments:c2
(Tl)
=m2
(Ti)-fm*
1
Covariance
sequence(50)
=
mx2(-Tl)-[mxl)
=
cx2(-r1)
where:
mo
(Ti)
=E{X(k)X(k+
Third-order
cumulants:Set
ofintegers:
(0,1,2)
Partitions:
p=lSj
={0,1,2}
p=2
Si
={0}
s2
={1,2}
Si
={1}
s2
={0,2}
Si
={2}
s2
={0,1}
p=3
Si
={0}
S2
={1}
s3
={2}
CX(Ti,T2)
=E{X(k)X(k+Ti)X{k+T2)}
-E{X(k)}E{X(k+Ti)X(k
+
T2)}
-E{X(k
+
Ti)}E{X(k)X(k+T2)}
-E{X{k
+
T2)}E{X{k)X(k+Ti)}
+2E{X(k)}E{X(k
+
Ti)}E{X(k+
t2
)}
(51)
So
in terms
ofmoments:c3(ri,T2)
= mx(Ti,T2)-mxmx(T2-T1)-mxmx(T2)-mxmx(Ti)
+
2([mx)j
(52)
Fourth-order
cumulants:Set
ofintegers:
(0,1,2,3)
Partitions:
p=ls1
={0,1,2,3}
p=2
Si
={0,1}
s2
={2,3}
s1
={0,2}
s2
-{1,3}
Si
={0,3}
s2
={1,2}
Si
={0}
s2
={1,2,3}
Si
={1}
s2
={0,2,3}
Si
={2}
s2
={0,1,3}
Si
={3}
p=3
s1
={0,1}
s2
={2}
s3
={3}
sx
={0,2}
s2
={1}
s3
={3}
S!
={0,3}
s2
={1}
s3
={2}
Sl
={1,2}
s2
={0}
s3
={3}
sx
={1,3}
s2
={0}
s3
={2}
sx
={2,3}
s2
={0}
s3
={1}
p=4
sx
={0}s2
={1}
s3
={2}
s4
={3}
CX4{Tl,T2,T3)
=E{X(k)X(k+Ti)X(k
+
T2)X(k+T3)}
-E{X{k)X(k
+
Ti)}E{X(k+T2)X(k+T3)}
-E{X{k)X{k+T2)}E{X(k+Ti)X(k+T3)}
-E{X(k)X(k
+
t3
)}{X(&
+
Ti
)X(&
+
T2
)}
-{X()}{X(&+ Ti)X(&+ T2)X(/e
+
t3)}
-E{X(yfe
+
Ti)};{X(^)Z(^+
T2)X(^+T3)}
-E{X(k
+
t2
)}E{X(k)X(k
+
T1)X(k+
t3
)}
-E{X(^
+
T3)}E{X(^)X(^+
T1)X(/e+T2)}
+2E{X{k)X(k +
Ti)}E{X(k+
t2
)}E{X(k
+
t3
)}
+2{X(&)X(& +
t2
)}E{X(k
+
Ti)}E{X(k+
t3
)}
+2E{X(k)X(k +
t3
)}E{X(k
+
Ti)}E{X(k+
t2
)}
+2{X(/e +
ti)X(A+
r2
)}{X()}{X(&
+
t3
)}
+2E{X(k +
Ti)X(k+
T3)}E{X{k)}E{X(k+
T2)}
+2E{X(k
+ r2)X(k +
T3)}E{X(k)}E{X(k+Ti)}
-6E{X(k)}E{X(k+Ti)}E{X(k
+
T2)}E{X(k+z3)}
(53)
In
terms
ofmoments:cl{Ti,T2,T3)
=mx(Ti,T2,z3)-mx(Ti)mx(T3-T2)
-mx{T2)mx(T3-Ti)-mx(T3)mx(T2-Ti)
-mxmx(Ti,T3)-mxmx(Ti,T2)
+2(t7x)
mx(T1)
+
2(mx)
m|(T2)
+
2(mj)
m*{r3)
+2(l)
m2(T3
-Tl)
+
2(ml
)
ml{H-*2)
+2(mj)
772^(T2-T!)-6(772*)
(54)
The
kurtosis
y?is
defined
asthe
fourth-order
cumulant ofazero-meanstochastic process
(i.e.
mx
=
E{X(k)}
=0)
for
whichTi
=T2
=T3
=0:
yx=cl(0,0,0)
=E{x4(k)}-3E2{x2{k)}
(55)
If
Ti
=T2
=T3
=I
and 772-, =0,
the
fourth-order
cumulantis:
c\
(1,1,1)
=E{x(k)X3{k
+
l)}-3E{X(k)X{k
+
l)}E{x2{k
+
l)\
(56)
Properties
of
moments and
cumulants
From
HT];
1.
Mom[aixi,a2x2
,....,anxn} =
aia2....anMom[xi,x2,....,xn]
Cum[aiXi,a2x2,....,anxn]
=aia2....anCum[xi,x2,....,xn\
where
(
a1
,a2
,>an) are constants.
2.
Moments
and cumulantsare symmetricfunctions
in
their
arguments:Mom[xi,x2,x3\-
Mom[x2,xi,x3]
=Mom
3. If
the
randomvariables{xi,X2,....,Xn}
canhedivided into
two
ormoregroups which are
statistically
independent,
their
nth-ordercumulantis
identical
to zero;
i.e.,
Cum[
xi
,x2
,....,xn
]
=
0
whereas,
in
general,
Mom[xi,x2
,....,xn
\
*
0.
4. If
the
setsof random variables[xi,x2,....,xn
}
and{yi,y2,----,yn}
are
independent,
then:
Cum[x1+y1,x2
+y2,....,xn
+
yn]
=Cum[xly...,xn]
+
Cum[yi,...,yn]
whereas
in
general:Mom[xi+yi,x2
+y2,....,xn +yn
]
=E{(xi+yi)(x2
+y2)....(xn
+
yn)}
Mom[xi,...,xn]
+
Mom[y1,...,yn]
However,
for
the
random variables \yi,X\,x2, ,Xn
}
wehave
that:
Cum[xi+yi,x2,....,xn]
=Cum[xi,x2,....,xn]
+
Cum[yi,x2,....,xn]
and
Mom[xi+yi,x2,....,xn
]
-Mom[x1,x2
,....,xn]
+
Mom[yi,x2
,....,xn]
5. If
the
set ofrandom variables{rti
,x2
, ,xn
}
is
jointly
Gaussian,
then
allthe information
abouttheir
distribution is
containedin the
moments ofordern
<
2
.Therefore,
allmoments of order greaterthan two
(
72>
2 )
have
no newinformation to
provide.This
leads
to the
fact
that
alljoint
cumulants of ordern
>
2
areidentical to
zerofor Gaussian
random vectors.So
the
cumulants ofEqualization
with
higher-order
statistics
The
purpose ofblind
equalizationis
to
identify
the
inverse
ofanunknownlinear
time-invariant
(possibly
non-minimumphase) system withoutany
physical access
to the
systeminput
signal.Such
operationrequiresthe
identification
ofboth
the
magnitude andthe
phase ofthe
system'stransfer
function.
Identification
ofthe
magnitude canbe
accomplishedwithsecond-order statistics
alone,
but
finding
the
phaseinvolves
the
higher-order
statisticsof
the
received signal.In
this sense, in
orderto
find
the
phase ofthe
system'stransfer
function,
someform
ofnonlinearity
mustbe
used.Depending
on wherethe
nonlineartransformation
is
being
applied onthe
data,
three
important
families
ofblind
equalization algorithmshave
appearedt17l;
1.
The
Bussgang
algorithms,
wherethe nonlinearity
is
in the
outputofthe
adaptive equalization
filter.
2.
The Polyspectra
algorithms,
wherethe nonlinearity
is
in the
input
ofthe
adaptiveequalizer
filter,
and3.
The
algorithms wherethe nonlinearity
is inside
the
equalizationfilter,
i.e.,
nonlinear
filter
(e.g.
Volterra)
orneuralnetwork.The
Bussgang
algorithms aregenerally
implemented
withLMS-based
approaches.
Of
the three
families
mentionedabove, the
Bussgang
algorithmshave
by
far
the
lowest
computationalcomplexity,
whichis
only
slightly
greaterthan that
of a conventional adaptive equalizerequippedwithatraining
phase.Hence,
for
the
rest ofthis
workI
willconcentrateonly
onBussgang-
type
The
Bussgang
deconvolution
techniques
These
algorithms areiterative
deconvolution
schemesthat
utilizememoryless nonHneaar
transformation
atthe
outputofthe
equalizerto
generate a
"desired"
signal
{an
estimate ofthe
input
signal)
ateachiteration.
Fig.
10
shows ablock diagram
ofablind
deconvolution
scheme:(Channel
output)Received
signal _.
y(n)
Transversal
filter
W(z)
7
z(n)
Zero-memory
nonlinear estimator
____
Desired
signal x(KW*<!>*
LMS
algorithm e(n)
Figure
11.
Blind
equalizer.The
key
componentin
this
schemeis
the zero-memory
nonlinearestimator,
which allows usto
estimatethe
input
data
sequence x(n) giventhe
deconvolved
sequence z(n).The
mean-square error(MSE)
may
be
usedto
determine
the
best
estimateofx(n)
givenz(n).The
choice ofthis
optimizationcriterion yields a conditional meanestimator
that
is
both
sensible androbust
1121.
Given
the
observationz(n), the
conditional mean estimatex(n)
ofthe
random variable
(input
signal)
x(n)is
written asE{x|z}
(dropping
the
time
indexes
for
convenience ofpresentation),
whereE{ }
denotes
the
expectationx =
E[x\z}=
\xpx
(x\z)dx
(57)
CO
where:
px
(x\z)
is
the
conditionalprobability
density
function
ofxgiven z(a
posterioridensity)
Using
Bayes' rule onthe
conditionalpdf,
(57)
becomes:
7
p7{z\x)pY(x)
x=
xFzK
'
,
fdx
(58)
Pz(z)
1
x = -
\xpz(z\x)px(x)dx
(59)
where:
pz
\z\x)
~N\
x{ri),
G
-kt]
,i.e.,pz
(z\x)
is
normally
distributed
with2
meanx(n)
(the input
signal),
andvariance(7^-,
the
variance ofthe
additive
Gaussian
noiseN(n).
px
(x)
is
the probability
density
function
ofthe
input
signal,
x(n).If
x(n)is
zero-meanGaussian,
with varianceG
,i.e.
px
(x)
~
N\
0,
G
J,
then
(59)
reducesto t17]
:a2
x(n)=
,
z(n)
(60)
x+N
In
general,
convergence ofthe
Bussgang
algorithmis
notguaranteed.A
provided
by
Beneviste
et al.&\
but
unfortunately this
infinite length
equalizerassumption
is
unrealisticandunattainablein
practice.To
date,
nozero-memory
nonlinearfunction
has been found
whichguarantees globalconvergence of
the
blind
equalizer,
andhence,
the
problem remains open.Special
cases of
the
Bussgang
algorithm
The
Sato
algorithm:
The
Sato
algorithmt23!
consists ofminimizing
anon-convexcostfunction
ofthe
form:
J(n)
=El[x(n)-z(n)]2\
(61)
where
(refer
to
Fig.
1
1)
:z(n)
:=Output
ofthe transversal
filter.
x(n)
:=Estimate
ofthe transmitted
input
signal.The
estimate ofthe transmitted input
signalis
obtainedusing
the
following
zero-memory
nonlinearity:x(n)=
ysgn[z(n)]
(62)
where
sgn[
]
refersto
the
signumfunction,
and/is
a constantwhich setsthe
gainofthe equalizer,
andis
defined by:
E\x2(n)}
7
=f ,
J
(63)
E{x(n)}
So
the
gainofthe
equalizeris
obtainedfrom
the
statisticsofthe
input
signal x(n).
The
Sato
algorithmfor blind
equalization wasinitially
introduced
to
deal
withthe
one-dimensionalM-ary
PAM
signals.It
is
a robustalgorithm,
andit is
superiorto the
decision-directed
equalizer,
althoughits
rate ofconvergencedecision-directed
algorithm(35),
exceptfor
the
gainfactor y,
whichis
dependent
onthe
input data.
Beneviste
et alShave
provedthat
the
Sato
algorithm can achieve globalconvergenceif
the probability
density
function
ofthe transmitted
data
sequence canbe
approximatedby
asub-Gaussianfunction
such asthe
uniform
distribution
^12\
However,
this
resulthas
been
disputed
by
otherresearchers, reporting
poorperformance ofSato's
algorithm.Just
asin
the
case ofthe
decision-directed
equalizer,
it
has been
establishedthat
almost alwaysSato's
algorithmconvergesto the
correctsolution once
the
eye patternhas been
opened.The
Godard
algorithm:The Godard
algorithm^
consists ofminimizing
anonconvexcostfunction
ofthe
form:
J(n)
=E-
\z(n)\p
-R}
2
(64)
where
p
is
a positiveinteger,
andRp
is
a positivereal constantdefined
as:E\\x(nfp\
Rp
=7
r
(65)
E[\x(n)\P]
The
Godard
algorithmis
designed
to
penalizedeviations
ofthe
blind
equalizeroutputz(n)
from
a constant modulus^12\
Godard
wasthe
first
to
propose afamily
of constantmodulusblind
equalizersfor
usein
two-dimensional digital
communication systems.The
constantRp
is
chosen suchThe
equalizertap
weights{
Wn
}
in
Godard's
algorithm areupdatedusing
an
LMS-type
algorithm^
:W(k
+
1)
=W(k)
+ pe
*(k)
YT(k)
(66)
where:
p
:=Step
size parameter.e*
(k)
:=Conjugate
ofthe
error signal.T
Y
(k)
:=Input
vectorto the
equalizer.The
error signale(k)
is
generated asfollows:
e(k)
=z(k)\z(k)\p-2l[Rp-\z(k)\p)j
(67)
It
canbe
seenfrom
the
abovedefinition
of error signal(67)
andfrom
the
definition
ofthe
costfunction in
(64)
that this
adaptation algorithmdoes
notrequire carrier phase
recovery
.Consequently,
it
runsslower,
althoughit
presents
the
advantageofdecoupling
the
problems of carrier phaserecovery
and
ISI
cancellationfrom
each otherI12l.
Two
particularchoices ofthe
integer
p
aboveyieldimportant
cases:Case
1:
p
=1
In
this
casethe
costfunction
in
(64)
becomes:
J(n)
=E{[\z(n)\-Ri]2\
(68)
where:
E\\x(nf)
#1
=A,
, ,n
(69)
E{\x(n)\}
which can
be
viewed as a modification ofthe
Sato
algorithm.Case
2:
p
=2
J(n)
=E-where:
o i2
\z(nf-R2
(70)
E\\x(n)\4}
^2=-^
T(7D
E\\x(nf\
This
caseis
referredto in the
literature
asthe
constant modulusalgorithm
(CMA)
6\
andis
the
mostwidely
usedin
practiceandthe
mostwidely
investigated
blind
equalizationalgorithm.Although
Godard
showedin
his
original paperthat the
algorithm wouldconverge
to the
global minimum and achieve perfectequalization,
providedit
was
initialized
in
a specialmanner, there
areconflicting
reportsin the
literature
sincethen.
Some have
demonstrated
that
it
is
possiblefor
the
Godard
algorithmto
exhibitill
convergencedue
to the
existence oflocal (i.e.
false)
minima.A
simplified set of
conditions
for
equalization
A
sufficient conditionfor
equalizationis
that the
probability
distribution
of
the individual
recovered symbols atthe
equalizer outputbe
equalto the
probability
distribution
ofthe
individual
transmitted
symbols atthe
channelinput.
This
conditionled
to the
formulation
of ageneral class of criteriathat
converge
to the
desired
response underthe
assumptionthat the
input
distribution
belongs
to
acertainfamily
ofcontinuous-typedistributions
t3l
Note,
however,
that
in digital
communicationsthe
input
distributions
are ofA
new simplified set of conditionsfor
equalizationhas
been derived
by
Shalvi
andWeinstein
f24]>
showing that the necessary
and sufficient conditionsfor
equalization arethat the
second-and
fourth-order
moments ofthe
individual input
and output symbolsbe
equal.HombinecLsystera
S(z)
L_.__.____." ~
T
Input
i^Channel
H(z)
y(n)
Blind
Equalizer
[
Outpi
x(n)
i z(n
Figure
12.
Simplified blind
deconvolution
model.The
objectiveis
to
setthe
taps
{
wk
}
ofthe
equalizer sothat the
outputsequence z(n)
is identical to the
input
sequencex(n), up to
a constantdelay
andpossibly
a constant phase shift.The
impulse
response ofthe
combined systemS(z)
(see Fig.
12)
canbe
expressed asthe
convolution sum ofthe
impulse
responses of
the
channelH(z)
andthe
equalizerW(z):
oo
Sk=hk*wk=
Twlhk-l
(72)
l
=-ooEqualization
condition:We
wantto
setthe
vector of equalizer weightsW
=[
w0
w^ w2
1
sothat
the
combined systemS
=[s0
Si S2
j
is
avectorhaving
only onenonzero component
in
whichthe
magnitude equals one:S
= e^[00 1 0
0]T
(73)
where
6
is
the
phaseshift,
andthe
nonzero componentis
delayed
anunknownConstrained
criterion
In
this
section a criterionfor
equalizationis developed
assuming that the
input
sequenceis independent
andidentically
distributed
(i.i.d.).
Start
by
expressing the
input-output
relationship
as: ooz(k)
= x(k)*sk=^six(k-l)
(74)
Z
=-ooBy
squaring
both
sidesof(74)
andtaking
expected values an expressionis
obtainedfor
the
variance ofthe
output sequencein
terms
ofthe
variance ofthe
input
andthe
impulse
response ofthe
combined system :E\z2
(k)}
= E{x2 (k)^s2(75)
And
similarly,
by
raising
both
sides of(74)
to the
fourth
power,
taking
expected valuesand
using
(75),
an expressionis
obtainedfor
the
kurtosis
ofthe
output sequence
in terms
ofthe
variance ofthe
input
andthe
impulse
responseof
the
combined system:K(z)
=K{x)^sf
(76)
I
where
K( )
representsthe
kurtosis
asdefined in Eq.
(55).
Eqs.
(75)
and(76)
form
the
basis
ofthe
following
theorem
^24\
Theorem:
If
Jz2(/e)}
=J*2()}
then
a)\K(z)\<\K(x)\,
b)
|i_'(2;)|
=|i_'(x)|
if
andonly if the
impulse
response ofthe
combinedT
Proof:
Let
S
=[s0
Si S2
]
be
a vectorof complex variables suchthat
o
Y,\sl\
<-V<-Then:
I
2
2
i
i
(77)
Hence
if
X isZ
I
=^>
then:
D
XM4^i
where
equality
holds
if
andonly
if
S has
at most one nonzero component.I
I
\sl
I
2)
XISZ
I
=1
if
andonly
if
S has
one nonzero component ofmagnitude1.
I
And
sothe
prooffollows
immediately
from
recalling
(75)
and(76).
Hence,
by
the
abovetheorem,
anecessary
and sufficientconditionfor
equalization
is
that
E
I
z(k)\
=E<x
(k)\
and\K(z)\
=\K
(x)\,
whichis
muchsimpler
than
having
to
equalize all moments ofthe probability
distributions.
It
canbe
shown as wellthat
if
the
input
and outputsequences arereal-valued,
orif
the
input
sequenceis
complex-valued suchthat
El
x(k)}
=0
(e.g.
whenthe
realandimaginary
parts of x(k) arestatistically
uncorrelatedwith
the
samevariance), then the
condition|i(z)|
=|ii_"(:x;)|
canbe
replacedby
E{\z(kf}
=E{\x(kf].
Equalization
criterion:Maximize
\K(z)\
subjectto
Elz2(k)\
= Eix2(k)\
This follows from
the
abovetheorem,
sinceby
equating the
variances ofthe
input
and output sequences weareconstraining the
impulse
response ofi2
the
combined systemto
X \sl
\
=1
(see Eq- 75). And
if that
is
the case,
/
then the
kurtosis
ofthe
output sequence will alwaysbe
smallerthan
or equalto the
kurtosis
ofthe
input
sequence.Hence
by
maximizing
|i^(_j)|
in
the
adaptation