First order Reed-Muller codes - Notes on Coding Theory J I Hall pdf

In 1954, I.S. Reed and D.E. Muller introduced independently a class of binary codes of length 2m, for any integerm, associated with Boolean logic. Thefirst of these codes, for eachm, fits naturally into the context of Hamming codes.

A code dual to a binary extended Hamming code is called afirst order Reed- Muller code, denotedRM(1, m) where m is the redundancy of the associated

first order Reed-Muller code

Hamming code. Any code that is equivalent to afirst order Reed-Muller code is alsofirst order Reed-Muller. We shall concentrate on the specific codeRM(1, m) with generator matrixELm.

The associated dual Hamming code is sometimes called a shortened first order Reed-Muller code or a simplex code. The dual Hamming code can be shortenedfirst order

Reed-Muller code simplex code

4.3. FIRST ORDER REED-MULLER CODES 57 easily recovered from RM(1, m). Indeed by first choosing all the codewords of

RM(1, m) that have a 0 in theirfirst coordinate position and then deleting this now useless coordinate, wefind the dual Hamming code. This is clear when we consider how the matrixELmwas constructed by bordering the matrixLm, the generator matrix of the dual lexicographic Hamming code. (See page 51.)

Having earlier constructed the generatorELmas a matrix in bordered block form, we now examine it again, but blocked in a diﬀerent manner. Notice that

RM(1,1) =_F2

2,RM(1,2) is the parity check code of length 4, andRM(1,3) is

a self-dual extended [8,4] Hamming code. Examples. EL1= 0 1 1 1 and EL2=   0 0 1 1 0 1 0 1 1 1 1 1   EL3=     0 0 0 0 1 1 1 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 1 1 1 1 1 1 1 1 1     EL4=       0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1      

For i between 2m−1 _{and 2}m_{, the column} _m_{-tuple containing the binary rep-} resentation of i is just that for i₋2m−1 _{with its leading 0 replaced by a 1.}

Therefore, if we ignore the top row ofELm, then the remainingmrows consist of anm_×2m−1 matrix repeated twice. Indeed this repeated matrix is nothing other thanELm−1. We now observe the recursive construction:

ELm=

0_{· · ·} 0 1_{· · ·}1

ELm−1 ELm−1 .

( 4.3.1) Theorem. For each m, the first order Reed-Muller code RM(1, m) is a binary linear [2m, m+ 1,2m−1] code.

Proof. CertainlyRM(1, m) is linear of length 2m_{, and its dimension}_m_{+ 1} is evident from the generator matrixELm. From their generator matricesEL1

andEL2, it is easy to see thatRM(1,1) (=F22) andRM(1,2) (the parity check

code of length 4) both have minimum distance 2m−1.

We now verify the minimum distance in general by induction, assuming that we already have dmin(RM(1, m−1)) = 2m−2. LetC1 be RM(1, m−1) with

minimum distance d1 = 2m−2, and let C2 be the repetition code of length

2m−1_{, whose minimum distance is therefore} _d

2= 2m−1. The generator matrix

ELm forRM(1, m) is then constructed from the generatorsG1 =ELm−1 and

G2 = [1· · · 1] according to the recipe of Problem 4.1.5. Therefore, by that

problem, we have

as claimed. 2

( 4.3.2) Theorem. The first order Reed-Muller codeRM(1, m) consists of a unique word of weight0, namely0, a unique word of weight 2m_{, namely} _{1, and} 2m+1₋₂_{words of weight}₂m−1_.

Proof. The last row of the generator matrix ELm is 1; so 0 and 1 are the unique codewords of weight 0 and 2m_{, respectively. By Theorem 4.3.1 the} linear code RM(1, m) has no codewordscof weight between 0 and 2m−1_{, and}

so it also has no codewords1+cof weight between 0 and 2m−1_{. That is, it has}

no codewords of weight between 2m−1 _{and 2}m_{. Therefore all codewords other} than0and 1have weight exactly 2m−1_. ₂

( 4.3.3) Corollary. The dual of the binary Hamming code of redundancy

mconsists of a unique word of weight 0, namely0, and 2m₋₁_{words of weight} 2m−1_.

Proof. In recovering the dual Hamming code fromRM(1, m), we shorten the code by taking all codewords that begin with 0 and then delete that position. In particular the codeword1of RM(1, m) does not survive. But by Theorem 4.3.2 all other nonzero codewords ofRM(1, m) have weight 2m−1_{. As only zeros}

are deleted, all the nonzero codewords of the dual Hamming code also will have weight 2m−1_. ₂

These dual Hamming codes are equidistant codesin that distinct codewords equidistant codes

are at a fixed distance from each other, here 2m−1_{. They satisfy the Plotkin}

bound 2.3.8 with equality. (The proof of the Plotkin bound as given in Prob- lem 3.1.5 compares the minimum distance with the average distance between codewords. For an equidistant code these are the same.)

For a binary wordx_∈_Fn

2, consider the corresponding wordx∗ ∈{+1,−1}n

gotten by replacing each 0 by the real number +1 and each 1 by₋1. ( 4.3.4) Lemma. If x,y ∈ Fn

2, then as vectors of real numbers x∗·y∗ =

n₋2dH(x,y). In particular if x,y∈F22h withdH(x,y) =h, thenx∗·y∗= 0.

Proof. The dot product of two_±1 vectors is the number of places in which they are the same minus the number of places where they are diﬀerent. Here that is (n₋dH(x,y))−dH(x,y). 2

LetRM(1, m)± _{be the code got by replacing each codeword}_c_of_RM₍₁_{, m}₎ with its_±1 versionc∗_{. List the codewords of} _RM₍₁_{, m}₎± _as_c∗

1,c∗2, . . . ,c∗2m+1. ( 4.3.5) Lemma. Ifc∗_∈RM(1, m)± then also ₋c∗_∈RM(1, m)±. We have

c∗ i ·c∗j = 2m if c∗i =c∗j = ₋2m _if _c∗ i =−c∗j = 0 if c∗ i =±c∗j .

4.3. FIRST ORDER REED-MULLER CODES 59

Proof. As 1 _∈ RM(1, m) we have (1+c)∗ ₌ ₋_c∗ _∈ _RM₍₁_{, m}₎±_{. By} Theorem 4.3.2, if distinct b,c_∈ RM(1, m) with b =1+c, then dH(b,c) = 2m−1_{. The lemma follows from Lemma 4.3.4.} ₂

We use this lemma as the basis of a decoding algorithm. When a vectorr is received, calculate each of the dot productsr_·c∗

i, fori= 1, . . . ,2m+1. Then decode to that codewordc∗

j that maximizes the dot product.

In fact this can be done a little more eﬃciently. Arrange our listing of

RM(1, r) so thatc∗

i+2m =−c∗_i, for eachi= 1, . . . ,2m. That is, the second half of the list is just the negative of the first half. In decoding, we calculate only those dot products from thefirst halfr_·c∗

i, fori= 1, . . . ,2m, and select thatj that maximizes the absolute value_|r_·c∗

j|. The received wordris then decoded to c∗

j ifr·c∗j is positive and to−c∗j ifr·c∗j is negative.

We organize this asHadamard transform decoding. Setn= 2m_{, and let}_H

n Hadamard transform decoding be the n_×n matrix whose ith _{row is the codeword} _c∗

i. The dot products of Lemma 4.3.5 then give

HnHn =n In×n,

since the negative of a row ofHn is never a row. Upon receiving the vector r, we calculate its Hadamard transform r =Hnr . If rj is that coordinate of r that has the largest absolute value, then we decodertoc∗

j in caserj>0 or to to ₋c∗

j in caserj<0.

An important aspect of Hadamard transform decoding is that it is a soft decision algorithm rather than a hard decision algorithm. We need not require that the received vector r have entries _±1. Its entries can be arbitrary real numbers, and the algorithm still works without modification.

Example. Consider the codeRM(1,3)±which comes fromRM(1,3) of lengthn= 23_{= 8 with generator matrix}_EL3_{. Let}

H8=             +1 +1 +1 +1 +1 +1 +1 +1 +1 ₋1 +1 ₋1 +1 ₋1 +1 ₋1 +1 +1 ₋1 ₋1 +1 +1 ₋1 ₋1 +1 ₋1 ₋1 +1 +1 ₋1 ₋1 +1 +1 +1 +1 +1 ₋1 ₋1 ₋1 ₋1 +1 ₋1 +1 ₋1 ₋1 +1 ₋1 +1 +1 +1 ₋1 ₋1 ₋1 ₋1 +1 +1 +1 ₋1 ₋1 +1 ₋1 +1 +1 ₋1             ,

soH8H8 = 8I8×8. The codewords ofRM(1,3)±are then the rows ofH8

and their negatives.

Suppose we receive the vectorr= (1,1,₋1,1,₋1,₋1,1,1). This has Hadamard transform H8r = (2,₋2,2,₋2,2,₋2,6,2). The entry with largest absolute value isr7= 6>0, so we decode to

c∗7= (+1,+1,−1,−1,−1,−1,+1,+1).

If next we receiver= (₋.7,1,0,₋.8,₋.9,1, .9,₋1), then

The entry with largest absolute value isr4 =₋6.3<0, so we decode to

−c∗4= (−1,+1,+1,−1,−1,+1,+1,−1).

( 4.3.6 )Problem. Assume that you are using the codeRM(1,3)± _{of the example.} Use Hadamard transform decoding to decode the received word

(.5, .4,₋.6, .5, .6,₋.3, .5,₋.6 ).

For any positive integern, a _±1 square matrixHn of sidenthat satisfies

HnHn =n In×n

is called a Hadamard matrix. If we take as a code the rows of Hn and their Hadamard matrix

negatives, then Hadamard transform decoding will work exactly as described above. Such a code (or its_{0,1_}counterpart) is called aHadamard code. Hadamard code

( 4.3.7 ) Problem. Prove that a Hadamard matrixHn must have n= 1,2 orn a

multiple of4. (Remark. It is a long-standing conjecture of combinatorial design theory that the converse of this problem is true: for each suchn, there exists a Hadamard matrix.)

Begin with a Hadamard code of length n. Choose those ncodewords that start with +1, drop that position, and translate back to a_{0,1_}code. The result is a binary code of lengthn₋1 and sizenwhich is equidistant of distancen/2. A code constructed in this fashion is ashortened Hadamard code. Starting with shortened Hadamard code

the matrixH8of the example above, we recover fromRM(1,3) andRM(1,3)±

the [7,3] dual Hamming code.

( 4.3.8 )Problem. Lethbe a positive integer. LetC be a binary equidistant code of length2h₋1and size2hwith distanceh.

(a)Prove that C is a shortened Hadamard code.

(b)Prove thatC meets the Plotkin bound 2.3.8 with equality.

Although any Hadamard matrix can be used to design a code that allows Hadamard transform decoding, there are certain advantages to be gained from using those matrices that come from Reed-Muller codes as described. The existence of a soft decision algorithm is good, but we hope to implement it as eﬃciently as possible. Consider decoding using the matrixH8 of the example.

Each decoding process requires 63 operations, 56 additions for calculating the 8 dot products and 7 comparisons among the answers. (By convention the operation of negation is considered to make no contribution.) Certain annoying repetitions are involved. For instance, both the second and the sixth rows ofH8

begin +1,₋1,+1,₋1; so the corresponding calculationr1−r2+r3−r4is made

twice during the decoding process. Can this and other patterns withinH8 be

exploited? The answer is “yes,” and it is this fact that makes a matrix derived from RM(1, m) a better choice than other Hadamard matrices with the same dimensions.

4.3. FIRST ORDER REED-MULLER CODES 61 LetH1= [1], a 1×1 Hadamard matrix, and define recursively a 2m+1×2m+1

matrix in block form

H2m+1= +H2 m +H₂m +H2m −H₂m . Then H2= +1 +1 +1 ₋1 , and H4=     +1 +1 +1 +1 +1 ₋1 +1 ₋1 +1 +1 ₋1 ₋1 +1 ₋1 ₋1 +1    =     +1 +1 +1 +1 +1 ₋1 +1 ₋1 +1 +1 ₋1 ₋1 +1 ₋1 ₋1 +1    .

The matrix H8 is that of the example. This construction can be continued,

for all m. The matrix H2m produced is a Hadamard matrix associated with

RM(1, m)± _{and the Reed-Muller code} _RM₍₁_{, m}_{) whose generator matrix is}

ELm. The recursive construction ofH2m is related to that ofEL_mand admits a streamlined implementation of decoding forRM(1, m) andRM(1, m)±_{, using} the so-called Fast Hadamard Transform orF HT algorithm. For instance,F HT

decoding of RM(1,3)± can be achieved with 31 operations rather than the 63 counted previously.

The Reed-Muller codes in general, and the codeRM(1,3) in particular, are important codes that often arise as constituents of larger codes. It is therefore worthwhile to have decoding algorithms that are as eﬃcient as possible. Sun and Van Tilborg have given a soft decision algorithm forRM(1,3) that is related to

F HT decoding but only needs, on the average, 14 operations with worst-case performance of 17 operations.

Chapter 5

Generalized Reed-Solomon

Codes

In 1960, I.S. Reed and G. Solomon introduced a family of error-correcting codes that are doubly blessed. The codes and their generalizations are useful in prac- tice, and the mathematics that lies behind them is interesting. In the first section we give the basic properties and structure of the generalized Reed-Solomon codes, and in the second section we describe in detail one method of algebraic decoding that is quite eﬃcient.

In document Notes on Coding Theory J I Hall pdf (Page 66-73)