Cryptanalysis of a Composited NLFSR for a de Bruijn Sequence

Since the function In

k contains (k +1) product-of-sum terms whose algebraic degrees

are high and the Hamming weights of these product-of-sum terms are low, as a result, the function In

k can be approximated by a linear function or a constant

function with high probability. In this section, we first investigate the success probability of approximating the function In

k by the zero function. We then study

the cycle decomposition of an approximated recurrence relation after a successful approximation of the feedback function.

5.3.1 Hamming Weights of the Product-Of-Sum Terms

Before calculating the success probability of approximating the function In

k by the

zero function, we need to derive the Hamming weight of a product-of-sum term, since In

k is a sum of (k + 1) product-of-sum terms.

Proposition 5.3.1 For an integer r ≥ 1, the Hamming weight of Xp

r is equal to

2r_.

Proof For any product term X₀p, the r-order composition is of the form

X_rp = Y i∈Zop Ui· Y i∈Zep Vi

where Ui is a sum of 2c variables and Vi is a sum of 2c variables and constant 1, c

is the Hamming weight of r. For simplicity, we assume that r = 2l_{, l ≥ 0. To find}

the Hamming weight of Xp

r, there are two cases arise.

Case I: When 1 ≤ p ≤ r + 1 If r = 2l_{, then U}

i and Vj can be written as Ui = xi + xi+r, i ∈ Zop, Vj = (xj +

i ∈ Zp

o and j ∈ Zep. This implies

x1 = 1 + x1+r = 1 + x1+2r = · · · = 1 + xl1 = 0/1 x2 = x2+r = x2+2r = · · · = xl2 = 0/1 .. . ... xp = 1 + xp+r = 1 + xp+2r = · · · = 1 + xln = 0/1, if p is odd xp = xp+r = xp+2r = · · · = xlp = 0/1, if p is even

where li ≤ p + r, i = 1, 2, ..., p. Note that Xrp is a function in (p + r) variables.

For an (p + r)-tuple with Xp

r = 1, the values at 2p positions are determined by

the values at p positions, which follows from the above set of equations and the remaining (p+r−2p) positions can take any binary values. Hence, the total number of (p + r)-tuples for which Xp

r = 1 is given by 2p· 2r−p = 2r.

Case II: When p ≥ r + 1 Similarly, Xp

r = 1 if and only if Ui = 1 and Vj = 1 for all i ∈ Zop and j ∈ Zep. This

implies x1 = 1 + x1+r = 1 + x1+2r = · · · = 1 + xl1 = 0/1 x2 = x2+r = x2+2r = · · · = xl2 = 0/1 .. . ... xr−1 = 1 + x2r−1= · · · = 1 + xlr−1 = 0/1 xr = x2r = · · · = xlr = 0/1

where li ≤ (p + r), i = 1, 2, ..., r. According to the above system of equations, the

binary values at (p + r) positions are determined by the binary values at r positions and these r positions can take any values. Hence, the total number of (p + r)-tuples for which Xp

r = 1 is given by 2r.

Considering Ui = 1 and Vj = 1 for all i ∈ Zop and j ∈ Zep as a system of linear

the Hamming weight of Xp

r is equal to the number of solutions of the system of

linear equations, which is equal to 2p+r−r _{= 2}r _{for any positive integer r.} ₂

Proposition 5.3.2 For any integer r ≥ 1, the Hamming weight of Jn−1

r is equal

to 2r_.

Proof The proof is similar to the proof of Proposition 5.3.1. 2

Proposition 5.3.3 For any integer k ≥ 1 and n ≥ 2, the Hamming weight of function In

k is equal to 2k+1. One can approximate function Ikn by the zero function

with probability (1 −₂n−11 −

1 2n+k−1).

Proof By Proposition 5.3.1, the Hamming weight of X_jn+k−1−j, i.e, H(X_jn+k−1−j) is equal to 2j_{, for 0 ≤ j ≤ k−1. Note that X}n+k−1−j

j = 1 is a system of linear equations

with (n+k−1−j) equations and (n+k−1) unknown variables and Supp(X_jn+k−1−j) contains the set of all solutions. It is not hard to show that the support of X_in+k−1−i and X_jn+k−1−j are disjoint for 0 ≤ i 6= j ≤ n − 1. Again, (∪k−2_j=0Supp(X_jn+k−1−j)) ⊂ Supp(J_kn−1), and Supp(X_k−1n+k−1) and Supp(J_kn−1) are disjoint. Then the cardinality of the support of In

k is equal to (2k+2k−1−

Pk−2

j=02

j_{) = (2}k₊₂k−1₋₂k−1_{+1) = 2}k_+1.

Hence, the Hamming weight of In

k is 2k+ 1.

Since the Hamming weight of In

k is 2k+ 1, the number of inputs for which Ikn

takes the value zero is equal to (2n+k−1_{− 2}k_{− 1). Hence, one can approximate the}

function In

k by the zero function with probability (1 − 1 2n−1 −

2n+k−1). 2

Proposition 5.3.4 For any n, k ≥ 1, the nonlinearity of function I_kn is equal to NIn

k = H(I

k) = 2k+ 1 for n > 3.

Proof It is well-known that the nonlinearity of a Boolean function can be obtained by calculating the minimum distance between the function and all affine functions [23]. In Proposition 5.3.3, we calculated the cardinality of Supp(In

k), which is

equal to (2k _{+ 1). The minimum distance between I}n

k and all affine functions is

achieved only for the zero function, and that is equal to (2k_{+ 1). For all other}

nonzero functions, the least distance between In

k and an affine function can be

(2n+k−2_{− (2}k_{+ 1)). For n > 3, (2}n+k−2_{− (2}k_{+ 1)) > (2}k_{+ 1). Therefore, for n > 3,}

the nonlinearity of In

For a small value of k, the function In

k can be approximated by the zero function

or a linear function due to its low nonlinearity.

5.3.2 Cycle Structure of an Approximated Recurrence Re-

lation

By Propositions 5.3.3 and 5.3.4, the function I_kn can be approximated by the zero function with probability about (1 − ₂n−11 ). As a consequence, Eq. (5.1) can be approximated as follows

Rn_k,a(x0, x1, ..., xn+k) = ((xn+ x0) + G(x1, x2, ..., xn−1)) ◦ ψk (5.4)

= g(x0, x2, ..., xn−1) ◦ ψk.

The recurrence relation Rn

k,a = 0 is called an approximated recurrence relation.

In the following proposition, we provide the cycle structure of an approximated recurrence relation.

Lemma 5.3.5 For an integer k ≥ 1, Ω(Rn

k,a) = Ω(g) ⊕ Ω(ψk), i.e., any sequence

x ∈ Ω(Rn

k,a) can be written as x = b + c, where b’s minimal polynomial is the

same as the minimal polynomial of a span n sequence that is generated by g and c’s minimal polynomial is (1 + x)k _{and ⊕ denotes the direct sum operation.}

Proof Let s be a span n sequence generated by g and let h(x) the minimal polynomial of s. Then, h(x) = h1(x) · h2(x) · · · hr(x), where hi’s are distinct irreducible

polynomials of degree less than or equal to n and the value of r depends on the sequence s, see [45, 48, 82]. If hi(x) = (1 + x) for some i, then the sequence s

is not a span n sequence. On the other hand, the minimal polynomial of ψk _is

(1 + x)k_{. Again, the minimal polynomial of a sequence generated by ψ}k _{is a factor}

of (1 + x)k_{. As h(x) does not contain the factor (1 + x), the minimal polynomial}

of s and the minimal polynomial of ψk _{are relatively prime with each other. Then,}

by Lemma 3.2.3, any sequence x ∈ Ω(Rn

k,a) can be represented by x = b + c where

b ∈ Ω(g) and c ∈ Ω(ψk_{). Hence, the cycle decomposition of R}n

k,a is a direct sum of

Ω(g) and Ω(ψk_{), i.e., Ω(R}n

Proposition 5.3.6 The cycle decomposition of Rn

k,a, i.e., Ω(Rnk,a) contains 2 ·

(Γ2(k) + 1) cycles with (Γ2(k) + 1) cycles of period at least (2n− 1) and (Γ2(k) + 1)

cycles of period at most 2dlog2ke, where Γ

2(k) is the number of all coset leaders

modulo (2k_{− 1).}

Proof For any positive integer k ≥ 1, the cycle decomposition of ψk is the cycle decomposition of polynomial (1 + x)k, which contains sequences with period 2dlog2ie_{, 1 ≤ i ≤ k, and the number of cycles is given by (Γ}

2(k) + 1) including the

zero cycle (see [43], Th. 3.4, page-42). Again, the cycle decomposition of g contains only two cycles, one is a cycle of length 2n− 1 and the other one is the zero cycle of length one. Therefore, by Lemma 5.3.5, Ω(Rn_k,a) contains 2 · (Γ2(k) + 1) cycles

where (Γ2(k) + 1) cycles are of length at least 2n− 1 and (Γ2(k) + 1) cycles are of

length at most 2dlog2ke_. 2

Remark 5.3.7 If the function Rn

k is approximated by the function (Rnk,a+ J n−1 k )

with high probability, then the number of cycles in Ω(Rn k,a+J

n−1

k ) equals (Γ2(k)+1),

and the period of a sequence in Ω(Rn k,a+ J

n−1

k ) is bounded below by 2n.

Proposition 5.3.8 Let Ω(Rn

k,a) be the cycle decomposition of Rnk,a. For any se-

quence x ∈ Ω(Rn

k,a) with period at least 2n− 1, the linear complexity of x is bounded

below by the linear complexity of the sequence generated by g.

Proof We already showed in Lemma 5.3.5 that any sequence x ∈ Ω(Rn

k,a) can be

written as x = b + c where b ∈ Ω(g), c ∈ Ω(ψk_{), and the minimal polynomial of b}

is coprime with the minimal polynomial of c. Since the minimal polynomial of b is coprime with the minimal polynomial of c, the linear complexity of x is equal to the sum of the linear complexities of b and c. Therefore, the linear complexity of x is greater or equal to the linear complexity of sequence b generated by g. Hence,

the assertion is established. 2

Remark 5.3.9 Using recurrence relation (5.1) with G as a linear function, one can generate a de Bruijn sequence with period 2n+k _{and linear complexity at least}

sequence is not suitable for using it as a building block in designing a cryptographic primitives such as PRSGs or stream ciphers, because in the entire sequence most of the bits are linearly related to the internal state bits and only at H(In

k) positions

the bits are nonlinearly related to the internal state bits due to the nonlinear term In

k, which is vulnerable against a cryptanalytic attack. For a more detailed analysis

on both linearly and nonlinearly composed de Bruijn sequences, we refer the reader to [71]. On the other hand, if the function g is nonlinear, then the bits of the de Bruijn sequence will be nonlinearly related to the internal state bits of the NLFSR, thereby a cryptanalytic attack would be more complex.

Propositions 5.3.3, 5.3.6, and 5.3.8 suggest that in order to generate a strong de Bruijn sequence by the composited construction, the starting span n sequence generated by g should have good randomness properties, particularly, long period and an optimal or near-optimal linear complexity. If an attacker is successful in approximating the feedback function Rn_k by the feedback function (g ◦ ψk), then the security of the sequence generated by R_kn depends on the security of the sequence generated by g. If the de Bruijn sequences are used as building blocks in PRNGs and stream ciphers, an attack would not have direct access to a de Bruijn sequence.

In document Design and Analysis of Cryptographic Pseudorandom Number/Sequence Generators with Applications in RFID (Page 90-95)