Prime testing - Complexity of Algorithms

Let mbe an odd natural number, we want to decide whether it is a prime. We have seen in the previous chapter that this problem is in NP∩co−NP. The witnesses described there did not lead, however, (at least for the time being) to a polynomial-time prime test. We will therefore give first a new, more complicated NP description of compositeness.

Theorem 7.3.1 “Little” Fermat Theorem Ifm is a prime thenam−1₋₁ _{is divisible by}_m

for all natural numbers1≤a≤m−1.

If—for a givenm—the integeram−1₋_{1 is divisible by}_m_{then we say that}_a_{satisfies the}

Fermat condition.

The Fermat condition, when required for all integers 1 ≤a ≤m−1, also characterizes primes:

Lemma 7.3.2 An integer m >0 is a prime if and only all integers1 ≤a≤m−1 satisfy the Fermat condition.

Indeed, if mis composite, then we can choose foraan integer not relatively prime tom, and thenam−1₋_{1 is obviously not divisible by}_m_.

A further nice feature of the Fermat condition is that it can be checked in polynomial time for givenmanda. This was discussed in chapter 5.

Of course, we cannot check the Fermat condition for everya: this would take exponential time. The question is therefore to whicha’s should we apply it?

We could just trya= 2. This is in fact not a bad test, and it reveals the non-primality of many (in a sense, most) composite numbers, but it may fail. For example, 561 = 3·11·17 is not a prime, but 561|2560₋_{1. Any other specific choice of}_a_{would have similar shortcoming.}

The next idea is to select as randomaand check Fermat condition. Ifmis a prime, then

awill of course satisfy it. Suppose thatmis not prime, then at least thosea’s not relatively prime tomwill violate it. Unfortunately, the number of sucha’s may be miniscule compared with the number of all choices fora, and so the probability that our random choice will pick one is negligible. (In fact, we can compute the greatest common divisor of a and m right away: if we can find an anot relatively prime tom, then this will yield a proper divisor of

m.)

So we need to usea’s relatively prime tom. Unfortunately, there are composite numbers

m(the so-called pseudo-primes) for which the Fermat condition is satisfied forallarelatively prime tom; for such numbers, it will be especially difficult to find an integeraviolating the condition. (Such a pseudo-prime is e.g. 561 = 3·11·17.)

7.3. PRIME TESTING 127 But at least of m is not a pseudo-prime, then the random choice for a works. This is guaranteed by the following lemma.

Lemma 7.3.3 Ifm is not a prime and not a pseudo-prime then at most half of the integers

a,1≤a≤m−1 relatively prime tom satisfies the Fermat condition. Note that none of the non-relatively-primea’s satisfy the Fermat condition.

Proof. Sincem is not a pseudoprime, there is at least one b relatively prime tom such that bm−1₋_{1 is not divisible by}_m_{. Now if}_a_{is “bad”, i.e.,}_am−1₋_{1 is divisible by} _m_{, then}

ab modmis “good”:

(ab)m−1₋_{1 = (}_am−1₋₁₎_bm−1₊_bm−1₋₁_,

and here the first term is divisible by mbut the second is not.

Hence for everya that is “bad”, we find anothera (namelyab modm) that is “good”. It is easy to see that differenta’s yield different “good” numbers. Thus at least half of the

numbers amust be good. ¤

Thus, ifmis not a pseudo-prime then the following randomized prime test works: check whether a randomly chosen integer 1≤a≤m−1 satisfies the Fermat condition. If not then we know that m is not a prime. If yes then repeat the procedure. If we found 100 times, independently of each other, that the Fermat condition is satisfied then we say that m is a prime. It can still happen that m is composite, but if the probability that we picked an integer asatisfying the condition is less than 1/2 at every step, and hence the probability that this occurs 100 times in a row is less than 2−100_.

Unfortunately, this method fails for pseudoprimes (it finds them prime with large probability). It turns out that one can modify the Fermat condition just a little to overcome this difficulty. Let us write the numberm−1 in the form 2k_M _where_M _{is odd. We say that} _a

satisfies the Miller conditionif at least one of the numbers

aM₋₁_{, a}M _{+ 1}_{, a}2M_{+ 1}_{, a}4M _{+ 1}_{, . . . , a}2k−1_M

+ 1

is divisible by m. Note that the product of these numbers is just am−1₋_{1. Hence every}

number satisfying the Miller condition also satisfies the Fermat condition.

Ifmis a prime then it divides this product, and hence it divides one of these factors, i.e., everyasatisfies the Miller condition. Ifmis composite then, however, it could happen that some asatisfies the Fermat condition but not the Miller condition (m can be a divisor of a product without being the divisor of any of its factors).

Thus the Miller condition provides a potentially stronger primality test than the Fermat condition. The question is: how much stronger?

We will need some fundamental facts about pseudoprimes. Lemma 7.3.4 Every pseudoprimem is(a)Odd

(b)Squarefree (not divisible by any square). Proof.

(a)Ifm >2 is even thena=m−1 will violate the Fermat condition, since (m−1)m−1_≡

−16≡1 (modm).

(b) Assume that p2 _| _m_{; let} _k _{be the largest exponent for which} _pk _| _m_{. Then} _a ₌

m/p−1 violates the Fermat condition since the last two terms of the binomial expansion of (m/p−1)m−1_are₋₍_m₋₁₎₍_m/p_{) + 1}_≡_m/p_{+ 1}_6≡_{1 (mod}_pk_{) (all earlier terms are divisible}

bypk_{) and if an integer is not divisible by}_pk _{then it is not divisible by}_m_either. _¤

Lemma 7.3.5 Let m=p1p2· · ·pt where the pi’s are different primes. The relation ak ≡1

(modm) holds for alla relatively prime to m if and only if pi−1 divides k for all i with

1≤i≤t.

Proof. Ifpi−1 divides k for alli with 1≤i≤t then ak−1 is divisible bypi according

to the little Fermat Theorem and then it is also divisible by m. Conversely, suppose that

ak _≡_{1 (mod}_m_{) for all} _a_{relatively prime to} _m_{. If e.g.} _p

1−1 would not divide k then let

g be a primitive root modulo p1 (the existence of primitive roots was stated in Chapter 5)

According to the Chinese Remainder Theorem, there is a residue class h modulo m with

h ≡g (modp1) and h≡ 1 (modpi) for all i ≥ 2. Then (h, m) = 1 and p1 6 |hk−1, so

m 6 |hk₋_1. _¤

Corollary 7.3.6 The number m is a pseudoprime if and only if m=p1p2· · ·pt where the

pi’s are different primes, t≥2, and(pi−1) divides (m−1)for all iwith 1≤i≤t.

7.3.7 Remark This is how one can show about the above example 561 that it is a pseudo- prime.

Now we can prove the main fact that makes the Miller test better than the Fermat test. Theorem 7.3.8 If m is a composite number then at least half of the numbers 1, . . . , m−1 violate the Miller condition.

Proof. Since we have already seen that the lemma holds for non-pseudoprimes, in what fol- lows we can assume thatmis a pseudoprime. Letp1· · ·pt(t≥2) be the prime decomposition

7.3. PRIME TESTING 129 ofm. By the above, these primes are all odd and distinct, and we have (pi−1)|(m−1) = 2kM

for alliwith 1≤i≤t.

Letl be the largest exponent with the property that none of the numberspi−1 divides

2l_M_{. Since the numbers}_p

i−1 are even whileM is odd, such an exponent exists (e.g. 0) and

clearly 0≤l < k. Further, by the definition ofl, there is aj for whichpj−1 divides 2l+1M.

Therefore pj−1 divides 2sM for all s with l < s≤ k, and hence pj divides a2

s_M

−1 for all primitive residue classesa. Consequentlypj cannot dividea2

s_M

+ 1 which is larger by 2, and hencemdoes not dividea2s_M

+ 1 either. If thereforeais a residue class that is a Miller accomplice thenmmust already be a divisor of one of the remainder classesaM₋₁_{, a}M_{+ 1,}

a2M _{+ 1}_{, . . .}_,_a2l−1_M

+ 1, a2l_M

+ 1. Hence for each sucha, the numberm divides either the product of the first l+ 1, which is (a2l_M

−1), or the last one, (a2l_M

+ 1). Let us call the primitive residue class amodulo man “accomplice of the first kind” if a2l_M

≡1 (modm) and an “accomplice of the second kind” ifa2l_M

≡ −1 (modm).

Let us estimate first the number of accomplices of the first kind. Consider an indexiwith 1 ≤i ≤t. Since pi−1 does not divide the exponent 2lM, Lemma 7.3.5 implies that there

is a number a not divisible bypi for whicha2

l_M

−1 is not divisible by pi. The reasoning

of Lemma 7.3.3 shows that then at most half of the mod pi residue classes will satisfy the

Fermat comdition belonging to the above exponent, i.e. such thata2l_M

−1 is divisible bypi.

According to the Chinese Remainder Theorem, there is a one-to-one correspondence between the primitive residue classes with respect to the productp1· · ·ptas modulus and thet-tuples

of primitive residue classes modulo the primes p1, . . . , pt. Thus, modulo p1· · ·pt, at most

a 2t_{-th fraction of the primitive residue classes is such that every} _p

i divides (a2

l_M

−1). Therefore at most a 2t_{-th fraction of the mod}_m_{primitive residue classes are accomplices of}

the first kind.

It is easy to see that the product of two accomplices of the second kind is one of the first kind. Hence multiplying all accomplices of the second kind by a fixed one of the second kind, we obtain accomplices of the first kind, and thus the number of accomplices of the second kind is at least as large as the number of accomplices of the first kind. (If there is no accomplice of the second kind to multiply with then the situation is even better: zero is certainly not greater than the number of accomplices of the first kind.) Hence even the two kinds together make up at most a 2t−1_{-th part of the primitive residue classes, and so (due}

to t≥2) at most a half. ¤

Lemma 7.3.9 For a given m and a, it is decidable in polynomial time whether a satisfies the Miller condition.

For this, it is enough to recall from Chapter 3 that the remainder of ab _modulo _c _is

computable in polynomial time. Based on these three lemmas, the following randomized algorithm, called the Miller–Rabin test, can be given for prime testing:

7.3.10 Algorithm We choose a number between 1 and m−1 randomly and check whether it satisfies the Miller condition. If it does not thenmis composite. If it does then we choose a newa. If the Miller condition is satisfied 100 times consecutively then we declare that m

is a prime.

If m is a prime then the algorithm will certainly assert this. If m is composite then the number a chosen randomly violates the Miller condition with probability 1/2. After hundred independent experiments the probability will therefore be at most 2−100 _{that the}

Miller condition is not violated even once, i.e., that the algorithm asserts thatmis a prime. 7.3.11 Remarks

1. Ifmis found composite by the algorithm then — interestingly enough — we see this not from its finding a divisor but from the fact that one of the residues violates the Miller condition. If at the same time, the residueadoes not violate the Fermat condition, then

mcannot be relatively prime to each of the numbersaM₋_1,_aM_{+ 1,}_a2M_{+ 1,}_a4m_{+ 1,}

· · ·, a2k−1_M

+ 1, therefore computing its greatest common divisors with each, one of them will be a proper divisor of m. No polynomial algorithm (either deterministic or randomized) is known for finding a factorization in the case when the Fermat condition is also violated. This problem is significantly more difficult also in practice than the testing of primality. We will see in the section on cryptography that this empirical fact has important applications.

2. For a given m, we can try to find an integer a violating the Miller condition not by random choice but by trying out the numbers 1,2, etc. It is not known how small is the first such integer if m is composite. Using, however, a hundred year old conjecture of analytic number theory, the so-called Generalized Riemann Hypothesis, (which is too technical to be formulated here) one can show that it is not greater than logm. Thus, this deterministic prime test works in polynomial time if the Generalized Riemann Hypothesis is true.

We can use the prime testing algorithm learned above to look for a prime number with

ndigits (say, in the binary number system). Choose namely a numberkrandomly from the interval [2n−1_,₂n₋_{1] and check whether it is a prime, say, with an error probability of at}

7.4. RANDOMIZED COMPLEXITY CLASSES 131

In document Complexity of Algorithms (Page 132-137)