(by M H Sørensen) - Computability and Complexity: From a Programming Perspective

11.1 Introduction

In the introduction to this book we mentioned Hilbert’s famous list of open problems at the International Congress of Mathematicians in 1900. The tenth problem is stated as follows:

Given a Diophantine equation with any number of unknown quantities and with rational integral numerical coefficients: to devise a process according to which it can be determined by a finite number of operations whether the equation is solvable in rational integers.

In modern terms, the problem is to give an algorithm which for a polynomial equation with integer coefficients can decide whether or not it has a solution in integers. An equation of this form is called Diophantine, after the Greek mathematician Diophantus from the third century, who was interested in such equations.

Hilbert’s Tenth problem is an example of a problem which is of independent interest in another field than computability theory, namely number theory. For instance, Fermat’s famous Last Theorem states that the equation

(p+ 1)n+3+ (q+ 1)n+3= (r+ 1)n+3

has no solution in natural numbers forp, q, r, n. Whether this is true has long been one of the most famous open problems in number theory.1 _{For each fixed}_{n, Fermat’s Last} Theorem is an instance of Hilbert’s Tenth problem, provided we restrict solutions to the natural numbers—this restriction is not essential as we shall see shortly. Thus, an algorithm deciding for a Diophantine equation whether any solution exists in the natural numbers would prove or disprove Fermat’s Last Theorem for each fixed n. Conversely, it has later been realized that unsolvability of Hilbert’s Tenth problem would imply unsolvability of many other decision problems in number theory and analysis.

1_{Wiles has recently given a proof of Fermat’s last Theorem which seems to be widely accepted, see}

Annals of Mathematics, May 1995.

From the proof of G¨odel’s famous theorem [54] it follows that every recursively enumerable setAcan be defined by a Diophantine equation preceded by a finite number of existential and bounded universal quantifiers. In his doctoral dissertation, Davis [32, 33] showed that all but one of the bounded quantifiers could be eliminated. Hence, any recursively enumerable setAcan be defined by a Diophantine equation E(x, y, z, x1, . . . , xn,)

as follows:

x∈A⇔ ∃y,∀z≤y,∃x1, . . . ,∃xn:E(x, y, z, x1, . . . , xn)

This form of definition was subsequently calledDavis Normal Form.

To prove that Hilbert’s Tenth problem is unsolvable it remains to eliminate the single bounded universal quantifier; that is, to show that any recursively enumerable setAcan be defined by a Diophantine equationE(x, x1, . . . , xn) thus:

x∈A⇔ ∃x1, . . . ,∃xn:E(x, x1, . . . , xn)

Indeed, ifAis any recursively enumerable, non-recursive set, then an algorithm deciding for any xwhetherE(x, x1, . . . , xn) has a solution, i.e., whether there are x1, . . . , xn such

thatE(x, x1, . . . , xn) holds, would also yield a method to test membership inA, which is

impossible.

While Davis showed how to simplify the form of equations necessary for defining all recursively enumerable sets, Robinson [153] attacked the problem from the other side. She showed that several sets could be defined by Diophantine equations. She also studied so-called exponential Diophantine equations, which allow unknowns in the exponents, and in particular showed, under what is now known as the Julia Robinson hypothesis, that any set definable by an exponential Diophantine equation is also definable by a Diophantine equation.

Davis and Putnam finally managed to eliminate the last bounded quantifier from the Davis Normal Form using the Chinese remainder Theorem and Robinson’s exponential Diophantine equations. The result, sometimes called the Bounded Quantifier Theorem, states in its original form that, if there are arbitrarily long arithmetic progressions consisting entirely of primes, then every recursively enumerable set can be defined by an exponential Diophantine equation. Robinson subsequently managed to eliminate the as- sumption regarding primes (which is still open) and simplify the proof. The resulting joint paper by Davis, Putnam, and Robinson [38] stating that every recursively enumerable set can be defined by an exponential Diophantine equation is now a classic.

It follows from Robinson’s earlier result that to prove the unsolvability of Hilbert’s Tenth problem, it is sufficient to prove that the Julia Robinson hypothesis is true. This

remained an open problem, believed by many to be false, until it was proved ten years later in 1970 by the young Russian mathematician Matiyasevich [116].

In this chapter we give an account of the unsolvability of Hilbert’s Tenth problem, leaving out the details of Matiyasevich’s result. The first section introduces exponential Diophantine equations. The second section develops certain tools that are used in the third section to prove the Davis-Putnam-Robinson Theorem. The fourth section considers Hilbert’s Tenth problem.

11.2 Exponential Diophantine equations and sets

We begin by making the notions of the introduction precise. As is customary we shall be concerned with solutions in natural numbers rather than in integers. We also show that this is an inessential modification.

Definition 11.2.1

1. A functionf:INn→IN isexponential polynomial if it can be written f(x1, . . . , xn) =t

where tis defined by the following grammar with 1≤i≤nandN∈IN:2 t::=xi|N|t1·t2|t1+t2|t1−t2|tt12

2. An equation f(x1, . . . , xn) = 0 is exponential Diophantine iff is exponential poly-

nomial. Asolution is a tuple (a1, . . . , an)∈INn withf(a1, . . . , an) = 0.

3. A setA⊆INnisexponential Diophantineif there exists an exponential polynomial f :INm+n→IN such that

(a1, . . . , an)∈A⇔ ∃x1, . . . ,∃xm:f(a1, . . . , an, x1, . . . , xm) = 0

4. An exponential polynomial not using the last clause fortis apolynomial. Diophan- tine equations and sets are defined by polynomials, like exponential Diophantine equations and sets are defined by exponential polynomials. 2 A few examples illustrating the definition may be useful.

2_{The construction}_tt2

1 may lead outside the integers, e.g.,x

1−y_with_x_{= 2 and}_y_{= 2. Such situations} will be tacitly avoided in what follows.

Example 11.2.2 1. The function

f(x, y, z) = 3x+ 5xy−71z5

is a polynomial, where we write z5 _{instead of}_z_·_z_·_z_·_z_·_{z. Therefore,} 3x+ 5xy−71z5= 0

is a Diophantine equation, and the set of all natural numbers x such that there exists y, zwith 3x+ 5xy−71z5_{= 0 is a Diophantine set.}

2. The function

f(x, y) =x−2y is a polynomial, so

x−2y= 0

is a Diophantine equation. Therefore the set of all even numbers is Diophantine; indeed, it is the set of all natural numbersxsuch that there exists aywithx−2y= 0, i.e.,x= 2y.

3. The function

f(p, q, r, n) = (p+ 1)n+3+ (q+ 1)n+3−(r+ 1)n+3 is an exponential polynomial. Hence,

(p+ 1)n+3+ (q+ 1)n+3−(r+ 1)n+3= 0

is an exponential Diophantine equation. Therefore the set of all x, y, z >0 such that for somek≥3,xk+yk=zk, is exponential Diophantine. 2 In the introduction Diophantine equations hadinteger solutions, but in the preceding definition their solutions werenatural numbers. However, the problem of deciding whether an arbitrary (exponential) Diophantine equation has solution in integers is equivalent to the problem of deciding whether an arbitrary (exponential) Diophantine equation has solution in natural numbers.

To reduce the former problem to the latter, note that there is a solution in integers to the equation

f(x1, . . . , xn) = 0

if and only if there is a solution in natural numbers to the equation f(p1−q1, . . . , pn−qn) = 0

For the opposite reduction, recall that any natural number can be written as the sum of four squares (see, e.g., the appendix to [123]). Hence, there is a solution in natural numbers to the equation

f(x1, . . . , xn) = 0

if and only if there is a solution in integers to the equation f(p2₁+q₁2+r2₁+s2₁, . . . , p2_n+q2_n+r_n2+s2_n) = 0

In conclusion, we have simplified the problem inessentially by considering only natural number solutions.

In a similar vein, we may allow equations of form f(x1, . . . , xn) =g(y1, . . . , ym)

whereg(y1, . . . , ym) is not simply 0, since this is equivalent to

f(x1, . . . , xn)−g(y1, . . . , ym) = 0

We may allow conjunctions of equations

f(x1, . . . , xn) = 0∧g(y1, . . . , ym) = 0

since this conjunction of equations has a solution if and only if there is a solution to the ordinary equation

f(x1, . . . , xn)·f(x1, . . . , xn) +g(y1, . . . , ym)·g(y1, . . . , ym) = 0

Similarly we may allow disjunctions of equations

f(x1, . . . , xn) = 0∨g(y1, . . . , ym) = 0

since this disjunction of equations has a solution if and only if there is a solution to the ordinary equation

f(x1, . . . , xn)·g(y1, . . . , ym) = 0

11.3 Encoding of finite sequences

We shall give a proof of the Davis-Putnam-Robinson theorem using encodings of counter machine computation executions. The idea will be clearer in the next section. Here it suffices to note that for this purpose it will be necessary to have available a means of expressing facts about objects in a sequence of finite, but unknown, length.

There are several such techniques available. The best known, first employed by G¨odel [54], uses the Chinese Remainder Theorem. In the present setting this technique has the disadvantage that it makes it rather hard to express certain necessary operations as exponential Diophantine equations. Therefore another technique was invented by Matiyasevich [118], which we present in this section.

Definition 11.3.1 Fora, b∈IN, let a= n X i=0 ai2i (0≤ai≤1), b= n X i=0 bi2i (0≤bi≤1)

Thebitwise less-than relationab is defined by:

ab⇔ ∀i∈ {0, . . . , n}:ai≤bi 2

The rest of this section is devoted to showing thatab is an exponential Diophantine relation, i.e., that

{(a, b)∈IN×IN|ab}

is a Diophantine set according to Definition 11.2.1. We prove this using two lemmas. The first lemma is due to Robinson [153], and the proof is a modification of Robinson’s proof due to Matiyasevich [120].

Lemma 11.3.2 Define fork≤n,

= n!

(n−k)!k!

and let n_k= 0 whenk > n. The relationm= n_kis exponential Diophantine.

Proof. First, the less-than relation is exponential Diophantine, since a < b⇔ ∃x:a+x+ 1 =b

Second, let [N]B

k be thek’th digit of N written in baseB. For instance, since 5 in base

2 is 101 and we count from the right starting from 0, we have [5]2₂= [5]2₀= 1

and

[5]2₃= [5]2₁= 0 The relationd= [N]B

k is exponential Diophantine since

Finally, by the binomial theorem (B+ 1)n= n X k=0 _n k Bk Note that n_k

is thek’th digit of (B+ 1)n _{written in base} _{B, provided} n k

< B for all k. This, in turn, holds if B >2n _{(see the exercises). Hence,} _m₌ n

k is exponential Diophantine: m= _n k ⇔ ∃B:B= 2n+ 1∧m= [(B+ 1)n]B_k 2

The second lemma necessary to prove that the bitwise less-than relation is exponential Diophantine involves a bit of elementary number theory, which has been banished to the exercises.

Lemma 11.3.3 nk⇔ n_k

is odd

Proof. See the exercises. 2

Proposition 11.3.4 The bitwise less-than relation is exponential Diophantine.

Proof. The relationm= n_kis exponential Diophantine by Lemma 11.3.2. The relation “m is odd” is also exponential Diophantine:

mis odd ⇔ ∃x:m= 2x+ 1

Hence, the proposition follows by Lemma 11.3.3. 2

Ifab thenais also digitwise less thanbusing any other base B, provided the base is a power of 2. The converse does not generally hold; it may be that B is a power of 2, a is digitwise less thanb in base B, and yet a6b. However, if B is a power of 2,a is digitwise less thanb in base B, and all the digits of b in B are 0 or 1, then also ab. All this is perhaps best explained with an example.

Example 11.3.5 For instance, 3443, as can be seen from the first two rows in Fig- ure 11.1. Moreover, 34 is also digitwise less than 43 with base 4, as can be seen from the last two rows in the figure. The reason is that every group of two coefficientsx·2i+1_+y_·₂i

in the base 2 representation is packed into a single coefficientx·2 +y in the base 4 representation. If, in the base 2 representation, all bits in a number are less than or equal to those in another number, then the same holds in the base 4 representation; that is, if x1≤x2 andy1≤y2 thenx1·2 +y1≤x2·2 +y2.

43 = 1·25+ 0·24 + 1·23+ 0·22 + 1·21+ 1·20 34 = 1·25_{+ 0}_·₂4 ₊ ₀_·₂3_{+ 0}_·₂2 ₊ ₁_·₂1_{+ 0}_·₂0

43 = 2·42 + 2·41 + 3·40

34 = 2·42 ₊ ₀_·₄1 ₊ ₂_·₄0

Figure 11.1: Digitwise comparisons in base 2 and 4.

43 = 1·25_{+ 0}_·₂4 ₊ ₁_·₂3_{+ 0}_·₂2 ₊ ₁_·₂1_{+ 1}_·₂0 18 = 0·25_{+ 1}_·₂4 ₊ ₀_·₂3_{+ 0}_·₂2 ₊ ₁_·₂1_{+ 0}_·₂0

43 = 2·42 ₊ ₂_·₄1 ₊ ₃_·₄0

18 = 1·42 ₊ ₀_·₄1 ₊ ₂_·₄0

Figure 11.2: More digitwise comparisons in base 2 and 4.

On the other hand, 18 is digitwise less than 43 in base 4, but 18643, as can be seen from Figure 11.2. The reason is that a group of two coefficients x1·2i+1+y1·2i in the base 2 representation of a may fail to be digitwise less than the corresponding two coefficients x2·2i+1+y2·2i in the base 2 representation of b, even if it holds that x1·2 +y1≤x2·2 +y2. For instance, this happens ifx1< x2 andy1> y2.

However, if all the coefficients in the base 4 representation are 0 or 1, i.e.,x1 andx2

are 0, then this cannot happen. 2

11.4 The Davis-Putnam-Robinson Theorem

In this section we show that any recursively enumerable setAis exponential Diophantine. As mentioned in Section 11.1, the result is due to Davis, Putnam, and Robinson [38]. The present proof is due to Jones and Matiyasevich [74], and is somewhat more in the spirit of this book than the original recursion-theoretic proof.

Any recursively enumerable setA can be represented by a counter machinepin the sense that thatx∈Aiff [[p]](x) terminates. This follows from the fact that counter ma- chines can express all partial recursive functions. The idea, then, is to formalize the execution of any counter machinepby an exponential Diophantine equationf(x, z1, . . . , zk) = 0

Before proceeding with the general construction it will be useful to review an example, taken from [156], which illustrates how this is done.

Example 11.4.1 Consider the following counter machine p: I1: ifX1=0goto4; I2: X1:=X1−1; I3: ifX2=0goto1; I4: stop

We assume that every subtraction command Il: X:=X-1 is preceded by a command

Il−1:if X=0 goto k. We also assume that for every conditionalIl:if X=0 goto k, Ik

is not a subtraction command. This implies that a counter will never be decremented below 0. We write a stop command at the end of the program, and assume that all counters have value 0, when the program terminates. These assumptions present no loss of generality.

Recall that the input is stored in counter X1. If the input to pis 2, then the computation has length 7, (we count a single-step computation to have length 0), and the following commands are executed in order:

1,2,3,1,2,3,1,4

The whole execution, including information about values of counters and the current program point, can be represented by the matrix in Figure 11.3.

7 6 5 4 3 2 1 0 = t 0 0 0 1 1 1 2 2 = x1,t 0 0 0 0 0 0 0 0 = x2,t 0 1 0 0 1 0 0 1 = i1,t 0 0 0 1 0 0 1 0 = i2,t 0 0 1 0 0 1 0 0 = i3,t 1 0 0 0 0 0 0 0 = i4,t

Figure 11.3: Representation of execution of counter machine.

The twox-rows represent the values of the two counters before stept, counting the first step as step 0. For instance,X1 has value 2 before step 0 and 1, sox1,0andx1,1are both 2. X1 has value 1 before step 2, 3, and 4, sox1,2, x1,3, and x1,4 are all 1. The i-rows

express which command is executed in step t. For instance, in step 0, command 1 is executed, soi1,0 is 1, and in step 2 command 3 is executed, and thereforei3,2 is 1.

Instead of representing the values of, say X1, by a row of numbers, we can pack the information into a single number

t=0 xi,tbt

where y= 7 is the length of the computation and b is a number larger than all the numbers in the matrix. With this idea the whole matrix becomes the system of equations in Figure 11.4. 0·b7+ 0·b6+ 0·b5+ 1·b4+ 1·b3+ 1·b2+ 2·b1+ 2·b0 = x1 0·b7₊ ₀_·_b6₊ ₀_·_b5₊ ₀_·_b4₊ ₀_·_b3₊ ₀_·_b2₊ ₀_·_b1₊ ₀_·_b0 ₌ _x 2 0·b7₊ ₁_·_b6₊ ₀_·_b5₊ ₀_·_b4₊ ₁_·_b3₊ ₀_·_b2₊ ₀_·_b1₊ ₁_·_b0 ₌ _i 1 0·b7+ 0·b6+ 0·b5+ 1·b4+ 0·b3+ 0·b2+ 1·b1+ 0·b0 = i2 0·b7₊ ₀_·_b6₊ ₁_·_b5₊ ₀_·_b4₊ ₀_·_b3₊ ₁_·_b2₊ ₀_·_b1₊ ₀_·_b0 ₌ _i 3 1·b7₊ ₀_·_b6₊ ₀_·_b5₊ ₀_·_b4₊ ₀_·_b3₊ ₀_·_b2₊ ₀_·_b1₊ ₀_·_b0 ₌ _i 4

Figure 11.4: Numeric representation of execution of counter machine.

Thus every computation of p on some input x can be represented by certain values of x, x1, x2, i1, . . . , i4, y, b. These values satisfy certain properties corresponding to the details of the computation. For instance, in all executions ofpcommand 2 is followed by execution of command 3. Thus in Figure 11.4 wheneverbj_{has coefficient 1 in}_i

2,bj+1has coeffient 1 ini3. This is a purely numerical relationship between the values ofi2 andi3. These relationships can be expressed as a set of equations such that every computation ofpon some input gives a solution to the equations, and every solution to the equations

correspond to a computation ofp. 2

The idea, in general, is now to translate any counter machinepinto an exponential Dio- phantine equation such that if [[p]](x) terminates, then the details of the computation— the number of steps, the values of the counters, etc.— give a solution to the equation, and conversely, for every solution of the equation, there is a corresponding terminating computation.

Proof. LetA be any recursively enumerable set and p be a counter machine such that x∈Aiff [[p]](x) terminates. Supposephas form

p=I1. . .In (with counters X1, . . . ,Xm) (11.1)

We now derive an exponential Diophantine equation

f(x, x1, . . . , xm, i1, . . . , in, y, b, U) = 0 (11.2)

such that

[[p]](x) terminates ⇔ (11.2) has a solution (11.3)

More precisely we derive 12 equation schemes which can be combined into a single conjunction using the technique in Section 11.2.

1. First of all, we need a base b for the representation in Figure 11.4. Recall that b must be larger than all the coefficients in order for the representation to be correct. Since the initial value of counterX1isxand the other counters are initialized to 0, no counter value can exceedx+y wherey is the number of steps of the computation. Therefore,

b= 2x+y+n (1)

is large enough. We shall need later two additional facts aboutb, both satisfied by the above choice: thatbis a power of 2 and thatb > n.

2. It will be convenient to have a number whose representation in base bis a string of lengthy consisting entirely of 1’s. This is the numberby−1₊_{· · ·}₊_b_{+ 1. This number} satisfies the equation

1 +bU=U+by (2)

In document Computability and Complexity: From a Programming Perspective - Free Computer, Programming, Mathematics, Technical Books, Lecture Notes and Tutorials (Page 187-200)