Ruler-and-Compass Constructions
7.1 Approximate Constructions and More General Instruments
For the technical drawing expert we emphasise that we are discussing exact con-structions. There are many approximate constructions for trisecting the angle, for instance, but no exact methods. Dudley (1987) is a fascinating collection of approxi-mate methods that were thought by their inventors to be exact. Figure 10 is a typical example. To trisect angle BOA, draw line BE parallel to OA. Mark off AC and CD equal to OA, draw arc DE with centre C and radius CD. Drop a perpendicular EF to OD and draw arc FT centre O radius OF to meet BE at T. Then angle AOT approxi-mately trisects angle BOA. See Exercise 7.10.
FIGURE 10: Close—but no banana.
The Greeks were well aware that by going outside the Platonic constraints, all three classical problems can be solved. Archimedes and others knew that angles can be trisected using a marked ruler, as in Figure 11. The ruler has marked on it two points distance r apart. Given∠AOB = θ draw a circle centre O with radius r, cutting OA at X, OB at Y. Place the ruler with its edge through X and one mark on the line OY at D; slide it until the other marked point lies on the circle at E. Then∠EDO
= θ /3. For a proof, see Exercise 7.3. Exercise 7.14 shows how to duplicate the cube using a marked ruler.
Setting your compasses up against the ruler so that the pivot point and the pen-cil effectively constitute such marks also provides a trisection, but again this goes beyond the precise concept of a ‘ruler-and-compass construction’. Many other uses of ‘exotic’ instruments are catalogued in Dudley (1987), which examines the history of trisection attempts. Euclid may have limited himself to an unmarked ruler (plus compasses) because it made his axiomatic treatment more convincing. It is not en-tirely clear what conditions should apply to a marked ruler—the distance between the marks causes difficulties. Presumably it ought to be constructible, for example.
The Greeks solved all three problems using conic sections, or more recondite curves such as the conchoid of Nichomedes or the quadratrix (Klein 1962, Coolidge
FIGURE 11: Trisecting an angle with a marked ruler.
1963). Archimedes tackled the problem of Squaring the Circle in a characteristically ingenious manner, and proved a result which would now be written
31071< π < 317
This was a remarkable achievement with the limited techniques available, and refine-ments of his method can approximate π to any required degree of precision.
Such extensions of the apparatus solve the practical problem, but it is the theo-retical one that holds the most interest. What, precisely, are the limitations on ruler-and-compass constructions? With the machinery now at our disposal it is relatively simple to characterise these limitations, and thereby give a complete answer to all three problems. We use coordinate geometry to express problems in algebraic terms, and apply the theory of field extensions to the algebraic questions that arise.
7.2 Constructions in C
We begin by formalising the notion of a ruler-and-compass construction. Assume that initially we are given two distinct points in the plane. Equivalently, by Euclid’s Axiom 1, we can begin with the line segment that joins them. These points let us choose an origin and set a scale. So we can identify the Euclidean plane R2with C, and assume that these two points are 0 and 1.
Euclid dealt with finite line segments (his condition (1) above) but could make them as long as he pleased by extending the line (condition (2)). We find it more convenient to work with infinitely long lines (modelling an infinitely long ruler), which in effect combines Euclid’s conditions into just one: the possibility of drawing the (infinitely long) line that passes through two given points. From now on, ‘line’ is always used in this sense.
Constructions in C 91 If z1, z2∈ C and 0 ≤ r ∈ R, define
L(z1, z2) = the line joining z1to z2 (z16= z2) C(z1, r) = the circle centre z1with radius r > 0 We now define constructible points, lines, and circles recursively:
Definition 7.2. For each n ∈ N define sets Pn,Ln, andCnof n-constructible points, lines, and circles, by:
P0 = {0, 1}
L0 = /0 C0 = /0
Ln+1 = {L(z1, z2) : z1, z2∈Pn}
Cn+1 = {C(z1, |z2− z3|) : z1, z2, z3∈Pn}
Pn+1 = {z ∈ C : z lies on two distinct lines in Ln+1} ∪
{z ∈ C : z lies on a line in Ln+1and a circle inCn+1} ∪ {z ∈ C : z lies on two distinct circles in Ln+1}
FIGURE 12: The setP1. Figure 12 shows that
P1= {−1, 0, 1, 2,1 ± i√ 3
2 }
Lemma 7.3. For all n ∈ N,
Pn⊆Pn+1 Ln⊆Ln+1 Cn⊆Cn+1
and each is a finite set.
Proof. The inclusions are clear. Let pnbe the number of points inPn, lnthe number of lines inLn, and cnthe number of circles inCn. Then
|Ln+1| ≤ 12pn(pn+ 1)
|Cn+1| ≤ pn12pn(pn+ 1)
|Pn+1| ≤ 12ln+1(ln+1+ 1) + 2lncn+ cn+1(cn+1+ 1)
bearing in mind that a line or circle meets a distinct circle in ≤ 2 points. By induction, all three sets are finite for all n.
We formalise a Euclidean ruler-and-compass construction using these sets. The intuitive idea is that starting from 0 and 1, such a construction generates a finite sequence of points by drawing a line through two previously constructed points, or a circle whose centre is one previously constructed point and whose radius is the distance between two previously constructed points, and then defining a new point using intersections of these.
Definition 7.4. A point z ∈ C is constructible if there is a finite sequence of points z0= 0, z1= 1, z2, z3, . . . zk= z (7.1) such that zj+1lies in at least one of:
L(zj1, zj2) ∩ L(zj3, zj4) L(zj1, zj2) ∩C(zj3, |zj4− zj5|) C(zj1, |zj2− zj3|) ∩C(zj4, |zj5− zj6|) where all ji≤ j and the intersecting lines and circles are distinct.
In the first case, the lines must not be parallel in order to have non-empty in-tersection; in the other cases, the line must meet the circle and the two circles must meet. These technical conditions can be expressed as algebraic properties of the zj.
We can now prove:
Theorem 7.5. A point z ∈ C is constructible if and only if z ∈ Pnfor some n∈ N.
Proof. Let z ∈ C be constructible, using the sequence (7.1). Inductively, it is clear that z = zk∈Pk.
Conversely, let z ∈Pk. Then we can find a sequence zj∈Pj, where 0 ≤ j ≤ k, satisfying (7.1).
To characterise constructible points, we need:
Definition 7.6. The Pythagorean closure Qpyof Q is the smallest subfield K ⊆ C with the property:
z∈ K =⇒ ±√
z∈ K (7.2)
The Pythagorean closure of Q exists because every subfield of C contains Q, so Qpy is the intersection of all subfields of C satisfying (7.2).
Constructions in C 93 The main theorem of this section is:
Theorem 7.7. A point z ∈ C is constructible if and only if z ∈ Qpy. Equivalently,
∞ [
n=0
Pn= Qpy (7.3)
Pre-proof Discussion.
We can summarise the main idea succinctly. Coordinate geometry in C shows that each step in a ruler-and-compass construction leads to points that can be expressed using rational functions of the previously constructed points together with the square root of a rational function of those points. Conversely, all rational functions of given points can be constructed, and so can square roots of given points. Therefore anything that can be constructed lies in Qpy, and anything in Qpycan be constructed.
The details require some algebraic computations in C and some basic Euclidean geometry. We prove Theorem 7.7 in two stages. In this section we show that
(A) Pn⊆ Qpyfor all n ∈ N.
In the next section, after describing some basic constructions for arithmetical opera-tions and square roots, we complete the proof by establishing
(B) If z ∈ Qpythen z ∈Pnfor some n ∈ N.
Equation (7.3) is an immediate consequence of (A) and (B).
Proof of Part (A). Part (A) follows by coordinate geometry in C ≡ R2. The details are tedious, but we give them for completeness. Use induction on n. Since P0= {0, 1} ⊆ Q, we have P0∈ z. Suppose inductively thatPn⊆ Qpy, and let z ∈Pn+1. We have to prove that z ∈ Qpy.
There are three cases: line meets line, line meets circle, circle meets circle.
Case 1: Line meets line. Here {z} = L(z1, z2) ∩ L(z3, z4) where the zj∈Pn⊆ Qpy (induction hypothesis) and the lines are distinct. Therefore there exist real α, β such that
z= αz1+ (1 − α)z2
z= β z3+ (1 − β )z4 Therefore
α =β (z3− z4) + z4− z2 z1− z2 Since α, β ∈ R, we also have
α =β (z3− z4) + z4− z2 z1− z2
where the bar is complex conjugate. These two equations have a unique solution for
α , β because we are assuming that the lines meet at a unique point z, and the solution is:
α = z2(z4− z3) + z2(z3− z4) − z3z4+ z4z3 (z1− z2)(z3− z4) + (z4− z3)(z1− z2) β = z3(z1− z2) + z3(z2− z1) − z2z1+ z1z2
(z4− z3)(z2− z1) + (z1− z2)(z4− z3) so α, β ∈ Qpy. Then z = αz1+ (1 − α)z2∈ Qpy.
Case 2: Line meets circle. Here z ∈ L(z1, z2) ∩ C(z3, |z4− z5|) where the zj ∈ Pn⊆ Qpy(induction hypothesis). Let r = |z4− z5|. There exist α, θ ∈ R such that
z= αz1+ (1 − αz2) z= z3+ reiθ Therefore
α (z1− z2) + z2 = z3+ reiθ α (z1− z2) + z2 = z3+ re−iθ
where we take the complex conjugate to get the second equation. We can eliminate θ to get
(α(z1− z2) + z2− z3)(α(z1− z2) + z2− z3) = reiθ.re−iθ = r2= (z4− z5)(z4− z5) which is a quadratic equation for α with coefficients in Qpy. Since the quadratic formula involves only rational functions of the coefficients and a square root, α ∈ Qpy. Therefore z ∈ Qpy.
Case 3: Circle meets circle. Here z ∈ C(z1, |z2− z3|) ∩ C(z4, |z5− z6|) where the zj ∈Pn⊆ Qpy (induction hypothesis). Let r = |z2− z3|, s = |z5− z6|. There exist θ , φ ∈ R such that
z = z1+ reiθ z = z4+ seiφ Take conjugates and eliminate θ , φ as above to get
(z − z1)(z − z1) = r2 (z − z4)(z − z4) = s2
Solving for z and z (left as an exercise) we find that z satisifies a quadratic equation with coefficients in Qpy. Therefore z ∈ Qpy.