The Euclidean Algorithm - Factorisation of Polynomials

Factorisation of Polynomials

3.1 The Euclidean Algorithm

In number theory, one of the key concepts is divisibility: an integer a is divisible by an integer b if there exists an integer c such that a = bc. For instance, 60 is divisible by 3 since 60 = 3.20, but 60 is not divisible by 7. Divisibility properties of integers lead to such ideas as primes and factorisation. We wish to develop similar ideas for polynomials.

Many important results in the factorisation theory of polynomials derive from the 47

observation that one polynomial may always be divided by another provided that a

‘remainder’ term is allowed. This is a generalisation of the Remainder Theorem, in which f is assumed to be linear.

Proposition 3.1 (Division Algorithm). Let f and g be polynomials over K, and suppose that f is non-zero. Then there exist unique polynomials q and r over K, such that g= f q + r and r has strictly smaller degree than f .

Proof. Use induction on the degree of g. If ∂ g = −∞ then g = 0 and we may take q= r = 0. If ∂ g = 0 then g = k is an element of K. If also ∂ f = 0 then f is an element of K, and we may take q = k/ f and r = 0. Otherwise ∂ f > 0 and we may take q = 0 and r = g. This starts the induction.

Now assume that the result whenever the degree of g is less than n, and let ∂ g = n> 0. If ∂ f > ∂ g, then we may as before take q = 0, r = g. Otherwise

f = a_mt^m+ · · · + a₀ g= b_ntⁿ+ · · · + b₀ where a_m6= 0 6= b_nand m ≤ n. Let

g1= b_na⁻¹_m t^n−mf− g

Since the terms of highest degree cancel (which is the object of the exercise) we have ∂ g1< ∂ g. By induction there are polynomials q1and r₁over K such that g₁=

f q₁+ r₁and ∂ r1< ∂ f . Let

q= b_na⁻¹_m t^n−m− q₁ r= −r₁ Then

f q+ r = b_na⁻¹_m t^n−mf− q₁f− r₁= g + g₁− g₁= g so g = f q + r; clearly ∂ r < ∂ f as required.

Finally we prove uniqueness. Suppose that

g= f q₁+ r₁= f q₂+ r₂ where ∂ r1, ∂ r2< ∂ f

Then f (q₁− q₂) = r₂− r₁. By Proposition 2.2, the polynomial on the left has higher degree than that on the right, unless both are zero. Since f 6= 0 we must have q₁= q₂ and r₁= r₂. Thus q and r are unique.

With the above notation, q is called the quotient and r is called the remainder on dividing g by f . The inductive process we employed to find q and r is called the Division Algorithm.

Example 3.2. Divide g(t) = t⁴− 7t³+ 5t²+ 4 by f = t²+ 3 and find the quotient and remainder.

Observe that

t²(t²+ 3) = t⁴+ 3t² has the same leading coefficient as g. Then

g− t²(t²+ 3) = −7t³+ 2t²+ 4

The Euclidean Algorithm 49 which has the same leading coefficient as

−7t(t²+ 3) = −7t³− 21t Therefore

g− t²(t²+ 3) + 7t(t²+ 3) = 2t²+ 21t + 4 which has the same leading coefficient as

2(t²+ 3) = 2t²+ 6 Therefore

g− t²(t²+ 3) + 7t(t²+ 3) − 2(t²+ 3) = 21t − 2 So

g= (t²+ 3)(t²− 7t + 2) + (21t − 2)

and the quotient q(t) = t²− 7t + 2, while the remainder r(t) = 21t − 2.

The next step is to introduce notions of divisibility for polynomials, and in par-ticular the idea of ‘highest common factor’ which is crucial to the arithmetic of poly-nomials.

Definition 3.3. Let f and g be polynomials over K. We say that f divides g (or f is a factor of g, or g is a multiple of f ) if there exists some polynomial h over K such that g = f h. The notation f |g will mean that f divides g, while f -g will mean that f does not divide g.

Note that we have said a highest common factor rather than the highest common factor. This is because hcf’s need not be unique. However, the next lemma shows that they are unique apart from constant factors.

Lemma 3.5. If d is an hcf of the polynomials f and g over K, and if 0 6= k ∈ K, then kd is also an hcf for f and g.

If d and e are two hcf ’s for f and g, then there exists a non-zero element k∈ K such that e= kd.

If d and e are hcf’s then by definition e|d and d|e. Thus e = kd for some polyno-mial k. Since e|d the degree of e is less than or equal to the degree of d, so k must have degree ≤ 0. Therefore k is a constant, and so belongs to K. Since 0 6= e = kd, we must have k 6= 0.

We shall prove that any two non-zero polynomials have an hcf by providing a method to calculate one. This method is a generalisation of the technique used by Euclid (Elements Book 7 Proposition 2) around 600BCfor calculating hcf’s of inte-gers, and is accordingly known as the Euclidean Algorithm.

Algorithm 3.6 (Euclidean Algorithm). Ingredients Two polynomials f and g over K, both non-zero.

Recipe For notational convenience let f = r−1, g = r₀. Use the Division Algo-rithm to find successively polynomials q_jand r_isuch that

r₋₁= q₁r₀+ r₁ ∂ r1< ∂ r0

r₀= q₂r₁+ r₂ ∂ r2< ∂ r1

r₁= q₃r₂+ r₃ ∂ r₃< ∂ r2

. . .

r_i= q_i+2ri+1+ r_i+2 ∂ ri+2< ∂ ri+1

. . .

(3.1)

Since the degrees of the r_idecrease, we must eventually reach a point where the process stops; this can happen only if some r_s+2= 0. The last equation in the list then reads

r_s= qs+2rs+1 (3.2)

and it provides the answer we seek:

Theorem 3.7. With the above notation, r_s+1is an hcf for f and g.

Now suppose that e| f and e|g. By (3.1) and induction, e|r_ifor all i. In particular, e|r_s+1. Therefore r_s+1is an hcf for f and g, as claimed.

Example 3.8. Let f = t⁴+ 2t³+ 2t²+ 2t + 1, g = t²− 1 over Q. We compute an hcf as follows:

t⁴+ 2t³+ 2t²+ 2t + 1 = (t²+ 2t + 3)(t²− 1) + 4t + 4 t²− 1 = (4t + 4)(1

4t−1 4)

Hence 4t + 4 is an hcf. So is any rational multiple of it, in particular, t + 1.

We end this section by deducing from the Euclidean Algorithm an important property of the hcf of two polynomials.

Theorem 3.9. Let f and g be non-zero polynomials over K, and let d be an hcf for f and g. Then there exist polynomials a and b over K such that

d= a f + bg

Proof. Since hcf’s are unique up to constant factors we may assume that d = r_s+1 where equations (3.1) and (3.2) hold. We claim as induction hypothesis that there exist polynomials a_iand b_isuch that

d= a_iri+ b_iri+1

Irreducibility 51 This is clearly true when i = s + 1, for we may then take a_i= 1, bi= 0. By (3.1)

ri+1= r_i−1− qi+1r_i Hence by induction

d= a_iri+ b_i(r_i−1− q_i+1ri) so that if we put

a_i−1= b_i b_i−1= a_i− b_iq_i+1 we have

d= a_i−1r_i−1+ b_i−1r_i Hence by descending induction

d= a₋₁r₋₁+ b₋₁r₀= a f + bg where a = a₋₁, b = b₋₁. This completes the proof.

The induction step above affords a practical method of calculating a and b in any particular case.

3.2 Irreducibility

Now we investigate the analogue, for polynomials, of prime numbers. The con-cept required is ‘irreducibility’. In particular, we prove that every polynomial over a subring of C can be expressed as a product of irreducibles in an ‘essentially’ unique way.

An integer is prime if it cannot be expressed as a product of smaller integers. The analogue for polynomials is similar: we interpret ‘smaller’ as ‘smaller degree’. So the following definition yields the polynomial analogue of a prime number.

Definition 3.10. A non-constant polynomial over a subring R of C is reducible if it is a product of two polynomials over R of smaller degree. Otherwise it is irreducible.

Examples 3.11. (1) All polynomials of degree 1 are irreducible, since they certainly cannot be expressed as a product of polynomials of smaller degree.

(2) The polynomial t²− 2 is irreducible over Q. To show this we suppose, for a contradiction, that it is reducible. Then

t²− 2 = (at + b)(ct + d)

where a, b, c, d, ∈ Q. Dividing out if necessary we may assume a = c = 1. Then b+ d = 0 and bd = −2, so that b²= 2. But no rational number has its square equal to 2 (Exercise 1.2).

(3) However, t²− 2 is reducible over the larger subfield R, for now t²− 2 = (t −√

2)(t +

√ 2)

This shows that an irreducible polynomial may become reducible over a larger sub-field of C.

(4) The polynomial 6t + 3 is irreducible in Z[t]. Although it has factors 6t + 3 = 3(2t + 1)

the degree of 2t + 1 is the same as that of 6t + 6. So this factorisation does not count.

(5) The constant polynomial 6 is irreducible in Z[t]. Again, 6 = 2 · 3 does not count.

Any reducible polynomial can be written as the product of two polynomials of smaller degree. If either of these is reducible it too can be split up into factors of smaller degree . . . and so on. This process must terminate since the degrees cannot decrease indefinitely. This is the idea behind the proof of:

Theorem 3.12. Any non-zero polynomial over a subring R of C is a product of irreducible polynomials over R.

Proof. Let g be any non-zero polynomial over R. We proceed by induction on the degree of g. If ∂ g = 0 or 1 then g is automatically irreducible. If ∂ g > 1, then either g is irreducible or g = hk where ∂ h, ∂ k < ∂ g. By induction, h and k are products of irreducible polynomials, whence g is such a product. The theorem follows by induction.

Example 3.13. We can use Theorem 3.12 to prove irreducibility in some cases, especially for cubic polynomials over Z. For instance, let R = Z. The polynomial

f(t) = t³− 5t + 1

is irreducible. If not, then it must have a linear factor t − α over Z, and then α ∈ Z and f (α) = 0. Moreover, there must exist β , γ ∈ Z such that

f(t) = (t − α)(t²+ β t + γ)

= t³+ (β − α)t²+ (γ − αβ )t − αγ

so in particular αγ = −1. Therefore α = ±1. But f (1) = −3 6= 0 and f (−1) = 5 6= 0.

Therefore no such factor exists.

Irreducible polynomials are analogous to prime numbers. The importance of prime numbers in Z stems in part from the possibility of factorising every integer into primes, but even more so from the uniqueness (up to order) of the prime factors.

Likewise the importance of irreducible polynomials depends upon a uniqueness the-orem. Uniqueness of factorisation is not obvious, see Stewart and Tall (2002) Chapter 4. In certain cases it is possible to express every element as a product of irreducible elements, without this expression being in any way unique. We shall heed the warn-ing and prove the uniqueness of factorisation for polynomials. To avoid technical

Irreducibility 53 issues like those in Examples 3.1(4,5), we restrict attention to polynomials over a subfield K of C. It is possible to prove more general theorems by introducing the idea of a ‘unique factorisation domain’, see Fraleigh (1989) Chapter 6.

For convenience we make the following:

Definition 3.14. If f and g are polynomials over a subfield K of C with hcf equal to 1, we say that f and g are coprime, or f is prime to g. (The common phrase ‘coprime to’ is wrong. The prefix ‘co’ and the ‘to’ say the same thing, so it is redundant to use both.)

The key to unique factorisation is a statement analogous to an important property of primes in Z, and is used in the same way:

Lemma 3.15. Let K be a subfield of C, f an irreducible polynomial over K, and g, h polynomials over K. If f divides gh, then either f divides g or f divides h.

Proof. Suppose that f -g. We claim that f and g are coprime. For if d is an hcf for f and g, then since f is irreducible and d| f , either d = k f for some k ∈ K, or d = k ∈ K.

In the first case f |g, contrary to hypothesis. In the second case, 1 is also an hcf for f and g, so they are coprime. By Theorem 3.9, there exist polynomials a and b over K such that

1 = a f + bg Then

h= ha f + hbg

Now f |ha f , and f |hbg since f |gh. Hence f |h. This completes the proof.

We may now prove the uniqueness theorem.

Theorem 3.16. For any subfield K of C, factorisation of polynomials over K into irreducible polynomials is unique up to constant factors and the order in which the factors are written.

Proof. Suppose that f = f₁. . . f_r = g₁. . . g_s where f is a polynomial over K and f1, . . . , f_r, g₁, . . . , g_sare irreducible polynomials over K. If all the f_iare constant then f ∈ K, so all the g_j are constant. Otherwise we may assume that no f_iis constant, by dividing out all of the constant terms. Then f₁|g₁. . . g_s. By an obvious induction based on Lemma 3.15, f₁|g_jfor some j. We can choose notation so that j = 1, and then f₁|g₁. Since f₁and g₁ are irreducible and f₁is not a constant, we must have f₁= k₁g₁for some constant k₁. Similarly f₂= k₂g₂, . . ., f_r= k_rg_r where k₂, . . ., k_r are constant. The remaining g_l(l > r) must also be constant, or else the degree of the right-hand side would be too large. The theorem is proved.

In document Ua24b.galois.theory.fourth.edition (Page 70-77)