ALGEBRAIC NUMBER THEORY AND QUADRATIC RECIPROCITY

(1)

HENRY COHN, JOSHUA GREENE, JONATHAN HANKE

1. Introduction

These notes are from a series of lectures given by Henry Cohn during MIT’s Independent Activities Period in January, 1995. They were first revised by Henry Cohn and Jon Hanke at PROMYS 1995 and then Joshua Greene worked on further revision at PROMYS 1996. The main references are A Classical Introduction to Modern Number Theory by Ireland and Rosen and Algebraic Theory of Numbers by Weyl; the proofs of uniqueness of ideal factorizations and the finiteness of the class number are taken from Ireland and Rosen, and the proof of quadratic reciprocity is taken from Weyl. Several theorems from algebra are stated without proof. In all cases, proofs can be found inAlgebra by M. Artin.

The goal of these notes is to develop the basic properties of rings of integers in algebraic number fields, and then to deduce the law of quadratic reciprocity in an especially clear and compelling way. We have tried to avoid using much algebra. (In particular, no Galois theory is required.) However, a few fundamental definitions are needed. One ought to know the definitions of rings (which we assume to be commutative and have multiplicative identities) and fields, and of vector spaces and dimension; the main non-obvious fact which we will assume is that the dimension of a finite-dimensional vector space is well-defined.

We will want to use the language of ideals, but we will not assume any prior familiarity with them. Supposenis an integer, and consider the set (n) of multiples ofn. This set has several useful properties: the sum of any two elements of (n) is in (n), and ifais in (n) andbis any integer, thenabis in (n). Any non-empty subset I of a ring R with these two properties (a, b ∈I implies a+b ∈I anda ∈I, b∈R impliesab∈I) is called anideal. The intuition is that every ideal is the set of multiples of some element. For some rings, such asZ, this claim is true, as we will prove below. For some rings, it is false, but we can imagine that the ideal is the set of multiples of some “ideal element” not contained in the actual ring (hence the name “ideal”).

Given any elementa∈R, we define (a) to be the set of multiples ofa. Clearly, (a) is an ideal, called theprincipal ideal generated bya. (Sometimes it is denoted byaRorRa.) A ring in which every ideal is of this form (and which also is an integral domain, i.e.,ab= 0 impliesa= 0 orb= 0) is called aprincipal ideal domain (or PID).

More generally, we can look at ideals generated by several elements. We define (a1, a2, . . . , an) ={a1b1+· · ·+anbn:b1, . . . , bn∈R}.

This set is easily shown to be an ideal, and is in fact the smallest ideal ofRcontaininga1, . . . , an. As we will see in Theorem 2.2 (in the casen= 2), in Zthis ideal is generated by the greatest common divisor (gcd) ofa1, . . . , an.

The main use of ideals is that one can “mod out” by them. Given an idealI, the quotient ringR/I is defined basically by identifying two elements of R iff they differ by an element of I. For example, if n∈Z, the ring Z/(n) is the ring of integers modulo n. We assume some familiarity with this process, although perhaps not expressed in this language, so we will not pause to prove the basic properties of quotient rings. For the most part, they are evident from the definitions.

Date_{: July, 1996. Note that this is an unfinished draft, and should be treated as such, although it will likely never be}

finished.

(2)

2. Ideals inZ and Unique Factorization

Lemma 2.1 (Division Algorithm in Z). For integers a and b with b 6= 0, there exist integers q and r such that a=bq+r and0≤r <|b|.

We omit the proof (which is straightforward using induction or well ordering). Theorem 2.2. Zis a principal ideal domain.

Proof. LetI 6= (0) be an ideal in Z. It seems thatI should be generated by its least positive element. Letg be that element, and letnbe any element ofI. Since we would like to show thatnis a multiple of g, we dividenbyg. By the division algorithm, there existqandrsuch thatn=gq+r with 0≤r < g. Sincer=n−gq,r∈I. Becausegis the smallest positive element ofIand 0≤r < g,r= 0 and therefore

nis a multiple ofg, as desired.

Proposition 2.3. Fora, b∈Z, there existc, d∈Zsuch that ac+bd= gcd(a, b).

Proof. Consider the ideal (a, b) generated byaandb. Since Zis a PID, (a, b) = (g) for someg. Theng divides each ofaandb. Sinceg∈(a, b), there existc and dsuch thatg =ac+bd. Thus, any common divisor ofaandb dividesg. We see thatg= gcd(a, b), and thatg is a linear combination ofaandb. Using a variant of Euclid’s algorithm (related to continued fractions), one can find efficiently thecand dwhich Proposition 2.3 says exist.

Definition 2.4. Aprimeis an integerpother than±1 whose only factors are±1 and±p. (In particular, we allow negative primes.)

Theorem 2.5. If pis prime andpdividesab, then pdivides aor pdivides b.

Proof. Supposepdoes not dividea. Then sincepis prime,pandaare relatively prime, i.e., their gcd is 1. By Proposition 2.3, there existcanddsuch thatpc+ad= 1. Multiplication bybyieldspbc+abd=b. By assumptionpdividesaband thereforeabd, and clearlypdividespbc, so pdividesb. Theorem 2.6(Unique Factorization inZ). Factorization of non-zero integers into primes is unique, up to rearrangement of the factors and change of their signs.

Note that we consider 1 to have the empty prime factorization: it is the product of no primes. Proof. We first need to establish the existence of prime factorizations. The proof is most easily organized by induction. The base case 2 is trivial because 2 is already prime. Now, say thatnis a natural number and assume that all natural numbers less thann have a prime factorization. Ifnis prime already, then we are done. Ifnis composite, then there are two natural numbersa, b such that ab=n anda, b < n. By inductiona=p1p2· · ·prandb=q1q2· · ·qs for primespi, qj. Thenp1p2· · ·prq1q2· · ·qs=nand this is a prime factorization.

The next step is to show that the prime factorization is unique. Suppose there were integers with several prime factorizations. Then there would be a non-zero integern with several prime factorizations such that |n|is as small as possible, sayn=p1· · ·ps=q1· · ·qt. Then p1 divides q1· · ·qt so by Theorem 2.5 (iterated) it divides some qi, without loss of generality q1. Since p1 and q1 are prime, we must have p1 = ±q1. Then|n/p1| < |n|, but n/p1 = p2· · ·ps = ±q2· · ·qt has two different prime factorizations.

This is a contradiction, so prime factorizations are unique.

There are shorter proofs of unique factorization, but they are tricky and unrelated to algebraic number theory. This proof, however, generalizes nicely, as we will see in the next lecture.

Remark 2.7. There is a strong analogy in number theory between the ringZof integers and the ringK[x] of polynomials over a fieldK. (The analogy is strongest whenKis a finite field, but for these results any field will do.) Making only a few changes, we can carry over all of the results of this section toK[x]. The main change is that the right way to measure the size of a polynomial is by its degree. For reference, we restate the results of this section in terms of polynomials:

(3)

Lemma 2.8. For f, g ∈ K[x] with g 6= 0, there exist q, r ∈ K[x] such that f =gq+r and r = 0 or degr <degg.

Theorem 2.9. K[x]is a principal ideal domain.

Proposition 2.10. Forf, g∈K[x], there existh, k∈K[x]such that f h+gk= gcd(f, g). Theorem 2.11. Iff is irreducible andf divides gk, thenf divides g orf dividesk.

Theorem 2.12. Factorization of non-zero polynomials into irreducibles is unique, up to rearrangement of the factors and multiplication by units (here, units are just non-zero constants from K).

3. Modular Arithmetic

Theorem 3.1. If ais relatively prime ton, then there exists bsuch that ab≡1 (modn).

Proof. Proposition 2.3 tells us that there existbandcsuch thatab+nc= 1. Hence,ab≡1 (modn). Corollary 3.2. The group of units modulom,(Z/mZ)×_{, is}_{_a₊_m_Z_∈_Z_/m_Z_{: (a, m) = 1}_}_.

Corollary 3.3. If pis prime, then Z/pZis a field.

Theorem 3.4 (Euler’s Theorem). If ais relatively prime to n, then aϕ(n)_≡_{1 (mod}_{n), where} _ϕ(n)_is the number of integers from1 ton that are relatively prime ton.

Proof. By Theorem 3.1,ais invertible modulon. Let{u1, . . . , uϕ(n)}be a system of integers representing the elements of (Z/nZ)×_{. It follows that, modulo}_{n, the numbers}_{au1, au2, . . . , auϕ(n)}_{are another system} of representatives for (Z/nZ)×_{. Multiplying these tells us that}

u1·u2· · ·uϕ(n)≡au1·au2· · ·auϕ(n) (modn).

We can cancel theui to getaϕ(n)_≡_{1 (mod}_n).

Corollary 3.5(Fermat’s Little Theorem). Ifpis prime, then ap_≡_a _(mod _p).

Proof. For an integera, there are only two possibilities: ais relatively prime toporpdividesa. In the former case Euler’s Theorem tells us thataϕ(p)_≡_{1 (mod}_{p). Recall that}_{ϕ(p) is the number of positive} integers less thanpand relatively prime top. Becausepis prime,ϕ(p) =p−1. Thus,ap_≡_a.

Ifpdivides a, thena≡0 (modp) so ap _≡_a _(mod_{p) trivially.} The logic that leads to Corollary 3.3 also proves the following result:

Corollary 3.6. If K is a field andf ∈K[x] is irreducible, thenK[x]/(f)is a field. 4. Field Extensions

This section and the next summarize the basic definitions and facts for field extensions. Some famil-iarity with field theory may be necessary to follow this, but we hope not too much.

Definition 4.1. Given fieldsKand LwithK⊂L, we callLanextensionofK. We often refer to the extension L/K. (Note that this does not indicate a quotient ring.) We can regard L as a vector space over K. The extension L/K isfinite ifL is a finite-dimensional K-vector space. The degreeof L/K is the dimension ofLas aK-vector space, and is denoted [L:K].

Proposition 4.2(Multiplicativity of Degrees). Given field extensionsL/K andK/F, if two of the three degrees [L:K],[K:F], and[L:F]are finite, then the third is, and

[L:F] = [L:K][K:F].

Sketch of Proof. If α1, . . . , αn form a basis of L/K, and β1, . . . , βm form a basis of K/F, then one can check thatαiβj (i= 1, . . . , n,j = 1, . . . , m) form a basis ofL/F. Definition 4.3. Given an extensionL/K, an elementα∈LisalgebraicoverKif it satisfies a non-zero polynomial equation with coefficients inK.

(4)

Proof. The infinitely many elements 1, α, α2_{, . . .} _{cannot be linearly independent. A linear dependence} between them gives a polynomial equation, showing thatαis algebraic. Proposition 4.5. SupposeL/K is a field extension. Each elementαofLthat is algebraic over Kis the root of a unique irreducible polynomial over K (unique up to multiplication by non-zero constants, that is). Ifαis the root of an irreducible polynomial of degree n, thenK(α)/K is finite, and has degree n. Proof. Consider the map π:K[x]→K[α] that substitutes αforxin a polynomial. The kernel of this map is a principal ideal (sinceK[x] is a PID), generated by a polynomialf, which is the polynomial of least degree which hasαas a root. This polynomial is irreducible, since if it factored, thenαwould be a root of one of the factors. Further,f divides any polynomial which hasαas a root because kerπ= (f). Therefore, up to multiplication by a constant, it is the only irreducible polynomial to haveαas a root.

Now suppose that f(x) has degree n. The ring K[α] is isomorphic to K[x]/(f(x)) (the quotient of the polynomial ringK[x] by the ideal generated byf(x)). In particular,K[α] has a basis 1, α, . . . , αn−1 consisting ofnelements. Corollary 3.6 implies thatK[α] is a field, from which it follows thatK(α) =K[α],

andK(α) is a degreenextension ofK.

Definition 4.6. Ifαis algebraic overK, then the unique monic irreducible polynomial satisfied byαis called theminimal polynomial ofαoverK.

Theorem 4.7. Given any extensionL/K, the set F of elements ofLwhich are algebraic overK forms a field.

Proof. Notice that for anyα∈K,αis the root of a polynomial (x−α)∈K[x]. Therefore,αis algebraic overK, soK⊂F.

Supposeαand β are algebraic over K. Since K(α, β) is a finite extension ofK(α), which is a finite extension ofK, we see thatK(α, β) is a finite extension ofK, by Proposition 4.2. Hence, Proposition 4.4 implies thatα+β andαβ are algebraic. Thus,F is a ring. Since

anαn₊_an

−1αn−1+· · ·+a1α+a0= 0 implies

an+an−1α−1+· · ·+a1α−n+1+a0α−n= 0,

we see thatα−1 _{is algebraic if}_α₆_{= 0 is. Hence,}_F _{is a field.} Proposition 4.8. Let L/K be an extension of fields of characteristic 0. If α, β ∈ L, then for all but finitely many choices ofc∈K,α+cβ generates K(α, β)overK, i.e. K(α, β) =K(α+cβ).

The proof of this proposition can be found in Weyl’s book or Artin’s (see the references section). Corollary 4.9. Any finite extensionL/K of fields of characteristic 0 is generated by a single element, i.e., there is someα∈Lsuch that L=K(α).

5. Conjugates and Symmetry

Definition 5.1. A polynomial f(x1, . . . , xn) ofn variables is called symmetric if for each permutation σof{1, . . . , n},

f(x1, . . . , xn) =f(xσ(1), . . . , xσ(n)).

Definition 5.2. For 1≤i≤n, thei-th elementary symmetric polynomial si(x1, . . . , xn) in the variables x1, . . . , xn is the sum of all products ofiof the variables. For example,

s1=x1+· · ·+xn, s2=X

j<k

xjxk =x1x2+x1x3+· · ·+x1xn+x2x3+x2x4+· · ·+xn₋1xn,

and

(5)

Theorem 5.3 (Fundamental Theorem on Symmetric Polynomials). Every symmetric polynomial is a polynomial in the elementary symmetric polynomials. In other words, if f is a symmetric polynomial of n variables, then there exists a polynomial g such that f(x1, . . . , xn) = g(s1, . . . , sn). (This holds for polynomials with coefficients in any ring.)

We omit the proof of this theorem. It can be found in Artin’s book. Also, a solution is sketched in Exercise 7 below.

Note that since

(y−x1)· · ·(y−xn) =yn−s1yn−1+· · ·+ (−1)nsn,

the coefficients of the polynomial with rootsx1, . . . , xn are plus or minuss1, . . . , sn. It is common to try to express some function of the roots of a polynomial in terms of its coefficients. We see now that if that function is a polynomial, then this can be done if (and obviously only if) the polynomial is symmetric.

For example,x2

1+· · ·+x2n is symmetric, and

x21+· · ·+x2n =s21−2s2. The product

Y

j<k

(xj−xk)

is not symmetric, so it cannot be expressed in terms of elementary symmetric polynomials. However, its square

Y

j<k

(xj−xk)2

is symmetric, and can therefore be expressed in terms of elementary symmetric polynomials. The expres-sion is something of a mess, however. Incidentally, this is the discriminant of the polynomial

yn−s1yn−1+· · ·+ (−1)nsn.

Proposition 5.4. Letf be an irreducible polynomial over a fieldKof characteristic 0. In any extension of K,f has no multiple roots.

Proof. Recall that a multiple root off is the same as a common root off andf0_{. Since}_f _{is irreducible} and degf0 _<_deg_f_{, they are relatively prime. (Note that this requires that the characteristic of}_f _{be 0.} In characteristicp, we could havef0 _{= 0. For example, the derivative of}_xp₋_a_is_pxp−1_{= 0. However,} this cannot happen in characteristic 0. In characteristic 0, f0 _{is not zero. Then since deg}_f0 _<_deg_f_, _f0 is not divisible byf, and thus sincef is irreducible, we must have gcd(f, f0) = 1.)

In particular, there existg and k such thatf g+f0_k _{= 1. This implies that} _f _and _f0 _{are relatively} prime over any extension ofK, and hence thatf has no multiple roots in any extension of K. Note the use of the derivative in the preceding proof. We used only purely formal properties of it, and can thus define it by the formula for differentiating a polynomial (so we needn’t worry about limits ifK is a weird field).

Definition 5.5. Anembedding of a fieldK into a fieldLis a one-to-one homomorphismK ,→L. Let us consider the possible embeddings ofQinto another field. Say thatKis a field, andσ:Q,→K is an embedding. Thenσ(1)6= 0 because σis injective. Further, for anya∈Q×,

σ(a) =σ(1·a) =σ(1)σ(a).

Cancellingσ(a) from both sides (we are in a field here!), we obtainσ(1) = 1. This, and the fact that σ is additive, uniquely specifiesσon all ofZ. Givenr∈Q, we can writer=m/nfor appropriately chosen integers,m, n. We already know thatσ(m) andσ(n) are specified, andσ(r) =σ(m)(σ(n))−1_{. Therefore,} σis uniquely determined on all ofQ, so there can be at most one embedding ofQinto a given field. Also, if the characteristic ofK is 0 then the map taking 1∈Qto 1∈Kdoes define an embedding.

In particular,Qsits as a subset ofCand this argument shows that there is no other way to embedQ

inC.

In the rest of this section, the results mention the field Qexplicitly. They can all be stated in more general forms, but for our purposes that won’t be necessary.

(6)

Proposition 5.6. An extension ofQof degreenhas exactly nembeddings into C.

Proof. By Corollary 4.9, the extension is Q(α) for some α, and by Proposition 4.5, α is a root of an irreducible polynomial f(x) of degree n. Suppose that then roots of f(x) in Care α1, . . . , αn. These roots must all be distinct, by Proposition 5.4.

For 1≤ i≤n, we can define a map σi : Q(α)→ Cthat takes α to αi. Note that σi has no choice about where to takeQ, so it is uniquely determined by the image of α. Also, note that there really is such as map. To see this, recall that Q(αi) is isomorphic toQ[x]/(f(x)). To get σi, we just compose the isomorphism fromQ(α) toQ[x]/(f(x)) that takesαtoxwith the isomorphism fromQ[x]/(f(x)) to

Q(αi) that takesxtoαi.

Now we have at least n embeddings fromQ(α) to C. Say thatτ is an embedding from Q(α) to C. Then τ(f(α)) =τ(0) = 0. However, because τ fixesQ and τ is a homomorphism,τ(f(α)) =f(τ(α)). The image ofαunder an embedding must, therefore, be a root off. Thus,τ is one of theσi, so we have

exactlynembeddings.

Definition 5.7. SupposeK/Qis an extension of degree n, with embeddingsσ1, . . . , σn into C. Given α∈K, theconjugates ofαareσ1(α), . . . , σn(α). Thetrace ofαis

tr(α) =σ1(α) +· · ·+σn(α) and thenormofαis

nm(α) =σ1(α)· · ·σn(α).

The conjugates, norm, and trace of α depend not only on α, but also on K. For example, in Q, nm(2) = 2, but in Q(i) the two conjugates of 2 are 2 and 2, so nm(2) = 4. Because of this, we may sometimes denote the norm and trace by nmK and trK.

Also, given any finite extensionK/F, one can define relative norms and traces fromKtoF. However, we won’t need to use them in these notes.

Proposition 5.8. Givenαin a finite extensionKofQ,nm(α)andtr(α)are inQ. More generally, any symmetric polynomial in the conjugates ofαis inQ.

Proof. By Corollary 4.9, we can suppose K = Q(θ). Also, say that the degree of K over Q is n, so

{1, θ, θ2_{, . . . , θ}n−1_}_{is a vector space basis for}_K_over_Q_{. Then}_α_{can be written as a}_Q_{-linear combination} of these basis elements, and this is the same as writingαas a polynomial inθ withQcoefficients. Letg be such a polynomial, so thatα=g(θ).

The conjugates of α are then given by g(σi(θ)), where σ1, . . . , σn are the embeddings of K into

C. In other words, they are given by the same polynomial, with θ replaced by its conjugates. Any polynomial p(σ1(α), . . . , σn(α)) that is symmetric in the conjugates of α then becomes a polynomial p(g(σ1(θ)), . . . , g(σn(θ)), which is symmetric in the conjugates of θ. However, we know thatθ is a root of a polynomial overQof degreen. Thus, the elementary symmetric polynomials in the conjugates ofθ are rational, since they are plus or minus the coefficients of the irreducible polynomial ofθ over Q. By the Fundamental Theorem on Symmetric Polynomials, we see thatpis also rational. Proposition 5.9. Suppose we have a finite extension K/Q, andα, β∈K. If the embeddings of K into

Care σ1, . . . , σn, then the polynomials

Y

i,j

(x−σi(α)−σj(β))

and

Y

i,j

(x−σi(α)σj(β))

have rational coefficients.

Proof. Pick one of the two polynomials and designate it as F(x). Consider the coefficients of x in F. Each coefficient is symmetric separately in the conjugates ofαand the conjugates ofβ. Focus onal, the

(7)

coefficient of xl_{. This coefficient} _{al, is a polynomial in the conjugates of} _{β, with coefficients which are} conjugates ofα. That isal is some element of

h(X1, . . . , Xn)∈Q[σ1(α), . . . , σn(α)][X1, . . . , Xn]

evaluated at (σ1(β), . . . , σn(β)). Then, because the coefficients of h are symmetric polynomials in the σi(α), these coefficients are actually rational. This means thath(X1, . . . , Xn) is really inQ[X1, . . . , Xn]. Thus,al is a rational polynomial in the conjugates ofβ, but it is also symmetric in the conjugates ofβ. Therefore,alis rational. Becausealwas an arbitrary coefficient,F is itself inQ[x]. Note that this gives polynomials overQsatisfied byα+β andαβ, thus giving another, more concrete proof that they are algebraic. In general, this is how one gets a polynomial overQsatisfied by an algebraic number. For example, consider a rootαof

√

2x4+√3x3−x2+√6.

Of course, α ought to be algebraic, but the equation we have here does not have rational coefficients. Instead, it has coefficients in the field K = Q(√2,√3). To get rational coefficients, we simply try to make everything symmetric in the conjugates of the coefficients. The fieldK has degree 4 overQ. Its four embeddings intoCare the identity, the map sending √2 to−√2 and √3 to itself, the map sending

√

2 to−√2 and√3 to−√3, and the map sending√2 to itself and√3 to−√3. Now consider the product

(√2x4₊√_3x3₋_x2₊√₆₎₍₋√_2x4₊√_3x3₋_x2₋√₆₎₍₋√_2x4₋√_3x3₋_x2₊√₆₎₍√_2x4₋√_3x3₋_x2₋√_6). This simplifies to

4x16−12x14+ 5x12−6x10−48x9−23x8−36x6−12x4+ 36.

Sinceαis a root of this polynomial overQ,αis indeed algebraic over Q. In general, the reasoning used in the proof of Proposition 5.9 shows that this always works.

6. Problems

Learning the material well requires more than just reading the notes. Besides reviewing what was done and possibly doing additional reading, working problems is very useful.

Each problem has a difficulty rating from 1 to 3. Problems rated 1 are meant to be relatively easy; everyone should work on them to see how well they understand the material from the preceding sections. Problems rated 2 should be doable. Problems rated 3 can be tricky or time consuming; one should work on them only if they look interesting.

(1) 1. Imitating the proof of Theorem 2.5, use Proposition 2.3 to prove the following result: Given a, b, c∈Zsuch thatadividesbcand gcd(a, b) = 1, adividesc.

(1.5) 2. Supposepis prime. By Fermat’s Little Theorem, the polynomialxp−1₋_{1 has the roots 1,}_{2, . . . , p}₋ 1 modulop. How does it factor inZ/pZ? What can one conclude about (p−1)! modulop? (The result is called Wilson’s Theorem, although Wilson didn’t prove it. Supposedly, he stated it as a conjecture, and then claimed that nobody could prove it because there was no good notation for dealing with primes. Gauss disproved Wilson’s claim by proving his conjecture.)

(1) 3. Consider the extensionQ(√2)/Q. What is the trace of 3 + 2√2? What is its norm? (1.5) 4. Consider the extensionQ(21/3_)/_Q_{. What is the trace of 2 + 2}1/3₋₂2/3_{? What is its norm?} (1.5) 5. Find an element α∈Q(√2,√3) such that Q(√2,√3) =Q(α). Find the irreducible polynomial

satisfied byα.

(2) 6. Prove that 21/3₋₂1/5_6∈_Q₍√_2,√_3).

(2.5) 7. Prove the Fundamental Theorem on Symmetric Polynomials as follows. First, we define an ordering on the monomials in the variablesx1, . . . , xn, so that we can talk about the leading term of a polynomial in several variables. For the ordering, we first order by degree, and then break ties using lexicographic (dictionary) ordering. For example, x3

1 comes beforex21 because of their degrees, andx2

1x22x3 comes beforex21x2x23because although they have the same degrees and the exponents ofx1are the same, the exponent of x2 inx2

(8)

Now show that we can systematically express any symmetric polynomial in terms of elementary symmetric polynomials as follows. Suppose we have a symmetric polynomialp. Show that there is a unique product of elementary symmetric polynomials such that subtracting a multiple of it will cancel the leading term of p. Prove that doing this repeatedly eventually gives 0, thereby expressingpin terms of elementary symmetric polynomials.

For example, suppose n = 3, and we have the polynomial p(x1, x2, x3) = (x1−x2)2_(x2₋ x3)2_(x1₋_x3)2_{. The leading term of} _p_is_x4

1x22. From p, we subtracts21s22, since it has the same leading term. The remaining polynomial has leading term −4x4

1y1z1, so we add to it 4s31s3. Continuing in this way, we eventually find that

(x1−x2)2_(x2₋_x3)2_(x1₋_x3)2₌_s2

1s22−4s31s3−4s32+ 18s1s2s3−27s23. (3) 8. Prove Proposition 4.8.

7. The Gaussian Integers

Definition 7.1. TheGaussian integers are the ringZ[i] ={a+bi:a, b∈Z_}.

Note that the conjugate of a Gaussian integer α is just its complex conjugate, which we denote ¯α. Then its norm is nm(α) =αα. This is the square of its absolute value, of course. It is a more convenient¯ measure of size than the absolute value is. One reason is that it is always an integer, while its square root is not. Another, deeper reason has to do with quotients ofZ[i]. Given an elementa+biofZ[i], we can form the quotient ringZ[i]/(a+bi)Z[i], just as we form Z/nZfromZ. One can show that

Z[i]/(a+bi)Z[i]

= nm(a+bi),

i.e.,Z[i]/(a+bi)Z[i] has nm(a+bi) elements.

It turns out thatZ[i] is in many ways analogous toZ. For example, Gaussian integers factor uniquely into primes. It turns out that theorems about factorization inZ[i] have some surprising consequences, so in this section we will prove unique factorization inZ[i].

Of course, we have to be careful about what we mean by unique factorization. InZ, the numbers±1 have a special role. Because they are units (i.e., have multiplicative inverses), they can occur arbitrarily in factorizations. InZ[i], the units are±1 and±i. Note that these are exactly the elements of norm 1.

In a factorization inZ[i], we do not want to distinguish between a number and that number multiplied by a unit, just as we do not distinguish between numbers and their negatives in a factorization inZ. Two Gaussian integers are called associates of each other if each is a unit times the other. For example, 2 is an associate of−2i. In the same way, one can define units and associates in any ring.

We now can generalize the definition of prime. For reasons to be explained in Section 9, we use the term “irreducible” rather than “prime.” (In general, we call irreducibles primes when we are dealing with a ring in which factorizations are unique. Irreducible polynomials are an exception to this.)

Definition 7.2. Anirreducible in a ring is an element which is not a unit, and whose only factors are units or associates.

Note that whether an element is irreducible depends on the ring in which you view it. For example, 2 is irreducible inZ, but inZ[i] it factors as 2 =i(1−i)2_.

Now that we have a notion of irreducibles and units, we can say what it means for a ring to have unique factorization.

Definition 7.3. An integral domainR is called aunique factorization domain (UFD) if each non-zero element ofR factors as a product of irreducibles and units, and the irreducible factors are unique up to rearrangement and multiplication by units.

We will show thatZ[i] is a UFD. To do this, we will imitate the proof inZ. We will prove an analogue of the division algorithm inZ[i], and deduce from that thatZ[i] is a PID. Then essentially the same proof as in the case ofZwill show thatZ[i] is a UFD.

In formulating the division algorithm in Z[i], we seem to run into a problem thatZ[i] is not ordered, so we can’t compare two Gaussian integers. However, this is not really a problem, since we can simply use the ordering on their norms.

(9)

Proposition 7.4 (Division Algorithm inZ[i]). For α, β∈Z[i] with β 6= 0, there existκ, ρ ∈Z[i] such that α=βκ+ρand0≤nm(ρ)<nm(β).

Before proving this, recall how we divide Gaussian integers in practice. Suppose we want to obtain a quotient and remainder from the division of 7−5i by 3 + 4i. The main idea is to convert this into a problem of dividing integers. First, note that we have

7−5i 3 + 4i =

(7−5i)(3−4i) (3 + 4i)(3−4i)=

1−43i

25 = (1/25) + (−2 + 7/25)i.

We choose−2ias the quotient because−2iis close to (1/25) + (−2 + 7/25)i. In general, the quotient cannot be chosen uniquely. Then, we have

7−5i= (3 + 4i)(−2i) + (−1 +i),

and indeed nm(−1 +i) = 2<25 = nm(3 + 4i) so we see that we have chosen the quotient well to obtain a remainder with smaller norm than that of the divisor.

Proof. As above, we divideαβ¯by nm(β). Supposeαβ¯=x1+x2i, withx1, x2∈Z. Using division inZ, we can divide each of x1 and x2 by nm(β). We choose the quotient so that instead of getting the least positive remainder, we get the remainder with the smallest absolute value. Then we have

x1= nm(β)q1+r1 and x2= nm(β)q2+r2, withq1, q2, r1, r1∈Z,|r1| ≤nm(β)/2, and|r2| ≤nm(β)/2. It follows that

αβ¯= nm(β)(q1+q2i) + (r1+r2i) = ¯ββ(q1+q2i) + (r1+r2i).

Every term in this equation is divisible by ¯β except possibly for r1+r2i, so it also must be divisible by ¯

β. Write r1+r2i= ¯βρwithρa Gaussian integer, and also write κ=q1+q2i. Thenα=βκ+ρand nm(ρ) = nm(r1+r2i)

nm( ¯β) = (r2

1+r22) nm(β) ≤

(nm(β)2_{/4 + nm(β)}2_/4)

nm(β) =

nm(β) 2 .

Note that the proof shows that we can choose a quotient such that nm(ρ)≤nm(β)/2, which is better than nm(ρ) < nm(β). Also, note that the quotient and remainder are not necessarily unique. For example,

3 = (1 +i)(2−i)−i and 3 = (1 +i)(1−2i) +i.

Now that we have the division algorithm, exactly the same proof as in the case ofZ shows that the Gaussian integers form a PID, and hence a UFD (see Proposition 2.3 to Theorem 2.6). As before, one needs to prove the existence of factorizations. In fact, one can prove that factorizations exist in any PID. One can also prove it directly in cases like Z[i]. For example, suppose there were non-zero, non-unit Gaussian integers that didn’t factor into irreducibles. Choose one with minimal norm, sayz. It can’t be irreducible itself, so z must factor non-trivially. Then its factors have smaller norm, so they factor into irreducibles, and combining those two factorizations gives one forz, which is a contradiction.

8. Sums of Squares and Splitting of Primes

Now that we know that Gaussian integers factor uniquely into primes, we can ask what their factor-izations look like. In particular, one interesting and important question is how ordinary integers factor in Z[i]. Of course, we only need to answer this for prime1_{integers, which we call “rational primes” to} distinguish them from primes inZ[i]. It turns out that this is connected to whether the prime is a sum of two squares.

Proposition 8.1. If pis a rational prime, thenpis a sum of two squares iffpis not prime in Z[i]. For example, 5 = 12_{+ 2}2_{= (1 + 2i)(1}₋_{2i), but 3 is prime in}_Z_[i].

1_{In this section, “prime” means “positive prime.” When dealing with factorizations in various rings, it is more convenient} not to try to normalize the primes, as one does inZ by looking only at positive primes. However, when dealing with congruences, it is convenient to consider only positive primes, because−3 behaves like 3, which is a prime congruent to 3

(10)

Proof. Ifp=a2₊_b2_{, then}_p_{= (a}₊_bi)(a₋_{bi), so}_p_{is not prime in} _Z_{[i]. Conversely, suppose}_p_factors asαβ withα, β∈Z[i] and not units. Then taking norms givesp2= nm(p) = nm(α)nm(β). Sinceαand β are not units, nm(α) and nm(β) are not 1, so they must each bep. Thus,pis the norm of a Gaussian integer. However, nm(a+bi) =a2₊_b2_{, so that means}_p_{is the sum of two squares.} Proposition 8.2. If pis a rational prime, then p is not prime in Z[i] iff there is a square root of −1 modulo p, i.e., iff there exists an integerhsuch that h2_{≡ −}_{1 (mod}_p).

Proof. Ifpis not prime inZ[i], then Proposition 8.1 implies thatp=a2₊_b2 _{for some}_{a, b}_∈_Z_{. Because} pis a rational prime anda, b∈Z, neitheranorbcan be a multiple of p, so they are units modulop. If we takeh≡a/b (mod p), thenh2_≡_a2_/b2_{≡ −}_{1 (mod}_p).

Now we prove the converse. Supposeh2_{≡ −}_{1 (mod}_{p). Then}_p_divides_h2_{+ 1. In}_Z_[i],_h2_{+ 1 factors} as (h+i)(h−i). Note thatpdoes not divide h+i(since it would have to divide the imaginary part), andpdoes not divideh−i. Sincepdivides the product (h+i)(h−i) but neither factor,pmust not be

prime inZ[i].

Notice that this last conclusion of this proof depends on the fact thatZ[i] has unique factorization. So far, we have shown thatpis the sum of two squares iff−1 is a square modulop. Using the following lemma, we will be able to determine when−1 is a square modulop:

Lemma 8.3 (Euler’s Criterion). If pis an odd, rational prime, and a6≡0 (modp), then ais a square modulo piffa(p−1)/2_≡_{1 (mod}_{p). If}_a_{is not a square modulo}_{p, then}_a(p−1)/2_{≡ −}_{1 (mod}_p).

Proof. Ifais a square modulop, thena≡b2 _(mod _{p) for some}_{b. Then}_a(p−1)/2_≡_bp−1_≡_{1 by Fermat’s} Little Theorem.

Note that there are (p−1)/2 non-zero squares modulop(since there are (p−1)/2 pairs of non-zero square roots, which are plus or minus each other). The polynomialx(p−1)/2₋_{1 can have only (p}₋_1)/2 roots modulop, sinceZ/pZis a field. Therefore, ifais not a square modulop, thena(p−1)/2_6≡_{1 (mod}_p). Because ap−1 _≡ _{1 (mod}_p), _a(p−1)/2 _{is a square root of 1 modulo} _{p. Since} _p _{is prime, if it is not 1,}

then it must be−1, modulop.

Corollary 8.4. If pis an odd prime, then pis a sum of two squares iffpis1 modulo 4.

At this point, it is easy to completely classify the primes inZ[i]. First, note that Corollary 8.4 and Proposition 8.1 together imply that all rational primes ≡ 3 (mod 4) remain prime in Z[i]. Rational primes p, which are ≡ 1 (mod 4) factor into p = π¯π, but π is now prime in Z[i]: if αβ = π, then nm(αβ) = nm(π) = p, but nm(αβ) = nm(α)nm(β), so this gives a factorization of p in Z. One of nm(α),nm(β) must be 1, so αβ is a trivial factorization ofπ. A similar argument reveals that (1 +i) is prime.

We will now show that, up to associates, these are the only primes in Z[i]. Say that we start with π∈Z[i] a prime. Then we already know how to factor nm(π) inZ, so write

π¯π= nm(π) = 2rp1p2· · ·psq1q2· · ·qt

where thepi are rational primes congruent to 1 (mod 4), and theqj are rational primes congruent to 3 (mod 4). When we pass to Z[i] and factor further into primes, 2r ₌_ir_{(1 +}_i)2r_, _pi ₌_πi_πi_¯ _{and the} _qj remain prime.

Therefore,

π¯π=ir(1 +i)2rπ1π1¯ · · ·πsπsq1q2¯ · · ·qt.

Because π is prime and factorization into primes is unique inZ[i], π must be an associate of one of the primes on the right hand side. Therefore,πwas already on our list of Gaussian primes.

9. Failure of Unique Factorization

Ideally, every ring would have unique factorization. However, that is not the case, even for interesting rings. It turns out thatZ[√−2] is a UFD. (One can conclude from this that an odd prime is of the form x2_{+ 2y}2 _iff₋_{2 is a square modulo the prime.) However,}_Z_[√₋_{3] is not. In it, one has}

(11)

Each factor has norm 4. Since there is nothing of norm 2, they are irreducible. However, the only units are ±1, so none of them are associates of each other. It follows that 4 has two different factorizations into irreducibles inZ[√−3].

Surprisingly, one can recover unique factorization by adding in additional elements. Suppose we don’t just look at the ringZ[√−3] but also include the quotients (1±√−3)/2. These quotients are units, since their product is 1. With these new units, the factorization above ceases to be a problem. This suggests that the ringZ[√−3] isn’t big enough to be really interesting. However, we would like a ring inQ(√−3) which can help us understand this field in the same way that Z helps us understand Q. In the field

Q(√−3), the ringZ[√−3] is not the right ring to look at.

Definition 9.1. Letω= (−1 +√−3)/2. TheEisenstein integers are the ringZ[ω] ={a+bω:a, b∈Z_}. We have to check that the set {a+bω : a, b ∈ Z_} really is a ring. The non-obvious part is closure under multiplication. This is fine once we observe thatω2₌₋_ω₋_{1. Note also that}_ω _{is a cube root of} 1. This is one reason why the Eisenstein integers are important.

In fact, Z[ω] is a UFD. One can prove this using methods like those we used with Z[i]. (In other words, we prove a division algorithm, and then use this to show that the ring is a PID and hence a UFD.) This might lead one to expect that given any ringZ[√n], we can add in some additional elements to get unique factorization into primes, and that it can be proved by proving the division algorithm. However, this is not the case. It is true that adding enough extra elements does eventually bring us to a ring with unique factorization (e.g., the whole fieldQ(√n)), but this isn’t what we want at all!

Some rings, such asZ[√−5], are even worse thanZ[√−3]. InZ[√−5], examples such as 2·3 = (1 +√−5)(1−√−5)

show that factorization into irreducibles is not unique. One might hope to enlarge the ring to something likeZ[(1+√−5)/2]. However, that leads into awful complications. UnlikeZ[i], there’s no reasonable basis. For example,Z[i] has the basis 1, i. This is called an integral basis, because every element ofZ[i] can be expressed as a linear combination of 1 andi with integer coefficients. However, Z[(1 +√−5)/2] has no integral basis. One could try 1,(1+√−5)/2, but this fails because ((1+√−5)/2)2₌₋_3/2+(1+√₋_5)/2, and−3/2 is not an integer.

In fact, we will see later that no matter how one extends Z[√−5] (withinQ(√−5)), it no longer has an integral basis. Because of this, extending the ring by adding additional elements is not a reasonable way to proceed.

Even when the ring is a UFD, one can’t necessarily prove it by first proving a division algorithm. For example, inQ(√−19), the appropriate ring to look at isZ[(1 +√−19)/2]. This ring is a UFD, but it has no division algorithm. If one tries to imitate the proof of the division algorithm in Z[i], one finds that the remainder doesn’t end up being small enough. In fact, one can prove that there can be no division algorithm, even though factorization is unique.

These examples show that the question of whether factorizations are unique can be quite subtle. It might seem that dealing with fields such as Q(21/3_{) would be hopelessly difficult. However, we will} see that there is a beautiful way to salvage unique factorization when it fails, and prove it when it doesn’t fail. Instead of factoring numbers, we will factor ideals. In certain rings, even when the elements themselves don’t factor uniquely, the ideals will. Non-principal ideals will provide additional factors to make everything work out.

10. Problems

(2) 1. Prove the division algorithm inZ[√−2], and conclude that it is a UFD.

(2) 2. Prove that an odd primepcan be written asx2_{+ 2y}2 _with_{x, y}_∈_Z_iff₋_{2 is a square modulo}_p. (1) 3. Letpbe an odd prime. Show that ifpcan be written in the form x2_{+ 5y}2 _with _{x, y} _∈_Z_{, then}

−5 is a square modulop. Show that the converse is false.

(2.5) 4. Investigate the arithmetic ofZ[√−5]. To what extent, and how, do the results we proved forZ[i] break down?

(1.5) 5. Show thatZ[(1 +√−5)/2] contains 1/2n _{for each}_{n, and conclude that it has no integral basis.} (3.5) 6. DoesZ[21/3_{] have a division algorithm?}

(12)

11. Quadratic Reciprocity

We have seen that an odd primepis a sum of two squares iff−1 is a square modulop, and is of the formx2_{+ 2y}2 _iff₋_{2 is a square modulo}_{p. Examples such as these, as well as other considerations, make} it very interesting to know when an integernis a square modulo p. It is easy to show that the product of two numbers is a square iff either both are squares, or neither are. Because of this, we see that it is enough to know which primes reduce to squares modulop. That is taken care of by the famous Law of Quadratic Reciprocity:

Theorem 11.1 (Law of Quadratic Reciprocity). Let pandq be distinct, positive, odd primes. If either is1modulo 4, thenpis a square moduloqiffqis a square modulo p. If both are 3modulo 4, thenpis a square modulo qiffqis not a square modulo p. Furthermore,2 is a square modulopiffp≡ ±1 (mod 8) and−1is a square modulopiffp≡1 (mod 4).

We will prove this theorem later in the course. It is one of the most amazing theorems in mathematics: who could imagine that whetherpis a square modulo q has anything to do with whetherqis a square modulop?

As an application of Quadratic Reciprocity, consider the question of when 3 is a square modulop. We see that ifp≡1 (mod 4), then 3 is a square modulopiffpis a square modulo 3, i.e., iffp≡1 (mod 3). Forp≡3 (mod 4),p6= 3, we see that 3 is a square modulopiffpis not a square modulo 3, i.e., iffp≡2 (mod 3). Combining these, we see that for primesp >3, 3 is a square modulopiffp≡ ±1 (mod 12).

12. Algebraic Integers

Definition 12.1. Anumber field is a finite extension ofQ.

We normally consider number fields to be embedded in C. (Sometimes this is mildly inconvenient. However, it is often convenient. For example, then one embedding of it into C is the identity map, so each element is one of its own conjugates.)

Given a number fieldK, we would like to have a ringOK⊂K that stands in the same relation toK asZdoes toQ. We callOK thering of integersinK. (This is the reason for the term “rational integers” for ordinary integers.) IfK =Q(i), then OK should beZ[i]. However, we have seen that things can be more complicated. IfK=Q(√−3), then OK should beZ[ω], whereω is a complex cube root of 1.

There are several properties whichOK should have. First,K should be the field of fractions of OK. In other words, every element ofK should be the ratio of two elements ofOK.

Second,OK should have an integral basis, which is defined as follows:

Definition 12.2. Anintegral basisfor a ringRconsists of elementsω1, . . . , ωn ∈Rsuch that eachα∈R can be written uniquely in the formα=a1ω1+· · ·+anωn, witha1, . . . , an ∈Z.

It follows immediately that a ring has an integral basis iff its additive group is a free abelian group on finitely many generators.

Finally,OKshould have unique factorization, if that is possible. (In many cases, such asK=Q(

√ −5), it is not possible.)

To construct such a ring OK, we need the notion of an algebraic integer. Once we have defined algebraic integers, we will show that of all the rings contained in K, only the ring of algebraic integers can have all the properties we would like (i.e., the quotient field is K, there is an integral basis, and unique factorization holds). Of those three properties, the only one that ever fails is unique factorization. Definition 12.3. A number is analgebraic integer if it is the root of a monic polynomial with coefficients inZ.

For example, (−1 +√−3)/2 is an algebraic integer, since it is a root of x2₊_x_{+ 1 = 0. However,} (1 +√−5)/2 is not an algebraic integer. This will follow from Proposition 12.9, since the irreducible polynomial of (1 +√−5)/2 is x2₋_x_{+ 3/2.}

Definition 12.4. Given a number fieldK, thering of integers OK in K is defined to be the set of all algebraic integers inK.

(13)

Proof. Supposeα, β∈ OK. Letα1, . . . , αn be the roots of some monic polynomial overZsatisfied byα, and letβ1, . . . , βm be the roots of some monic polynomial overZsatisfied byβ. Then the polynomials

Y

i,j

(x−αi−βj)

and

Y

i,j

(x−αiβj)

are monic, and have integer coefficients. This follows by an argument identical to that given in the proof of Proposition 5.8.

Since these polynomials are monic polynomials overZwithα+β andαβ as roots,α+β andαβ are

both algebraic integers.

Proposition 12.6. SupposeRis a subring of a number fieldK, and thatR has an integral basis. Then R⊂ OK.

In fact, we can prove the same results with a weaker hypotheses. We do not really need to require that the integral basis be linearly independent. It is enough for it to spanR.

Proposition 12.7. SupposeRis a subring of a number fieldK, and that for somen,Rcontains elements ω1, . . . , ωn such that for all x∈ R, there exista1, . . . , an ∈ Z such that x= a1ω1+· · ·+anωn. Then R⊂ OK.

Proof. Letx∈R. For eachi, there exista1i, . . . , ani such thatxωi=a1iω1+· · ·+aniωn. Consider the following simultaneous equations in the variablesy1, . . . , yn:

(a11−x)y1+a21y2+· · ·+an1yn = 0 a12y1+ (a22−x)y2+· · ·+an2yn = 0

.. . a1ny1+a2ny2+· · ·+ (ann−x)yn = 0

These equations have a non-zero solution, namely yi = ωi. Hence, the determinant of the matrix of coefficients must be zero. Expanding this determinant out gives a monic polynomial with integer

coefficients satisfied byx.

In fact, it isn’t surprising that we can prove results like Proposition 12.7, because its hypotheses imply thatR has an integral basis. To see this, we apply the following theorem:

Theorem 12.8. Every torsion-free, finitely-generated abelian group is a free abelian group on finitely many generators. Every subgroup of a finitely generated free abelian group is a finitely generated free abelian group itself.

A proof of this theorem can be found in Artin’s book. One can sometimes use ad hoc arguments to avoid citing this theorem, but we won’t bother.

Note that the hypotheses of Proposition 12.7 essentially say that the additive group ofRis generated by finitely many elements ω1, . . . , ωn. The additive group is clearly torsion-free (since that just means it has no elements of finite order), so we see that it is a free abelian group on finitely many generators. Hence,Rhas an integral basis.

Proposition 12.9. A algebraic number is an algebraic integer if and only if its minimal polynomial has integer coefficients.

We have already found this theorem useful in showing that 1+√₂−5 isn’t an algebraic integer.

Proof. What we need to prove is that ifαis an algebraic integer, then the minimal polynomial of αhas integer coefficients. Consider the ringZ[α].

Regardless of whatαis, the additive group of this ring is generated by 1, α, α2_{, . . .. Suppose}_α_satisfies a monic polynomial of degreenwith integer coefficients. Let us write this polynomial asxn₊_{q(x) where} q ∈Z[x] and has degree at mostn−1. Then for k ≥n, we can use the polynomial to writeαk _{as an}

(14)

integer linear combination of lower powers ofα. For example,αn ₌₋_{q(α) and the general case follows} by induction.

We see, then, that 1, α, . . . , αn−1 _{generate the additive group of} _Z_{[α]. Since it is finitely-generated,} and is obviously torsion-free, it has an integral basis, say ofmelements.

That integral basis is also a basis of Q(α)/Q, so the degree of αmust be the same as the size of the integral basis, (i.e.,m=n). The proof of Proposition 12.7 shows thatαsatisfies a monic polynomial over

Zof degreen. Since this polynomial has the same degree as the minimal polynomial ofα, they must be the same. Thus, the minimal polynomial ofαhas integer coefficients. Proposition 12.10. For a number field K, K is the field of fractions of OK. Moreover, given any α∈K, there existsd∈Zsuch thatdα∈ OK.

Proof. Sinceα∈K, αsatisfies a polynomial equation

xn+an−1xn−1+· · ·+a1x+a0= 0,

with a0, a1, . . . , an₋1 ∈ Q. Choose dto be divisible by the denominators of a0, a1, . . . , an−1. Thendα satisfies the equation

xn+dan₋1xn−1+· · ·+dn−1a1x+dna0= 0,

which has integer coefficients.

Lemma 12.11. Ifω1, . . . , ωn are a basis ofK/Q, andσ1, . . . , σn are the embeddings ofK intoC, then square of the determinant of the matrixM withMij=σi(ωj)is a non-zero rational number.

Proof. One can change from any basis to any other via a matrix with rational entries. This changes det(M)2 _{by a factor of the square of the determinant of that change of basis matrix. Thus, we need only} prove that det(M)2_∈_Q_{for one basis, and it will follow for all the others.}

LetK=Q(θ) (again, this follows from Corollary 4.9). We choose the basis 1, θ, . . . , θn−1_{. Then det(M}₎ is a Vandermonde determinant, and det(M)2 _{is therefore the discriminant of the minimal polynomial of} θ. As we saw in the section on symmetric polynomials, this is rational. Furthermore, by Proposition 5.4, all of the roots are distinct and the discriminant is non-zero. Proposition 12.12. For a number fieldK,OK has an integral basis.

Proof. Letσ1, . . . , σnbe the embeddings ofKintoC. Consider a basisω1, . . . , ωn ofK/Qsuch that each ωi is an algebraic integer. (We can do this simply by taking any basis and multiplying each element of it by a suitable rational integer.) We will show that there is ad∈Z such that every element ofOK is an integer linear combination of ω1/d, . . . , ωn/d. Then the additive group ofOK is a subgroup of a free abelian group onω1/d, . . . , ωn/d, and is hence finitely generated, soOK has an integral basis.

Supposeα∈ OK. We know thatαcan be expressed uniquely asa1ω1+· · ·+anωnwitha1, . . . , an∈Q. Thesea1, . . . , an are determined by being the solutions of the following simultaneous linear equations:

a1σ1(ω1) +· · ·+anσ1(ωn) = σ1(α) ..

.

a1σn(ω1) +· · ·+anσn(ωn) = σn(α)

Note that the determinant of the coefficient matrix (σi(ωj)) is non-zero by Lemma 12.11. Thus, theai exist and are unique.

Letdbe the square of the determinant of the coefficient matrix of these equations. We use Cramer’s Rule to solve the simultaneous equations. This gives the solutionsai as the ratio of two determinants. If we multiply numerator and denominator by the denominator, then the denominator becomesd, and the numerator is the product of two determinants.

Becausedis the square of the determinant of a matrix of algebraic integers,dis an algebraic integer. By Lemma 12.11, d ∈ Q. Therefore, d is an integer (by Proposition 12.9). For essentially the same reason thatdis an algebraic integer, the numerators of the fractions givingai are also algebraic integers. Becauseaianddare rational, the numerators must be rational as well, so they are also ordinary integers.

(15)

Therefore, we have proved that each ai is an integer divided by a fixed integer d. Therefore, every element of OK is an integer linear combination of ω1/d, . . . , ωn/d. As we saw above, this implies that

OK has an integral basis.

Note that the proof of Proposition 12.12 lets us find an integral basis for OK (and thereby findOK itself) with only a finite amount of computation. There are ways to do the computation slightly more efficiently, but we won’t go into that here.

Proposition 12.13. SupposeK is a number field, and R is a subring ofK such thatK is the field of fractions ofR. IfR has unique factorization, thenOK ⊂R.

Proof. Letα∈ OK. SinceKis the field of fractions ofR,α=p/qfor somep, q∈R. SinceRhas unique factorization, we choosepandqto be relatively prime.

Becauseαis an algebraic integer, it satisfies a monic polynomial equation xn+an₋1xn−1+· · ·+a1x+a0= 0 witha0, a1, . . . , an₋1∈Z. Therefore,

pn+an−1pn−1q+· · ·+a1pqn−1+a0qn = 0.

It follows thatq divides pn_{. Now we use the fact that}_R _{has unique factorization. Since}_q _{is relatively} prime top, it is relatively prime topn_{. The only way it can divide}_pn_{is if}_q_{is a unit. Then since}_α₌_p/q

andqis a unit, we haveα∈R. Thus,OK⊂R.

Therefore, the only subring of a number fieldK that can possibly have our three properties isOK. It always has the first two, but usually it is not a UFD. When we discuss ideal factorization, we will deal with that problem.

We now show that in the case of quadratic number fields, the ring of integers is what we expect. Proposition 12.14. Let dbe a square-free integer. Then the ring of integers in Q(√d)isZ[√d] unless d≡1 (mod 4), in which case it is Z[(1 +√d)/2].

Proof. Becausedis square-free,Q(√d)6=Q. The only algebraic integers inQare the integers, so we find the rest of the integers inQ(√d) if we restrict our attention to an arbitrary elementα=a+b√d∈Q(√d), witha, b∈Qandb6= 0. It is a root of the irreducible polynomial

x2₋_2ax_{+ (a}2₋_db2_). Thus, it is an algebraic integer iff 2a∈Zand a2₋_db2_∈_Z_.

From this, we see that (2a)2−d(2b)2∈Z, sod(2b)2∈Z. Sincedis square-free, we then have (2b)2∈Z, which implies 2b ∈Z. Now let a= a0_{/2 and} _b ₌ _b0_{/2. We must have}_a0_{, b}0 _∈ _Z_{. The number} _α_{is an} algebraic integer iff (a0₎2₋_d(b0₎2 _{is divisible by 4. When} _d _6≡_{1 (mod 4), this is true iff} _a0 _and _b0 _are even. Whend≡1 (mod 4), it is true iffa0 _and_b0 _{have the same parity. This is equivalent what we were}

trying to prove.

Note thatZ[(1 +√d)/2] is the same as Z[(±1±√d)/2].

13. Determining the Ring of Integers

LetK be a number field, andOK its ring of integers. As we have seen,OK can sometimes be larger than one might guess. For example, ifK=Q(√−3), thenOK=Z[(1 +√−3)/2], rather thanZ[√−3]. In this section, we will develop an algorithm for determiningOK, and apply it to certain cyclotomic fields. Supposeω1, . . . , ωn is a basis K/Q, and σ1, . . . , σn are the embeddings of K into C. Let M be the matrix whose i, j entry is σi(ωj). Then the rational numberd= det(M)2 _{is called the discriminant of} the basis. Recall that in proving Proposition 12.12 we proved the following:

Proposition 13.1. SupposeK is a number field, andω1, . . . , ωn is a basis ofK/Qsuch that eachωi is an algebraic integer. Letdbe the discriminant of the basis. Then every element ofOK can be expressed in the form(a1ω1+· · ·+anωn)/dwith a1, . . . , an ∈Z.

(16)

Given a basis of K/Q consisting of integers, we can therefore find all the algebraic integers with only a finite amount of computation. We simply need to find all the algebraic integers of the form (a1ω1+· · ·+anωn)/dwith 0≤ai< d.

For an example, consider the fieldK=Q(ζ) withζa primitivep-th root of unity, forpan odd prime. We will show thatOK =Z[ζ].

First, we show that the polynomialxp−1₊_xp−2₊_{· · ·}₊_x_{+ 1 (which has}_ζ_{as a root) is irreducible.} Proposition 13.2(Eisenstein’s Criterion). Letf(x)∈Z[x]be a polynomial of degreensuch thatf(x)≡

xn _(mod _p)_{(i.e., the leading term is}₁_modulo_p_{and the other coefficients are}₀_modulo_{p), and such that} the constant term of f(x)is not divisible byp2_{. Then} _f_(x)_{is irreducible.}

Proof. Supposef(x) =g(x)h(x) withg(x) and h(x) non-constant. We have xn ≡g(x)h(x) (modp), so for someiandc6≡0 (modp), we haveg(x)≡cxi _(mod _{p) and}_h(x)_≡_c−1_xn−i _(mod_{p). Since}_{g(x) and} h(x) are not constant, we cannot havei= 0 ori=n. Therefore, the constant terms ofg(x) andh(x) are divisible byp. It follows that the constant term off(x) =g(x)h(x) is divisible by p2_{. This contradicts}

our hypotheses, sof(x) must be irreducible.

If we setx=y+ 1, thenxp−1₊_{· · ·}₊_x_{+ 1 = ((y}_{+ 1)}p₋_{1)/y. Now,} (y+ 1)p₋₁

y =y

p−1₊p 1

yp−2+· · ·+

_p

k+ 1

yk· · ·+

_p

p−1

,

which satisfies the hypotheses of Eisenstein’s Criterion. Since the change of variables doesn’t affect irreducibility, we see that xp−1₊_xp−2₊_{· · ·}₊_x_{+ 1 is irreducible. Therefore [K} _: _Q_{] =}_p₋_{1, and the} numbers 1, ζ, . . . , ζp−2 _{form a basis for}_K/_Q_{. Note that this basis consists of algebraic integers. We will} show that it is in fact an integral basis forOK.

Lemma 13.3. Supposeaandb are not divisible byp. Then(1 +ζa)/(1 +ζb)is a unit inOK. Proof. There exists acsuch thata≡bc (mod p). Then

1 +ζa 1 +ζb =

1 +ζbc

1 +ζb = 1 +ζ

b₊_ζ2b₊

· · ·+ζb(c−1)∈ OK.

For the same reason, its reciprocal is also inOK, so it is a unit inOK. Letλ= 1−ζ. Since

xp−1+xp−2+· · ·+x+ 1 = p−1

Y

i=1

(x−ζi),

we havep=Qp−1

i=1(1−ζi). Applying Lemma 13.3, we see thatpdiffers fromλp−1by a unit factor. Proposition 13.4. The discriminant of the basis1, ζ, . . . , ζp−2 _is_±_pp−2_.

Proof. The discriminant d is given by the square of a Vandermonde determinant. That is, given the matrix M =     

1 ζ ζ2 _{· · ·} _ζp−1

1 (ζ2₎ _(ζ2₎2 _{· · ·} _(ζ2₎p−1 ..

. ... ... . .. · · ·

1 (ζp−1₎ _(ζp−1₎2 _{· · ·} _(ζp−1₎p−1

     ,

thend= (det(M))2_{. We have}

d=



 Y

i>j

ζi−ζj 



2

.

In this product, we get a lot of units times (p−1)(p−2) factors ofλ. Therefore,dis a unit timespp−2_, sinceλp−1 _{is a unit times}_{p. Since}_d_∈_Q_and_pp−2_∈_Q_{, the unit factor must be in}_Q_{. Since it is rational} and an algebraic integer, it must be an integer. The only units inZare±1, sod=±pp−2_.

(17)

Proposition 13.5. _OK=Z[ζ].

Proof. Suppose thatOK 6= Z[ζ]. From Proposition 13.1, we see that there exist a0, . . . , ap−2 ∈Z, not all divisible byp, such that (a0+a1ζ+· · ·+ap−2ζp−2)/p∈ OK. (This is true sincepis the only prime factor of the discriminant. If there were any other algebraic integers, multiplying by a power ofpwould produce one of this form.)

We now change from the basis 1, ζ, . . . , ζp−2 _{to 1, λ, . . . , λ}p−2_{. Then it is not hard to check that there} are integersb0, . . . , bp−2, not all divisible byp, such that

a0+a1ζ+· · ·+ap−2ζp−2=b0+b1λ+· · ·+bp−2λp−2. We will show that eachbi must be divisible byp, which will be a contradiction.

We begin withb0. InOK,pdividesb0+b1λ+· · ·+bp−2λp−2. Sinceλdividesp, we see thatλdivides b0. If b0 is not divisible by p, then there exist a1, a2 ∈Z such that b0a1+pa2 = 1. Then since λ is a common factor of b0 and p, we must have that λdivides 1. However, since pis a unit timesλp−1, that would imply thatpis a unit, which is a contradiction. Therefore,b0 must be divisible byp.

Now we see that λp−2 _divides _b1₊_b2λ₊_{· · ·}₊_bp

−2λp−3. From this, we can deduce as above thatλ dividesb1, and then thatpdividesb1. Continuing in this way, we see thatpdivides eachbi, which means that (b0+b1λ+. . .+bp₋2λp−2_)/p_∈_Z_[ζ],_{and therefore we must have}_O

K =Z[ζ].

14. Ideal Factors

We have seen that in some rings of integers, such asZ[√−5], factorization is not unique. We could try to enlarge the ring to make it unique, but if we do so while staying within the fieldQ(√−5), we find that the enlarged ring never has an integral basis.

Instead, we introduce “ideal factors.” These new factors will not actually be elements of a ring, but they will restore unique factorization. They will correspond to ideals in the ring of integers.

To see how this will restore unique factorization, consider the two factorizations 2·3 = (1 +√−5)(1−√−5)

inZ[√−5]. Define the ideals

p= (2,1 +√−5),

q= (3,1 +√−5), and

r= (3,1−√−5).

The notation is meant to suggest thatp is the gcd of 2 and 1 +√−5. Of course, in the ring Z[√−5], 2 and 1 +√−5 have no common factors. However, if we are going to reconcile the two factorizations of 6, then there should be such an elementp.

The basic idea is this: 2 and 1 +√−5 do not generate the unit ideal (1) =Z[√−5]. If they did, then there would be two algebraic integers α, β ∈ Z[√−5] such that 2α+ (1 +√−5)β = 1. Multiply both sides by 1−√−5 to obtain 2α(1−√−5) + 2(3β) = 1−√−5. This says that 2 divides 1−√−5. but we know that it doesn’t. Because 2 and 1 +√−5 do not generate the unit ideal, they ought to have some common factor. However, they do not have any non-trivial common factor inZ[√−5].

The ideal p is not a principal ideal (since if it were, it would be generated by a common factor of 2 and 1 +√−5 and would therefore be the unit ideal). However, like all ideals, it behaves like the set of multiples of some number. We will think ofpas defining an “ideal factor,” which doesn’t correspond to any element ofZ[√−5].

Recall the following definitions:

Definition 14.1. LetR be a ring, and I and J ideals inR. Theproduct IJ is the ideal generated by all productsij withi∈Iandj ∈J. Aprime ideal P is an ideal other thanRsuch that ifa, b∈R and ab∈P, thena∈P or b∈P. Amaximal ideal is an ideal M ⊂Rsuch that M 6=R, butR is the only ideal properly containingM.

(18)

There are two useful facts which we should mention now, but whose proof we delay to a later section. First, any ideal different from the ring itself is contained in a maximal ideal. Second, maximal ideals are always prime.

From time to time, we might write “ideal” when we mean “non-zero ideal.” This is especially likely to happen when we are talking about prime ideals.

The following lemma is often useful. We omit the proof, since it is easy and probably well known. Lemma 14.2. An ideal P 6= R in an arbitrary ring R is prime iff for all ideals A and B, AB ⊂ P impliesA⊂P or B⊂P. Also, P is prime (orP =R) iff R/P is an integral domain.

We will now show that p, q, and r are prime ideals. Note that the principal ideal generated by a numberαis prime iffαis prime, so we think of this as saying that the “ideal elements” generating p,q, andrare prime.

To see this, we look at the quotientsZ[√−5]/p,Z[√−5]/q, andZ[√−5]/r. These are all very similar, so we will deal only with the first. For it, we have

Z[√−5]/p=

Z[√−5] (1 +√−5)

/(2) =Z/(2),

which is a field (and therefore an integral domain). We can check in the same way thatqandrare prime. Now we will show that how these factors reconcile the two factorizations of 6 inZ[√−5]. In general, when dealing with ideal factorizations, instead of looking at actual elementsαof the ring of integers, we will look at the principal ideals (α). To start off with, we will show that the principal ideal (2) factors as

p2.

We havep2 = (22_,_{2(1 +}√₋_5),_{(1 +}√₋₅₎2_{) = (4,}_{2 + 2}√₋_5,₋_{4 + 2}√₋_{5). Since 2 = (2 + 2}√₋₅₎₋ (4 + (−4 + 2√−5)), it follows that 2∈p2, and we havep2⊂(2), so (2) =p2. In the same way, we can show that (3) =qr, (1 +√−5) =pq, and (1−√−5) =pr. Thus, both factorizations of 6 can be refined top2qr. Of course, this doesn’t prove that 6 has only one factorization into prime ideals, but at least it looks encouraging.

15. Uniqueness

We will now prove unique factorization for ideals in rings of integers (except for one proposition which we will put off until section 16). We begin with the following very important result:

Proposition 15.1. LetKbe a number field, and letOK denote the ring of integers ofK. Every non-zero ideal of OK divides a non-zero principal ideal.

We will prove this in section 16. Part of the importance of this result is that it says that there are no unnecessary ideal factors. At first, this might seem obvious. However, it is false in most rings. For example, consider the ringZ[x], which is a UFD but not a PID. One non-principal ideal inZ[x] is (2, x). This ideal does not divide any non-zero principal ideal. To see this, suppose thatIis an ideal, and (2, x)I is principal, say (2, x)I= (f(x)). For allg(x)∈I, we must havef(x)|2g(x) andf(x)|xg(x), sof(x)|g(x), and henceI⊂(f(x)). We can then conclude that (2, x)(f(x)) = (f(x)), and therefore that (2, x) = (1). However, 16∈(2, x), so this is impossible. Note that this last step depends on a cancellation law. In a general ring with arbitrary idealsAC=BC, we cannot cancel, but it isn’t hard to show that cancellation always holds whenC is principal.

So we see that Proposition 15.1 isn’t completely obvious. It will be the key to almost all of our results in this section.

We begin with the following two results:

Proposition 15.2(Cancellation Law). SupposeA,B, andCare ideals ofOK, withC6= 0. IfAC=BC, thenA=C.

Proof. LetC0 _{be an ideal such that}_CC0 _{is principal, say}_CC0_{= (α), with}_α₆_{= 0. Then} (α)A= (α)B,

(19)

Proposition 15.3 (“To Contain is to Divide”). SupposeA and B are non-zero ideals ofOK. Then A divides B iffA containsB.

Proof. Clearly, if A divides B, then A contains B. Suppose B ⊂ A. Let A0 _{be an ideal such that} AA0 _{= (α) with}_α₆_{= 0. Then}_BA0 _⊂_{(α). Let}_C_{= (BA}0_{)/α, (i.e., the set of all} _β _{such that} _αβ_∈_BA0_). Then C is an ideal of OK. Also, because BA0 _⊂_{(α), it is easy to check that} _{C(α) =} _BA0_{. In other} words,CAA0₌_BA0_{. By cancellation, we have}_AC₌_B_{, so}_A_divides_B_. Now we prove that all ideals inOK factor into primes. (This is not as obvious as it was in rings such asZ, although it is not hard to prove.) First, we need the following lemma:

Lemma 15.4. LetRbe a ring, andIan ideal ofR. Then idealsJ1 ofRthat containI are in one-to-one correspondence with the idealsJ2 of R/I viaJ1=ϕ−1_{(J2), where}_ϕ_:_R_→_R/I _{is the canonical map.} Proof. The substance of this lemma is to check thatϕ−1 _{carries ideals of}_R/I _{to ideals of} _R_containing Iand thatϕtakes ideals ofRcontainingIto ideals inR/I. We will leave the details here to the reader. After the two maps are shown to be well-defined, it follows immediately that the two maps are inverses

and, thus, that each is bijective.

Because a field is the same as a ring without any proper ideals (every non-zero element has a reciprocal iff every non-zero principal ideal is the unit ideal), we deduce the following corollary:

Corollary 15.5. An ideal I in a ring R is maximal iffR/I is a field.

Note that because every field is an integral domain, it follows that every maximal ideal is prime. Lemma 15.6. LetI be a non-zero ideal ofOK. Then I∩Z₆= (0).

Proof. Letα∈I andα6= 0. Sinceαis an algebraic integer, it satisfies a polynomial equation αn₊_an

−1αn−1+· · ·+a1α+a0= 0

withai∈Z. We can takea06= 0. Becausea0∈(α)⊂I, we haveI∩Z₆= (0). Proposition 15.7. LetI be a non-zero ideal ofOK. Then OK/I is finite.

Proof. Leta >0 be inZ_∩I. Since (a)⊂I,OK/(a) maps surjectively ontoOK/I. Ifω1, . . . , ωn form an integral basis forOK, then we see that every element ofOK/(a) can be represented by a unique element

a1ω1+· · ·+anωn

with 0≤ai< afor eachi. Therefore,OK/(a) hasan _{elements, and}_O_K/I _{has at most}_an _{elements, and}

is therefore finite.

Proposition 15.8. Every non-zero ideal ofOK factors into prime ideals.

Proof. LetI be an ideal ofOK. Combining Lemmas 15.4 and 15.7 shows that only finitely many ideals ofOK containI. Therefore,I has only finitely many factors.

Now letP1be a maximal ideal containingI. Then P1 divides I, and I=P1I1 for some ideal I1. We can continue in this way. SinceI has only finitely many factors, this process must eventually halt. This gives a factorization ofIinto prime ideals (since maximal ideals are prime). Lemma 15.9. Every finite integral domain is a field.

Proof. Supposex6= 0 is an element of a finite integral domain. Since there are infinitely many powers of x, we must havexi₌_xj _{for some}_{i > j. Because we are working in an integral domain, that implies that} x(i−j)= 1. Therefore, xis invertible. Since every non-zero element of the integral domain is invertible,

it is a field.

Proposition 15.10. Every non-zero prime ideal of OK is maximal.

Proof. SupposeP is a non-zero prime ideal ofOK. Then OK/P is finite, by Proposition 15.7. SinceP is prime,OK/P is an integral domain. Therefore, it is a field, soP is maximal (by Corollary 15.5).