MATH32062 Algebraic Geometry
1 Affine algebraic varieties
1.1 Definition of affine algebraic varieties
We want to define an algebraic variety as the solution set of a collection of polynomial equations, or equivalently, a set where every element of a set of polynomials vanishes. Then every combination of those polynomials vanishes on the set, so we can use the ideal generated by those polynomials to define the algebraic variety. This motivates the definition below. Using ideals instead of concrete polynomials also turns out to be more useful for proving most of the theorems about algebraic varieties.
In the following, K denotes an arbitrary field, unless specified otherwise.
Definition. An affine algebraic variety is a set of the form
V(I) = {(a
1, a
2, . . . , a
n) ∈ K
n| f (a
1, a
2, . . . , a
n) = 0, ∀f ∈ I}, where I is an ideal I / K[x
1, x
2, . . . , x
n].
It is clear from the definition that if I
1⊆ I
2, then V(I
1) ⊇ V(I
2), but I
1$ I
2does not imply V(I
1) % V(I
2), see examples 7 and 8 below. Conversely, V(I
1) % V(I
2) does not imply I
1$ I
2. (Try to think of an example where I
16⊂ I
2, but V(I
1) ⊃ V(I
2).)
Examples:
1. If I = {0}, then V(I) = K
n.
2. If I = K[x
1, x
2, . . . , x
n], then V(I) = ∅.
3. Let I = hx
2+ y
2− 1i / R[x, y], then V(I) is the circle of radius 1 centred at the origin defined by the equation x
2+ y
2= 1 in R
2.
4. Let f
1, f
2, . . . , f
mbe polynomials of degree 1 in x
1, x
2, . . . x
n, and let I = hf
1, f
2, . . . f
mi be the ideal generated by them. Then V(I) is the set of solutions of the system of linear equations f
1= f
2= . . . = f
m= 0. Such a variety is called an affine subspace of K
n. This, in particular, includes sets containing a single point. If P = (b
1, b
2, . . . , b
n) ∈ K
n, then the ideal I = hx
1− b
1, x
2− b
2, . . . x
n− b
ni defines the variety V(I) = {P }.
5. Let I = hy
2− x
3i / R[x, y], then V(I) is the cuspidal cubic curve, also
called Neile’s semicubical parabola after William Neile, who computed its
arc length in 1657. It is defined by the equation y
2= x
3in R
2, shown below
on the left.
-0.5 0.5 1.0 1.5 2.0 2.5
-4 -2 2 4
-1 1 2
-4 -2 2 4
6. Let I = hy
2− x
3− x
2i / R[x, y], then V(I) is the nodal cubic curve defined by the equation y
2= x
3+ x
2in R
2, shown above on the right.
7. If I = hx, yi / R[x, y], then V(I) = {(0, 0)} ⊂ R
2. 8. If I = hx
2, y
2i / R[x, y], then V(I) = {(0, 0)} ⊂ R
2.
9. If I = hx
2+ y
2+ z
2− 1, x
2− x + y
2i / R[x, y, z], then V(I) ⊂ R
3is a “figure of eight” curve, shown below.
-2 -1
0 1
2 -2
-1 0
1 2
-2 -1
0 1 2
10. The special linear group SL(n, R) consisting of all n × n real matrices
with determinant 1 is also an affine algebraic variety. The set of all n × n
real matrices is R
n2, the determinant is a polynomial in the entries of the matrix, therefore SL(n, R) = V(hdet(A) − 1i).
Note: Examples 7 and 8 show that different ideals can define the same variety.
The exact nature of the correspondence between ideals and varieties will be investigated later in Section 1.3.
What do affine algebraic varieties look like? There is not a way of telling whether a set is an affine algebraic variety or not just by looking at it. Her- wig Hauser’s gallery of algebraic surfaces https://homepage.univie.ac.
at/herwig.hauser/bildergalerie/gallery.html contains a large number of algebraic surfaces and shows that affine algebraic varieties can have lots of different shapes.
Proposition 1.1 Let K be a field. Let f
1, f
2, . . . , f
m∈ K[x
1, x
2, . . . , x
n] and let I = hf
1, f
2, . . . , f
mi. Then
V(I) = {(a
1, a
2, . . . , a
n) ∈ K
n| f
i(a
1, a
2, . . . , a
n) = 0, ∀i, 1 ≤ i ≤ m}.
Proof. f
i∈ I for each i, 1 ≤ i ≤ m, so if (a
1, a
2, . . . , a
n) ∈ V(I), then f
i(a
1, a
2, . . . , a
n) = 0 for every i, 1 ≤ i ≤ m.
Conversely, assume that f
i(a
1, a
2, . . . , a
n) = 0 for every i, 1 ≤ i ≤ m. Any f ∈ I can be written as f =
m
P
i=1
f
ig
ifor suitable g
i∈ K[x
1, x
2, . . . , x
n], 1 ≤ i ≤ m, therefore
f (a
1, a
2, . . . , a
n) =
m
X
i=1
f
i(a
1, a
2, . . . , a
n)g
i(a
1, a
2, . . . , a
n) = 0, so (a
1, a
2, . . . , a
n) ∈ V(I).
Definition. A ring R is Noetherian if and only if all of its ideals can be generated by finitely many elements. (Equivalently, R is Noetherian if and only if for any increasing sequence of ideals I
1⊆ I
2⊆ I
3⊆ . . . ⊆ I
n⊆ I
n+1. . . there exists an N such that I
n= I
Nfor all n ≥ N .)
Theorem 1.2 (Hilbert Basis Theorem) (Not examinable) If K is a field, then the polynomial ring K[x
1, x
2, . . . , x
n] is Noetherian for any n ≥ 0.
Note: The Hilbert Basis Theorem justifies why we only considered ideals generated by a finite set of polynomials in Proposition 1.1. The combination of Proposition 1.1 and Theorem 1.2 shows that any affine algebraic variety is the set of solutions of finitely many polynomial equations.
The proposition below describes some methods which can be used to con-
struct further affine algebraic varieties from known ones.
Proposition 1.3 (i) Let V
1= V(I
1), V
2= V(I
2), . . . , V
k= V(I
k) be affine algebraic varieties in K
n. Then
V
1∪ V
2∪ . . . ∪ V
k= V(I
1∩ I
2∩ . . . ∩ I
k) = V(I
1I
2. . . I
k) is also an affine algebraic variety.
(ii) Let V
α= V(I
α), α ∈ A be affine algebraic varieties in K
n. Then
\
α∈A
V
α= V
X
α∈A
I
αis also an affine algebraic variety.
(iii) Let V
1= V(I
1) ⊆ K
m, V
2= V(I
2) ⊆ K
nbe affine algebraic varieties.
Then V
1× V
2⊆ K
m× K
n∼ = K
m+nis also an affine algebraic variety.
Proof. (i) Let’s assume first that k = 2. Let P ∈ V
1∪ V
2and let f ∈ I
1∩ I
2. If P ∈ V
1, then f (P ) = 0 because f ∈ I
1, if P ∈ V
2, then f (P ) = 0 because f ∈ I
2. Therefore in either case we have f (P ) = 0, so P ∈ V(I
1∩ I
2), V
1∪ V
2⊆ V(I
1∩ I
2). We also have V(I
1∩ I
2) ⊆ V(I
1I
2), therefore
V
1∪ V
2⊆ V(I
1∩ I
2) ⊆ V(I
1I
2) (1.1) (I
1I
2⊆ I
1∩ I
2, since all the elements of the form i
1i
2, i
1∈ I
1, i
2∈ I
2are contained in both I
1and I
2by the definition of an ideal, and if all the generators of I
1I
2are elements of I
1∩ I
2, then necessarily I
1I
2⊆ I
1∩ I
2. This means that the elements of V(I
1∩ I
2) have to satisfy all the polynomials in I
1I
2, therefore V(I
1∩ I
2) ⊆ V(I
1I
2).)
Let now P ∈ K
n\ (V
1∪ V
2). Then P / ∈ V
1, so there exists f
1∈ I
1such that f
1(P ) 6= 0. Similarly, P / ∈ V
2, so there exists f
2∈ I
2such that f
2(P ) 6= 0.
Then f
1f
2∈ I
1I
2and (f
1f
2)(P ) = f
1(P )f
2(P ) 6= 0, therefore P / ∈ V(I
1I
2).
This implies V(I
1I
2) ⊆ V
1∪ V
2. By combining this with (1.1), we obtain V
1∪ V
2= V(I
1∩ I
2) = V(I
1I
2).
For k > 2, use induction on k.
(ii) Assume that P ∈ T
α∈A
V
α. Then P ∈ V
α= V(I
α) for every α ∈ A, so f
α(P ) = 0 for every α ∈ A and f
α∈ I
α. Any f ∈ P
α∈A
I
αcan be written as f = P
α∈A
f
αwith f
α∈ I
αfor every α ∈ A and f
α= 0 for all but finitely many α. Hence f (P ) = P
α∈A
f
α(P ) = P
α∈A
0 = 0, so P ∈ V
P
α∈A
I
α. This implies
\
α∈A
V
α⊆ V
X
α∈A
I
α. (1.2)
Assume now that P ∈ K
n\ T
α∈A
V
α. Then there exists α
0∈ A such that P / ∈ V
α0, therefore there exists f ∈ I
α0such that f (P ) 6= 0. Now f ∈ P
α∈A
I
α, therefore P / ∈ V( P
α∈A
I
α). This implies that
V
X
α∈A
I
α⊆ \
α∈A
V
α.
By combining this with (1.2), we obtain V
X
α∈A
I
α= \
α∈A
V
α.
(iii) Let x
1, x
2, . . . , x
mbe co-ordinates on K
m, and y
1, y
2, . . . , y
nco- ordinates on K
n. Let J
1= hI
1i / K[x
1, x
2, . . . , x
m, y
1, y
2, . . . , y
n]. (I
1is an ideal in K[x
1, . . . , x
m], J
1is the ideal generated by the elements of I
1in the bigger ring K[x
1, x
2, . . . , x
m, y
1, y
2, . . . , y
n].) We claim that V(J
1) = V
1× K
n. Let P = (a
1, a
2, . . . , a
m, b
1, b
2, . . . , b
n) ∈ V(J
1). f (P ) = 0 for every f ∈ I
1, since I
1⊂ J
1. As f only involves the variables x
1, x
2, . . . , x
m,
0 = f (P ) = f (a
1, a
2, . . . , a
m, b
1, b
2, . . . , b
n) = f (a
1, a
2, . . . , a
m).
Hence (a
1, a
2, . . . , a
m) ∈ V(I
1) = V
1, so P ∈ V
1× K
n, therefore V(J
1) ⊆ V
1× K
n.
Let now P = (a
1, a
2, . . . , a
m, b
1, b
2, . . . , b
n) ∈ V
1× K
n. Let f ∈ J
1. f can be written as f =
r
P
i=1
f
ig
i, where f
i∈ I
1and g
i∈ K[x
1, x
2, . . . , x
m, y
1, y
2, . . . , y
n].
Now f
i(P ) = f
i(a
1, a
2, . . . , a
m, b
1, b
2, . . . , b
n) = f
i(a
1, a
2, . . . , a
m) = 0. (The second equality holds because f
ionly involves on the variables x
1, x
2, . . . , x
m, and the last equality holds because P ∈ V
1× K
n.) Therefore f (P ) =
r
P
i=1
f
i(P )g
i(P ) = 0, so P ∈ V(J
1). This means V
1×K
n⊆ V(J
1), by combining this with V(J
1) ⊆ V
1× K
nproved above, we obtain V(J
1) = V
1× K
n. Similarly, if we define J
2= hI
2i/K[x
1, x
2, . . . , x
m, y
1, y
2, . . . , y
n], then V(J
2) = K
m× V
2.
Hence
V
1× V
2= (V
1× K
n) ∩ (K
m× V
2) = V(J
1) ∩ V(J
2) = V(J
1+ J
2)
by (ii).
In practical calculations, the ideals defining the varieties are given by sets of generators.
To find a generating set for an ideal defining the union of the varieties, take all the possible products of the generators, one factor from each ideal.
To find a generating set for an ideal defining the intersection of the varieties, take the union of the generating sets of the ideals.
To find a generating set for an ideal defining the Cartesian product of the varieties, take the union of the generating sets of the ideals, but remember that this ideal will be in a different ring.
Examples:
1. Find an ideal I / R[x, y] such that V(I) = {(1, 1), (2, −3)}.
Let I
1= hx − 1, y − 1i and I
2= hx − 2, y + 3i. Then V(I
1) = {(1, 1)} and V(I
2) = {(2, −3)}. By Proposition 2.3 (i),
I = I
1I
2= h(x − 1)(x − 2), (x − 1)(y + 3), (y − 1)(x − 2), (y − 1)(y + 3)i is a suitable ideal.
2. Let V
1= V(hx
21+ x
22− 1i) ⊂ R
2and let V
2= V(hy
1− y
2i) ⊂ R
2. Then V
1× V
2= Vhx
21+ x
22− 1, y
1− y
2i ⊂ R
4is a cylinder.
Definition. Let K be a field and let V ⊆ K
nbe an affine algebraic variety.
W is a subvariety of V if and only if W ⊆ V and W is also an affine algebraic variety.
Remark. By Proposition 1.3 (i) and (ii), affine algebraic varieties behave
like closed sets in a metric space. This motivates the introduction of the
Zariski topology on an affine algebraic variety, in which the closed sets are the
subvarieties and the open sets are the complements of subvarieties. There
are, however, certain differences from metric spaces, for example, any two
non-empty Zariski open sets in R
nhave a non-empty intersection.
1.2 Affine spaces
Affine spaces are the simplest algebraic varieties, they are the solutions sets of system of linear equations, i. e., they are defined by ideals generated by degree 1 polynomials.
Definition. Let K be a field, and let V be a finite dimension vector space over K. U ⊆ V is an affine subspace of V if and only if U = ∅ or U = u
0+ W = {u
0+ w | w ∈ W }, where u
0∈ U and W is a linear subspace of V . (Linear subspace or vector subspace of a vector space means subspace as defined in MATH10202 or MATH10212.)
Examples: K
nas a subspace of itself, ∅, any set containing a single point are all affine subspaces of K
n. The line y = 2x + 5 is an affine subspace of R
2, in this case we can take W = span{(1, 2)} and u
0= (0, 5).
Proposition 1.4 Let U = u
0+W be a non-empty affine subspace in a vector space V . Then U = u + W for any u ∈ U , in other words, u
0can be chosen to be an arbitrary element of U , but W is uniquely determined by U . Proof. Let u ∈ U , then u = u
0+ (u − u
0), so u − u
0∈ W . We have
u + W = {u + w | w ∈ W } = {u
0+ (u − u
0) + w | w ∈ W }.
(u − u
0) + w ∈ W as W is a vector space, therefore u + w ∈ u
0+ W for every w ∈ W , hence u + W ⊆ u
0+ W . Similarly,
u
0+ W = {u
0+ w | w ∈ W } = {u + (u
0− u) + w | w ∈ W } ⊆ u + W, therefore u + W = u
0+ W = U .
W can be obtained as U − u = {v − u | v ∈ U } for any u ∈ U .
Definition. The dimension of an affine space U = u
0+ W is defined to be dim W . (Sometimes it is convenient to define dim ∅ = −1.)
Definition. Let K be a field. K
n, considered as an affine subspace of itself is called an n-dimensional affine space over K, and is denoted by A
n(K) or A
nif the field is understood.
Definition. Let V
1, V
2be vector spaces. An affine map from V
1to V
2is
defined to be a function Φ : V
1→ V
2which can be written in the form
Φ(x) = T (x) + b, where T : V
1→ V
2is a linear transformation and b ∈ V
2.
It is easily checked that the composition of affine maps is also an affine
map. The invertible affine maps V → V form a group. Within this group
the translations form a normal subgroup and the quotient by this normal
subgroup is the group of invertible linear transformations on V .
Proposition 1.5 (Not examinable) Let V
1, V
2be vector spaces and let Φ : V
1→ V
2be an affine map.
(i) For any affine subspace U
1⊆ V
1, the image Φ(U
1) is also an affine subspace of V
2.
(ii) For any affine subspace U
2⊆ V
2, the preimage Φ
−1(U
2) is also an affine subspace of V
1.
Proof. Let x
1, x
2, . . . , x
mbe the co-ordinates on V
1, y
1, y
2, . . . , y
nthe co-ordinates on V
2. Let Φ(x) = T (x) + b, where T : V
1→ V
2is a linear transformation and b = (b
1, b
2, . . . , b
n)
T∈ V
2is a vector. Let {t
ij}
1≤i≤n,1≤j≤mbe the matrix of T with respect to the co-ordinates on V
1and V
2.
(i) Let U
1⊆ V
1be an affine subspace. If U
1= ∅, Φ(U
1) = ∅, too. Otherwise, we can write U
1= u
1+ W
1, where u
1∈ U
1and W
1is a linear subspace of V
1. Then
Φ(U
1) = {Φ(u
1+ w) | w ∈ W
1} = {T (u
1+ w) + b | w ∈ W
1}
= {(T (u
1) + b) + T (w) | w ∈ W } = (T (u
1) + b) + T (W )
As T (W ) is a linear subspace of W
2, it follows that Φ(U
1) is an affine sub- space, as claimed.
(ii) Let now U
2be an affine subspace of V
2.
There exist degree 1 polynomials f
1, f
2, . . . , f
kin y
1, y
2, . . . , y
nsuch that U
2= {(y
1, y
2, . . . , y
n) ∈ V
2| f
i(y
1, y
2, . . . , y
n) = 0, ∀i, 1 ≤ i ≤ k}.
Then
Φ((x
1, x
2, . . . , x
m)
T) =
b
1+
m
X
j=1
t
1jx
j, b
2+
m
X
j=1
t
2jx
j, . . . , b
n+
m
X
j=1
t
njx
j T,
therefore Φ((x
1, x
2, . . . , x
m)
T) ∈ U
2if and only if
f
ib
1+
m
X
j=1
t
1jx
j, b
2+
m
X
j=1
t
2jx
j, . . . , b
n+
m
X
j=1
t
njx
j= 0
for every i, 1 ≤ i ≤ k. These are linear equations in the x
i, 1 ≤ i ≤ m, therefore Φ
−1(U
2) is an affine subspace of V
1, as claimed.
Definition. Let X, Y be subsets of a vector space V . X and Y are affine
equivalent if and only if there exist mutually inverse affine maps Φ, Ψ : V →
V such that Φ(X) = Y and Ψ(Y ) = X. It is easy to verify that affine
equivalence is an equivalence relation.
Examples:
1. All affine subspaces of the same dimension are affine equivalent. This can be proved by showing that any k-dimensional affine subspace is affine equivalent to the linear subspace spanned by the first k standard unit vectors.
2. Any two triangles in R
2are affine equivalent. An easy way to prove it is to show that any triangle is affine equivalent to the triangle with vertices (0, 0), (1, 0) and (0, 1). See https://personalpages.manchester.ac.uk/
staff/gabor.megyesi/teaching/MATH32062/triangles.pdf for an exam- ple of calculating an explicit affine equivalence between two triangles.
3. Similarly, any two parallelograms in R
2are because all parallelograms affine equivalent to the unit square.
4. Plane conics
A conic (short for a conic section) is the set of points satisfying an equation of the form ax
2+ bxy + cy
2+ dx + ey + f = 0 with at least one of a, b, c not 0.
By a sequence of change of co-ordinates corresponding to affine transforma- tions and by multiplying the equation by a non-zero scalar, every such equa- tion can be reduced to one of finitely many standard forms, which shows that there are finitely many affine equivalence classesc of conics. The tranforma- tion of the points is given inverse of the change of co-ordinates, for example, if the new co-ordinates are x
0= 2x and y
0= y + 5, then its effect on the points is the map (x, y) 7→ (x/2, y − 5).
The procedure consists of four steps. The co-ordinates after the ith step will be denoted by x
iand y
i, the coefficients by a
i, b
i, . . . , f
i.
Step 1. If a 6= 0, let x
1= x, y
1= y. If a = 0 but c 6= 0, let x
1= y, y
1= x.
If a = c = 0, then let x
1= (x + y)/2, y
1= (x − y)/2. In this case b 6= 0 and bxy = b(x
21− y
21). We have achieved that a
1, the coefficient of x
21in the new equation is not 0. If necessary, we can multiply the equation by −1 to make a
1positive.
Step 2. Complete the square with respect to x
1, let x
2= √
a
1x
1+ b
12a
1y
1+ d
12a
1and y
2= y
1. After this step, the equation will be of the form x
22+ c
2y
22+ e
2y
2+ f
2= 0.
Step 3. If c
26= 0, complete the square with respect to y
2, let x
3= x
2and y
3= p|c
2|(y
2+e
2/(2c
2)), then the equation will have the form x
23±y
23+f
3= 0.
If c
2= 0 but e
26= 0, let x
3= x
2and y
3= e
2y
2+ f
2, so the equation becomes
x
23+ y
3= 0. If c
2= e
2= 0, then let x
3= x
2and y
3= y
2, so the equation will have the form x
23+ f
3= 0.
Step 4. If f
36= 0, then let x
4= x
3/p|f
3|, y
4= y
3/p|f
3| and after the change of variable divide the whole equation by |f
3|. After this, the form of the equation will be the same, but f
3will be ±1. If f
3= 0, then let x
4= x
3, y
4= y
3.
At the end of this step, either the equation is x
24+ y
4= 0, or it contains x
24, possibly ±y
42and possibly ±1 and no other term. x
24− y
24− 1 = 0 can be tranformed into x
24− y
42+ 1 = 0 by swapping x
4and y
4, so these two are affine equivalent.
Therefore any variety in R
2defined by a degree 2 equation is affine equivalent to one of the following:
Case 1. x
2+ y
2− 1 = 0 (ellipses) Case 2. x
2+ y
2= 0 (single point) Case 3. x
2+ y
2+ 1 = 0 (∅)
Case 4. x
2− y
2− 1 = 0 (hyperbolas) Case 5. x
2− y
2= 0 (two intersecting lines) Case 6. x
2+ y = 0 (parabolas)
Case 7. x
2− 1 = 0 (two parallel lines) Case 8. x
2= 0 (“double line”)
Case 9. x
2+ 1 = 0 (∅)
It is also true, but requires a separate proof, that these cases, apart from the two empty sets are not affine equivalent. See https://personalpages.
manchester.ac.uk/staff/gabor.megyesi/teaching/MATH32062/conic.pdf for examples of transforming the equation of a conic to one of the above forms.
Over C we can use essentially the same method to reduce the equation of a conic to one of the above forms. The difference is that we do not need to multiply the equation to ensure a
1> 0 in Step 1 and there is no need to take the modulus of c
2and f
3in Steps 2 and 3, resp. If a term occurs in the final equation, it will always have coefficient +1, so we will end up with one of Cases 3, 6, 8 or 9.
Over C, the change of co-ordinates x
0= x, y
0= iy converts Case 1 into Case 4 and vice versa, so all ellipses and hyperbolas are affine equivalent over C.
By a similar argument, Cases 1, 3 and 4 are affine equivalent over C, similarly
Cases 2 and 5, and Cases 7 and 9.
1.3 The correspondence between varieties and ideals
Definition. Let X ⊆ A
n. The ideal of X is defined as
I(X) = {f ∈ K[x
1, x
2, . . . , x
n] | f (a
1, a
2, . . . , a
n) = 0, ∀(a
1, a
2, . . . , a
n) ∈ X}.
I(X) is an indeed ideal of K[x
1, x
2, . . . , x
n].
It is clear from the definition that X
1⊆ X
2implies I(X
1) ⊇ I(X
2).
Proposition 1.6
(i) If J / K[x
1, x
2, . . . , x
n], then J ⊆ I(V(J )).
(ii) X ⊆ V(I(X)) for any X ⊆ A
n(K). Equality holds if and only if X is an affine algebraic variety.
Proof. (i) If f ∈ J and P ∈ V(J ) then f (P ) = 0 by the definition of V, therefore f ∈ I(V(J )) by the definition of I, so J ⊆ I(V(J )) as claimed.
(ii) It is also clear from the definitions that X ⊆ V(I(X)) for any set X ⊆ A
nand that if equality holds then X is an affine algebraic variety.
The only thing that remains to be proved that if X is an algebraic va- riety, then X = V(I(X)). Assume that X = V(J ) for some ideal J / K[x
1, x
2, . . . , x
n]. By (i), J ⊆ I(V(J )), therefore X = V(J ) ⊇ V(I(V(J ))) = V(I(X)). We have already seen that X ⊆ V(I(X)), therefore X = V(I(X)), as required.
Definition. A field K is algebraically closed if and only if every polynomial of degree at least 1 in K[x] has a root in K.
Examples: C and ¯ Q (the field of algebraic numbers in C) are algebraically closed. Q, R and the finite fields are not algebraically closed.
Remark. For any field K, there exists a minimal algebraically closed field containing it, which is unique up to isomorphism, this field is called the algebraic closure of K. For example, the algebraic closure of R is C and the algebraic closure of Q is ¯ Q
Definition. Let R be a (commutative) ring. Let I /R be an ideal. The radical of I, denoted by √
I or rad I, is
√
I = {a ∈ R | ∃n ∈ N such that a
n∈ I}.
I is called a radical ideal if and only if √ I = I.
Algebraic facts: For any ideal I, √
I is also an ideal, I ⊆ √
I and p√
I = √ I, i. e., √
I is a radical ideal.
Theorem 1.7 (Hilbert’s Nullstellensatz) Let K be an algebraically closed field.
(i) Every maximal ideal of K[x
1, x
2, . . . , x
n] is of the form hx
1− a
1, x
2− a
2, . . . , x
n− a
ni, where a
i∈ K, 1 ≤ i ≤ n.
(ii) If J is an ideal of K[x
1, x
2, . . . , x
n], J 6= K[x
1, x
2, . . . , x
n] , then V(J ) 6= ∅.
(iii) I(V(J )) = √
J for any J / K[x
1, x
2, . . . , x
n].
Proof. (i) Not proved here.
Note that hx
1−a
1, x
2−a
2, . . . , x
n−a
ni is a maximal ideal of K[x
1, x
2, . . . , x
n] for any field K. We can define a ring homomorphism K[x
1, x
2, . . . , x
n] → K, f 7→ f (a
1, a
2, . . . , a
n), whose kernel is exactly hx
1− a
1, x
2− a
2, . . . , x
n− a
ni.
Since the image is a field, the kernel is a maximal ideal. The substance of this part is that over an algebraically closed field the converse is also true.
(ii) As J 6= K[x
1, x
2, . . . , x
n], there exists a maximal ideal m containing J . By (i), m = hx
1− a
1, x
2− a
2, . . . , x
n− a
ni for some a
i∈ K, 1 ≤ i ≤ n.
Therefore {(a
1, a
2, . . . , a
n)} = V(m) ⊆ V(J ), so V(J ) 6= ∅.
(iii) If f ∈ √
J , then f
k∈ J for some positive integer k. Hence f
k(P ) = (f (P ))
k= 0 for every P ∈ V(J ), therefore f (P ) = 0 for every P ∈ V(J ), i. e., f ∈ I(V(J )). This means that √
J ⊆ I(V(J )) without assuming that K is algebraically closed.
We need to prove the other inclusion, I(V(J )) ⊆ √
J . Let f ∈ I(V(J )).
Let’s introduce a new variable y. Let f
1, f
2, . . . , f
rbe a set of generators for J , and let J
1= hf
1, f
2, . . . , f
r, f y − 1i / K[x
1, x
2, . . . , x
n, y].
We claim that V(J
1) = ∅. Assume that P = (a
1, a
2, . . . , a
n, b) ∈ V(J
1) ⊆ A
n+1. As f
i∈ J
1for every i, 1 ≤ i ≤ r, we have f
i(a
1, a
2, . . . , a
n) = 0 for every i, 1 ≤ i ≤ r, so (a
1, a
2, . . . , a
n) ∈ V(J ). Therefore f (a
1, a
2, . . . , a
n) = 0, too, but then (f y − 1)(P ) = −1, so P / ∈ V(J
1) after all. Therefore V(J
1) = ∅, as claimed.
By (ii), this implies that J
1= K[x
1, x
2, . . . , x
n, y], in particular 1 ∈ J
1. This means that we can write
1 = (f y − 1)g
0+
r
X
i=1
g
if
ifor suitable g
0, g
1, . . . , g
r∈ K[x
1, x
2, . . . , x
n, y]. Let y
Nbe the highest power
of y occurring in any of g
0, g
1, . . . , g
r. Let’s multiply the above equality
by f
N. Whenever there is a power y, say y
k, occurring in g
ifor some i,
0 ≤ i ≤ r, we can write y
kf
Nas (f y)
kf
N −k, so for each i, 0 ≤ i ≤ r,
f
Ng
i(x
1, x
2, . . . , x
n, y) = G
i(x
1, x
2, . . . , x
n, f y) for a suitable polynomial G
i.
Therefore
f
N= (f y − 1)G
0(x
1, x
2, . . . , x
n, f y) +
r
X
i=1
f
iG
i(x
1, x
2, . . . , x
n, f y).
By substituting y = 1/f into the above equality, we obtain
f
N=
r
X
i=1
f
iG
i(x
1, x
2, . . . , x
n, 1).
The right-hand side is an element of J , therefore f ∈ √
J as required.
The function V from ideals of K[x
1, x
2, . . . , x
n] to affine algebraic varieties
in A
nis not injective, and the function I from subsets of A
nto ideals of
K[x
1, x
2, . . . , x
n] is neither injective nor surjective. However, the combina-
tion of Proposition 1.6 and the Nullstellensatz implies that V and I create
a bijection between radical ideals of K[x
1, x
2, . . . , x
n] and affine algebraic
varieties in A
n.
1.4 Irreducibility
Definition. An affine algebraic variety W is reducible if and only if it can be written as W = V
1∪V
2, where W
1, W
2are also affine algebraic varieties, W
16=
V 6= V
2. If W is not reducible, it is called irreducible. (The irreducibility of W is equivalent to saying that if W = V
1∪ V
2, where W
1, W
2are also affine algebraic varieties, then W
1= V or W
2= V .)
Examples:
1. The variety V(hx
2−y
2i) ⊂ R
2is reducible, since it is the union of V(hx−yi) and V(hx + yi), which are the lines y = x and y = −x.
2. Any variety consisting of a single point is obviously irreducible.
3. Warning: Graphs can be useful in finding the irreducible components, but they can also be misleading. The graph below shows the the curve
y
2+ (x
2− 4)(x
2− 1) = 0
in R
2. Topologically it consists of two disjoint connected components, but it is irreducible, the two parts cannot be separated algebraically.
-2 -1 1 2
-2 -1 1 2
Theorem 1.8 Every affine algebraic variety V can be decomposed into a union V = V
1∪ V
2∪ . . . ∪ V
ksuch that every V
i, 1 ≤ i ≤ k, is an irreducible affine algebraic variety and V
i6⊆ V
jfor i 6= j. The decomposition is unique up to the ordering of the components.
Definition. The V
i, 1 ≤ i ≤ k, in the above theorem are called the irreducible components of V .
Proof. We call an affine algebraic variety good if it can be written as the union of finitely many irreducible affine algebraic varieties and bad otherwise. We shall prove first that all affine algebraic varieties are good.
Assume that there exists a bad affine algebraic variety.
Claim: There exists a bad affine algebraic variety W such that for all affine
algebraic varieties W
0, I(W ) $ I(W
0) implies that W
0is good.
Assume that this claim is false. Let V
1be a bad affine algebraic variety, then there exists a bad variety V
2such that I(V
1) $ I(V
2). Similarly, there exists a bad variety V
3such that I(V
2) $ I(V
3). Continuing like this, we can construct an infinite sequence of bad varieties V
1, V
2, V
3, . . . , such that
I(V
1) $ I(V
2) $ I(V
3) $ . . . $ I(V
m) $ I(V
m+1) $ . . . .
The I(V
m), m = 1, 2, 3, . . . are ideals in the polynomial ring K[x
1, x
2, . . . , x
n] for some n, which is Noetherian by the Hilbert Basis Theorem (Theorem 1.2), so it does not contain a strictly increasing infinite sequence of ideals.
This contradiction shows that the claim is true.
Let now W be a bad affine algebraic variety with the property described in the claim. W cannot be irreducible, since any irreducible variety is good, because it is the union of one irreducible variety. Therefore W is reducible, there exist affine algebraic varieties W
1and W
2such that W
16= W 6= W
2and W = W
1∪ W
2. W
i⊂ W implies I(W
i) ⊇ I(W ) for i = 1, 2. By Proposition 1.6 (ii), V(I(W )) = W and V(I(W
i)) = W
ifor i = 1, 2, so W
i6= W implies I(W
i) % I(W ) for i = 1, 2. By the assumption on W , this implies that W
1and W
2are both good, they are unions of finitely many irreducible affine algebraic varieties. Then W = W
1∪ W
2is also the union of finitely many irreducible affine algebraic varieties, contradicting the assumption that W is bad.
This shows that every affine algebraic variety V can be written as a finite union of irreducible affine algebraic varieties, then discarding the varieties which are contained in one of the terms, we can write V = V
1∪ V
2. . . ∪ V
kwith V
iirreducible for each i, 1 ≤ i ≤ k and V
i6⊆ V
jif i 6= j.
Let V = V
10∪ V
20∪ . . . ∪ V
l0be another decomposition of V into irreducible varieties with V
i06⊆ V
j0for i 6= j. Then
V
1= V
1∩ V = (V
1∩ V
10) ∪ (V
1∩ V
20) ∪ . . . ∪ (V
1∩ V
l0).
As V
1is irreducible, there exists i such that V
1= V
1∩ V
i0, i. e., V
1⊆ V
i0. By applying this argument to V
i0, we obtain that there exists j such that V
i0⊆ V
j, therefore V
1⊆ V
j. This implies j = 1 by our assumption on the V
1, V
2, . . . , V
k, and therefore V
1= V
i0.
By applying this argument to each V
r(1 ≤ r ≤ k) and V
s0(1 ≤ s ≤ l), we obtain a bijection between {V
1, V
2, . . . , V
k} and {V
10, V
20, . . . , V
l0}, so the decompositions are the same apart from possibly the order of the components.
Definition. Let R be a commutative ring. An ideal I / R is called a prime
ideal if and only if ab ∈ I implies a ∈ I or b ∈ I.
Proposition 1.9 An affine algebraic variety W is irreducible if and only if I(W ) is prime.
Proof. We shall prove the contrapositive, namely that W is reducible if and only if I(W ) is not prime.
Assume that W is reducible, then W can be written as W = W
1∪ W
2, where W
1and W
2are affine algebraic varieties and W 6= W
1, W 6= W
2. W
1⊂ W implies I(W
1) ⊇ I(W ), but V(I(W )) = W and V(I(W
1)) = W
1by Proposition 1.6 (ii), therefore I(W
1) 6= I(W ), so I(W
1) % I(W ). Let f
1∈ I(W
1) \ I(W ). Similarly, let f
2∈ I(W
2) \ I(W ). If P ∈ W
1, then (f
1f
2)(P ) = 0 because f
1(P ) = 0, if P ∈ W
2, then (f
1f
2)(P ) = 0 because f
2(P ) = 0, therefore (f
1f
2)(P ) = 0 for every P ∈ W , so f
1f
2∈ I(W ).
Neither f
1, nor f
2is an element of I(W ), therefore I(W ) is not prime. This shows that if I(W ) is prime, then W is irreducible.
Assume now that I(W ) is not prime. Let f
1, f
2be polynomials such that f
1, f
2∈ I(W ), but f /
1f
2∈ I(W ). Let
W
1= {P ∈ W | f
1(P ) = 0} = W ∩ V(hf
1i) = V(hI(W ), f
1i) and similarly
W
2= {P ∈ W | f
2(P ) = 0} = W ∩ V(hf
2i) = V(hI(W ), f
2i).
W
1, W
2are affine algebraic varieties. Clearly W
1⊆ W , W
2⊆ W , but W
16= W since f
1∈ I(W ) and similarly W /
26= W since f
2∈ I(W ). For any / P ∈ W , 0 = (f
1f
2)(P ) = f
1(P )(f
2)(P ), so f
1(P ) = 0 or f
2(P ) = 0. In the first case, P ∈ W
1, in the second P ∈ W
2, therefore W = W
1∪ W
2and W is reducible. This shows that if W is irreducible, then I(W ) is prime.
Remark. The criterion involves I(W ), so if W = V(J ) for some ideal J , the primality or otherwise of J is not sufficient in general to tell whether W is irreducible or not. If the field is algebraically closed, then I(V(J )) = √
J by the Nullstellensatz. If J is prime, then it is automatically radical, so I(V(J)) = J and V(J) is irreducible. If J is radical but not prime, then V(J) is reducible.
The functions V and I create a bijection between prime ideals of K[x
1, x
2, . . . , x
n] and irreducible affine algebraic varieties in A
n.
Properties of irreducible varieties and examples
1. Irreducibility is preserved under affine equivalence. If X and Y are affine
equivalent affine algebraic varieties, then X is irreducible if and only if Y
is. Let Φ : X → Y be an affine equivalence and assume that X can be
written as X = X
1∪ X
2, where X
1, X
2are also affine algebraic varieties and X
16= X 6= X
2, then Y = Φ(X
1) ∪ Φ(X
2), Φ(X
1), Φ(X
2) are also affine algebraic varieties and Φ(X
1) 6= Y 6= Φ(X
2).
2. Affine subspaces over an infinite field K are irreducible. (Note that al- gebraically closed fields are infinite.) Every affine subspace is equivalent to one of the form x
1= x
2= . . . = x
k= 0 for some k. It can be proved that I(V(hx
1, x
2, . . . , x
ki)) = hx
1, x
2, . . . , x
ki if K is infinite and that hx
1, x
2, . . . , x
ki/K[x
1, x
2, . . . , x
n] is a prime ideal directly from the definition.
3. Let K be an algebraically closed field. Let H ⊂ A
n(K) be a hypersur- face, that is, an algebraic variety defined by an ideal generated by a sin- gle polynomial f ∈ K[x
1, x
2, . . . , x
n], H = V(hf i). f can be factorised as f = f
1α1f
2α2· · · f
rαr, where the f
i, 1 ≤ i ≤ r, are irreducible polynomials such that f
iis not a scalar multiple of f
jif i 6= j, and α
i, 1 ≤ i ≤ r, are positive integers. This factorisation is unique up to scalar factors.
We shall now show that phfi = hf
1f
2· · · f
ri. Let g ∈ phfi. g can be factorised as g = g
1β1g
β22· · · g
βss, where the g
i, 1 ≤ i ≤ s, are irreducible polynomial such that g
iis not a scalar multiple of g
jif i 6= j, and β
i, 1 ≤ i ≤ s, are positive integers. g ∈ phfi means that there exists a positive integer m such that g
m= g
mβ1 1g
2mβ2· · · g
smβs∈ hf i, therefore for each i, 1 ≤ i ≤ r, f
i|g
mβ1 1g
2mβ2· · · g
smβs. As f
iis irreducible, it must divide g
jfor some j, 1 ≤ j ≤ s, so f
i|g. The f
iare pairwise coprime, therefore it follows that f
1f
2. . . f
r|g, g ∈ hf
1f
2· · · f
ri. Conversely, if g ∈ hf
1f
2· · · f
ri, then g
max{α1,α2,...,αr}∈ hf i. This shows that I(H) = phfi = hf
1f
2· · · f
ri.
By Proposition 1.3 (i), S
ri=1
V(hf
ii) = Vhf
1f
2· · · f
ri = H. By the argument used previously, hf
ii / K[x
1, x
2, . . . , x
n] is a prime ideal, therefore it is radical, so I(V(hf
ii)) = hf
ii by the Nullstellensatz, by using the fact that hf
ii is prime again, it follows that V(hf
ii) is irreducible. V(hf
ii) does not contain V(hf
ji) if i 6= j, because f
jis not a multiple of f
i. This shows that the irreducible components are V(hf
ii), 1 ≤ i ≤ r.
In particular, H is irreducible if and only if r = 1, i. e., f is the power of a single irreducible polynomial.
Lemma 1.10 Let n ≥ 2 be an integer, let K be a field and let p ∈ K[x
1, x
2, . . . , x
n] be a polynomial which does not involve the variable x
ifor some i, 1 ≤ i ≤ n, and which is not a square of a polynomial in K[x
1, x
2, . . . , x
n]. Then x
2i− p is irreducible in K[x
1, x
2, . . . , x
n].
Proof. We can assume for simplicity that i = n. Let’s assume that for a
contradiction that x
2n−p can be factorised. By considering it as a polynomial
in x
nwith coefficients in K[x
1, x
2, . . . , x
n−1], there are two possibilities: either
one factor has degree 2 in x
nand the other degree 0, or both factors have degree 1.
In the first case the factors are ax
2n+ bx
n+ c and d, where a, b, c, d ∈ K[x
1, x
2, . . . , x
n−1]. From the coefficient of x
2nwe get ad = 1, therefore a and d must be constants, so we do not get a proper factorisation.
In the second case the factors are ax
n+ b and cx
n+ d with a, b, c, d ∈ K[x
1, x
2, . . . , x
n−1]. Similarly to the previous case, from the coefficient of x
2nwe get ac = 1, therefore a and c must be constants. By dividing the first factor by a and multiplying the second by a, we can assume that a = c = 1.
Then x
2n− p = (x
n+ b)(x
n+ d) = x
2n+ (b + d)x
n+ bd, so by comparing the coefficient of x
nand the degree 0 term in x
nwe get b + d = 0 and −p = bd.
Hence d = −b, so −p = −b
2, p = b
2, but p is not a square, this is a contra- diction, therefore x
2n− p is irreducible as claimed. .
This simple lemma has lots of useful consequences.
4. Ellipses, hyperbolas, parabolas are irreducible over C and R.
x
2+ y
2− 1 = x
2− (1 − y
2) and 1 − y
2is not a square, therefore x
2+ y
2− 1 is irreducible in C[x, y] by Lemma 1.10 and hx
2+ y
2− 1i / C[x, y] is a prime ideal. Prime ideals are radical, so by the Nullstellensatz (Theorem 1.7), I(V(hx
2+ y
2− 1i)) = hx
2+ y
2− 1i. By Proposition 1.9, V(hx
2+ y
2− 1i) is irreducible over C as hx
2+ y
2− 1i / C[x, y] is a prime ideal.
It can also be proved that I(V(hx
2+ y
2− 1i)) = hx
2+ y
2− 1i over R, too. The main idea is that if x
2+ y
2− 1 does not divide f ∈ R[x, y], then f (x, y) = x
2+ y
2− 1 = 0 only has finitely many solutions, but the unit circle x
2+ y
2− 1 = 0 has infinitely many points over R, therefore f / ∈ I(V(hx
2+ y
2− 1i)). The rest of the proof is the same as over C.
Any ellipse is affine equivalent to V(hx
2+y
2−1i) over either C or R, therefore all ellipses are irreducible.
The proof for hyperbolas and parabolas is similar.
5. If p(x) ∈ C[x] is a polynomial of odd degree, then V(hy
2− p(x)i) is irreducible over C, as p(x) cannot be a square.
This includes as special cases the nodal cubic curve, V(hx
3+ x
2− y
2i), the
cuspidal cubic curve V(hx
3− y
2i) and elliptic curves (to be studied later in
the course).
-1 1 2
-4 -2 2 4
-0.5 0.5 1.0 1.5 2.0 2.5
-4 -2 2 4
Nodal cubic Cuspidal cubic
6. Let C = V(hx
2y
2+ y
4− x
2− 2y
2+ 1i) ⊂ A
2(C). This polynomial factorises as
x
2y
2+ y
4− x
2− 2y
2+ 1 = (y
2− 1)
2+ x
2(y
2− 1) = (x
2+ y
2− 1)(y
2− 1)
= (x
2+ y
2− 1)(y − 1)(y + 1)
y − 1 and y + 1 have degree 1, so they are irreducible. We have already showed that x
2+ y
2− 1 is irreducible. Therefore the irreducible components of C are the lines V(hy − 1i), V(hy + 1i) and V(hx
2+ y
2− 1i).
-2 -1 1 2
-1.5 -1.0 -0.5 0.5 1.0 1.5