Number Theory for Cryptography (Lecture Notes)

(1)

Number Theory for Cryptography

DTU SP²

University of Luxembourg

Gabor Wiese

In these lectures (8 hours taught in November 2020), we mention some topics from (algebraic) number theory as well as some related concepts from (algebraic) geometry that can be useful in cryptography. We cannot go deeply into any of the topics and most results will be presented without any proofs.

One of the things that one encounters are ‘ideal lattices’. In the examples I saw, this was nothing but (an ideal in) an order in a number field, which is one of the concepts that we present here in its mathematical context (i.e. embedded in a conceptual setting). It has been noted long ago (already in the 19th century) that number fields and function fields of curves have many properties in common. Accordingly, we shall also present some basic topics on affine plane curves and their function fields. This leads us to mention elliptic curves, however, only in an affine version (instead of the better projective one); we cannot go deeply into that topic at all.

The material presented here is classical and very well known. Large parts of these lecture notes are taken from my lecture notes for the lectures Commutative Algebra and Algebraic Number Theory (the latter written in collaboration with Sara Arias-de-Reyna) for the Master in Mathematics at the University of Luxembourg [Wie]. They were, in turn, heavily influenced by a number of sources, such as Neukirch:

Algebraic Number Theory [Neu99], Lorenzini: An Invitation to Arithmetic Geometry [Lor96], Atiyah- Macdonald: Introduction to Commutative Algebra [AM69], Stevenhagen: Number Rings [Ste17].

1 Integers and functions

1.1 Number fields and field extensions

Example 1.1. Letd∈Zbe different from0,1and not a square. Put Q(√

d) ={a+b√

d∈C|a, b∈Q} ⊆C.

We say thatQ(√

d)is aquadratic field.

In particular,Q(√

d) is a field: we can add, substract, multiply and divide elements ofQ(√

d)without leaving this set (except division by zero, of course).

Exercise: Make these four operations explicit.

Example 1.2. Letn∈Z_≥₁be a positive integer. Letζ_n= exp(2πi/n) = cos(2π/n) +isin(2π/n)∈C.

Note thatζ_nlies on the unit circle. It is aprimitiven-th root of unity, meaningζ_nⁿ= 1andζ_n^j 6= 1for all 1≤j≤n−1. Put

Q(ζ_n) ={

n−1

X

j=0

a_jζ_n^j ∈C|a₀, . . . , a_n₋₁ ∈Q}.

(2)

We say thatQ(ζ_n)is then-th cyclotomic field. In Cryptography, one sometimes encountersQ(ζ₂^m)and its ‘ideal lattice’ (ring of integers, see below).

In particular,Q(ζ_n)is a field. If it’s not clear here, it will be clear below.

Both these examples arenumber fields.

Definition 1.3. Anumber fieldis a fieldF ⊂Cwhich is a finite dimensionalQ-vector space. That means that there are elementsy1, . . . , yn∈F such that everyx∈F can be written as

x=

n−1

X

j=0

a_jy_j

with uniquea₀, . . . , a_n₋₁ ∈Q.

Even though number fields (and ideal lattices) are one of the principal motivations for these objects in this course, we shall work more abstractly so that we are flexible and can transport results to other settings.

Definition 1.4. LetF be a field. IfK⊆Fis a subfield ofF, then we say thatK⊆Fis afield extension.

A different piece of notation (used in mathematical literature) for a field extension isF/K.

The following lemma is fundamental because it allows us to use linear algebra in number theory!

Lemma 1.5. LetK ⊆F be a field extension. ThenF is aK-vector space.

More precisely: As a fieldF has addition and multiplication (and also their ‘inverses’: substraction and division). To makeFinto aK-vector space, we need an addition; we just take the addition that we already have. We also need a ‘scalar multiplication’, i.e. we must be able to multiply an element ofK with an element ofF; we again take the multiplication that we already have. Then it’s very easy to check that the associativity, commutativity and distributivity relations that one has in a field imply all the axioms in the definition of vector space.

Definition 1.6. LetK ⊆F be a field extension. Thedegreeof the field extension is defined as [F :K] := dim_K(F),

the dimension ofF asK-vector space. This can be finite or infinite.

If[F :K]is finite, then we say thatK ⊆F is a finite field extension.

Remark 1.7. A number fieldFis hence a field (subfield ofC) such thatQ⊆F is a finite field extension.

Definition 1.8. Let K ⊆ F be a field extension. A polynomialf ∈ K[X]is said to be the minimal polynomial ofa∈F overKif

• it is monic, i.e. its leading coefficient is1,

• ais a root off, i.e.f(a) = 0,

• any polynomialg ∈ K[X]such that g(a) = 0 is a multiple off (in particular, ifg 6= 0, then deg(f)≤deg(g).

Example 1.9. Leta∈Q. Its minimal polynomial overQism_a(X) =X−a∈Q[X].

Example 1.10. Let us consider quadratic fields.

(a) Consider√

2∈C. Its minimal polynomial overQism^√₂(X) =X²−2∈Q[X].

(3)

(b) Consider ¹⁺₂^√⁵ ∈C. Its minimal polynomial overQisX²−X−1∈Q[X].

(c) Consider ¹⁺^√₂⁻⁵ ∈C. Its minimal polynomial overQisX²−X+³₂ ∈Q[X].

Example 1.11. Letpbe a prime number andζ_p = exp(2πi/p) ∈ C. Its minimal polynomial overQis X^p⁻¹+X^p⁻²+· · ·+X+ 1∈Q[X]. This is thep-th cyclotomic polynomial.

More generally, the minimal polynomial ofζ_nforn∈Z_≥₁exists and can be written down rather explicitly.

It is called thep-th cyclotomic polynomial.

Definition 1.12. LetK ⊆F be a field extension. An elementa∈F is calledalgebraic overKif it has a minimal polynomial inK[X].

The field extensionK ⊆F is calledalgebraicif everya∈F is algebraic overK.

Example 1.13. In all preceding examples, we saw numbers that are algebraic overQ.

The famous number π, defined as the ratio of the circumference of a circle over its diameter, is not al- gebraic overQby a famous theorem of Lindemann. One says that it is a transcendental number(over Q).

Proposition 1.14. Any finite field extensionK ⊆ F (in particular, any number fieldF) is algebraic, so a∈Lhas a minimal polynomialm_a∈K[X].

Proof. The powers1 = a⁰, a = a¹, a², a³, . . . must be K-linearly dependent because the dimension is finite. The minimal polynomial is the shortest non-zero equation of the form

aⁿ+cn−1aⁿ⁻¹+· · ·+c1a¹+c0a⁰= 0

withc_i ∈K.

1.2 Integers

Example 1.15. The minimal polynomial ofa∈QoverQism_a(X) =X−a∈Q[X]. Note the following:

a∈Z⇔m_a(X)∈Z[X].

So, we have

Z={a∈Q|m_a(X)∈Z[X].

Definition 1.16. IfSis a ring andR⊆Sis a subring, then we also speak of aring extension(in analogy to the terminology used for field extensions).

In the sequel, we shall often be concerned with a number fieldF and consider the ring extensionZ⊂F. Definition 1.17. LetR⊆Sbe a ring extension. Thena∈Sis calledintegraloverRif there is a monic polynomialf ∈R[X]such thatf(a) = 0.

We first clarify which algebraic elements are integral in the case of interest to us.

Lemma 1.18. LetFbe a number field. Thena∈FisintegraloverZif and only if its minimal polynomial m_ahas coefficients inZ.

More generally, letRbe an integrally closed integral domain (see below;Zis an example) with field of fractionsK. LetK⊆Fbe a finite field extension. Thena∈Fis integral overRif and only if its minimal polynomialm_a(X)has coefficients inR.

(4)

Proof. We only prove the first statement. The general statement requires more technology and will be skipped.

One implication is clear (and works without any assumption onRin the general case). For the other one, letf ∈Z[X]be any monic polynomial such thatf(a) = 0. Considerf as an element ofQ[X]. As such, it is a multiple of the minimal polynomialma(X)∈Q[X], i.e.

f(X) =m_a(X)·h(X)

for some polynomialh(X)∈Q[X]. Note thath(X)is necessarily monic. Now, a theorem of Gauß tells us that if a monic polynomial with coefficients inZfactors into two monic polynomials, then both of them also have coefficients inZ, provingm_a(X)∈Z[X].

We reconsider the same examples as above.

Example 1.19. Leta ∈ Q. As its minimal polynomial overQisma(X) = X−a ∈ Q[X], we have:

a∈Zif and only ifais integral overQ.

Example 1.20. Let us consider quadratic fields.

(a) Consider√

2∈ C. As its minimal polynomial overQism^√₂(X) =X²−2∈Z[X], it follows that

√2is integral overZ.

(b) Consider ¹⁺₂^√⁵ ∈ C. As its minimal polynomial over Qis X² −X −1 ∈ Z[X], it follows that

1+√ 5

2 ∈Cis integral overZ.

(c) Consider ¹⁺^√₂⁻⁵ ∈ C. As its minimal polynomial over QisX²−X + ³₂ 6∈ Z[X], it follows that

1+√

−5

2 ∈Cis not integral overZ.

Example 1.21. Letpbe a prime number andζ_p = exp(2πi/p) ∈C. As its minimal polynomial overQ isX^p⁻¹+X^p⁻²+· · ·+X+ 1∈Q[X], it follows thatζ_p is integral overZ. More generally, the same conclusion holds for allζ_nforn∈Z_≥₁.

1.3 Ring of integers and integral ring extensions

Definition 1.22. For a number fieldF, define thering of integers ofF as Z_F :={a∈F |ais integral overZ}. We revisit again the above examples.

Example 1.23. Ring of integers ofQ: As seen above: Z_Q=Z, i.e.Zis the ring of integers ofQ.

Example 1.24. Letd6= 0,1be a squarefree integer. The ring of integers ofQ(√ d)is (1) Z[√

d], ifd≡2,3 (mod 4), (2) Z[¹⁺₂^√^d], ifd≡1 (mod 4).

Example 1.25. Letn∈Z_≥₁andζ_n= exp(2πi/n)∈C. The ring of integers of then-th cyclotomic field Q(ζ_n)is

Z_Q(ζ_n₎=Z[ζ_n] ={

n−1

X

j=0

a_jζ_n^j ∈C|a₀, . . . , a_n₋₁ ∈Z}.

(5)

Attention: ifnis not prime, then the powersζn^j for0 ≤ j ≤ n−1do not form a basis (there’s linear dependence). For instance, forn = 4, we haveζ₄ = iand1, i, i² = −1, i³ = −iare notQ-linearly independent.

Exercise: Work our a basis forn= 2^m.

Let us add an abstract definition, which we will mostly (but not exclusively) use withR=ZandS=Z_K with a number fieldK.

Definition 1.26. LetSbe a ring andR⊆Sa subring.

(a) The setR_S ={a∈ S |ais integral overR}is called theintegral closure ofRinS(compare with the algebraic closure ofRinS– the two notions coincide ifRis a field).

An alternative name is:normalisation ofRinS.

(b) S is called anintegral ring extension ofR ifR_S = S, i.e. if every element ofS is integral overR (compare with algebraic field extension – the two notions coincide ifRandSare fields).

(c) Ris calledintegrally closed inSifR_S=R.

(d) An integral domainRis calledintegrally closed(i.e. without mentioning the ring in which the closure is taken) ifRis integrally closed in its fraction field.

(e) Leta_i ∈ S for i ∈ I (some indexing set). We letR[a_i | i ∈ I](note the square brackets!) be the smallest subring ofScontainingRand all thea_i,i∈I.

Note that we can seeR[a]insideSas the image of the ring homomorphism Φ_a:R[X]→S,

Xd

i=0

c_iXⁱ 7→

Xd

i=0

c_iaⁱ.

Remark 1.27. The ring of integers of K is the integral closure of Z in K, this explains the piece of notationZ_K. An alternative notation that one often encounters in mathematical texts isOK.

Example 1.28. Every UFD (unique factoriation domain) is integrally closed. In particular,Zand poly- nomial ringsF[X₁, . . . , X_n]are integrally closed.

In the next statements, we will speak ofR-modules for a ringR. AnR-module is nothing else than a vector space (exactly the same definition), except that we allow the coefficients to be in a ring, where as the notion ofvector spaceis restricted to coefficients in fields.

Proposition 1.29. LetR⊆S ⊆T be rings.

(a) Fora∈S, the following statements are equivalent:

(i) ais integral overR.

(ii) R[a]⊆Sis a finitely generatedR-module.

(b) Leta₁, . . . , a_n ∈Sbe elements that are integral overR. ThenR[a₁, . . . , a_n]⊆Sis integral overR and it is finitely generated as anR-module.

(c) LetR⊆S ⊆T be rings. Then ‘transitivity of integrality’ holds:

R⊆Tis integral ⇔ S⊆T is integral andR⊆S is integral.

(6)

(d) R_S is a subring ofS.

(e) Anyt∈Sthat is integral overRSlies inRS. In other words,RSis integrally closed inS(justifying the name).

Proposition 1.30. Let R be an integral domain, K = Frac(R), K ⊆ F a finite field extension and S:=R_F the integral closure ofRinF. Then the following statements hold:

(a) Everya∈F can be written asa= ^s_r withs∈Sand06=r∈R.

(b) F = Frac(S)andSis integrally closed.

(c) IfRis integrally closed, thenS∩K =R.

1.4 Trace and norm in field extensions

We now systematically do linear algebra in field extensions. LetK ⊆ F be a finite field extension of degree[F :K] =n. Leta∈L. Note that multiplication bya:

T_a:F →F, x7→ax

isF-linear and, thus, in particular,K-linear. Once we choose aK-basis ofF, we can representTaby an n×n-matrix with coefficients inK, also denotedT_a.

Example 1.31. The complex numbersChave theR-basis{1, i}and with respect to this basis, anyz∈C is represented as(^x_y) = x+yi. Now, takea= (^b_c) = b+ci∈ C. We obtain: T_a= ^b_{c b}⁻^c

, as we can easily check:

T_a(z) =az= (b+ci)(x+yi) = (bx−cy) + (cx+by)iandT_a(z) = ^b_{c b}⁻^c

(^x_y) =

bx−cy cx+by

.

As an aside: You may have seen this matrix before; namely, writingz=r(cos(ϕ) +isin(ϕ)), it looks like r

cos(ϕ)−sin(ϕ) sin(ϕ) cos(ϕ)

, i.e. it is a rotation matrix times a homothety (stretching) factor.

We can now do linear algebra with the matrixTa∈Matn(K).

Recall that thetraceof a matrixM = (m_i,j)₁_≤_i,j_≤_n∈Mat_n(K)is defined as Tr(M) =

Xn

i=1

m_i,i,

the sum of its diagonal entries.

Definition 1.32. LetK ⊆ F be a field extension of degree[F :K] = n. Let a∈ F. Thetrace ofain K⊆Fis defined as the trace of the matrixTa∈Matn(K)and thenorm ofainK ⊆F is defined as the determinant of the matrixT_a∈Mat_n(K):

Tr_F/K(a) := Tr(T_a)andNorm_F/K(a) := det(T_a).

Note that trace and norm do not depend on the choice of basis by a standard result from linear algebra.

Example 1.33. Let us continue the example above. Letz=x+yi∈C. ThenTr_C/R(z) = 2x= 2 Re(z) andNorm_C/R(z) =x²+y² =|z|².

Lemma 1.34. LetK⊆Fbe a finite field extension.

(7)

(a) Tr_F/K defines a group homomorphism(F,+)→(K,+), i.e.

Tr_F/K(a+b) = Tr_F/K(a) + Tr_F/K(b)for alla, b∈F.

(b) Norm_F/K defines a group homomorphism(F^×,·)→(K^×,·), i.e.

Norm_F/K(a·b) = Norm_F/K(a)·Norm_F/K(b)for alla, b∈F.

Proof. (a) The trace of a matrix is additive andT_a+b =T_a+T_bbecause T_a+b(x) = (a+b)x=ax+bx=T_a(x) +T_b(x) for allx∈F.

(b) The determinant of a matrix is multiplicative andT_a_·_b =T_a◦T_bbecause T_a_·_b(x) =abx=T_a T_b(x)

for allx∈F.

Lemma 1.35. LetK⊆Fbe a finite field extension of degree[F :K] =n. Leta∈F.

(a) Let f_a = Xⁿ +b_n₋₁Xⁿ⁻¹ +· · ·+b₁X +b₀ ∈ K[X]be the characteristic polynomial ofT_a ∈ Mat_n(K). ThenTr_F/K(a) =−b_n₋₁andNorm_F/K(a) = (−1)ⁿb₀.

(b) Let m_a = X^d+c_d₋₁X^d⁻¹ +· · ·+c₁X+c₀ ∈ K[X] be the minimal polynomial ofa overK.

Then{1, a, a², . . . , a^d⁻¹} forms aK-basis ofK(a)and[K(a) : K] = d. Moreover, the matrixT_a^′ representing the mapK(a)−−−→^x^7→^ax K(a)with respect to thisK-basis equals

T_a^′ =





0 0 ··· 0 −c0

1 0 ··· 0 −c1

0 1 ··· 0 −c2

... ... ... ...

0 0 ··· 1−cd−1



.

(c) Thend= [K(a) :K]and withe= [F :K(a)]one hasm_a(X)^e=f_a(X).

Proof. Exercise.

We need some important pieces of terminology from field theory.

Definition 1.36. LetK ⊆F andK⊆Lbe field extensions. Denote

Hom^field_K (F, L) ={ϕ:F →L| field homomorphism s.t.∀x∈K :ϕ(x) =x}, the set of field homomorphisms fromF toLthat are the identity onK.

LetK be an algebraic closure ofK (i.e. and algebraic extensionK ⊆ K such that every non-constant polynomial inK[X]has a zero (and hence all zeros) inK.

We say that a finite field extensionK⊆F isseparableif[F :K] = #Hom^field_K (F, K).

Example 1.37. IfK is a field of characteristic0, i.e. a field that contains (a subfield isomorphic to)Q, then any field extensionK⊆F is separable. IfKis a finite field, the same conclusion holds.

However, ifK is an infinite field of characteristicp > 0, then there are non-separable extensions. One encounters this when working with elliptic curves over finite fields.

(8)

Proposition 1.38. LetK ⊆ F be a finite separable field extension, K an algebraic closure ofK con- tainingF. Let, furthermore, a ∈ F andf_a be the characteristic polynomial ofT_a. Then the following statements hold:

(a) f_a(X) =Q

σ∈Hom^field_K (F,K)(X−σ(a)),

(b) Tr_F/K(a) =P

σ∈Hom^field_K (F,K)σ(a), and

(c) Norm_F/K(a) =Q

σ∈Hom^field_K (F,K)σ(a).

Corollary 1.39. LetK⊆F ⊆Lbe finite separable field extensions. Then

Tr_L/K = Tr_F/K◦Tr_L/F andNorm_L/K = Norm_F/K◦Norm_L/F.

We now apply norm and trace to integral elements. We again state the result in general, but one can think ofR =Z,F a number field andS =Z_Kits ring of integers.

Lemma 1.40. LetRbe an integrally closed integral domain,Kits field of fractions,K ⊆F a separable finite field extension andSthe integral closure ofRinF. Lets∈S. Then the following statements hold:

(a) Tr_F/K(s)∈RandNorm_F/K(s)∈R.

(b) s∈S^×⇔Norm_F/K(s)∈R^×.

Proof. (a) directly follows fromS∩K=R.

(b) ‘⇒’: Lets, t∈S^×such thatts= 1. Then

1 = Norm_F/K(1) = Norm_F/K(st) = Norm_F/K(s)Norm_F/K(t), exhibiting an inverse ofNorm_F/K(s)inR.

‘⇐’: AssumeNorm_F/K(s)∈R^×. Then 1 =rNorm_F/K(s) =r Y

σ∈HomK(F,K)

σ(s) = r Y

id6=σ∈HomK(F,K)

σ(s) s=ts,

exhibiting an inverse tosinS.

1.5 Discriminant

Definition 1.41. LetK ⊆ F be a finite separable field extension of degreen = [F : K]. LetK be an algebraic closure ofK. Further, letHom^field_K (F, K) ={σ1, . . . , σn}and letα1, . . . , αn∈Fbe aK-basis ofF. Form the matrix

D(α₁, . . . , α_n) := (σ_i(α_j))₁_≤_i,j_≤_n=







σ₁(α₁) σ₁(α₂) · · · σ₁(α_n) σ₂(α₁) σ₂(α₂) · · · σ₂(α_n)

... ... . .. ... σ_n(α₁) σ_n(α₂) · · · σ_n(α_n)





 .

Thediscriminant of(α₁, . . . , α_n)is defined as

disc(α₁, . . . , α_n) := detD(α₁, . . . , α_n)2

. Thetrace pairing onK ⊆F is the bilinear pairing

F ×F →K, (x, y)7→Tr_F/K(xy).

(9)

Example 1.42. (a) Let0,1 6=d∈ Zbe a squarefree integer and considerK =Q(√

d). Computations (exercise) show:

disc(1,√

d) = 4dand disc(1,1 +√ d 2 ) =d.

(b) Letf(X) = X³+aX +b∈ Z[X]be an irreducible polynomial and considerK =Q[X]/(f). Let α∈Cbe any root off, so that we can identifyK =Q(α)and1, α, α²is aQ-basis ofK.

Computations (exercise) showdisc(1, α, α²) =−4a³−27b².

Remark 1.43. The discriminant of a polynomialf can also be computed as the resultant of f and its formal derivativef^′.

Proposition 1.44. LetK⊆Fbe a finite separable field extension of degreen= [F :K]. Letα₁, . . . , α_n be aK-basis ofF. Then the following statements hold:

(a) LetD := D(α₁, . . . , α_n). ThenD^trDis the Gram matrix of the trace pairing with respect to the chosenK-basis. That is,

D^trD= Tr_F/K(αiαj)

1≤i,j≤n. (b) We have

disc(α₁, . . . , α_n) = det(D)² = det(D^trD) = det Tr_F/K(α_iα_j)

1≤i,j≤n.

(c) LetC = (c_i,j)₁_≤_i,j_≤_nbe ann×n-matrix with coefficients inK withdetC 6= 0and putβ_i :=Cα_i fori= 1, . . . , n. Then

disc(β₁, . . . , β_n) = det(C)²disc(α₁, . . . , α_n).

(d) IfF =K(a), then

disc(1, a, . . . , aⁿ⁻¹) = Y

1≤i<j≤n

σ_j(a)−σ_i(a)2

,

whereσ1, . . . , σnare theK-field homomorphismsF →K.

Proof. (a) Letσ₁, . . . , σ_nbe theK-field homomorphismsF →K. Then we have

D^trD=







σ₁(α₁) σ₂(α₁) · · · σ_n(α₁) σ₁(α₂) σ₂(α₂) · · · σ_n(α₂)

... ... . .. ... σ₁(α_n) σ₂(α_n) · · · σ_n(α_n)













σ₁(α₁) σ₁(α₂) · · · σ₁(α_n) σ₂(α₁) σ₂(α₂) · · · σ₂(α_n)

... ... . .. ... σ_n(α₁) σ_n(α₂) · · · σ_n(α_n)







=





 Pn

k=1σ_k(α₁α₁) Pn

k=1σ_k(α₁α₂) · · · Pn

k=1σ_k(α₁α_n) Pn

k=1σ_k(α2α1) Pn

k=1σ_k(α2α2) · · · Pn

k=1σ_k(α2αn)

... ... . .. ...

Pn

k=1σ_k(α_nα₁) Pn

k=1σ_k(α_nα₂) · · · Pn

k=1σ_k(α_nα_n)







=







Tr_F/K(α₁α₁) Tr_F/K(α₁α₂) · · · Tr_F/K(α₁α_n) Tr_F/K(α₂α₁) Tr_F/K(α₂α₂) · · · Tr_F/K(α₂α_n)

... ... . .. ...

Tr_F/K(α_nα₁) Tr_F/K(α_nα₂) · · · Tr_F/K(α_nα_n)





 .

So, the(i, j)-entry of the matrixD^trDequalsTr_F/K(α_iα_j). Hence,D^trDis the Gram matrix of the trace pairing with respect to the chosenK-basis.

(10)

(b) is clear.

(c) Exercise.

(d) Exercise.

Proposition 1.45. Assume the setting of the previous proposition. The discriminantdisc(α₁, . . . , α_n) is non-zero and the trace pairing onK ⊆F is non-degenerate.

1.6 Integral bases, orders and lattices in number fields

For going on, we need to introduce some more terminology on modules (‘vector spaces over rings’). It is again the same as for vector spaces.

Definition 1.46. LetRbe a ring and letM be anR-module.

Letm_i ∈M with indicesi∈I (some set) be a collection of elements.

The collection(m_i)_i_∈_Iis called agenerating set ofMif for everym∈Mthere is a finite subset of indices J ⊆I and for everyj∈Jthere isr_j ∈Rsuch thatm=P

j∈Jr_jm_j.

IfM possesses a generating set that is finite, thenM is calledfinitely generated asR-module.

The collection(m_i)_i_∈_I is calledR-linearly independentorR-freeif for all finite subsets of indicesJ ⊆I the only linear combination equal to zero:

0 =X

j∈J

r_jm_j

withr_j ∈Rforj∈J is the one wherer_j = 0for allj∈J.

AnR-free generating set ofMis called anR-basis ofM. AnR-moduleM that possesses anR-basis is calledR-free.

The number of elements in an (and any)R-basis ofMis called theR-rankofM, denotedrk_R(M).

Remark 1.47. IfR = K is a field, then anyR-module is aK-vector space and by results form linear algebra it is automatically free and possesses aK-basis. Moreover, in that case, dimension and rank are the same.

However, ifRis not a field, then there are in general non-free modules, which also don’t have any basis (but they all have generating sets).

We include the structure theorem of finitely generated modules over PIDs (principal ideal domains). The main example isR =Zor a polynomial ring in one variable over a field. The proof is not very difficult, but not included here.

Theorem 1.48. Let R be a principal ideal domain and M a finitely generated R-module. Then the following statements hold:

(a) Assume thatM is a freeR-module of rankm. Then any submoduleN ofM is finitely generated and free of rank≤m.

(b) An elementm ∈ M is called atorsion elementif there is0 6= r ∈ R such thatrm = 0. The set M_torsion={m∈M |mis a torsion element}is anR-submodule ofM.

(c) M is a freeR-module⇔M_torsion={0}.

(11)

(d) There is an integermsuch that

M ∼=M_torsion⊕R⊕. . .⊕R

| {z }

mtimes

.

We need one more piece of terminology for modules.

Definition 1.49. AnR-moduleMis calledNoetherianif all its submodules are finitely generated. A ring RisNoetherianif it is Noetherian asR-module; this is equivalent to asking that all ideals ofRare finitely generated.

Example 1.50. Every principal ideal domain (PID) is Noetherian, so, in particular, Z is Noetherian.

Moreover, the polynomial ring inn-variables over a Noetherian ring is Noetherian (this is calledHilbert’s Basissatz).

Definition 1.51. LetR⊆Sbe an integral ring extension. IfSis free as anR-module, then, by definition, anR-basis ofS(i.e. a free generating system) exists and is called anintegral basis ofSoverR.

We point out that, ifSis an integral domain (as it always will be in this lecture), then anR-basis ofS is also aK-basis ofF = Frac(S)withK = Frac(R).

Note that, in general, there is no reason why an integral ring extensionSshould be free as anR-module.

This is, however, the case for the rings of integers, as the following proposition shows. We first need a lemma.

Lemma 1.52. LetRbe an integrally closed integral domain,Kits field of fractions,K ⊆F a separable finite field extension andSthe integral closure ofRinF.

(a) For anyK-basisα₁, . . . , α_n ofF, there is an element r ∈ R\ {0}such thatrα_i ∈ S for alli = 1, . . . , n.

(b) Letα₁, . . . , α_n∈Sbe aK-basis ofF and letd= disc(α₁, . . . , α_n)be the discriminant of this basis.

ThendS⊆Rα₁+· · ·+Rα_n.

Proof. (a) By Proposition 1.30 (a), we can writeα_i = ^s_rⁱ

i withr_i ∈ R ands_i ∈ S for alli = 1, . . . , n.

Hence, we may taker =r1·. . .·rn. (b) Lets=Pn

j=1x_jα_jbe an element ofSwithx_j ∈Kforj= 1, . . . , n. We showds∈Rα₁+· · ·+Rα_n. Note that the elementary properties of the trace yield

Tr_F/K(α_is) = Xn

j=1

Tr(α_iα_j)x_j ∈R.

We can rewrite this in matrix form usingM =D^trD=





Tr_F/K(α1α1) ··· Tr_F/K(α1αn)

... ... ...

Tr_F/K(αnα1) ··· Tr_F/K(αnαn)



. Now:

M x1

...

xn

=





P_n

j=1Tr_F/K(α1αj)xj

...

Pn

j=1Tr_F/K(αnαj)xj



∈Rⁿ.

Multiplying through with the adjoint matrixM^#yields M^#M

x1

...

xn

= det(M) x1

...

xn

=d x1

...

xn

∈Rⁿ. Thus,dx_i ∈Rfor alli= 1, . . . , nand, consequently,ds∈Rα₁+· · ·+Rα_n.

(12)

Proposition 1.53. LetRbe a principal ideal domain,K its field of fractions,K ⊆ F a finite separable field extension andSthe integral closure ofRinF.

(For instance, we can takeR=Z,K =Q,F a number field andS =Z_F its ring of integers.)

Then every finitely generatedS-submodule06=M ⊆F is a freeR-module of rank[F :K]. In particular, Spossesses an integral basis overR. Moreover,Sis a Noetherian ring.

Proof. As principal ideal domains are unique factorisation domains and, hence, integrally closed, we may apply Lemma 1.52 to obtain aK-basisα₁, . . . , α_n∈SofFand we also havedS⊆Rα₁+· · ·+Rα_n=:

N withd= disc(α1, . . . , αn). Note thatN is a freeR-module of rankn= [F :K].

Letm₁, . . . , m_k ∈M be a generating system ofM ⊆F asS-module. As them_i are elements ofF, by Proposition 1.30 (a) there isr ∈ R such thatrm_i ∈ S for alli = 1, . . . , k, whencerM ⊆ S. Hence, rdM ⊆ dS ⊆ N. Consequently, Theorem 1.48 yields thatrdM is a freeR-module of rank at mostn.

Of course, theR-rank ofrdM is equal to theR-rank ofM. Let0 =6 m ∈ M. ThenN m ≤ Sm≤ M, showing thatn, theR-rank ofN (which is equal to theR-rank ofN m) is at most theR-rank ofM. Since finite direct sums of Noetherian modules are Noetherian, it follows thatSis Noetherian.

For the rest of this section we specialise to the case of number fields.

Definition 1.54. LetF be a number field. A subringOofZ_F is called anorder ofFifOhas an integral basis of length[F :Q]. Equivalently, the index(Z_F :O)as abelian groups is finite.

Corollary 1.55. LetF be a number field andZ_F the ring of integers ofF. Then the following statements hold:

(a) Z_F is an order ofF, also called themaximal order ofF.

(b) LetObe an order ofF and0 (I EObe an ideal. ThenI is a freeZ-module of rank[F :Q]and the quotientO/I is finite (i.e. has finitely many elements; equivalently, the index(O:I)is finite).

Proof. (a) It is a trivial consequence of Proposition 1.53 that ZF is a free Z-module of rank [F : Q]

becauseZ_F is a Z_F-module generated by a single element, namely1. In particular, Z_F has an integral basis and, hence, is an order ofF.

(b) SinceI ⊆ Ois a subgroup andOis a free abelian group,I is free of rankrk(I)≤rk(O) = [F :Q].

Let0 6=x ∈I, thenxO ⊆I. Asrk(O) = rk(xO) ≤rk(I), it follows thatrk(I) = rk(O) = [F :Q].

The quotient of any two freeZ-modules of the same rank isZ-torsion by Theorem 1.48. Hence,O/I is an abelian group generated by finitely many elements of finite order, hence, it is a finite group.

Example 1.56. Letf ∈Q[X]be an irreducible monic polynomial and letF =Q[X]/(f)be the number field defined byf. ThenZ[X]/(f)is an order inF, called theequation order off. Note that is usually is NOT the maximal order, i.e. it is usually not the ring of integers. It may have some kind of ‘singular points’. More on that later.

Definition 1.57. LetF be a number field with ring of integersZ_F and06=a⊂F be a finitely generated Z_F-module. Thediscriminant ofais defined asdisc(α₁, . . . , α_n)for anyZ-basis of the freeZ-modulea (see Proposition 1.53). (By Proposition 1.44 (c), this definition does not depend on the choice ofZ-basis because the basis transformation matrix is invertible with integral entries and thus has determinant±1.) Thediscriminant ofF is defined asdisc(Z_F).

Proposition 1.58. LetF be a number field andZ_F its ring of integers. Let06=a⊆b⊂F be two finitely generatedZF-modules. Then the index(b:a)is finite and satisfies

disc(a) = (b:a)²disc(b).

(13)

1.7 Fields and rings of functions (on a plane curve)

In order to keep the technicalities low, we shall only work in an affine setting and not in a projective one.

Definition 1.59. LetK ⊆F be a field extension. Letn∈ Z_≥₁. The set ofF-points of affinen-spaceis defined asAⁿ(F) :=Fⁿ(i.e.n-dimensionalF-vector space).

LetS⊆K[X₁, . . . , X_n]be a subset. Then

VS(F :={(x₁, . . . , x_n)∈Aⁿ(F)|f(x₁, . . . , x_n) = 0for allf ∈S} is called the set ofF-points of the affine (algebraic) set belonging toS.

If the setSconsists of a single non-constant polynomial, thenVS(K)is also called ahyperplane inA(K).

Ifn= 2andS ={f}with non-constantf, thenVS(K)is called aplane curve(because it is a curve in the planeA²(K). ItsF-points are defined asVS(F)forK ⊆F a field extension.

Convention: When the number of variables is clear, we writeK[X]forK[X₁, . . . , X_n]. In the same way a tuple(x₁, . . . , x_n)∈Aⁿ(K)is also abbreviated asxif no confusion can arise.

The letter ‘V’ is chosen because of the word ‘variety’ or ‘vanishing set’.

Example 1.60. (a) K =R,n= 2,K[X, Y]∋f(X, Y) =aX+bY +cnon-constant. ThenV_{_f_}(R)is a line (y =−^a_bx−^c_b ifb6= 0; ifb= 0, then it is the line withx-coordinate−_a^c and anyy-coordinate).

(b) K =R,n= 2,K[X, Y]∋f(X, Y) = X²+Y²−1. ThenV_{_f_}(R)is the circle inR²around the origin with radius1.

(c) K =Q,f(X, Y) :=X²+Y²+ 1. NoteV_{f}(R) =∅, but(0, i)∈ V_{f}(C).

(d) K =F₂,f(X, Y) :=X²+Y²+1 = (X+Y+1)²∈F₂[X]. Because off(a, b) = 0⇔a+b+1 = 0 for anya, b∈L,L/F2, we have

V_{f}(L) =V_{X+Y+1}(L), which is a line.

Example 1.61. (a) LetK=Qand considerf(X, Y) =X²+Y²−1and theQ-points of the associated curveC=S¹=V_{f}(Q). They correspond in a precise way to primitive pythagorean triples(a, b, c) fora, b, c∈Zanda²+b²=c². For details see an exercise.

Note that this is a nice and first illustration of the deep relations between geometry and number theory (algebra). We will encounter several in this course.

(b) LetKbe a field and considerf(X, Y) =X²+Y².

The only solution of the form(x,0)is(0,0)in any fieldK. Suppose now(x, y) is a solution with y6= 0. Thenx² =−y², orz² =−1withz= ^x_y.

Hence,V_{f}(K) ={(0,0)}if and only ifX² =−1has no solution inK.

In particular,V_{f}(R) = {(0,0)}(but: V_{f}(C) =V_{X−iY}(C)∪ V_{X+iY}(C), union of two lines) andV_{f}(F_p) ={(0,0)}if and only ifp≡3 (mod 4).

Example 1.62. LetKbe a field andf(X) =X³+aX²+bX+cbe a separable polynomial (meaning that it has no multiple zeros overK).

Any plane curve of the formVY²−f(X)is called anelliptic curve.

(14)

Definition 1.63. LetX be a subset ofAⁿ(K).

Thecoordinate ring of X is the ring of functions X → K that are described by polynomials. More precisely, it is the image of the ring homomorphism

ϕ:K[X]→Maps(X, K), f 7→ (x₁, . . . , x_n)7→f(x₁, . . . , x_n)

(with+and·onMaps(X, K)defined pointwise:(f+g)(x) :=f(x)+g(x)and(f·g)(x) :=f(x)·g(x)).

The kernel ofϕis called thevanishing ideal ofX:

IX := ker(ϕ) ={f ∈K[X]|f(x) = 0for allx∈ X }. By the isomorphism theorem, we haveK[X] =K[X]/I_X.

Proposition 1.64. Let K be a field andf ∈ K[X, Y]a nonconstant irreducible polynomial. LetC = Vf(K) be the associated plane curve. Assume thatVf(K) is infinite (which is automatic ifK = K is algebraically closed).

Then the vanishing idealIC is(f)and the coordinate ringK[C]is isomorphic toK[X, Y]/(f).

Example 1.65. • Linef(X, Y) :=X−Y + 2∈R[X, Y],L:=Vf(R):

We haveIL= (X−Y + 2), i.e. that the vanishing ideal ofLis the principal ideal generated byf.

This is a consequence of Proposition 1.64.

We compute the structure of the coordinate ring in this case. Consider the ring homomorphism:

ϕ:R[X, Y]→R[T], g(X, Y)7→g(T, T + 2).

Note that this homomorphism is chosen such thatX−Y + 2gets mapped toT−(T+ 2) + 2 = 0 and so lies in the kernel. We now prove that the kernel is equal toIL(and hence to(X−Y + 2)).

Letg∈ker(ϕ). This meansg(T, T + 2)is the zero polynomial. If we now take a point(x, y) ∈ L, then it satisfiesy = x+ 2, whenceg(x, y) = g(x, x+ 2) = 0because it is equal tog(T, T + 2) evaluated atT =x. This meansg∈ IL, as claimed.

From the isomorphism theorem, we now obtain that the coordinate ring is just the polynomial ring in one variable:

R[L] =R[X, Y]/IL=R[X, Y]/(X−Y + 2)∼=R[T].

In other words, the coordinate functions satisfy the equalityx2 =x1+ 2.

• Parabolaf(X, Y) :=X²−Y + 2∈R[X, Y],P :=Vf(R):

Again by Proposition 1.64 we haveIP = (X²−Y + 2).

With arguments similar to those used before, we conclude that the coordinate ring is R[P] =R[X, Y]/IP =R[X, Y]/(X²−Y + 2)∼=R[T],

where the last isomorphism is given by sending the class ofg(X, Y)tog(T, T²+ 2). So, it is again isomorphic to the polynomial ring in one variable.

• Hyperbolaf(X, Y) :=XY −1∈R[X, Y],H:=Vf(R):

We again haveIH= (XY −1)by Proposition 1.64. This time we obtain R[H] =R[X, Y]/(XY −1)∼=R[X, 1

X]

={ Xf

i=e

a_iXⁱ|e, f ∈Z, a_i∈R} ⊂R(X) := Frac(R[X]).

(15)

Note that this ring is not isomorphic to the polynomial ring in one variable. For, suppose to the contrary that there is a ring isomorphism ϕ : R[X,_X¹] → R[T]. As X is a unit, so is ϕ(X).

Thus,ϕ(X) ∈ R[T]^× = R^×is a constant polynomial. Consequently, the image ofϕlands in R, contradicting the surjectivity.

Definition 1.66. LetX ⊆Aⁿ(K)be a subset. We say thatX isreducibleif there are two affine subsets X1 =VS1,X2 =VS2(K)⊆Aⁿ(K)such that

X ⊆ X1∪ X2

and

X 6⊆ X1andX 6⊆ X2.

An affine setX ⊆Aⁿ(K)is called anaffine varietyifX is irreducible (i.e. not reducible).

Example 1.67. • Letf(X, Y) = XY ∈ R[X, Y]. ThenVf(R) is the union of thex-axis and the y-axis, so clearlyVf(R)is reducible. More precisely,

Vf(R) =VX(R)∪ VY(R).

• The lineX−Y + 2is irreducible. Attention: it is reducible for the usual real topology (take two closed ‘half lines’ overlapping).

• The hyperbolaHis also irreducible. This is a consequence of the next proposition, since the co- ordinate ringR[H]is an integral domain.

We can now formulate a topological statement on an affine algebraic set as a purely algebraic statement on the coordinate ring! This kind of phenomenon will be encountered all the time in the sequel of the lecture.

Proposition 1.68. Let∅ 6=X ⊆Aⁿ(K)be an affine set. Then the following statements are equivalent:

(i) X is irreducible (i.e.X is a variety).

(ii) The coordinate ringK[X]is an integral domain.

Definition 1.69. LetX ⊆ Aⁿ(K) be an affine variety (so that its coordinate ringK[X]is an integral domain).

Then thefunction field ofX is the field of fractions ofK[X]. It is denotedK(X).

The elements in the function field are thus fractions ^f_g withf, g∈K[X].

IMPORTANT: We cannot view these fractions insideMaps(X, K) because the denominator ^f(x_g(x¹^,...,xⁿ⁾

1,...,xn)

may be zero for some(x₁, . . . , x_n)∈ X. So, an element ^f_g of the function field only gives us a map X \ Vg(K)→K, (x₁, . . . , x_n)7→ f(x₁, . . . , x_n)

g(x₁, . . . , x_n).

It is not everywhere defined. If one introduces a suitable topology, one can see that it is defined on an open set.

We will go more into that when we discuss local vs. global below.

(16)

1.8 Morphisms between curves

Definition 1.70. LetC₁ = Vf(K) andC₂ = Vg(K)be affine plane curves. A mapϕ : C₁ → C₂ is a morphism of curvesif it is given by polynomials in the following sense: there areϕ₁, ϕ₂ ∈ K[C₁]such that for all(x, y)∈C₁ we have

ϕ(x, y) = (ϕ₁(x, y), ϕ₂(x, y))∈C₂.

We speak of an isomorphism if there is also a morphismψ:C₂ →C₁such thatψ◦ϕ= idandϕ◦ψ= id.

Example 1.71. Letf(X, Y) =X²+Y²−1andg(X, Y) =X+Y −1be polynomials (say inR[X, Y]).

The associated curvesVf(R)andVg(R)are the unit circle and the line through(0,1)and(1,0)(of slope

−1). Then

ϕ:Vf(R)→ Vg(R), (x, y)7→(x², y²) is a morphism of curves.

Proposition 1.72. Letϕ:C1→C2be a morphism of curves given byϕ1, ϕ2 ∈K[C1].

Then the map

ϕ^∗:K[C2]→K[C1], g7→ϕ^∗(g) :=g◦f is aK-algebra homomorphism.

Explicitly, we haveϕ^∗(g)(x, y) =g(ϕ(x, y)) =g(ϕ1(x, y), ϕ2(x, y)).

We also have a converse.

Proposition 1.73. LetC₁,C₂ be affine plane curves. Letψ : K[C₂] → K[C₁]be aK-algebra homo- morphism. Putϕ₁:=ψ(X+IC2)∈K[C₁]andϕ₂ :=ψ(Y +IC2)∈K[C₁].

Then the map

ϕ:C₁→C₂, (x, y)7→(ϕ₁(x, y), ϕ₂(x, y)) is well-defined andϕ^∗ =ψ.

Proof. We must check that(ϕ1(x, y), ϕ2(x, y))lies inC2whenever(x, y)lie inC1. For that purpose, let g∈ IC2 be any polynomial in the vanishing ideal ofC₂. Note that

g(X+IC2, Y +IC2) =g(X, Y) +IC2 = 0 +IC2.

Consequently, we have

0 =ψ g(X+IC2, Y +IC2)

=g ψ(X+IC2), ψ(Y +IC2)

=g ϕ₁, ϕ₂

∈K[C₁].

Hence,

g ϕ₁(x, y), ϕ₂(x, y)

= 0∀(x, y)∈C₁. This implies(ϕ₁(x, y), ϕ₂(x, y))lies inC₂.

It suffices to check the equalityϕ^∗ =ψon generators: X+IC2, Y +IC2 ∈K[C2].

ϕ^∗(X+IC2)(x, y) =ϕ₁(x, y) =ψ(X+IC2)

and similarly for the other one.

Now we do the same thing again, but not with the coordinate rings, but their quotient fields, i.e. function fields. In order for the latter to be defined, we will impose that the curves are irreducible.

(17)

Definition 1.74. LetC₁=Vf(K)andC₂ =Vg(K)be irreducible affine plane curves.

Arational mapϕof curvesfromC₁toC₂is given by rational functionsϕ₁, ϕ₂∈K(C₁)such that for all (x, y)∈C₁ where bothϕ₁andϕ₂ are defined, we have

ϕ(x, y) = (ϕ1(x, y), ϕ2(x, y))∈C2.

We use the piece of notationϕ:C₁ 99KC₂.

Proposition 1.75. Let ϕ : C₁ 99K C₂ be a rational map of irreducible affine plane curves given by ϕ1, ϕ2 ∈K(C1).

Then the map

ϕ^∗ :K(C2)→K(C1), g7→ϕ^∗(g) =g◦f

is aK-field homomorphism, i.e. a (necessarily injective) field homomorphism that is the identity onK. In other words, we have field extensions

K ⊂ϕ^∗ K(C2)

⊆K(C1).

The extensionϕ^∗ K(C2)

⊆K(C1)is finite.

Explicitly, we haveϕ^∗(g)(x, y) =g(ϕ(x, y)) =g(ϕ₁(x, y), ϕ₂(x, y)).

Also as before, we also have a converse with a similar proof.

Proposition 1.76. LetC₁,C₂be irreducible affine plane curves. Letψ:K(C₂) →K(C₁)be aK-field homomorphism. Putϕ₁ :=ψ(X+IC2)∈K(C₁)andϕ₂ :=ψ(Y +IC2)∈K(C₁).

Then the map

ϕ:C₁→C₂, (x, y)7→(ϕ₁(x, y), ϕ₂(x, y)) is well-defined andϕ^∗ =ψ.

Example 1.77. LetF be a field and letE₁andE₂be two elliptic curves overF. In line with the approach in these lectures, we see the elliptic curves as affine plane curves, but we equip them with ‘a point at infinity’O. Then they have the structure of an abelian group withOas neutral element.

AnisogeniesbetweenE1andE2is a morphism of curves ϕ:E₁ →E₂

such thatϕ(O) =O. It is known that ifϕis not the zero map, then it is surjective with finite kernel and in that case we obtain the injection of function fields

ϕ^∗ :F(E₂)→F(E₁).

The degree of this field extension is called the degree of the isogeny. Moreover, the isogeny is called separableif the function field extension has this property.

The most important isogenies are those given by multiplication by an integerm∈Z.

[m] :E→E, P 7→mP for any elliptic curve. Its degree ism².