R ENDICONTI
del
S EMINARIO M ATEMATICO
della
U NIVERSITÀ DI P ADOVA
J. W. S. C ASSELS
Computer-aided serendipity
Rendiconti del Seminario Matematico della Università di Padova, tome 93 (1995), p. 187-197
<http://www.numdam.org/item?id=RSMUP_1995__93__187_0>
© Rendiconti del Seminario Matematico della Università di Padova, 1995, tous droits réservés.
L’accès aux archives de la revue « Rendiconti del Seminario Matematico della Università di Padova » (http://rendiconti.math.unipd.it/) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions).
Toute utilisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
J. W. S.
CASSELS (*)
Number
Theory
is anexperimental
science.Unexpected regulari-
ties are noticed in the results of calculations and
only subsequently
areexplanations
andproofs
found. Forexample,
the Law ofQuadratic Reciprocity
was knownby experiment
to Euler and otherslong
beforethe first
complete proof
was foundby
Gauss: and Gauss’ firstproof
is averification of a kind which would not have
suggested
itself if the Lawwas not
already
knownempirically.
The advent of electroniccomputers
has revolutionized the
possibilities
of numericalexperimentation
andso
greatly
increased the number of theoremssuggested
in this way and ofconjectures
stillawaiting proof.
Another boon ofcomputers
is thatone can often
easily
test aconjecture suggested
in other ways: itwould
be foolish to
expend
intellectual energy inattempting
to prove it if a littleexperimentation produces
acounter-example.
I am not a
computer expert,
but I am fortunate inhaving
friendswho are, and in
having
been a witness ofearly developments.
The firstapproaches
to electroniccomputing
were madeduring
the war withspecialized
devices for ballistic orcryptographic
purposes.They
lackedversatility,
and if you wanted to do another calculation you had to un-(1)
«Coinedby
HoraceWalpole
upon the title of thefairy
tale "The threeprinces
ofSerendip",
the heroes of which "werealways making
discoveries,by
accident and
sagacity,
ofthings they
were not in quest of".»Serendip
is an oldname for Sri Lanka =
Ceylon.
An oftenquoted example
islooking
for a needle ina
haystack
andfinding
the farmer’sdaughter.
(1)
Lecturegiven
in Padova in October 1993 and elsewhere.(*) Indirizzo dell’A.:
Department
of Pure Mathematics and MathematicalStatistics, University
ofCambridge,
16 Mill Lane,Cambridge
CB2 1SB, U.K.plug
andreplug
wires orsomething
of the sort. At least inCambridge,
it is claimed that the true mark of an electronic
computer
is that it will readin,
store and act on a program. I waspresent
at a demonstrationin,
Ithink, May
1949 of almost the first program on the very first com-puter
EDSAC. This was a roomful ofequipment
but waslaughably
weak
by today’s
standards.Slowly
itprinted
out theprime
numbers up to 100. At least this is what it wassupposed
todo,
but itprinted
out thesquares of
primes
as well: the program contained abug!
There was no such
profession
ascomputer
science in thosedays.
Several of my fellow research students in number
theory
found thechallenges
of the newdiscipline congenial
and moved into it. Imight
add that I have
always encouraged
my researchpupils
to learn aboutcomputers. Apart
from the more obviousadvantages,
ifthey
turn outnot to be creative mathematicians
they will,
atleast,
haveacquired
asaleable skill.
There were
computers
before there were electroniccomputers.
Inparticular,
there were machines whichoperated
cardsprepared by punching
holes. The machine sorted themby sensing
the holes mechan-ically
or,later, optically.
These machines wereprimarily
forbusiness,
but were
occasionally
used for mathematics. Oneapplication
was’toWaring’s Problem,
that is forgiven n (say n
=5)
tofinding
an s suchthat every
positive integer x
is the sum of at most s n-th powers ofposi-
tive
integers.
Theanalytic approach naturally produces
an s such thatthis is true for all
large enough
x, say for x > xo . Then toget
anelegant
formulation one has to check for the
remaining r %
xo . This was adrudgery
for which thepunched
card machines were well suited. The Soviet mathematician Buchstab(1940)
made a moresophisticated
ap-plication.
For asieving problem
he defined a functionrecursively:
toevaluate it he used
punched
cards.Incidentally,
theshape
of thepunched
cards is still reflected in theprocrustean
format of the FOR-TRAN statement.
Although
electroniccomputers
weredeveloped
for less serious pur- poses,they
were soon used for numbertheory. Perhaps
the earliest ap-plication
wasby
von Neumann and Goldstine(1953)
to aconjecture
ofKummer about the distribution of certain
trigonometric
sums involv-ing
a cubic residue character( « Kummer sums»).
Kummer had made theconjecture
on the basis of hand calculations. Von Neumann and Goldstine carried the calculations much further anddestroyed
itsplau- sibility. Programming
was a much moreexacting
artthen,
both be-cause of the feeble power of the machines and because the modern pro-
gramming languages
did notexist,
so it all had to be done in «machine code». The programmer was Mrs AtleSelberg.
Theproblem
has nowbeen cleared up
by
R. Heath-Brown and S. J. Patterson(1979).
Before
proceeding further,
we must recall some facts aboutelliptic
curves over the rational
For
present
purposes, anelliptic
curve eover Q
isgiven by
where the aj E
~,
and thepolynomial
on theright
hand side has no re-peated
factors. Apoint 3i = ( x, y )
on e is said to be rational if x, y There is asingle point
on e atinfinity,
which is also rationalby
defini-tion. A fundamental
fact,
which we do not provehere,
is that the set C~of rational
points
on e form an abelian group with o as zero. Oneputs
precisely
whenthe rj
are collinear(Figure 1):
inparticular,
The difficult
thing
is to prove that addition is associative.A celebrated result is Mordell’s finite basis theorem: the group 0153 is
finitely generated. This,
I note inpassing,
is asplendid example
of
serendipity, though
notcomputer-aided.
He wasaiming
at some-Fig. 1.
(3)
For a fuller discussion in the same vein see, forexample,
the author’s book Cassels (1991).thing quite
different.Later,
that wasproved by Siegel:
the finitebasis theorem
being
akey ingredient (4).
We shall need to know a little of the ideas which enter into a modern
proof
of Mordell’stheorem,
and consider thespecial
casewith
distinct ej E
Z. If(x, y)
is a rationalpoint
on e, it is easy to see that there are u,v, t
e Z such that X =ujt2,
y = are fractions in theirlowest terms. It follows that
It is
readily
verified that thegreatest
common divisor of the first twofactors on the
right
hand side divides el - e2etc.,
and so,taking
thesquarefree
kernels of the three factors we havefor some sl , s2 , S3 where the
triplet
ml , m2 ,m3c= Z
is from a finiteset. On
eliminating u
from(6)
weget
thefollowing
three simultaneousquadratic equations
in of whichonly
two areindepen-
dent :
The
triplet {mi,
’Yn2,corresponds precisely
to the the class of g =( x, y )
EG
modulo2(B.
To seewhy
there should be this link between theexpressions (6)
and the group structure of0153,
we suppose that e isgiven by (4)
and that thepoints rj=(xj,yj)
offigure
1 lie on the lineY = IX + n. The values of
the Xj
are obtainedby eliminating Y,
and soare
given by
(~)
For a historical discussion, see Cassels (1986).On
substituting an ej
for X we see thatis a square. It is not difficult to
supply
the rest of theargument.
Theproof
of finitegeneration
now uses a differenttype
ofargument
that wedo not need to go into here. Since it easy to determine the torsion
part (5)
of@j,
aknowledge
of the number oftriplets
m2 , forwhich
(7)
is solublegives
the rank of (S(the
number ofgenerators
of in- finiteorder).
Similar resultsapply
to ageneral e
not of thespecial
form(4).
Note that we have shown that the rank of (S is
finite,
but we have noway to determine that
rank,
even inprinciple.
All we have is an upper bound. Toget
theprecise rank,
we should have to decide forgiven
I ml,
m,2 , whether there is a of(7)
or not.For a
long
time there was no infallible decisionprocedure:
as thelogi-
cians would say, the rank was not
effectively computible.
It isonly
inthe last few years that this situation has
begun
tochange.
If we can prove that some more of the
equations (7)
areinsoluble,
we
get
asharper
bound for the rank. We recall thetheory
of asingle equation
where F is a
quadratic
form with rational coefficients. The equa- tionhas no solution
(6)
inQ
as there are none in the real field R.Similarly,
the
equation
has no solution in
Q
onconsidering
the behaviour(7)
of apossible
sol-ution modulo powers of 3: that
is,
there is no solution in the 3-adic field We say that aDiophantine equation
is«everywhere locally
soluble»if the are solutions in R and in every
p-adic
fieldQp
and that it is «solu- bleglobally»
if there is a solution inQ.
Hassesuccinctly
reformulated(~) Only
the2-primary
part is relevant here.(6 )
Asalways,
we exclude the trivial solution in which all the variables are 0.(7)
For y,2:}
is a solution, we may suppose that x, y, z have no com- mon factor. Then x, y must be divisibleby
3, and (12) nowimplies
that z is divisi-ble
by
3. Contradiction!existing
criteria for thesolubility
of asingle equation (10),
where F is aquadratic
form: «If there is a solutioneverywhere locally,
then there isa solution
globally».
There are other situations to which such a state- mentapplies.
It is known as a «Hasseprinciple»
or«local-global principle-.
There is no
local-global principle
for simultaneousequations
of thetype (7).
For any we can,however, always
decidewhether
(7)
iseverywhere locally
soluble. Thisgives
a better upper bound for the rank of (M. We can check whether this bound is the actual rankby searching (preferably by machine!)
for solutions. If one canproduce
a solution for all theeverywhere locally
soluble(7),
then theupper bound is the actual rank. If a
thorough
search fails to find the re-quired solutions,
then there is astrong suspicion
that the rank is small-er than the upper bound. As a matter of
experimental observation,
theupper bound is
usually
attained. But notalways.
After extensive com-putations
Selmer(1954)
made thetotally unexpected conjecture:
thedifference
between the upper bound and the actual rank isalways
even.
Actually
Selmer worked not withgeneral
curves(1)
but withwhere A E Z is
given.
For this he found upper bounds for the rankby,
methods in
which, roughly speaking, multiplication by
3[more precise- ly, multiplication by V--31 replaces multiplication by
2. He made ex-tensive calculations
by
hand andcompleted
them in one of the earliestuses of a
computer(8)
The curve(13)
isbirationally equivalent over 0
to
which is of
type (1).
Hence there is a bound for the rank via Selmer found evidence that the difference between the two bounds isalways
even.The most
plausible
way in which even numbersnaturally
arise is asthe rank of a
skew-symmetric
matrix. In themean-time,
for anyelliptic
curve e Tate and Shafarevich had constructed a group,
usually
denotedby III,
whichgives
the obstruction to alocal-global principle.
With sev-eral
years’
hard work Imanaged
tofind(9)
askew-symmetric
form on(1)
Selmerexemplifies
theearly
connection between numbertheory
and com- puters. He has been called the father of computer science inNorway.
(9)
Cassels (1962). Forgeneralizations
see Tate (1962) and Milne (1986).III whose kernel is the set of
infinitely
divisible elements. If one as- sumes that theonly infinitely-divisible
element is0,
thisimplies
Selmer’s
conjectures.
We now come to the celebrated
conjectures
of Birch and Swinner-ton-Dyer (1965),
which arose from a search for somequantitative
local-global principle
forelliptic
curvesanalogous
toSiegel’s quantitative
lo-cal-global principle
forquadratic
forms. We were thencolleagues
inCambridge,
so I had aring-side
seat. As I rememberit,
theconjectures
were
gradually
refined from a vague hunchby
the interaction of com-puter experiments
and theoretical considerations. The finalconjecture
connects a
quantity depending
on 01 with the behaviour of an L-function attached to the curve. There was onemissing piece:
a number occurredwhich could not be accounted for. For most of the curves
investigated
this number was 1: it was
always
aninteger, though
there was no obvi-ous reason
why
this shouldhappen,
and indeed it wasalways
a square.The
skew-symmetric
form on III ensures that itsorder,
iffinite,
is asquare, so a natural guess was that this was the unidentified number.
This was then
supported by
other evidence.The
Birch-Swinnerton-Dyer conjectures
have set theagenda
formuch that has
happened
since.They
have beenwidely generalized
andtested,
butonly recently
areproofs beginning
to emerge.One check on the truth of these
conjectures
is thatthey
canpredict
the existence of
points
on curves which are notimmediately apparent.
Andrew Bremner observed that for
prime
p theBirch-Swinnerton-Dy-
er
conjectures imply
the existence of agenerator
of infinite order onthe curves
in addition to the
point (0, 0)
of order2,
and he devised atechnique
tolook for it. I was
roped
in to the search. We lookedat p %
1000. For p == 877 the
generator
is(u/v, rls),
where(Cassels
and Bremner(1984)).
In fact the existence of thesegenerators
is
predicted by
the Selmerconjecture,
butBirch-Swinnerton-Dyer
even
predicts
the size of the numbers(the height
of thegenerator).
Bremner
(1988)
and Bremner and Buell(1993)
have carried the search further. Otherexamples
ofgenerators
ofgreat height
have beengiven
by Zagier.
We now
change
thesubject
to cubic surfacesf (X, Y, Z, T)
=0,
where the cubic form
f
has rational coefficients. Mordellconjectured
that there is a
local-global principle
and there wereproofs
for surfaces of somespecial types.
Forexample,
Selmer showed that theprinciple
holds for
«diagonal»
cubic surfacesprovided
that is a rational cube. HoweverSwinnerton-Dyer
gavean
ingenious example
of a form for which thelocal-global principle
fails.There remained the
possibility
that theprinciple
holds for all«diago-
nal»
surfaces,
whether or notthey satisfy
Selmer’s condition. This seemed to meunlikely.
At that time MikeGuy
was my research stu- dent andalready
hooked oncomputers.
Isuggested
that he look for alikely counter-example.
So heprogrammed
thecomputer
to look for all surfaces(17)
with coefficients up to 50 which(i)
areeverywhere locally
soluble and
(ii)
have no solutions with the variables up to 50. In theprimitive
state ofcomputers
at the time this was no small achievement.There was a snag. The demand for
computer
time washigh
and pro- grams had to be fed indirectly,
no on-line facilities.Consequently
keenusers,
particularly
ifthey
werejunior,
did all their work atnight.
I ama normal sort of
chap
who worksby day. Although
I knew at secondhand that
Guy
had a list ofpotential counter-examples,
I could notget
them. This deadlock
persisted
for some time.Finally,
John HortonConway,
who works bothby day
andby night,
gave me, not the wholelist,
but what hethought
was thesimplest specimen
I took this with me on
holiday. Fortunately
from thispoint
ofview,
theweather was atrocious: I had
plenty
of time tothink,
and was able toprove that there is no
global point (Cassels
andGuy (1966)).
Note that
although
thecomputer
was crucial to thediscovery,
itdoes not affect the
logical
status of the result. This would have been ex-actly
the same if theAngel
Gabriel hadappeared
to me in a dream andsaid
«why
nottry (18)?».
In fact thesimplest counter-example
onGuy’s
list is not
(18)
butwhich involves
only
twoprimes,
while the coefficients of all the other listed forms contain at least three. Bremner(1978)
showedby
a ratherdifferent
argument
that there are noglobal points
on(19).
Manin
(1974)
hasproduced
a rathersophisticated invariant,
which heconjectures
is theonly
obstruction to thelocal-global principle
for cubicsurfaces. For
«diagonal»
surfacesColliot-Th6l6ne, Kanevsky
and San-suc
(1987) give supporting computational
evidence.They
confined at-tention to
diagonal
surfaces because it is difficult to teach even a power- ful modern machine tocompute
Manin’s invariant forgeneral
forms.We conclude with a
problem
of a very differenttype.
LetE
C[Z],
bepolynomials
withcomplex
coefficients.When does
have a factor in
C[X, Y]?
A familiarexample is f =
g, when there is the factor X -Y,
or, moregenerally, f (X)
=H(F(X)), g(Y)
=H(G(Y))
withfactor
F(X) - G(Y). Davenport,
Lewis and Schinzel(1961)
found anoth-er
example.
Letbe the 4th
Chebyshev polynomial.
ThenMore
generally,
takef = T4(F(X)), g(I~ _ -T4(G(~). Davenport,
Lewis and Schinzel were rash
enough
tosuggest
that this exhausts thepossibilities.
It seemed a
good
idea to look at the associated Riemann surfaces.The
equation f (X)
= Zgives
acovering
of the Riemannsphere
ofZ,
and
similarly
for = Z. We can combine thecoverings by looking
atthe
pair (X, Y)
overZ,
whereThen
f (X) -
factorsprecisely
when thecovering (22)
consists ofmore than a
single piece.
This reduces theproblem
to apurely
combi-natorial one about the behaviour at the ramification
points.
The mostpromising
case is whenf,
g have the samedegree n (say).
MikeGuy programmed
this onEDSAC2,
the successor toEDSAC,
and hegot
asfar as n = 12 before EDSAC2 was killed: its memory was
required
forthe
newly-acquired
TITAN(1°),
which was much morepowerful
butwhich,
ashappened
in thosedays, required programming
in anentirely
different way.
Guy
found new solutions of the combinatorialproblem
for n =7,
11.(10)
TITAN was a cut-down and modified ATLAS, a computer much ahead of its time. It is atragedy
of British post-war economicpolicy
that resources wererefused to
develop
it, but lavished on suchclearly
uneconomicprestige projects
as Concorde.
This left the
question
offinding
thecorresponding polynomials f
and g.We were still
pondering
how to do this whenBryan
Birch visited Cam-bridge. Guy
told him theproblem
over lunch andBryan
then and there found thepolynomials
for n = 7. Over dinner he found those for n = 11.The
polynomials
for n = 7 contain aparameter
t. Withput
Then
where
The formulae for n = 11 are
similar,
but morecomplicated
and withouta
parameter.
I talked about this at the 15th Scandinavian
Congress (Cassels (1968)),
where I learned thatTverberg
had found the 7 factorization from a differentpoint
of view.Subseqently
it has been shown that the classification of thesimple
finite groups(!) implies
that there are essen-tially only finitely
many further such curiousfactorizations,
andthey
have all been
determined,
at least assphere coverings (Feit (1980)).
I learned
only recently
that Birch hadalready
beenconsidering
cov-erings
of the Riemannsphere
in a different context.They
are also thestarting point
of anunpublished
research program of Grothendieck which has attracted much attentionrecently
and which was the themeof a recent conference at
Luminy. [See:
The Grothendiecktheory of
dessins
d’enfants (ed.
L.Schneps),
LMS LectureNotes,
200(1994).]
I
hope
that I have now saidenough
to convince you of my thesis that thetheory
of numbers is anexperimental subject
and thatnowadays
the sensible way to
experiment
isusually
on acomputer.
REFERENCES
BIRCH, B.J. - H.P.F. SWINNERTON-DYER, Notes on
elliptic
curves, I, II, J.Reine
Angew.
Math., 212 (1965), pp. 7-25; 218 (1965), pp. 79-108.BREMNER,
A.,
Some cubicsurfaces
with no rationalpoints,
Math. Proc. Cam-bridge
Philos. Soc., 84 (1978), pp. 219-223.BREMNER, A., On the
equation
Y2 =X(X2
+p),
in NumberTheory
and itsAppli-
cations (ed. R. A. MOLLIN), pp. 3-22, NATO ASI Series, 265C (1988) (Kluwer).
BREMNER, A. - D. A. BUELL, Three
points of
greatheight
onelliptic
curves, Math.Comp.,
61 (1993), pp. 111-115.BREMNER, A. - J. W. S. CASSELS, On the
equation
Y2 =X(X2
+p),
Math.Comp.,
42 (1984), pp. 257-264.BUCHSTAB, A. A, Sur la
décomposition
des nombrespairs
en somme de deux composants dont chacune estformée
d’un nombre borné defacteurs
pre-miers,
Comptes
Rendus(Doklady)
Acad. Sci. URSS, 29 (1940), pp. 544-548.CASSELS, J. W.
S.,
Arithmetic on curvesof
genus 1, IV Proof of theHauptver-
mutung, J. ReineAngew.
Math., 211 (1962), pp. 95-112.CASSELS, J. W.
S.,
Mordell’sfinite
basis theoremrevisited,
Math. Proc. Cam-bridge
Philos. Soc., 100 (1986), pp. 31-41.CASSELS, J. W. S., Lectures on
elliptic
curves. London MathematicalSociety
Student Texts, 24 (1991) (corrected
reprint,
1992).CASSELS, J. W. S., Factorization
of polynomials
in several variables, Proceed- ings of the 15th ScandinavianCongress,
Oslo 1968, pp. 3-17, Lecture Notes inMathematics, 118
(Springer).
CASSELS, J. W. S. - M. J. T. GUY, On the Hasse
principle for
cubicsurfaces,
Mathematika, 13 (1966), pp. 111-120.COLLIOT-THELENE,
J.-L. - D. KANEVSKY - J.-J. SANSUC,Arithmétique
dessurfaces cubiques diagonales,
Lecture Notes in Mathematics, 1290 (1987),pp. 1-108.
DAVENPORT, H. - D. J. LEWIS - A. SCHINZEL,
Equations of the form f (x)
=g(y), Quarterly
Journal (Oxford), 12 (1961), pp. 304-312.FEIT, W., Some consequences
of
theclassification of finite simple
groups, Proc.Symposia
Pure Math., 27 (1980), pp. 175-181. (American Math. Soc.) HEATH-BROWN, D. R. - S. J. PATTERSON, The distributionof
Kummer sums atprime
arguments, J. ReineAngew.
Math., 310 (1979), pp. 111-130.MANIN, YU. V., Cubic Forms:
Algebra, Geometry,
Arithmetic, Translated from the Russianby
M. HAZEWINKEL, North-Holland (1974).MILNE, J. S., Arithmetic
Duality
Theorems,Perspectives
in Mathematics, 1 (1986) (Academic Press).SELMER, E. S., A
conjecture concerning
rationalpoints
on cubic curves, Math.Scand., 3 (1954), pp. 49-54.
TATE, J.,
Duality
theorems in Galoiscohomology
over numberfields,
Proc. In-tern
Congress
Math., Stockholm (1962), pp. 234-241.VON NEUMANN, J. - H. H. GOLDSTINE, A numerical
study of
aconjecture of
Kummer, Math. Tables AidsComp.,
7 (1953), pp. 133-134.Manoscritto pervenuto in redazione il 18 novembre 1993.