J
OURNAL DE
T
HÉORIE DES
N
OMBRES DE
B
ORDEAUX
J
AMES
W. C
OGDELL
On sums of three squares
Journal de Théorie des Nombres de Bordeaux, tome
15, n
o1 (2003),
p. 33-44
<http://www.numdam.org/item?id=JTNB_2003__15_1_33_0>
© Université Bordeaux 1, 2003, tous droits réservés.
L’accès aux archives de la revue « Journal de Théorie des Nombres de Bordeaux » (http://jtnb.cedram.org/) implique l’accord avec les condi-tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti-lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte-nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
On
sumsof three
squares
par JAMES W. COGDELL
RÉSUMÉ. Nous nous intéressons à la
question
de savoirquand
un entier d’un corps de nombres totalement réel est somme de trois carrés d’entiers du corps, et
plus
généralement,
s’il peut êtrereprésenté
par une formequadratique
ternaire définiepositive
etentière sur le corps. Dans un travail récent avec
Piatetski-Shapiro
et
Sarnak,
nous avons montré que tout entier totalementpositif
etsans facteur carré assez
grand
possède
unereprésentation
intégrale
globale
si et seulement s’il en est de même localement partout, résolvant ainsi pour l’essentiel le dernier cas ouvert du onzièmeproblème
de Hilbert. Dans cetarticle,
nous exposons les idées dela démonstration de ce résultat.
ABSTRACT. We address the
question
of when aninteger
in ato-tally
real number field can be written as the sum of threesquared
integers
from the field and moregenerally
whether it can berepre-sented
by
apositive
definiteintegral
ternaryquadratic
form overthe field. In recent work with
Piatetski-Shapiro
and Sarnak wehave shown that every
sufficiently large
totally positive
square freeinteger
isglobally integrally
represented
if andonly
if it isso
locally
at allplaces,
thusessentially resolving
theremaining
open case of Hilbert’s eleventhproblem.
In this paper wegive
anexposition
of the ideas in theproof
of this result.The
question
of when aninteger
isrepresentable
as a sum of squares has along
venerablehistory.
Moregenerally,
Hilbert’s eleventhproblem
asks(among
otherthings)
whichintegers
areintegrally represented by
agiven
quadratic
form over a number field. The case ofbinary quadratic
formsis
equivalent
to thetheory
of relativequadratic
extensions and their classgroups and class fields as
developed by
Hilbert. For forms in four or morevariables the situation is
quite
different and has been understood for sometime. The case of three variables has remained open.
Manuscrit requ le 19 novembre 2001.
The author was supported in part by the NSA. While at the Institute for Advanced Study,
where the original stages of the work was performed, the author was supported in part by a
grant from the Ellentuck Fund and in part the research was conducted for the Clay Mathematics Institute.
The
problem
of whichintegers
in a number field k arerepresented by
the genus of a
quadratic
form iscompletely
answeredby
Siegel’s
massformula,
whichgives
the number of solutions in terms ofproducts
of local densities[23].
If there isonly
one class in the genus ofq(x),
then this answers thequestion
ofrepresentability
for q. If n > 4and q
is indefiniteat some archimedean
place
of k thenSiegel
showedby
analytic
methodsthat a number is
represented by
one form in a genus iff it isrepresented by
all forms in the genus, thus
reducing
theglobal representability question
to local ones
[24].
These results were recovered and extended to indefiniteternary
forms viaalgebraic
techniques by
Kneser[12]
and Hsia[7]
utilizing
spinor
genera. So we will restrict ourselves to the case ofpositive
definiteintegral
forms over atotally
real field k. If the number of variables isat least
five,
one canproceed
eitherby analytic
methods,
using
boundstowards
Ramanujan
for Hilbert modularforms,
oralgebraic
methods[8]
to prove that there is an effective constant
Cq
such that if a thering
ofintegers
ofk,
istotally positive
and its normN(a)
>Cq
then a isrepresented by
q iff it isrepresented
by
q for everycompletion kv
of k. When the number of variables is four one must add aprimitivity
condition on therepresentations,
bothlocally
andglobally
[8].
Recently,
in collaboration with I. I.Piatetski-Shapiro
and P.Sarnak,
we have established ananalogue
of this result in the case of
positive
definiteintegral
ternary
quadratic
formsvia
analytic
methods[3].
Theorem. Let k be cc
totally
real numberfield
and letq(x)
be apositive
definite integrals
ternary
quadratic
form
over k. Then there is anineffective
constant
Cq
such thatif a
is atotally positive
squarefree integer of
k withN(a)
>Cq
then a isrepresented integrally by q
locally integrally
represented
over everycompletions kv of
k.The result was
previously
known for k= ~ by
Duke and Schulze-Pillot[6, 18].
Of
special
interest is the case of theternary
form+ ~2 +
x3
giving
the result on sums of squares in a number field alluded to in the title.Corollary.
Let k be atotally
real numberfield.
Then there is anineffec-tive constant
Ck
such that everytotally positive
squarefree integer
a withN(a)
>Ck
is the sumof
threeintegrals
squaresiff
it is the sumof
threeintegral
squareslocally for
eachplace
vof
k.Of course
over Q Legendre
hasgiven
us theprecise
answer: a E Z is thesum of three squares iff a is not of the form
4’~(8m
+7).
Over a numberfield, partial
results have been obtainedby algebraic
methodsby
Donkarsuch a can be
represented
but do notgive
the local toglobal
result wepresent
here.In this note I would like to describe our
proof
of this theorem in thesimplified
case of ktotally
real of class number one.This paper is an
expanded
version of the lecture Ipresented
at the XXII JournéesArithmétiques
2001 in Lille. I would like to thank theorganizers
of JA 2001 for theopportunity
tospeak
on thistopic.
I would also liketo thank my
collaborators,
I. I.Piatetski-Shapiro
and P.Sarnak,
forallow-ing
me topresent
this summary of our work here. Inparticular,
I thankP. Sarnak for
reading
andproviding
critical comments on an earlier versionof this note.
1. Theta series
We take k to be a
totally
real number fieldhaving
class number one. Letd =
(k : Q)
be thedegree
of 1~ overQ.
We let 0 be thering
ofintegers
of k. Let V be a vector space of dimension three over k
equipped
with apositive
definiteintegral
quadratic
formq (x) .
We will let L =C73
denotethe
integral
lattice in V. Soq(x)
E 0 for every x E L.The
proof
we willgive
isanalytic
in nature. Hence webegin
with thetheta series associated to
q(x)
andL, Siegel’s analytic
class invariant[23],
where T . This is a Hilbert modular form of
weight
3/2
for anappro-priate
congruencesubgroup
r c8L2(O).
Its Fourierexpansion
isgiven
by
where E
Llq(x)
=all
[
is therepresentation
number of aby
L.
There are two related theta series. Let
Spn(L)
denote thespinor
genusof L and
Gen(L)
the genus of L[18, 19]. (Whether
we fix the lattice andwork with the genus of the form or fix the form and work with the genus
of the lattice is all the same. Here we fix the
form q
and vary the lattice into be the
weighted
average over the genus ofL,
Siegel’s
analytic
genusinvariant
[23],
whereo(M)
is the order of the group of units ofM,
andsimilarly
forSpn(L)).
By
the work ofSiegel
[23]
we know that the coefficientsrq(T,
Gen(L))
aregiven by
aproduct
of local densities and is non-zero iff a islocally
represented
integrally
by
(q, Lv)
for allcompletions.
So we need to be ableto relate
rq(a, L)
andrq(a,
Gen (L)).
By
classical results ofSiegel,
analgebraic proof
of which can be foundin
Walling
[27],
we know that bothand
are cusp forms of
weight
3/2.
Now there are twotypes
of cusp forms ofweight
3/2.
Recall that we have the Shimuracorrespondence
between cuspforms of
weight
3/2
for F and modular forms ofweight
2 for anappropriate
subgroup
r’ cPGL2(O).
If we denoteby
53~2(r)
the cusp forms associatedto theta series attached to one dimensional
quadratic
forms and itsorthogonal
complement
withrespect
to the Petersson innerproduct
then53~2(T)
consists ofprecisely
those cusp forms that lift to cusp forms in52 (F’)
[25].
Following
the work of Schulze-Pillot(see
[19, 20]
for relatedresults)
we can conclude thatand
Fortunately
the Fourier coefficients of the forms in(r)
arequite
sparse.In
fact,
it is known(see
[7, 12])
that outside of anexplicitly
computable
finite number of
"genus exceptional"
square classes we haveSo for square free a there are
only
a finite number of such genusexceptions
which we can avoid
by
taking
~xsufficiently
large.
Hence for all butfinitely
many square free cx we have
From the
computation
of the local densities(see
[19])
one can concludethat for a
locally represented
a one has an ineffective lower boundwith the
implied
constantindependent
of a. Theineffectivity
comes froman
application
of theBrauer-Siegel
theorem and is the source of theinef-fectivity
of our theorem. Thus our result will follow if we canproduce
anestimate for the Fourier coefficients
a(a)
ofhalf-integral weight
forms inSg/2
(r)
of the formfor some fixed 6 > 0. This program was carried out in the case of k
= Q
by
Duke and Schulze-Pillot[6]
with akey ingredient being
estimates onthe Fourier coefficients of
half-integral weight
forms due to Iwaniec[9]
and Duke[5].
2. Fourier coefficients and L-functions
By
now, a common way to estimate Fourier coefficients of modular formsof half
integral weight
is toappeal
toWaldspurger’s
formula[26]
relating
these coefficients to central values of the L-functions of the Shimura lift.Waldspurger
established this relationonly
for k= Q
butrecently
hisre-sult has been
generalized
tototally
real fieldsby
Shimura[22]
using
the Shimuracorrespondence
andby
Baruch and Maousing Jacquet’s
relativetrace formula
[I].
We will use Baruch and Mao’s version of this relation.Note that it suffices to prove our estimate for Hecke
eigenforms
sincethere is
always
a basiscoming
from such. Let if denote thecuspidal
repre-sentation of
SL2 (A)
generated by
ourf
E Fix an additivecharac-ter 1f;
ofkBA
such that 1f =8(if,1/J)
is the Shimura lift of 7r to acuspidal
representation
ofPGL2(A).
If a is a square freeinteger
of O and we letthen we know
[25]
where Xa is the
quadratic
character associated to the extensionClassically
xa is a ray class character mod(a).
Let cp
be the new form onS5d
associated to the new vector in the space of 7r and leta(a)
denote itsa-Fourier coefficient.
Let S denote the finite set of
places
of k where 7r,7i-,
or 1/J
is ramified.Let
Soo
denote the archimedeanplaces
of k and letSa
denote the set of finiteplaces v
where 0. Letbe the finite
(or classical)
L-function of 7r. Then the formula of Baruch andMao
[1]
can be stated as:where the are certain local constants
given
as ratiosinvolving
localnorms, local Fourier
coefficient,
and local L-values. Similar formulas hadbeen
given
earlierby
Shimura[22].
For v 0
S Baruch and Mao canexplicitly
compute
thec, (a)
and for v ES,
a finite set ofplaces, they
can estimatethem as a varies. As a consequence one
gets
an estimatewith the
implied
constantindependent
of a.Note that the
convexity
bound onL(l,
7r (9 as cx varies(see
[10])
iswhich would
give
This is better than the Hecke
bound,
but not sufficient for our purposes. Wemust beat the
convexity
bound to obtain a first non-trivial estimate towardsRamanujan
for the Fourier coefficients ofhalf-integral
weight
forms over atotally
real number field.3.
Subconvexity
Let 1r be a
cuspidal automorphic representation
ofPGL2 (A)
correspond-ing
to aholomorphic
Hilbert modular form of evenweight
k =(1~, ... , k) .
Let
W(-r)
be the associated new form. Let Xi be any ray class charactermodulo an ideal a. We will let Xl also denote its associated idele class
char-acter. The
key
to ourproof
of the statedtheorem,
and a result of interestin its own
right,
is thefollowing breaking
ofconvexity
for 0Xl) in
the conductoraspect
as X, varies.Theorem. We have
2uith the
implied
constantdepending
on E butindependent of
a.Here
again by
L(s, 7r
(8)Xl)
we mean the classical(or finite)
L-functionIf we
apply
this result to ourprevious
situation with X, = xa then this willcomplete
theproof
of our first Theorem.We would now like to describe the
proof
of thisTheorem,
still in the caseof class number one. We will work with the
L-function
in its additive form. To thisend,
we writewhere
tl+
is the group oftotally positive
units in U = This is relatedto the Fourier
expansion
of cp
where
dK
is the different ofK,
by
=We first use the
approximate
functionalequation
forL(s,
(see
forexample
[13, 17]).
Thisgives
anexpression
of the formas two sums of
length
essentially X,
where we have taken X =N(a).
HereVi
andV2
are functionshaving
Vi(O)
=1,
smooth,
andrapidly
decaying
atinfinity. By using
a smoothdyadic
subdivision it suffices to estimate sumsof the
shape
where now W is smooth of
compact
support say
in the interval(.1, 2),
soconcentrated near 1. There are
approximately
log(N(a))
such sums upto
N(a).
The crucial contribution for us will beagain
when X is of sizeN(a).
Note that the presence of the cutoff function W inJ(xi)
forcesQ -
so that if we setWV(x) = VX W (x)
we have W is stillsmooth with
compact support
in(~,2)
andwhere
Our estimate will
proceed by placing
S(Xl)
into afamily
and thenusing
arithmetic
amplification.
For ageneral
exposition
of thesetechniques
one can refer to the talk of P. Michel at the XXII JourneesArithm6tiques
2001[13].
Toget
anappropriate
family
we need to work with a sum over thefull set of
totally positive integers.
To this end we let F be a smoothedcharacteristic function of a fundamental domain for the action of the
totally
positive
unitsU+
on thehyperboloid
definedby
= 1 ink~
=Rd+
which satisfies
for every x E
k+
withN(x)
= 1. We extend this to alltotally positive u
by
setting
F(u)
= Fu
Then we can writeNote that both W and F act as cutoff functions - W
cutting
off inand F
cutting
off in the"argument"
of p.We now
place
S(Xl)
in afamily.
We do not use thefamily
of allS(x)
where X
runs over all ray class characters mod a, whichmight
seem morenatural but could be too sparse.
Indeed,
theimage
ofU+
in(O/a)"
can belarge
[15].
Instead we use thefamily
of allS(X)
as X runs over all charactersof the group
(0/a)x.
LetC(a)
denote this group of characters. Then wewill consider the average value of in this
family
E
IS(x)12.
Note C(a)that the
length
of the sum isIC(a)l
which isgiven by
thegeneralized
Euler totient function~(a)
and istrivially
boundedby
N(a)
and is at least of size unlike the group of ray class characters.In addition to
averaging
over thisfamily,
we will utilize thetechnique
ofarithmetic
amplification.
To this end we will take anauxiliary
parameter
M which will be of size
X6
with 5 small. Itsprecise
value will be determined in the course of theargument.
We take a setfvl
oftotally positive integers
which should berelatively
prime
to a and all have norm boundedby
M.There should be
roughly
M of them andthey
should bebalanced,
in that for the archimedeanembeddings v each vv
should beroughly
of the samesize. For each v we take a coefficient
c(v)
such thatI
= 1. We thenTo obtain an estimate on our
original
S(Xl)
we will takearithmetically
defined coefficients
c(v)
=Xi 1(v)
thusamplifying
the termIS(XI)12
by
afactor
of M2.
To utilize
A,
weexpand
the norm squares,interchange
the order ofsum-mation,
andperform
the character sum overC(a).
Thisyields
We
split
this sum into twoterms,
thediagonal
D and the offdiagonal
OD,
whereand
The
diagonal
term is estimatedsimply using
theRamanujan
bounds for the known in this caseby
Brylinski
and Labesse[2],
namely
I
and then
analysing
the size of the sums determinedby
the cutofffunctions. These
yield
The off
diagonal
term is moreinteresting.
Let us write it aswhere in
B (v1,
v2, h)
we have resummed over~~
whichonly
has an effect onthe cutoff functions F. The terms v2,
h)
are estimatedusing
severalfor
suitably
large
a where s =(s1, ... , sd)
EcJ
QD(s,
vl, v2,h)
is atype
ofDirichlet series
where we have used a multi-index
notation,
so t-s = where.~1, - .. , .~d
are theimages
of under the dembeddings
of k into R This Dirichlet series carries the arithmetic information inB(vl,
v2,h).
The func-tionH(s)
isessentially
the Mellin transform of the cutoff functions.H (s )
isrelatively
simple
to handle. It isentire,
rapidly decreasing
in and can be estimatedby
where as is common we have written sj =
aj +
itj.
The
interesting
bit of the estimate is in the Dirichlet seriesD(s,
vl, v2,h).
It isessentially
a Dirichlet series formed withproducts
of shifted Fouriercoefficients.
Selberg
has shown how toapproach
such Dirichlet series via Poinear6 series[21].
To thisend,
let us setg(T)
= Thenthere is a Poincax6 series
Ph(T, s)
such that when wecompute
the Petersson innerproduct
of g withPh(s)
we findOne now
expands
this innerproduct spectrally
via Parseval’s formula. If we letfojl
be a suitable orthonormal basis of Maass cusp forms thenwhere
c(s)
is a similarexpression
involving
the continuousspectrum
and is estimated in a similar manner. Sarnak hasdeveloped
ageneral
method forestimating
(g,
CPj)
(see
forexample
[16])
which in this caseyields
where rj =
(rj,l, - - - , rj,d)
is thespectral
parameter
ofOj -
The term(0j, Ph (s))
isexpressed
in terms of the h-Fourier coefficientpj (h)
of~~
and the associated archimedean r-factorsinvolving
thespectral
parameters
r~. We then estimate theseusing
the bounds towardsRamanujan
in both the archimedean and non-archimedeanaspects
due to Kim and Shahidi[11].
The sum is then controlled
using Weyl’s
law. A similartype
of estimate can be found[14]
and in theappendix
of[17].
In the final
analysis,
these estimatesgive
that the Dirichlet series has ananalytic
continuation to the domainRe(sj)
> 2 + 1
and in thisregion
satisfieswhere
the hj
are theimages
of h under the dembeddings
of k into IR. Notethat
the )
in theboundary
of the domain of continuation comes from thearchimedean estimates towards
Ramanujan
whilethe )
in theexponent
ofN (h)
is from the non-archimedean bound towardsRamanujan
of Kim and Shahidi.Returning
now to ourexpression
ofB(V1, V2, h)
in terms of the inverse Mellintransform,
we can now shift the lines ofintegration
toRe(sj) =
1/2
which in turn results in
When we combine D and OD and choose M =
X6
togive
them thesame order of
growth
in X we find that M =X7~65.
Nowtaking
X =N(a)
to
get
the dominant term from ourpartition
weget
an estimate of ouramplified
sumWe now take
c(v)
= toamplify
the term we are interested in.Then
estimating
this one termby
the entire sum we findor
which
finally gives
as desired.
Note that to our
knowledge
this estimate is better than the current best bounds even in the case k =Q.
References °
[1] M. BARUCH, Z. MAO, Central value of automorphic L-functions, preprint, 2000.
[2] J.-L. BRYLINKSKI, J.-P. LABESSE, Cohomologie d’intersection et fonctions L de certaines variétés de Shimura. Ann. Sci. École Norm. Sup.(4) 17 (1984), 361-412.
[3] J. W. COGDELL, I. I. PIATETSKI-SHAPIRO, P. SARNAK, Estimates on the critical line for Hilbert modular L-functions and applications, in preparation.
[4] E. N. DONKAR, On sums of three integral squares in algebraic number fields. Amer. J. Math., 99 (1977), 1297-1328.
[5] W. DUKE, Hyperbolic distribution problems and half-integral weight Maass forms. Invent.
Math., 92 (1988), 73-90.
[6]
W. DUKE, R. SCHULZE-PILLOT, Representations of integers by positive ternary quadraticforms and equidistribution of lattice points on ellipsoids. Invent. Math. 89 (1990), 49-57.
[7] J. S. HSIA, Representations by spinor genera. Pacific J. Math. 63 (1976), 147-152.
[8] J. S. HSIA, Y. KITAOKA, M. KNESER, Representations of positive definite quadratic forms. J. Reine Angew. Math. 301 (1978), 132-141.
[9] H. IWANIEC, Fourier coefficients of modular forms of half integral weight. Invent. Math. 87
(1987), 385-401.
[10] H. IWANIEC, P. SARNAK, Perspectives on the analytic theory of L-functions. Geom. Funct. Anal. (GAFA), Special Volume, Part II (2000), 705-741.
[11] H. KIM, F. SHAHIDI,
Cuspidality
of symmetric powers of GL(2) with applications. Duke Math. J. 112 (2002), 177-197.[12] M. KNESER, Darstellungsmasse indefiniter quadratischer Formen. Math. Zeit. 77 (1961), 188-194.
[13] P. MICHEL, Familles de fonctions L de formes automorphes et applications. J. Théor. Nom-bres Bordeaux 15 (2003), 275-307.
[14] Y. PETRIDIS, P. SARNAK, Quantum unique ergodicity for SL2(O)BH3 and estimates for L-functions. J. Evol. Equ. 1 (2001), 277-290.
[15] D. ROHRLICH, Non-vanishing of L-functions for GL(2). Invent. Math. 97 (1989), 381-403.
[16] P. SARNAK, Integrals of products of eigenfunctions. Internat. Math. Res. Notices (1994) 251-260.
[17] P. SARNAK, Estimates for Rankin-Selberg L-functions and quantum unique ergodicity. J. Funct. Anal 184 (2001), 419-453.
[18] R. SCHULZE-PILLOT, Thetareihen positiv definiter quadratischer Formen. Invent. Math. 75
(1984), 283-299.
[19] R. SCHULZE-PILLOT, Darstellungsmasse von Spinorgeschlechtern ternärer quadratischer
Formen. J. Reine Angew. Math. 352 (1984), 114-132.
[20] R. SCHULZE-PILLOT, Ternary quadratic forms and Brandt matrices. Nagoya Math. J. 102
(1986), 117-126.
[21] A. SELBERG, On the estimation of Fourier coefficients of modular forms. Proc. Sympos.
Pure Math., Vol. VIII, pp. 1-15 Amer. Math. Soc., Providence, R.I., 1965.
[22] G. SHIMURA, On the Fourier coefficients of Hilbert modular forms of half-integral weight.
Duke Math. J. 72 (1993), 501-557.
[23] C. L. SIEGEL, Über die analytische Theorie der quadratischer Formen I. Ann. of Math. 36
(1935), 527-606; II, 37 (1936), 230-263; III, 38 (1937) 212-291.
[24] C. L. SIEGEL, Indefinite quadratische Formen und Funktionentheorie I. Math. Ann. 124
(1951) 17-54; II, 366-387.
[25] J.-L. WALDSPURGER, Correspondance de Shimura. J. Math. Pures Appl. 59 (1980), 1-113.
[26]
J.-L. WALDSPURGER, Sur les coefficients de Fourier des formes modulaires de poids demi-entier. J. Math. Pures Appl. 60 (1981), 375-484 .[27] L. WALLING, A remark on differences of theta series. J. Number Theory 48 (1994), 243-251. James W. COGDELL
Department of Mathematics Oklahoma State University Stillwater OK 74078 USA