A NNALI DELLA
S CUOLA N ORMALE S UPERIORE DI P ISA Classe di Scienze
E. B OMBIERI
A. J. V AN DER P OORTEN
J. D. V AALER
Effective measures of irrationality for cubic extensions of number fields
Annali della Scuola Normale Superiore di Pisa, Classe di Scienze 4
esérie, tome 23, n
o2 (1996), p. 211-248
<http://www.numdam.org/item?id=ASNSP_1996_4_23_2_211_0>
© Scuola Normale Superiore, Pisa, 1996, tous droits réservés.
L’accès aux archives de la revue « Annali della Scuola Normale Superiore di Pisa, Classe di Scienze » (http://www.sns.it/it/edizioni/riviste/annaliscienze/) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions). Toute utilisa- tion commerciale ou impression systématique est constitutive d’une infraction pénale.
Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
Extensions of Number Fields
E. BOMBIERI * - A. J. VAN DER POORTEN ** - J. D. VAALER ***
1. - Introduction
A basic
problem
ofDiophantine approximation
is togive
effective measuresof
irrationality
foralgebraic
numbers. We formulate thisproblem
as follows.Let k be an
algebraic
numberfield, v
aplace
ofk, and ||v
the associatedabsolute value on the
completion kv.
Here wenormalize I I v exactly
as in[5], [7]
or[8].
Let K be a finite extension of khaving degree
r > 2. We assumethat K has at least one
embedding
in thecompletion kv
and then weidentify
K with a fixed
embedding
of K inkv. If f3 belongs
to k we writeH (~C3)
forthe absolute
height
of thecorresponding projective point /3
=[~]
inI~1 (k).
We warn the reader here that this
height H ( )
differs from the Weilheight HweiO )
because we use the rather than the~-norm,
for theplaces
at oo. In any case, we have
so that the two
heights
arecomparable. However,
the use of theprojective height H ( )
leads to neater andsharper
results and suggests itselfnaturally
inthis context.
If a
belongs
tokv
but not to k then Dirichlet’s theorem asserts the existence of apositive
constantC1
1depending
on a andinfinitely
many distinctfl
in ksuch that
By
a measure ofirrationality
for a with respect to v and k we understand apositive
number f1 for which aninequality
of thetype
* Research of the first author was supported by Macquarie University.
** Research of the second author was supported partially by grants from the Australian Research Council and Macquarie University, and a research agreement with Digital Equipment Corporation.
*** Research of the third author was supported by a grant from the National Science Foundation, DMS-8701396.
Pervenuto alla Redazione il 26 giugno 1995.
°
is valid for
all f3
in k withH (,C3) > C3.
HereC2
andC3
arepositive
constantsindependent
off3.
If the constantsC2
andC3
can becomputed
fromknowledge
of a and f1 then the measure of
irrationality
is said to be effective. In view of(1.1)
any measure ofirrationality It
mustsatisfy
/,t > 2.If a
belongs
to K but not to k then the fundamental theorem of Roth states that 2 + ~ is a measure ofirrationality
for every 8 > 0.Unfortunately,
all known
proofs
of Roth’s theorem result in constantsC2
andC3
which arenot
effectively computable. Alternatively,
Roth’s theorem asserts thatOur
knowledge
of effective measures ofirrationality
toralgebraic
numbersis still rather limited. To
begin
with there is thesimple
butimportant
Liouvillebound
which is
easily
obtained from theproduct
formula. This shows that r is aneffective measure of
irrationality
for any a in K but not in k.There are
essentially
three methods forobtaining improvements
in the Li- ouville bound in which all constants can beeffectively
determined.Historically
the first of these is in the work of Baker
[1], [2], although
a forerunner of it isimplicit
in theearly
work of Thue[17].
This method is based on theexplicit
construction of Pade
approximations
toalgebraic
functions. At present it pro- videsgood
effective measures ofirrationality
forspecial numbers,
inparticular
for certain cubic irrationalities and for r-th roots of certain rational numbers.
See
Chudnovsky [10].
The second method is based on Baker’s
theory
of linear forms inlogarithms
of
algebraic
numbers. The firstgeneral improvement
on Liouville type boundswas obtained
by
Baker[3].
This had farreaching
consequences,allowing
forexample
the effective solution of thegeneral
Thueequation F(x, y)
= m, whereF (x , y )
EZ[x, y] is
abinary
form with at least three distinct linear factors inC[x, y].
In the form obtained
by
Feldman[ 11 ],
the methodyields
an effectivemeasure of
irrationality
for a when r > 3 which isslightly
smaller than r.Although
theimprovement
on the Liouville bound isusually
verysmall,
thismethod
applies
to allalgebraic
numbers a. Forspecial
numbers such as cuberoots of
positive integers,
veryexplicit
bounds have beengiven recently by
Baker and Stewart
[4].
The third method has been
developed by
Bombieri[5] ]
andby
Bombieriand Mueller
[7].
Thisapproach
combines elements of theoriginal
non-effective methods of Thue andSiegel
with animportant
result ofDyson
forestablishing
the
non-vanishing
of theauxiliary polynomial
in two variables. This method isnot unrelated to the Pade
approximation method,
and it may be considered as atwo-variable Pade
approximation
method in which theapproximants
do notquite
reach the maximum order of
approximation
one can achieve. In some favorablecases, this method leads to the construction of
triples (K, k, v)
for which aneffective measure of
irrationality
much smaller than r can be established. Animportant
feature of the method is that the measure ofirrationality
which isobtained holds for all a in K but not in k.
We define to be the
greatest
lower bound for effective measures ofirrationality
of a, relative to the absolute value v.In the present paper we formulate and prove a new
equivariant
version ofthe Thue-Siegel principle
whichapplies only
tospecial
numbers in the Galois closure ofKlk.
In order to define thesespecial
numbers it is necessary for the Galois group to have afaithful, projective representation
inPGL(2, k).
Westudy
here in detail therepresentation
of thesymmetric
groupS3
inPGL(2, Q) generated by
the classical substitutions z 1-+ 1 - z and z HI/z.
If isa cubic extension we obtain
by
our methods verygood
effective measures ofirrationality
whichapproach
thelimiting
value 2 in favorable cases.Suppose,
for
example,
that m is alarge positive integer and K C R
is the cubic extensionof Q
formedby adjoining
theunique
real root ofx 3
+ mx + 1. Then we obtainan effective measure of
irrationality ttv(a) 2 +10 (log m ) -1 J3
for all generatorsa of
K/Q.
Another
interesting application
of the newequivariant
form of the Thue-Siegel principle
in this paper has been worked out in detail in Bombieri[6].
Thegroup in
question
now is thecyclic
group of order r,acting by multiplication by
r-th roots of
unity.
The results obtained there aresufficiently
strong toimply, by
known reductionsgoing
back toSiegel
andBaker,
a newproof
of theBaker-Feldman
improvement
of Liouville’sinequality,
valid for everyalgebraic
number.
The method used in the present work
originates
in the paper of Bombieri[5]
and continues to useDyson’s
Lemma in an essential way. We also introducesome new technical devices. Let h’ be a cubic extension of
k, identify K
withan
embedding
of K inkv,
and then fix anembedding
of the Galois closure L ofKlk
inkv.
We determine afaithful, projective representation
a -P a
ofG = into
PGL(2, Q)
and then define a subsetA(P)
cP~(L) by
Our next step is to consider how well the
point ° 1
in can be approx-imated
by points A
inA(P).
For this purpose we introduce a suitable metric8,
on which is inducedby 11,
onkv.
In Theorem 4 we show that[~]
cannot have two excellent
approximations ÀI
1 andA2
inA ( P )
for whichand
H(A2)
are bothlarge.
This is our new form of theThue-Siegel principle
for
special points.
It isexpressed
as aninequality
in which all constants aregiven explicitly
and we refer to it now as theThue-Siegel inequality.
Theproof
of Theorem 4 is
given
in Sections 4through
8.During
the course of theproof
we take
advantage
of the fact that the action of G onpoints
in isstrictly
controlled.
It remains then to demonstrate how effective measures of
irrationality
canbe obtained from Theorem 4. It turns out to be very convenient to reformulate the basic
problem
of lower boundsfor I a -
in terms of theprojective
metric
3,.
Thus we let a =f"1
inP~(~), j3
= in andprovide
lower bounds for
3, (a, 3).
Since3v (m, ,Ci) ~a -
our lower boundsapply automatically
toIn Theorem 5 we show that if
a
I inA ( P )
is an excellentapproximation
to
[ Oil
1 thenis an effective measure of
irrationality
for each generator a of the cubic extension Of courseÀ
1qualifies
as an excellentapproximation
to° 1 precisely
when 3.
In Section 9 we assume that
f (x) = x3
+ px + q,p # 0,
is irreducible ink[x]
and that the smallest root off
inkv
generatesK / k .
We use theroots of
f
to construct apoint a
I inA(P).
Then we estimate the measureof
irrationality
in terms of p and q. This allows us to construct cubic extensions of k which have an effective measure ofirrationality
close to 2by making
a suitable choice of p and q. Aprecise
upper bound for withexplicit
constants isgiven
in Theorem 17.Suppose,
forexample,
that I~ isgenerated by
theunique
real root ofwhere p > 0 and q
arerelatively prime integers.
We get a non-trivial effective measure ofirrationality
for each generator of as soon as p > for a suitable effective
C(s)
and any E > 0. The measure ofirrationality
isIn our
proof
of theThue-Siegel inequality
it is necessary to construct anauxiliary polynomial
withprescribed vanishing
and small coefficients. We note that thisstep
has now been disconnected from any conditionsinvolving
irrationalalgebraic
numbers. Werequire
that thepolynomial
bebihomogeneous
withinteger coefficients,
and that it vanish tohigh
order at thepoints ([~],
,[~D,
, ,
and (
In this respect, our
auxiliary polynomials
are universal for theproblem
of effective
approximation
of cubic irrationals. Here we use asimple
form ofSiegel’s
lemma to construct them. Weremark, however,
that a direct construction may lead to a much betterunderstanding
of these universalpolynomials
andthereby drastically improve
andmodify
theapplication
of our method.2. -
Inequalities
for theprojective
metricLet k be an
algebraic
numberfield, v
aplace
of kand kv
thecompletion
of k with respect to v. We will use two absolute
values I I v
fromv which are determined
exactly
as in ourprevious
papers, such as[5], [7]
or[8].
Inparticular
we have = for all x inkv,
where d =[k : Q]
and dv = [kv : Qv]’
These absolute values haveunique
extensions toQv,
thecompletion
of analgebraic
closure ofkv.
We extend to a norm on finite dimensional vector spaces overQv
as follows. Ifis a column vector in
QN
we setand Ixlv
= in both cases. Let ei,e2,... eN denote the standard basisvectors in
S2v .
For each subset I c{ 1, 2,..., N }
let ei be thecorresponding
standard basis vector in the exterior
algebra
We recall that this is agraded algebra
in which each
subspace
has dimension(~)
and basis vectors(ej :
I I I = n } .
As usual weidentify 52,~
with thesubspace
so thatwhenever I =
f i
ii 2
...{ 1, 2, ... , N }
is not empty.Then I I v
extends to
by applying (2.1 )
to the basisle, :
I c{ 1, 2,..., N } } .
We
identify
elements of with M x N matrices overQv.
Then we extend
I I v
to such matrices Aby setting
If v t
oo and A = we find thatIf
v oo
let A* denote thecomplex conjugate
transpose of A and letdenote the
eigenvalues
of thepositive
semi-definite matrix A * A . It follows thatWe define a map
by 17v(A) = Obviously qv(A)
for 0 inQv
andtherefore 17v is well defined as a map
We note that = 1 if and
only
if theidentity
= holds forall x in In the
special
case N = 2 the identities(2.2)
and(2.3)
can beused to
verify
the formulaWe define a second map
by
Since
Dv(ax, by) = y)
for all 0 and 0 inSZv,
it follows thatDv
iswell defined as a map
It can be shown that
3v
is a metric on and the induced metrictopology
coincides with the
quotient topology
determinedby
thenorm 11,
onQN.
Thisobservation,
but with a different definition for8v,
is due to Neron[12].
The fact that our definition isequivalent
to N6ron’s was establishedby Rumely [15].
If N = 2
and v ~ oo
then C andy)
is the cordal distance betweenpoints
x and y on the Riemannsphere P~(C)
normalized so that the diameterof the
sphere
is 1. ..In case N = 2 the maps i7v and
3v satisfy
the basicinequality
for all A in
PGL(2, Qv)
and x, y in Toverify
theinequality
on theleft of
(2.6)
we note thatThen the
inequality
on theright
of(2.6)
follows sincer~v (A-1 )
=17v(A).
Thisresult continues to be true if N ? 3, as shown
by
K.-K. Choi[9];
aproof
of theweaker
inequality
with inplace
of1 (A)
can be found in S.Tyler [ 18] .
If
j3
occurs inkN
we define itsheight by
where the
product
extends over allplaces v
of k.By
theproduct
formula thisheight
is well defined onWe mention
here,
as an illustration of what we have done sofar,
twotheorems which show how classical results can be formulated in our
setting.
If a
belongs
topN-1 (kv)
for someplace
v of k then we may ask how well a can beapproximated by points j3
inIP’N -1 (k)
withH(,C3)
boundedby
asuitable parameter. In order to state a
reasonably precise
result letwhere
Ok
is the discriminant of k andThen we have the
following projective
form of Dirichlet’s theorem.THEOREM 1.
Let cx belong
toJl1’N-I (kv),
Tbelong
tokv
and assume that 1I r Iv.
Then there exists
,C3
in(k)
such thatIf cx
belongs
toJIDN-1 (kv)
but not toP-1 (k)
then(i)
and(ii)
can becombined to establish the existence of
infinitely
many distinct0
insuch that
This formulation of Dirichlet’s theorem is similar to, but
slightly
moreprecise than,
that obtainedby
Schmidt[16],
Theorem 1. Aproof
has beengiven by
S.
Tyler [18],
Theorem 3.2.Next we consider lower bounds for the
projective
distance8v (a, j3)
and indoing
so we restrict our attention to theprojective
lineSuppose
thenthat
in
homogeneous
coordinates with aalgebraic
over k ofdegree
r > 2. As K =k (a )
is embedded inkv
there exists aplace
w of K with16 v, [ I~w : kv]
=1, and =
It follows thatfor all x and y in =
Pl (k,).
Ifj3
occurs in c thenwhere the
product
on theright
of(2.11)
extends over allplaces w
of K. Sincej3 ~
a we conclude thatThis is the Liouville lower bound for the
projective
metric. If r = 2 then(2.10)
with N = 2 shows that this bound is
essentially sharp.
3. - The
Thue-Siegel inequality
We assume
throughout
this section that a inkv
isalgebraic
over kof degree 3,
that K =k(a) C kv,
and that a’ and a" areconjugates
of a inkv.
Let L =
k(a, a’, a") c kv
denote the Galois closure of the extensionWe write , ,,
for the
corresponding points
inP~(L).
For each a in G =Gal(L/ k)
thereexists a
unique
elementPGL(2, L)
such thatClearly
a -Q,
is afaithful, projective representation
of G inPGL(2, L).
Also, there exists a
unique
element tP inPGL(2, L)
such thatIt follows that the
conjugate representation
Q’ -Pa acts
to permute the elements of theset { [ ~ ] , [ 1 ] ’ [ o ] } ~
If G isnoncyclic
then{ Pa :
Q EG) =!9,
whereis exactly the
subgroup
ofPGL(2, Q)
which permutes the elements of the set If G iscyclic
thenIn
particular,
a -P (J
is afaithful, projective representation
of G inPGL(2, Q).
LEMMA 2. Let A
belong
toPl (L).
Then theidentity
holds
for
all a in Gif and only if 1 A
occurs inPl (k).
PROOF. If cr is in G then
But occurs in
° , 1
and it follows that =Pa4.’a.
In a similar manner we find that and This shows that = in
PGL(2, L)
for each cr in G.If
cr -1 (~)
= for all a in G’ thenHence
belongs
to On the otherhand,
if occurs in then B = withj3
inIPl (k).
Thus we havefor all a in G. This proves the lemma.
We define the subset
where the second
equality
follows from theprevious
lemma. Thus11 (P)
isexactly
the set of all cross ratios[a, a’, a"; 3]
withj3
in We note thatA(P)
does notdepend
on our choice ofgenerator
a for the extensionK / k .
For suppose that K = with I and
a 1
thecorresponding points
inP~(L).
Then there exists aunique
element T inPGL(2, k)
such that a =Wale
Thereforeand the
corresponding
set of cross ratios isAlso,
theonly points
inI~1 (k,)
which are fixedby
each element of(Pa :
cr EG}
are
[6
I I where 03B66 is aprimitive
sixth root of1,
and these are fixedpoints only
when G iscyclic.
It follows thatA(P)
does not intersectPl
We now consider the
problem
ofgiving good
effective lower bounds forthe
projective
distancew ° , 1 a
for all À inA(P).
Toward this end it will be convenient to define the exponent ofapproximation
for all A inA (P),
to be
so that
We also set
at each
place
w of L andA
simple
calculationusing (2.2)
and(2.3)
shows that1J(P) = 4(3
+5 1/2).
If[L : k]
= 6 and a’ and a" occur inkv
then the Liouville bound(2.12) implies
that
ev(A)
6 for all A inA(P).
In fact this estimate can besubstantially improved simply by using
thedefining
property ofA(P).
THEOREM 3. For each
point A
inA (P)
we have 0e" (a)
3.PROOF. Our initial
embedding
of L intokv
determines aplace
w of L suchthat
v
andIf
[L :
= 3 the result follows as in our derivation of the Liouville bound(2.12). Only
thespecial
case[L :
= 6requires
a separateargument.
The group G acts onplaces w
of L such thatw v,
where v is afixed
place
of k. If cr is in G then is theplace
of L determinedby
With respect to the
projective
metrics8w
andpoints
A inA(P)
this actionresults in the
identity
for all cr in G. We have
log
The desired
inequality plainly
follows from this.For each
pair
ofpoints a
1 andA2
inA(P)
we defineand
The main technical result of this paper is a formulation of the
Thue-Siegel principle
as aninequality
which isexplicit
in all constants. Theinequality
bounds the
product
whenevera
1 andA2
occur inA(P). By
Theorem 3 we have
9,
but ifr(Ai, a2)
andA2)
are. smallthen our
inequality
shows that is not muchlarger
than 6. Theprecise
result is as follows.THEOREM 4. and
~2
arepoints
inA(P)
thenAfter several
preliminary inequalities,
ourproof
of Theorem 4 will begiven
in Section 8.
If 8 > 0 then
by
a familiarargument (3.6) implies
thatis a finite set. Of course the argument which leads to this conclusion does not
provide
an upper bound forH (A)
when Abelongs
to the set(3.7).
In orderto obtain an effective result we must be able to determine a
point a
1 inA(P)
for which
is less than 3. Then will be an effective measure of
irrationality
foreach generator a of the cubic extension K =
kv.
THEOREM 5. Assume that
a in A ( P ) satisfies
3. Then we havefor all ,Ci
inI~1 (k).
Theimplied
constant in(3.9) depends
inA,
and o.PROOF. From the mean value theorem we have the
elementary inequality
whenever x and y are
positive
real numbers. IfA2 belongs
toA(P)
thenand, using (3.6),
we find thatLet 4D be the
unique
element ofPGL(2, L)
which satisfies(3.1)
and writewhere the
equality
on theright
follows from(2.4).
Ifj3
occurs in(k)
thenAnd from
(2.6)
we conclude thatAs occurs in
A(P)
theinequalities (3.10)
and(3.12)
can be combined. Inthis way we obtain the bound
The result
plainly
followsby appealing
to(3.11).
We now turn our attention to the
proof
of Theorem 4.4. - Local estimates
Throughout
this section Mand
N arenonnegative integers, w
is aplace
ofthe number field L and
Qw
is thecompletion
of analgebraic
closureLw.
Weidentify
the vector spaceEw
= withbihomogeneous polynomials
in
S2w [x, y] having bidegree (M, N).
We define two norms onEw.
The firstof these we denote
by ||w and
defineby setting
As
(x, y)
-~ is a well defined map from into[0, oc),
it follows thatis an alternative definition for this norm. The group
GL(2,
xGL(2, Qw)
acts on
Ew
asfollows,
if(A, B)
EGL(2, S2w)
xGL(2, S2w)
we define P(A,B) :Ew by
1 I 1IThen = p(AC, BD) and therefore
(A, B) -+
is arepresentation
of
GL(2, S2w)
xGL(2, S2w)
inGL(Ew).
LEMMA 6. Let F E
Ew
and(A, B)
EGL(2, S2w)
xGL(2, S2w).
Then we havePROOF. For each x in the
inequality
holds,
andsimilarly
withA,
x and Mreplaced by
B, y and N. The lemma followseasily by using (4.3).
Next we introduce a second norm on
Ew.
Indoing
so it will be convenientto set - ~ . ~ ,
Let
F(x, y)
begiven by (4.1).
If oo we defineand
if w ~ oo
we setLEMMA 7. Let
F(x, y)
occur inEw. If w ~’
oo thenand
if w oo
thenPROOF.
If w t
oo theidentity (4.5)
can be established as in[13],
Lemma2. Assume then that
w oo.
Letand write a for the
unique
rotational invariant measure on the Borel subsets of S such = 1. It followsusing [14], Proposition
1.4.8 and1.4.9,
thatWe conclude that
To
verify
theremaining inequality
in(4.6),
let x E S andy E S .
Thenby
Cauchy’s inequality
This shows that
whenever x E S and
y E S.
Theinequality
of theright
of(4.6)
followsimmediately.
Suppose
now that N = 0 and so F inEw
issimply
ahomogeneous polynomial
in which can be written asIf M is
positive
and F is notidentically
zero then there exist nonzero vectors~ 1, ~2 , ... , ~M
in such thatLEMMA 8. Let
F (x) in Ew be given by (4.7)
and(4.8). If w {
oo thenand
if
wI
00 thenPROOF. If oo then
(4.9)
is Gauss’ lemma forhomogeneous polynomials.
This can also be established as in
[13],
Lemma 2. We assume thenthat w ~ 100.
Let J denote a subset of
{ 1, 2, ... , M} having cardinality I J I
and write(4.8)
as
Then we have
where It follows that
and
This shows that
Next we define
with S and cr as in our
proof
of Lemma 7. As or isrotationally
invariant andthe
unitary
groupU(2, S2w)
actstransitively on S,
we find that I restricted to S is constant. The value of this constant isFor 0 in
and £ #
0 in we haveCombining (4.12)
and(4.13)
leads to theidentity
Finally,
we use(4.8), (4.14),
and Jensen’sinequality
to conclude thatThis verifies the bound
The
remaining inequalities
in(4.10)
follow from(4.6).
We return to consideration of
bihomogeneous polynomials
and assume nowthat M and N are
positive integers.
Werequire
twopositive
realparameters 9
and r. Then we define
We also define
to be the
subspace
of allpolynomials F(x, y)
inEw
such thatThus
F(x, y) belongs
toyw precisely
when its vector of coefficients(m, n)
-fm,n
issupported
on r. -In our next lemma we make use of thefollowing elementary inequality:
if0 s
1 and 0 t 1 thenLEMMA 9. Let F be a
polynomial
in :and -
, Thenwe have
PROOF.
Suppose that w t
oo. Thenfollows
using (4.16).
If
w oo
weapply Cauchy’s inequality
to deduce thatWe raise both sides of
(4.18)
to the powersw /2
andapply (4.16).
The desiredinequality (4.17)
followsimmediately.
Let F in
Ew
begiven by (4.1 )
and let(A, B )
occur inGL(2,
xGL(2, S2w).
Then acts on F and so also on the vector of coefficients(m, n)
-fm,n.
We writeso that
(m, n)
-~(P(A, B) f )m,n
is the vector of coefficients ofP(A, B) F.
Thenwe define
(If
F isidentically
zero we set its Indexequal
tooo.)
For K > 0 we haveand so it suffices to consider the Index
only
when 9MrN = 1. It will benotationally convenient, however,
to use all fourparameters
in laterapplications.
As p is a
representation
we have thesimple identity
Also,
0 andb ;
0belong
to thenThe map
is therefore well defined for
(A, B )
inPGL(2,
xPGL(2, Clearly
thepolynomial
F occurs inyw
if andonly
ifwhere
THEOREM 10. Let F be a
polynomial
inEw,
x E and 1 Assume thatfor
somepair (A, B)
in PGL Then we havePROOF.
Using (4.20)
we find thatTherefore we may
apply
Lemma 9 and obtain theinequality
Then Lemma 6 and Lemma 7
imply
thatThe result follows
by combining (4.22)
and(4.23).
5. - Global estimates
Let
L / k
be a Galois extension ofalgebraic
number fields with G =Gal(L/k).
We assume that a -1fro-
is afaithful, projective representation
of G in
PGL(2, k).
At eachplace v
of k we defineBecause of the way we have normalized absolute values we have
where the
product
extends over allplaces w
of L such thatfor almost all
places v
of k we defineNext we define
The group G acts on
places w
of L suchthat w I v
where v is a fixedplace
of k. If cr is in G then a w is the
place
of L determinedby
With respect to the
projective
metrics8w
andpoints
À in this actionresults in the
identity
for all c~ in G.
Let E = denote the L-vector space of
bihomogeneous poly-
nomials
in
L[x, y] having bidegree (M, N).
LEMMA 11. Let
F(x, y)
be apolynomial
in E and let~1 and À2
bepoints
inA(1f¡).
Assume thatA2) =,4
0 and thatfor
all a in G. Thenfor
anyplace
vof k
and anyembedding of
L intokv
we havePROOF. We
apply (4.21)
at eachplace w
of L. In this way we obtain the estimate, - ,
Taking
theproduct
over allplaces w
leads to the boundAn
embedding
of L intokv
determines aplace
iu of L such thatin I v
andat all
points
x and y in AsL / k
is a Galoisextension,
is constant on
places w with w I v
and thereforeFinally,
we use(5.5)
and the fact that G actstransitively
to conclude thatThis proves the lemma.
6. -
Dyson’s
lemmaIn this section we assume that Q is an
algebraically
closed field of char- acteristic zero andF(x, y)
is abihomogeneous polynomial
ofbidegree (M, N) having
coefficients in Q. Letdenote the stabilizer of Then
whenever U and V
belong
to stab Thatis,
the mapis constant on all
right
cosets of xstab~ (°) )
inPGL(2, Q)
xxPGL(2, Q).
Now let(a,,3) belong
to xPl (0)
and select(A, B)
sothat and i . We set
By
ourprevious
remarks index is well defined. We note that theanalogue
of(4.12)
is theidentity
We now describe the
projective
form ofDyson’s
lemma. Werequire
thefollowing objects:
DYSON’S LEMMA. Let F be a
bihomogeneous polynomial of bidegree (M, N)
in
S2 [x, y]
which is notidentically
zero.If
,for each j
=1, 2, ... ,
J thenThis can be obtained from the affine formulation
given
in[5]
with minormodifications. It can also be
obtained, again
with minormodifications,
fromthe more
general
result ofVojta [19].
For our purposes it will be useful tohave a second version which
specifically
bounds the index.COROLLARY 12. Let F be a
bihomogeneous polynomial of bidegree (M, N)
inQ [x, y]
which is notidentically
zero. Then we havePROOF. Select
Kj, j
=1, 2,...,
J, so thatIt follows that and
By Dyson’s
lemma we conclude thatThe
corollary
followsby taking
the supremum on the left hand side of(6.3)
over all values of the
parameters Kj, j
=1, 2,...,
J.In
applications
ofDyson’s
lemma it is necessary to estimate how the indexchanges
when a differentialoperator
isapplied
to F. Letbe a second
bihomogeneous polynomial
ofbidegree (R, S)
inQ[x, y]
and letbe the
corresponding
linearpartial
differentialoperator.
If(a, j3) belongs
towe find that
where
12
is the involutionIn the
special
case theinequality (6.4)
follows from theexpansion
The
general
case can then be establishedby selecting
A and B so that~ and
using
theidentity
where A’ and B’ are the transpose of A and B
respectively. (Note
also thatA’I2A
=12
andB’I2B
=12
inPGL(2, Q).)
Again
let A and Bsatisfy A-’ [0]
I = a andB-1 [0]
I =0.
We say thatthe
pair
ofnonnegative integers (R, S)
is critical with respect to the index at(a,
ifand
If
(R, S)
is critical at(a, j3)
then(6.4)
and(6.5)
can be used to show thatif and
only
ifAlternatively,
if(R, S)
is critical at(ct, 3)
thenif and
only
if7. - The
auxiliary polynomial
Here we assume that
0 9 l, 0 i 1
and also that 9t2/3.
Thenwe define
and
Let
0 ci
1 c2 oo . Thenwhere M -~ oo, N - oo, in such a way that
Therefore
is
positive.
Inparticular, I
> 0 if M and Nsatisfy (7.1 )
and aresufficiently large.
THEOREM 13. Let M and N be
positive integers
such that(7.1 )
holds and- ~ I
> 0. Then there exists abihomogeneous polynomial
of bidegree (M, N)
iny]
such that(i)
F is notidentically
zero,(v)
thecoefficients fm,n satisfy
(vi)
eitherPROOF. We will construct F so that its coefficients
fm,n
aresupported
on r’.This insures that
(ii)
and(iv)
hold. We alsorequire
thatfor all
pairs (i, j)
in r" and thisimplies
that(iii)
is satisfied. Thus we needa nontrivial solution in
integers to r" I homogeneous
linearequations in I r’ I
variables.
By
thesimplest
form ofSiegel’s
lemma there exists such a solution withIt remains then to estimate the
right
hand side of(7.4)
asin such a way that
(7.1 )
holds.Let
for 0 x
1, 1/1(0) == 1/1(1)
=0,
and defineThen we have
and
By combining (7.5), (7.6)
and(7.7)
we find thatNext we observe that
and in a similar manner
It follows that
Finally,
it can be shown thatin the
region 0 0 1, 0 -c 1,
and 8 i2/3.
The bound(v) plainly
follows from
(7.4), (7.8), (7.9)
and(7.10).
In the
polynomial
Ffails
tosatisfy (vi)
then we mayreplace
F withIt is easy to
verify
that(7.11)
continues tosatisfy (i)-(v).
8. - Proof of the
Thue-Siegel inequality
We now prove Theorem 4 in the
slightly
stronger formwhere and (
In
doing
so we may assume thatAi
1 and~2 belong
toA ( P )
andsatisfy
and
For if any of these conditions fail to hold then the basic
inequality (8.1 )
followsfrom Theorem 3.
Let 9 C PGL(2, Q)
denote thesubgroup
on theright
handside of
(3.2)
which isisomorphic
toS3. Then 9
acts onI~1 (kv)
and every orbit has six elements withexactly
threeexceptions.
Theexceptional
orbits arewhere §6
is aprimitive
sixth root of 1. If A inA(P) belongs
to any of theseexceptional
orbits then 2 followseasily.
Hence we may also assume thatÀI
andA2
each have orbitscontaining
sixpoints
under the action of Q.Let 0 0
1,
0 i1, with!
8 i~.
Then let M and N bepositive integers
such thatWe note tha