A NNALI DELLA
S CUOLA N ORMALE S UPERIORE DI P ISA Classe di Scienze
R OGER D. N USSBAUM J OEL E. C OHEN
The arithmetic-geometric mean and its generalizations for noncommuting linear operators
Annali della Scuola Normale Superiore di Pisa, Classe di Scienze 4
esérie, tome 15, n
o2 (1988), p. 239-308
<http://www.numdam.org/item?id=ASNSP_1988_4_15_2_239_0>
© Scuola Normale Superiore, Pisa, 1988, tous droits réservés.
L’accès aux archives de la revue « Annali della Scuola Normale Superiore di Pisa, Classe di Scienze » (http://www.sns.it/it/edizioni/riviste/annaliscienze/) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions). Toute utilisa- tion commerciale ou impression systématique est constitutive d’une infraction pénale.
Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
The Arithmetic-Geometric Mean and Its Generalizations for Noncommuting Linear Operators
ROGER D. NUSSBAUM* - JOEL E.
COHEN+
Introduction
The
arithmetic-geometric
mean(to
be defined in amoment)
is the limitof an iterative process that
operates recursively
onpairs
ofpositive
realnumbers. For over two
centuries,
an enormous amount of effortby
somegreat
mathematicians
has been devoted tounderstanding
and togeneralizing
the
arithmetic-geometric
mean. There have been twosimple
reasonswhy
allthis attention has been devoted to what is in essence a very humble idea.
First,
the limit has animportant meaning
or use that apriori
couldhardly
besuspected
from the definition of the iterative process.
Specifically,
the limit can be used tocompute elliptic integrals,
which are of substantial mathematical and scientificinterest. Second,
the iterative process converges to its limit withexceptional rapidity (quadratically-also
to be definedlater),
so that very few iterativesteps
are
required
toapproximate
the limit veryclosely.
A
large
classical literature concernsgeneralizations
of the arithmetic-geometric
mean, or what could be called means and their iterations(see [6]).
This paper concerns extensions of the
arithmetic-geometric
mean and of the classicalgeneralizations
from the case that the variables are real numbers to thecase that the variables are linear
operators.
As in the case ofpositive
numberswe are interested in three
questions (for
eachgeneralization):
the existence of alimit,
thespeed
of convergence to thelimit,
andpossible explicit
formulas for the limit(for example,
in terms ofelliptic integrals). Though
themachinery
wehave
developed
and the results we have obtained aresubstantial,
as witnessedby
thelength
of this paper, our success inachieving
all three aims is notcomplete. First,
we prove the existence of a limit for all the iterations we considerformally
here. But for otherinteresting iterations,
which appear to beplausible operator-theoretic generalizations
of thearithmetic-geometric
mean for* Partially
supportedby
National Science Foundation grant DMS 85-03316.+
Partially
supported by National Science Foundation grant BSR 84-07461.Pervenuto alia Redazione il 20 Giugno 1987.
positive numbers,
we observenumerically
anapparent
convergence to a limit but are unable toexplain
the observationmathematically.
Hence we do notbelieve we have the last word on the existence of limits for
generalizations
of the
arithmetic-geometric
mean to linearoperators. Second,
forsimplicity
we establish a
quadratic
rate of convergenceonly
for the "monsteralgorithm"
considered
in Section 3below, although
we believe a similaranalysis
can beused to
determine,
in all the otherexamples
we treat, whether convergence is linear orquadratic. Third,
weinterpret
thelimiting
linearoperator
in terms ofelliptic integrals only
for a small subset of the iterations whose limits we proveto exist. Even
classically, explicit integral
formulas for limits of iterated means are knownonly
for a fewexamples
which are very close to thearithmetic- geometric
mean.However,
in our case(see
Section4),
wegive
afamily,
indexedby
real numbers A > 1, of reasonable definitions of thearithmetic-geometric
mean of two linear
operators
A andB;
but weonly
obtain anexplicit integral
formula when A = 1.
On
balance,
the results of this paper arelargely
foundational: we prove the existence of limits for a widevariety
ofoperator-theoretic generalizations,
many
apparently
new, of thearithmetic-geometric
mean.Though
our successin
finding explicit integral
formulas for the limits islimited,
it ispossible
that these
results,
and futureextensions,
will provepractically important
fornumerical
algorithms
tocompute
functions of matrices that can be derived from matrixelliptic integrals.
We now sketch the
arithmetic-geometric
mean and our results moreprecisely.
If A and B are
positive
realnumbers,
define a mapf by
If
fk
denotes the kth iterate off,
it is not hard to prove that there is apositive
number M =
M (A, B)
such thatThe number M is called the
"arithmetic-geometric
mean of A and B" or the AGM of A and B." FirstLanden,
thenLagrange
andfinally
Gaussobserved independently
thatLagrange
andLegendre
used this observation tocompute elliptic integrals.
Historical references to this work and to some of Gauss’
deeper
work on theAGM can be found in
[6]
and[15].
An enormous literature
concerning
"means and their iterations" touches on a wide range of mathematics[6].
Forexamples,
if A and B arepositive reals, 0 a, Q 1
and[11]
or p > 0, q > 0 and
then
where M is a
positive
numberdepending
on A andB,
and or p, q. Onecan also
study functions f
which are functions of mvariables,
m > 2, andtry
to prove
analogues
of(0.4).
Borchardt[9]
considered the mapfor
positive
realsA, B, C
and D andproved (this
is the easypart
of hiswork)
that .
where M is a
positive
numberdepending
onA, B,
C and D.Many
otherexamples
are mentioned in Section 2 below.A first
goal
of this paper is to describe reasonableanalogues
of the AGMand its
generalizations
when all the variables arepositive definite, bounded,
self-adjoint
linearoperators
on a Hilbert space.Abbreviating
thephrase "positive definite, bounded, self-adjoint
linearoperator"
to"positive
definiteoperator",
the first
question
is: what should be theanalogue
of(for A
and Bpositive reals)
when A and B arepositive definite
Moregenerally,
if satisfies di >
0,
1 i m,and
1 andAi,
1S i
m,_ _
are
positive reals,
what is a reasonableanalogue
ofA" 2 Am
when thevariables
Ai,
1i
m, arepositive
definiteoperators?
Wesuggest
that am
reasonable
analogue of it Aat
is;=1 ’
If,
forpositive
definiteoperators 1 j
m, and a real numberr #
0 wedefine ---
one can prove that
in the
operator
normtopology,
so oursuggestion
dovetailsnicely
withcertain
reasonable means.
Using (0.7),
wegive operator-valued analogues
of mapsf
like those mentionedbefore,
and we prove convergence off k ~ A, B ~
in thestrong operator topology.
Forexample,
a veryspecial
case of results in Section 2 is that if 0 a,fi
1, and A and B arepositive
definiteoperators,
andthen there exists a
positive
definiteoperator E
such thatThe first two sections of this paper deal with the convergence of very
general operator-valued
versions of extensions of the AGM. In Section1,
wegive
results(Theorems
1.1 and1.2)
which enable one to prove convergence in the strong operatortopology
of certain sequences ofn-tuples
ofpositive
definitelinear
operators.
Anexample
would be(Ak, Bk)
-- withf
as in(0.8).
The
key
idea in Sections 1 and 2 is toexploit
theconcavity
of certain maps A --+g(A),
forpositive
definiteA,
and to use the beautiful classicaltheory
ofLoewner. In the
applications
in Section2,
we useonly
theconcavity
of themaps A -
log
A andAp (0
p1)
and theconvexity
ofA --; A-1;
thefull Loewner
machinery
is not needed.The arguments
simplify
in the case of finite-dimensional matrices. Theorem1.2,
inparticular,
is not needed in the finite-dimensional case.In Section
2,
we use the convergence results of Section 1 to proveoperator-
valued versions of convergence theorems for iterates of many classical means.
Our convergence theorems
suggest
that our conventions were reasonable andprovide
an answer to thequestion
raised in[6,
p.196]
of how to extendthe usual means to
noncommuting
variables. The mapsf
we consider are notusually order-preserving,
so thegeneral
convergence results in Section 4 of[26]
(see [27]
for asummary)
are notapplicable.
In Section
3,
we extend the domain ofM(A, B),
theoperator-valued
AGMof A and
B,
topairs
of bounded linearoperators
which are notnecessarily
positive
definite andself-adjoint,
and prove that(A, B) - M(A, B)
isanalytic.
The
analogues
of thesequestions
are considered for a moregeneral
"monsteralgorithm"
introduced in[6].
We also consider the commutative case(AB
=BA)
and prove an
integral
formula forM(A, B) analogous
to that when A and B arereal. The commutative case was also treated in
[32],
but the discussion thereseems
incomplete.
There are
already
numerous papersconcerning operator-valued
versionsof the AGM and other means. Section 4 of our paper
displays
the connection between ouroperator-valued
definition of the AGM and one introducedby Fujii [19]
and Ando and Kubo[5].
We prove that the two definitions are ingeneral
different.
However,
there is a continuum of "reasonable" definitions of anAGM, parametrized by
A > 1, such that A = 1corresponds
to that ofFujii-Ando-Kubo
and A = 00
corresponds
to ours. For eachA >
1 and eachpair
ofpositive
definite operators A and B there exists in the limit a
positive
definiteoperator Ex
which is the AGM of A and B for thealgorithm corresponding
toA,
andgenerally E~
u.1. -
Convergence
criteria for sequences of linearoperators
We recall some standard notation and results. If X and Y are
complex
Banach spaces, we denote
by
the set of boundedcomplex
linearoperators from X to
Y;
X* = will denote the continuouscomplex
linear maps from X
to C,
thecomplex
numbers. If X =Y,
we shall writeL (X)
instead ofC (X, X). £(X, Y)
is a Banach space in the standard norm,IIAII
= X andllxll 1}.
If A G~(X,X), a(A)
will denote thespectrum
ofA,
sois not one-one and
onto}.
If D is an open
neighborhood
ofa(A)
andf :
D - C isanalytic,
we definef (A)
in terms ofCauchy’s integral
formula:where r is a finite union of
simple,
closed rectifiable curves in D which containsa (A)
in its interior. Anexposition
of the basic results about this functional calculus can be found in[17], [33]
or[36].
Recall that if ~4 denotes
the algebra
of functions which areanalytic
onan open
neighborhood of a (A),
then the mapf
-+f ( A)
definedby ( 1.1 )
isan
algebra homomorphism and a (f (A))
=/(cr(~4)). If g
isanalytic
on an openneighborhood
ofQ(B),
where B =f (A),
theng(B) = (g o f) (A).
If X and Y are real Banach spaces,
,~ ( ~, Y )
denotes thebounded,
real linear maps from X to Y. If A eC (X) and X
denotes thecomplexification
ofX,
then A can be extended
uniquely
to acomplex
linear mapA : fi - X,
and wedefine
a (A),
the spectrum ofA,
to bea (A).
Iff
isanalytic
on aneighborhood
of
a(A)
and =f (z) (where
w denotes thecomplex conjugate
ofw),
thenf ( A) ( ~) ~ X,
so we definef ( A)
= in this case.Aside from the norm
topology
onC (X, X),
there are alsolocally
convextopologies
called the"strong operator topology"
and the "weakoperator
topology" (see [17], Chapter 6).
If(Ak )
is a sequence of bounded linearoperators
in and,~ (~, X ) ,
then(A k) approaches
A in the strongoperator topology
as k - ocif,
forX,
and
Ak approaches
A in the weakoperator topology if,
for X and- -
If
is a sequence of bounded linearoperators
whichapproaches
abounded
linearoperator
A in the weakoperator topology,
we shall writeSimilarly,
if(Ak)
converges to A in thestrong operator topology
we shall writeand if we shall write
In this paper we shall also deal with sequences
( A ~ k ~ )
of orderedm-tuples
ofbounded linear
operators,
sowhere
A i (k) c £(X, Y)
for1 - j :!~
m. We shall say theA (k)
converges in thestrong operator topology
to them-tuple
A =(Ai, A2, ... , Am)
and writeA{k~
- A ifSimilarly,
we shall writeA~ k ~ ~ A
if- A;
for1 j m and All
~ .~1if
A~k ~
~ A for1 j rra.
"
If ~ is a
complex
Hilbert space with innerproduct
x, y > and.~ (H),
then A isself-adjoint
ifAx, y
>= x,Ay
> for all x, y e H.If A is
self-adjoint and f :
C is a continuous map, one can definef (A).
This definition agrees with that in(1.1) when f
isanalytic
on an openneighborhood
of A. If A EZ (H)
isself-adjoint,
we shall say that A is"positive
semidefinite"
(sometimes
callednonnegative definite)
iffor all x E H and A is
"positive
definite" if there exists e > 0 such thatfor all x E .~I with
We abbreviate
"positive
semidefinite" as"p.s.d.", "nonnegative
definite" as"n.n.d." and
"positive
definite" as"p.d.".
Positive semidefiniteness induces apartial ordering
on the set ofself-adjoint operators
A E~(~):
if A and B arebounded, self-adjoint operators,
we writeA B
if B- A ispositive
semidefinite.Henceforth,
whenever we say that A EC (H)
ispositive definite
orpositive semidefinite,
it will be assumed that A isself-adjoint.
If .~ denotes the set of
bounded, self-adjoint p.s.d.
linear mapsin.£ (H),
then K is an
example
of a cone(with
vertex at0)
in a Banach space Y =£(H);
and
K°,
the interior ofI~,
is the set ofself-adjoint, p.d. operators
in,~ ~ ~) .
In
general,
if C is a subset of a Banach spaceZ,
we say that C is a cone(with
vertex at0)
if C is aclosed,
convex subset of Z and(a)
if x EC,
thenC for all real numbers t > 0 and
(b)
if x E C - then-x ¢
C. A coneC induces a
partial ordering
on Zby x y
if andonly
if y - x E C. If D is a subset of a Banach spaceZi , Cl
is a cone inZl
andC2
is a cone in a Banach spaceZ2
andf : .D ---; Z2
is a map, we saythat f
isorder-preserving (with respect
to thepartial orderings
induced inZj by C~ )
if for all x and y in D suchthat x y (in
thepartial ordering
inducedby G1)
one hasf { x) f ( y)
(in
thepartial ordering
inducedby C2). Usually
we shall haveZ1 - Z2
and01
=C2.
If D is convex, we say thatf :
D -Z2
is "convex"(with respect
tothe
partial ordering
inducedby C’2 )
if for all xand y
in D and all real numberst with 0 t 1, one has
We shall say that
f
isstrictly
convex iff
is convex and forall x =1= y
in Dfor
We shall say that
f
is concave(strictly concave)
if -f
is convex(strictly convex).
Our first lemma is well-known for real-valued functions. The
proof
in ourgenerality
isessentially
the same and we omit it.LEMMA 1.1. Let D be a convex subset
of
a Banach spaceZi, O2
a coneitz a Banach space
Z2
andf :
D -Z2
a map which is continuous on line segnlents in D(so
the map t -f ~ ~ 1 -
+ty),
0 ~ t 1, is continuousfor
all x and y in
D). If,
with respect to thepartial ordering
inducedby C2,
onehas
for
all x and y inD,
thenf
is convex.If f :
D -+Z2
is convex andfor all x
and y in D with y one has
then
f
isstrictly
convex.If 1 j
n, arenonnegative
real numberssuch
that and
1 j :!~,
n, are anypoints
in D andf
is convex, thenIf f
isstrictly
convex and Sj > 0for
1j n,
thenequality
holds in(1.6) if
and
only if
all thepoints
xj areequal for 1 j
n.We shall
eventually
need somecontinuity
results for the strongoperator topology.
LEMMA 1.2.
Suppose
that(Lk)
is a sequenceof
bounded linear mapsof
a
complex
Banach space X toitself
and that(Lk)
converges to abounded
linear operator L in the strong operator
topology.
Assume thata(Lk)
CBand a (L) C B,
where B is a compact subsetof
thecomplex numbers,
that D is a bounded openneighborhood of
B such that r = a D consistsof
afinite number of simple,
closedrectifiable
curves and thatf
is acomplex-valued function
which is
defined
andanalytic
on an openneighborhood of
D.If
there exists a constant M such thatfor
allthen
j(Lk)
--~j(L). If
is an entirefunction,
thenf (Lk)
--~f(L). If
X is a Hilbert space and all the operators
Lk
are normal(or self-adjoint), f (Lk)
~f ~L)~
PROOF. For A E r and a fixed x E X one has
Applying
the estimate in(1.7) yields
The uniform boundedness
principle implies
that there is a constantMi
suchthat
L ~~ ~ M,
for allk >
1.(A -
is continuous in theoperator norm,
(1.8) implies
that there is a constantindependent
of A eF,
such that ~
for all
Inequality (1.8)
and the factthat Lk )
--; Limply
thatA version of the
Lebesgue
dominated convergence theorem nowimplies
thatso
f (Lk)
-f ~L~.
If .X is a Hilbert
space, A
E r andLk
isnormal,
it follows that A -Lk
and (A -
arenormal,
so(see [36])
Because
o(Lk) g
B and r aredisjoint compact
sets,(1.10) implies
that(1.7)
issatisfied,
sof (Lk)
2013~f (L) by
the firstpart
of the lemma.Finally, suppose X
is acomplex
Banach space andf
is entire. If R = >11,
we can take D =2RI
and for A E wehave
-
so
Thus the first
part
of the lemmaimplies
thatf (Lk)
--;f (L)
in this case also.0 .
REMARK > . >. The obvious
analogue
of Lemma 1.2 for the weakoperator
topology
is false. Let H be12
and let1 }
be the standard orthonormal basis for1~ .
For n >1,
define aself-adjoint operator An : H -
Hby
for
for
One can
easily
provethat (An > -
0, but(A 2) -
I, theidentity.
Before
stating
our first theorem we recall some basic facts about matrices withnonnegative
entries. If M is an n x n matrix all of whose entries arenonnegative,
M is called "irreducible"if,
for each orderedpair ~i, j )
with1 S i, i
n, there exists aninteger
p > 1(possibly dependent
on~i, j ) )
suchthat the
entry
in row I andcolumn j
of MP isstrictly positive.
The matrix M is called"primitive"
if there exists aninteger p >
1 such that all entries of MPare
strictly positive.
If ~VI is an irreducible matrix withnonnegative
entriesand
r denotes the
spectral
radius ofM,
then r > 0 and there exists aunique (within
scalar
multiples)
column vector u such that all entries of u arepositive
andIf M is
primitive
and if one definesMl
=(where
r is thespectral
radiusof
M),
then for any nonzero vector x, all of whosecomponents
arenonnegative,
one has
where u is the
eigenvector
in(1.11)
and a is apositive
numberdepending
onx.
If M = is a matrix with
nonnegative entries,
M is called "column- stochastic" ifn
for and M is "row-stochastic" if
for
It is an
elementary
exercise in thetheory
ofnonnegative
matrices that thespectral
radius of any column-stochastic(or row-stochastic)
matrixequals
one.Furthermore,
a trivialargument
shows that theproduct
ofcolumn-stochastic (or row-stochastic)
matrices is column-stochastic(or row-stochastic).
Now suppose that M is a
column-stochastic, primitive
matrix withnonnegative
entries and let u be a column vector, all of whose entries arepositive,
such thatIf we normalize u
by demanding
we know that u is
unique.
DefineMco
to be the n x n matrix all of whose columnsequal
u. If1 ~ j
n, denotes the standard orthonormal basisof 5i. n,
we know that isthe jth
column ofMk,
and because~~
iscolumn-stochastic, ( 1.12) implies
thatWe conclude from the
previous equation that,
if M is column-stochastic andprimitive,
thenIf .~ is a cone in a Banach space
X,
let ~’ denote the cone which is the n-fold Cartesianproduct
of K. Let Y denote the n-fold Cartesianproduct
ofX with any of the standard norms. If M is an n x n matrix with
nonnegative entries,
M induces a bounded linearmap W
of Y to Yby
where
It is easy to check that
W(C)
CC,
thatW(C - { 0 } j ç
C -{ 0 }
if no row of.~ is
identically
zero, and that(if KO
isnonempty) Co
if no columnof ~I is
identically
zero.We are in a
position
to state our first theorem. Forsimplicity,
we restrictourselves to the cone of
p.s.d.
bounded linearoperators
on a Hilbert space, but versions of thefollowing
theorem can begiven
for moregeneral
cones.THEOREM 1.1. Let K denote the cone
of positive semidefinite, self-adjoint
bounded linear operators on a Hilbert space H. Let C denote the
n-fold
Cartesian
product of K,
C = K x K x ... x K. Let Y denote then-fold
Cartesian
product of
X =£(H, B)
withitself,
Y == X x X x... x X.Suppose
that
f : CO
--;CO
is a continuous map andc~ : K° -;
X is a continuous mapand Y
by
Assume that
for
every A ECO
there exist B ECO
andpositive
reals asuch that
and
for
0, where thepartial ordering
in(1.14)
and(1.15)
is inducedby
Cand f ~
isthe j th
iterateof f.
Assume that there exists an n x n,primitive, column-stochastic
matrix M suchthat, for
every A ECo,
Let u denote the
unique
column vector such that allcomponents of
uiof
u arepositive
andand
and
let 1ri
denote theprojection
onto its ith coordinate.If, for
A ECo,
we
define
. ,~, B ,., ’.. - ,so
~~~~ _ 1ri f k (A)),
there exists E EKO
such thatFurthermore, for
1 i n, one hasPROOF. If u is the
eigenvector
in the statement of thetheorem, define,
forA = a function :
by
the formulaInequality ( 1.16) implies,
if A= (A,, A2, - - .,
thatIf we define
Ek - ~ ( f k ~ .~ ) ~ , Ek -
is anincreasing
sequence ofbounded,
self-adjoint
operators, and( 1.14)
and( 1.15) imply
thatEk - El
is boundedabove
(in
thepartial ordering
onK).
Thus as k - oo, E with E -Ei self-adjoint. By iterating inequality (1.16)
we seethat,
forany k
> 1,The remarks
preceding
the theoremimply
thatgiven
any e > 0, there existsNx
> 1 such that for all k> N1,
where is the matrix with all columns
equal
to u, and(1.22)
means thatfor 1
i, j
~ n, thei, j entry
ofMk
isgreater
than orequal
to thei, j entry
of (1 - e) M,, -
If k >Ni,
and 1 i n, it follows thatFor a
given x E H and e
> 0 there existsN2
such thatfor
because
.E~
--~ E.Combining inequalities (1.23)
and(1.24) yields
for k >N1 + ~2 ~
which
implies
that(using inequality (1.22)
andrecalling
that isbounded in
norm)
If,
for some and somez,
one hasthen
inequalities (1.26)
and(1.27) imply
The above
inequality
contradicts the fact thatso
inequality (1.27)
must be false. We conclude from(1.26)
thatStandard arguments using
thepolarization
nowimply
thatThere are several obstacles to
using
Theorem 1.1. The firstproblem,
ofcourse, is to prove the existence
of 0
and M as in Theorem 1.1 forexamples
ofinterest to us. We shall use the classical results of C. Loewner
concerning
theconcavity
oforder-preserving
maps from the cone K ofp.s.d.
bounded linearoperators
of a Hilbert space H toL (H).
However,
evenassuming
that we can establish thehypotheses
of Theorem1.1 in
examples
ofinterest,
Theorem 1.1provides inadequate
information when H is infinite dimensional. If H is finitedimensional,
theweak,
strong andoperator
normtopologies
onL (H)
areidentical,
so Theorem 1.1implies
for
If 0
is one-one with norm-continuousinverse,
one concludes thatIf ~Y is infinite
dimensional,
one wouldhope
that there exists ap.d. bounded
linear
operator
G such thatfor : -.
However, as Remark 1.1
shows,
the weakoperator
convergence in( 1.18)
may be very far from thestrong operator
convergencehoped
for in(1 .29).
We now
begin
to address there deficiencies.If K is the cone of
p.s.d.
bounded linearoperators
in£(H)
and H is aHilbert space, we need to know when certain maps defined on D =
.H°
areorder-preserving
or concave orstrictly
concave. The first lemma isLoewner’s
theorem[23, 24] concerning order-preserving
maps onK’;
anexposition
ofLoewner’s
theory
isgiven
in[ 16] .
LEMMA 1.3.
(C. Loewner)
Let K denote the coneof positive semidefinite,
bounded linear maps
of
a Hilbert space H toitself. Suppose that f : (0, 00)
-+ 3-is a continuous, real-valued map and that
f
has ananalytic
extension toU =
~z
c C : 0 or(Im(z)
= 0 andRe(z)
>0) )
such that > 0for
all z such thatIm(z)
> 0. Then the map A -+f (A) for
A E D == isorder-preserving
with respect to thepartial
order inducedby
K.Functions that
satisfy
thehypotheses
of Lemma 1.2 aref (z)
=log(z)
and-
zp,
op
1, andlog(z)
will be theexample
of most interest here.One can
give
asimple
and self-containedproof
that ifA,
B EKO
andA B,
thenB-1::; (although
we shall not doso). Using
thisfact,
we now provedirectly
that themaps A - log(A>
and A --->AP,
0 p 1, areorder-preserving
on If
~’°
andEt == (1 - t~ I
+tE,
where I denotes theidentity
map, then one caneasily
prove thatAn
algebraic manipulation gives
If A and .B are in
KO
Band At
= andBt
=for 1, then
Bt for 0 f t
1, soz
for 0t
1 andUsing (1.31)
and(1.32)
one finds thatlog(A) log(B).
That BP if A Band 0 p 1 follows
by
a similarargument
from the formula(see [21],
p.286)
We also need
concavity
results for maps of~°
toLEMMA 1.4.
(Ando [3]) Suppose
thatK,
Handf
are as in Lemma 1.3.Then, for A E K° ,
the mapA - f (A) is
concave and the map A -A f ~ A)
isconvex
(with
respect to thepartial ordering from K).
Ando states Lemma 1.4 for
finite-dimensional
Hilbert spaces(Theorem
4in
[3]),
but the sameargument,
based on Loewner’stheory,
works forgeneral
Hilbert spaces.
Lemma 1.4 is not
quite adeguate
for our purposes. We need strictconcavity
and
convexity results,
and in fact we shall need aproperty (see
Theorem 1.2below) analogous
to theproperty
of uniformconvexity
for norms,. Such results will follow from the strictconvexity
of the mapA -~ A~ 1
for A EKO,
and thisstrict
convexity
wasproved independently by
P. Whittle[34,
Lemma1]
and I.Olkin and J. Pratt
[29].
Whittle’s lemma is stated for finite-dimensional Hilbert spaces, but theproof applies
ingeneral
andyields
thefollowing
lemma.LEMMMA 1.5.
([34]
and[29])
Let H be a Banach space and suppose thatA, B c L (H) - If
A is a real number such that 0 A 1 andA,
B and~ 1 - À)A
+ ÀB are one-one and onto, thenIf H is a Hilbert space and K denotes the cone of
positive
semidefiniteoperators
in£ (H),
then the map A -4A-1
ofKG
toKO
isstrictly
convex.By exploiting
Lemma 1.5 we obtain thefollowing sharpening
of Ando’stheorem
(Lemma 1.4).
See also Bendat and Sherman[8].
LEMMA 1.6. Let
f,
K and H be as in Lemma 1.3.If
there do not existreal constants a and
(3
such thatf (x)
= 0, thenfor
A cKO,
athe map A -4
f (A)
isstrictly
concave.If
there do not exist real constants aand -y such that
f (x~ -
et -~-1x-l for
all x ~ 0, then A --~ isstrictly
convex on
KG.
PROOF. Loewner’s
theory implies
that for A not anegative real, À i
0,where a is
real, Q
> 0and p
is a finitenonnegative
Borel measure withsupport
in
( - oo, 0~ .
If we assume thatf (x)
is not an affine map,then p
is not the zeromeasure. From
(1.35)
and theidentity
we obtain
The map is
obviously
concave and Lemma 1.5implies
that foreach t ~ 0 and for A G
K°
the mapis
strictly
concave so(1.36) implies
A ~f (A)
is concave(Lemma 1.4).
Toprove
that A -- f ( A)
isstrictly
concave, takeA, K°
B anddefine
Equation (1.36) gives
The strict
concavity
of A ->-A-1 implies
that there existto
0, with to in thesupport of M,
x E H and 6 > 0 such thatfor all t 0 and strict
inequality
holds ininequality (1.38)
forUsing
this information in(1.37)
we findwhich proves strict
concavity.
Starting
from(1.36)
we see thatIt is easy to prove
directly
that A ~A2
isstrictly
convex on K. Therefore(because Q > 0)
the map ~4 2013~ aA + is convex on K andstrictly
convexif B
> 0. Somealgebraic manipulation
shows thatfor t ~ 0. Lemma 1.5
implies
that for each t 0, the mapis convex
and, in fact, strictly
convex if t 0. If thesupport of /-z
hasnonempty intersection
with(- oc, 0),
the same kind ofproof
as used before shows thatA -4Af (A)
isstrictly
convex(A
EKO).
If thesupport Of /-z
isfol, (1.35) implies
thatfor 0. If
(3
> 0, themap A --+ A f ( .A )
isgiven by A --
a A +in this case and hence is
strictly
convex.If/3=0, f ~ ~ ) = a - ~ a -1, contrary
to our
assumption.
0When
f ( z ~
= or 0 p 1, Lemma 1.6 can beproved directly
by using (1.31)
and(1.33)
and Lemma 1.5.An immediate consequence of Lemmas 1.1 and 1.6 is:
COROLLARY 1.1. Let H be a Hilbert space.
Suppose
thatA.1’, 1 j :5
rn,are
bounded, self-adjoint, p.d.
linear mapsof
H to H.If
sj,1 !,~ j S
m, arem
positive
numbers such thatE
sj = 1, itfollows
thatj=l
Equality
holds in(1.40) if
andonly if
all the operatorsAj
areequal. If
0 ci 1,
Equality
holdsin (1.41) if
andonly if
all the operatorsAi
areequal.
REMARK 1.2.
Suppose
thatAi
and sj are as above and that pand q
arereal numbers such that
1 ~ p
q.Defining
a =pq-1
andCorollary
1.1
implies
3
One
only
needs 0 p q to deriveinequality (1.42).
Since p > 1, Lemma 1.3implies
that B --~ isorder-preserving
onK,
so one obtains from(1.42)
For
positive
realinequality (1.43)
is a classical result[20].
Unfortunately,
when H is infinite dimensional Lemma 1.6 isinadequate
for our purposes. We need to
exploit
strictconcavity
and strictconvexity
in amore
quantitative
way,analogous
to the idea of uniformconvexity
for a norm.The next lemma illustrates the sort of uniform
convexity
we need for the case of thestrictly
convex map A -A-1
when A ispositive
definite.LEMMA 1.7.
Suppose
thatAi,
1 It m, arep.d.,
bounded linear mapsof
a Hilbert space H intoitself
and 0.1 _for
1 i m, where a and{3
marepositive
reals. Assume that ak, 1 k m, arepositive
reals such that2:
ak = 1.Then, for
1i, j ~
m,k=l
where the
inequality refers
to thepartial ordering
inducedby
the coneof positive semidefinite
bounded linear maps.PROOF. Because A - is convex, the left side of
(1.44)
isalways greater
than orequal
to zero. To prove(1.44)
it suffices(by relabelling)
toprove it when i =
1, j = 2.
If A > QI and
B > aI,
then thespectral mapping
theoremimplies
A‘~ B‘1
andand
Using inequality (1.45)
in(1.34) gives
If we define and
inequality (1.46) gives
If we define
inequality
and the
convexity
of A --+ A - 11give
Inequality (1.48)
isprecisely
the statement of the lemma for i = 1and j =
2.0
The next
theorem,
when combined with Theorem1.1,
will enable us to prove convergence in thestrong operator topology.
THEOREM 1.2. Let K be the cone
of positive semidefinite bounded
linearmaps
of
a Hilbert space H intoitself.
Let C denote them-fold
Cartesianproduct of
K withitself.
Thus C CY,
where Y is them-fold
Cartesian pro-duct
of
X =£(H)
withitself.
Assume that(B(k)), k >
0, is a sequence inC° ,
and writeB ~ k ~
=( B 1~ j , B~k ~ , ... ~ Suppose that ( 0, aa)
- ’-I isa real-valued
function
such that limdo(x)ldx
= 0 and suchthat 0
has an:1:-+00
analytic
extension to U ={z E C : Im (z) =
0 or(Im(z) =
0 andRe(z)
>0))
and > 0
for
all z such thatIm(z)
> 0. Assume that there existpositive
numbers