C OMPOSITIO M ATHEMATICA
P ETER F LOR
On groups of non-negative matrices
Compositio Mathematica, tome 21, n
o4 (1969), p. 376-382
<http://www.numdam.org/item?id=CM_1969__21_4_376_0>
© Foundation Compositio Mathematica, 1969, tous droits réservés.
L’accès aux archives de la revue « Compositio Mathematica » (http:
//http://www.compositio.nl/) implique l’accord avec les conditions géné- rales d’utilisation (http://www.numdam.org/conditions). Toute utilisation commerciale ou impression systématique est constitutive d’une infrac- tion pénale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
376
On
groups
ofnon-negative
matricesby Peter Flor
Index of notation
Wolters-Noordhoff Publishing
Printed in the Netherlands
J: some group of
non-negative
matricesI: the neutral element of 9 N = ker I
S = im 1
P : the cone of
non-negative
vectors in R"P0 = IP
A1, ···, Ak:
theedges
ofPo
a, E
Aj (j
= 1, ···,k),
a, ~ 0B -
{e1, ···, en},
the set of unit vectors in RnM ==NnP
Mj = {x ~ P | Ix ~ Aj}
Bo
= N ~ EEj = (Mi
nE)BE0 (j
= 1,···, k) K,:
the linear hull ofEj
ai =
bj+cj, bj ~ Ki,
c, c- M.There has been some
interest,
in recent years, in groups of non-negative
matrices. Brown[1]
discovered thestriking
fact that any such group is finite if it iscompact,
and Schwarz([6], § 4)
gavea detailed
description
of groups of stochastic matrices. In these papers, groups ofnon-negative
matrices were studied in the context ofsemigroup theory.
Both authors restricted their atten- tion tocompact
groups.In the
present
paper afairly complete description
of all groups ofnon-negative
matrices will begiven,
without the restriction ofcompactness.
This includes aproof
of Brown’s theorem as well asan extension of some results of Schwarz. Our method is
quite elementary, contrasting
with Brown’sproof
which uses Lietheory.
Indeed it was the desire to prove Brown’s theorem in an
elementary
manner which led to the results
presented
here.377
Let P C Rn be the cone of vectors with
non-negative
compo- nents, and denoteby J
any group ofnon-negative n
X n matrices.Non-negativity
means 9P C P. Denoteby
I the unit element of J. 1 is anidempotent
matrix. So Rn = N (D S where N = ker I=
(x ~ Rn| Ix
=0}
and S = im I ={IxBx
ERn}.
The relationsG = GI
= IG,
I = GG-1= G-1Gimply
N = kerG,
S = im G forevery G E 9. Thus dim S is the rank of every element of J
(the
rank
of J,
as we shall say forshortness).
OnS, e
acts as a trans-formation group in the usual sense; in
particular,
Ir = x for everyx ~ S.
Let
Po
= P nS,
G E J. We have GP C P and also GP CS,
thatis,
GP CPo. Furthermore, Po
=I Po
=GG-1 P0
CGPo
C GP. SoGP =
GPo
=Po,
for every G E 9.Like P, Po
is a convexpyramid.
We shall have tostudy
certainextreme subsets of
Po.
A subsetQ
of a convex cone C is calledextreme if
Q
is a convex cone anda E e, b E C, a + b ~ Q imply
a E
Q,
b EQ.
A convex cone C isspanned by
a set D C C if Cis the set of all linear combinations of elements of D with non-
negative
coefficients.Alternatively,
D spans C if C is the smallestconvex cone
containing
D. Thefollowing
lemma is obvious.LEMMA 1.
Il
D spans C andQ
is extreme inC,
D nQ
spansQ.
Define
M = N ~ P, E0 = N ~ E
where E ={e1,
e2’ ...,en} is
the set of unit vectors in Ryl. M is extreme in
P,
and so isspanned by E0, by
Lemma 1. Further letA1, ···, Ak
be theedges
ofPo,
define
Mj = (r
EP|Ix ~ Aj},
and letEj
be thecomplement
ofEo
inMi n E, for i
= 1, 2, ···, k.Every M,
is extreme in P since theAj
are extreme in
Po.
Soby
Lemma 1again, Mj
isspanned by E0 ~ Ej. For y ~ l, we have Mj ~ Ml = M, Ej n El ==
0.Now select a non-zero
vector aj
on everyAj.
Forevery
thereis a
unique decomposition aj
=bj + cj where bj
is in the linear hull ofE; , and ci
E M.lai = aj ~
0 ;so aj ~ M, bj ~
0. Since theBi
aremutually disjoint, the b,
arelinearly independent.
Further-more, ci E M which is
spanned by E0;
this isdisjoint
from everyBi (1 = 1, - - -, k).
So any linear relationinvolving the a,
wouldentail the same relation for the
bj.
We haveproved
LEMMA 2: al, ... ak are
linearly independent.
Po
= P n S is apointed
closed convexpyramid,
thatis,
thesolution set of a
system
of weakhomogeneous
linearinequalities,
of maximal rank. It is well known that a closed
pointed
convexpyramid
isspanned by
itsedges. (See [4],
pp. 40-43, or[3],
Corollary 1B. )
SoP,
isspanned by
al, ···, ak. On the otherhand,
Po = IP
containsIe1, ···, Ien;
so its linear hull containsS,
andtherefore is
equal
to S. This shows that S is the linear hull of{a1, ···, ak}; by
Lemma 2,{a1, ···, ak}
is a basisof S,
and dim S = k.Now it is easy to describe the way in which J
operates
on S.Since
GPo
=Po
for every G EJ,
Gpermutes
theedges
ofPo:
where
03BBj(G)
> 0, and 03C0G is somepermutation
of the set{1,
2, ’ ’ ’,k}. Clearly, 03BBj(I)
= 1 forall j, and n,
is theidentity.
Once I is
given,
and N, a1, ’ ’ ’, ak have been defined as above,equation (1) completely
describes allpossible
G. Indeed let03BB1, ···, 03BBk
be anypositive numbers,
and 03C0 anypermutation
of{1, ···, k}.
DefineG(03BB1, ···, 03BBk; 03C0)
to be that matrix G which satisfiesSince
the a,
are a basis ofS,
and Rn = N E9S,
a matrixG(03BB1, ···, Â,; 03C0)
isuniquely
definedby (2).
1 claim that G is non-negative,
i.e. that GP C P. NowGP0
=Po
is obvious from(2)
andthe fact that
the 03BBj
arepositive.
Furthermore, we have IN ={0}, Iai = ai (j = 1, ···, k); so G = GI, GP = GIP = GP0 = P0.
A
simple
calculation shows that the differentG(03BB1, ···, 03BBk; 03C0) multiply according
to the ruleIn an abstract
setting,
wherethe 03BBj
are elements of anarbitrary
group
B,
the group definedby (1) (or (3))
was introducedby
Ore
[5]
who called it thecomplete
monomial groupof degree
k overB. So we have
proved
THEOREM 1.
Every
maximal groupof non-negative
matricesof
rank k is
isomorphic
to thecomplete
monomial groupof degree
k overthe reals.
(Of
course,by
anelementary
theorem ofsemigroup theory,
every
subgroup
of asemigroup
is contained in aunique
maximalsubgroup. )
In
fact,
ourproof
shows somewhat more thanjust
the isomor-phism
stated in Theorem 1 : for any two maximal groups of non-negative n
X n matrices whose ranks areidentical,
one is theimage
of the other under some inner
automorphism
of thesemigroup
ofall n n matrices. This is not so, however, within the
semigroup Hn
of allnon-negative
n n matrices. Indeed consider the idem-potents
379
both of rank two. The
equation Ii X = 0
has non-trivial solutions in:Y?3
for i = 2 but not for i = 1. So there is noautomorphism
of
H3,
inner or not, which maps(the
maximalsubgroup
contain-ing) h
onto(the
maximalsubgroup containing) I2 .
The considerations
preceding
Theorem 1give
acomplete description
of all maximal groups ofnon-negative
matrices pro- vided we know allpossible systems (N,
a1, ···,ak).
Some informa-tion about these systems is
implicit
in our next theorem which describes thenon-negative idempotent matrices, generalizing
Doob’s
description
of stochasticidempotents ([2],
Theorem2).
THEOREM 2. Let I be a
non-negative idempotent
matrixof
rank k.There exists a
permutation
matrix P such thatwhere
the J,
arenon-negative idempotent
matricesof
rank one, thatis,
matrices(03B1s03B2t),
as > 0,03B2s ~
0,03A303B1s03B2s
= 1.Conversely,
every matrix(4) where J is given by (5)
while A and B arearbitrary
non-negative
matricesof appropriate sizes,
isidempotent
andof
rank k.Remember that the
change
from I to P-’IP means that acertain
permutation
issimultaneously performed
on the rows andcolumns of I. This
operation
is anisomorphism
for matrix multi-plication
and thus does not interfere withidempotence.
We
begin by proving
aparticular
case of Theorem 2. Assume that no columnof
I is zero. This means thatEo
isempty.
Assumefurther that no row
of
I is zero either. It follows that every ei occursin the canonical
decomposition
of some xE S,
and so of some aj;in other
words,
uEj =
E. Now letKi
be the linear hull ofEj.
Then Rn =
K,
~ ··· QQKk . Furthermore, IKi
is the linethrough
a,, for every
j.
So I is reducedby
theKi,
and each restriction ofI to
some Kj
has rank one.By rearranging
the coordinates in sucha way that the unit vectors
belonging
to eachKi
aregrouped together,
we find a matrixwhere
the Jj
areidempotents
of rank one. So we haveproved
Theorem 2 for the
particular
case indicated above.In the
general
case, webegin by grouping
the indicesi = 1, 2, ···, n in four sets
according
to whether the i-th row and column of I are both zero, or the i-th row is zero but the i-th column is not, and so on.By simultaneously rearranging
rows andcolumns,
we find a matrixwhere
Pi
is apermutation matrix,
andK, L,
M are such that K and L have no zero row in common, and K and M have no zerocolumn in common.
I1
isidempotent:
K2 =K,
KL =L,
MK = M,ML = N. Since K and L = KL have no zero row in common, K cannot have a zero row.
Similarly,
no column of K is zero.By
thefirst
part
of theproof,
a furtherpermutation performed
simultane-ously
on the rows and columns of Kproduces
a matrix of the form(5).
Thiscompletes
theproof
of Theorem 2 in one direction. Theconverse is trivial.
Equation (3)
shows that themapping x :
G - 03C0G is a homo-morphism
of 9 into thesymmetric
groupIn
the case in whichJ is
bounded,
more can be said.LEMMA 3.
If J is bounded,
03C0 is anisomorphism.
This is an immediate consequence of
LEMMA 4.
If J is bounded,
andi f for
someindex j
andfor
someG e
9, 03C0G(j) == j,
thenÂi (G)
== 1.Under the
assumptions
made in Lemma 4,Gaj = 03BBj(G)aj
andG-laj == (03BBj(G))-1aj;
soby iteration,
381 Since 9 is
bounded,
so is the set{Ctaj|t
= 0, ±1,±2, ···}.
SinceÂj (G)
> 0, thisimplies 03BBj(G)
= 1.To derive Lemma 3, assume that G E ker 03C0, i.e. 7&G is the
identity.
So
by
Lemma 4,Gaj
= ai forevery
Since S is the linear hull of the a; , it follows that Gx = x for every x ES,
i.e. G = I.c:9 thus is
isomorphic
to a group ofpermutations
on k elements.Actually, J
itself acts as a group ofpermutations
on a set ofsuitably
chosenmultiples
03BC1a1, ···, Pkak. For suppose the indices iand i
are in the same class oftransitivity
withrespect to 03C0(J).
If G and H are elements of J
satisfying 03C0G(i)
=03C0H(i)
=i,
Lemma 4 shows that
Gai
=Hai (=
somemultiple
ofaj).
So it ispossible
to choose 03BC1, ···, 1 Yk in such a way thatfor all
j. (This
choice isunique,
up to a commonfactor,
if andonly if 03C0(J)
istransitive.)
We also see that 9 is contained in(at least)
one maximal bounded group of
non-negative
matrices: once multi-ples di
= !liai have beenfixed,
consider the set of all linear transformationsvanishing
on N andpermuting
thedj.
We haveproved
THEOREM 3.
Every
bounded groupof non-negative
matrices iscontained in at least one maximal bounded group
of non-negative
matrices.
Every
maximal bounded groupo f non-negative
matrices isisomorphic
to thesymmetric
groupJk
where k is the ranko f
thematrices in the group.
In
particular,
every bounded group ofnon-negative
matrices is finite. This is Brown’s theorem.In the
special
case of stochasticmatrices,
our Theorem 3 isrelated to Theorem 8 of
[6]
which says that every maximal group of stochastic matrices of rank k isisomorphic
to//k.
To deducethis fact from our Theorem 3, we have to show that if J consists of stochastic
matrices,
then among the maximal bounded groups ofnon-negative
matricescontaining J
there is one, atleast,
whichconsists
entirely
of stochastic matrices.(Since
the stochastic n X nmatrices form a
semigroup,
the theorem onsemigroups
mentionedat an earlier occasion
implies
that there is at mostone. )
Now a
non-negative
matrix G is stochastic if andonly
if Ge = ewhere e =
e1+e2+ ···
+en. Let e =x+03A3kj=103BCjaj, x
E N. Since Iis
stochastic, e
= Ie= 0+03A303BCjaj.
So x = 0. ThusIf Ci
EE=,
say,03BClbl
is theonly
term on theright
whose i-thcoordinate may be different from zero. Since the i-th coordinate of e is one, we have
Ili =1=
0, for allj.
Now for every G eJ,
So
This shows that 9 acts as a group of
permutations
on the set{03BCjaj|J
- 1, 2, ···,kl,
and may be extended to the group of all linear transformationsvanishing
on N andpermuting
the !liai.In the
proof
of Theorem 1 it was shown that transformations of this kind arenecessarily non-negative.
Sincethey
leave invariante
= 03A303BCjaj, they
are stochastic. This proves Schwarz’s theorem.In
concluding,
1 amglad
toacknowledge
asuggestion
from thereferee which resulted in
simplifying
theproofs
of Lemma 2,Theorem 2, and Schwarz’s theorem.
REFERENCES
D. R. BROWN
[1] On clans of non-negative matrices. Proc. Amer. Math. Soc. 15 (1964) 671-674.
J. L. DOOB
[2] Topics in the theory of Markoff chains. Transactions Amer. Math. Soc. 52 (1942)
37-64.
A. J. GOLDMAN and A. W. TUCKER
[3] Polyhedral convex cones. In: Kuhn-Tucker (ed.), Linear inequalities and related systems. Annals of Mathematics Studies 38, Princeton University Press, Prince- ton, N.J. (1956) pp. 19-40.
H. MINKOWSKI
[4] Geometrie der Zahlen. Teubner, Leipzig und Berlin 1910.
O. ORE
[5] Theory of monomial groups. Transactions Amer. Math. Soc. 51 (1942) 15-64.
0160T. SCHWARZ
[6] On the structure of the semigroup of stochastic matrices. Publ. Math. Inst.
Hungar. Acad. Sci., Ser. A, 9 (1964) 297-311.
(Oblatum 16-10-68)