A NNALES DE LA FACULTÉ DES SCIENCES DE T OULOUSE
P ETER M CCULLAGH
Quasi-symmetry and representation theory
Annales de la faculté des sciences de Toulouse 6
esérie, tome 11, n
o4 (2002), p. 541-561
<http://www.numdam.org/item?id=AFST_2002_6_11_4_541_0>
© Université Paul Sabatier, 2002, tous droits réservés.
L’accès aux archives de la revue « Annales de la faculté des sciences de Toulouse » (http://picard.ups-tlse.fr/~annales/) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions).
Toute utilisation commerciale ou impression systématique est constitu- tive d’une infraction pénale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
- 541 -
Quasi-symmetry and representation theory (*)
PETER
MCCULLAGH (1)
E-mail: [email protected]
Annales de la Faculte des Sciences de Toulouse Vol. XI, n° 4, 2002
pp. 541-561
RESUME. - La quasi-symetrie est un modele pour les tableaux carres dont les lignes et les colonnes sont indexees par un meme ensemble.
Dans un modele log-lineaire, la quasi-symetrie est associee a un sous es-
pace vectoriel QSn de certaines fonctions reelles definies sur n x n. Une
permutation des niveaux des facteurs induit une transformation lineaire QSn - QSn faisant de ceci une representation du groupe symétrique Sn Toutefois, il a plusieurs representations du groupe qui sont totalement
insatisfaisantes en tant que modeles lineaires ou log-lineaires. La quasi- symetrie est aussi une representation hereditaire du groupe symetrique, qui dit que la restriction de Q Sn+ 1 au sous tableau principal n x nest egale a QSn Une autre maniere de dire la meme chose est que, en tant que
sequence d’espaces vectoriels, QS est une representation de la catégorie des applications injectives sur des ensembles finis. Cette propriete est d’une importance fondamentale pour les modeles statistiques. Cet article
a pour but de lister toutes les sous representations hereditaires par des ma-
trices carrees reelles et d’expliquer comment celles-ci peuvent etre utilisees pour la construction de modeles. Nous concluons qu’il y a exactement six modeles log-lineaires non triviaux pour un tableau de contingence carre.
ABSTRACT. - Quasi-symmetry is a model for square contingency tables
with rows and columns indexed by the same set. In a log-linear model, quasi-symmetry is associated with a vector subspace
QSn
of certain real- valued functions on n x n. Permutation of factor levels induces a linear transformation QSn, making this into a representation of the symmetric group Sn However, there are many group representations thatare totally unsatisfactory as linear or generalized linear models. Quasi- symmetry is also a hereditary group representation, which is to say that the restriction of
QS n+ 1
to the leading n x n sub-array is equal toQSn.
.Another way of saying the same thing is that, as a sequence of vector spaces, QS is a representation of the category of injective maps on finite sets. This property is of fundamental importance for statistical models.
This paper sets out to list all hereditary sub-representations by real-valued square matrices, and to explain how these may used in model construction.
We conclude that there are exactly six non-trivial log-linear models for a
square contingency table.
~ * ~ Recu le 18 septembre 2001, accepte le 18 septembre 2002
( 1 ) Department of Statistics, University of Chicago, Chicago, 11 60637, U.S.A.
1. Introduction
A real-valued square matrix
f
is said to bequasi-symmetric
oforder n,
if there exists a
vector g
E Rn and asymmetric
matrix h such thatEquivalently,
sincehi~
= 9i+ gj + hij
issymmetric,
the condition canbe written in the form
fij
=hij
+ 9j. . Theoriginal
definition ofquasi- symmetry, given by
Caussinus(1965)
in the context ofpositive
matricesand
log-linear models,
involvesmultiplication
rather than addition. Thatis to say, a
positive
matrix m isquasi-symmetric
in theoriginal
sense ifthe matrix
log
m withcomponents log
mij isquasi-symmetric
in the senseof
(1).
If
f, f’
are bothquasi-symmetric,
there exists a vectorg’
and a symme- tric matrix h’ such thatf ~
+ .Consequently,
for all scalars cx,ct’,
the linear combination
03B1f
+ct’ f’
is alsoquasi-symmetric.
That is to say, the set ofquasi-symmetric
matrices of order n is a vector space. We writeQSn
Cnoting that, for n > l, QSn
has dimension(n2
+ 3n -2)/2.
The co-dimension for the residual
degrees
of freedom is(n - 1 ) (n - 2) /2.
As many
applied
statisticians are well aware,quasi-symmetry
is no or-dinary
vectorsubspace.
It arises as a statistical model in all manner ofapparently
unrelatedapplications,
such as thefollowing.
(i)
Case-control studies inepidemiology (Pike, Casagrande
and Smith1975;
Darroch1981; McCullagh 1982);
( ii )
Reversible Markov chains(Kelly, 19 7 9 ) ;
(iii) Gravity
models inmigration studies, (Stewart, 1948; Upton, 1985);
(iv)
Transmissiondisequilibrium
models ingenetics, (Spielman
et al.1993;
Ewens and
Spielman 1995;
and Sethuraman andSpeed 1997), (v)
Matchedpairs designs (Fienberg
and Larntz1976),
(vi)
Citation studies in science(Stigler, 1994).
The
closely
relatedBradley-Terry
model(Bradley
andTerry, 1952)
isfrequently
used forassessing
teamstrengths (Barry
andHartigan 1993),
and for
rating
teams orplayers
in tournaments(Joe 1990). Although
thederivations seem to have little in common, it is hard to believe that there is not a common theme.
The chief aim of this paper is to uncover what it is that
distinguishes quasi-symmetry
fromordinary
run-of-the-mill vectorsubspaces,
and to de-scribe the entire class of vector
subspaces
that have the sameproperties.
Quasi-symmetry
has twoproperties,
both relevant to statisticalmodels,
that set it
apart
from thegreat majority
ofsubspaces.
The firstproperty
is connected with
representation theory
for thesymmetric
group(Diaconis, 1988),
the idea herebeing
that the order in which the factor levels are listed is sometimesentirely arbitrary.
Apermutation
7r ESn
actsnaturally
onvectors
f
E TZnby composition
such thatf
is carried to7r* f
with compo- nents(7r* f)i
= . The samepermutation
also actsnaturally
on square matrices inRn2 by composition
such that the matrixf
is carried to7r* f by
This is a linear transformation
nn 2 -t nn 2
which could beexpressed
in theform of matrix
multiplication
if that werehelpful.
Iff
isquasi-symmetric
with the
decomposition ( 1 ),
theimage 7r* f
is alsoquasi-symmetric:
such that h’ is
symmetric.
Inalgebraic terminology,
the association of each group element 7r with a linear transformation 7r*:QSn
is a rep-resentation *:
Sn
-GL(QSn)
of thesymmetric
groupby
invertible lin-ear transformations --
QSn
on a vector space. The induced linear transformation is not in fact ahomomorphism
in the usual sense, but ananti-homomorphism:
the groupproduct
is carried to thecomposition cp* ~r* : ~Zn
-- in reverse order. This reversal is of little consequence for grouprepresentations,
but it isimportant
in the criticalproperty
of inheritance.The second
property
ofquasi-symmetry
is not so much aproperty
of the vectorsubspace
CRn 2
as aproperty
of the sequence ofsubspaces
C
~Z 1 ~ , QS’2
CR , ... ~ .
That is to say, the secondproperty applies
toquasi-symmetry
as a statistical modelformula,
defined in a similar mannerfor each
integer n >
0. Iff
EQSn,
theleading (n - 1 )
x(n - 1 )
sub-matrixis also
quasi-symmetric,
and thus in This humble inheritance prop-erty guarantees
that the sub-modelsQS 4
CR42
,QS5
CR52
and so on,are alike and have similar
interpretations. Generally speaking,
the numberof recorded levels is rather
arbitrary,
and determined to some extentby
theinvestigator through
selection andaggregation. Any pattern
that arises asa consequence of such
capricious
choices is of little scientific interest. Fora model to be
sensible,
it must beimmune,
or at leastrobust,
to selectionof levels
and,
ifrelevant, aggregation
of levels. Selection andaggregation
are not invertible
transformations,
so grouprepresentations
do notbegin
toaddress this inheritance
property,
which is the criticalingredient
that dis-tinguishes
apossibly
sensible statistical model from adefinitely silly
model(McCullagh, 2000).
It is easy to see, for
example
that thediagonal matrices,
thesymmetric
matrices and the
skew-symmetric
matrices aregroup-invariant
and also have the inheritanceproperty. So, although quasi-symmetry
is veryspecial,
it iscertainly
notunique
in thisrespect.
With a littleeffort,
it ispossible
toexhibit a number of other
group-invariant
sequences, such as the constantmatrices,
that also have the inheritanceproperty.
The purpose of this paper is toprovide
acomplete catalogue
of statistical models that occur as sub-representations
inRn2.
.2.
Representation
ofinjective
maps2.1. Model formula
We set out in this section to
identify
all model formulae for squaretwo-way
tables with rows and columns indexedby
the same labels. For thispurpose, a model formula is defined as follows. For
each n >
0 a modelformula M identifies a vector
subspace liIn
C Rnxn that is closed under simultaneouspermutation
of rows and columns.Moreover,
the sequence is such thatMn
is embedded inMn+1 by selecting
theleading
n x nsub-array,
i.e.by
aprojection
that deletes the last row and column. We discuss in section 3 what modifications arerequired
to make the definitionappropriate
forcontingency
tables andlog-linear models,
where the issue ofaggregation
of levels also arises.2.2.
Composition
Let n
= {I, ... , n~,
so thatn2
= n x n is the set of orderedpairs
of real numbers~ (i, j ) :1 i, j n).
It may behelpful
to think ofn2
as atable,
or array. For the
moment,
this is an array ofempty slots,
but the intentionis to fill it with real numbers as in a square matrix or
contingency
table. Afunction
f : n2 ~
R is a square matrix withcomponents fij
=f(i,j),
eachof which is a real number. Such a function is
typically displayed
as a squarematrix,
such asfor n = 4.
Consider now the
injective
map cp: m --~ n, in which m = 3 and n =4,
defined
by
To say that the map is
injective
is to say that it isone-to-one,
i.e. distinctpoints
in the domain m =~1, 2, 3~
have distinctimages
in n =( 1 , 2, 3, 4~.
With
f
defined as in(2),
thecomposition p* f
is a square matrix of order 3 whosecomponents
are the values off
on there-arranged sub-array
That is to say,
p* f
is the 3 x 3 matrixNote that
diagonal
elements remain on thediagonal,
andoff diagonal
ele-ments remain
off-diagonal.
It is clear that
for all scalars ct,
a’, showing
that the induced transformationcp* : Rn2
--Rm2
is a linear transformation from matrices of order n into matrices of order m. Note that the direction ofp*
is reversed relative to p.2.3.
Representation
Denote
by
I the set of allinjective
maps on finite sets. It is sufficient forour purposes to
pretend that,
for eachinteger ?~ ~ 0,
there isonly
one set ofsize n,
namely
n= {1,..., , n~.
To each map (/?: n ~ n’ therecoresponds
adomain n = dom p and a codomain
n’
= cod p: theimage
pn Cn’
is the set ofpoints
px E n’ such that x E n. It isimportant
in all that follows not to confuse theimage
with the codomain. Two maps p: n -~ n’ and n’ -~ n"are said to be
composable
ifdom 03C8
= cod p, in which case thecomposition
~~cp:
n -~ n" is also a map. Eachidentity
map n -~ n isevidently injective,
and the
composition
ofcomposable injective
maps is alsoinjective.
The setI of
injective
maps on finite sets constitutes acategory, containing
eachidentity
and closed undercomposition.
A contravariant
representation
of Iby surjective
linear maps is a func-tion,
here denotedby 03C6 ~ 5p*,
that associates with each 1-1 map (/?: m ~ n in I asurjective
linear mapVm
-~Vn
on vector spaces in such a way thatidentity
andcomposition
arepreserved.
That is to say, theidentity
map1 : n --~ n is carried to the
identity
linear transformation 1 * :Vn
-~Vn . Also,
~:
n --~ n’ is carried to~* : Vn~
--Vn,
and thecomposition
m ~ n’ towhich is the
composition
ofimages
in reverse order.The standard zero-order
representation
of l associates with each cp: m -~n in
I,
theidentity
map -- TZ. The first-order standardrepresentation
of I associates with each cp: m ~ n in
I,
thesurjective
linear transformationcp*
: Rn ~ Rmby
functionalcomposition p* f
=f
o p. Thecomponents
of the transformed vector are(p* f )2
=f~(i),
which is a selectedsubset,
orsample,
of theoriginal components.
Our interest is in squarematrices,
andspecifically
in vector spaces of square matrices.Accordingly,
the second- order standardrepresentation
of Z~ associates with each p: m --~ n inI,
the
surjective
linear transformationcp*
-- alsoby
functionalcomposition
on square matrices as described and illustrated in the
preceding
section.The action of an
injective
map on a table ormulti-way
array of num-bers is to
select, permute
and re-label factorlevels,
forexample by selecting
varieties in a
variety
trial or treatment levels in a horticulturalexperiment.
Statistical
sampling
is an instance of the action of aninjective
map, in fact arandomly
choseninjective
map whoseimage
is the set ofsampled
units.
Ordinarily,
in factorialdesigns
theinjective
maps actindependently
on the levels of each factor. For a statistical
design
with klogically
unre-lated
factors,
a linear model issub-representation
of theproduct category Z~
in the kth-order tensorproduct
of first-orderrepresentations.
The struc-ture of these
sub-representations
isprecisely
that of the factorialmodels,
also called hierarchical interaction
models,
in 1-1correspondence
with thefree distributive lattice on k
generators. But,
forhomologous
factors(Mc- Cullagh, 2000),
the action on the rows is the same as the action on thecolumns,
and the matrices are square. That is to say, the structure of thesub-representations
in the second-order standardrepresentation
of I is dif- ferent from thesub-representations
of theproduct category I2
in the tensorproduct
of first-orderrepresentations.
Forexample, symmetry
and skew-symmetry
andquasi-symmetry
aresub-representations,
but these are notfactorial models.
2.4. Orbits
The
symmetric
groupSn
acts on nby permutation.
The action is said tobe transitive
because,
to each orderedpair
ofpoints (i, j )
therecorresponds
a
permutation
7r ESn
such that~r (i) = j.
Thesymmetric
groupSn
also actson the square array n x n
by permutation,
the samepermutation applied
to rows as to columns. In other
words,
the orderedpair (i, j )
is carried to(7r(i),7r(j)).
. This action has twoorbits,
thediagonal
elements(i, i),
andthe
off-diagonal
elements(i, j )
for i~ j
In thecategory
ofinjective
maps, eachinjective
map p: m ~ n also preserves group orbits. Thediagonal
orbitdiag(m2)
is carriedinjectively
todiag(n2),
andsimilarly
for theoff-diagonal
orbit. The orbit
partition implies
that the second-order standardrepresenta-
tion of I may bedecomposed
as the direct sum of twosub-representations,
the real-valued functions on the
diagonal
orbit and the real-valued functionson the
off diagonal
orbit.Ordinarily,
there is noguarantee
that group orbits arepreserved by
thenon-invertible maps in the
category.
Forexample
in thecategory
of all mapson finite ordinal
sets,
the invertible maps are the finitesymmetric
groups.So the group orbits are the
diagonal
and theoff-diagonal.
But thispartition
is not
preserved by non-injective
maps: an orderedpair (i, j)
with i7~ ~
may be carried to an orderedpair
for whichcp(i)
=cp( j )
. On the otherhand, diagonal
elements cannot be sent tooff-diagonal elements,
so thediagonal
is a
sink,
not an orbit.2.5.
Sub-representation
Let cp: m -~ n be a
generic injective
map, and letnn2
-~nm2
be theimage
map in the standardrepresentation.
Within thisrepresentation
theremay exist a sequence of
subspaces Vn
CRn2
suchthat,
for each p: m ~ n, the linear transformation -~ also satisfiesp*Vn
=Vm .
Thenthe restriction of
p*
to thesubspaces Vn
also constitutes arepresentation
ofI
by surjective
linear maps on vector spaces.By
aslight
abuse of termino-logy,
we say that V is asub-representation
in the standardrepresentation,
the maps
being
determinedby
restriction.Several
sub-representations
havepreviously
been indicated. Theparti-
tion of
n2 by
orbitsimplies
that the second-order standardrepresentation
contains two
complementary sub-representations,
the real-valued functionstaking
the value zero on thediagonal orbit,
and the real-valued functionstaking
the value zero on theoff-diagonal
orbit. We now introduce a model-formula notation
capable
ofdescribing
these and some others in a modera-tely
convenient manner.The definition for A is to be read in the
following
manner. To eachinteger
nthere
corresponds
a vectorsubspace An
Cnn2 consisting
of n x n matricesf
such thatfij
= ai for some vector a E The asterisk denotes restriction to theoff-diagonal orbit; diag()
denotes restriction to thediagonal
orbit. Foreach
77 ~ 0,
the vector spacesAn
andBn
have dimension n. For each 77 ~2,
the vector spaces
An
andBn
also have dimension n. Therepresentations
are
isomorphic,
as are A* ^-_’ B* . Therepresentations A
and A* arealmost,
but notquite, isomorphic.
2.6.
Group representations
The set of invertible maps n -t n in I is the
permutation
group, and therepresentation theory
for finite groups is wellunderstood;
see, forexample,
James and Liebeck
(1993)
or Diaconis(1988).
Since the restriction of eachI-representation
to n isnecessarily
a grouprepresentation,
webegin
ourstudy
ofI-representations by studying
the grouprepresentations
that occurin the standard
representation.
Thegreat majority
of grouprepresentations
are not inherited under
restriction,
and thus do not extendnaturally
toI-representations.
Our task is toidentify
those that do extendnaturally.
A
representation
ofSn
associates with eachpermutation
7r a linear trans- formation 7r* : in such a way thatidentity
andcomposition
are pre- served. In the first-order standardrepresentation, V
= TZn and 7r* is theusual n x n matrix
representation by
linear transformations thatpermute
coordinates. Thisrepresentation decomposes
into two irreducible represen-tations,
the trivial one-dimensionalrepresentation In
C TZnby
constantfunctions,
and acomplementary representation 1 n by
vectors whose com-ponents
add to zero. The restriction of1 ~
to a subset of m ncomponents
isequal
to1m,
so the sequenceIn
C is asub-representation
of l.However,
the restriction of
1~-
isequal
to not to1~. Thus,
the first-order stan- dardI-representation
containsexactly
onesub-representation,
but thereis no
complementary sub-representation.
In thisrespect,
the structure ofsub-representations
of I isfundamentally
different from therepresentation theory
for finite groups: it is notsemi-simple.
In the second-order standard
representation
V = and the linear transformation 7r* actsby composition (7r* f)ij
= The groupprod-
uct 03C003C6 is carried to
(03C003C6)
* =reversing
the order ofcomposition.
Character
theory
for finite groups(James
and Liebeck1993)
enablesus to
identify
irreduciblesub-representations
that occur in agiven
repre- sentation. In the case of thesymmetric
group to eachpartition
of thenumber n there
corresponds
a distinct irreduciblerepresentation,
and con-versely,
so each irreduciblesub-representation
is associated with apartition
such as
(n), (n - 1,1 ), (n - 2, 2), (n - 1,1,1 )
and so on. Thedecomposition
of the first-order
representation
is(n)
(B(n - 1,1)
in which(n)
is the tri-vial
representation
and(n - 1,1 )
is the(n - 1 )-dimensional representation.
For the standard second-order
representation
ofSn by
squarematrices,
thedecomposition by sub-representations
is as follows.The first line states that the vector space of square matrices can be
expressed
as the direct sum of three
subspaces,
thediagonal matrices,
the symme- tricoff-diagonal matrices,
and theskew-symmetric
matrices.Further,
thesethree
subspaces
arepreserved
under coordinatepermutation,
which is to say that each is asub-representation
of,S’n .
None of thesesub-representations
is irreducible. The
diagonal sub-representation
contains a one-dimensionalsub-representation by
constant functionsplus
acomplementary (n - 1)-
dimensional
sub-representation consisting
of functions whosecomponents
sum to zero. The
symmetric off-diagonal representation
contains a one- dimensionalrepresentation by
constantfunctions,
asub-representation
con-sisting
of functionssuch that
L
exi =0,
and acomplementary sub-representation
The dimensions are
1, n -1
andn(n - 3)/2.
Thealternating representation
contains two
sub-representations,
oneconsisting
of functionsand a
complementary sub-representation
The dimensions are and
(n -1 ) (n - 2 ) /2.
All dimensions arenecessarily non-negative,
so theexpression n(n - 3)/2
isinterpreted
as zerofor n
3.It is
straightforward
toverify
that each of thesesubspaces
is closed under coordinatepermutation.
The marvel of charactertheory
is that it enablesus to determine whether or not a
given representation
is reducible. These calculations are standard andclassical,
so details are omitted. For?~ ~ 3,
the standard second-orderSn-representation
contains sevenirreducibles,
in which the trivial
representation (n)
hasmultiplicity two, (n - l,1 )
hasmultiplicity three,
and theremaining
two havemultiplicity
one each.2.7. Inheritance and
I-representations
A
sub-representation
of I in the standardrepresentation
is a sequence ofsubspaces Vn
Cnn 2
such that.
(i) Vn
is arepresentation
of thesymmetric
groupS’n,
closed under coordinatepermutation;
.
(ii)
The restriction of functionsf
inVn
to theleading
m x msub-array
is
equal
toVm.
Thus,
eachcomponent
of anI-representation
is also a group represen-tation,
butonly
certain grouprepresentations
can occur as acomponent
inan
I-representation.
A
representation V
that contains nosub-representation
other than itself and the zerorepresentation
is called irreducible. For thecategory I
ofinjec-
tive maps, or for the
product category Ik,
suchrepresentations
are ratherrare, so this
concept
is notespecially
useful. Arepresentation
that is ex-pressible
as the direct sum 0 W of twocomplementary
non-zero sub-representations
is calleddecomposable. Conversely,
arepresentation
that isnot
expressible
in this form is calledindecomposable.
The standard first- orderI-representation
isindecomposable,
but it is not irreducible because it contains the trivial one-dimensionalsub-representation.
Arepresentation
that is
expressible
as thespan V
= U + W of twosub-representations,
notequal
to zero orV,
is calledweakly decomposable. Although
V n W is alsoa
sub-representation,
there need not exist a direct-sumdecomposition.
Thefactorial model ~4 + B is a
weakly decomposable I2-representation
in whichA n B is the trivial one-dimensional
sub-representation.
There is no direct-sum
decomposition,
so thisrepresentation
isI2-indecomposable.
It is evident that the direct-sum
decomposition
is
preserved
notonly by permutation
but alsoby
restriction to subsets. This observationimplies
that the standardI-representation decomposes
as thedirect sum of three
sub-representations.
Thisdecomposition
isunique only
in the sense of
isomorphism.
There exist alternativedirect-sum-decomposi-
tions in which
diag(Rn2)
isreplaced by
anisomorphic sub-representation
such as
A,
B orsymadd(A, B). Despite
this lack ofuniqueness,
we focus ini-tially
on thesub-representations
that occur in each of these threerepresenta-
tions. On account ofisomorphisms, however,
there existsub-representations
of the standard second order
representation
that are not in any of these threecomponents. Many
of these areindecomposable,
and some are irreducible.The method used to construct a
sub-representation proceeds
as follows.For some
large
n, choose a group irreducible C Then act onv(n) by
restriction to theleading
m x msub-array giving
a vector space which isnecessarily
asub-representation
of the groupSm
in 7Z"z .The sequence of vector spaces is a
sub-representation
of thecategory
ofinjective
maps on finite sets ofcardinality ~
n. To extend this finite sequence to an infinite sequence, it is necessary tobegin
at n = oo,whatever that
might
mean, and to define the sequenceby
restriction to finite squares. Therepresentation generated
in this manner may containa
sub-representation,
i.e. it may not beirreducible,
but it isnecessarily indecomposable,
notexpressible
as the vector span of two non-zero sub-representations.
To see how this
works,
letv(n)
=(n)
Cdiag(Rn2 )
be the trivial grouprepresentation by
constantdiagonal matrices, multiples
of the iden-tity.
The restriction ofv(n)
to theleading
m x msub-array
isequal
toC
diag(Rm2 ),
which is alsoSm-irreducible. Thus,
the constant dia-gonal
functionsld
constitute a trivialI-representation.
On the otherhand,
if we let
V~n> ~--" (n - 1,1)
C be thecomplementary
group ir- reducibleconsisting
ofdiagonal
matrices whose elements sum to zero, the restriction to a proper m x msub-array
isequal
to the set of all m x /?7diagonal
matrices. The zero-sum condition is notpreserved by
restriction to subsets. In otherwords,
the restriction of theSn-irreducible (n - 1,1)
isequal
to(m) ® (m -1,1 ),
which isclearly
not irreducible. That is to the sequence of vectorsubspaces diag n
C is the smallestZ-representation
that includes the group irreducibles
(n - 1, 1 )
C This represen-tation contains a one-dimensional trivial
sub-representation,
but there does not exist acomplementary sub-representation.
In the same way, we find that
sym*
and alt areindecomposable
I-representations containing sub-representations
as follows:For
?~ ~ 2, 1~
is the one-dimensional vector space of constantoff-diagonal
matrices.
Likewise, symadd~
is the vectorsubspace consisting
of matricesexpressible
in the formfor some a E
Equivalently, symadd~
is theimage
of the linear trans- formation RnRn 2
as defined above. The dimension is zerofor n 1,
one for n =
2,
and n forn
3.Finally, altaddn
is a vector space of di- mension n - 1consisting
of additiveskew-symmetric
matrices of the form a2 - Themappings
Rn -Rn2
definedby (gn03B1)ij
= cxi + 03B1j for i~ j
and = aj are the
components
of twohomomorphisms
of thefirst-order
representation
into the second-order standardrepresentation.
2.8. General
sub-representations
When
applied
to a groupirreducible, v(n),
theprojection
=is a
representation
ofSm
in The sequence~V~n~ ~
is arepresentation
of the
category by surjective
linear maps. Whenapplies
to the directsum of
Sn-irreducibles,
theprojection yields
in which the
image
spaces may have non-zero intersection. Since we even-tually
let n go toinfinity,
it is sufficient to consideronly
the sequencesgenerated by
restriction ofSn-irreducibles.
Suchrepresentations
are neces-sarily indecomposable
in both senses. All othersub-representations
of I inthe standard
representation
areexpressible
as the span ofindecomposables.
be
arbitrary
real numbers. Consider theSn-irreducible consisting
of vectors
f
suchthat,
for some ct E1 n
For ?~ ~
3,
and for each8, g5,
this irreducible has dimension n - 1 and isisomorphic
withln ^--’ (n - 1,1). If § =
0 and 8 =-Tr/4,
the restric- tion to m n elements is an irreducibleSm-representation consisting
ofskew-symmetric
matrices of the formand
this sequenceevidently
constitutes an irreducible
I-representation.
For every other value of8, 03C6,
the restriction is a
representation
of8m
of the same formexcept
that thezero-sum restriction on a is not conserved. These
representations
are almostisomorphic
with the first-order standardrepresentation,
and are indecom-posable
but not irreducible.The unusual
qualifier
’almost’ is inserted here for thefollowing
reason.Let I>
= 0 and 8 =7r/4,
so thatfij
= ai+ aj
for i# j.
. The dimension of thisrepresentation
is zero for n =1,
one for n = 2 and n for each n >3,
whereas the dimension of the first-order standardrepresentation
isn. On dimensional
grounds alone,
therepresentations
cannot beisomorphic
in the orthodox sense.
However,
if we restrict thecategory
I to sets of sizeat least
three,
the truncatedrepresentations
are indeedisomorphic.
Thephrase
’almostisomorphic’
is understood in this truncated sense.To each
real I>
and-1r /2 8 # Tr/2
therecorresponds
anindecompos-
able
I-representation V8,4>.
For(0, Ø) =1= (8’, 4/)
therepresentations V8,4>
andV9~~
mayoverlap.
Forexample, if ø
=1,
therepresentations corresponding
to 8 = 0 and 8 =
Tr/2
are thesubspaces fij
= ai andfij
= aj,conventionally
denoted
by
A and B. The intersection is the one-dimensional trivial repre- sentation.However,
therepresentation
A+ B
is in factI-decomposable
inthe
strong
sense because it can beexpressed
assymadd
0 altadd. The span of any subset of theV84>
isevidently
adecomposable representation,
butany subset of three distinct V’s is sufficient for a basis.
This
completes
thestory
for the threeisomorphic Sn-representations.
If we do the same
thing
for therepresentation (n - 1,1,1 ), with n >
3we find that the
projection
for m n is the space of allskew-symmetric
matrices. This
representation
isI-indecomposable,
but it contains the ad- ditivesub-representation
of dimension n - 1.Likewise,
theprojection
of thesymmetric representation (n - 2, 2)
is the space ofsymmetric off-diagonal matrices,
which contains twosub-representations. Finally,
the two trivialsub-representations
1 and 1 * arenon-overlapping
and almostisomorphic.
2.9.
Symmetry
It is
possible
to reduce the number ofrepresentations by adding
fur-ther
symmetry
conditions such as invariance under the two-element group52
that actsby switching
rows with columns. While this invariance mayseem
appealing
onpurely algebraic grounds,
I haverarely
seencompelling arguments
for it in statisticalapplications. Nonetheless,
it is worthwhileexplaining why
certainI-representations
in the standardrepresentation
donot extend to
representations
of I x~2.
The inversion in the group
62
carries(i, j )
to(jay, i),
so the orbitdecompo-
sition is unaffected. Since
fij
is carried tofji,
thedecomposition by diagonal, symmetric
andskew-symmetric
matrices is unaffected. The group62
actsas the
identity
on thesymmetric matrices, ( f T
=f ),
but the inversion actsas the
negative identity
onskew-symmetric matrices, ( f ~’
= -f).
. Conse-quently,
the additivesymmetric sub-representation, fij
= ctj+ aj
is carriedto itself element-wise. In the additive
skew-symmetric representation,
how-ever, the inversion carries
fij
= ctj - ajto - f,
so theserepresentations
areno
longer isomorphic.
Ineffect, 62 splits
the threepreviously isomorphic representations
into twoisomorphic representations
and one other. The in- version in62
carries thesubspace Ag
={/
= cxi COS () + ajsin 9~
to~r/2-~
so
Ao
is arepresentation only
for 8 =-1r / 4
and B =Tr/4.
Inparticular,
therow factor A =
Ao
and the column factor B =A7r /2
are notrepresentations ofTx?2.
However~1+jB ~ A7r /4
EBA-7t" / -1
is adecomposable representation.
Quasi-symmetry
andquasi-skew-symmetry
are bothdecomposable
sub-representations of I x 52 :
3.
Contingency
tables3.1. .
Aggregation
of levelsThe discussion in section 2 relates to
representation-theory
forinjective
maps, the idea
being
that the form of themodel,
i.e. the modelformula,
should be unaffected
by permutation
and selection of factor levels. This condition seemsfairly compelling
in certaincircumstances, particularly
inconnection with
explanatory
factorshaving
unordered levels. For contin- gencytables, however,
certain factors areexplanatory
whereas others areresponses. It is
extremely important
to make this distinction clear because the relevantalgebraic operations
on response levels are not the same as theoperations performed
on the levels of anexplanatory
factor. The levels of aresponse factor may be
aggregated,
but the sameoperation
does not makesense for
explanatory
factors unless the levelsbeing aggregated
areequiva-
lent in their effect. Selection of response levels may also be
appropriate
whenwe talk of conditional distributions. For these reasons, it is
appropriate
inconnection with
contingency
tables todevelop
therepresentation theory
forall maps,
including aggregation
andselection, along
the lines of section 2.To illustrate what is
involved,
consider a response factorinitially having
levels denoted
by SZ
={a, b, c},
in which levels{a, b~
areaggregated
to forma new