A NNALES DE LA FACULTÉ DES SCIENCES DE T OULOUSE
O LE E. B ARNDORFF -N IELSEN
P ETER E. J UPP
Statistics, yokes and symplectic geometry
Annales de la faculté des sciences de Toulouse 6
esérie, tome 6, n
o3 (1997), p. 389-427
<http://www.numdam.org/item?id=AFST_1997_6_6_3_389_0>
© Université Paul Sabatier, 1997, tous droits réservés.
L’accès aux archives de la revue « Annales de la faculté des sciences de Toulouse » (http://picard.ups-tlse.fr/~annales/) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions).
Toute utilisation commerciale ou impression systématique est constitu- tive d’une infraction pénale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
- 389 -
Statistics, Yokes and Symplectic Geometry(*)
OLE E.
BARNDORFF-NIELSEN(1)
and PETER E.
JUPP(2)
Annales de la Faculté des Sciences de Toulouse Vol. VI, n° 3, 1997
RÉSUMÉ. - Nous établissons et étudions un lien entre jougs et formes symplectiques. Nous montrons que les jougs normalises correspondent a certaines formes symplectiques. Nous présentons une méthode pour construire de nouveaux jougs à partir de jougs donnés. Celle-ci est motivée en partie par la dualite entre la formulation hamiltonienne et la formulation lagrangienne de la mécanique conservative. Nous nous
proposons quelques variantes de cette construction.
MOTS-CLfS : Application moment, hamiltonien, joug de vraisemblance
esperee, joug de vraisemblance observée, lagrangien, sous-variété lagran- gienne, tenseurs.
ABSTRACT. - A relationship between yokes and symplectic forms is
established and explored. It is shown that normalised yokes correspond
to certain symplectic forms. A method of obtaining new yokes from old is given, motivated partly by the duality between the Hamiltonian and
Lagrangian formulations of conservative mechanics. Some variants of this construction are suggested.
KEY-WORDS : expected likelihood yoke, Hamiltonian, Lagrangian sub- manifold, momentum map, observed likelihood yoke, tensors.
AMS Classiflcation : 58F05, 62E20
(*) Reçu le 10 juillet 1995
(1) Department of Mathematical Sciences, Aarhus University, Ny Munkegade, DK-
8000 Aarhus C (Denmark)
(2) School of Mathematical and Computational Sciences, University of St Andrews, North Haugh, St Andrews KY16 9SS (United Kingdom)
1. Introduction
In the
differential-geometric approach
to statisticalasymptotics
a centralconcept
is that of ayoke, key examples being
the observed andexpected
likelihood
yokes.
A
yoke
on a manifold M is a real-valued function on the"square"
M x Mof
M, satisfying
conditions(2.1)
and(2.2)
below(Barndorff-Nielsen [7]).
Yokes which are restricted to be
non-negative
and are zeroonly
on thediagonal
are known as contrast functions. For uses of contrast functions in statistics see, e.g.,Eguchi [16]
andSkovgaard [24].
Inapplications
ofyokes
to statistical
asymptotics
we areparticularly
interested in the values of theyoke
near thediagonal 0~
of M xM, ( c~
A suitable
neighbourhood
ofAj~
in M x M can beregarded
as the totalspace of the normal bundle of
0~
in M x M and this normal bundle isisomorphic
to thetangent
bundle TM of M.Furthermore,
ayoke
on Mdetermines a
(possibly indefinite)
Riemannianmetric,
which can be used toidentify
thetangent
bundle TM with thecotangent
bundle T*M. .One of the main contexts in which
cotangent
bundles occur is in conser- vative mechanics(see
e.g., Abraham and Marsden[I],
and Marsden[19]),
where
they
are known as thephase
spaces. Thegeometrical concept
isthat of a
symplectic
structure. A survey ofsymplectic geometry
and itsapplications
isgiven by
Arnol’d and Givental’[3].
The role of Hamiltonians in conservative mechanics and their relation to
exponential
familiessuggest
thatsymplectic geometry
may have a natural role toplay
in statistics. It is relevant here also to mention the work of Combet[13]
which discusses certain connections betweensymplectic geometry
andLaplace’s
method forexponential integrals.
In view of the results to be described below we think it fair to say that there exists a natural
link,
via the concept ofyokes,
betweenstatistics
and
symplectic geometry.
How useful this link may be seems difficult toassess at present. Other links have been discussed
by
T. Friedrich andY. Nakamura. Friedrich
[17]
established some connections betweenexpected (Fisher)
information andsymplectic
structures.However,
as indicatedin Remark
3.4,
hisapproach
and results arequite
different from those considered here. Nakamura([22], [23])
has shown that certainparametric
statistical models in which the parameter space M is an even-dimensional vector space
(and
so has thesymplectic
structure of thecotangent
space ofa vector
space) give
rise tocompletely integrable
Hamiltoniansystems
onM. In contrast to
these,
the Hamiltonian systems considered here arise in the moregeneral
context ofyokes
onarbitrary
manifolds M and are definedon some
neighbourhood
of0~
in M x M.Section 2 reviews the necessary
background
onyokes, symplectic
struc-tures and mechanics. In Section
3,
we show that everyyoke
on a manifoldM
gives
rise to asymplectic form,
at least on someneighbourhood
of thediagonal
of M xM,
and in someimportant
cases on all of M x M.By passing
to germs,i.e., by identifying yokes
orsymplectic
forms which agree in someneighbourhood
of thediagonal,
we show that normalisedyokes
arealmost
equivalent
tosymplectic
forms of a certaintype. (However,
fromthe local
viewpoint
there isnothing special
about thosesymplectic
formswhich are
given by yokes,
since Darboux’s Theorem shows thatlocally
ev-ery
symplectic
form can be obtained from someyoke;
seeExample 3.1.)
InSection
4,
we consider some uses of thesymplectic
formgiven by
ayoke.
There is an
analogy
between conservative mechanics and thegeometry
ofyokes
in thatsymplectic
formsplay
animportant part
in both areas. In Section5,
thisanalogy
is used to transfer theduality
between the Hamilto- nian andLagrangian
versions of conservative mechanics to a construction forobtaining
from any normalisedyoke g
anotheryoke g,
called the La-grangian
of g, which has the same metric as g andis,
in some sense, close to the dualyoke g*
of g.Section 6 considers Lie group actions on the manifold and the associated momentum map, which takes values in the dual of the
appropriate
Liealgebra.
An outline version of some of the material
presented
in this paper isgiven
in Barndorff-Nielsen and
Jupp [11].
.2. Preliminaries
2.1 Yokes
Let M be a smooth manifold. We shall sometimes use local coordinates
(cv 1,
... ,wd)
on Mand, correspondingly,
local coordinates ... ,wd;
;w’ l ,
..., on M x M.Furthermore,
we use the notationsFor a function on M x
M,
thecorresponding gothic
letter will indicate restriction of that function to thediagonal,
sothat,
e.g.,In coordinate terms a
yoke
on M is defined as a smooth function g : M x M -~ M such that for every w in M:The coordinate-free definition of a
yoke
is as follows. For a vector fieldX on
M, define
the vector fields X and X ’ on M x Mby
X =(X, 0)
andX ~ _
(o, X), i.e.,
>where pk : M x M - M is the
projection
onto thekth
factor.Then,
forvector fields X and Y on
M,
we define~ Y)
M --~ Rby
A
yoke
on M may now be characterised as a smoothfunction g :
M x lV~ -~ Rsuch that:
(i) Xg(v, c~)
= 0 for all w inM,
(ii)
the(0, 2)-tensor (X, Y) ~
isnon-singular.
An alternative way of
expressing (i)
and(ii)
is that ona~:
(i)
(ii) d1 d2g
isnon-singular,
where
d1
andd2
denote exterior differentiationalong
the first and second factor in M xM, respectively.
Example
The
simplest example
of ayoke
is the function g definedby
For all
03C9’,
the function 03C9 ~g(03C9,03C9’)
has a(unique)
maximum =03C9’,
so that
(2.1)
holds. Since~gi~~ (w)~
is theidentity matrix, (2.2)
holds. Somegeneralisations
of thisexample
aregiven
inExamples
5.1-5.3.Two of the most
important geometric objects given by
ayoke
are a(possibly indefinite)
Riemannian metric and a one-parameterfamily
oftorsion-zero affine connections. The metric is the
(0,2)-tensor (X, Y) ~ ( ~’), given
in coordinate formby
the matrix(gi~~~.
For a in?,
thea
a-connection ~ is defined
by
0
where V is the Levi-Civita connection of the metric and T is
given by g(TxY ( Z)(W )
=X Y’ Z’g(c~, c~)
- Y ZX ’g(cv, w)
.a
Note that T is a
( 1, 2)-tensor.
The lowered Christoffelsymbols
of V at ware
given by
and the
(0,3)-tensor corresponding
to T is the "skewness" tensor with elementsFor
applications
to statistics of the metric and the a-connections ofexpected
and observed likelihood
yokes,
see Amari[2],
Barndorff-Nielsen([6], [8])
andMurray
and Rice([21]).
A normalised
yoke
is ayoke satisfying
the additional conditionExcept
whereexplicitly
statedotherwise,
we shall consideronly
normalisedyokes.
For anyyoke
g, thecorresponding
normalisedyoke
is theyoke 9
defined
by
and the dual
yoke
is theyoke g*
definedby
Note that
g*
isnecessarily
a normalisedyoke.
In the statistical context the two
important examples
of normalisedyokes
are the
expected
likelihoodyoke
and the observed likelihoodyoke.
For aparametric
statistical model withparameter
spaceM, sample space X
andlog-likelihood
function £ : M x X -~ M theexpected
likelihoodyoke
on Mis the
function g given by
Given an
auxiliary
statistic a such that the function .c ~a)
isbijective,
the observed likelihood
yoke
on M is defined as thefunction g given by
For more about
expected
and observed likelihoodyokes
see Barndorff-Nielsen
([7], [8]), Chapter
5 of Barndorff-Nielsen and Cox[9]
and Blaesild[12].
Differentiation of
(2.1) yields
theimportant
resultThis shows that the matrix is
symmetric,
a property which is not obvious at firstsight. Similarly,
differentiation of(2.5)
shows that for anormalised
yoke g
we haveDifferentiation of
(2.7) yields
Similarly,
for a normalisedyoke
g, we haveDifferentiation of
(2.10)
and(2.11) yields
Formulae
(2.7)
and(2.10)-(2.13)
arespecial
cases of thegeneral
balancerelations for
yokes.
See formula(5.91)
of Barndorff-Nielsen and Cox[9].
For the
applications
so far made ofyokes
to statisticalasymptotics
theimportant part
of ayoke
is its germ round thediagonal,
or evenjust
itsoo-jet
at thediagonal.
Inparticular,
thisoo jet
has been used to construct the associated tensors and affine connections.See,
forinstance,
Blaesild[12]
and references
given
there.(Recall
that two functions on M x M have thesame germ at
A~-
ifthey
agree on someneighbourhood
ofA~-
and thattwo such functions have the same
oo jet
atA~-
if at eachpoint
of0~
theirderivatives of any
given
orderagree.)
By using partial maximisation,
it ispossible
to define aconcept
of"profile" yoke.
In Remark 3.3 we shall discussbriefly
how thisconcept
fitstogether
with the idea of thesymplectic
form of ayoke,
anotherconcept
to be introduced in Section 3.
2.2
Symplectic
structuresLet N be a manifold of dimension k. A
symplectic
structure orsymplectic form
?y on N is anon-singular
closed 2-form onN,
i.e.:(i)
?y is a 2-form(skew-symmetric (0, 2)-tensor)
onN;
(ii)
at eachpoint n
ofN,
the mapTnN
~Tn N given by
X ~ X~is
nonsingular (where
X~ denotes the 1-form definedby
=Y)
for Y inTnN);
(iii) d??
= 0.The standard
example
of asymplectic
form ison
where
... ,xd
are the standard linear coordinates obtained from a base of and y1, ... yd are the coordinates on obtained fromthe dual base.
Locally,
this is theonly example
of asymplectic form,
becauseDarboux’s Theorem
(e.g.
Abraham and Marsden[1,
p.175])
states that a2-form ~
on N is asymplectic
form if andonly
if round eachpoint
of Nthere is some coordinate
system
...,xd,
...,yd)
in which(2.14)
holds. Note that this
implies
that k isnecessarily even, k
=2d,
and thatthe d-fold exterior
product
is nowhere zero and so is a volume on N.
Formula
(2.14)
can be used on manifolds moregeneral than Rd.
Ifx1,
...,xd
are coordinates on a manifold M and y1, ..., yd arecorrespond- ing
dual coordinates on the fibres of T*M then(2.14)
defines a canonicalsymplectic
form r~ on the total space T*M of thecotangent
bundle of M.The
symplectic form q
can also be derived from the canonical 1-form00
onT*M
given by
for every v in TT*M. Here TT* M : TT*M -~ T*M denotes the
projection
map of the
tangent
bundle of T*M and : TT*M - T M denotes thetangent
map ofTM
: T * M ~ M. . In terms of the coordinates...,
xd,
y1, ...,yd)
onT*M,
the canonical 1-form isRemark 2.1.2014
Aspects of symplectic
structures such as:(i) existence, i.e., determining
when agiven
closed 2-form can be deformed into asymplectic structure,
(ii) classification, i.e., determining
when twosymplectic
forms areequiv- alent,
(iii)
thestudy
ofLagrangian
manifolds(submanifolds
of maximal dimen- sion on which the restrictionof ~
iszero),
appear to be
mainly
ofmathematical/mechanical
interest and not of any direct statistical relevance(in
thisconnection,
see e.g. McDuff[20]
andWeinstein
[25]-[26]).
2.3 Hamiltonians and
Lagrangians
There are two formulations of conservative mechanics: the Hamiltonian formulation takes
place
on thecotangent
bundle and uses the canonicalsymplectic
form whereas theLagrangian
formulation takesplace
on the
tangent
bundle and leads to the use of second order differentialequations.
It ispossible
to pass from one formulation to the otherby
means of fibrewise
Legendre transformation, i.e., performing Legendre
transformation in each
cotangent
space ofM. Moreprecisely,
let H bea Hamiltonian on
M, by
which in the present context we mean a real-valued function on T*M. . Then the fibre derivative FH of His just
the derivative of Halong
the fibre. In terms of the coordinates(~ 1,
...,xd,
, ...Yd)
on T* M used in
(2.16)
to define?o,
where ... ,
xd
arekept
fixed.Suppose first,
forsimplicity,
that H ishyperregular, i.e.,
FH : T*M - TM is adiffeomorphism.
Then thefibrewise
Legendre
transformation of H is thefunction ~ :
TM 2014~ M definedby
where a is defined
by
= v. In coordinate terms,where
The
function if
isusually
called theLagrangian corresponding
to theHamiltonian H.
In the context of
yokes
we need to relax the condition ofhyper-regularity
and assume
just
that FH is adiffeomorphism
from someneighbourhood
Uof the zero section of T*M to a
neighbourhood FH(U)
of the zero sectionof T M . Then we can define
H
onF H ( U ) by (2 .17) .
3. The
symplectic
formgiven by
ayoke
A
yoke yields
asymplectic
form. Letyoke
on M.Then differentiation
of g along
the first copy of Myields
amapping
given
in coordinate termsby
If g
is the observed likelihoodyoke (2.6)
of aparametric
statistical modelthen 03C6
isessentially
the score. For ageneral yoke
g, the2-form ~
on M x Mis defined
by
i.e.)
as thepull-back
to M x Mby p
of the canonical 2-form onT*M,
,so that for
tangent
vectorsX, Y
at some commonpoint
of M xM,
where
Tp
denotes the derivative of p.Let
(c,~ 1,
... ,w d )
be local coordinates on M. .Then, taking (c,v 1,
... ,c~ d )
as coordinates on the first factor in M x M and
(c~‘ 1, ... , w‘d)
as coordinateson the second
factor,
the coordinateexpression for q
isIt follows from
(2.2)
that there is aneighbourhood W
of thediagonal
ofM x M on which ~ is
non-singular. Thus ~
is asymplectic
form on W. .In
fact,
for alarge
class of statistical models theexpected
and observedlikelihood
yokes g
have matrices[9i;j]
of mixedpartial
derivatives which arenon-singular everywhere
on M xM,
so that thecorresponding symplectic forms ~
are defined on all of M x M. Note from(3.2) that ~
isspecial
in
containing
no termsinvolving dW2
A or dw~ A . Thisspecial
feature is the basis of the
characterisation,
in Theorem 3.1below,
of thosesymplectic
forms which arise fromyokes.
Example
3.1For the
yoke
considered inExample 2.1,
thecorresponding symplectic
form r~ is the canonical
symplectic
form ongiven by (2.14).
It followsfrom Darboux’s Theorem that every
symplectic
form ariseslocally
fromsome
yoke.
Because,
ingeneral,
r~ isnon-singular only
on someneighbourhood
ofrather than on all of M x
M,
it isappropriate
to consider germs ofsymplectic
forms. Twosymplectic forms,
each defined on aneighbourhood
have the same germ
at 0394M
ifthey
agree in someneighbourhood
ofSimilarly,
twoyokes
on M have the same germ at0~
ifthey
agree in someneighbourhood
of It follows from(3.2)
that the germ~r~~
of r~depends only
on the germ[g]
of g. Thus we have a functionfrom the space of germs of
yokes
on M to the space of germs ofsymplectic
forms around
As mentioned in Remark
2.1,
a submanifold L of asymplectic
manifoldN is called
Lagrangian
if(i)
=0,
where z : : L ~ N isthe
inclusionand ~
is thesymplectic
form on
N,
(ii)
dim N = 2 dim L .It is useful to call a
symplectic
form on aneighbourhood
W ofAM
in M x M
hv-Lagrangian (or
horizontal and verticalLagrangian)
if all thesubmanifolds
~~w~
xM)
n Wand(M
x~w~~)
n WareLagrangian, i.e.,
if theprojections
pi : : W -~ M and p2 : W - M areLagrangian
foliations in thesense of Arnol’d and Givental’
[3,
p.36].
We shall call asymplectic
formr~ on W
dhv-Lagrangian (or diagonally-symmetric
horizontal and verticalLagrangian)
if it ishv-Lagrangian
and satisfies the symmetry conditionfor all
tangent
vectorsX, Y
at the samepoint
of M.THEOREM 3.1.- Let
[Y](M),
and[dhv L](M)
denote the spacesof
germsof yokes
onM, of
germsof
normalisedyokes
onM,
andof
germs
of dhv-Lagrangian symplectic forms
aroundrespectively.
Thenthe
function given by (3.3)
has thefollowing properties:
(i)
~ :[yJ (M)
--~[dhv L](M);
~ [9]
_ ~ ~(iii )
the restriction to[Ny]{M)
is a one-to-one mapfrom [NyJ{M)
to
[dhv L](M);
(iv ) if
M issimply-connected
then the restrictionof
~ to maps onto[dhv L](M)
and so is abijection from
to[dhv L](M).
Proof
(i)
If c is the inclusion(~w~
xM)
n W -~ W or(M
x~c~~~)
n W -~ Wthen
dcv2
A = 0.Property (3.4)
follows fromsymmetry
of gi;j.(ii)
This follows from gi; j =g2; j
.(iii)
Let g andg
~ be normalisedyokes
whichgive
rise to the samesymplectic
form. Thenso that
for some functions ..., ad. It follows from
(2.1)
that a1 = ... = ad = 0.Then
for some function
Q.
From(2.5)
it follows that/3
= 0 andso g = g’.
(iv) Any 2-form ~
on an open set in M x M can beexpressed locally
asIf ~
isdhv-Lagrangian
then aij = Cij = 0 and soSince a
symplectic
form isclosed,
we haved1J
= 0 and soGiven 03C9 and w’ in
M,
choosepaths 03BE and (
in M such that03BE(0) = 03C9’,
~(1)
= w,~(o)
= W and~(1)
= c,~’. Define aiby
It follows from
(3.7)
and thesimple-connectivity
of M that ai does notdepend
on the choice of(. Also,
Define g by
It follows from
(3.8), (3.4), (3.6)
and thesimple-connectivity
of M thatg(w, c,~’)
does notdepend
on the choice of~,
so that g is well-defined. It issimple
toverify that g
is a normalisedyoke
with r~ as itssymplectic
form.Remark 3.1. - Yokes and 2-forms
satisfying (3.5)
are related topreferred point geometries.
Thesegeometries
were introducedby Critchley
et al.([14], [15])
in order toprovide
ageometrical
structure which reflects the statistical considerations that(i)
the distributiongenerating
the data has adistinguished
role and(ii)
this distribution need notbelong
to the statistical model used. Preferredpoint geometries
are of statistical interest both because of their relevance tomis-specified
models and because variousgeometrical objects
which arise in statistics arepreferred point
metrics.It is convenient here to define a
preferred point
metric on a manifold M tobe a map from M to
(0,2)-tensors
onM,
which isnon-degenerate
on thediagonal. (This
is aslight weakening
of the definition ofCritchley
etal.,
who
require
the tensor to besymmetric
and to bepositive-definite
on thediagonal.)
In terms of coordinates(c,~ 1,
... ,w d ),
apreferred point
metric bon a manifold M has the form
A
yoke g
on M determines apreferred point
metric on Mby
Note that the map from
yokes
on M topreferred point
metrics on M is notonto
(even
at the level ofgerms),
because ageneral preferred point
metricb does not
satisfy
theintegrability
conditions(3.6)
and(3.7). However,
itfollows as in the
proof
of Theorem3.1(iii)
that the restriction of this map to the set of normalisedyokes
is one-to-one.A
preferred point
metric b on M determines a 2-form on M x Msatisfying
(3.5) by
....Since this 2-form need not be
closed,
it is not ingeneral
asymplectic
form.It is a consequence of the
proof
of Theorem3.1(iv)
that if M issimply
connected then the
preferred point
metrics for which thecorresponding
2-form is
symplectic
and satisfies(3.4)
areprecisely
those which come fromyokes.
Remark 3.2. . - The construction of the
symplectic
form of ayoke
behavesnicely
under inclusion of submanifolds. Let : N ~ M be anembedding (or
moregenerally,
animmersion)
andlet g
be ayoke
on M. If thepull-
back to N
by
t of the(0,2)-tensor
9i;j on M isnon-singular (as happens,
in
particular,
if the metricof g
ispositive-definite) then g pulls
back to ayoke g
o(t
xt)
on N. It is easy toverify
that thecorresponding symplectic
form r~~ satisfies r~~ =
(t
xc~
.Remark 3.3. - The construction of the
symplectic
form behavesnicely
under
taking profile yokes.
Let p : : M - N be asurjective
submersion(=
fibredmanifold)
andlet g
be ayoke
on Msatisfying
0(so that -g
is a contrastfunction).
Theprofile yoke
of g is the function9
N x definedby
We can choose
implicitly
a sections : N x N ~ M x M of p x p (i.e.,
afunction s with
(p
xp)
o s theidentity
of N xN)
such thatThen the
symplectic
form r~~of 9
satisfiesFor
example,
let M be the parameter space of acomposite
transformation model with groupG,
let p be thequotient
map p : : M -M/G,
and letg be the
expected
likelihoodyoke.
Then thecorresponding profile yoke
g
is-7,
, where I is the Kullback-Leiblerprofile
discrimination consideredby
Barndorff-Nielsen andJupp [10].
The tensors and onM/G
obtained
by applying
Blaesild’s[12] general
construction to theyoke g
arethe transferred Fisher information
p!i
and the transferred skewness tensorpiD
of Barndorff-Nielsen andJupp ~10~.
Remark 3.4.- A rather different connection between statistics and
symplectic
structures isgiven by
Friedrich[17].
His constructionrequires
a manifold
M,
a vector field X on M and an X-invariant volume form Aon M. These
give
rise to a 2-form r~ onP(M, A),
the space ofprobability
measures on M which are
absolutely
continuous withrespect
to A. If X hasa dense orbit
then ~
is asymplectic
form onP(M, a).
.4. Some uses of the
symplectic
form of ayoke
In
symplectic geometry important
ways in which asymplectic
structurer~ is used are:
(i)
to raise and lowertensors;
(ii)
to transform(the
derivativesof)
real-valued functions into vectorfields;
(iii)
toprovide
a volumeH,
thusenabling integration
over the manifold(in mechanics,
over thephase
space,i.e.,
thecotangent space).
We now
apply
these to thesymplectic
form r~ of ayoke g
on M. Recallthat ~ is defined on some
neighbourhood
Wof 0394M
in M x M.4.1
Raising
andlowering
tensorsA
symplectic form q
enablesraising
andlowering
of tensors in thesame way that a Riemannian metric does. In
particular,
the "musicalisomorphism" {(,
which raises 1-forms to vectorfields,
and its inverse D(which
lowers vector fields to
1-forms)
are definedby
for 1-forms a and vector fields X and Y. For the
symplectic
form(3.2)
ofa
yoke,
the coordinateexpressions of #
and Darewhere is the
(2.0)-tensor
inverse to The construction in Section 5 of theLagrangian
of ayoke
can beexpressed
in terms of}t;
see(5.2).
4.2 The vector field of a
yoke
A
yoke g
on Mgives
rise as follows to a vector fieldXg
on W. . Thederivative
dg of g
is a 1-form.Raising
thisusing )} yields
the vector field=
Xg
. In coordinate terms,Xg (v, w~)
isgiven by
By
Liouville’s Theorem onlocally
Hamiltonian flows(see,
e.g., Abraham and Marsden[1,
pp.188-189]),
the flowof Xg
preserves the volume Q on W definedby (2.15). If g
is an observed likelihoodyoke,
then the vector fieldXg
describes ajoint
evolution of theparameter
and the observation.The vector field
Xg
can be used to differentiate functions. Let h bea real-valued function on M x M. Then the derivative of h
along Xg
isdh(Xg ).
An alternativeexpression
fordh(Xg )
iswhere
~h, g~
is the Poisson bracket(with
respect to thesymplectic
form r~of
g )
definedby
(See,
e.g., Abraham and Marsden[1,
p.192].)
It is clear from(4.3)
that{g, g~
= 0. Thissuggests
that one way ofcomparing
two normalisedyokes
h and g
isby
their Poisson bracket. In coordinate terms we haveRemark 4.1.2014 The Poisson bracket with
respect
to a fixedsymplectic
structure is
skew-symmetric. However,
because thesymplectic structure q
used in
(4.3) depends
on g, theskew-symmetry
propertydoes not hold in
general
here.The behaviour of
~h, g~
near thediagonal
of M x M isexplored
inProposition
4.1. It shows that the3-jet
of~1~, g~
at thediagonal
cannotdetect differences in metrics between two normalised
yokes hand
g but that some differences in skewness tensors can be detected.PROPOSITION 4.1.- Let
h and g
be normalisedyokes
on M withskewness tensors
Then:
~ t=0;
~ ~r;
_~;r
=~ i
_ = =
~i
=
Wrs;t
= = =g),
whereg)
is thetensor
defined by
[3] indicating
a sumof
three terms obtainedby appropriate
permu- tationof
indices.Furthermore, if
then
Proof.
- Part(o)
is immediate from(4.4)
and(2.8). Differentiating (4.4)
withrespect
to 03C9 andapplying (2.8) yields (i). Repeated
differentiation of(i)
then shows that k satisfies(2.9)-(2.11). Differentiating (4.4)
twiceand three times with
respect
to c~,using
Leibniz’ rule for differentiation ofproducts.
andapplying (2.8), (2.9)
and(2.10),
we obtain(ii)
andDifferentiation of
(ii)
nowyields (iii).
Differentiation of
(4.5) together
with(4.6)
and(2.9)-(2.11) yields
from which
(4.7)
follows.Differentiating (4.4)
four times with respect to w,using
Leibniz’ rule for differentiation ofproducts,
andapplying (4.10),
we obtainDifferentiation of
(4.7)
nowyields (4.8).
DExample 4.1
Let g
be theexpected
likelihoodyoke
of thefamily
ofsamples
of size nfrom a normal distribution with unknown mean p and unknown variance
0"2.
ThenConsider a location-scale model with
probability density
functionsfor some function q.
Suppose
that forsamples
of size n from a distribution in thisfamily
there is no non-trivial sufficient statistic(as happens,
forexample,
in afamily
ofhyperbolic
distributions withgiven shape).
Take asan
ancillary
statistic ... ,an ) ,
withwhere
and ? denote the maximum likelihood estimatesof p
and c-. Let hbe the
corresponding
observed likelihoodyoke.
Thenwhere
with
Another way in which vector fields can arise from statistical models is
given by
Nakamura([22], [23]).
He considersparametric
statistical models(e.g., regular exponential models)
in which theparameter
space M is an even- dimensional vector space and the Fisher information metric i isgiven by
for some
potential
function~.
The metric can be used to raise the derivativeof 03C8
to a vector field(d’lj;)ij
onM, given
in coordinatesby
where is the inverse of the matrix . If M has dimension 2k then it can be
given
thesymplectic
structure of thecotangent
space T*V ofa k-dimensional vector space V. Nakamura shows that the vector field
(d~)~
is Hamiltonian and iscompletely integrable, i.e.,
there are functionsf1,
... ,fk on M
which are constant under the flow of the vectorfield, satisfy
= 0 for 1i, j k,
and such thatdf1,
...,d fk
arelinearly independent
almosteverywhere (see
Abraham and Marsden[1,
pp. 392-393]).
We have found no non-trivialanalogue
in oursymplectic
context ofNakamura’s
complete integrability
result.4.3 The volume of a
yoke
Recall from
(2.15)
that ayoke g
on Mgives
rise to a volume Q on someneighbourhood
W of thediagonal
in M x M. For manyyokes
of statistical interest W can be taken to be all of M x M.However,
for ageneral yoke,
W will be a proper subset of M x M. The main way in which volumes are
used is as
objects
to beintegrated
over the manifolds on whichthey live,
so in such cases there is no obvious choice of manifold over which Q should be
integrated. (One possibility
is tointegrate
S2 over the maximal such W but theinterpretation
of the value of thisintegral
is notclear.)
An alternative way of
using
the volume Q is to compare it with other volumes onW,
, inparticular
with the restriction to W of thegeometric
measure obtained from the metric of g. More
precisely,
theyoke g gives
a(possibly indefinite)
Riemannian metric on M and so a volume on Mgiven
in coordinate terms as
where [ . denotes
the absolute value of the determinant. Thecorresponding product
measure on M x M is the volume with coordinate formIt follows from
(2.9)
that(4.11)
can also be written asSince Q is
given
in coordinate form as d!w~ ) I volume (4.11 )
can bewritten as d!
M,
where the function h : W ~ R isgiven
in coordinate formby
Note
that,
ingeneral, h
is not ayoke. However, h
can be used to obtainfurther
yokes
from g. . For any real A define M x M - Rby
Since h = 1 on the
diagonal,
thefollowing proposition
shows that is anormalised
yoke
with the same metric as g. Note that the germs at0~
ofhand
depend only
on the germ of g.PROPOSITION 4.2. - Let g be a normalised
yoke
on M and h : M x M --~II8 be any
function
such that h = 1 on thediagonal 0~.
Then(i)
theproduct hg
is a normalisedyoke
on Mhaving
the same metricas g;
(ii)
thefunction
satisfies
Proof.
- These aresimple
calculations. DRemark 4.2. - Note that if k in
Proposition
4.2 satisfies alsothe matrix + is
non-singular {4.14)
then k is rather like a
yoke
but need notsatisfy (2.1).
Note also that conditions(4.13)
and(4.14) provide
ageneralisation
of theconcept
of anormalised
yoke. Also,
in contrast to the definition of ayoke, (4.13)
and(4.14)
involve the twoarguments
in asymmetrical
way.Remark 4.3. - Let k be a function on M x M
satisfying (4.14).
Definegk : M x M -~ 1I8
by
Then gk
is a normalisedyoke
on M. .Remark 4.4. - One context in which the
"adjustment
factor" h of(4.12)
occurs in statistics is in a variant of the
p*-formula
for the distribution of the score, _ ~ _ ~ ,
A suitable
starting point
for this is thep*-formula
for the distribution of the maximum likelihood estimator.
Here [ )[
is thedeterminant of the observed information
matrix j
=ili, a)
evaluatedat ili. For an extensive discussion of this formula and its
applications
see Barndorff-Nielsen and Cox
[9].
Theexpression
v| a) d1
...represents a volume on M.
Changing
the variable in(4.15)
from ~ to thescore s*
gives
thep*-formula
for the distribution
of s*.
Theexpression p* (s*; w a) dsl ~ ~ ~ dsd represents
a volume on the
cotangent
space to M at w .Now j
definedby
= is an
inner-product
onTWM,
so thatj-1
is aninner-product
on The
geometric
measureof j -1
is a measure onrepresented
Then
_u..
is a ratio of measures on
TWM.
Letj-1~2
be any square root ofj-1
anddefine the standardised score s*
by
Barndorff-Nielsen
[8,
sect.7.3]
derivedas an
approximation
to thedensity ofs*.
. Note thatp* (s*; W a)
is(4.16).
Define h M x l~l -~ R
by
Since the observed likelihood
yoke (2.6)
is g,given by
we
have, neglecting
an additive constant,where h is the
"adjustment
factor" defined in(4.12).
FromProposition 4.2,
k satisfies