A NNALES DE L ’I. H. P., SECTION A
N. H ADJISAVVAS
Distance between states and statistical inference in quantum theory
Annales de l’I. H. P., section A, tome 35, n
o4 (1981), p. 287-309
<http://www.numdam.org/item?id=AIHPA_1981__35_4_287_0>
© Gauthier-Villars, 1981, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section A » implique l’accord avec les conditions générales d’utilisation (http://www.numdam.
org/conditions). Toute utilisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
p. 287
Distance between states
and statistical inference in quantum theory
N. HADJISAVVAS
Laboratoire de Mecanique Quantique, BP 347, 51062 Reims, France Ann. Inst. Henri Poincaré,
Vol. XXXV, n° 4, 1981,
Section A :
Physique théorique.
ABSTRACT. - We define the distance between two
quantal
states as aglobal
measure of the difference of theirrespective
statisticalpredictions.
The
expression
of this distance in the usual Hilbert space formalism is found. This concept is then used tostudy
theprojection postulate
and thestatistical inference
problem
inQuantum Theory.
RESUME. - Nous définissons la distance entre deux etats
quantiques
comme une mesure
globale
de la difference de leurspredictions statistiques respectives. L’expression
de cette distance dans le formalisme Hilbertien habituel est determine. Par lasuite,
ce concept est utilise dans l’étude dupostulat
deprojection
et duprobleme
de Finferencestatistique
enMécanique Quantique.
I. INTRODUCTION
Quantum Mechanics,
as it standstoday,
is aphysical theory describing
the
microsystems exclusively
from the observationalpoint
of view. Itwould be thus natural for it to be connected with other theories
treating
the observation on a
general
level and inpurely
abstract terms, as Estima- tionTheory
and Informationtheory. However,
if numerous works on thissubject appeared lately [1 ] [2 ] [3 ],
the elaboration of this connection is far from been achieved. Inparticular, only
few concrete results have been found[4 J [5] ] [6]
§ other results have no clearphysical interpretation,
even if
they
are sometimes verysuggestive [7].
Annales de l’lnstitut Henri Poincaré-Section A-Vol. XXXV, 0020-2339/1981/287/$ 5,00/
© Gauthier-Villars
Much of tiiis effort is
principally
concentrated on theproblem
ofspecifying (or estimating)
the stateof a quantal
systemby using
the availableinformation about it. This is a crusial
question,
forQuantum
Mechanicsgives precise
statisticalpredictions
on the results of future measurements when the state isknown,
but on the other hand it says little about the determination of that state. This isespecially
true for thequantum
statis- ticalmechanics,
where ingeneral
the available information is not sufficient to determine the state.The
study
of thisquestion
wasoriginated by Jaynes [8 ].
Heproposed
the so-called « maximum entropy
principle »
as a way tospecify
the statewhich best represents some statistical data. Later the
study
of a similarsubject
was undertakenby
the present author[9 ],
andGudder,
Marchandand
Wyss [1 D ],
i. e. that offinding
a stateusing
the available information and an initial state.The scope of our paper is to advance the
study
towards this later direc- tion. We divide the work in two parts : in thefirst,
we define a concept of distance between states, we discuss itsphysical interpretation,
and deriveits mathematical
properties.
In the second part, we use this concept for thestudy
of theproblem
of thechange
of the stateprovoqued by
a new infor-mation on the system. We consider two cases,
depending
on the natureof this information. As a
result,
we derive the so-called «projection
postu- late » fromanother, physically
much moreappealing postulate.
II. THE DISTANCE BETWEEN TWO STATES OF A SYSTEM
1. The definition of the distance.
Let us first
specify
some constraints to which anappropriate
distanceshould
obey.
From a mathematicalpoint
ofview,
the distanced(pl, p2)
of two states pi and p~ should
satisfy
thefollowing
conditions :It would be also desirable for the distance to
satisfy
thetriangle inequality :
i. e. to be a metric. We note however that there exist some « distances », as the information discrimination of Kullback
[11],
which are notmetrics
[12 ].
Several different distances
satisfying
conditionsa), b)
andc)
have been defined up to now[13-17]. Among these,
there are two which shallAnnales de l’lnstitut Henri Poincaré-Section A
289
DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY
retain our attention because
they
are theonly
ones defined on the basisof
physical considerations, namely :
the distancep2)
of Gudder[13] ]
and the distance defined
by Jauch, Misra,
Gibson[18] ] [19],
and
independently, by
Kronfli[20] ]
andHadjisavvas [9].
Gudder’s distance is introduced as follows. Let E be the set of all « statis- tical »
(or
«density »)
operators(i.
e.positive
operators of unittrace)
inthe Hilbert space
corresponding
to aphysical
system. As well known, the statistical operators represent the states of thesystem. E
is known to be convex : If 0 a 1 andWi, W2
E L then the « mixture »aWl
+( 1- a)W2
also
belongs
to E. Now Gudderqualified
two statesrepresented by
the statistical operatorsWl, W2
as « close » if there exists a mixturecontaining mostly W1 equal
to a mixturecontaining mostly W~:
thatis,
if there exist
W3, W4
~ 03A3 and a small À such thatAfterwards,
based on thisconception
ofcloseness,
he defined the distancebetween the two states
by
The property of the
quantal
states to form mixturescorresponds
to acertain
physical
fact. But it seemshighly improbable
apriori
that a distance which takes into accountonly
thissecondary
property whileneglecting
all the other aspects of the concept of state, can
properly
express the diffe-rence between two states. That is
why
we shall rather be concerned with another distance put forwardby Jauch,
Misra and Gibson. The definition of this second distance makes use of the von Neumann-Jauch[27] ] [22] ]
formalism of « quantum
logic »
which webriefly
recall:To any
physical
systemcorresponds
a set L of «propositions
». If the system does not possess anysuperselection rules,
one admits that thisset L is
isomorphic
to the set of allprojectors
in acomplex separable
Hilbert space. In this
formalism,
the states areprobability
measures on L and arerepresented,
via the Gleason’s theorem[23 ], by
statistical operators.The relation
connecting
thephysical
concepts ofproposition
and statewith their mathematical
representatives
is thefollowing:
if to the propo- sition « a » and to the state p therecorrespond, respectively,
theprojection
operator E and the statistical operator
W,
then theprobability
tofind, by making
a suitable measurement, that theproposition
« a » is true, isgiven by
the formula :We now have :
DEFINITION 1. - The distance between any two states pi, p~ is
given
bv the expression:Vol. XXXV, n° 4-1981.
The justification
of this definition can be based on thefollowing argument which,
if notunshakable,
is much stronger than the usual arguments(when they exist)
in favour of other definitions. As wealready pointed
out in the
introduction, Quantum Theory
describes themicrosystems exclusively through
their behaviour upon measurement. Inparticular,
aquantal
state does notonly yield
statisticalpredictions,
but it can also becompletely
definedby
thesepredictions (this
is one of the aspects of Gleason’stheorem). Accordingly,
the distance between two states pi, p2can be measured
by
the difference between the statisticalpredictions yielded by
pi, p2. Now for agiven proposition a
EL,
the difference of the statisticalpredictions of pi,
p2 about ais
.Accordingly,
an
appropriate
distance should be based on thestudy
of the whole set of values of the functionWe shall prove in the
following
subsection that the set of values of this function is an interval of the form[0, d ].
Now if d issmall,
then the diffe-rence of
predictions p2(a) I
is small for any a and thus the two states pl, p2 should be considered as close to each other. If on the contrary d isbig, then pl(a) - p2(a) | is
close to d for an infinite number ofproposi- tions a,
so that the distance between should bebig.
That iswhy d
seems to be an
appropriate
measure of thisdistance,
and thisexplains
thechoise of the definition 1.
Jauch,
Misra and Gibson put forward this distance for the sole purpose ofdefining
atopology
in the set of states, in order to takecorrectly
limitsin a
rigorous study
of thescattering
process.They
determined thetopology
introduced
by p2),
butthey
did notgive
theexpression
of the distance in the Hilbert space formalism. Our scope here is to use the distance in the statistical inferenceproblem
inQuantum Theory.
We need for this theexpression of d( p i, p2)
for anypair
of states, pure or mixtures. This expres-sion, together
with some otherproperties,
isgiven
in thefollowing
subsection.
2.
Explicit
form of the Hilbert sp ace formalism.We start
by briefly recalling
some definitions and results of tracetheory [24 ].
An operator A in acomplex separable
Hilbert space H is said tobelong
to the trace class iff theseries A fn) converges
for at least one
basis { In
In that case, theexpression
Tr A= 03A3(fn,
n= IAf")
is
independent
of theparticular basis { fn
The trace class is denotedAnnales de l’lnstitut Henri Poincaré-Section A
DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY 291
by Li(H)
and is known to be a Banach space with respect to the norm) A III
=Tr
. The statistical operators areby definition,
thepositive
elements with unit norm in
Li(H).
If a
self-adjoint
operatorbelongs
toLI (H),
then itsspectrum
is discrete :~ ~n
and one has : Tr A= ~,n and ) ) A = ~ Furthermore,
n n
the usual bound norm of operators is
given
in this caseby II
=sup
.neN
Some conventions on notations :
L(H)
is the Banach space of all conti-nuous operators on H. For any
subset,
r cH,
we denoteby [F] ] the alge-
braic
subspace generated by F-that is,
the set of all finite linear combinations of elements of r - andby
F thetopological
closure of r. For any vector g,we denote
by
theprojector
on the one dimensionalsubspace
contain-ing
g. For any operatorA, R(A)
will be his range. If A is selfadjoint,
then A +and A - are his
positive
andnegative
parts.We now
give
theexplicit
form ofd(pl, p2)
in the mostgeneral
case :THEOREM 1.
Let pl, p2
be any two states,represented by
the statistical operatorsWl, W2 respectively.
ThenProof
- SinceWt, W2
EL 1 (H)
one also hasWl - W2
ELi(H).
LetWi 2014 W2 =
be thespectral decomposition
ofWi 2014 W2, where 03BBn
n
are its
eigenvalues complete
orthonormal set ofeigen-
vectors.
Then, by
the definition one has :On the other
hand,
and Thus :
Vol. XXXV, n° 4-1981.
Since Tr
WI
= TrWz
=I,
we 03A303BBn = Tr(WI - Wz)
= 0 so thatSubstitution in
(4) gives
Using (3), (5), (6)
we obtain :But
"WI - W211I 1 = [ = 2 Substitution in (7)
then gives
the
desired
result. /
We note a
remarquable
fact : the distance which seemed to us as the most « natural » among all distances defined up to now isgiven,
up to a constant,by
the distance introducedby
the natural normof the mathematical space
Li(H)
associated to states.However,
inspite
of the fact that the
expression II WI - W2 111/2
isgeneral
andelegant,
it is hard to calculate in
practice. Fortunately,
when at least one of thestates is pure, the
expression
of the distance becomessimpler
as shownby
thefollowing
result :PROPOSITION 1. - Let pi be a pure state
represented by
a normalizedvector
g and p2
any staterepresented by
a statistical operator W. Then :a)
The spaceR(P(f] - W)+
is one-dimensional.b)
Proo, f:
Let F be theprojector
onW)
+. ThenNow
= F f ~ ~ F f ) I
has one-dimensional range, and the sameAnnales de l’lnstitut Henri Poincaré-Section A
DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY 293
must be true for the
right member
of(8).
Nowby corollary 1,
ref.[25 ]
one has
and thus
W)+
has also a 1-dimensional range. This provesa).
Now
b)
is a obvious consequence ofa)
and of relation(7). )t
When the states pi, p2 are both pure, their mutual distance
acquires
a very
simple
form :COROLLARY 1. are two pure states
represented by
the vectorsj’
and g, then
Proof
- Wejust
have to calculate theeigenvalues
of -p(g]’
Thishas been
made,
forexemple,
in ref.[18 ]
andgives
the result :which,
combined with relation(7),
proves thecorollary. /
We
finally
prove theproposition concerning
the valuesof I
mentionned in the
preceding
subsection :PROPOSITION 2. - For any
pair
of states p~ the range of the func- tionf
defined on the set of allpropositions
Lby
is the entire interval
10, ~ ~ ~~ ~ ~ ~ ~
.Proof.
- Wealready know, by
virtue of theorem1,
thatNow be a basis basis of
R(Wi - W2) - .
We denoteby Ro
the one between these twosubspaces
that has the smallest
dimension,
sayRo
=R(WI - W2) + .
Then we cansuppose that I c J. The idea of the
proof
is to « turn »continuously Ro
from its initial
position
until it falls onR(WB 2014 W~)’ .
For any i E I and t E
[0,
1] define :
Obviously
Vol. XXXV, n° 4-1981. 1 1
and also
Now
corollary
1 andproposition
1imply
the relationInserting
this in relation(10)
andusing (9),
we getVt, t’
E[0, 1 ] :
.which shows that the function t -
E(t)
is continuous. Hence the functiont -
h(t) :=
Tr((Wi - W2)E(t))
is also continuousand, consequently,
itsrange is an interval. For t =
0, E(0)
is theprojector
onRo
=R(WI - W2)+,
thus
For t =
1, E(l)
is theprojector
onR1
cR(Wi - W2)-,
thush(1)
0.If at
is theproposition corresponding
toE(t)
thenf(at) = I h(t) so
thatf
takes on all the values from 0
to IWI - 2 W2111
as was to beproved, t)
With
proposition
2 we concluded ourstudy
of the distancep2)
and turn now to
applications.
III. STATISTICAL INFERENCE IN
QUANTUM
THEORYThe purpose of this section is to
give
anapplication
of the defined distanceto the
problem
of statistical inference inQuantum Theory.
Beforedoing
so, we shall try to elucidate further what we mean
by
the denomination« state of the system ». This will
help
us tojustify
in thefollowing
sub-sections our choices of 4definitions and methods. However, we think that these choices and methods may be
justified
and useful even for an under-standing
of the concept of state different from ours, or in different contexts.Their use is not
necessarily
bound to ourpersonal conceptions
on theinterpretation
ofQuantum Theory.
Annales de l’Institut Henri Poincaré-Section A
DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY 295
1. The «
subjective »
and the «objective »
state.There are many different concepts which have been called
by
the communname « state of the system »
by
quantum theorists. These concepts differby
theorganisation
level to whichthey apply-level
of an individualsystem
or of a statistical ensemble of systems- but also
by
the different concep- tions thatthey
express.Throughout
this work we shall follow the conven-tional
Quantum Theory.
The states are
usually
seen as a result of a certainpreparation
of the system. Let usreproduce
J. M. Jauch’s definition[21 ]
« A state is theresult of a series of
physical manipulations
which constitute thepreparation
of the state. Two states are identical if the relevant conditions in the prepa- ration are identical ». And Jauch
explains
furthermore that the relevanceor not of a condition is not a
priori known,
but is aquestion
ofphysics.
It is clear that the concept of state so defined has an
objective
character :a
preparation
defines a stateindependently
of what the observer thinksor knows about it. In the framework of
Quantum Theory,
this state isrepresented,
as wellknown, by
a statistical operator.Accordingly,
thefundamental
problem
which aphysicist
has to solve if he wants topredict
the future
behaviour
of systemsproduced by
agiven preparation,
is todetermine the
corresponding
statistical operator. In order to dothis,
hecan make use of all the information available about the
preparation.
Thisinformation is
usually
of two kinds :a)
It may consist of statistical results of measurements made on alarge
set of systems
produced by
thatpreparation.
In that case,using induction,
he assumes that these results will also be observed in future measure-
ments, i. e. characterize the state
produced by
thepreparation
and not theparticular
ensemble of systems which was used for the’ measurement.Such information will be called indirect.
b)
It may concern thepreparation
itself(assembling
of theinstruments, boundary conditions, etc.),
i. e. have a direct character.The ensemble of all available
information,
direct orindirect,
will bedenoted
by
J.In
practice,
the available information on thepreparation
israrely
sufficient to determine
uniquely
thecorresponding
state.If,
forexemple,
it is indirect and consists in the
knowledge
of meanvalues, dispersions,
etc.of some
observables,
it is well known that there exist ingeneral infinitely
many states in which these observables have the
given
mean values anddispersions.
We shall call these states « statescompatible
with J » andnote the set of
corresponding
statistical operatorsby S J.
The available information J on a
preparation
will be called «complete »
if it determines the state
uniquely,
i. e. if the setS J
containsonly
one element.When J is
incomplete,
as itusually happens,
the statecorresponding
toVol. XXXV, n° 4-198 1 .
the
preparation
is unknown and one can beonly
sure of itsbelonging
to
Sj.
In this case, it is ingeneral
notpossible
togive objective predictions concerning
the results of measures on systems soprepared,
i. e.specify
the relative
frequencies
which will beactually
realized.However,
inpractice
onealways
tries to make use of the availableinformation,
evenif it is
incomplete,
in order togive predictions
whocorrespond
to a « reaso-nable
degree
of belief »[8 ] [26 ] even
if it is not certain thatthey
will beverified
by
theexperience.
In otherwords,
one tries to find between all elements ofSj,
the « mostprobable
» or the one that constitutes the « best estimation » with respect to the available information. This research is in the classicalprobability theory,
thesubject
of theproblem
of statistical inference. But one cannotsimply transpose
to quantum mechanics the methods used inprobability theory,
for two distinct reasons.First,
thequantum-theoretical algorithms
are often different from those of thetheory
ofprobability. Second,
quantum mechanics is aphysical theory,
so that our available information may have a non statistical character
(direct
information about the aparata used in thepreparation, etc.).
At this
point
we think that it would be useful toenlarge
ourlanguage.
Given a
preparation,
we shall say that our information J about this prepara- tion defines a certain «subjective
state » of thephysicist
or a state of know-ledge. By opposition
to thesubjective
state, we shall call from now on the state of the system definedby
thepreparation
itself «objective
state ». Wesaw that each
objective
state has a mathematicalrepresentative (i.
e. astatistical
operator)
with thehelp
of which allobjective
statisticalpredic-
tions
concerning
the system may be obtained. Now wepostulate
that thesubjective
state also has a mathematicalrepresentative
which can beused to obtain statistical
predictions,
but in this case thesepredictions
will be
subjective
andthey
willsimply
reflect ourknowledge
about the system in the bestpossible
way. Thus thisrepresentative
will be the statis-tical operator described earlier as the « most
probable »
between theelements of
Sj,
or the « best estimation » of theobjective
state.We summarize the content of this
’subjection
in thefollowing
mnemonicdefinitions :
DEFINITION 2
(Direct
and indirectinformation).
- An informationconcerning
thepreparation
of a system is called indirect if it consists of statistical results of measurementsperformed
on alarge
number of systemsproduced by
thepreparation.
It is called direct if it concerns thepreparation
itself. The total amount of the available information will be denoted
by
J.DEFINITION 3
(Objective
andsubjective state).
- We shall say that apreparation
defines theobjective
state of the system, and that an amountof information J about the
given preparation
defines the «subjective
state » or « state of
knowledge »
of the observer. Both of these states arerepresented by
statistical operators.Annales de l’lnstitut Henri Poincaré-Section A
297 DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY
DEFINITION 4
(Complete information).
- An amount of information Jconcerning
agiven preparation
is calledcomplete
if it determinesuniquely
the
objective
state. In this case thesubjective
and theobjective
state arerepresented by
the same statistical operator.In this new
language,
the statistical inferenceproblem
is translated as follows : for agiven incomplete
amount of informationJ,
find the statistical operator that represents our state ofknowledge.
After these remarks on the concept of state, we turn now to the
applica-
tion of the distance to the statistical inference
problem.
From now onstates and statistical operators will be denoted
by
the samesymbol.
Sincewe shall
primarily
be concerned withsubjective
states, we shall sometimesdrop
theadjective
«subjective
».2. Modification of the
subjective
state due to additional information.Jaynes
was one of the first tostudy
aparticular
case of the statistical inferenceproblem
inQuantum theory, namely
the case of information ofpurely
statistical character(i.
e.indirect).
He defined a new concept, the entropy of aquantal
state, andproposed
thefollowing
solution to theproblem :
for agiven
amount of informationJ,
let S be the set of all statistical operatorscompatible
with J in the sense definedpreviously.
Then thesubjective
state isrepresented by
the element of S which has the greatest entropy. This solution isnowadays
considered as correct. Infact,
as wehave shown elsewhere
[27 ],
this « maximum entropyprinciple » proposed by Jaynes
isimplied by
theprinciple
ofLaplace,
at least in the classicalcase. Yet it has been understood that it is
applicable exclusively
on thecase of indirect information
(i.
e. mean values of some observables aregiven).
Itsapplication
to other casesproduces paradoxes [28 ].
We shall
study
a different case of the statistical inferenceproblem
whichcan be stated as
follows ;
suppose that our state ofknowledge
on aphysical
system is
Wo.
At agiven
time weacquire
new data whichchange
our stateof
knowledge.
Theproblem
is to determine the final stateWl
in terms ofthe state
Wo
and of the new data. In whatfollows,
we shall use the Heisen-berg picture,
so that there will be nodynamical
evolution of the state in time. States canchange only
when our information on the systemchanges.
We shall consider two different cases,
according
to the nature of thenew information which
changes
the state, direct or indirect(Cf.
def.3).
2.1. The case of indirect information.
2.1.1.
Physical investigation.
When the new information is
indirect,
we can state ourproblem
asfollows. For a
given preparation,
suppose that our information about itVol. XXXV, n° 4-1981.
permitted
to choose in any way(which
we do notspecify)
a statistical operatorWo representing
thisparticular
state ofknowledge.
In order totest whether or not
Wo
represents also theobjective
state which this prepa- rationdefines,
we can mesure some observables on a great number of systemsproduced by
thepreparation.
Thisprovides
some statistical data which may be identical or not to thosepredicted by Wo.
Ifthey
areidentical,
then we have an indication that
Wo
is close to the statistical operatorrepresenting
theobjective
state, or even that itmight
beequal
to it. There-fore,
we maintainWo
as arepresentative
of thesubjective
state. If thestatistical data differ from those
predicted by Wo,
then we areobliged
tochoose another
WI
asrepresenting
our state ofknowledge,
since this later ischanged.
Theproblem
of the statistical inference is here to find thisWI
on the basis of
Wo
and of the new statistical data.The solution we propose to this
problem
is based on thefollowing qualitative
argument. Let S be the set of alldensity
operators whogive predictions
in accordance with the new statistical data.Obviously Wi
mustbe one of them. But on the other hand
Wi
has to take into account, as faras
possible,
theknowledge
contained inWo.
This condition can be fulfilled ifWi,
whilebelonging
toS,
is chosen so as togive
the closestpossible
statistical
predictions
to thosegiven by Wo,
and this for allimaginable
measurements.
By
definition 1 of the distance between states, this isequi-
valent to
saying
thatd(Wo, WJ
should be as small aspossible.
In otherwords, Wi
should minimize the distanced(Wo, W)
taken betweenWo
and an element WeS.
It may be that this argument, because of its
qualitative character,
is notdecisively convincing. However,
it isquite
in thespirit
of the arguments used in classical statistical inference. We thus raise its conclusion to the status of apostulate.
POSTULATE 1. - Let
Wo, WI
represent thesubjective
statesbefore
andafter
theacquiring of
aninformation consisting of
some statistical data.Let
furthermore
S be the setof
alldensity
operatorsgiving predictions
inaccordance with these statistical data. Then
WI
is an elementof
S whichsatisfies
the relation :This is the translation of the
physical problem
into mathematicallanguage.
Indeed now we have to
specify
S and find between its elements the one which satisfies relation(11).
The first part of the
problem,
i. e. the determination of S in the case ofan indirect
information,
issimple. Indeed, quantitative
statistical dataare
generally
of three kinds :,
a)
Meanvalues, dispersions,
etc. of some observables aregiven.
Annales de l’Institut Henri Poincaré-Section A
299 DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY
b)
Theprobabilities
of the answer « yes » are determined for somepropositions.
c)
One may determinecompletely
theprobability density
of the valuesof some observables.
Now all these results can be
expressed
as mean values of some self-adjoint
operators. In casea)
this isevident,
sincedispersions
areexpressed by
the mean values of squares of operators. For caseb),
it is sufficient to remember thatpropositions
are alsorepresented by
operators(projectors).
As for the case
c),
let us note that if C =f ÀdEÀ
is thespectral decomposition
of the operator
C,
then theprobability
distribution of the values of C in the state W isgiven by
the formula :1(2)
= Tr But Tr(AW)
forany self
adjoint
operator Ais, by definition,
the mean value of A in thestate W. Thus the
giving
of theprobability
distribution of C isequivalent
to the
giving
of all the mean values of theprojectors E~.
We conclude
that, indeed,
any indirect information can beexpressed by
thegiving
of the mean valueski,
i E I of some observablesCi. Accordingly,
the set S of all states
compatible
with an indirect information has the form:where E is the set of all
density
operators andCi
are observables defined for the studied system.The first part of the mathematical
problem
raisedby
thepostulate
i. e.the determination of the set S for the case of indirect information is thus solved. The second
part the finding
of the operatorW1
in S thatsatisfies rel
(11) is
much more difficult. Infact,
we haveonly
found somepartial
results. In order to facilitate the mathematical treatment, we shall suppose from now on thatCi
are continuous operators. If this is the case, then the structure of S isgiven by
thefollowing proposition.
PROPOSITION 3. - The set S defined
by (12)
is convex and closed(in Proof - a)
For anyW’,
W" E S and any a, b E !R+ such that a + b =1,
one has :
Consequently,
aW’ + bW" ES,
so that S is convex.b)
The functionsbeing
continuous[29] ]
it follows that the sets are closed. ButS so that S is
closed..
I
Vol. XXXV, n° 4-1981. >
The result of the
proposition
allows us toapply
thetheory
of bestapproxi-
mation of an element of a Banach space
by
elements of aclosed,
convex subset[30] ] [31 ].
In ourproblem,
the Banach space will beLi(H),
andwe shall have to
find, according
to ourpostulate,
the bestapproximation of Wo by
an elementW1 of S,
with respect to thenorm 11’111 i (cf.
Theorem1).
The
theory
of bestapproximation
for convex sets is still far frombeing sufficiently developed;
yet ityields general
criteriacharacterizing Wi,
which are much
simpler
that condition(11).
These criteria aregiven
inthe mathematical section 2 .1. 2
by
thefollowing
theorems : for ageneral
S(cf.
relation(12)) by
the theorems 3 and 4. For thespecial
case of an Sdefined
by
the mean value of one observableC,
cf.Corollary
2.Finally,
these criteria enable us, in some
special
cases, tocompletely
determineWi
1(cf.
Theorem6).
-2.1.2.
Approximations in the
spaceL1(H).
Throughout
thissubsection,
S will be a closed convex subset of E andWo
a
density
operator notbelonging
to S. Our scope is togive
criteria charact-erizing
the « element of bestapproximation
», i. e. the unknown statistical operatorWi
which satisfies rel.(11).
We shall make use of the basic result of the
approximation theory
ofconvex sets in normed spaces, due to I.
Singer [31 ],
which runs as follows :THEOREM 2
(1. Singer).
- Let E be a normed space, G a convex subset ofE,
xeE""’Ö
and go E G.Then
= g=Ginf ~
x - if andonly
if there exists an
fEE * (the topological
dual ofE)
such thatIn the case we
study, Singer’s
theoremimplies
thefollowing
result :THEOREM 3. - If
Wi
ES,
then a necessary and sufficient condition in order to haveis the
following.
There exists a hermitian operator A such that andProof
- Weapply Singer’s
theorem to the case E = G = G =S,
x =
Wo, go
=Wi.
It is well known[29] ] that
the dualofLi(H)
isisomorphic
to
L(H).
Theduality
is established via the continuous bilinear form whereAnnales de l’lnstitut Henri Poincaré-Section A
301 DISTANCE BETWEEN STATES AND STATISTICAL INFERENCE IN QUANTUM THEORY
Thus,
if relation(13)
is true, then an element B EL(H)
exists such that :and VW E S : Re
[Tr (B(WI - W))]
O.Setting
A= B +
B* we find:2
And also
= 1. N
The
problem
offinding
the element of bestapproximation
is thus trans-formed
by
Theorem 3 to apair
ofsimpler problems.
We should first find thegeneral
solution of thepair
ofequations (14)
and then examine if there exists one between these solutions that satisfies the condition(15).
Weestablish now a theorem which solves the first of these two
problems.
THEOREM 4. - The
general
solution of the system ofequations (24)
with respect to the hermitian operator A is the
following :
where
F,
F’ are,respectively,
theprojectors
onand B is a
hermitian
operator the range of which isorthogonal
toR(Wo - Wl)
and such that
II B II
1.Proof
- We first show that if B is as describedabove,
then A definedby (16)
satisfies the relations(14).
If x E
R(F)
then Bx = 0 and thus(x, Ax) = (x, Fx) = ) ) ( x ~ ~ 2
whichshows
that II A
1. On the otherhand,
for any x E H writewhere xi e
R(F),
x2 ER(F’),
x3 ER(B) and x4 | R(F)
E8R(F’)
E8R(B).
Thenthus
proving that II
= 1.Besides,
if we set forsimplicity
W =Wo - Wi
then
Vol. XXXV, n° 4-1981.
Now
obviously W
= (F - F’)I W
I is thepolar decomposition
ofW
so that
which
together
with the formerrelation,
shows that conditions(14)
aresatisfied.
Let us now prove that any solution of
(14)
has the form(16).
Ifis the
spectral decomposition
ofW,
then(14) implies
Since A 1,
we deduce thatNow if A =
r 03BBdE03BB is the spectral decomposition
of A one has :
and
analogously,
Therefore,
if we setby
definition B = A -(F - F’),
we haveBt/1n
= 0whenever 0. This
implies
thatR(B)
isorthogonal
toR(F - F’)
=R(W).
Since A = B +
(F - F’) and II A II = 1,
thisorthogonality implies
in itsturn
that II
BII
x 1 as was to be demonstrated. -The
following proposition
is useful forproving
theunicity
of the elementof best
approximation
for the cases of interest.PROPOSITION 4. - If the
inequality (13) characterizing
the elements of bestapproximation
has two solutions with respect toWi,
sayWi, W~
thenany A
obeying
conditions(14)
and(15) obeys
also the same conditionswith
W~
in theplace
ofWi.
Proof - Suppose
thatWi, W~
are solutions of(13)
and that Aobeys
conditions
(14), (15).
Then one hasand, by
virtue of(15),
Annales de l’Institut Henri Poincaré-Section A