HAL Id: hal-02056897
https://hal.archives-ouvertes.fr/hal-02056897v2
Preprint submitted on 3 Sep 2019
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Geometry of tangent and cotangent bundles on statistical manifolds
Tat Dat Tô, Stéphane Puechmorel
To cite this version:
Tat Dat Tô, Stéphane Puechmorel. Geometry of tangent and cotangent bundles on statistical mani-
folds. 2019. �hal-02056897v2�
statistical manifolds
Tat Dat Tˆ o and St´ ephane Puechmorel
Ecole Nationale de l’Aviation Civile, Unversit´ e F´ ed´ erale de Toulouse 7, Avenue Edouard Belin, FR-31055 Toulouse, France
{tat-dat.to, stephane.puechmorel}@enac.fr
Abstract. This note surveys some results on the geometric structure on the tangent bundle and cotangent bundle of statistical manifolds. In particular we give a description for the tangent bundle of exponential models parametrized on convex domains.
Keywords: Statistical manifolds · Hessian manifolds · almost K¨ ahler structure.
1 Statistical manifolds
We recall here the definition of statistical manifolds following [10, 2, 1, 3]. Let (M, g) be a Riemannian manifold. Denote Γ (T M ) the space of vector fields on M . A connection ∇ on M is a bilinear map
∇ : Γ (T M) × Γ (T M ) → Γ (T M ) (1)
(X, Y ) 7→ ∇ X Y (2)
satisfying for all f ∈ C
∞(M ), X, Y, Z ∈ Γ (T M )
1. ∇ f X Y = f ∇ X Y , i.e, ∇ is C
∞(M, R)-linear in the first variable;
2. ∇ X (f Y ) = X(f )Y + f ∇ X Y , i.e, ∇ satisfies Leibniz rule in the second vari- able.
A connection ∇
∗is called dual connection of ∇ with respect to g if
∇ Z g(X, Y ) = g(∇ Z X, Y ) + g(X, ∇
∗Z Y )
for all X, Y, Z ∈ Γ (T M). It is straightforward that ∇
∗is defined uniquely by the last identity.
Definition 1. A triple (M, g, ∇) is called a statistical manifold if ∇ and ∇
∗are torsion free. We also call (g, ∇, ∇
∗) a dualistic structure on M .
Assume that (g, ∇, ∇
∗) is a dualistic structure on M . Then the tensor T defined by
T (X, Y, Z) = g(∇
∗X Y − ∇ X Y, Z), ∀X, Y, Z ∈ Γ (T M )
is symmetric on all three entries. The tensor T has been called the skewness
tensor by Lauritzen [10]. Conversely, it follows from Lauritzen [10] that:
2 Tˆ o et al.
Theorem 1. A metric g and a symmetric 3-tensor T yields a dualistic structure with torsion-free connections.
Denote ˆ ∇ the Levi-Civita connection with respect to g, then we have
∇ ˆ = 1
2 (∇ + ∇
∗) and T = 2( ˆ ∇ − ∇).
From now we have another way to define a statistical manifold by a triple (M, g, T ), where g is a Riemannian metric on M and T is a symmetric 3-tensor.
Definition 2. A smooth map φ from a statistical manifold (M 1 , g 1 , T 1 ) to a statistical manifold (M 2 , g 2 , T 2 ) is called a statistical isomorphism if φ is an diffeomorphism of M 1 into M 2 such that g 1 = φ
∗(g 2 ) and T 1 = φ
∗(T 2 ).
2 Tangent and Cotangent bundles of statistical manifolds
2.1 Almost K¨ ahler structure on tangent bundles
We recall here the work of Dombrowski [6] on the geometry of the tangent bundle. As a corollary, the tangent bundles of a statisticcal manifolds admit a canonical almost K¨ ahler structure.
Let (M, g) be a Riemannian manifold and ∇ be a connection on M . Denote by π : T M → M the canonical projection, then π
∗:= dπ : T T M → T M is a C
∞map. We denote by V ξ T M the vector space ker(π
∗|TξT M
). Then the connection
∇ induces a direct sum decomposition on T T M at any ξ ∈ T M :
T ξ T M = V ξ T M ⊕ H ξ T M (3) where V ξ T M (reps. H ξ T M ) is called the vertical (reps. horizontal) space of T ξ T M . Indeed, for any a normal neighborhood U (with respect to ∇) of p ∈ M , there is a canonical map
τ : π
−1(U ) → T p M
defined as follows: for ξ ∈ π
−1(U ), τ(ξ) is the parallel translation with respect to ∇ of ξ along the unique geodesic arc in U joining q = π(ξ) and p. We define the connection map K : T T M → T M as follows: for any ξ ∈ T M , A ∈ T ξ T M , and c : t → c(t) a path in T M with ˙ c(0) = A, then
KA = lim
t→0
τ(c(t)) − ξ
t .
Then we define H ξ T M := ker(K
|TξT M).
Let u 1 , . . . , u n ∈ C
∞(M ) be a coordinate system on M . Define then a coor- dianate system v 1 , . . . , v 2n ∈ C
∞(T M ) on T M as follows:
v i := u i ◦ π ∀i = 1, . . . , n (4)
v n+i (X ) := du i (X) ∀i = 1, . . . , n, and ∀X ∈ T M. (5)
We also denote
X i = ∂
∂u i and A j = ∂
∂v j , (6)
for i = 1, . . . , n and j = 1, . . . , 2n. Then for any ξ = P n
i=1 x i X i ∈ T M with x i ∈ C
∞(M ), and A = P 2n
j=1 a j A j ∈ Γ (T M ) with a j ∈ C
∞(T M ), we have (π
∗A) ξ =
n
X
i=1
a i (ξ)X i
and
(KA) ξ =
n
X
i,j,k=1
(a n+i (ξ) + Γ i jk a j (ξ)x i )X i , (7) where Γ i jk = (∇ X
jX k )u i .
We now define an almost complex structure J on T M as follows. For ξ ∈ T M , the exists for A ∈ T ξ T M a unique element in T ξ T M denoted by J A such that π
∗(J A) = −KA and K(J A) = π
∗A. The map J : T T M → T M is thus an almost complex structure for T M characterized by
π
∗◦ J = −K, K ◦ J = π
∗. (8) It follows from [6] that for any X ∈ Γ (T M ), there exist unique X h ∈ Γ (T T M ), called horizontal lift and unique X v ∈ Γ (T T M ), called vertical lift satisfying
π
∗(X ξ h ) = X π(ξ) , π
∗(X ξ v ) = O π(ξ) , KX ξ h = O π(ξ) , KX ξ v = X π(ξ) . By the definition we have J X h = X v and J X v = −X h for ant X ∈ Γ (T M ).
We also have a natural Riemannian metric ˜ g on T M , namely Sasaki metric (cf. [13, 6]) induced from (g, ∇):
˜
g(A, B) := g(π
∗A, π
∗B) + g(KA, KB), ∀A, B ∈ T ξ T M. (9) Observe J is g-compatible, i.e, ˜ g(J A, J B) = ˜ g(A, B), therefore we can define a hermitian metric ˜ h and its K¨ ahler form ˜ ω on T T M by
˜ h(A, B) = ˜ g(A, B) + i˜ g(A, J B), ∀A, B ∈ T ξ T M, (10) and
˜
ω(A, B) = ˜ h(A, J B), ∀A, B ∈ T ξ T M. (11) Then it follows from Dombrowski [6] and Satoh [14] that:
Theorem 2. Let (M, g) be a Riemannian manifold with a torsion-free connec-
tion ∇, then (T M, ˜ h, J) is an almost-Hermitian manifold. It is almost-K¨ ahlerian
if and only if ∇
∗is torsion-free (so that (M, g, ∇) is statistical). Furthermore,
(T M, ˜ h, J) is a K¨ ahler manifold if and only if ∇ is flat, i.e the Riemannian
curvature of ∇ vanishes.
4 Tˆ o et al.
This theorem gives a bridge between information geometry and complex geome- try. As consequence we have the following characterization of statistical isomor- phism.
Theorem 3. A smooth map φ : (M 1 , g 1 , T 1 ) → (M 2 , g 2 , T 2 ) on statistical man- ifolds is a statistical isomorphism if and only if ϕ := dφ : (T M 1 , h ˜ 1 , J 1 ) → (T M 2 , ˜ h 2 , J 2 ) is an isomorphism of almost-K¨ ahler manifolds, i.e, ϕ is a diffeo- morphism satisfying ϕ
∗˜ h 2 = ˜ h 1 and ϕ
∗J 2 = J 1 .
Proof. The complex structure on T M 1 is defined by g 1 and ∇ 1 = ˆ ∇ 1 − 1 2 T 1 , where ˆ ∇ 1 is the Levi-Civita connection of g 1 . The complex structure on T M 2 is defined in the same way.
Suppose that φ : (M 1 , g 1 , T 1 ) → (M 2 , g 2 , T 2 ) is a statistical isomorphism.
Then it follows from the definition that g 1 = φ
∗g 2 and T 1 = φ
∗T 2 , hence we get ϕ
∗˜ h 2 = ˜ h 1 and ϕ
∗J 2 = J 1 .
Conversely, if ϕ := dφ : (T M 1 , ˜ h 1 , J 1 ) → (T M 2 , ˜ h 2 , J 2 ) is an isomorphism of almost-K¨ ahler manifolds, then φ is diffeomorphic. Since ϕ
∗˜ h 2 = ˜ h 1 , the formulas (10) and (9) imply that φ
∗(g 2 ) = g 1 . Finally, the identities (8) and ϕ
∗J 2 = J 1 imply φ
∗K 2 = K 1 , where K 1 , K 2 are the connections maps of (M 1 , g 1 , T 1 ), (M 2 , g 2 , T 2 ). Since the connection ∇ 1 , ∇ 2 can be defined by K 1 , K 2
(see (7)), so do T 1 and T 2 . This show that φ
∗T 2 = T 1 , hence φ is a statistical isomorphism as required.
For more results on tangent space of statistical manifold we refer to [11, 4].
2.2 Cotangent bundle
Let M be a manifold, then its cotangent bundle T
∗M has a natural symplectic structure defined as follows. Denote by ¯ π : T
∗M → M the canonical projection which assigns to each form p ∈ T q
∗M its base point q ∈ M . Define the Liouville 1-form θ on T
∗M by
hθ, Xi = hp, π ¯
∗X i, ∀X ∈ T p T
∗M. (12) Then ω = dθ is a symplectic form on M . Take a local coordinate system (p 1 , . . . , p n , q 1 , . . . , q n ) on T
∗M , where (q 1 , . . . , q n ) is a local coordinate on M and (p 1 , . . . , p n ) are the coefficients of forms. Then θ = P p j dq j and ω = −dθ = dq j ∧ dp j .
Now let g be a metric on M then, we have the following isomorphism
φ g : T M → T
∗M (13)
ξ 7→ g(·, ξ) (14)
The following result is straightforward from Delano¨ e [5, Th´ eor` eme 2] and Satoh [14].
Theorem 4. Let (M, g) be a Riemannian manifold with a torsion-free connec-
tion ∇. The following are equivalent:
1. (M, g, ∇) is a statistical manifold;
2. (φ g )
∗(HM) is a Lagrangian subspace with respect to the symplectic form ω on T
∗M , i.e, ω vanishes on (φ g )
∗(HM );
3. φ g : (T M, ω) ˜ → (T
∗M, ω) satisfies φ
∗g ω = ˜ ω, where ω ˜ defined in (11).
2.3 K¨ aher and dual flat correspondence
Definition 3. A statistical manifold (M, g, ∇) is said to be ∇-flat if ∇ is a flat connection.
Since ∇ being flat implies that ∇
∗is flat as well, we call a ∇-flat statistical manifold is dual flat. It follows from [2] that any dual flat statistical manifold (M, g, ∇) carries a Hessian structure, i.e g can be locally expressed by g = ∇dϕ, that is,
g ij = ∂ 2 ϕ
∂x i ∂x j
where {x 1 , . . . , x n } is an affine coordinate system with respect to ∇. By Theorem 2, (T M, g, J) is a K¨ ˜ ahler manifold, where ˜ g and J are defined in Section 2.1.
Denote ˆ ∇ the Levi-Civita of (M, g), T = 1 2 ( ˆ ∇ − ∇). Then the (1,3) tensor Q = ∇T is called the Hessian curvature tensor for (g, ∇).
Definition 4. We define first Koszul form α and the the second Koszul form β for (g, ∇) (cf. [9]) by
∇ X vol g = α(X)vol g and β = −∇α.
We recall here some properties of the Hessian curvature (cf. [15])
Proposition 1. Let R ˆ be the Riemannian curvature of g, R ˜ be the Riemannian curvature on the K¨ ahler manifold (T M, g, J). Then ˜
R ˆ ijkl = 1
2 (Q ijkl − Q jikl ), R ˜ = Q ◦ π, R ˜ j k ¯ = β jk ◦ π, where R ˜ j k ¯ is the Ricci tensor on (T M, g, J). ˜
These properties above infer that the second Koszul form β plays a similar role as the Ricci tensor in K¨ ahler geometry.
3 Completeness to tangent bundles of statistical manifolds
In this section we study the completeness to the tangent bundle (T M, ˜ g) of a statistical manifold (M, g, ∇). We first recall the Riemannian submersion.
Definition 5. Let ( ˜ M , g) and (M, h) be two Riemannian manifolds and π :
M ˜ → M be a submersion. Then π is call Riemannian submersion if the isomor-
phism π
∗: H M ˜ → T M is an isometry, where H M ˜ is the horizontal distribution.
6 Tˆ o et al.
We now have the following lemma (see for example [12]).
Lemma 1. Let π : ˜ M → M be a Riemannian submersion. Then we have that (i) for any gedesic γ of M , p ˜ = π
−1(γ(0)), there exists a unique locally defined
horizontal lift ˜ γ of γ with ˜ γ(0) = ˜ p, and ˜ γ is a geodesics of M ˜ .
(ii) for any geodesic γ ˜ of M ˜ . If γ ˜
0(0) is a horizontal vector, then ˜ γ
0(t) is hor- izontal for every t in the domain of γ ˜ and π ◦ γ ˜ is a geodesic of M of the same length as ˜ γ.
(iii) If M ˜ is complete, so is M .
We now prove that a necessary condition so that (T M, ˜ g) is complete is that (M, g) is complete.
Proposition 2. If (T M, g) ˜ is complete, so is (M, g).
Proof. We only need to prove that the canonical map π : (T M, g) ˜ → (M g) is a Riemannian submersion. Indeed, it follow from the definition of Sasaki metric ˜ g that
˜
g(A, B) := g(π
∗A, π
∗B) + g(KA, KB), ∀A, B ∈ T ξ T M.
Therefore, for any A, B ∈ H ξ T M we have
˜
g(A, B) := g(π
∗A, π
∗B)
this implies that π is a Riemannian submersion. Lemma 1 now implies that (M, g) is complete.
Theorem 5. Suppose (M, g, ∇) is a complete statistical manifold, then (T M, g) ˜ is also complete.
Proof. For any p ∈ T M and any ξ ∈ T p T M , the Cauchy-Lipschitz theorem implies the existence of the geodesic ˜ γ on an open interval [0, t 0 ), starting from p with the tangent vector ξ on (T M, ˜ g). We can assume ˜ γ : [0, t 0 ) → T M is a unit-speed geodesic. In order to prove that (T M, g) is complete, we need to ˜ prove that the geodesic is defined on the interval [0, t 0 ] as well.
We now prove that, ˜ γ can be extended beyond t 0 . Indeed, take γ = π
◦γ, then ˜ γ is Lipschitz since ˜ γ has bounded speed. In addition, (M, g) is complete, hence there exists p = lim t→t
−0
γ(t). Take a neighborhood U of p, then π
−1(U ) = U ×R n and ˜ γ(t) = (γ(t), c(t)) ∈ π
−1(U ) for t ∈ (t 0 − , t 0 ) for some > 0, hence c((t 0 − , t 0 )) ⊂ R n . Since the speed of c(t) is also bounded and the restriction of ˜ g on fibers is complete, there exists lim t→t
−0