versus De Donder–Weyl

(1)

Covariant Hamiltonian formalism for the calculus of variations with several variables: Lepage–Dedecker

versus De Donder–Weyl

Fr´ ed´ eric H´ elein

^a

and Joseph Kouneiher

^b

a

Institut de Math´ ematiques de Jussieu, UMR 7586, Universit´ e Denis Diderot – Paris 7, Site de Chevaleret, 16 rue Clisson, 75013 Paris, France

b

LUTH, Laboratoire de l’Univers et de ses Th´ eories, CNRS UMR 8102, Observatoire de Paris, 92195, Meudon Cedex, France

[email protected], [email protected]

Abstract

The main purpose in the present paper is to build a Hamiltonian theory for ﬁelds which is consistent with the principles of relativity.

For this we consider detailed geometric pictures of Lepage theories in the spirit of Dedecker and try to stress out the interplay between the Lepage-Dedecker (LP) description and the (more usual) De Donder- Weyl (DDW) one. One of the main points is the fact that the Legen- dre transform in the DDW approach is replaced by a Legendre correspondence in the LP theory (this correspondence behaves diﬀerently:

ignoring the singularities whenever the Lagrangian is degenerate).

e-print archive: http://lanl.arXiv.org/abs/math-ph/0401046

(2)

1 Introduction

1.1 Presentation

Multisymplectic formalisms are ﬁnite dimensional descriptions of variational problems with several variables (or ﬁeld theories for physicists) analogue to the well-known Hamiltonian theory of point mechanics. For example consider on the set of maps u : R

ⁿ

−→ R a Lagrangian action of the type

L [u] =

Rⁿ

L(x, u(x), ∇ u(x))dx

¹

· · · dx

ⁿ

.

Then it is well-known that the maps which are critical points of L are char- acterized by the Euler–Lagrange equation

_∂x^∂_µ

∂L

∂(∂µu)

=

^∂L_∂u

. By analogy with the Hamiltonian theory we can do the change of variables p

^µ

:=

_∂(∂^∂L

µu)

and deﬁne the Hamiltonian function H(x, u, p) := p

^µ

∂u

∂x

^µ

− L(x, u, ∇ u), where here ∇ u =

_∂u

∂x^µ

is a function of (x, u, p) deﬁned implicitly by p

^µ

:=

∂(∂∂Lµu)

(x, u, ∇ u). Then the Euler-Lagrange equation is equivalent to the generalized Hamilton system of equations

⎧ ⎪

⎪ ⎨

⎪ ⎪

⎩

∂u

∂x

^µ

= ∂H

∂p

^µ

(x, u, p)

µ

∂p

^µ

∂x

^µ

= − ∂H

∂u (x, u, p). (1)

This simple observation is the basis of a theory discovered by T. De Donder [3] and H. Weyl [20] independently in 1935. This theory can be formu- lated in a geometric setting, an analogue of the symplectic geometry, which is governed by the Poincar´ e–Cartan n-form θ := eω + p

^µ

du ∧ ω

_µ

(where ω := dx

¹

∧ · · · ∧ dx

ⁿ

and ω

_µ

:= ∂

_µ

ω) and its diﬀerential Ω := dθ, often called multisymplectic (or polysymplectic form).

Although similar to mechanics this theory shows up deep diﬀerences. In

particular there exist other theories which are analogues of Hamilton’s one

as for instance the ﬁrst historical one, constructed by C. Carath´ eodory in

1929 [2]. In fact, as realized by T. Lepage in 1936 [16], there are inﬁnitely

many theories, due to the fact that one could ﬁx arbitrary the value of some

tensor in the Legendre transform (see also [18], [6]). Much later on, in 1953,

P. Dedecker [4] built a geometrical framework in which all Lepage theories

(3)

are embedded. The present paper, which is a continuation of [9], is devoted to the study of the Lepage–Dedecker theory. We also want to compare this formalism with the more popular De Donder–Weyl theory.

First recall that the range of application of the De Donder–Weyl theory is restricted in principle to variational problems on sections of a bundle F . The right framework for it, as expounded e.g. in [8], consists in using the ﬁrst jet bundle J

¹

F and its aﬃne dual

J

¹

_∗

F as analogues of the tangent and the cotangent bundles for mechanics respectively. For non degenerate variational problems the Legendre transform induces an immersion of J

¹

F in

J

¹

_∗

F . In contrast the Lepage theories can be applied to more general situations but involve, in general, many more variables and so are more complicated to deal with, as noticed in [15]. This is probably the reason why most papers on the subject focus on the De Donder–Weyl theory, e.g. [14], [8]. The general idea of Dedecker in [4] for describing Lepage’s theories is the following: if we view variational problems as being deﬁned on n-dimensional submanifolds embedded in a (n + k)-dimensional manifold N , then what plays the role of the (projective) tangent bundle to space-time in mechanics is the Grassmann bundle Gr

ⁿ

N of oriented n-dimensional subspaces of tangent spaces to N . The analogue of the cotangent bundle in mechanics is Λ

ⁿ

T

^∗

N . Note that dimGr

ⁿ

N = n + k + nk so that dimΛ

ⁿ

T

^∗

N = n + k +

^(n+k)!_n!k!

is strictly larger than dimGr

ⁿ

N + 1 unless n = 1 (classical mechanics) or k = 1 (submanifolds are hypersurfaces). This diﬀerence between the dimensions explains the multiplicity of Lepage theories: as shown in [4], we substitute to the Legendre transform a Legendre correspondence which associates to each n-subspace T ∈ Gr

_qⁿ

N (a “generalized velocity”) an aﬃne subspace of Λ

ⁿ

T

_q^∗

N called pseudofibre by Dedecker. Then two points in the same pseudofiber do actually represent the same physical (infinitesimal) state, so that the coordinates on Λ

ⁿ

T

^∗

N , called momento¨ıdes by Dedecker do not represent physically observable quantities. In this picture any choice of a Lepage theory corresponds to a selection of a submanifold of Λ

ⁿ

T

^∗

N , which

— when the induced Legendre transform is a well-deﬁned map — intersects transversally each pseudoﬁber at one point (see Figure 1.1): so the Legendre correspondence specializes to a Legendre transform. For instance the De Donder–Weyl theory can be recovered in this setting by the restriction to some submanifold of Λ

ⁿ

T

^∗

N (see Section 2.2).

In [9] and in the present paper we consider a geometric pictures of Lepage theories in the spirit of Dedecker and we try to stress out the interplay between the Lepage–Dedecker description and the De Donder–Weyl one.

Roughly speaking a comparison between these two points of view shows up

some analogy with some aspects of the projective geometry, for which there

(4)

pseudofibres

a choice of a Lepage theory

Figure 1:

Pseudoﬁbers which intersect a submanifold corresponding to the choice of a Lepage theory

is no perfect system of coordinates, but basically two: the homogeneous ones, more symmetric but redundant (analogue to the Dedecker description) and the local ones (analogue to the choice of a particular Lepage theory like e.g. the De Donder–Weyl one). Note that both points of view are based on the same geometrical framework, a multisymplectic manifold:

Definition 1.1. Let M be a diﬀerential manifold. Let n ∈ N be some positive integer. A smooth (n + 1)-form Ω on M is a multisymplectic form if and only if

(i) Ω is non degenerate, i.e. ∀ m ∈ M , ∀ ξ ∈ T

_m

M , if ξ Ω

_m

= 0, then ξ = 0

(ii) Ω is closed, i.e. dΩ = 0.

Any manifold M equipped with a multisymplectic form Ω will be called a multisymplectic manifold.

For the De Donder–Weyl theory we choose M to be J

¹

_∗

F and for the Lepage–Dedecker theory M is Λ

ⁿ

T

^∗

N . In both descriptions solutions of the variational problem correspond to n-dimensional submanifolds Γ (analogues of Hamiltonian trajectories: we call them Hamiltonian n-curves) and are characterized by the Hamilton equation X Ω = ( − 1)

ⁿ

d H , where X is a n-multivector tangent to Γ, H is a (Hamiltonian) function deﬁned on M and by “ ” we mean the interior product.

We may insist on the point that many contributions on the De Donder–Weyl theory are devoted to the construction of multisymplectic manifolds having the same dimension as the Lagrangian formulation conﬁguration space, i.e.

J

¹

F , either by pulling back the multisymplectic form by the Legendre map

(5)

as in [8], or by working on a quotient or a submanifold of (J

¹

)

^∗

F as for in- stance in [7] (see [5] for a comparaison between the diﬀerent points of view).

However when dealing with Lepage–Dedecker theories, one is forced to aban- don these points of view and to work with multisymplectic manifolds whose dimension is larger than the number of physical variables. The advantage is however is that we do not need for any extra structure, like connections, and in particular in our setting the Hamiltonian function is thought as a global function on M .

Consequently, in Section 2 we present a complete derivation of the (Dedecker) Legendre correspondence and of the generalized Hamilton equations, using a method that does not rely on any trivialization or connection on the Grass- mannian bundle. A remarkable property, which is illustrated in this paper through the examples given in Paragraph 2.2.2, is that when n and k are greater than 2, the Legendre correspondence is generically never degenerate.

The more spectacular example is when the Lagrangian density is a constant function — the most degenerate situation one can think about — then the Legendre correspondence is well-deﬁned almost everywhere except precisely along the De Donder–Weyl submanifold. We believe that such a phenomenon was not noticed before; it however may be useful when one deals for example with the bosonic string theory with a skewsymmetric 2-form on the target manifold (a “B-ﬁeld”, as discussed in [9] and in subsection 2.2, example 5) or with the Yang–Mills action in 4 dimensions with a topological term in the Lagrangian: then the De Donder–Weyl formalism may fail but one can cure this degenerateness by using another Lepage theory or by working in the full Dedecker setting.

In this paper we also stress out another aspect of the (Dedecker) Legen- dre correspondence: one expects that the resulting Hamiltonian function on Λ

ⁿ

T

^∗

N should satisfy some condition expressing the “projective” invariance along each pseudoﬁber. This is indeed the case. On the one hand we observe in Section 2.1 that any smoothly continuous deformation of a Hamiltonian n-curve along directions tangent to the pseudoﬁbers remains a Hamiltonian n-curve

¹

(Corollary 2.1). On the other hand we give in Section 4.3 an intrin- sic characterization of the subspaces tangent to pseudofibers. This motivates the definition given in Section 3.3 of the generalized pseudofiber directions on any multisymplectic manifold.

1A property quite similar to a gauge theory behavior although of diﬀerent meaning.

Here we are interested by desingularizing the theory and avoid the problems related to the presence of a constraints.

(6)

Beside these properties in this paper and in its companion paper [11] we wish to address other kind of questions related to the physical gain of these theories: the main advantage of multisymplectic formalisms is to offer us a Hamiltonian theory which is consistent with the principles of Relativity, i.e. being covariant. Recall for instance that for all the multisymplectic for- malisms which have been proposed one does not need to use a privilege time coordinate. One of our ambitions in this paper was to try to extend this democracy between space and time coordinates to the coordinates on fiber manifolds (i.e. along the fields themselves). This is quite in the spirit of the Kaluza–Klein theory and its modern avatars: 11-dimensional supergrav- ity, string theory and M-theory. This concern leads us naturally to replace De Donder–Weyl by the Dedecker theory. In particular we do not need in our formalism to split the variables into the horizontal (i.e. corresponding to space-time coordinates) and vertical (i.e. non horizontal) categories.

Moreover we may think that we start from a (hypothetical) geometrical model where space-time and fields variables would not be distinguished a priori and then ask how to make sense of a space-time coordinate function (that we call a “r-regular” in Section 3.2). A variant of this question would be how to define a constant time hypersurface (that we call a “slice” in Sec- tion 3.2) without referring to a given space-time background. We propose in Section 3.2 a definition of r-regular functions and of slices which, roughly speaking, requires a slice to be transversal to all Hamiltonian n-curves. Here the idea is that the dynamics only (i.e. the Hamiltonian equation) should de- termine what are the slices. We give in Section 4.2 a characterization of these slices in the case where the multisymplectic manifold is Λ

ⁿ

T

^∗

N .

These questions are connected to the concept of observable functionals over the set of solutions of the Hamilton equation. First because by using a codi- mension r slice Σ and an (n − r)-form F on the multisymplectic manifold one can deﬁne such a functional by integrating F over the the intersection of Σ with a Hamiltonian curve. And second because one is then led to impose conditions on F in such a way that the resulting functional carries only dy- namical information. The analysis of these conditions is the subject of our companion paper [11]. And we believe that the conditions required on these forms are connected with the deﬁnitions of r-regular functions given in this paper, although we have not completely elucidated this point.

Lastly in a future paper [12] we investigate gauge theories, addressing the

question of how to formulate a fully covariant multisymplectic for them.

(7)

Note that the Lepage–Dedecker theory expounded here does not answer this question completely, because a connection cannot be seen as a submanifold.

We will show there that it is possible to adapt this theory and that a conve- nient covariant framework consists in looking at gauge fields as equivariant submanifolds over the principal bundle of the theory, i.e. satisfying some suitable zeroth and first order differential constraints.

1.2 Notations

The Kronecker symbol δ

^µ_ν

is equal to 1 if µ = ν and equal to 0 otherwise.

We shall also set

δ

^µ_ν₁¹_···ν^···µ_p^p

:=

δ

_ν^µ₁¹

. . . δ

^µ_ν_p¹

.. . .. . δ

_ν^µ₁^p

. . . δ

^µ_ν_p^p

.

In most examples, η

_µν

is a constant metric tensor on R

ⁿ

(which may be Euclidean or Minkowskian). The metric on his dual space his η

^µν

. Also, ω will often denote a volume form on some space-time: in local coordinates ω = dx

¹

∧· · ·∧ dx

ⁿ

and we will use several times the notation ω

_µ

:=

_∂x^∂µ

ω, ω

_µν

:=

_∂x^∂µ

∧

_∂x^∂ν

ω, etc. Partial derivatives

_∂x^∂µ

and

_∂p ^∂

α1···αn

will be some- time abbreviated by ∂

_µ

and ∂

^α¹^···αⁿ

respectively.

When an index or a symbol is omitted in the middle of a sequence of in- dices or symbols, we denote this omission by . For example a

_i

1···ip···in

:=

a

_i₁_···i_p−1_i_p+1_···i_n

, dx

^α¹

∧· · ·∧ dx

^α^µ

∧· · ·∧ dx

^αⁿ

:= dx

^α¹

∧· · ·∧ dx

^α^µ−1

∧ dx

^α^µ+1

∧

· · · ∧ dx

^αⁿ

.

If N is a manifold and FN a ﬁber bundle over N , we denote by Γ( N , FN )

the set of smooth sections of FN . Lastly we use the following notations

concerning the exterior algebra of multivectors and diﬀerential forms. If

N is a diﬀerential N -dimensional manifold and 0 ≤ k ≤ N, Λ

^k

T N is

the bundle over N of k-multivectors (k-vectors in short) and Λ

^k

T

^∗

N is

the bundle of diﬀerential forms of degree k (k-forms in short). Setting

ΛT N := ⊕

^N_k=0

Λ

^k

T N and ΛT

^∗

N := ⊕

^N_k=0

Λ

^k

T

^∗

N , there exists a unique

duality evaluation map between ΛT N and ΛT

^∗

N such that for every decom-

posable k-vector ﬁeld X, i.e. of the form X = X

₁

∧ · · · ∧ X

_k

, and for every

l-form µ, then X, µ = µ(X

₁

, · · · , X

_k

) if k = l and = 0 otherwise. Then

interior products and are operations deﬁned as follows. If k ≤ l, the

(8)

product : Γ( N , Λ

^k

T N ) × Γ( N , Λ

^l

T

^∗

N ) −→ Γ( N , Λ

^l−k

T

^∗

N ) is given by Y, X µ = X ∧ Y, µ , ∀ (l − k)-vector Y.

And if k ≥ l, the product : Γ( N , Λ

^k

T N ) × Γ( N , Λ

^l

T

^∗

N ) −→ Γ( N , Λ

^k−l

T N ) is given by

X µ, ν = X, µ ∧ ν , ∀ (k − l)-form ν.

2 The Lepage–Dedecker theory

We expound here a Hamiltonian formulation of a large class of ﬁrst order variational problems in an intrinsic way. Details and computations in coor- dinates can be found in [14], [9].

2.1 Hamiltonian formulation of variational problems with several variables

2.1.1 Lagrangian formulation

The category of Lagrangian variational problems we start with is described as follows. We consider n, k ∈ N

^∗

and a smooth manifold N of dimension n + k; N will be equipped with a closed nowhere vanishing “space-time volume” n-form ω. We deﬁne

• the Grassmannian bundle Gr

ⁿ

N , it is the ﬁber bundle over N whose ﬁber over q ∈ N is Gr

_qⁿ

N , the set of all oriented n-dimensional vector subspaces of T

_q

N .

• the subbundle Gr

^ω

N := {(q, T ) ∈ Gr

ⁿ

N /ω

_q|T

> 0}.

• the set G

^ω

, it is the set of all oriented n-dimensional submanifolds G ⊂ N , such that ∀ q ∈ G, T

_q

G ∈ Gr

_q^ω

N (i.e. the restriction of ω on G is positive everywhere).

Lastly we consider any Lagrangian function L, i.e. a smooth function L : Gr

^ω

N −→ R . Then the Lagrangian of any G ∈ G

^ω

is the integral

L [G] :=

G

L (q, T

_q

G) ω (2)

We say that a submanifold G ∈ G

^ω

is a critical point of L if and only if, for any compact K ⊂ N , G∩K is a critical point of L

_K

[G] :=

G∩K

L (q, T

_q

G) ω

(9)

with respect to variations with support in K.

It will be useful to represent Gr

ⁿ

N diﬀerently, by means of n-vectors. For any q ∈ N , we deﬁne D

_qⁿ

N to be the set of decomposable n-vectors

²

, i.e. elements z ∈ Λ

ⁿ

T

_q

N such that there exist n vectors z

₁

,...,z

_n

∈ T

_q

N satisfying z = z

₁

∧ · · · ∧ z

_n

. Then D

ⁿ

N is the ﬁber bundle whose ﬁber at each q ∈ N is D

_qⁿ

N . Moreover the map

D

_qⁿ

N −→ Gr

_qⁿ

N z

₁

∧ · · · ∧ z

_n

−→ T (z

₁

, · · · , z

_n

),

where T (z

₁

, · · · , z

_n

) is the vector space spanned and oriented by (z

₁

, · · · , z

_n

), induces a diﬀeomorphism between

D

ⁿ_q

N \ { 0 }

/ R

^∗₊

and Gr

_qⁿ

N . If we set also D

^ω_q

N := { (q, z) ∈ D

ⁿ_q

N /ω

_q

(z) = 1 } , the same map allow us also to identify Gr

_q^ω

N with D

^ω_q

N .

This framework includes a large variety of situations as illustrated below.

Example 1 — Classical point mechanics — The motion of a point mov- ing in a manifold Y can be represented by its graph G ⊂ N := R × Y. If π : N −→ R is the canonical projection and t is the time coordinate on R , then ω := π

^∗

dt.

Example 2 — Maps between manifolds — We consider maps u : X −→ Y, where X and Y are manifolds of dimension n and k respectively and X is equipped with some non vanishing volume form ω. A ﬁrst order La- grangian density can represented as a function l : T Y ⊗

X ×Y

T

^∗

X −→ R , where T Y ⊗

_{X ×Y}

T

^∗

X := { (x, y, v)/(x, y) ∈ X × Y , v ∈ T

_y

Y ⊗ T

_x^∗

X } . (We use here a notation which exploits the canonical identiﬁcation of T

_y

Y ⊗ T

_x^∗

X with the set of linear mappings from T

_x

X to T

_y

Y ; note that the bundle T Y ⊗

_{X ×Y}

T

^∗

X −→ X ×Y is diﬀeomorphic to the ﬁrst jet bundle J

¹

F −→ F , where F = X × Y is a trivial bundle over X ). The action of a map u is

[u] :=

X

l(x, u(x), du(x))ω.

In local coordinates x

^µ

such that ω = dx

¹

∧· · ·∧ dx

ⁿ

, critical points of satisfy the Euler-Lagrange equation

_n

µ=1 ∂

∂x^µ

∂l

∂vⁱµ

(x, u(x), du(x))

=

_∂y^∂l_i

(x, u(x), du(x)),

∀i = 1, · · · , k..

Then we set N := X × Y and denoting by π : N −→ X the canonical pro- jection, we use the volume form ω π

^∗

ω. Any map u can be represented by

2another notation for this set would beDΛⁿTqN, for it reminds that it is a subset of ΛⁿTqN, but we have chosen to lighten the notation.

(10)

its graph G

_u

:= { (x, u(x))/ x ∈ X } ∈ G

^ω

, (and conversely if G ∈ G

^ω

then the condition ω

_|G

> 0 forces G to be the graph of some map). For all (x, y) ∈ N we also have a diﬀeomorphism

T

_y

Y ⊗ T

_x^∗

X −→ Gr

_(x,y)^ω

N D

^ω_(x,y)

N

v −→ T (v),

where T (v) is the graph of the linear map v : T

_x

X −→ T

_y

Y . Then if we set L(x, y, T (v)) := l(x, y, v), the action deﬁned by (2) coincides with .

Example 3 — Sections of a ﬁber bundle — This is a particular case of our setting, where N is the total space of a ﬁber bundle with base manifold X . The set G

^ω

is then just the set of smooth sections.

2.1.2 The Legendre correspondence

Now we consider the manifold Λ

ⁿ

T

^∗

N and the projection mapping Π : Λ

ⁿ

T

^∗

N −→ N . We shall denote by p an n-form in the ﬁber Λ

ⁿ

T

_q^∗

N . There is a canonical n-form θ called the Poincar´ e–Cartan form deﬁned on Λ

ⁿ

T

^∗

N as follows: ∀(q, p) ∈ Λ

ⁿ

T

^∗

N , ∀X

₁

, · · · , X

_n

∈ T

_(q,p)

(Λ

ⁿ

T

^∗

N ),

θ

_(q,p)

(X

₁

, · · · , X

_n

) := p (Π

_∗

X

₁

, · · · , Π

_∗

X

_n

) = Π

_∗

X

₁

∧ · · · ∧ Π

_∗

n, p , where Π

_∗

X

_µ

:= dΠ

_(q,p)

(X

_µ

). If we use local coordinates (q

^α

)

_1≤α≤n+k

on N , then a basis of Λ

ⁿ

T

_q^∗

N is the family (dq

^α¹

∧ · · · ∧ dq

^αⁿ

)

_1≤α₁<···<αn≤n+k

and we denote by p

_α₁_···α_n

the coordinates on Λ

ⁿ

T

_q^∗

N in this basis. Then θ writes

θ :=

1≤α1<···<αn≤n+k

p

_α₁_···α_n

dq

^α¹

∧ · · · ∧ dq

^αⁿ

. (3) Its diﬀerential is the multisymplectic form Ω := dθ and will play the role of generalized symplectic form.

In order to build the analogue of the Legendre transform we consider the ﬁber bundle Gr

^ω

N ×

_N

Λ

ⁿ

T

^∗

N := {(q, z, p)/q ∈ N , z ∈ Gr

_q^ω

N D

^ω_q

N , p ∈ Λ

ⁿ

T

_q^∗

N } and we denote by Π : Gr

^ω

N ×

_N

Λ

ⁿ

T

^∗

N −→ N the canonical projection. To summarize:

Gr

^ω

N ×

N

Λ

ⁿ

T

^∗

N

Π^L

Π^H //

Π

((R

RR RR RR RR RR RR

RR

Λ

ⁿ

T

^∗

N

Π

ı

M

oo

Π|M

zzuuuuuuuuu

Gr

ⁿ

N

^oo _ı

Gr

^ω

N

^//

N

(11)

We deﬁne on Gr

^ω

N ×

_N

Λ

ⁿ

T

^∗

N the function W (q, z, p) := z, p − L(q, z).

Note that for each (q, z, p) there a vertical subspace V

_(q,z,p)

⊂ T

_(q,z,p)

(Gr

^ω

N ×

N

Λ

ⁿ

T

^∗

N ), which is canonically deﬁned as the kernel of

d Π

_(q,z,p)

: T

_(q,z,p)

(Gr

^ω

N ×

_N

Λ

ⁿ

T

^∗

N ) −→ T

_q

N .

We can further split V

_(q,z,p)

T

_z

D

_q^ω

N ⊕ T

_p

Λ

ⁿ

T

_q^∗

N , where T

_z

D

^ω_q

N KerdΠ

^H_(q,z,p)

and T

_p

Λ

ⁿ

T

_q^∗

N KerdΠ

^L_(q,z,p)

. Then, for any function F deﬁned on Gr

^ω

N ×

_N

Λ

ⁿ

T

^∗

N , we denote respectively by ∂F/∂z(q, z, p) and ∂F/∂p(q, z, p) the re- strictions of the diﬀerential

³

dF

_(q,z,p)

on respectively T

_z

D

^ω_q

N and T

_p

Λ

ⁿ

T

_q^∗

N . Instead of a Legendre transform we shall rather use a Legendre correspon- dence: we write

(q, z) ←→ (q, p) if and only if ∂W

∂z (q, z, p) = 0. (4) Let us try to picture geometrically the situation (see ﬁgure 2.1.2): D

_q^ω

N is

Tq qN

Λn

Dω DqN

z

T z ω

N

Figure 2:

TzDq^ωN is a vector subspace of ΛⁿTqN

a smooth submanifold of dimension nk of the vector space Λ

ⁿ

T

_q

N , which is of dimension

^(n+k)!_n!k!

; T

_z

D

_q^ω

N is thus a vector subspace of Λ

ⁿ

T

_q

N . And

∂L∂z

(q, z) or

^∂W_∂z

(q, z, p) can be understood as linear forms on T

_z

D

_q^ω

N whereas p ∈ Λ

ⁿ

T

_q^∗

N as a linear form on Λ

ⁿ

T

_q

N . So the meaning of the right hand side of (4) is that the restriction of p at T

_z

D

_q^ω

N coincides with

^∂L_∂z

(q, z, p):

p

_|T_z_Dω

qN

= ∂L

∂z (q, z). (5)

3However in order to make sense of “∂F/∂q(q, z, p)” we would need to deﬁne a “horizontal” subspace ofT(q,z,p)(Gr^ωN ×NΛⁿT^∗N), which requires for instance the use of a connection on the bundleGr^ωN ×N ΛⁿT^∗N −→ N. Indeed such a horizontal subspace prescribes a inertial law on N, such a law would have a sense on a Galilee or Minkowski space-time but not in general relativity.

(12)

Given (q, z) ∈ Gr

^ω

N we deﬁne the enlarged pseudofiber in q to be:

P

_q

(z) := { p ∈ Λ

ⁿ

T

_q^∗

N / ∂W

∂z (q, z, p) = 0 } .

In other words, p ∈ P

_q

(z) if it is a solution of (5). Obviously P

_q

(z) is not empty; moreover given some p

₀

∈ P

_q

(z),

p

₁

∈ P

_q

(z), ⇐⇒ p

₁

− p

₀

∈

T

_z

D

_q^ω

N

_⊥

:= { p ∈ Λ

ⁿ

T

_q^∗

N / ∀ ζ ∈ T

_z

D

_q^ω

N , p(ζ ) = 0 } . (6) So P

_q

(z) is an aﬃne subspace of Λ

ⁿ

T

_q^∗

N of dimension

^(n+k)!_n!k!

− nk. Note that in case where n = 1 (the classical mechanics of point) then dim P

_q

(z) = 1:

this is due to the fact that we are still free to ﬁx arbitrarily the momentum component dual to the time (i.e. the energy)

⁴

.

We now deﬁne

P

_q

:=

z∈Dq^ωN

P

_q

(z) ⊂ Λ

ⁿ

T

_q^∗

N , ∀ q ∈ N

and we denote by P := ∪

_q∈N

P

_q

the associated bundle over N . We also let, for all (q, p) ∈ Λ

ⁿ

T

^∗

N ,

Z

_q

(p) := { z ∈ Gr

_q^ω

N /p ∈ P

_q

(z) } .

It is clear that Z

_q

(p) = ∅ ⇐⇒ p ∈ P

_q

. Now in order to go further we need to choose some submanifold M

q

⊂ P

q

, its dimension is not ﬁxed a priori.

Legendre Correspondence Hypothesis — We assume that there exists a subbundle manifold M ⊂ P ⊂ Λ

ⁿ

T

^∗

N over N where dim M =: M such that,

• for all q ∈ N the ﬁber M

_q

is a smooth submanifold, possibly with boundary, of dimension 1 ≤ M − n − k ≤

^(n+k)!_n!k!

• for any (q, p) ∈ M, Z

_q

(p) is a non empty smooth connected submani- fold of Gr

^ω_q

N

4a simple but more interesting example is provided by variational problems on maps u:R²−→R². Then one is led to the multisymplectic manifold Λ²T^∗R⁴. And given any (q, z) ∈ Gr^ωR⁴ the enlarged pseudoﬁber Pq(z) ⊂Λ²T^∗R⁴ is an aﬃne plane parallel to R

v1¹v²2−v1²v¹2

dx¹∧dx²−ijv^jνdyⁱ∧dx^ν+dy¹∧dy²

⊕Rdx¹∧dx², where (using the notations of Example 2)T(v) =z. For details see Paragraph 2.2.2.

(13)

• if z

₀

∈ Z

_q

(p), then we have Z

_q

(p) = { z ∈ D

_q^ω

N / ∀ p ˙ ∈ T

_p

M

_q

, z − z

₀

, p ˙ = 0 } .

Remark — In the case where M =

^(n+k)!_n!k!

+ n + k, then M

_q

is an open subset of Λ

ⁿ

T

_q^∗

N and so T

_p

M

_q

Λ

ⁿ

T

_q^∗

N . Hence the last assumption of the Legendre Correspondence Hypothesis means that Z

_q

(p) is reduced to a point. In general this condition will imply that the inverse correspondence can be rebuild by using the Hamiltonian function (see Lemma 2.2 below).

Lemma 2.1. Assume that the Legendre correspondence hypothesis is true.

Then for all (q, p) ∈ M , the restriction of W to { q }× Z

_q

(p) ×{ p } is constant.

Proof — Since Z

_q

(p) is smooth and connected, it suﬃces to prove that W is constant along any smooth path inside { (q, z, p)/q, p ﬁxed , z ∈ Z

_q

(p) } . Let s −→ z(s) be a smooth path with values into Z

_q

(p), then

d

ds (W (q, z(s), p)) = ∂W

∂z (q, z(s), p) dz

ds

= 0,

because of (4).

A straightforward consequence of Lemma 2.1 is that we can deﬁne the Hamiltonian function H : M −→ R by

H (q, p) := W (q, z, p), where z ∈ Z

_q

(p), i.e. ∂W

∂z (q, z, p) = 0.

Any function f constructed this way will be called Legendre Image Hamiltonian function. In the following, for all (q, p) ∈ M and for all z ∈ D

_qⁿ

N we denote by

z

_|T_p_M_q

: T

_p

M

q

−→ R

˙

p −→ z, p ˙ the linear map induced by z on T

_p

M

_q

. Then:

Lemma 2.2. Assume that the Legendre Correspondence Hypothesis is true.

Then

⁵

5The advised Reader may expect to have also the relation “^∂H_∂q(q, p) =−^∂L_∂q(q, z)”. But as remarked above the meaning of ^∂H_∂q and ^∂L_∂q is not clearly deﬁned, because we did not introduce a connection on the bundle Gr^ωN ×N ΛⁿT^∗N. This does not matter and we shall make the economy of this relation later ! (cf footnote 2)

(14)

(i) ∀ (q, p) ∈ M and ∀ z ∈ Z

_q

(p),

∂ H

∂p (q, p) = z

_|T_p_M_q

. (7) As a corollary of the above formula, z

_|T_p_M_q

does not depend on the choice of z ∈ Z

_q

(p).

(ii) Conversely if (q, p) ∈ M and z ∈ D

_q^ω

N satisfy condition (7), then z ∈ Z

_q

(p) or equivalently p ∈ P

_q

(z).

Proof — Let (q, p) ∈ M and (0, p) ˙ ∈ T

_(q,p)

M , where ˙ p ∈ T

_p

M

_q

. In order to compute d H

_(q,p)

(0, p), we consider a smooth path ˙ s −→ (q, p(s)) with values into M

_q

whose derivative at s = 0 coincides with (0, p). We can further lift ˙ this path into another one s −→ (q, z(s), p(s)) with values into Gr

_q^ω

N ×M

_q

, in such a way that z(s) ∈ Z

_q

(p(s)), ∀ s. Then using (5) we obtain

d

ds ( H (q, p(s)))

_|s=0

= d

ds ( z(s), p(s) − L(q, p(s)) )

_|s=0

= z, p ˙ + z, p ˙ − ∂L

∂z (q, z)( ˙ z) = z, p ˙ , from which (7) follows. This proves (i).

The proof of (ii) uses the Legendre Correspondence Hypothesis: consider z, z

₀

∈ D

ⁿ_q

N and assume that z

₀

∈ Z

_q

(p) and that z satisﬁes (7). Then by applying the conclusion (i) of the Lemma to z

₀

we deduce that ∂ H /∂p(q, p) = z

_0|T_p_M_q

and thus (z − z

₀

)

_|T_p_M_q

= 0. Hence by the Legendre Correspondence

Hypothesis we deduce that z ∈ Z

_q

(p).

A further property is that, given (q, z) ∈ D

^ω

N , it is possible to ﬁnd a p ∈ P

_q

(z) and to choose the value of H(q, p) simultaneously. This property will be useful in the following in order to simplify the Hamilton equations.

For that purpose we deﬁne, for all h ∈ R , the pseudofiber:

P

_q^h

(z) := { p ∈ P

_q

(z)/ H (q, p) = h } . We then have:

Lemma 2.3. For all (q, z) ∈ Gr

^ω

N the pseudoﬁber P

_q^h

(z) is a aﬃne sub- space

⁶

of Λ

ⁿ

T

_q^∗

N parallel to

T

_z

D

_qⁿ

N

_⊥

. Hence dim P

_q^h

(z) = dim P

_q

(z) − 1 =

^(n+k)!_n!k!

− nk − 1.

6again in the instance of variational problems on maps u:R² −→R² and the multisymplectic manifold Λ²T^∗R⁴, for any (q, z)∈Gr^ωR⁴ the pseudoﬁberPq^h(z)⊂Λ²T^∗R⁴ is an aﬃne line parallel toR

v¹1v²2−v1²v¹2

dx¹∧dx²−ijvν^jdyⁱ∧dx^ν+dy¹∧dy² , where T(v) =z. (See also Paragraph 2.2.2.)

(15)

Proof — We ﬁrst remark that, ∀ q ∈ N and ∀ z ∈ D

^ω_q

N , ω

_q

belongs to T

_z

D

_q^ω

N

_⊥

, because of the deﬁnition of D

^ω_q

N . So ∀ λ ∈ R , ∀ p ∈ P

_q

(z), we deduce from (6) that p + λω

_q

∈ P

_q

(z) and thus

H(q, p + λω

_q

) = z, p + λω

_q

− L(q, z)

= H (q, p) + λ z, ω

_q

= H (q, p) + λ.

Hence we deduce that ∀ h ∈ R , ∀ p ∈ P

_q

(z), ∃ !λ ∈ R such that H (q, p + λω

_q

) = h,

so that P

_q^h

(z) is non empty. Moreover if p

₀

∈ P

_q^h

(z) then p

₁

∈ P

_q^h

(z) if and only if p

₁

− p

₀

∈

T

_z

D

_q^ω

N

_⊥

∩ z

^⊥

, where z

^⊥

:= { p ∈ Λ

ⁿ

T

_q^∗

N / z, p = 0 } . In order to conclude observe that

T

_z

D

^ω_q

N

_⊥

∩ z

^⊥

=

T

_z

D

_qⁿ

N

_⊥

.

2.1.3 Critical points

We now look at critical points of the Lagrangian functional using the above framework. Instead of the usual approach using jet bundles and contact structure, we shall derive Hamilton equations directly, without writing the Euler–Lagrange equation.

First we extend the form ω on M by setting ω Π

^∗

ω, where Π : M −→ N is the bundle projection, and we deﬁne G

^ω

to be the set of oriented n- dimensional submanifolds Γ of M , such that ω

_|Γ

> 0 everywhere. A conse- quence of this inequality is that the restriction of the projection Π to any Γ ∈ G

^ω

is an embedding into N : we denote by Π(Γ) its image. It is clear that Π(Γ) ∈ G

^ω

. Then we can view Γ as (the graph of) a section q −→ p(q) of the pull-back of the bundle M −→ N by the inclusion Π(Γ) ⊂ N . Second, we deﬁne the subclass p G

^ω

⊂ G

^ω

as the set of Γ ∈ G

^ω

such that,

∀ (q, p) ∈ Γ, p ∈ P

_q

(T

_q

Π(Γ)) (a contact condition). [As we will see later it can be viewed as the subset of Γ ∈ G

^ω

which satisfy half of the Hamilton equations.] And given some G ∈ G

^ω

, we denote by p G ⊂ p G

^ω

the family of submanifolds Γ ∈ p G

^ω

such that Π(Γ) = G and we say that p G is the set of Legendre lifts of G. We hence have p G

^ω

= ∪

G∈G^ω

p G.

Lastly, we deﬁne the functional on G

^ω

I [Γ] :=

Γ