Day 4 Solvable and nilpotent Lie algebras • (a) Definitions • (b) Lie’s theorem • (c) Engel’s theorem • (d) Characterization by the Killing form 1.1

(1)

WARNING: UNCORRECTED NOTES 1. Day 4 Solvable and nilpotent Lie algebras

• (a) Definitions

• (b) Lie’s theorem

• (c) Engel’s theorem

• (d) Characterization by the Killing form

1.1. Definitions. Note that if h,h⁰ ⊂g are ideals, then so is [h,h⁰], by the Jacobi identity. We define a sequence of important ideals:

Thedescending central series(suite centrale), defined inductively: C⁰(g) = g;Cⁱ(g) = [g, Cⁱ⁻¹g].

Thederived series, again defined inductively: D⁰(g) =g,Dⁱ(g) = [Dⁱ⁻¹(g), Dⁱ⁻¹(g)].

Definition 1.1. The Lie algebragis nilpotent (resp. solvable) ifCⁱ(g) = 0 fori >>0 (resp. ifDⁱ(g) = 0 for i >>0).

Obviously,Cⁱ(g)⊃Dⁱ(g), so any nilpotent Lie algebra is solvable.

Exercise 1.2. Let b,n⊂gl(n) be the upper triangular, resp. strictly upper triangular, subalgebras. Show that b is solvable but not nilpotent, and n is nilpotent.

Lemma 1.3. Let h⊂g be an ideal. Then g is solvable if and only if hand g/h are solvable.

More generally, any subalgebra of a solvable Lie algebra is solvable.

Finally, if h, andh⁰ are solvable ideals ofg, then so ish+h⁰.

Proof. Since Dⁱ(h)⊂Dⁱ(g) and since Dⁱ(g) maps onto Dⁱ(g/h), one direc- tion is clear; the last statement also follows. Now suppose g and g/h are solvable, and say Dⁿ(g/h) = 0. ThenDⁿ(g) ⊂h, and Dⁿ⁺ⁱ(g)⊂Dⁱ(h) for i≥0.

For the final statement, we recall the isomorphism (h+h⁰)/h⁰ −→h/(h^∼ ∩h⁰).

The right hand side is a quotient of a solvable Lie algebra, hence solvable;

but h⁰ is solvable, hence the first statement implies that so is h+h⁰. This

completes the proof.

This shows that the following definition is meaningful.

Definition 1.4. The radical of a Lie algebragis the maximal solvable ideal.

A Lie algebra gis semisimple if its radical is trivial.

1

(2)

Lemma 1.5. Let h⊂g be an ideal, with g nilpotent. Then h and g/h are nilpotent.

Moreover, the center of any non-zero nilpotent Lie algebra is non-trivial.

Proof. The first part follows as in the solvable case. Supposeg is nilpotent and non-zero. Then there exists i such that Cⁱ(g) 6= 0, Cⁱ⁺¹(g) = 0. But 0 = Cⁱ⁺¹(g) = [g, Cⁱ(g)] implies that Cⁱ(g) is contained in the center of

g.

1.2. Lie’s theorem. In what follows, K is an algebraically closed field of characteristic zero, and gis a (finite-dimensional) Lie algebra over K.

Theorem 1.6. The following are equivalent:

(i) g is solvable.

(ii) Every irreducible representation of g is1-dimensional.

(ii’) Every representation ofg has a (non-zero) eigenvector.

(iii) Every representation(r, V) of g has ag-stable flag, i.e. r(g) is contained in the algebra of upper-triangular matrices of End(V) (for some basis). [trigonalisable]

Proof. Conditions (ii) and (ii’) are clearly equivalent. Moreover, (ii’) implies (iii) by induction. (Also (iii) implies (ii’).)

To prove that (iii) implies (i), we apply (iii) to the adjoint representation of gon itself. Thus gcontains a g-stable complete flag; if dimg=N, say

g=g₀ ⊃g₁ ⊃ · · · ⊃g_N ={0}

with dimg_i/gi+1 = 1 for all i, and each g_i is stable for the adjoint representation of g. The condition of being ad(g)-stable is equivalent to the condition that eachg_i is an ideal. Thus each g_i/gi+1 is a 1-dimensional Lie algebra, and is therefore abelian. It follows that g has a filtration by Lie subalgebras g_i such that the successive quotients are abelian, which is one of the equivalent characterizations of solvability.

The hard step is the implication (i) ⇒ (ii’). We prove this by induction on dimg. Suppose dimg = 1; then (ii’) follows because K is algebraically closed. Now suppose dimg > 1, so dimg/[g,g] ≥ 1. Let (r, V) be a representation of g. Let h ⊃ [g,g] be of codimension 1 in g. Any subspace of g containing [g,g] is a subalgebra; so by induction, the representation r restricted to h has a non-zero eigenvector v ∈ V. Let λ : h→K be the eigenvalue ofv, and let V_λ ⊂V be the λ-eigenspace ofh. The main step is to prove thatV_λ is ag-stable subspace ofV. Assume this for the moment, and let Y ∈g,Y /∈h. Then Y has an eigenvector, sayv⁰, in Vλ, because K is algebraically closed. Thus v⁰ is an eigenvector for all ofg.

It remains to show that, if v ∈ Vλ, then Y v ∈ Vλ, in other words, that XY v=λ(X)Y v for allX ∈h. But

XY v=Y Xv+ [X, Y]v=λ)XY v+ [X, Y]v

(3)

so we need to show that [X, Y]v = 0. Moreover, h is an ideal in g, so [X, Y]∈h, and [X, Y]v=λ([X, Y])v. Thus we have to show that

λ([X, Y]) = 0,∀X∈h, Y ∈g.

Let 06=w∈V_λ and letW_k be the span of w, Y w, . . . , Y^k−1w. Let n >0 be the smallest integer such that w, Y w, . . . , Yⁿw are linearly dependent;

then

dimW_n=n; W_n+i=W_n∀i≥0.

. Write W = W_n; obviously Y W ⊂ W. We show by induction that, for all X ∈ h, X fixes Wk and has upper triangular matrix relative to the basisw, Y w, . . . , Y^k−1wwith diagonal entries λ(X). Fork= 1 this is clear;

assume it’s true for k. Use the formula

XY^kw=XY·Y^k−1w=Y X·Y^k−1w−[Y, X]Y^k−1w+Y X·Y^k−1w (modW)_k By induction,

X·Y^k−1w≡λ(X)Y^k−1w (modWk−1) so that

XY^kw=λ(X)Y^kw+Y(Wk−1) +W_k=λ(X)Y^kw (modW_k).

ThusT rW(X) =n·λ(X) for allX ∈h. In particular T rW([X, Y]) =n·λ([X, Y])

for X ∈ h, Y ∈ g. But both X and Y stabilize W, so [X, Y] is the com- mutator of two endomorphisms of W, and therefore has trace 0. It follows that

nλ([X, Y]) = 0

and since char(K) = 0, this implies λ([X, Y]) = 0, as required.

1.3. Engel’s theorem.

Lemma 1.7. Let g be a nilpotent Lie algebra. Then for any X ∈g, adX is a nilpotent endomorphism of g: there exists n >0 such that adⁿ_X = 0.

Proof. This is easy. Say Cⁿ(g) = 0. Then for all X_i ∈ g, i = 1, . . . , n, Qn

i=1adXi = 0. Indeed,adXn−i(Cⁱ(g))⊂Cⁱ⁺¹(g) by definition.

Engel’s theorem is the converse.

Theorem 1.8 (Engel’s theorem). Let g be a (finite-dimensional) Lie algebra. Suppose, for all X∈gthere exists n=n(X) such thatadⁿ_X = 0. Then g is a nilpotent Lie algebra.

Corollary 1.9. The subalgebra n⊂gl(n) is nilpotent.

(4)

Proof. It suffices to show that for every X ∈ n, adX is nilpotent. More generally, we can show that, ifXis any nilpotent endomorphism ofV (finite- dimensional) then adX :gl(V) → gl(V) is nilpotent.

Define the endomorphismsr_X and `_X ofgl(V) by r_X(Y) =Y X, `_X(Y) =XY.

These obviously commute and both r_X and `_X are nilpotent. The sum and difference of two commuting nilpotent endomorphisms of a vector space is again nilpotent (use the binomial theorem). Thus ad_X = `_X −r_X is

nilpotent.

The proof of Engel’s theorem is based on the following result that is a strengthening of Lie’s theorem for nilpotent Lie algebras. Note that we do not requireK to be algebraically closed.

Theorem 1.10. Let g ⊂ gl(V) be a Lie subalgebra consisting of nilpotent endomorphisms, with V 6= 0. Then there exists a nonzero v ∈V such that gv= 0.

Proof. Induct on N = dimg. When N = 1 this is clear. Let h ⊂ g be a proper subalgebra. It follows from the proof of the Corollary that h acts on gl(V) by nilpotent endomorphisms, and thus also on the vector space g/h. By induction, there is a nonzeroY ∈g/hsuch thatadX(Y) = 0 for all X ∈ h. In other words, [X, Y]∈ hfor all X ∈h, but note that Y /∈ h. In other words, the normalizer ofhing is strictly bigger than h.

In the preceding argument, we can take h to be a maximal proper subalgebra of g. Its normalizer is strictly bigger than h, and therefore must equal g. Thus any maximal proper subalgebra of g is an ideal. Suppose dimg/h > 1. Any nonzero Z ∈ g/h generates a 1-dimensional subalgebra of g/h, and its inverse image ing is a proper subalgebra containing h. This contradicts the maximality of h, so h is of codimension 1. Choose Z ∈ g, Z /∈h; so thatg=KZ +h.

By induction,V^h ⊂V is nontrivial. Since his an ideal in g,V^h is stable underg: ifv ∈V^h then for allY ∈h

Y Zv =ZY v+ [Y, Z]v= 0

because [Y, Z] ∈ h. Now Z is nilpotent on V and stabilizes V^h, hence is nilpotent on V^h and has a non-zero kernel. This completes the proof.

Corollary 1.11. Let g ⊂gl(V) be a Lie subalgebra consisting of nilpotent endomorphisms, with dimV =M >0. Then there is a complete flag

V =V₀ ⊃V₁⊃ · · · ⊃V_M ={0}

with dimV_i/V_i+1 = 1 for i≤M−1 such that, for all i, g(V_i)⊂V_i+1.

(5)

Proof. By induction again. TakeVM−1 =Kv for any nonzero v ∈ V^g and let V¹ =V /V_M−1. Then dimV¹ = dimV −1 and by induction contains a

complete flag. Pull back toV.

Proof of Engel’s theorem. By hypothesis,ad(g)⊂gl(g) consists of nilpotent endomorphisms. It follows that there isZ ∈gsuch that [X, Z] =ad_X(Z) = 0 for all X∈g. In other words, the centerz_g of gis non-zero. On the other hand, ad(zg) = 0 by definition, so ad(g) = ad(g/zg) consists of nilpotent elements. Since dimg/z_g<dimg, we can apply induction (the case dimg= 1 is trivial) and conclude thatg/zgis a nilpotent Lie algebra. It follows that there existsnsuch that Cⁿ(g)⊂zg, and so Cⁿ⁺¹(g)⊂[g, zg] = 0.

1.4. Characterization by the Killing form. We recall the statement of the Jordan decomposition for endomorphisms. Asemisimpleendomorphism is a diagonalizable endomorphism.

Theorem 1.12. Let K be any field.

(a) Let V be a finite-dimensional vector space over K, X ∈ End(V).

Then there is a unique pair of elements Xs, Xn ∈End(V) such that X = Xs+Xn, Xs is semisimple, Xn is nilpotent, and [Xs, Xn] = 0.

(b) There are polynomials p_s, p_n ∈K[T](depending on X), with p_s(0) = pn(0) = 0, such that Xs = ps(X), Xn = pn(X). In particular, Xs and Xn

commute with any endomorphism Y such that [X, Y] = 0.

(c) If B ⊂A⊂V and X(B)⊂A thenXs(B)⊂A, Xn(B)⊂A.

Corollary 1.13. LetX ∈gl(V). SupposeXis semisimple (resp. nilpotent).

Then so is adX ∈End(gl(V)). In particular, for anyX,adXs+adXn is the Jordan decomposition for ad_X.

Proof. We have seen this for nilpotent X. If X is semisimple, say X = diag(a₁, . . . , a_n) for some basis, then an easy (and important) computation shows that

ad_X(E_ij) = (a_i−a_j)E_ij

where E_ij is the elementary matrix with 1 at the (ij) place and zero else- where (in the chosen basis).

Thus for any X, ad_X = ad_X_s +ad_X_n is a decomposition as a sum of a semisimple and a nilpotent matrix. The two obviously commute, hence they

are the Jordan components.

Henceforward, K is assumed of characteristic zero.

Theorem 1.14 (Cartan’s criterion for solvability). Let g ⊂ gl(V) be a subalgebra with V finite-dimensional. Suppose that T r(XY) = 0 for all X∈[g,g] and all Y ∈g. Then g is solvable.

In particular, suppose the Killing form B_ad(X, Y) = 0 for all X ∈[g,g], Y ∈g. Then g is solvable.

(6)

The proof of the first statement requires a lemma. First I derive the second statement. Indeed, let g⁰ = ad(g) ⊂ gl(g). It follows from the hypothesis and the first part that g⁰ is solvable. But g⁰ =g/Z whereZ is the center of g; thusg is also solvable.

To prove the theorem, we use the following lemma.

Lemma 1.15. Let A⊂B ⊂gl(V), withdimV finite. Let M ={X∈gl(V) | [X, B]⊂A}

Suppose X ∈M satisfies T r(XY) = 0 for allY ∈M. Then X is nilpotent.

Proof. LetX=X_s+X_nbe the Jordan decomposition, withX_s=diag(a₁, . . . , a_n) for some basis. Let E denote the Q-vector subspace of K spanned by the eigenvalues a_i; it is a finite-dimensional Q-vector space. We will show that Hom(E,Q) = 0; this implies thatE= 0.

So let f ∈Hom(E,Q), and consider Y =diag(f(a₁), . . . , f(a_n)) (in the same basis). Then ad_Y(E_ij) = (f(a_i)−f(a_j))E_ij. Claim there exists a polynomial g∈K[T] such that, for all i, j,

g(a_i−a_j) =f(a_i)−f(a_j).

One can certainly find such a polynomial (by Lagrange interpolation) provided

a_i−a_j =a_k−a_` ⇒f(a_i)−f(a_j) =f(a_k)−f(a_`).

But this is clear because f is linear. Then

ad_Y =g(ad_X_s) =g◦p_s(ad_X)

where X_s =p_s(X),p_s(0) = 0, because ad_X_s is the semisimple part of ad_X. ThusadY is a polynomial inadX without constant term. SinceadX(B)⊂A, it follows that ad_Y(B)⊂A, so thatY ∈M. Thus

0 =T r(XY) =X

f(ai)ai. Recall thatf(a_i)∈Qfor all i, so thatP

a_if(a_i) is aQ-linear combination of elements ofE, and hence is in E. Apply f to both sides: we get

0 =f(0) =X

f(a_i)f(a_i)

is a sum of squares of rational numbers; hence each f(ai) = 0. Since the ai

spanE,f ≡0.

Proof of Cartan’s criterion. In order to prove that g is solvable, it suffices to prove that [g,g] is nilpotent; and by Engel’s theorem it suffices to show that allX ∈[g,g] are nilpotent endomorphisms. We apply the lemma with A= [g,g], B =g, so that

M ={X ∈gl(V)|[X,g]⊂[g,g]}.

Clearly B ⊂M. Our hypothesis is that T r(XY) = 0 for all X∈[g,g], Y ∈ g. The lemma implies that X is nilpotent provided T r(XZ) = 0 for all

(7)

X∈[g,g], Z ∈M. However, if U, V ∈g,Z ∈M, a simple calculation using the invariance of the trace shows that

T r([U, V]Z) =T r(U[V, Z]) =T r([V, Z]U).

Since Z ∈ M, [V, Z] ∈ [g,g] and by the hypothesis, T r([V, Z]U) = 0 for any U ∈ g. So T r([g, g]M) = 0 and the lemma applies to show that any

X∈[g,g] is nilpotent.

Corollary 1.16. More generally, let φ:g → End(V) be a representation, and hthe radical of B_φ. Then φ(h) is solvable.

Proof. The radical h of B_φ is an ideal of g: if X, Z ∈ g, Y ∈ h, then

−T r(φ(ad_X(Y))◦φ(Z)) =T r(φ([Y, X]◦φ(Z)) =T r([φ(Y), φ(X)]◦φ(Z)) = T r(φ(Y)◦[φ(X), φ(Z)]) = 0. It follows that B_φ |_h =B_φ_|_h. It now follows

from Cartan’s criterion thatφ(h) is solvable.