MA1112: Linear Algebra II

(1)

MA1112: Linear Algebra II

Dr. Vladimir Dotsenko (Vlad) Lecture 16

In the first half of this module, matrices were used to represent linear maps. We shall temporarily take the out- look which views symmetric matrices in the spirit of one of the proofs from last week, and looks at the associated bilinear forms (and quadratic forms). Let us start with a motivating example.

Motivation

Example 1. Consider a functionf(x1, . . . ,xn) ofnscalar real arguments. Letx⁰=x⁰₁e1+ · · · +x_n⁰enbe a point inRⁿ, and assume that the functionf is smooth enough to consider its Taylor series to order two near the pointx⁰:

f(x)=f(x⁰)+

k

X

i=1

(xi−x_i⁰)∂f

∂x_i(x⁰)+1 2

n

X

i,j=1

(xi−x⁰_i)(xj−x⁰_j) ∂²f

∂x_ix_j(x⁰)+o(|x−x⁰|²).

Suppose that we would like to know whetherf attains its locally minimal/maximal value atx⁰. Then, since in the first order of magnitude we have

f(x)=f(x⁰)+

k

X

i=1

(x_i−x_i⁰)∂f

∂x_i(x⁰)+o(|x−x⁰|), we conclude that a necessary condition is_∂^∂_x^f

i(x⁰)=0 for alli, that is thegradientoff vanishes atx⁰. In this case, we have

f(x)=f(x⁰)+1 2

n

X

i,j=1

(x_i−x_i⁰)(x_j−x⁰_j) ∂²f

∂x_ix_j(x⁰)+o(|x−x⁰|²),

so the difference betweenf(x) andf(x⁰), whenxis close tox⁰, is approximately equal to¹₂q(x−x⁰), where q(y)=a11y²₁+2a12y1y2+ · · · +2a1ny1yn+y₂²+2a23y2y3+ · · · +anny_n²,

where for brevity we denotea_{i j}= _∂^∂_x²_i^f_x_j(x⁰); we havea_{i j}=a_{j i} whenever the function f is smooth enough. The functionq(y) is a very typical example of aquadratic form.

Bilinear and quadratic forms

Definition 1. LetV be a vector space. A functionq:V→Ris said to be aquadratic formif for some basise₁, . . . ,e_n ofV we have

q(x₁e₁+ · · · +x_ne_n)= X

1Éi,jÉn

a_{i j}x_ix_j,

wherea_{i j}are some scalars. In other words, the value ofqon a vector is a quadratic polynomial in coordinates of a vector.

Remark 1. It is easy to see that if the condition from the definition holds for some basis, then it holds for any basis, since coordinates relative to different bases are related by transition matrices in a linear way.

1

(2)

One simple example of a quadratic form is

q(x)=x₁²+ · · · +x_n².

In general, ifV is a Euclidean vector space thenq(x)=(x,x) is certainly a quadratic form. This can be generalised:

everybilinear formgives rise to a quadratic form.

Definition 2. LetV be a vector space. A functionV×V →R,v₁,v₂7→b(v₁,v₂) is called a bilinear form if for all vectorsv,v₁,v₂the following conditions are satisfied:

b(c₁v₁+c₂v₂,v)=c₁b(v₁,v)+c₂b(v₂,v) and b(v,c₁v₁+c₂v₂)=c₁b(v,v₁)+c₂b(v,v₂).

A bilinear form is said to besymmetricifb(v₁,v₂)=b(v₂,v₁) for allv₁,v₂. A symmetric bilinear form is said to be positive definite, ifb(v,v)Ê0 for allv, andb(v,v)=0 only forv=0.

In these words, a function of two vector arguments is a scalar product if and only if it is bilinear, symmetric, and positive definite.

Another important class of bilinear forms is given by skew-symmetric ones: a bilinear form is said to beskew- symmetricifb(v1,v2)= −b(v2,v1) for allv1,v2. Those appear very frequently in differential geometry and advanced classical mechanics.

Remark 2. Generalising what we proved about scalar products, for every bilinear formband every basise₁, . . . ,e_n ofV, we have

b(x₁e₁+. . .+x_ne_n,y₁e₁+. . .+y_ne_n)=

n

X

i,j=1

b_{i j}x_iy_j, wherebi j=b(ei,ej). Moreover, writing this sum as

n

X

i=1

xi n

X

j=1

bi jyj,

we see that

b(x1e1+. . .+xnen,y1e1+. . .+ynen)=x^TB y,

where then×n-matrixBhas entriesbi j. (Strictly speaking,x^TB yis a 1×1-matrix, but that is essentially the same as a single number.)

Every bilinear formbgives rise to a quadratic form by puttingq(x)=b(x,x), for example, the bilinear form b(x₁e₁+x₂e₂,y₁e₁+y₂e₂)=2x₁y₂

gives rise to a quadratic form 2x₁x₂, and the bilinear form

b(x₁e₁+x₂e₂,y₁e₁+y₂e₂)=x₁y₂+x₂y₁

gives rise to the same quadratic form. It turns out that the reconstruction ofbfromqis unique if we assume that bis symmetric; in this case the reconstruction formula is

b(v,w) :=1

2(q(v+w)−q(v)−q(w)).

One celebrated example of a quadratic form isq(x₁,x₂,x₃,t)=x₁²+x²₂+x₃²−t²on the Minkowski spaceR⁴, it is used in special relativity theory. This (sort of ) motivates the next few results. We shall now formulate them (in order to use them in the next homework), and next week we shall discuss their proofs in detail.

2

(3)

Four theorems about signed sums of squares

Theorem 1(Canonical form). Let q be a quadratic form on a vector space V . There exists a basis e₁, . . . ,e_nof V for which the quadratic form q becomes a signed sum of squares:

q(x₁e₁+ · · · +x_ne_n)=

n

X

i=1

εix²_i,

where all numbersεiare either1or−1or0.

Theorem 2(Law of inertia). In the previous theorem, the triple(n₊,n₋,n₀), where n_±is the number ofεi equal to

±1, and n₀is the number ofεi equal to0, does not depend on the choice of the basis e₁, . . . ,e_n. This triple is often referred to as thesignatureof the quadratic form q.

LetB=(b_{i j}) be the matrix of a given symmetric bilinear formbonV. We shall now discuss some methods of computing the signature ofbvia the matrix elements ofB. We denote byB_kthek×k-matrix whose entries areb_{i j} with 1Éi,jÉk, that is the top left corner submatrix ofB, and put∆0=1 and∆k:=det(B_k) for 1ÉkÉn.

Theorem 3(Jacobi theorem). Suppose that for all i=1, . . . ,n we have∆i 6=0. Then the number of1’s,0’s and−1’s in the canonical form is equal to the number of positive numbers, zeros, and negative numbers among the numbers

∆k−1

∆k , k=1, . . . ,n.

Theorem 4(Sylvester theorem). The given symmetric bilinear form is positive definite if and only if

∆k>0 for all k=1, . . . ,n.

3