Restricted invertibility and the Banach-Mazur distance to the cube

(1)

HAL Id: hal-00811793

https://hal-upec-upem.archives-ouvertes.fr/hal-00811793

Preprint submitted on 11 Apr 2013

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Restricted invertibility and the Banach-Mazur distance to the cube

Pierre Youssef

To cite this version:

Pierre Youssef. Restricted invertibility and the Banach-Mazur distance to the cube. 2012. �hal-

00811793�

(2)

RESTRICTED INVERTIBILITY AND THE BANACH-MAZUR DISTANCE TO THE CUBE

PIERRE YOUSSEF

A

BSTRACT

. We prove a normalized version of the restricted invertibility principle obtained by Spielman-Srivastava in [15]. Applying this result, we get a new proof of the proportional Dvoretzky-Rogers factorization theorem recovering the best current estimate in the symmetric setting while we improve the best known result in the nonsymmetric case. As a consequence, we slightly improve the estimate for the Banach-Mazur distance to the cube: the distance of every n-dimensional normed space from `

ⁿ_∞

is at most (2n)

⁵⁶

. Finally, using tools from the work of Batson-Spielman-Srivastava in [2], we give a new proof for a theorem of Kashin-Tzafriri [11] on the norm of restricted matrices.

1. I NTRODUCTION

Given an n × m matrix U , viewed as an operator from `

^m₂

to `

ⁿ₂

, the restricted invertibility problem asks if we can extract a large number of linearly independent columns of U and provide an estimate for the norm of the restricted inverse. If we write U

_σ

for the restriction of U to the columns U e

_i

, i ∈ σ ⊂ {1, . . . , m}, we want to find a subset σ, of cardinality k as large as possible, such that kU

_σ

xk

₂

> ckxk

₂

for all x ∈ R

^σ

and to estimate the constant c (which will depend on the operator U ). This question was studied by Bourgain-Tzafriri [4] who obtained a result for square matrices:

Given an n × n matrix T (viewed as an operator on `

ⁿ₂

) whose columns are of norm one, there exists σ ⊂ {1, . . . , n} with |σ| > d

_kTⁿ_k2

such that kT

σ

xk

₂

> ckxk

₂

for all x ∈ R

^σ

, where d, c > 0 are absolute constants.

Here and in the rest of the paper, k · k

₂

denotes the Euclidean norm. For any matrix A, kAk denotes its operator norm seen as an operator on l

₂

and kAk

_HS

denotes the Hilbert-Schmidt norm, i.e.

kAk

_HS

= ^q T r(A · A

^∗

) = ^X

i

kC

_i

k

²₂

!

1/2

where C

i

are the columns of A.

Since the identity operator can be decomposed in the form Id = ^P

_j6n

e

j

e

^t_j

where (e

_j

)

_j6n

is the canonical basis of R

ⁿ

, the previous result states that one can find a large part of this basis (of cardinality greater than d

_kTⁿ_k2

) on the span of which the operator T is invertible and the norm of its inverse is controlled by an absolute constant.

In [21], Vershynin generalized this result for any decomposition of the identity and improved the estimate for the size of the subset. Using a technical iteration scheme based on the previous result of Bourgain-Tzafriri, combined with a theorem of Kashin-Tzafriri which we will discuss in the last section, he obtained the following :

Let Id = ^P

_j6m

x

j

x

^t_j

and let T be a linear operator on `

ⁿ₂

. For any ε ∈ (0, 1) one can find σ ⊂ {1, . . . , m} with

|σ| > (1 − ε) kT k

²_HS

kT k

²

1

(3)

such that

X

j∈σ

a

_j

T x

j

kT x

_j

k

₂

> c(ε)



 X

j∈σ

a

²_j





1 2

for all scalars (a

_j

).

One can easily check that, in the case of the canonical decomposition, this is a generalization of the Bourgain-Tzafriri theorem, which was previously only proved for a fixed value of ε.

The constant c(ε) plays a crucial role in applications and finding the right dependence is an important problem. Let us mention that Veshynin obtained also a non trivial upper bound which we don’t state here; we refer to [21] for a full statement.

Back to the original restricted invertibility problem, a recent work of Spielman-Srivastava [15] provides the best known estimate for the norm of the inverse matrix. Their proof uses a new deterministic method based on linear algebra, while the previous works on the subject employed probabilistic, combinatorial and functional-analytic arguments.

More precisely, Spielman-Srivastava proved the following:

Theorem A (Spielman-Srivastava). Let x

₁

, . . . x

_m

∈ R

ⁿ

such that Id = ^P

_i

x

_i

x

^t_i

and let 0 <

ε < 1. For every linear operator T : `

ⁿ₂

→ `

ⁿ₂

there exists a subset σ ⊂ {1, . . . , m} of size

|σ| >

(1 − ε)

²^kT_kT^k_k²^HS2

for which {T x

_i

}

i∈σ

is linearly independent and λ

_min

^X

i∈σ

(T x

_i

)(T x

_i

)

^t

!

> ε

²

kT k

²_HS

m ,

where λ

_min

is computed on span{T x

_i

}

i∈σ

or simply here λ

_min

denotes the smallest nonzero eigenvalue of the corresponding operator. .

One can view the previous result as an invertibility theorem for rectangular matrices. Given, as above, a decomposition of the identity and a linear operator T , we can associate to these an n × m matrix U whose columns are the vectors (T x

_j

)

_j6m

. Since Id = ^P

_j

x

_j

x

^t_j

, one can easily check that

U · U

^t

= T · T

^t

= ^X

j

(T x

_j

) · (T x

_j

)

^t

.

Hence,

kU k

_HS

= kT k

_HS

and kU k = kT k,

and thus the previous result can be written in terms of the rectangular matrix U.

In the applications, one might need to extract multiples of the columns of the matrix. Adapt- ing the proof of Spielman-Srivastava, we will generalize the restricted invertibility theorem for any rectangular matrix and, under some conditions, for any choice of multiples.

If D is an m × m diagonal matrix with diagonal entries (α

_j

)

_j6m

, we set I

_D

:= {j 6 m | α

_j

6= 0} and write D

⁻¹_σ

for the restricted inverse of D i.e the diagonal matrix whose diagonal entries are the inverses of the respective entries of D for indices in σ and zero elsewhere. The main result of this paper is the following:

Theorem 1.1. Given an n × m matrix U and a diagonal m × m matrix D with (α

_j

)

_j6m

on its diagonal, with the property that Ker(D) ⊂ Ker(U ), then for any ε ∈ (0, 1) there exists σ ⊂ I

_D

with

|σ| >

"

(1 − ε)

²

kU k

²_HS

kU k

²

#

such that

s

_min

U

_σ

D

_σ⁻¹

> εkU k

_HS

kDk

_HS

,

where s

_min

denotes the smallest singular value.

(4)

Note that if we apply this fact to the matrix U which we associated with a linear operator T and a decomposition of the identity, and we take D to be the identity operator, we recover the restricted invertibility theorem of Spielman-Srivastava. Taking D the diagonal matrix with diagonal entries kT x

j

k

2

, it is easy to see that we recover the "normalized" restricted invertibility principle stated by Vershynin with c(ε) = ε.

In Section 2, we give the proof of the main result. In section 3, we use Theorem 1.1 to give an alternative proof for the proportional Dvoretzky-Rogers factorization; in the symmetric case, we recover the best known dependence and improve the constants involved which allows us to improve the estimate of the Banach-Mazur distance to the cube, while in the nonsymmetric case, we improve the best known dependence for the proportional Dvoretzky-Rogers factorization.

Finally, in Section 4 we give a new proof of a theorem due to Kashin-Tzafriri [11] which deals with the norm of coordinate projections of a matrix; our proof slightly improves the result of Kashin-Tzafriri and has the advantage of producing a deterministic algorithm.

2. P ROOF OF T HEOREM 1.1

Since the rank and the eigenvalues of (U

_σ

D

⁻¹_σ

)

^t

· (U

_σ

D

_σ⁻¹

) and (U

_σ

D

⁻¹_σ

) · (U

_σ

D

⁻¹_σ

)

^t

are the same, it suffices to prove that (U

_σ

D

⁻¹_σ

) · (U

_σ

D

_σ⁻¹

)

^t

has rank equal to k = |σ| and its smallest positive eigenvalue is greater than ε

²^kUk_kDk²^HS2

HS

. Note that (U

_σ

D

⁻¹_σ

) · (U

_σ

D

⁻¹_σ

)

^t

= ^X

j∈σ

U D

⁻¹_σ

e

j

· U D

⁻¹_σ

e

j

t

= ^X

j∈σ

U e

_j

α

_j

!

· U e

_j

α

_j

!

t

We are going to construct the matrix A

k

= ^X

j∈σ

U D

_σ⁻¹

e

j

· U D

_σ⁻¹

e

j

t

by iteration. We begin by setting A

₀

= 0 and at each step we will be adding a rank one matrix

^{U e}_α^j

j

·

^{U e}_α^j

j

t

for a suitable j, which will give a new positive eigenvalue. This will guarantee that the vector U D

⁻¹_σ

e

j

chosen in each step is linearly independent from the previous ones.

If A and B are symmetric matrices, we write A B if B − A is a positive semidefinite matrix. Recall the Sherman-Morrison Formula which will be needed in the proof. For any invertible matrix A and any vector v we have

(A + v · v

^t

)

⁻¹

= A

⁻¹

− A

⁻¹

v · v

^t

A

⁻¹

1 + v

^t

A

⁻¹

v .

We will also apply the following lemma which appears as Lemma 6.3 in [16]:

Lemma 2.1. Suppose that A 0 has q nonzero eigenvalues, all greater than b

⁰

> 0. If v 6= 0 and

(1) v

^t

(A − b

⁰

I)

⁻¹

v < −1, then A + vv

^t

has q + 1 nonzero eigenvalues, all greater than b

⁰

.

Proof. The proof of the lemma is simple and makes use of the Sherman-Morrison formula.

Denote λ

₁

> λ

₂

> .. > λ

_q

> b

⁰

the eigenvalues of A and λ

⁰₁

> .. > λ

⁰_q

> λ

⁰_q+1

> 0 the eigenvalues of A + vv

^t

. By the Cauchy interlacing Theorem we have

λ

⁰₁

> λ

₁

> λ

⁰₂

> .. > λ

⁰_q

> λ

_q

> λ

⁰_q+1

Since λ

_q

> b

⁰

then λ

⁰_q

> b

⁰

and it remains to prove that λ

⁰_q+1

> b

⁰

.

Tr(A + vv

^t

− b

⁰

I)

⁻¹

= ^X

j6q+1

1 λ

⁰_j

− b

⁰

− ^X

j>q+1

1 b

⁰

Tr(A − b

⁰

I)

⁻¹

= ^X

j6q

1 λ

_j

− b

⁰

− ^X

j>q

1 b

⁰

(5)

Tr(A + vv

^t

− b

⁰

I)

⁻¹

− Tr(A − b

⁰

I)

⁻¹

= ^X

j6q

"

1 λ

⁰_j

− b

⁰

− 1 λ

_j

− b

⁰

#

+ 1

λ

⁰_q+1

− b

⁰

− 1 b

⁰

6 1

λ

⁰_q+1

− b

⁰

− 1 b

⁰

By the Sherman-Morrison’s formula we have:

(A + vv

^t

− b

⁰

I)

⁻¹

= (A − b

⁰

I )

⁻¹

− (A − b

⁰

I)

⁻¹

vv

^t

(A − b

⁰

I)

⁻¹

1 + v

^t

(A − b

⁰

I)

⁻¹

v Now taking the Trace we get:

Tr(A + vv

^t

− b

⁰

I)

⁻¹

− Tr(A − b

⁰

I)

⁻¹

= − v

^t

(A − b

⁰

I )

⁻²

v 1 + v

^t

(A − b

⁰

I)

⁻¹

v

Since (A − b

⁰

I)

⁻²

is positive definite and using the hypothesis, one can see that the right hand side in the previous equality is positive. Therefore we get:

1 λ

⁰_q+1

− b

⁰

− 1 b

⁰

> 0 wich means that λ

⁰_q+1

> b

⁰

.

For any symmetric matrix A and any b > 0, we define

φ(A, b) = Tr U

^t

(A − bI )

⁻¹

U as the potential corresponding to the barrier b.

At each step l, the matrix already constructed is denoted by A

_l

and the barrier by b

_l

. Suppose that A

_l

has l nonzero eigenvalues all greater than b

_l

. As mentioned before, we will try to construct A

_l+1

by adding a rank one matrix v · v

^t

to A

_l

so that A

_l+1

has l + 1 nonzero eigenvalues all greater than b

_l+1

= b

_l

− δ and φ(A

_l+1

, b

_l+1

) 6 φ(A

_l

, b

_l

). Note that

φ(A

_l+1

, b

_l+1

) = Tr U

^t

(A

_l

+ vv

^t

− b

_l+1

I)

⁻¹

U

= Tr U

^t

(A

_l

− b

_l+1

I)

⁻¹

U − Tr U

^t

(A

_l

− b

_l+1

I)

⁻¹

vv

^t

(A

_l

− b

_l+1

I)

⁻¹

U 1 + v

^t

(A

_l

− b

_l+1

I)

⁻¹

v

!

= φ(A

_l

, b

_l+1

) − v

^t

(A

_l

− b

_l+1

I)

⁻¹

U U

^t

(A

_l

− b

_l+1

I )

⁻¹

v 1 + v

^t

(A

_l

− b

_l+1

I)

⁻¹

v .

So, in order to have φ(A

_l+1

, b

_l+1

) 6 φ(A

_l

, b

_l

), we must choose a vector v verifying (2) − v

^t

(A

_l

− b

_l+1

I)

⁻¹

U U

^t

(A

_l

− b

_l+1

I)

⁻¹

v

1 + v

^t

(A

l

− b

l+1

I)

⁻¹

v 6 φ(A

_l

, b

_l

) − φ(A

_l

, b

_l+1

).

Since v

^t

(A

_l

− b

_l+1

I)

⁻¹

U U

^t

(A

_l

− b

_l+1

I)

⁻¹

v and φ(A

_l

, b

_l

) − φ(A

_l

, b

_l+1

)) are positive, choosing v verifying conditions (1) and (2) is equivalent to choosing v which satisfies the following:

v

^t

(A

_l

− b

_l+1

I )

⁻¹

U U

^t

(A

_l

− b

_l+1

I)

⁻¹

v 6 (φ(A

_l

, b

_l

) − φ(A

_l

, b

_l+1

)) −1 − v

^t

(A

_l

− b

_l+1

I)

⁻¹

v Since U U

^t

kU k

²

Id and (A

_l

− b

_l+1

I )

⁻¹

is symmetric, it is sufficient to choose v so that (3) v

^t

(A

_l

− b

_l+1

I)

⁻²

v 6 1

kU k

²

(φ(A

_l

, b

_l

) − φ(A

_l

, b

_l+1

)) −1 − v

^t

(A

_l

− b

_l+1

I)

⁻¹

v

Recall the notation I

_D

:= {j 6 m | α

_j

6= 0} where (α

_j

)

_j6m

are the diagonal entries of D.

Since we have assumed that Ker(D) ⊂ Ker(U ), we have kU k

²_HS

= ^X

j6m

kU e

_j

k

²₂

= ^X

j∈ID

kU e

_j

k

²₂

6 |I

_D

| · kU k

²

,

(6)

and thus |I

D

| >

^kUkkUk²^HS²

. At each step, we will select a vector v satisfying (3) among (

^{U e}_α^j

j

)

_j∈I_D

. Our task therefore is to find j ∈ I

_D

such that

(4) (U e

_j

)

^t

(A

_l

− b

_l+1

I)

⁻²

U e

_j

6 φ(A

_l

, b

_l

) − φ(A

_l

, b

_l+1

) kU k

²

−α

²_j

− (U e

_j

)

^t

(A

_l

− b

_l+1

I)

⁻¹

U e

_j

The existence of such a j ∈ I

_D

is guaranteed by the fact that condition (4) holds true if we take the sum over all (

^{U e}_α^j

j

)

j∈D

. The hypothesis Ker(D) ⊂ Ker(U) implies that:

• ^X

j∈I_D

(U e

_j

)

^t

(A

_l

− b

_l+1

I)

⁻²

U e

_j

= Tr U

^t

(A

_l

− b

_l+1

I)

⁻²

U ,

• ^X

j∈I_D

(U e

_j

)

^t

(A

_l

− b

l+1

I)

⁻¹

U e

j

= Tr U

^t

(A

_l

− b

l+1

I)

⁻¹

U . Therefore it is enough to prove that, at each step, one has

(5) Tr(U

^t

(A

_l

− b

_l+1

I)

⁻²

U ) 6 φ(A

l

, b

l

) − φ(A

l

, b

l+1

) kUk

²

−kDk

²_HS

− φ(A

_l

, b

_l+1

)

The rest of the proof is similar to the one in [16]. One just needs to replace m by kDk

²_HS

. For the sake of completeness, we include the proof. The next lemma will determine the conditions required at each step in order to prove (5).

Lemma 2.2. Suppose that A

_l

has l nonzero eigenvalues all greater than b

_l

, and write Z for the orthogonal projection onto the kernel of A

_l

. If

(6) φ(A

l

, b

l

) 6 −kDk

²_HS

− kU k

²

δ and

(7) 0 < δ < b

_l

6 δ kZU k

²_HS

kU k

²

, then there exists i ∈ I

_D

such that A

_l+1

:= A

_l

+

^{U e}_αⁱ

i

·

^{U e}_αⁱ

i

t

has l + 1 nonzero eigenvalues all greater than b

l+1

:= b

l

− δ and φ(A

l+1

, b

l+1

) 6 φ(A

l

, b

l

).

Proof. As mentioned before, it is enough to prove inequality (5). We set ∆

_l

:= φ(A

_l

, b

_l

) − φ(A

_l+1

, b

_l+1

). By (6), we get

φ(A

_l

, b

_l+1

) 6 −kDk

²_HS

− kU k

²

δ − ∆

_l

.

Inserting this in (5), we see that it is sufficient to prove the following inequality:

(8) Tr U

^t

(A

_l

− b

_l+1

I)

⁻²

U 6 ∆

_l

∆

_l

kU k

²

+ 1

δ

!

. Now, denote by P the orthogonal projection onto the image of A

_l

. We set

φ

^P

(A

_l

, b

_l

) := Tr U

^t

P (A

_l

− b

_l

I)

⁻¹

P U and ∆

^P_l

:= φ

^P

(A

_l

, b

_l

) − φ

^P

(A

_l

, b

_l+1

) and use similar notation for Z. Since P , Z and A

_l

commute, one can write

∆

_l

= ∆

^P_l

+ ∆

^Z_l

and φ(A

_l

, b

l

) = φ

^P

(A

_l

, b

l

) + φ

^Z

(A

_l

, b

l

).

Note that:

(A

_l

− b

_l

I)

⁻¹

− (A

_l

− b

_l+1

I)

⁻¹

= (A

_l

− b

_l

I)

⁻¹

(b

_l

I − A

_l

+ A

_l

− b

_l+1

I)(A

_l

− b

_l+1

I)

⁻¹

= δ(A

_l

− b

_l

I)

⁻¹

(A

_l

− b

_l+1

I)

⁻¹

and since P (A

_l

− b

_l

I)

⁻¹

P and P (A

_l

− b

_l+1

I)

⁻¹

P are positive semidefinite, we have:

U

^t

P (A

_l

− b

_l

I)

⁻¹

P U − U

^t

P (A

_l

− b

_l+1

I)

⁻¹

P U δU

^t

P (A

_l

− b

_l+1

I)

⁻²

P U.

(7)

Inserting this in (8), it is enough to prove that:

Tr U

^t

Z (A

_l

− b

_l+1

I)

⁻²

ZU 6 ∆

_l

∆

_l

kUk

²

+ 1

δ

!

− ∆

^P_l

δ . Since A

_l

Z = 0, we have:

• Tr(U

^t

Z(A

_l

− b

l+1

I)

⁻²

ZU ) =

^kZUk_b2 ²^HS l+1

and

• ∆

^Z_l

= −

^kZUk_b ²^HS

l

+

^kZU_b ^k²^HS

l+1

= δ

^kZUk_b ²^HS

lbl+1

,

so taking into account the fact that ∆

_l

> ∆

^Z_l

> 0, it remains to prove the following:

(9) kZU k

²_HS

b

²_l+1

6 δ

²

kZU k

⁴_HS

kU k

²₂

b

²_l

b

²_l+1

+ kZU k

²_HS

b

_l

b

_l+1

. By Hypothesis (7), this last inequality follows by

(10) kZU k

²_HS

b

²_l+1

6 δ kZU k

²_HS

b

_l

b

²_l+1

+ kZU k

²_HS

b

_l

b

_l+1

,

which is trivially true since b

_l+1

= b

_l

− δ.

We are now able to complete the proof of Theorem 1.1. To this end, we must verify that conditions (6) and (7) hold at each step. At the beginning we have A

₀

= 0 and Z = Id, so we must choose a barrier b

₀

such that:

(11) − kUk

²_HS

b

₀

6 −kDk

²_HS

− kU k

²

δ and

(12) b

₀

6 δ kU k

²_HS

kU k

²

. We choose

b

₀

:= ε kUk

²_HS

kDk

²_HS

and δ := ε 1 − ε

kU k

²

kDk

²_HS

,

and we note that (11) and (12) are verified. Also, at each step (6) holds because φ(A

_l+1

, b

_l+1

) 6 φ(A

_l

, b

_l

). Since kZU k

²_HS

decreases at each step by at most kU k

²

, the right-hand side of (7) decreases by at most δ, and therefore (7) holds once we replace b

_l

by b

_l

− δ.

Finally note that, after k = (1 − ε)

²^kUk_kUk²^HS2

steps, the barrier will be b

_k

= b

₀

− kδ = ε

²

kUk

²_HS

kDk

²_HS

. This completes the proof.

3. P ROPORTIONNAL D VORETZKY -R OGERS FACTORIZATION

Let BM

n

denote the space of all n-dimensional normed spaces X, known as the Banach- Mazur compactum. If X, Y are in BM

n

, we define the Banach-Mazur distance between X and Y as follows:

d(X, Y ) = inf {kT k · kT

⁻¹

k | T is an isomorphism between X and Y }

Remark 3.1. For K, L two symmetric convex bodies in R

ⁿ

, one can define the Banach-Mazur distance between K and L as

d(K, L) = inf {α/β | βL ⊂ T (K) ⊂ αL}

One can easily check that this distance is coherent with the previous one as d(X, Y ) = d(B

_X

, B

_Y

).

(8)

By the classical Dvoretzky-Rogers lemma [6], it is proven that if X is an n-dimensional Banach space then there exist x

₁

, ..., x

_m

∈ X with m = √

n such that for all scalars (a

_j

)

_j6m

max

j6m

|a

_j

| 6

X

j6m

a

_j

x

_j

_X

6 c



 X

j6m

a

²_j





1 2

,

where c is a universal constant. Bourgain-Szarek [3] proved that the previous statement holds for m proportional to n, and called the result "the proportional Dvoretzky-Rogers factorization":

Theorem B (Proportional Dvoretzky-Rogers factorization). Let X be an n-dimensional Banach space. ∀ε ∈ (0, 1), there exist x

₁

, ..., x

_k

∈ X with k > [(1 − ε)n] such that for all scalars (a

_j

)

_j6k

max

j6k

|a

_j

| 6

X

j6k

a

_j

x

_j

_X

6 c(ε)



 X

j6k

a

²_j





1 2

,

where c(ε) is a constant depending on ε. Equivalently, the identity operator i

2,∞

: l

^k₂

−→ l

_∞^k

can be written i

2,∞

= α ◦ β with β : l

₂^k

−→ X, α : X −→ l

_∞^k

and kαk · kβk 6 c(ε).

Finding the right dependence on ε is an important problem and the optimal result is not known yet. In [17], Szarek showed that one cannot hope for a dependence better than cε

⁻¹⁰¹

. Szarek-Talagrand [18] proved that the previous result holds with c(ε) = cε

⁻²

and in [7] and [8] Giannopoulos improved the dependence to get cε

⁻³²

and cε

⁻¹

. In all these results, a factorization for the identity operator i

_1,2

: l

^k₁

−→ l

₂^k

was proven and by duality the factorization for i

2,∞

was deduced. The previous proofs used some geometric results, technical combinatorics and Grothendieck’s factorization theorem. Here we present a direct proof using Theorem 1.1 which allows us to recover the best known dependence on ε and improve the universal constant involved.

Note that Theorem B can be formulated with symmetric convex bodies. In [13], Litvak and Tomczak-Jaegermann proved a nonsymmetric version of the proportional Dvoretzky-Rogers factorization:

Theorem C (Litvak-Tomczak-Jaegermann). Let K ⊂ R

ⁿ

be a convex body, such that B

₂ⁿ

is the ellipsoid of minimal volume containing K . Let ε ∈ (0, 1) and set k = [(1 − ε)n]. There exist vectors y

₁

, y

₂

, ..., y

_k

in K, and an orthogonal projection P in R

ⁿ

with rank P > k such that for all scalars t

₁

, ..., t

_k

cε

³





k

X

j=1

|t

_j

|

²





1 2

6

k

X

j=1

t

_j

P y

_j

_{P K}

6 6 ε

k

X

j=1

|t

_j

|,

where c > 0 is a universal constant.

Using again Theorem 1.1 combined with some tools developped in [3] and [13], we will be able to improve the dependence on ε in the previous statement.

3.1. The symmetric case. Let us start with the original proportional Dvoretzky-Rogers factorization. We will prove the following:

Theorem 3.2. Let X be an n-dimensional Banach space. ∀ε ∈ (0, 1), there exist x

₁

, ..., x

_k

∈ X with k > [(1 − ε)

²

n] such that for all scalars (a

_j

)

_j6m

ε



 X

j6k

a

²_j





1 2

6 X

j6k

a

_j

x

_j

_X

6 ^X

j6k

|a

_j

|

(9)

Equivalently, the identity operator i

_1,2

: l

₁^k

−→ l

^k₂

can be written as i

_1,2

= α ◦ β, where β : l

₁^k

−→ X, α : X −→ l

₂^k

and kαk · kβk 6 ε

⁻¹

.

Proof. Without loss of generality, we may assume that X = ( R

ⁿ

, k · k

_X

) and B

₂ⁿ

is the ellipsoid of minimal volume containing B

_X

. By John’s theorem [10] there exist x

₁

, ..., x

_m

contact points of B

_X

with B

₂ⁿ

(kx

_j

k

_X

= kx

_j

k

_X^∗

= kx

_j

k

₂

= 1) and positive scalars c

₁

, ..., c

_m

such that

Id = ^X

j6m

c

_j

x

_j

x

^t_j

and k · k

₂

6 k · k

_X

Let U = √

c

₁

x

₁

, ..., √

c

_m

x

_m

be the n × m rectangular matrix whose columns are √ c

_j

x

_j

and denote D = diag( √

c

₁

, ..., √

c

_m

) the m × m diagonal matrix with √

c

_j

on its diagonal.

Let ε < 1, applying Theorem 1.1 to U and D, we find σ ⊂ {1, ..., m} such that k = |σ| > ^h (1 − ε)

²

n ⁱ

and for all a = (a

_j

)

_j6m

(13) U

_σ

D

⁻¹_σ

a

2

=

X

j∈σ

a

_j

x

_j

₂

> ε



 X

j∈σ

|a

_j

|

²





1 2

To simplify the notations, we may assume that σ = {1, . . . , k}. Denote P the orthogonal projection of X onto Y = span {(x

_j

)

_j6k

}. Now note that (13) guarantees that the (x

_j

)

_j6k

are linearly independent and therefore that P is of rank k. Define T and β as follows:

β : l

^k₁

→ X and T : Y → l

₂^k

e

_j

7→ x

_j

for j 6 k x

_j

7→ e

_j

for j 6 k and write α = T P . For a = (a

_j

)

_j6k

∈ R

^k

, by the triangle inequality we have

kβ(a)k

_X

=

X

j6k

a

_j

x

_j

_X

6 ^X

j6k

|a

_j

| · kx

_j

k

_X

= ^X

j6k

|a

_j

|,

and therefore kβk 6 1. Now let x ∈ X, then P x ∈ Y and one can write P x = ^P

_j6k

a

_j

x

_j

. Using (13) we get

kα(x)k

₂

=



 X

j6k

a

²_j





1 2

6 1 ε

X

j6k

a

_j

x

_j

₂

= 1

ε kP xk

₂

6 1

ε kxk

₂

6 1 ε kxk

_X

,

and therefore kαk 6 ε

⁻¹

which finishes the proof.

As a direct application of the previous result, we have

Corollary 3.3. Let X be an n-dimensional Banach space. For any ε ∈ (0, 1), there exists Y a subspace of X of dimension k > [(1 − ε)

²

n] such that d(Y, l

^k₁

) 6

√n ε

.

3.2. The nonsymmetric case. Let us now turn to the nonsymmetric version of Theorem 3.2.

We will prove the following:

Theorem 3.4. Let K ⊂ R

ⁿ

be a convex body, such that B

ⁿ₂

is the ellipsoid of minimal volume containing K . ∀ε ∈ (0, 1), there exist x

₁

, ..., x

_k

with k > [(1 − ε)n] contact points and there exists P an orthogonal projection of rank > k such that for all (a

_j

)

_j6k

ε

²

16 



k

X

j=1

|a

_j

|

²





1 2

6

k

X

j=1

a

_j

P x

_j

_{P K}

6 4 ε

k

X

j=1

|a

_j

|

(10)

Proof. By John’s Theorem [10], we get an identity decomposition in R

ⁿ

Id = ^X

j6m

c

_j

x

_j

x

^t_j

and ^X

j6m

c

_j

x

_j

= 0

where x

₁

, ..., x

_m

are contact points of K and B

₂ⁿ

and (c

_j

)

_j6m

positive scalars. Note that we will not use the second assertion i.e the fact that ^P

_j6m

c

_j

x

_j

= 0.

As before, take U = √

c

₁

x

₁

, ..., √

c

_m

x

_m

the n × m rectangular matrix whose columns are

√ c

_j

x

_j

. Denote D = diag( √

c

₁

, ..., √

c

_m

) the m × m diagonal matrix with √

c

_j

on its diagonal.

Applying Theorem 1.1 to U and D with

₄^ε

, we find σ

₁

⊂ {1, ..., m} such that

s = |σ

₁

| >

1 − ε 4

2

n > (1 − ε 2 )n

and for all a = (a

j

)

j6m

(14) U

σ1

D

⁻¹_σ₁

a

2

=

X

j∈σ1

a

j

x

j

₂

> ε 4



 X

j∈σ1

|a

j

|

²





1 2

Define Y = span{x

_j

}

j∈σ1

. We will now use the argument of Litvak and Tomczak-Jaegermann to construct the projection P . First partition σ

₁

into ^h

^ε₂

s ⁱ disjoint subsets A

_l

of equal size.

Clearly

|A

_l

| 6

"

s [

₂^ε

s]

#

+ 1 6

"

2 ε ·

ε 2

s [

^ε₂

s]

#

+ 1 6

4 ε

+ 1

Let z

_l

= ^P

_i∈A_l

x

_i

and take P : Y −→ Y the orthogonal projection onto span{z

_l

}

^⊥

. For every l, we have P z

_l

= 0 so that for j ∈ A

_l

we can write

−P x

_j

= ^X

i∈A_l,i6=j

P x

_i

= (|A

_l

| − 1) · 1

|A

_l

| − 1

X

i∈A_l,i6=j

P x

_i

We deduce that for every l and every j ∈ A

l

, we have −P x

j

∈ (|A

l

| − 1) P K ⊂

⁴_ε

P K.

Let T : R

^|σ¹^|

−→ Y a linear operator defined by T e

_j

= x

_j

for all j ∈ σ

₁

, where (e

_j

)

j∈σ₁

denotes the canonical basis of R

^|σ¹^|

and Y is equipped with the euclidean norm. Since (x

_j

)

_j6s

are linearly independent, T is an isomorphism. Moreover, by (14), we have kT

⁻¹

k 6

⁴_ε

. Take P

⁰

= T

⁻¹

P T and P

⁰⁰

the orthogonal projection onto ImP

⁰

. It is easy to check that P

⁰⁰

P

⁰

= P

⁰

and

k = rankP

⁰⁰

= rankP >

1 − ε 2

s > (1 − ε) n

For all scalars (a

_j

)

j∈σ₁

,

(11)

X

j∈σ1

a

_j

P x

_j

₂

=

X

j∈σ1

a

_j

P T e

_j

₂

=

X

j∈σ1

T (a

_j

P

⁰

e

_j

)

₂

> 1 kT

⁻¹

k ·

X

j∈σ₁

a

_j

P

⁰

e

_j

₂

> ε 4 ·

X

j∈σ1

a

_j

P

⁰⁰

e

_j

₂

Now take U

⁰

= (P

⁰⁰

e

₁

, ..., P

⁰⁰

e

_s

) the s×s matrix whose columns are (P

⁰⁰

e

_j

). Apply Theorem 1.1 with U

⁰

and Id as diagonal matrix and

₄^ε

as parameter, then there exists σ ⊂ σ

₁

of size

|σ| >

1 − ε 4

2

s > (1 − ε)n such that for all scalars (a

j

)

j∈σ

,

X

j∈σ

a

_j

P

⁰⁰

e

_j

₂

> ε 4



 X

j∈σ

|a

_j

|

²





1 2

This gives us the following

X

j∈σ

a

_j

P x

_j

₂

> ε 4 ·

X

j∈σ

a

_j

P

⁰⁰

e

_j

₂

> ε

²

16 

 X

j∈σ

|a

_j

|

²





1 2

On the other hand, since K ⊂ B

₂ⁿ

we have P K ⊂ B

₂^k

and therefore

X

j∈σ

a

_j

P x

_j

₂

6 X

j∈σ

a

_j

P x

_j

_{P K}

Denoting A = −P K ∩ P K which is a centrally symmetric convex body and using the fact that −P y

_j

∈

⁴_ε

P K alongside the triangle inequality, one can write

X

j∈σ

a

j

P x

j

_A

6 4 ε

X

j∈σ

|a

j

|

Finally, we have

ε

²

16 

 X

j∈σ

|a

_j

|

²





1 2

6 X

j∈σ

a

_j

P x

_j

₂

6 X

j∈σ

a

_j

P x

_j

_{P K}

6 X

j∈σ

a

_j

P x

_j

_A

6 4 ε

X

j∈σ

|a

_j

|

One can interpret the previous result geometrically as follows:

Corollary 3.5. Let K ⊂ R

ⁿ

be a convex body such that B

₂ⁿ

is the ellipsoid of minimal volume containing K . ∀ε ∈ (0, 1), there exists P an orthogonal projection of rank k > [(1 − ε)n] such

that ε

4 B

₁^k

⊂ P K ⊂ 16

ε

²

B

₂^k

.

(12)

Moreover, d(P K, B

₁^k

) 6

⁶⁴

√n ε³

.

By duality, this means that there exists a subspace E ⊂ R

ⁿ

of dimension k > [(1 − ε)n] such that

ε

²

16 B

₂^k

⊂ K ∩ E ⊂ 4 ε B

_∞^k

. Moreover, d(K ∩ E, B

_∞^k

) 6

⁶⁴

√n ε³

.

3.3. Estimate of the Banach-Mazur distance to the Cube. In [3], Bourgain-Szarek showed how to estimate the Banach-Mazur distance to the cube once a proportional Dvoretzky-Rogers factorization is proven. This technique was again used in [7] and [18]. Since we are able to obtain a proportional Dvoretzky-Rogers factorization with a better constant, using the same argument we will recover the best known asymptotic for the Banach-Mazur distance to the cube and improve the constants involved. Let us start defining

R

ⁿ_∞

= max {d(X, l

_∞ⁿ

)/ X ∈ BM

n

}

Similarly one can define R

ⁿ₁

, and since the Banach-Mazur distance is invariant by duality then R

ⁿ₁

= R

ⁿ_∞

. It follows from John’s theorem [10] that the diameter of BM

ⁿ

is less than n and therefore a trivial estimate is R

ⁿ_∞

6 n. In [17], Szarek showed the existence of an n-dimensional Banach space X such that d(X, l

ⁿ_∞

) > c √

n log(n). Bourgain-Szarek proved in [3] that R

ⁿ_∞

6 o(n) while Szarek-Talagrand [18] and Giannopoulos [7] improved this upper bound to cn

⁷⁸

and cn

⁵⁶

respectively. Here, we will prove the following estimate:

Theorem 3.6. Let X be an n-dimensional Banach space. Then d(X, l

ⁿ₁

) 6 2

⁵⁶

√

n · d(X, l

ⁿ₂

)

²³

.

Since by John’s theorem [10], for any X ∈ BM

n

we have d(X, l

₂ⁿ

) 6 √

n then we get the following:

Corollary 3.7. R

ⁿ₁

= R

_∞ⁿ

6 (2n)

⁵⁶

.

Proof of Theorem 3.6. We denote d

_X

= d(X, l

ⁿ₂

). In order to bound d(X, l

₁ⁿ

), we need to define an isomorphism T : l

₁ⁿ

−→ X and estimate kT k · kT

⁻¹

k. A natural way is to find a basis of X and then define T the operator which sends the canonical basis of R

ⁿ

to this basis of X. The main idea is to find a "large" subspace Y of X which is "not too far" from l

₁

(actually more is needed), then complement the basis of Y to obtain a basis of X. Finding the "large" subspace is the heart of the method and is given by the proportional Dvoretzky-Rogers factorization. The proof is mainly divided in four steps:

-First step: Place B

_X

into a "good" position.

Since the Banach-Mazur distance is invariant under linear transformation, we may change the position of B

_X

. Therefore without loss of generality we may assume that

k · k

₂

6 k · k

_X

6 d

_X

k · k

₂

-Second step: Let ε > 0 and set k = (1 − 2ε)n. Apply Theorem 3.2 to find x

₁

, ..., x

_k

in X such that for all scalars (a

_j

)

_j6k

(15) ε



 X

j6k

a

²_j





1 2

6 X

j6k

a

j

x

j

_X

6 ^X

j6k

|a

j

|

Note that (x

_j

)

_j6k

are linearly independent and are a good candidate to be part of the basis of X.

-Third step: To form a basis of X, we simply take y

_k+1

, .., y

_n

an orthogonal basis of span{(x

_j

)

_j6k

}

^⊥

Restricted invertibility and the Banach-Mazur distance to the cube

HAL Id: hal-00811793

https://hal-upec-upem.archives-ouvertes.fr/hal-00811793

Preprint submitted on 11 Apr 2013

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.