Projection Methods for Uniformly Convex Expandable Sets

(1)

HAL Id: hal-02896463

https://hal-centralesupelec.archives-ouvertes.fr/hal-02896463

Submitted on 10 Jul 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Distributed under a Creative Commons Attribution| 4.0 International License

Sets

Stéphane Chrétien, Pascal Bondon

To cite this version:

Stéphane Chrétien, Pascal Bondon. Projection Methods for Uniformly Convex Expandable Sets.

Mathematics , MDPI, 2020, �10.3390/math8071108�. �hal-02896463�

(2)

Article

Projection Methods for Uniformly Convex Expandable Sets

Stéphane Chrétien

^1,2,3,∗

and Pascal Bondon

⁴

1

Laboratoire ERIC, Université Lyon 2, 69500 Bron, France

2

The Alan Turing Institute, London NW1 2DB, UK

3

Data Science Division, The National Physical Laboratory, Teddington TW11 0LW, UK

4

Laboratoire des Signaux et Systèmes, CentraleSupélec, CNRS, Université Paris-Saclay, 91190 Gif-sur-Yvette, France; [email protected]

* Correspondence: [email protected]

Received: 21 February 2020; Accepted: 22 June 2020; Published: 6 July 2020

Abstract: Many problems in medical image reconstruction and machine learning can be formulated as nonconvex set theoretic feasibility problems. Among efficient methods that can be put to work in practice, successive projection algorithms have received a lot of attention in the case of convex constraint sets. In the present work, we provide a theoretical study of a general projection method in the case where the constraint sets are nonconvex and satisfy some other structural properties.

We apply our algorithm to image recovery in magnetic resonance imaging (MRI) and to a signal denoising in the spirit of Cadzow’s method.

Keywords: cyclic projections; nonconvex sets; uniformly convex sets; strong convergence

1. Introduction

1.1. Background and Goal of the Paper

Many problems in applied mathematics, engineering, statistics and machine learning can be reduced to finding a point in the intersection of some subsets of a real separable Hilbert space H . In mathematical terms, let ( S

_i

)

_i∈I

be a finite family of proximinal subsets (i.e., sets S such that any x ∈ H admits a closest point in S) [1] of H with a non-empty intersection, we address the problem of finding a point in the intersection of the sets ( S

i

)

_i∈I

using a successive projection method.

When the sets are closed and convex, the problem is known as the convex feasibility problem, and is the subject of extensive literature; see [2–5] and references therein. In this paper, we take one step further into the scarcely investigated topic of finding a common point in the intersection of nonconvex sets. Convex feasibility problems have been applied to an extremely wide range of topics in systems analysis and control, signal and image processing. Important examples are: model order reduction [6], controller design [7], tensor analysis [8], image recovery [9], electrical capacitance tomography [10], MRI [11,12], and stabilisation of quantum systems and application to quantum computation [13,14].

Extension of this problem to the nonconvex setting also has many applications, related to sparse estimation and more specifically low rank constraints, such as in control theory [15], signal denoising [16], phase retrieval [17], structured matrix estimation [18,19] and has great potential impact on the design of scalable algorithm in many machine learning problems such as Deep Neural Networks [20]. Studies of projection methods in the nonconvex settings have been quite scarce in the literature [21–23] and a lot of work still remains to be done in order to understand the behavior of such methods for these difficult problems.

Mathematics2020,8, 1108; doi:10.3390/math8071108 www.mdpi.com/journal/mathematics

(3)

In the present paper, our goal is to investigate how the results in [22] can be improved in such a way that strong convergence can be obtained. The results proved in [22] make the assumption that the sets involved in the feasibility problem can be written as a possibly not countable union of closed convex sets, and one of the sets is boundedly compact. Our study will be based on the assumption that the convex sets in the family are uniformly convex, which allows to remove the boundedly compactness assumption. As will be seen in Section 4, uniform convexity can be obtained by simply modifying the algorithm even in the case where the sets S

i

are not expandable into a family of uniformly convex sets.

1.2. Preliminary on Projections and Expansion into Convex Sets

The notation Card will denote the cardinality of a set. H will denote a real separable Hilbert space with scalar product h· | ·i , norm | · | , and distance d. Let S be a subset of H . S denotes the closure of S.

S is proximinal if each point in H has at least one projection onto S. If S is proximinal, then S is closed.

When S is proximinal, P

S

is the projection point-to-set mapping onto S defined by (∀ x ∈ H) P

S

( x ) = { y ∈ S | | x − y | = inf

z∈S

| x − z |} .

p

S

( x ) denotes an arbitrary element of P

S

( x ) and d ( x, S ) = | x − p

S

( x )| . B ( x, ρ ) denotes the closed ball of center x and radius ρ. In the case of a projection onto a convex set C, Kolmogorov’s criterion is:

x

⁰

− P

_C

( x ) , x − P

_C

( x ) ≤ 0 (1) for all x

⁰

∈ C. Our work is a follow-on project to the work in [22] which introduced the notion of convex expandable sets in order to tackle certain feasibility problems involving rank-constrained matrices as described in [23]. We recall what it means for a nonconvex set to be expandable into a family of convex sets.

Definition 1. A subset S of H is said to be expandable into a family of convex sets when it is the union of non-trivial, i.e., not reduced to a single point, convex subsets C

j

of H , i.e.,

S = ^S

_j∈J

C

j

(2)

where J is a possibly uncountable index set. Any family ( C

_j

)

_j∈J

satisfying (2) is called a convex expansion of S.

Remark 1. Uncountable unions often appear in practice. An important example is the set of matrices of rank r in R

^n×m

for 0 < r < min { n, m } . This set is the union of the rays passing through the null matrix and any matrix of rank r different from zero. This constraint often appears in signal processing problems such as signal denoising [16] and more generally, structured matrix estimation problems [19].

Uniform convexity is an important concept, which allows to prove many strong convergence results in the framework of successive projection methods ([4], Section 5.3).

Definition 2. A convex subset C of H is uniformly convex with respect to a positive-valued nondecreasing function b ( t ) if

∀( x, y ) ∈ C

²

, B (( x + y ) /2, b (| x − y |)) ⊂ C.

Uniform convexity will be instrumental in our analysis. We define the following condition which is stronger than the condition of being expandable into a family of convex sets.

Definition 3. A subset S of H is said to be expandable in uniformly convex sets when the sets C

j

in (2) are

moreover uniformly convex with the same function b (·) . Any family ( C

_j

)

_j∈J

of uniformly convex sets satisfying

(2) is called a uniformly convex expansion of S.

(4)

1.3. The Projection Algorithm

For any point x in H , for the sake of simplicity, p

_S_i

( x ) will be noted by p

_i

( x ) . We will assume throughout that the family { S

_i

}

i∈I

is finite.

Definition 4. (Method of Projection onto Uniformly Convex Expandable Sets)

Given a point x

0

in H , two real numbers α and λ in ( 0, 1 ] , a nonnegative integer M and a sequence ( I

n

) of subsets of I satisfying the following conditions

C

₁

. (∀ n ∈ N ) I

n

⊂ I, I = ^S

^n+M_j=n

I

_j

and Card ( I

n

) > 1 C

2

. (∀ n ∈ N ) (∀ i ∈ I

n

) α ≤ α

_i,n

and ∑

j∈I_n

α

_j,n

= 1 the projection-like method considered in this paper is iteratively defined by

(∀ n ∈ N )



 

 

 

  1.

– If ∃ j ∈ I

n

such that p

_j

( x

n

) ∈ ^T

_i∈I_n

S

_i

Then

– If I

n

6= I set I

n

= I and go to 1

– If I

n

= I set λ

n

= 1, α

_j,n

= 1 and go to 2 – Else go to 2

2. x

_n+1

= x

n

+ λ

n

∑

i∈I_n

α

_i,n

( p

_i

( x

n

) − x

n

)

(3)

with λ

_n

satisfying

C

3

. (∀ n ∈ N ) λ ≤ λ

n

≤ µ

n

where µ

n

=







∑i∈_Inαi,n|x_n−p_i(x_n)|²

|x_n−∑i∈_Inα_i,np_i(x_n)|²

if x

n

6∈ ^T

_i∈I_n

S

_i

1 if x

n

∈ ^T

_i∈I_n

S

_i

.

Remark 2. Using the assumption Card ( I

n

) > 1, the coefficients α

_i,n

, i ∈ I

n

, can be chosen in such a way that the denominator in the definition of µ

_n

is not equal to zero. Note that one can relax the constraint Card ( I

n

) > 1 in the case where µ

n

is well defined at every iteration, thus allowing to recover the cyclic projection method.

The numbers α and λ ensure that the steps taken at each iteration are sufficiently large as compared to the distance of the iterations to their projections on each S

_i

, i ∈ I. The use of the integer M ensures that each S

i

is involved at least every M iterations. The “If” condition in Step 1 of the algorithm ensures that x

n

will move at each iteration by enforcing that it does not yet belong to all the selected sets S

i

, i ∈ I

n

. Our method is an extension of Ottavy’s method [4] to the case of nonconvex sets. The idea of using a potentially variable index set I

n

at each iteration allows to recover several variants, such as cyclic projections [2,21,22,24] or parallel projections [2,23,24]. Notice that, due to finiteness of the cardinality of I, finding an index j such that p

_j

( x

n

) ∈ ^T

_i∈I_n

S

_i

is easy if such an index exists.

1.4. Our Contributions

The main contributions of our work are a proof that strong convergence holds in the proposed

setting, and a new constructive modification of the projection method in order to accommodate the case

of non-necessarily uniformly convex expandable sets. Based on our findings, we will then weaken our

assumptions and provide a new algorithm which does not need uniform convexity, while preserving

strong convergence to a feasible solution. Applications to an infinite dimensional MRI problem and to

low rank matrix estimation for signal denoising are presented in Section 5.

(5)

2. Ottavy’s Framework

2.1. Successive Projection Point-to-Set Mapping

Let α and λ be two real numbers chosen in ( 0, 1 ) . Define the point-to-set mapping D : x ∈ H → D ( x ) = { x ˜ ∈ H ; ˜ x = x + λ ( x )( x ¯ − x )}

where ¯ x is a weighted average,

x ¯ = ∑

i∈I(x)

α

_i

( x ) p

i

( x ) I ( x ) is a subset of selected constraints such that

I ( x ) ⊂ I and I ( x ) 6= ∅ α

_i

( x ) , i ∈ I is a set of normalized nonvanishing weightings such that

α

_i

( x ) ∈ [ 0, 1 ] , ∑

i∈I(x)

α

_i

( x ) = 1, and p

i

( x ) 6= x ⇒ α

_i

( x ) ≥ α

λ ( x ) is the relaxation parameter such that

λ ( x ) ∈ [ λ, + ∞ ) and x 6= x ¯ ⇒ λ ( x ) ∈ [ λ, µ ( x ¯ )]

with

µ ( x ¯ ) =



 ∑

α∈I(x)

α

_i

( x ) | x − p

i

( x )|

²



 / | x − x ¯ |

²

. 2.2. Ottavy’s Lemma

In [4], Ottavy proved the very useful lemma.

Lemma 1. For any x ∈ H , ˜ x ∈ D ( x ) , and z ∈ ∩

_i∈I

S

i

, the following results hold:

( _i ) | x ˜ − x |

²

≤ β

| x − z |

²

− | x ˜ − z |

²

( ii ) 2 h x − z | x − x ˜ i ≤

1 + ²

λ

| x − z |

²

− | x ˜ − z |

²

( iii ) max

i∈I(x)

| p

i

( x ) − x |

²

≤ ( 2 + λ ) 2αλ

²

| x − z |

²

− | x ˜ − z |

²

.

3. A Strong Convergence Result

Let ( x

n

)

_n∈_N

be the sequence of iterates of the projection method defined by (3). In the present section, we show strong convergence of this sequence to a point in ∩

i∈I

S

i

.

For every n in N and for each i ∈ I, define

T

_i,n

= ^S { C ⊂ S

_i

| C is convex and p

_i

( x

n

) ∈ C } T

n

= ^T

_i∈I_n

T

_i,n

.

When the (finite) family ( S

_i

)

_i∈I

is expandable into a family of convex sets, T

_i,n

6= _∅ for every n in

N and for each i ∈ I.

(6)

Lemma 2. If the following conditions are satisfied C

₄

. (∀ n ∈ N ) T

n

6= _∅

C

5

. The finite family ( S

i

)

_i∈I

is expandable in convex sets then, for every n in N and for every z

n

in T

n

, we have

( i ) | x

n+1

− x

n

|

²

≤ ² λ

| x

n

− z

n

|

²

− | x

n+1

− z

n

|

²

( ii ) 2 h x

n

− z

n

| x

n

− x

n+1

i ≤ ( 1 + ²

λ ) | x

n

− z

n

|

²

− | x

n+1

− z

n

|

²

( iii ) sup

i∈I_n

| p

i

( x

n

) − x

n

|

²

≤ ( 2 + λ ) 2αλ

²

| x

n

− z

n

|

²

− | x

n+1

− z

n

|

²

.

Proof. Fix n in N ^and ^z

n

in T

n

. According to C

5

, each point p

_i

( x

n

) belongs to a set C

_i,j_n

⊂ S

_i

, a convex set in the uniformly convex expansion of S

_i

indexed by j

n

. Hence p

_i

( x

n

) is also the projection of x

n

onto C

i,j_n

, and z

n

∈ C

i,j_n

. Replacing respectively x, ˜ x and z by x

n

, x

n+1

and z

n

in Ottavy’s Lemma (Lemma 1, or [4, Lemma 7.1]), we obtain the desired results.

Lemma 3. If the conditions C

₄

, C

5

, and

C

6

. (∃( z

n

)

_n∈_N

) such that z

n

∈ T

n

for all n in N ^and ∑

n∈N

| z

_n+1

− z

n

| < + _∞ are satisfied, then

( i ) The sequences (| x

n

− z

n

|)

n∈N

and (| x

n+1

− z

n

|)

n∈N

are bounded.

( ii ) The series ∑

n∈N

(| x

n

− z

n

|

²

− | x

_n+1

− z

n

|

²

) is convergent.

( iii ) For each i in I, the sequence (| p

_i

( x

n

) − x

n

|)

_n∈_N

converges to zero.

( iv ) The sequence (| x

n

− z

n

|)

_n∈_N

is convergent.

( _v ) The sequence ( _z

_n

)

_n∈_N

converges strongly to a point in ^T

i∈I

S

i

.

Proof. (i) Let ( z

n

)

_n∈_N

be a sequence satisfying C

6

, and let k in N . We deduce from Lemma 2(i) that

| x

_k+1

− z

_k

| ≤ | x

_k

− z

_k

| . Then we have

| x

_k+1

− z

_k+1

| ≤ | x

_k+1

− z

_k

| + | z

_k+1

− z

_k

| ≤ | x

_k

− z

_k

| + | z

_k+1

− z

_k

| . Therefore

| x

n+1

− z

n+1

| ≤ | x

0

− z

0

| + _∑

ⁿ_k=0

| z

k+1

− z

k

|

and then C

6

ensures that the sequences (| x

n

− z

n

|)

_n∈_N

and (| x

n+1

− z

n

|)

_n∈_N

are bounded.

(ii) Now, the cosine law, followed by the Cauchy-Schwarz inequality give

| x

_k+1

− z

_k+1

|

²

= | x

_k+1

− z

_k

|

²

+ 2 h x

_k+1

− z

_k

| z

_k

− z

_k+1

i + | z

_k

− z

_k+1

|

²

≤ | x

k+1

− z

k

|

²

+ 2 | x

k+1

− z

k

| | z

k+1

− z

k

| + | z

k+1

− z

k

|

²

. (4) Using Lemma 2(i), we then obtain that

0 ≤ | x

k

− z

k

|

²

− | x

k+1

− z

k

|

²

≤ | x

k

− z

k

|

²

− | x

k+1

− z

k+1

|

²

+ 2 | x

k+1

− z

k

| | z

k+1

− z

k

|

+ | z

_k+1

− z

_k

|

²

.

(7)

Thus

0 ≤ _∑

ⁿ_k=0

| x

k

− z

k

|

²

− | x

k+1

− z

k

|

²

≤ | x

0

− z

0

|

²

+ 2 ∑

ⁿ_k=0

| x

k+1

− z

k

| | z

k+1

− z

k

|

+ _∑

ⁿ_k=0

| z

_k+1

− z

_k

|

²

. Since the sequence (| x

n+1

− z

n

|)

n∈N

is bounded and the series ∑

n∈N

| z

n+1

− z

n

| is convergent, the result (i) follows at once.

(iii) Lemma 3(i) and Lemma 2(iii), imply that lim

n→∞

sup

_i∈I_n

| p

_i

( x

n

) − x

n

|

²

= 0. Thus for any k ∈ [ 0, M ] ,

n→

lim

∞

sup

i∈I_n₊_k

| p

_i

( x

n

) − x

n

| = _0.

Then we deduce from C

₁

that lim

n→∞

sup

_i∈I

| p

_i

( x

n

) − x

n

| = 0, which completes the proof of (iii).

(iv) Squaring the triangle inequality gives

| x

k+1

− z

k

|

²

− | x

k+1

− z

k+1

|

²

≤ 2 | x

k+1

− z

k+1

| | z

k+1

− z

k

| + | z

k+1

− z

k

|

²

≤ 2 | x

_k+1

− z

_k

| | z

_k+1

− z

_k

| + 3 | z

_k+1

− z

_k

|

²

. Using (4), we deduce that

| x

_k+1

− z

_k+1

|

²

− | x

_k+1

− z

_k

|

²

≤ 2 | x

_k+1

− z

_k

| | z

_k+1

− z

_k

| + 3 | z

_k+1

− z

_k

|

²

.

Since the sequence (| x

n+1

− z

n

|)

_n∈_N

is bounded, and ∑

n∈N

| z

n+1

− z

n

| is convergent, we deduce from the last inequality that ∑

n∈N

(| x

_n+1

− z

_n+1

|

²

− | x

_n+1

− z

n

|

²

) is convergent. Then using Lemma 3(i), we obtain that the series ∑

n∈N

(| x

n+1

− z

n+1

|

²

− | x

n

− z

n

|

²

) is convergent, which is equivalent to the convergence of the sequence (| x

n

− z

n

|)

_n∈_N

.

(v) Since, by C

6

, ( z

n

)

_n∈_N

is a Cauchy sequence, it is strongly convergent to a point z in the Hilbert space H . For every n in N ^, ^z

n

∈ T

n

= ^T

_i∈I_n

T

_i,n

⊂ ^T

_i∈I_n

S

_i

. Fix i in I. Due to condition C

₁

, there exists a subsequence ( z

_σ(n)

)

_n∈_N

of ( z

n

)

_n∈_N

satisfying z

_σ(n)

∈ S

i

for all n ∈ N . Therefore, z ∈ S

i

. Since this assertion is true for all i in I, and each S

i

is closed, we obtain that z ∈ ^T

_i∈I

S

i

.

Remark 3. In the convex case, Lemma 3(iii) is known in the simpler case where ( z

n

)

_n∈_N

is chosen to be a constant sequence in ^T

_i∈I

S

_i

and is a consequence of the Fejér monotonicity [5].

We now state the main result of this section, which is strong convergence under the assumption that the sets S

i

, i ∈ I, are expandable in uniformly convex sets.

Theorem 1. If the conditions C

4

, C

6

, and C

7

.

The finite family ( S

i

)

_i∈I

is expandable in uniformly convex sets with respect to a function b are satisfied, then the sequence ( x

n

)

_n∈_N

converges strongly to a point in ^T

i∈I

S

_i

.

Proof. Notice that C

7

implies C

5

. According to C

5

, each point p

_i

( x

n

) belongs to a convex set C

_i,j_n

⊂ S

_i

in the uniformly convex expansion of S

_i

, indexed by j

n

. Hence p

_i

( x

n

) is also the projection of x

n

onto C

i,j_n

, and z

n

∈ C

i,j_n

. Let ( z

n

)

_n∈_N

be a sequence satisfying C

6

. Fix n in N , and i in I. Define

H ( x

n

, p

i

( x

n

)) = { x ∈ H | h x − p

i

( x

n

) | p

i

( x

n

) − x

n

i ≥ 0 } . According to C

7

, p

_i

( x

n

) and z

n

belong to a uniformly convex set C

_i,j_n

⊂ S

_i

. Thus

B (( z

n

+ p

_i

( x

n

)) /2, ρ

_i,n

) ⊂ C

_i,j_n

,

(8)

where ρ

_i,n

= b (| z

n

− p

_i

( x

n

)|) . Since p

_i

( x

n

) is also the projection of x

n

onto C

_i,j_n

, it results from the Kolmogorov criterion (1) that

C

i,jn

⊂ H ( _x

_n

_, _p

_i

( _x

_n

)) _. Then

B (( z

n

+ p

_i

( x

n

)) /2, ρ

_i,n

) ⊂ C

_i,j_n

⊂ H ( x

n

, p

_i

( x

n

)) . (5) Now, consider the translation T : x 7→ y = x − ( p

i

( x

n

) − z

n

) /2. Clearly,

x ∈ B (( z

n

+ p

i

( x

n

)) /2, ρ

_i,n

) if and only if y ∈ B ( z

n

, ρ

_i,n

) . (6) On the other hand, using z

n

∈ C

i,j_n

, we get that x ∈ H ( x

n

, p

i

( x

n

)) implies y ∈ H ( x

n

, p

_i

( x

n

)) . Indeed,

h y − p

_i

( x

n

) | p

_i

( x

n

) − x

n

i= h x − ( p

_i

( x

n

) − z

n

) /2 − p

_i

( x

n

) | p

_i

( x

n

) − x

n

i

= h x − p

i

( x

n

) | p

i

( x

n

) − x

n

i − ¹

2 h p

i

( x

n

) − z

n

| p

i

( x

n

) − x

n

i and since z

n

∈ C

_i,j_n

, Kolmogorov’s criterion gives

1 2 h p

_i

( x

n

) − z

n

| p

_i

( x

n

) − x

n

i ≤ 0, which implies

h y − p

_i

( x

n

) | p

_i

( x

n

) − x

n

i= h x − ( p

_i

( x

n

) − z

n

) /2 − p

_i

( x

n

) | p

_i

( x

n

) − x

n

i

≥ h x − p

i

( x

n

) | p

i

( x

n

) − x

n

i and, after recalling that x ∈ H ( x

n

, p

_i

( x

n

)) ,

h y − p

_i

( x

n

) | p

_i

( x

n

) − x

n

i ≥ 0

which implies that y ∈ H ( x

n

, p

_i

( x

n

)) as desired. Hence (5), together with (6) imply B ( z

n

, ρ

_i,n

) ⊂ H ( x

n

, p

_i

( x

n

)) _.

Now, assume x

n

6∈ S

i

, and define

y

n

= z

n

+ ( p

i

( x

n

) − x

n

) / | p

i

( x

n

) − x

n

| u

n

= z

n

+ ρ

_i,n

( z

n

− y

n

) .

One easily checks that | u

n

− z

n

| = ρ

_i,n

, i.e., u

n

∈ B ( z

n

, ρ

_i,n

) . Thus, using (2.2), we get u

n

∈ H ( x

n

, p

i

( x

n

)) , i.e., h u

n

− p

i

( x

n

) | p

i

( x

n

) − x

n

i ≥ 0. Therefore

h z

n

− p

_i

( x

n

) | p

_i

( x

n

) − x

n

i ≥ h z

n

− u

n

| p

_i

( x

n

) − x

n

i = ρ

_i,n

| p

_i

( x

n

) − x

n

| . (7) Since this inequality is obviously satisfied when x

n

∈ S

_i

, it holds whether x

n

6∈ S

_i

or not. Now, let

ρ

n

= min

i∈I_n

ρ

_i,n

, and

ρ = inf

n∈N

ρ

_n

. The case: ρ = 0.

In the case where ρ = 0, we have

(9)

• either there exist n

0

in N ^and ⁱ

0

in I

n₀

, such that ρ

_i₀_,n₀

= 0,

• or lim inf

n→∞

ρ

n

= 0.

In the first case, since b (·) vanishes only at zero, we have z

n₀

= p

_i₀

( x

n₀

) and p

_i₀

( x

n₀

) ∈ T

n₀

⊂ T

i∈I_n₀

S

_i

. According to (3), p

_i₀

( x

n₀

) ∈ ^T

_i∈I

S

_i

, and for all n > n

₀

, x

n

= p

_i₀

( x

n₀

) . Therefore, the sequence ( _x

_n

)

_n∈_N

has converged to a solution in a finite number of steps. In the second case, there exists a subsequence ( ρ

_σ(n)

)

_n∈_N

which converges to zero. Fix i in I. Since b (·) is nondecreasing,

ρ

_σ(n)

≥ min

j∈I_σ₍_n₎

b

| z

_σ(n)

− p

_i

( x

_σ(n)

)| − | p

_i

( x

_σ(n)

) − p

_j

( x

_σ(n)

)|

. (8)

According to Lemma 3(ii) and (iii), (| p

_i

( x

n

) − p

_j

( x

n

)|)

_n∈_N

converges to zero for all j in I, and (| z

n

− p

_i

( x

n

)|)

_n∈_N

converges to a limit c independent of i. Hence, for all e > 0 there exists N in N ^such that for all n ≥ N, | z

_σ(n)

− p

i

( _x

_σ(n)

)| ≥ c − e/2 and | p

i

( _x

_n

) − p

j

( _x

_n

)| ≤ e/2 for all j in I. Thus, for all n ≥ N, and for all j in I, | z

_σ(n)

− p

i

( x

_σ(n)

)| − | p

i

( x

_σ(n)

) − p

j

( x

_σ(n)

)| ≥ c − e.

Assume c > 0, and take e = c/2. Then, since b (·) is nondecreasing and vanishes only at zero, we deduce from (8) that for all n ≥ N, ρ

_σ(n)

≥ b ( c/2 ) > 0, which contradicts lim

n→∞

ρ

_σ(n)

= 0.

Hence, c = 0, and lim

n→∞

| z

n

− p

i

( x

n

)| = 0 for all i in I. Using Lemma 3(ii), we deduce that lim

n→∞

| x

n

− z

n

| = 0, and it results from Lemma 3(iv) that ( x

n

)

_n∈_N

converges strongly to a point in T

i∈I

S

_i

.

The case ρ 6= 0.

In the case ρ 6= 0, we deduce from (7) that

h z

n

− p

_i

( x

n

) | p

_i

( x

n

) − x

n

i ≥ ρ | p

_i

( x

n

) − x

n

| and since h p

_i

( x

n

) − x

n

| p

_i

( x

n

) − x

n

i = | p

_i

( x

n

) − x

n

|

²

≥ 0,

h z

n

− x

n

| p

_i

( x

n

) − x

n

i ≥ ρ | p

_i

( x

n

) − x

n

| . (9) Therefore, combining the definition of x

_n+1

and (9), we have

λ

_n

∑

i∈I_n

α

_i,n

| p

i

( x

n

) − x

n

| ≤ ^λ

ⁿ

ρ ∑

i∈I_n

α

_i,n

h z

n

− x

n

, p

i

( x

n

) − x

n

i

= ¹ ρ

*

z

n

− x

n

, λ

n

∑

i∈I_n

α

_i,n

( p

_i

( x

n

) − x

n

) +

= ¹

ρ h z

n

− x

n

, x

n+1

− x

n

i .

Using Lemma 2(ii) and Lemma 3(i), we deduce that the series ∑

n∈N

| x

n+1

− x

n

| is convergent.

Then ( x

n

)

_n∈_N

is a Cauchy sequence, and is therefore strongly convergent to a point x

^∗

in H . Using Lemma 3(ii), we deduce that ( p

i

( x

n

))

n∈N

is also strongly convergent to x

^∗

for any i in I.

Since each set S

i

is closed, we immediately conclude that x

^∗

∈ ^T

_i∈I

S

i

, which completes the proof.

4. Projections onto Stepwise Generated Uniformly Convex Sets

In this section, we show how the results of Section 3 may be used to define a strongly convergent

cyclic projection algorithm in the case where the sets S

_i

are only expandable in convex sets, but not into

uniformly convex sets. To our knowledge, this method is new, even in the convex case. The underlying

idea of the algorithm is as follows. First, note that the condition C

4

is realistic for the type of nonconvex

sets considered in this paper, and may be interpreted as a strong consistency condition. On the other

hand, in many cases, a trivial sequence ( a

n

)

_n∈_N

where a

n

∈ T

n

for all n in N is already known, as in

the applications presented in Section 5. Using this sequence, we define a projection method which

(10)

converges strongly to a point in ^T

i∈I

S

_i

. For every n in N ^, ⁱ ⁱⁿ ^{I, and} ^a

n

in T

n

, B

_i,n

will denote the closed ellipsoid with main axis [ a

n

, p

_i

( x

n

)] , with center 1/2 ( a

n

+ p

_i

( x

n

)) , which is rotationally invariant around this axis and with maximal radius equal to | a

n

− p

i

( _x

_n

)| /2 and with minimal radius equal to γ | a

n

− p

i

( x

n

)| /2 for some γ ∈ ( 0, 1 ) . We denote by p

⁰_i

( x

n

) the projection of x

n

onto B

i,n

; see Figure 1.

Definition 5. (Method of Double Projection onto Convex Expandable Sets)

Assume C

₄

is satisfied. Given a point x

₀

in H , two real number α and λ in ( 0, 1 ] , a nonnegative integer M, a sequence ( _I

_n

) of subsets of I and a sequence ( _a

_n

)

_n∈_N

_{where a}

_n

∈ T

n

for all n in N , the projection method onto stepwise generated uniformly convex sets is iteratively defined by

(∀ n ∈ N )



 



 



1. (∀ i ∈ I

n

) p

⁰_i

( x

n

) = p

_B_i,n

( x

n

)

2. – If ∃ j ∈ I

n

such that p

⁰_j

( x

n

) ∈ ^T

_i∈I_n

S

i

Then

– If I

n

6= I set I

n

= I and go to 2

– If I

n

= I set λ

n

= 1, α

_j,n

= 1 and go to 3 – Else go to 3

3. x

_n+1

= x

n

+ λ

n

∑

i∈I_n

α

_i,n

( p

⁰_i

( x

n

) − x

n

) under the conditions C

1

, C

2

, and

C

⁰₃

. (∀ n ∈ N ) λ ≤ λ

_n

≤ µ

_n

where µ

_n

=







∑i∈Inαi,n|x_n−p⁰_i(xn)|²

|x_n−∑i∈_Inαi,np⁰_i(x_n)|²

if x

n

6∈ ^T

_i∈I_n

B

_i,n

1 if x

n

∈ ^T

_i∈I_n

B

i,n

.

In the remainder of this section, the sequence ( x

n

)

_n∈_N

is defined by the algorithm of Definition 5.

The main result is the following.

Theorem 2. If the condition C

5

is satisfied, and the sequence ( a

n

)

_n∈_N

introduced in Definition 5 satisfies C

6

, then the sequence ( x

n

)

_n∈_N

converges strongly to a point in ^T

_i∈I

S

_i

.

Proof. The method introduced in Definition 5 is a general projection algorithm onto specific uniformly convex sets constructed at each iteration. For every n in N and each i in I, a

n

and p

⁰_i

( x

n

) belong to B

i,n

. Then Lemma 2 and Lemma 3 where p

i

( x

n

) and ( z

n

)

_n∈_N

are respectively replaced with p

⁰_i

( x

n

) and ( a

n

)

_n∈_N

hold true. In the same way, taking ( z

n

)

_n∈_N

= ( a

n

)

_n∈_N

in the proof of Theorem 1, we deduce that the sequence ( x

n

)

_n∈_N

converges strongly to some point x

^∗

in H . Nevertheless, since p

_i

( x

n

) is replaced with p

⁰_i

( x

n

) , it is still not clear why x

^∗

∈ ^T

_i∈I

S

i

. Let us now address this specific point. Fix n in N , i in I, and a

n

in T

n

. According to C

5

, a

n

and p

i

( x

n

) belong to a convex set C

i,j_n

⊂ S

i

, and p

i

( x

n

) is also the projection of x

n

onto C

_i,j_n

. Hence the Kolmogorov criterion (1) gives h p

_i

( x

n

) − a

n

| x

n

− p

_i

( x

n

)i ≥ 0. Therefore

h p

i

( x

n

) − a

n

| x

n

− p

_i⁰

( x

n

)i + h p

i

( x

n

) − a

n

| p

⁰_i

( x

n

) − p

i

( x

n

)i ≥ 0 which implies

h p

_i

( x

n

) − a

n

| x

n

− p

_i⁰

( x

n

)i ≥ h p

_i

( x

n

) − a

n

| p

_i

( x

n

) − p

⁰_i

( x

n

)i . (10) Moreover, since the segment [ a

n

, p

_i

( x

n

)] is the main axis of the ellipsoid B

_i,n

and p

⁰_i

( x

n

) ∈ B

_i,n

, we have h a

n

− p

i

( x

n

) | p

⁰_i

( x

n

) − p

i

( x

n

)i ≥ 0. Combining this with (10), we get

0 ≤ h p

i

( x

n

) − a

n

| p

i

( x

n

) − p

⁰_i

( x

n

)i ≤ h p

i

( x

n

) − a

n

| x

n

− p

⁰_i

( x

n

)i ≤ | p

i

( x

n

) − a

n

| | x

n

− p

⁰_i

( x

n

)| . We deduce from Lemma 3(iii) where p

i

( x

n

) is replaced with p

⁰_i

( x

n

) that lim

n→∞

| p

⁰_i

( x

n

) − x

n

| = 0.

On the other hand, as p

_i

( x

n

) is the projection of x

n

onto C

_i,j_n

, and a

n

∈ C

_i,j_n

, we have | p

_i

( x

n

) − a

n

| ≤

| x

n

− a

n

| . Moreover, following the same steps as in the proof of Lemma 3(i) gives that (| x

n

− a

n

|)

n∈N

(11)

is bounded, from which we deduce that (| p

_i

( x

n

) − a

n

|)

_n∈_N

is bounded as well. Therefore we deduce from the last inequality that

n→

lim

∞

h a

n

− p

_i

( x

n

) | p

⁰_i

( x

n

) − p

_i

( x

n

)i = 0. (11) Let x = a

n

− p

_i

( x

n

) _and y = p

⁰_i

( x

n

) − p

_i

( x

n

) . Then, we have

x − y = a

n

− p

⁰_i

( x

n

) and

x − ωy = ω 1

ω ( a

n

+ ( ω − 1 ) p

i

( x

n

)) − p

⁰_i

( x

n

)

(12) for all ω ∈ R .

Assume that p

_i⁰

( x

n

) 6= p

i

( x

n

) . Using Claim 1, we get that

a

n

− p

⁰_i

( x

n

) | ¹

ω

_i,n

( a

n

+ ( ω

_i,n

− 1 ) p

_i

( x

n

)) − p

⁰_i

( x

n

)

≤ 0 as long as we enforce

ω

_i,n

≥ h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − a

n

i

h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − p

⁰_i

( x

n

)i ^. ⁽¹³⁾ In particular, we can take

ω

_i,n

= max

h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − a

n

i h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − p

⁰_i

( x

n

)i ^, ^e

(14) for any appropriately chosen e > 0. Using Niculescu’s Lemma 4 in Appendix B, we obtain that

s 1 ω

_i,n

+

r ω

_i,n

1 ! h a

n

− p

_i

( x

n

) , p

⁰_i

( x

n

) − p

_i

( x

n

)i

2 | a

n

− p

_i

( x

n

)| ≥ | p

⁰_i

( x

n

) − p

_i

( x

n

)| . (15) Note that this inequality also holds without resorting to Claim 1 if p

⁰_i

( x

n

) = p

_i

( x

n

) . According to Theorem 1, ( _x

_n

)

_n∈_N

_and ( _p

⁰_i

( _x

_n

))

_n∈_N

are strongly convergent to a point x

^∗

in H . We now split the remainder of the argument into the following complementary cases involving an increasing function σ: N 7→ N which parametrises possible subsequences.

• If the sequence (| a

n

− p

⁰_i

( x

n

)|)

_n∈_N

converges to zero, then using Claim 2, (| p

⁰_i

( x

n

) − p

_i

( x

n

)|)

_n∈_N

also converges to zero. Therefore, ( _p

_i

( _x

_n

))

_n∈_N

is also strongly convergent to x

^∗

for each i in I.

Since each set S

i

is closed, we again conclude that x

^∗

∈ ^T

_i∈I

S

i

.

• If the sequence (| a

n

− p

⁰_i

( x

n

)|)

n∈N

does not converge to zero, and the sequence (h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − p

⁰_i

( x

n

)i)

_n∈_N

converges to zero, then we must have | a

_σ(n)

− p

_i⁰

( x

_σ(n)

)| > e for some e > 0 and some subsequence indexed by an appropriately chosen increasing function σ. This implies in particular that

n→+

lim

∞

| p

_i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

)| | cos

p

⁰_i

( x

_σ(n)

) − a

_σ(n)

, p

_i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

) | = 0.

As a result, either

n→+

lim

∞

cos

p

⁰_i

( x

_σ(n)

) − a

_σ(n)

, p

_i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

) = 0 (16) or

n→+∞

lim | p

i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

)| = 0.

(12)

Notice further that (16) holds only if (| p

_i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

)|)

_n∈_N

converges to zero. Thus, both cases simplify into the conclusion that (| p

_i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

)|)

_n∈_N

converges to zero. Therefore ( _p

_i

( _x

_σ(n)

))

_n∈_N

is also strongly convergent to x

^∗

for each i in I. Since each set S

i

is closed, we again conclude that x

^∗

∈ ^T

_i∈I

S

i

.

• If both sequences (| a

n

− p

⁰_i

( x

n

)|)

n∈N

and (h p

_i⁰

( x

n

) − a

n

| p

i

( x

n

) − p

⁰_i

( x

n

)i)

n∈N

do not converge to zero, then | a

_σ(n)

− p

⁰_i

( x

_σ(n)

)| > e and h p

⁰_i

( x

_σ(n)

) − a

_σ(n)

| p

i

( x

_σ(n)

) − p

⁰_i

( x

_σ(n)

i > e for some e > 0 and for some subsequence indexed by an appropriately chosen increasing function σ.

Since | a

n

− p

_i

( x

n

)| is the length of the main axis of B

_i,n

, and, as such, bounds from above the distance between any two points in B

i,n

, we deduce that | a

n

− p

⁰_i

( x

n

)| ≤ | a

n

− p

i

( x

n

)| . Furthermore, since the sequences (| a

n

− p

i

( x

n

)|)

_n∈_N

and (| a

n

− x

n

|)

_n∈_N

are bounded, we deduce that

| a

n

− p

_i

( x

n

)| ≤ | a

n

− x

n

| ≤ B

for some B > 0. Then using the Cauchy-Schwarz inequality in the numerator in (14), we get

ω

_i,σ(n)

∈ [ e, B

²

e ]

for some e ∈ ( 0, B ) . Since | a

_σ(n)

− p

i

( x

_σ(n)

)| > e, we deduce from (11) and (15) that (| p

⁰_i

( x

_σ(n)

) − p

_i

( x

_σ(n)

)|)

_n∈_N

converges to zero. Therefore, ( p

_i

( x

_σ(n)

))

_n∈_N

is also strongly convergent to x

^∗

for each i in I. Since each set S

_i

is closed, we again conclude that x

^∗

∈ ^T

_i∈I

S

_i

.

These previous cases cover all the possible cases and all lead to the conclusion that x

^∗

∈ ^T

_i∈I

S

i

. The proof is thus complete.

The two following claims were instrumental in the proof of Theorem 2. We now provide their proofs.

Claim 1. Assume that p

_i⁰

( _x

_n

) 6= p

i

( _x

_n

) _{. Then}

h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − a

n

i

h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − p

⁰_i

( x

n

)i ⁽¹⁷⁾ is a well defined nonnegative real number. Moreover, we have

a

n

− p

⁰_i

( x

n

) | ¹

ω

_i,n

( a

n

+ ( ω

_i,n

− 1 ) p

_i

( x

n

)) − p

⁰_i

( x

n

)

≤ 0 (18)

for all

ω

_i,n

≥ h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − a

n

i h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − p

⁰_i

( x

n

)i ^.

Proof. We have p

⁰_i

( x

n

) 6∈ [ a

n

, p

_i

( x

n

)] . Indeed, assume for contradiction that p

⁰_i

( x

n

) ∈ [ a

n

, p

_i

( x

n

)] . This would imply that p

⁰_i

( x

n

) lies in the interior of B

_i,n

and therefore x

n

= p

⁰_i

( x

n

) . However, since by convexity of C

i,j_n

, [ a

n

, p

i

( x

n

)] ⊂ C

i,j_n

, this would imply that x

n

∈ C

i,j_n

, and therefore p

_i⁰

( x

n

) = x

n

= p

_i

( x

n

) , the sought-after contradiction.

Consider now the 2D plane containing a

n

, p

_i

( x

n

) and p

⁰_i

( x

n

) . Set g

n

to be at the intersection of the line perpendicular at p

⁰_i

( _x

_n

) _to [ _a

_n

_, _p

⁰_i

( _x

_n

)] with the segment [ _a

_n

_, _p

_i

( _x

_n

)] (see Figure 1). First, we have

h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − a

n

i = cos ( α

_i,n

) | p

⁰_i

( x

n

) − a

n

|| p

i

( x

n

) − a

n

| and

| p

⁰_i

( x

n

) − a

n

|

²

= h g

n

− a

n

| p

⁰_i

( x

n

) − a

n

i = cos ( α

_i,n

) | g

n

− a

n

|| p

⁰_i

( x

n

) − a

n

|

(13)

from which we obtain

| g

n

− a

n

| = | p

⁰_i

( x

n

) − a

n

|

²

| p

_i

( x

n

) − a

n

|

h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − a

n

i ⁽¹⁹⁾ Since B

i,n

is an ellipse with maximal axis [ a

n

, p

i

( x

n

)] , our next step is to express g

n

as the convex combination

g

n

= θ

_i,n

a

n

+ ( ₁ − θ

_i,n

) p

_i

( x

n

) ₍₂₀₎ of a

n

and p

i

( x

n

) for some θ

_i,n

∈ [ 0, 1 ] . Notice that (20) implies

| g

n

− a

n

| = ( 1 − θ

_i,n

)| p

_i

( x

n

) − a

n

| , which itself gives

θ

_i,n

= 1 − | g

n

− a

n

|

| p

i

( x

n

) − a

n

| ^. Using (19), we get

θ

_i,n

= 1 − | p

⁰_i

( x

n

) − a

n

|

²

h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − a

n

i = h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − p

⁰_i

( x

n

)i h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − a

n

i ^.

Notice further that, by construction, since [ a

n

, p

_i

( x

n

)] is the main axis of B

_i,n

and since, by assumption, p

⁰_i

( x

n

) 6= p

_i

( x

n

) , we have (see Figure 1)

h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − p

⁰_i

( x

n

)i > _0. ₍₂₁₎ Then, setting

d

n

= ¹

ω

_i,n

( a

n

+ ( ω

_i,n

− 1 ) p

i

( x

n

)) , we see that we need to take ω

_i,n

≥ 1/θ

_i,n

, i.e.,

ω

_i,n

≥ h p

⁰_i

( x

n

) − a

n

| p

_i

( x

n

) − a

n

i h p

⁰_i

( x

n

) − a

n

| p

i

( x

n

) − p

⁰_i

( x

n

)i ^, (which, owing to (21), is well defined), in order to ensure that

a

n

− p

⁰_i

( x

n

) | d

n

− p

⁰_i

( x

n

) ≤ 0

i.e., (18) holds. Finally, (19) and (21) together ensure that (17) is nonnegative.

Claim 2. We have

| a

n

− p

⁰_i

( x

n

)| ≥ | p

⁰_i

( x

n

) − p

_i

( x

n

)| . Proof. By definition of p

_i

( x

n

) , the fact that a

n

∈ S

_i

implies that

| p

_i

( x

n

) − x

n

| ≤ | a

n

− x

n

| . Thus x

n

belongs to the half space

H =

x ∈ H ; h x − ¹

2 ( a

n

+ p

i

( x

n

)) | p

i

( x

n

) − ¹

2 ( a

n

+ p

i

( x

n

))i ≥ 0

which is the set of points in H which are closer to p

_i

( x

n

) than a

n

. Clearly, p

⁰_i

( x

n

) ∈ H as well,

which completes the proof.

(14)

Figure 1. Section of the ellipsoid B

_i,n

. 5. Applications

5.1. MRI Image Reconstruction: The Infinite-Dimensional Hilbert Space Setting

The field of inverse problems was extensively investigated in the last five decades, fueled in particular by the many challenged in medical imaging. Projection methods have played an important role in the development of efficient reconstruction techniques [25], a well studied example being the method of projection onto convex sets (POCS) [2]. In recent years, techniques from penalised estimation have gained increasing popularity since the discovery of the compressed sensing paradigm, allowing for fewer measurements to be collected whilst achieving remarkable reconstruction performance.

One penalisation approach which has been a center of focus since its introduction is the Rudin Osher Fatemi functional [26], which can be described as follows: the reconstructed image u is the solution of the minimization problem,

arg min

u∈BV(Ω)

k u k

_TV(Ω)

+ ^λ 2 Z

Ω

( y

_obs

( x ) − A[ u ]( x ))

²

dx

where λ is a positive relaxation parameter and the term k u k

_TV(Ω)

is defined in Appendix A. The term R

Ω

( y

obs

( x ) − A[ u ]( x ))

²

dx is called the fidelity term and the term k u k

_TV(Ω)

is called the regularisation term. This infinite-dimensional minimization problem enforces the solution, which is a priori in L

²

( Ω ) , to satisfy a finite bounded variation (BV) norm constraint.

One important remark is that the optimisation problem can be turned into a pure feasibility problem by imposing the TV-norm and the fidelity terms to be less than or equal to a certain tolerance.

Using this approach, one can define an alternating projection procedure as in Algorithm 1 below, where we will use the forward operator A = F

_partial

, the partial Fourier Transform which plays a central role in MRI. The usual Fourier transform will be denoted by F as usual.

In this example, we will use the projections on the sets S

1

=

u ∈ L

²

( _Ω ) | Z

Ω

( λ y

_obs

− A[ u ]( x ))

²

dx ≤ λη

_{f id}

, λ ∈ R

+

,

S

2

= ⁿ u ∈ L

²

( Ω ) | k u k

_p,TV(Ω)

≤ η

_TV

o

,

S

3

= ⁿ u ∈ L

²

( Ω ) | F

_partial

[ u ] = λy

obs

, λ ∈ R

+

o

.

(15)

The set S

2

is not convex for p ∈ ( 0, 1 ) , but, since the epigraph of | · |

_p,TV(_Ω₎

is a cone, the set S

2

is expandable into convex sets as discussed in [22,23]. Notice that when the scaling factor λ is dropped in the definition of S

1

and S

3

, the problem is equivalent to the standard TV-penalised reconstruction approach. Introducing this scaling factor allows to recover a solution up to a scaling, which can easily be recovered by rescaling the solution by using the constraint F

_partial

[ u ] = y

_obs

as a post-processing step. The introduction of the scaling α in these definition allows the null function to belong to the intersection of the sets S

1

, S

2

and S

3

. Thus, we will set a

n

= 0, n ∈ N in the sequel. In what follows,

˜

y

_obs

will be the function will value equal to zero except for the observed frequencies which are taken as the observed values.

Our implementation is described in Algorithm 1. Experiments were made after incorporating the projection onto the TV ball into a freely available code provided in [27]. The fidelity term was set to η

_{f id}

= 10

⁻³

and the regularisation term η

_TV

was tuned using cross validation. The projection onto the TV-ball was computed using the method described in [28]. In order to avoid numerical instabilities, the iterates were rescaled every 10 iteration, which does not change the convergence analysis due to conic invariance of our formulation. We chose very thin ellipsoids B

i,n

by setting α = 10

⁻⁶

. A numerical illustration is given in Figure 2 below. Extensive numerical simulations will be presented in a forthcoming paper devoted to a thorough study of projection methods for MRI and X-ray computed tomography (XCT).

Algorithm 1: Alternating projection method for MRI for p = _1.

Result:

1

y = F

_partial

[ u

0

] observed data;

2

η

_TV

bound on the TV norm;

3

η

_{f id}

bound on the fidelity term;

4

e convergence tolerance;

5

Compute y

⁽⁰⁾

from y by filling in unobserved with zeros ;

6

Set k = 1, u

₁

= F

⁻¹

( y ˜

_obs

) ;

7

while Difference between two successive iterates larger than e do

8

Compute p

⁰₁

( u

n

) = P

B_1,n

( u

n

) ;

9

Compute p

⁰₁

( u

n

) = P

B_2,n

( u

n

) ;

10

Compute p

⁰₁

( u

n

) = P

B_3,n

( u

n

) ;

11

Set u

n+1

= u

n

+ λ

_n

∑

³i=1

α

_i,n

( p

_i⁰

( u

n

) − u

n

) ;

12

end while

Since S

₁

and S

₃

are convex and S

₂

is expandable into convex sets, we deduce from Theorem 2 that Algorithm 1 converges strongly to a point in ^T

³_i=1

S

i

.

Figure 2. POCS reconstruction with 30 iterations.

(16)

5.2. A Uniformly Convex Version of Cadzow’s Method

The method presented in Section 4 is now applied to a denoising problem in the spirit of Cadzow’s method [16].

Let x = [ x

0

, . . . , x

N−1

]

^>

be a complex-valued vector of length N. Let H be a linear operator which maps x into the set of Hankel L × K matrices, with L ≥ K, L + K = N + 1, defined by setting the ( i, j ) component to the value x

i+j−2

i.e.,

H( x ) =







x

0

x

1

x

2

· · · x

K−1

x

₁

x

2

x

3

· · · x

K

x

₂

x

₃

x

₄

· · · x

_K+1

.. . .. . .. . .. . .. . x

L−1

x

_L

x

_L+1

· · · x

N−1





 .

The adjoint operator associated with H , denoted by H

^∗

, is a linear map from L × K matrices to vectors of length N, obtained by averaging each skew-diagonal.

The denoising problem we consider in this section is the one of estimating a signal x corrupted by random noise, as described in the observation model

y

t

= x

t

+ e

_t

, for t = 0, . . . , N − 1, where

• x

t

is a sum of damped exponential components, i.e., x

t

=

∑

κ k=−κ

c

_k

ρ

^t_k

exp ( 2ιπ f

_k

t ) ,

with 2κ + 1 ≤ K, ρ

_k

a real damping factor satisfying | ρ

_k

| ≤ 1, f

_k

a real number representing a frequency and c

_k

a complex coefficient,

• e

_t

is a random noise.

This denoising problem is of crucial importance in many applications and is also known as super-resolution in the literature; see [16,29–33]. The main motivation behind Cadzow’s algorithm is that many signals of interest have low rank Hankelization. In mathematical terms, we can often look for approximations and denoising, for signals which satisfy the constraint that rank( H( x )) = r where r ≤ 2κ + 1. Such constraints are important when the observed signal is corrupted with additive noise, which increases the rank of H( x ) . Consequently, an intuitive way of estimating x from y is to do rank reduction: starting from x

⁽⁰⁾

= y = [ y

0

, . . . , y

N−1

]

^>

, the Cadzow’s algorithm iteratively updates the estimate via the following rule

x

⁽ⁿ⁺¹⁾

= H

^∗

(T

_r

(H( x

⁽ⁿ⁾

))) , n = 0, 1, . . .

Here T

_r

computes the truncated singular value decomposition (SVD) of any L × K matrix X, that is,

T

r

( X ) =

∑

r j=1

σ

_j

u

j

v

^∗_j

, with

X =

∑

K j=1

σ

_j

u

j

v

^∗_j

(17)

being an SVD of X and with σ

1

≥ σ

2

≥ . . . ≥ σ

K

being the singular values. As is now well known, Cadzow’s algorithm is easily interpreted as a method of alternating projections in the matrix domain [23]. Denote by P

M_r

the projection onto the set M

r

of L × K matrices of rank r, and by P

M_H

the projection onto the space M

_H

of L × K Hankel matrices. Then, Cadzow’s method is easily seen to be equivalent to the following matrix recursions

X

⁽ⁿ⁺¹⁾

= P

_M_H

P

_M_r

( X

⁽ⁿ⁾

) , n = 0, 1, . . .

Since M

_H

is convex and M

_r

is expandable into convex sets, we deduce from the main result of [22] that Cadzow’s method has cluster points in M

_H

∩ M

_r

. On the other hand, applying the method of Section 4 can be advantageous in the application of projection methods to the case of low rank matrix constraints such as in the application of Cadzow’s method. In particular, our method shows that the SVD can be computed only approximately in the first iterations and still belong to the ellipsoids B

i,n

, thus implying possibly important computational savings. Our new method converges to a point in M

_H

∩ M

_r

.

An example showing the efficiency of Cadzow’s method is presented in Figure 3 below. In this experiment, we set the signal-to-noise ratio (SNR) to 3.5 dB and the rank constraint equal to 15 (an appropriate choice for the rank can be made using statistical techniques such as cross-validation).

We took K = b N/2 − 10 c .

Figure 3. Application of our method to denoising a benchmark Electrocardiogram (ECG) type signal.

6. Conclusions and Future Work

The work presented in this paper is a follow on project to the work of [22]. We obtain strong convergence results in Hilbert spaces in the case of uniformly convex expandable sets, a notion which refines the definition of convex expandable sets introduced in [22,23]. We showed that the proposed methods apply to practical inverse and denoising problems which are instrumental in engineering and signal and data analytics.

Future work is planned in developing new and faster algorithms for nonconvex feasibility

problems, using acceleration schemes such as Richardon’s extrapolation [34].

Projection Methods for Uniformly Convex Expandable Sets

HAL Id: hal-02896463

https://hal-centralesupelec.archives-ouvertes.fr/hal-02896463

Submitted on 10 Jul 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Distributed under a Creative Commons Attribution| 4.0 International License

Sets

Stéphane Chrétien, Pascal Bondon

To cite this version:

Stéphane Chrétien, Pascal Bondon. Projection Methods for Uniformly Convex Expandable Sets.

Mathematics , MDPI, 2020, �10.3390/math8071108�. �hal-02896463�

Article