A new method for interpolating in a convex subset of a Hilbert space

(1)

HAL Id: hal-01136466

https://hal.archives-ouvertes.fr/hal-01136466v2

Submitted on 30 Dec 2015

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A new method for interpolating in a convex subset of a Hilbert space

Xavier Bay, Laurence Grammont, Hassan Maatouk

To cite this version:

Xavier Bay, Laurence Grammont, Hassan Maatouk. A new method for interpolating in a convex

subset of a Hilbert space. Computational Optimization and Applications, Springer Verlag, 2017, 68

(1), pp.95-120. �10.1007/s10589-017-9906-9�. �hal-01136466v2�

(2)

(will be inserted by the editor)

A New Method For Interpolating In A Convex Subset Of A Hilbert Space

Xavier Bay · Laurence Grammont · Hassan Maatouk

Received: date / Accepted: date

Abstract In this paper, interpolating curve or surface with linear inequality constraints is considered as a general convex optimization problem in a Reproducing Kernel Hilbert Space. We propose a new approximation method based on a discretized optimization problem in a finite-dimensional Hilbert space under the same set of constraints. We prove that the approximate solution converges uniformly to the optimal constrained interpolating function.

An algorithm is derived and numerical examples with boundedness and monotonicity constraints in one and two dimensions are given.

Keywords Optimization·RKHS· Interpolation·Inequality constraints

1 Introduction

LetX be a nonempty set of R^d (d≥1) and E =C⁰(X) the linear (topolog- ical) space of real valued continuous functions on X. Givenn distinct points x⁽¹⁾, . . . , x⁽ⁿ⁾ ∈ X and y1, . . . , yn ∈ R, we define the set I of interpolating functions by

I:=n

f ∈E, f x⁽ⁱ⁾

=yi, i= 1, . . . , no .

X. Bay

Mines Saint- ´Etienne, UMR CNRS 6158, LIMOS, F-42023 Saint- ´Etienne, France.

E-mail: [email protected] L. Grammont

Universit´e de Lyon, Institut Camille Jordan, UMR 5208, 23 rue du Dr Paul Michelon, 42023 Saint- ´Etienne Cedex 2, France.

E-mail: [email protected] H. Maatouk

Mines Saint- ´Etienne, UMR CNRS 6158, LIMOS, F-42023 Saint- ´Etienne, France.

E-mail: [email protected]

(3)

LetC be a closed convex set ofE. We consider the following problem

min{khk²_H, h∈H∩C∩I} (P)

whereHis a Reproducing Kernel Hilbert Space (RKHS) continuously included in E. Notice thatH∩C∩I is a closed convex subset ofH and the (unique) solution of (P) is the projection in H of the null function onto this convex set (assumed to be nonempty). The reproducing kernel (r.k.) K of H is a continuous symmetric definite-positive function :

K : (x,y)∈R^d×R^d −→ K(x,y) := (K(.,y), K(.,x))_H ∈R.

Choosing different kernels, the norm with corresponding RKHS defines different notions of smoothness or different regularization criteria for scattered data interpolation.

In terms of the reproducing property (see [3]), the interpolation conditions can be formulated in the Hilbert spaceH as

∀h∈H∩I, h x⁽ⁱ⁾

= h, K

., x⁽ⁱ⁾

H

=y_i, i= 1, . . . , n. (1) Very often in practice, the convex set C is an infinite set of linear inequality constraints. The following interpolation problem without such constraints (caseC=E)

min

khk²_H, h∈H∩I (Q) has been solved so far. It is easy to prove that, ifH∩I 6=∅, then (Q) has a unique solution. Let I be the interpolation operator fromH intoRⁿ defined as

I(h) :=

h x⁽¹⁾

, . . . , h x⁽ⁿ⁾

.

From Equation (1), I is a bounded linear operator whose range is included in the usual Euclidian space Rⁿ. The kernel Ker(I) of I is closed in H so that, for any y ∈ Rⁿ, ˇh = I^†(y) is the unique solution of (Q), where I^† is the generalized inverse or Moore-Penrose inverse ofI (see [21]). If the matrix K= K x⁽ⁱ⁾, x^(j)

1≤i,j≤n is invertible, ˇh= I^†(y) can be expressed as (see Proposition 2, Sect. 2.1)

ˇh(x) =k(x)^>K⁻¹y, (2) wherek(x) = K x, x⁽¹⁾

, . . . , K x, x⁽ⁿ⁾^>

andy= (y₁, . . . , y_n)^>.

In many applications from science to engineering, there is a priori informa- tion on the shape of the solution such as boundedness or monotonicity property. The shape constraints restrict the reconstruction to some closed convex subset of the relevant function space. The general approach is based on using a minimization principle : the so called smoothing spline principle (see [2], [24]).

The starting point is a characterization of the solution of the problem (P) as the orthogonal projection onto the convex setCof a finite linear combination (with unknown coefficients) of certain basis functions. The coefficients are defined from interpolation conditions which lead to a set of nonlinear equations

(4)

that can be solved by Newton’s method (see [2]).

If

◦

H\∩C∩ I⁻¹({y})6=∅then (P) has a unique solution of the form hˆ=PC(I^∗(α)) =PC

n

X

i=1

αiK ., x⁽ⁱ⁾

! ,

whereα= (α₁, . . . , α_n)^>is a vector inRⁿ andP_Cis the orthogonal projection onto the convex set H∩C (see [20], Theorem 3.2, pp 739). Conversely, if for arbitrary vectorα, ˆh=PC P

iαiK ., x⁽ⁱ⁾

satisfies the conditionI(ˆh) =y then ˆhis the solution of (P) (see [2], Theorem 2.1, pp 304). Under particular assumptions (see [20]), if ˆαis solution of the following dual problem

min 1

2kPC(I^∗(α))k²_H−(α, y), α∈Rⁿ

,

then ˆh = P_C(I^∗( ˆα)) is the solution of (P). In the general case (see [20], Theorem 3.2), ˆαis the solution the following dual problem

min 1

2kI^∗(α)k²_H−1

2kI^∗(α)−P_C(I^∗(α))k²_H−(α, y), α∈Rⁿ

. This last problem is not easy to solve. As Andersson and Elfving wrote it in their paper [2], to transform this result into a numerical algorithm, it is nec- essary to compute the orthogonal projectionPC and the difficulty lies in that calculation. Andersson and Elfving [2] investigate the structure of the projection operatorP_Cfor particular constraints defining the convex setC. Laurent [17] proposed an algorithm to solve this kind of minimization problem. This algorithm was applied by Utreras and Varas in [23] for the K-Monotone Thin Plate Spline (K-M.T.P.S). The algorithm is based on iterations using Kuhn and Tucker’s theorem. Moreover, as the authors wrote it in the paper [23], the computational cost of the dual-type algorithm is still high.

In this paper, we propose a new method to solve (P). We define a discretized optimization problem (PN) in a finite-dimensional space HN under the same interpolation conditions and inequality constraints :

min

khk²_H_N, h∈HN ∩C∩I . (PN) The main step of the method is the construction of the finite-dimensional Hilbert space H_N in the bigger spaceE=C⁰(X), using a much more flexible set of basis functions inEto incorporate inequality constraints. In a particular framework, we prove that the problems (P) and (PN) have a unique solution and that the solution of the discretized problem (PN) tends to the solution of (P), for the convergence in the space E (uniform convergence).

The article is organized as follows : in Sect. 2, the new method to approximate (P) is described and its convergence property is proved. In order to

(5)

illustrate the proposed approach, some numerical examples with boundedness and monotonicity constraints in one and two dimensions are given in Sect. 3.

The algorithm is applied to classic spline cases with inequality constraints.

2 A new method based on a discretized optimization problem

Consider the infinite-dimensional convex optimization problem

min{J(h), h∈H∩C}, (O)

where J is a real valued criterion defined on a Hilbert space H and C is a closed convex set ofH.

By analogy with Finite Element Method (see the Ritz-Galerkin method [11]), a discretized optimization problem is of the form

min{J(ρNh_N), h_N ∈H_N ∩C_N}, (O_N) whereH_N =π_N(H) is a finite-dimensional space,π_N is a linear operator from H to HN (projection or restriction operator), ρN is an extension operator from HN toH andCN :={hN ∈HN s.t. ρN(hN)∈C} (see [5] and [18]). If πN(C)⊂CN and under some stability and consistency properties of πN and ρN, one can expect that J(uN)−→J(u) and uN * uweakly in H, where u is the solution of (O) anduN is the solution of (ON).

Our approach is different : we do not discretize the constraints set but we discretize the criterion :

min{JN(hN), hN ∈HN∩C}.

Nevertheless, the analysis of this discretized optimization problem involves a triple (H_N, π_N, ρ_N), where π_N is a linear operator from E to H_N ⊂E and ρ_N is an operator fromH_N toH. In this section, the spaceE=C⁰(X) is the Banach space of continuous functions equipped with the uniform normk.k_∞, where X is a compact subspace of R^d. Let ˆhbe the solution of (P) and ˆhN

the solution of (PN), we will prove that ˆhN −→

N→+∞

ˆhin the spaceE.

2.1 The approximating subspacesHN and operatorsπN andρN

For simplicity,X is assumed to be the unit interval [0,1]. Let∆N be a subdi- vision of [0,1] being a graded mesh :

∆N : 0 =tN,0< tN,1< . . . < tN,N = 1, ∆N ⊂∆N+1,

(6)

and hN = max{|tN,i+1−tN,i|, i= 0, . . . , N−1} −→

N→+∞ 0. For each N, we define the approximating subspaceHN of E =C⁰(X) to be the subspace of piecewise linear continuous functions associated to ∆N. The canonical basis ofHN is formed by the so-called hat functions [ϕN,0, . . . , ϕN,N] :

j= 1, . . . , N −1 ϕN,j(t) :=











t−tN,j−1

t_N,j−tN,j−1, t∈[t_N,j−1, t_N,j],

tN,j+1−t

tN,j+1−tN,j, t∈[t_N,j, t_N,j+1],

0 otherwise.

(3)

ϕN,0(t) :=







t_N,1−t

t_N,1−tN,0, t∈[tN,0, tN,1],

0 otherwise,

ϕN,N(t) :=







t−tN,N−1

t_N,N−tN,N−1, t∈[t_N,N−1, tN,N],

0 otherwise.

Next, we define the linear operatorsπ_N :E−→H_N,ρ_N :H_N −→H and a normk.k_H_N such thatπ_N andρ_N are stable, i.e.

∀h∈H, kπN(h)kHN ≤ khkH,

∀hN ∈HN, kρN(hN)kH≤ khNkH_N, andπ_N andρ_N are consistent, i.e.

∀h∈H, ρN◦πN(h) −→

N→+∞h.

Proposition 1 Let πN be the linear operator defined fromE ontoHN by

∀f ∈E, πN(f) =

N

X

j=0

f(tN,j)ϕN,j. Then, πN ◦πN =πN and

πN(f) −→

N→+∞f inE.

Proof It is a classical result related to the usual Schauder basis of the Banach E (see [19]).

Remark 1 Take J(h) = khk²_H in (O). If kρNhNkH = khNkH_N, then (ON) becomes a finite-dimensional problem easy to handle. So, it would be nice to construct the operatorρN and the norm onHN satisfying this last equality.

For this, we consider the interpolation operator IN : H −→ R^N⁺¹ by I_N(h) := (h(t_N,0), . . . , h(t_N,N)). By the reproducing property, we also have

IN(h) = (h, K(., tN,0))_H, . . . ,(h, K(., tN,N))_H

. (4)

(7)

Hence, IN is a bounded operator and for all y ∈ R^N⁺¹, the following optimization problem

minh∈H

khk²_H, h(t_N,j) =y_j, j= 0, . . . , N

has a unique solution ˜h=I_N^†(y). Let us define the operatorρN :HN −→H as follows :

∀hN ∈HN, ρN(hN) =I_N^†(ch_N), wherech_N := (hN(tN,0), . . . , hN(tN,N))^>.

We assume in the following that the Gram matrixΓ_N := (K(t_N,i, t_N,j))_0≤i,j≤N isinvertible for allN.

Proposition 2 For allhN ∈HN, we have

ρN(hN) =k(.)^>Γ_N⁻¹ch_N, (5) wherek(.) = (K(., t_N,0), . . . , K(., t_N,N))^>. Moreover,

kρN(hN)k²_H =c^>_h_NΓ_N⁻¹ch_N. (6) Proof By definition,ρN(hN)∈Ker(IN)^⊥. Additionally, from relation (4), we obtain

Ker(IN)^⊥ =span(K(., tN,0), . . . , K(., tN,N)), so we can write for someαj :

ρN(hN) =

N

X

j=0

αjK(., tN,j). (7)

AsρN(hN)(tN,i) =hN(tN,i) fori= 0, . . . , N, we have α:= (α0, . . . , αN)^>= Γ_N⁻¹ch_N, which leads to Equation (5). Using Equation (7), one gets

kρN(h_N)k²_H = (ρ_N(h_N), ρ_N(h_N))_H =

N

X

i=0 N

X

j=0

α_jα_i(K(., t_N,i)K(., t_N,j))_H. Since (K(., tN,i), K(., tN,j))_H=K(tN,i, tN,j), we obtain

kρ_N(h_N)k²_H=

N

X

i=0 N

X

j=0

α_jα_iK(t_N,i, t_N,j) =α^>Γ_Nα, withα=Γ_N⁻¹ch_N, which completes the proof of the proposition.

In view of Proposition 2, let us construct an inner product inHN so that kρN(hN)kH=khNkH_N.

(8)

Theorem 1 Define the scalar product inHN by (f, g)_H

N :=c^>_fΓ_N⁻¹cg, (8) with cf := (f(tN,0), . . . , f(tN,N))^> and cg := (g(tN,0), . . . , g(tN,N))^>. Then, the space HN is a RKHS with r.k.KN given by

∀x⁰, x∈[0,1], KN(x⁰, x) =

N

X

i,j=0

K(tN,i, tN,j)ϕN,j(x)ϕN,i(x⁰).

Proof Clearly,HN is a finite-dimensional Hilbert space. Let xbe in [0,1]. We have

KN(., x) =

N

X

j=0

λj,xϕN,j ∈HN, (9)

whereλj,x=

N

X

k=0

K(tN,j, tN,k)ϕN,k(x) = (ΓNϕ(x))_j, withϕ(x) := (ϕN,0(x), . . . , ϕN,N(x))^>. Leth:=

N

X

i=0

α_iϕ_N,i∈H_N. Using Equation (8), we obtain (h, KN(., x))_H

N =α^>Γ_N⁻¹(ΓNϕ(x)) =α^>ϕ(x) =h(x), which is the reproducing property inHN.

Proposition 3 The operator ρ_N is stable. Indeed, ρ_N is an isometry from H_N intoH, i.e.

∀hN ∈HN, kρN(hN)k²_H=khNk²_H_N. Furthermore,

∀x∈X, ρ_N(K_N(., x)) =

N

X

j=0

ϕ_N,j(x)K(., t_N,j). (10) Proof Let hN be in HN, then hN =c^>_h

Nϕ(x). According to the definition of the inner product inHN, we have

kh_Nk²_H

N = (h_N, h_N)_H

N =c^>_h

NΓ_N⁻¹c_h_N.

Using (6), we obtain kρN(h_N)k²_H = khNk²_H_N. Since c_K_N_(.,x) = Γ_Nϕ(x) (see Equation (9)), we deduce the relation (10) from Proposition 2 and Equation (5).

Proposition 4 For allf inE,

kπN(f)k²_H_N =c^>_fΓ_N⁻¹cf,

withc_f = (f(t_N,0), . . . , f(t_N,N))^>. Moreover, π_N is stable, i.e.

∀h∈H, kπN(h)kH_N ≤ khkH.

(9)

Proof The first part is a direct consequence of Theorem 1. Now, consider the orthogonal decomposition inH :H =H₀^N⊕^⊥H₁^N, with

H₀^N ={h∈H : h(tN,j) = 0, j= 0, . . . , N}, H₁^N = span{K(., tN,j), j= 0, . . . , N}.

For all h ∈ H, there exists a unique h0 ∈ H₀^N and h1 ∈ H₁^N such that h=h0+h1. Thus,

kh1k²_H≤ khk²_H.

Additionally, every h1 ∈H₁^N can be expressed ash1(.) =PN

j=0αjK(., tN,j).

From the reproducing property (K(., tN,j), K(., tN,i))_H=K(tN,i, tN,j), we get

kh1k²_H= (h1, h1)_H=

N

X

i,j=0

αiαjK(tN,i, tN,j) =α^>ΓNα.

Ash1(tN,i) =PN

j=0αjK(tN,i, tN,j) fori= 0, . . . , N, we haveα=Γ_N⁻¹ch₁ and kh1k²_H=c^>_h₁Γ_N⁻¹ΓNΓ_N⁻¹ch₁=c^>_h₁Γ_N⁻¹ch₁.

Since h0 ∈H₀^N, ch₁ =ch and kh1k²_H =c^>_hΓ_N⁻¹ch=kπN(h)k²_H

N, which completes the proof of the proposition.

Proposition 5 Let QN be the orthogonal projection from H onto (H₀^N)^⊥ = H₁^N. For allh∈H, we have

ρ_N ◦π_N(h) =Q_N(h).

Moreover,(HN, πN, ρN)is consistent, i.e.

ρ_N(π_N(h)) −→

N→+∞hin H.

Proof According to the proof of Proposition 4, we haveQ_N(h) =k(.)^>Γ_N⁻¹c_h. On the other hand, we know that ρ_N(π_N(h)) = k(.)Γ_N⁻¹c_h from Proposition 2. Hence,ρ_N◦π_N is the orthogonal projection fromH intoH₁^N. To complete the proof of the proposition, it is sufficient to show that the subspace∪_NH₁^N is dense inH. Lethbe in ∪NH₁^N⊥

. By the reproducing property, we have h(tN,i) = 0, for all N ∈ N and i = 0, . . . , N. By continuity, h = 0 and

∪NH₁^N⊥

={0}.

(10)

2.2 Existence and uniqueness of the solution of (PN) We recall the problems (P), (P_N) respectively :

min{khk²_H, h∈H∩C∩I} and min

khk²_H_N, h∈HN ∩C∩I . Assumptions.

(H1)

◦

H\∩C∩I6=∅ (H2) ∀N, πN(C)⊂C

By the first hypothesis, the closed convex set H∩C∩I is nonempty, so the initial problem (P) admits an unique solution ˆh.

Now, ifg ∈

◦

H\∩C∩I 6=∅, we will construct a sequencegN ∈H such that lim_N→+∞g_N =ginH andπ_N(g_N)∈I. Usingg∈

◦

H\∩C6=∅and (H2), this result will prove thatπ_N(g_N) is inH_N ∩C∩IforN large enough. Thus, the problem (P_N) also admits an unique solution.

Let us construct now the sequence (gN)N associated tog∈

◦

H\∩C∩I. If x^(k)is a data point, let [aN,k, bN,k] be the smallest interval of ∆N containing x^(k), then we can writex^(k)=λN,kaN,k+ (1−λN,k)bN,k, whereλN,k∈[0,1].

Now we define the set

FN :={h∈H : λN,kh(aN,k) + (1−λN,k)h(bN,k) =yk, k= 1, . . . , n}. We consider the following optimization problem :

h∈FminN

kh−gk²_H. (RN) According to the classical projection theorem, the problem (RN) has a unique solution denoted bygN.

Figure 1 shows the projectionπ_N(g_N) (black dashed line) of the solution of the problem (R_N). Notice that the functionπ_N(g) (red line) does not respect the interpolation condition.

Lemma 1 If g∈H∩I, thengN −→

N→+∞g inH.

Proof We define the spaceG^N₀ andG^N₁ respectively as

G^N₀ :={h∈H : λ_N,kh(a_N,k) + (1−λ_N,k)h(b_N,k) = 0, k= 1, . . . , n}, G^N₁ := span{λN,kK(., aN,k) + (1−λN,k)K(., bN,k), k= 1, . . . , n}. For arbitraryf inFN (6=∅), we haveFN =f+G^N₀ andgN =f+P_GN

0 (g−f), where P_GN

0 is the orthogonal projection onto G^N₀ . Therefore, g−gN = g−

(11)

Fig. 1: The functionπN(gN) in the neighborhood of the data pointx^(k). f−P_GN

0(g−f)∈(G^N₀)^⊥=G^N₁. Then, there exists (β^N₁ , . . . , β_n^N)^> inRⁿsuch that

g−g_N =

n

X

k=1

β_k^N^N_k , (11)

where^N_k :=λN,kK(., aN,k)+(1−λN,k)K(., bN,k). The vectorβ^N = (β₁^N, . . . , β_n^N)^>

can be seen as the solution of the following linear system

A^Nβ^N =b^N, (12)

whereA^N_k,l:= ^N_k , ^N_l

H andb^N := (b^N₁ , . . . , b^N_n)^>, with b^N_k :=λ_N,kg(a_N,k) + (1−λ_N,k)g(b_N,k)−y_k. Now, each dot product

^N_k, ^N_l

H = (λ_N,kK(., a_N,k) + (1−λ_N,k)K(., b_N,k), λ_N,lK(., a_N,l) + (1−λ_N,l)K(., b_N,l))_H converges to K(., x^(k)), K(., x^(l))

H =K(x^(k), x^(l)) by the continuity ofK(., .).

On the other hand, the right vector in (12) converges to zero by continuity of the functiong. Since the matrix K(., x^(k)), K(., x^(l))

H

1≤k,l≤n is invertible, Lemma 1 is deduced from relations (11) and (12).

Theorem 2 Under the assumptions(H1) and (H2), the discretized optimization problem (PN) has a unique solution ˆhN (forN large enough).

Proof H_N ∩C ∩I is a nonempty closed convex subset of H_N for N large enough.

2.3 Convergence analysis

In this section we state some technical lemmas and the convergence of the proposed method.

(12)

Lemma 2 Leth1∈H∩C∩Iandh0∈

◦

H\∩C∩I. Defineht:= (1−t)h0+th1∈ H, t∈[0,1]. Then

– ht converges toh1 whent tends to1.

– ∀t <1,ht∈

◦

H\∩C∩I.

Proof The first property is straightforward. Now, since h0 ∈

◦

H\∩C, there exists r > 0 such that the open ball B(h0, r) is contained in H ∩C. For t∈[0,1], we defineφtas :

φ_t : h∈H−→ (1−t)h+th₁.

We haveφt(H∩C)⊂H∩C so thatφt(B(h0, r)) =B(ht,(1−t)r)⊂H∩C.

Thusht∈

◦

H\∩Cift <1. The proof of the lemma is done sinceh0∈I, h1∈I andI is convex.

Lemma 3 Let >0be arbitrary small. There existsg∈

◦

H\∩C∩I such that kgkH≤ kˆhkH+, whereˆhis the solution of the problem (P).

Proof By assumption (H1), there exists g∈

◦

H\∩C∩I. Using Lemma 2 with h0 =g andh1 = ˆh∈H∩C∩I, we chooset such that ht ∈

◦

H\∩C∩I and kht−ˆhkH≤, which implies thatkhtkH≤ kˆhkH+.

Lemma 4 Let >0. For N large enough, we have kˆhNkHN ≤ kˆhkH+ 2.

Proof Letg ∈

◦

H\∩C∩I be such thatkgkH ≤ kˆhkH+(see Lemma 3). Let gN be the solution of (RN) associated tog. By Lemma 1, we have forN large enough

kgNkH ≤ kgkH+≤ kˆhkH+ 2.

SinceπN(gN)∈HN∩C∩IandπN is stable, we havekhˆNkH_N ≤ kπN(gN)kH_N ≤ kgNkH, which completes the proof of the lemma.

Lemma 5 For allxin X,ρ_N(K_N(., x)) −→

N→+∞K(., x) inH.Furthermore, sup

x∈X

kρN(KN(., x))−K(., x)kH −→

N→+∞0.

Proof From Proposition 3, we have

kρN(KN(., x))−K(., x)k²_H =kρN(KN(., x))k²_H+kK(., x)k²_H−2 (ρN(KN(., x)), K(., x))_H

=kKN(., x)k²_H_N+kK(., x)k²_H−2

N

X

j=0

ϕN,j(x)K(x, tN,j)

=KN(x, x) +K(x, x)−2

N

X

j=0

ϕN,j(x)K(x, tN,j).

(13)

By uniform continuity ofK(., .) on the compact setX×X, we deduce that both KN(x, x) = PN

i,j=0K(tN,i, tN,j)ϕN,i(x)ϕN,j(x) and PN

j=0ϕN,j(x)K(x, tN,j) are uniformly convergent to the functionK(x, x), which completes the proof of the lemma.

Proposition 6 Let ˆhN andhˆ be respectively the solutions of (PN) and (P).

Then

1. kˆhNk²_H

N −→

N→+∞kˆhk²_H. 2. ρN(ˆhN) −→

N→+∞

ˆhinH. Proof From Lemma 4, we have

lim sup

N−→+∞

kˆhNkH_N ≤ kˆhkH.

Additionally, by Proposition 3,kρN(ˆh_N)kH=kˆh_NkHN, therefore lim sup

N−→+∞

kρN(ˆh_N)kH≤ khkˆ H < +∞. By weak compactness in Hilbert space, there exists a subse-

quence (ρN_k(ˆhN_k))_k∈N which is weakly convergent : ρ_N_k(ˆh_N_k) *

k→+∞l inH. (13)

Let us prove that the limit functionlis in C∩I.

– ∀k∈Nandi= 1, . . . , n yi= ˆhN_k(x⁽ⁱ⁾) =

ˆhN_k, KN_k(., x⁽ⁱ⁾)

H_Nk =

ρN_k(ˆhN_k), ρN_k(KN_k(., x⁽ⁱ⁾))

H. Since ρN_k(KN_k(., x⁽ⁱ⁾)) −→

k→+∞K(., x⁽ⁱ⁾) strongly inH (see Lemma 5) and using (13), we have

ρN_k(ˆhN_k), ρN_k(KN_k(., x⁽ⁱ⁾))

H →

k→+∞(l, K(., x⁽ⁱ⁾))H =l(x⁽ⁱ⁾), which implies thaty_i=l(x⁽ⁱ⁾). Hencel∈I.

– FixN ≥1. We haveπ_N(ˆh_N_k) −→

k→+∞π_N(l) in the finite-dimensional space HN because ˆhN_k(tN,j) −→

k→+∞l(tN,j)) by the previous argument showing l ∈I. Since ˆhN_k ∈C, πN(C)⊂C and C∩HN is closed in HN, we have πN(l)∈C. AsπN(l) converges tolinE(see Proposition 1) andCis closed for the topology ofE, one gets l∈C.

(14)

Sincel∈H∩C∩I, we have

kˆhk²_H ≤ klk²_H. Asρ_N_k(ˆh_N_k) *

k→+∞l in H, we know thatklk_H ≤lim inf

k→+∞kρ_N_k(ˆh_N_k)k_H. So, lim sup

k→+∞

kρN_k(ˆhN_k)kH ≤ kˆhkH ≤ klkH ≤lim inf

k→+∞kρN_k(ˆhN_k)kH. (14) Hence,klkH =kˆhkH and l= ˆhby unicity of the solution of the problem (P).

From inequalities (14), we deduce that kρNk(ˆh_N_k)kH −→

k→+∞kˆhkH. But, weak convergence (see (13)) and convergence of the sequence of the norms imply strong convergence. Hence, the subsequenceρ_N_k(ˆh_N_k) converges strongly to ˆh and the sequence (ρN(ˆhN))N as well.

The first part of Proposition 6 is a crucial step for convergence analysis of the sequence of the minimizers (ˆhN)N. The following theorem summarizes the main results of this paper.

Theorem 3 Under assumptions(H1) and (H2), the discretized optimization problem (PN) has an unique solution ˆhN ∈ HN ⊂ E = C⁰(X) (for N large enough). Let ˆhbe the unique solution in the RKHSH⊂Eof the constrained interpolation problem (P). Then, we have

ˆhN −→

N→+∞

ˆhinE, and

kˆh_Nk²_H

N −→

N→+∞kˆhk²_H, ρN(ˆhN) −→

N→+∞

ˆh inH.

Proof Let ˆhand ˆh_N be the solutions of (P) and (P_N) respectively. Then ˆh_N(x)−h(x) =ˆ

ˆh_N, K_N(., x)

HN

−

h, K(., x)ˆ

H

=

ρ_N(ˆh_N), ρ_N(K_N(., x))

H−

ˆh, K(., x)

H

=

ρN(ˆhN), ρN(KN(., x))−K(., x)

H

+

ρN(ˆhN)−ˆh, K(., x)

H

. The proof of the theorem is done by applying Proposition 6 and Lemma 5 to the following inequality

sup

x∈X

|ˆh_N(x)−ˆh(x)| ≤ kρ_N(ˆh_N)k_H×sup

x∈X

kρ_N(K_N(., x))−K(., x)k_H +kρN(ˆh_N)−ˆhkH×sup

x∈X

kK(., x)kH

since sup

x∈X

kK(., x)k_H = sup

x∈X

pK(x, x)<+∞(K is a continuous function in X²).

(15)

2.4 Implementation of (PN)

The aim of this section is to show that the discretized optimization problem (P_N) is equivalent to a Quadratic Program (QP). To do this, we define the applicationψfrom R^N⁺¹ toH_N as follows :

ψ : α∈R^N⁺¹ −→ ψ(α) =

N

X

j=0

αN,jϕN,j(.)∈HN,

where ϕN,j, j = 0, . . . , N are defined in (3). Define a new scalar product on R^N+1 as

(α, β)_RN+1:=α^>Γ_N⁻¹β

The application ψ is a norm preserving isomorphism. For all f ∈ HN such thatf(x) =PN

j=0αN,jϕN,j(x), we have kfk²_H

N =α^>Γ_N⁻¹α=kαk²_RN+1.

Using the isomorphism ψ, we define the following closed convex subset of R^N+1, ˜C:=ψ⁻¹(C) and ˜I:=ψ⁻¹(I) the following affine subspace ofR^N+1 :

I˜=







α∈R^N⁺¹ such that

N

X

j=0

αN,jϕN,j

x⁽ⁱ⁾

=yi, i= 1, . . . , n





 .

Consider now the QP problem arg min

α∈R^N+1 α∈I∩˜ C˜

α^>Γ_N⁻¹α, ( ˜P_N)

where ˜I and ˜C represent respectively the interpolation condition and the inequality constraints in the Euclidian space R^N⁺¹. The numerical calculation of ( ˜P_N) is a classical problem in the optimization of positive quadratic forms, see e.g. [4] and [12].

Proposition 7 The solution of the discretized optimization problem (PN) is ˆh_N =

N

X

j=0

(α_opt)_jϕ_N,j,

whereα_opt∈R^N+1 is the unique solution of problem (P˜_N).

3 Numerical Illustration

This section is devoted to numerical examples to illustrate the approximation method in various situations.

(16)

3.1 Boundedness constraints

Let us recall thatE=C⁰([0,1]). LetH be the RKHS whose reproducing kernel is the commonly used squared exponential (or Gaussian) kernel K(x, y) = exp

−^(x−y)_2θ2²

, where the parameter θ defines a characteristic length-scale [22]. The norm on the induced RKHS defines a strong smoothness criterion for data interpolation. The setC of inequality constraints is of the form

C={f ∈E : −∞ ≤a≤f(x)≤b≤+∞, x∈[0,1]},

where the lower and upper boundsaand bare assumed to be known. Notice thatC is a closed convex set ofE.

In the following proposition, we give a characterization of the functions in bothH_N andC. This characterization is easy to use in practice.

Proposition 8 Let h_N ∈H_N. Then, hN :=

N

X

j=0

αN,jϕN,j ∈Cif and only if the coefficientsαN,j ∈[a, b], j= 0, . . . , N.

Proof Observe that∀x∈[0,1], PN

j=0ϕN,j(x) = 1. Now if the coefficientsαN,j

lie in the interval [a, b], then hN is in C. Conversely, ifhN ∈C, then h_N(t_N,i) =

N

X

j=0

α_N,jϕ_N,j(t_N,i) =

N

X

j=0

α_N,jδ_i,j=α_N,i∈[a, b], which completes the proof of the proposition.

From Proposition 8, we immediately see that hypothesis (H2) holds, i.e.

πN(C)⊂Cfor allN. Let ˆhN be the solution of the finite optimization problem (PN). From Proposition 7 and Proposition 8, ˆhN can be expressed as

x∈[0,1], ˆh_N(x) =

N

X

j=0

(α_opt)_jϕ_N,j(x), whereα_opt∈R^N+1 is the solution of the following QP :

arg min

α∈R^N+1 α∈I∩˜ C˜

kαk²_RN+1, where

I˜=







α∈R^N⁺¹ :

N

X

j=0

αN,jϕN,j(x⁽ⁱ⁾) =yi, i= 1, . . . , n





 , C˜ =

α∈R^N+1 : a≤αN,j ≤b, j= 0, . . . , N .

(17)

To ensure the hypothesis (H1), we need to suppose a < yi< b, i= 1, . . . , n.

In the illustration example (see Figure 2),a= 0,b= 1 andn= 6. The value of the parameterθ is fixed to 0.18.

0.0 0.2 0.4 0.6 0.8 1.0

0.00.20.40.60.81.0

x unconstrained function constrained function data

(a)

0.0 0.2 0.4 0.6 0.8 1.0

0.00.20.40.60.81.0

x constrained function approximation N = 10 approximation N = 50

(b)

Fig. 2: Unconstrained and constrained interpolating function (ˆhN withN = 500) (Figure 2a). Convergence of the discretized solution ˆh_N, N = 10 and N = 50 (Figure 2b).

In order to investigate the convergence of the proposed method, we plot in Figure 2a the solution of problem (P) without boundedness constraints (for- mula (2), black line) and the solution of the discretized optimization problem (PN) ˆhN forN = 500 (red line), which is assumed to be very closed to the function ˆh. Unlike the first solution, the last one respects both interpolation conditions and boundedness constraints. Figure 2b shows the convergence of the proposed approximate solution. The red line is the function ˆh_N forN = 500.

The blue and the green dashed line represent respectively the function ˆh_N for N = 10 andN= 50.

3.2 Monotonicity in one dimension

E andH are the spaces defined in the previous Sect. 3.1. The convex setCis the space of monotone non-decreasing functions and is defined as

C:=

f ∈ C⁰([0,1]) : f(x)≤f(x⁰) ifx≤x⁰ . Using the notation introduced before, we have the following result :

(18)

Proposition 9 Let hN ∈ HN. Then, hN(x) := PN

j=0αN,jϕN,j(x) is non- decreasing if and only if the sequence (αN,j)j=0,...,N is non-decreasing (i.e.

αN,j−1≤αN,j, j= 1, . . . , N).

Proof If the sequence (α_N,j)_j=0,...,N is non-decreasing then it is obvious to see that h_N is non-decreasing. Conversely, the sequence (α_N,j)_j = (h_N(t_N,j))_j, j= 0, . . . , N is a non-decreasing sequence.

From this proposition, we see again that hypothesis (H2) holds, i.e.πN(C)⊂ C for all N. Moreover, the interpolation conditions and the inequality constraints inR^N+1 can be expressed as follow :

I˜=







α∈R^N⁺¹ :

N

X

j=0

α_N,jϕ_N,j(x⁽ⁱ⁾) =y_i, i= 1, . . . , n







, (15) C˜ =

α∈R^N+1 : α_N,j−1≤α_N,j, j= 0, . . . , N . (16) To ensure the hypothesis (H1), we suppose

y_i−1< yi, i= 2, . . . , n.

From Proposition 7, the solution of problem (PN) is equal to x∈[0,1], ˆhN(x) =

N

X

j=0

(αopt)jϕN,j(x),

where αopt ∈R^N+1 is the solution of the problem ( ˜PN), where ˜I and ˜C are defined in (15) and (16) respectively. Figure 3 shows the convergence of the proposed algorithm. In Figure 3a, the black line is the solution of problem (P) without monotonicity constraints and the red line is the solution of the discretized optimization problem (PN) forN = 500. Notice that only the last one respects both interpolation conditions and monotonicity constraints. In Figure 3b, convergence of different approximations is illustrated. The red line represents the function ˆh_N forN= 500 and the blue line (resp. the green line) corresponds to the function ˆhN forN = 5 (resp.N = 20).

3.3 Case of a finite number of constraints

The so-calledconstrainedinterpolation splines are defined as solutions of problem (P) where the general norm (semi-norm) is defined from a differential operator. In the framework of spline theory, the problem of interpolation under a finite number of inequality constraints has been solved (see e.g. [8], [9]

in R² and [16] in R). Our aim in this section is only to assess the conver- gence of the proposed method by comparing it with the analytical solution.

To do this, let us draw attention that our method can be easily applied to a finite number of inequality constraints. In the following, we will recall the

(19)

0.0 0.2 0.4 0.6 0.8 1.0

−10−50510

t

unconstrained function constrained function data

(a)

0.0 0.2 0.4 0.6 0.8 1.0

−20−15−10−50510

x

constrained function approximation N = 5 approximation N = 20 data

(b)

Fig. 3: Unconstrained and constrained interpolating function (ˆhN withN = 500) (Figure 3a). Convergence of discretized solutions ˆh_N,N = 5 andN= 20 (Figure 3b).

main results given in [8]. Firstly, the interpolation conditions are defined as f(x⁽ⁱ⁾) =yi, i= 1, . . . , n. Secondly, the finite number of inequality constraints are denoted respectively lower and upper inequality constraints and are defined as in [8] :

f(x⁽ⁱ⁾)≥y_i, i=n+ 1, . . . , n+p₁, (17) f(x⁽ⁱ⁾)≤yi, i=n+p1+ 1, . . . , n+p1+p2. (18) In this case, we have n interpolation conditions and pinequality constraints withp=p1+p2. The analytical form of the constrained interpolation spline is given by Dubrule and Kostov in [8] :

σ(x) =

n+p

X

i=1

bⁱK(x, x⁽ⁱ⁾), (19)

where the functionKis the underlying reproducing kernel of the RKHSH. The (n+p) coefficientsb = b¹, . . . , b^n+p>

are obtained by solving the following quadratic optimization problem :

arg min

b n+p

X

i=1 n+p

X

j=1

bⁱb^jK(x⁽ⁱ⁾, x^(j)),

under the n interpolation conditions (f(x⁽ⁱ⁾) = yi, i = 1, . . . , n) and the p inequality constraints given in (17) and (18). This form is generalized to any kernel or semi-kernel, stationary covariance function or generalized covariance function, see [8] and [15].

(20)

The convex setCis the subset of functions that verify (17) and (18). In that case, the solution of the discretized optimization problem (PN) is expressed as follows :

hˆ_N(x) =

N

X

j=0

(α_opt)_jϕ_N,j(x) (20)

where the (N + 1) coefficients ((αopt)0, . . . ,(αopt)N) are the solution of the following quadratic optimization problem

arg min

α∈R^N+1 α∈I∩˜ C˜

kαk²_RN+1, where

I˜=n

α∈R^N+1 such that ˆh_N(x⁽ⁱ⁾) =y_i, i= 1, . . . , no , C˜ =n

α∈R^N+1 such that ˆh_N verifies (17) and (18)o .

In Figure 4, the kernel is the Mat´ern 3/2 covariance kernel defined as follows : K_m3

2(x, y) = 1 +

√3|x−y|

θ

!

exp −

√3|x−y|

θ

! ,

whereθis a smoothing parameter of value 0.3. We choosen= 6 interpolation conditions and p= 3 inequality constraints (p1 = 2 andp2 = 1). The black line represents the constrained interpolation spline given by (19). In Figures 4a and 4b, we plot respectively the function ˆhN given in (20) for N = 10 and N = 40. Notice that ˆhN respects both interpolation conditions and inequality constraints and coincides with the constrained interpolation spline whenN is large enough.

In Figure 5, the Gaussian kernel is used where the parameter θ is also equal to 0.3. The black line represents the constrained interpolation spline using Dubrule’s algorithm. The red dashed line is the function ˆh_N defined as the solution of problem (P_N) forN = 10 (Figure 5a) andN = 40 (Figure 5b).

3.4 Monotonicity in multidimensional cases

Let us begin by the two dimensional case where x = (x₁, x₂) ∈ R². The unknown functionfis assumed to be continuous and monotone non-decreasing on the unit squareX = [0,1]×[0,1] :

x1≤x⁰₁ and x2≤x⁰₂ ⇒ f(x1, x2)≤f(x⁰₁, x⁰₂).

Like the one dimensional case, we construct the basis functions such that the monotonicity constraints areequivalentto constraints on the coefficients. First, we discretize the unit square, e.g. uniformly with (N+ 1)²knots, see Figure 6 forN = 7. Then, the basis function at the knot (tN,i, tN,j) is defined as

ϕi,j(x) :=ϕN,i(x1)ϕN,j(x2),

(21)

0.0 0.2 0.4 0.6 0.8 1.0

−1012

x constrained spline approximation equality data lowIneq data upperIneq data

(a)

0.0 0.2 0.4 0.6 0.8 1.0

−1012

(b)

Fig. 4: The black line represents the constrained interpolation spline using the Mat´ern 3/2 kernel. The red dashed-line corresponds to the function ˆhN for N = 10 (Figure 4a) andN = 40 (Figure 4b).

0.0 0.2 0.4 0.6 0.8 1.0

−1012

(a)

0.0 0.2 0.4 0.6 0.8 1.0

−1012

(b)

Fig. 5: The black line represents the constrained interpolation spline using the Gaussian kernel. The red dashed-line corresponds to the function ˆh_N for N = 10 (Figure 5a) andN = 40 (Figure 5b).

whereϕ_N,j, j= 0, . . . , N are given in (3). Now, we have

ϕi,j(tN,k, tN,l) =ϕN,i(tN,k)ϕN,j(tN,l) =δi,kδj,l, k, l= 0, . . . , N.

Proposition 10 LethN ∈HN. Then,hN(x) :=PN

i,j=0αi,jϕN,i(x1)ϕN,j(x2) is non-decreasing with respect to the two input variables if and only if the(N+ 1)² coefficients αi,j, i, j= 0, . . . , N verify the following linear constraints :

i αi−1,j≤αi,j andαi,j−1≤αi,j, i, j= 1, . . . , N

(22)

0.0 0.2 0.4 0.6 0.8 1.0

0.00.20.40.60.81.0

x1

x2

Fig. 6: Data for the monotone 2D interpolation problem (black points) and knots (tN,i, tN,j)_0≤i,j≤7used to define the basis functions.

ii α_i−1,0≤αi,0, i= 1, . . . , N iii α_0,j−1≤α0,j, j= 1, . . . , N

Proof If the (N + 1)² coefficients α_i,j, i, j = 0,· · · , N verify the above linear constraints (i), (ii) and (iii), then h_N is non-decreasing as a piecewise linear function with respect to x1 or x2 direction. Conversely, the relations hN(tN,i, tN,j) =αi,j i, j= 0, . . . , N complete the proof of the proposition.

Consequently, the solution of the problem (P_N) can be expressed as ˆhN(x) =

N

X

i,j=0

(αopt)i,jϕN,i(x1)ϕN,j(x2), whereαopt= (αopt)i,j is the solution of the following QP :

arg min

α∈R^(N+1)2 α∈I∩˜ C˜

kαk²_R_(N+1)2,

with ˜C=n

α∈R^(N⁺¹⁾² : (i), (ii) and (iii) are satisfied in Proposition 10o and I˜is defined by (15).

In Figure 7, we take the kernel to be the 2-dimensional Gaussian kernel K(x, y) = exp

−(x1−y1)² 2θ²₁

×exp

−(x2−y2)² 2θ₂²

(23)

with smoothing parameters (θ1, θ2) = (0.4,0.4). In Figures 7a and 7b, we plot respectively the solution of the discretized optimization problem (P_N) ˆh_N for N = 20 and the associated contour levels. Notice that ˆhN satisfies both interpolation conditions and monotonicity (non-decreasing) constraints with respect to the two input variables. Figure 8 shows the case where the true function is known to be monotone (non-decreasing) for the second variable only (see Remark 2 below). In this case, the set of constraints is C :={f ∈ E :f(x1, x2)≤f(x1, x⁰₂), ifx2≤x⁰₂}, whereE=C⁰(X).

(a)

X1

X2

0.0 0.2 0.4 0.6 0.8 1.0

0.00.20.40.60.81.0

(b)

Fig. 7: The solution ˆhN of the discretized optimization problem (PN) (Fig- ure 7a) and the associated contour levels (Figure 7b).

Remark 2 Proposition 10 can be easily extended to the monotonicity with respect to one of the input variables. For instance, the function h_N defined in Proposition 10 is non-decreasing with respect to the second variableif and only if the (N+1)²coefficients verify :α_i,j−1≤αi,j, j= 1, . . . , Nandi= 0, . . . , N. Remark 3 The monotonicity in the general multidimensional case is a simple extension of the two-dimensional case. Remark 2 can be extended as well for the monotonicity with respect to any subset of the variablesx1, . . . , xd.

4 Splines case

The aim of this section is to compare the method described in this paper with existing algorithms. We focus on cubic spline interpolation with inequality constraints.