Enhanced electrical impedance tomography via the Mumford-Shah functional

(1)

URL:http://www.emath.fr/cocv/

ENHANCED ELECTRICAL IMPEDANCE TOMOGRAPHY VIA THE MUMFORD–SHAH FUNCTIONAL

Luca Rondi

¹

and Fadil Santosa

²

Abstract. We consider the problem of electrical impedance tomography where conductivity distribution in a domain is to be reconstructed from boundary measurements of voltage and currents. It is well-known that this problem is highly illposed. In this work, we propose the use of the Mumford–Shah functional, developed for segmentation and denoising of images, as a regularization. After establishing existence properties of the resulting variational problem, we proceed by demonstrating the approach in several numerical examples. Our results indicate that this is an effective approach for overcoming the illposedness. Moreover, it has the capability of enhancing the reconstruction while at the same time segmenting the conductivity image.

Mathematics Subject Classification. 35R25, 35R30, 35J25, 68U10.

Received August 16, 2000. Revised April 12, 2001.

1. Introduction and formulation of the problem

The purpose of this work is to demonstrate that the Mumford–Shah functional from image processing can be used effectively to regularize the classical problem of electrical impedance tomography. In electrical impedance tomography the objective is to determine the conductivity distribution in a domain from measurements collected at the boundary. Such a problem is often referred to as the inverse conductivity problem. The underlying physical phenomena, that of electrostatics, is modeled by an elliptic partial differential equations. The data available from measurement amounts to (limited) information about the Neumann-to-Dirichlet map.

In this work, we will consider conductivity distribution that is discontinuous. The Mumford–Shah functional, which is used in image processing as a method for segmentation and denoising, can be shown to regularize this otherwise illposed inverse problem. Moreover, as we demonstrate in the paper, it allows one not only to obtain a good image of the conductivity distribution, but also to determine the jump set of the conductivity.

We begin by giving a precise formulation of the inverse conductivity problem. Consider a bounded domain Ω in Rⁿ, n≥2, with sufficiently smooth boundary, namely we assume∂Ω to be Lipschitz. LetH¹(Ω) ={u∈ L²(Ω) : ∇u∈L²(Ω,Rⁿ)}, where ∇udenotes the gradient in the distribution sense. By H^1/2(∂Ω) we denote the space of traces ofH¹(Ω) on∂Ω. We recall thatH^−1/2(∂Ω) is the dual space toH^1/2(∂Ω). With₀H^1/2(∂Ω),

0H^−1/2(∂Ω) and₀L²(∂Ω) we denote the corresponding subspaces of elements with zero means. We note that

0H^1/2(∂Ω) and₀H^−1/2(∂Ω) are dual to each other, whereas the dual of₀L²(∂Ω) is the space itself. We recall

Keywords and phrases: Electrical impedance tomography, inverse problems for elliptic equations, regularization of illposed problem, image enhancement.

1 Institut f¨ur Industriemathematik/SFB 013, Johannes Kepler Universit¨at Linz, Austria. Present address: School of Mathe- matics, University of Minnesota, Minneapolis, MN 55455, U.S.A.

2School of Mathematics, University of Minnesota, Minneapolis, MN 55455, U.S.A.; e-mail:[email protected]

c EDP Sciences, SMAI 2001

(2)

also that if X and Y are two Banach spaces then B(X, Y) will be the space of all bounded linear operators fromX toY, with the usual operator norm. IfX=Y we setB(X, Y) =B(X).

Let σ be the conductivity of the medium occupying the region Ω. We make the assumption that σ is a measurable function on Ω satisfying

0< λ≤σ(z)≤λ⁻¹ for a.e. z∈Ω (1.1) where λis a positive constant less than 1. We leturepresent the electrostatic potential in Ω. The potential is created by a current distributionf on the boundary; we assumef ∈₀H^−1/2(∂Ω). Then the potentialusatisfies the folllowing Neumann type boundary value problem





div(σ∇u) = 0 in Ω σ∇u·ν=f on∂Ω.

u|_∂Ω∈₀H^1/2(∂Ω)

(1.2)

The boundary value problem (1.2) admits a unique weak solution. That is there exists a unique function u∈H¹(Ω) whose trace on∂Ω has zero mean such that

Z

Ωσ∇u· ∇φ=f[φ|∂Ω] for everyφ∈H¹(Ω). (1.2_w) We note that if f ∈0L²(∂Ω), thenf[φ|∂Ω] =R

∂Ωf φfor anyφ∈H¹(Ω).

For any σsatisfying (1.1) for someλ, 0 < λ <1, the so-called Neumann-to-Dirichlet map associated toσ, Λ(σ,·) :0H^−1/2(∂Ω)7→0H^1/2(∂Ω), is defined in the following way

Λ(σ, f) =u|_∂Ω for anyf ∈₀H^−1/2(∂Ω)

where uis the weak solution to (1.2). We have that Λ(σ,·) is a bounded linear operator from 0H^−1/2(∂Ω) to

0H^1/2(∂Ω) whose norm depends upon Ω andλonly. In the sequel we shall often consider Λ(σ,·) as a bounded linear operator from ₀L²(∂Ω) into itself. We remark that obviously

kΛ(σ,·)k_B(₀_L²_(∂Ω))≤CkΛ(σ,·)k_B(₀_H^−1/2_(∂_Ω),₀_H^1/2_(∂Ω)) holds with a constantC >0 depending on Ω only.

The inverse conductivity problem, which has been formulated for the first time by Calder´on in [C], consists in determining if the conductivityσis uniquely determined by the associated Neumann-to-Dirichlet map. The pioneering work of Kohn and Vogelius [Ko-V], and later, the work of Sylvester and Uhlmann [Sy-U], provided the definitive uniqueness results on this problem. For an overview discussion on uniqueness we refer to a recent monograph by Isakov [I].

In this paper we shall deal with the reconstruction issue, that is, we are looking for a procedure which allows us to recover the unknown conductivity σ from the knowledge, possibly partial, of the Neumann-to-Dirichlet map Λ(σ,·). We are interested in the case when the unknown conductivityσmay present some discontinuities, and our aim is, in particular, to recover the set where such discontinuities occur. In fact we shall suppose to know a priori thatσ, besides satisfying (1.1) for a fixed constantλ, either ispiecewise H¹, that is

σ∈P H¹(Ω) =

σ∈L^∞(Ω) : σ∈H¹(Ω\K)

K closed in Ω, Hⁿ⁻¹(K)<∞

or is piecewise constant, that is σ∈P C(Ω) =

σ∈L^∞(Ω) : σ∈H¹(Ω\K), ∇σ= 0 a.e. in Ω\K Kclosed in Ω, Hⁿ⁻¹(K)<∞

·

(3)

We recall thatHⁿ⁻¹denotes the (n−1)-dimensional Hausdorff measure. We remark that, in an equivalent way, we say thatσbelongs toP C(Ω) if it is constant on any connected component of Ω\K,Kbeing a closed set in Ω with finite (n−1)-dimensional Hausdorff measure.

Since the inverse conductivity problem is severely illposed, a regularization procedure is needed to stabilize any reconstruction process. It is known that standard regularization techniques, such as penalizing theH₁-norm stabilizes the problem at the cost of loss of resolution. In this work, we explore the use of techniques from image processing, which stabilizes and enhances the reconstruction. An earlier work of Dobson and Santosa [D-S]

employed minimal total variation (TV) criterion, which is popular in image processing, to stabilize and enhance the reconstruction for electrical impedance tomography.

In the present work, we also borrow another idea from the image processing literature, namely the use of the Mumford–Shah functional, as a regularization procedure. Mumford and Shah [Mu-Sh1, Mu-Sh2] devised the functional as a way to denoise and segment a black-and-white image. In order to consider the Mumford–Shah functional in the context of electrical impedance tomography, letF(σ) be the followingL²data-fitting functional F(σ) =kΛ(σ,·)−Gk²_B(₀_L2(∂Ω)). (1.3) The data G being a linear, bounded operator from ₀L²(∂Ω) into itself representing the measurements of the electrostatic potentials corresponding to any current density belonging to ₀L²(∂Ω) applied to the conductor Ω. From the point of view of applications, we never have the entire Neumann-to-Dirichlet map but rather N different measurements g_i ∈0L²(∂Ω), corresponding to current densitiesf_i ∈0L²(∂Ω), i= 1,· · ·, N, and we shall substitute the term described in (1.3) with

XN i=1

Z

∂Ω|Λ(σ, f_i)−g_i|². (1.4)

The variational problem we consider is

(σ,K)min (

α Z

Ω\K|∇σ|²+βHⁿ⁻¹(K) +γF(σ) )

(1.5) where K is closed in Ω andσ ∈H¹(Ω\K). Here α, β andγ are positive tuning parameter. Without loss of generality we can always impose α= 1. The first term of the functional above is a smoothing term implying that outside the discontinuity set K the image should be smooth. The second is a penalization term on the length of the discontinuity set K, and therefore prevents the creation of spurious discontinuities due to noise.

The third term, the fitting term, represents the faithfulness of the reconstruction with respect to the input data.

In the context of image processing,σwould represent a grey-level image, and the third term in (1.5) would be replaced by

Z

Ω|σ−g|²

wheregis the raw input image. Also the so-calledminimal partition problem has been considered

(σ,K)min

βHⁿ⁻¹(K) +γ Z

Ω|σ−g|²

(1.6) where K is closed in Ω,σ∈H¹(Ω\K) and∇σ= 0 almost everywhere in Ω\K.

The existence of a solution to the Mumford–Shah minimization problem has been obtained in the framework of free-discontinuity problem by [DG-Ca-L] introducing a relaxed functional in a space of functions of bounded variation. By the direct method in the Calculus of Variations a minimum of such a relaxed functional does

(4)

exist and, by a regularity theory argument, it produces also a minimum for the original functional. Moreover they showed that if (σ, K) solves the variational problem, thenσis indeed piecewiseC¹, that isσ∈C¹(Ω\K).

In a similar framework in [Co-Ta] the existence of a solution has been proved also for the minimal partition problem (1.6).

The computational problem associated with these functionals is an active area of research. A main difficulty is presented by the penalty term involving the Hausdorff measure of the setK. One approach is to approximate these functionals in the sense of Γ-convergence by functionals defined on spaces of smooth functions. We refer to the monograph of Braides [Br], for a survey of such results. In this work, we employ the method of Γ-convergence approximation of the Mumford–Shah functional by elliptic, although non-convex, functionals defined on Sobolev spaces due to Ambrosio and Tortorelli [A-T1, A-T2].

The plan of the paper is as follows. In Section 2 we shall study the properties of the Neumann-to-Dirichlet map with respect to the first argument, that is the conductivity. We shall prove continuity and differentiability with respect to suitableL^pnorms trying to keep at a minimum the regularity assumptions of the conductivities involved, given our interest in the recovery of discontinuous functions. The minimization associated with the inverse conductivity problem is considered in Section 3. The main result pertaining to this problem is stated in Theorem 3.7. In addition to the problem stated in (1.5), we also consider the linearized inverse problem, where the Neumann-to-Dirichlet map is linearized about a given conductivityσ₀. Our result for this problem is given in Theorem 3.8. We also treat a more general linear inverse problem where the fitting term in (1.5) is replaced by

F(σ) = Z

Ω|A[σ]−g|² (1.7)

whereAis a bounded linear operator fromL^p(Ω) into itself for anyp, 1≤p≤ ∞. We shall focus our attention upon the case in whichAsatisfies some compactness properties, for instance whenAis a blurring convolution.

Existence results for this case are stated in Theorem 3.9. Section 3 ends with a review of the Ambrosio and Tortorelli approximation of the Mumford–Shah functional where we shall note that it may be easily applied to our problems. In Section 4 some numerical examples will be presented in order to show the potential of the method. We shall limit ourselves to the linearized inverse conductivity problem and, concerning the other inverse problems, to the case when the operator Ais a convolution. A short discussion section ends the paper.

2. Regularity of the Neumann-to-Dirichlet map

Some regularity results on the Neumann-to-Dirichlet map as a function of the conductivityσwill be needed in the sequel to establish existence results for the minimization problems stated above. We follow some of the techniques developed in [D] where it is shown that higher integrability properties of the solutions to problem (1.2) are decisive for proving differentiability properties of the Neumann-to-Dirichlet map. In order to lower as much as possible the regularity of the conductivities involved, we shall make use of the following regularity results for elliptic equations in divergence form.

We look for conditions upon which weak solutions to elliptic equations in divergence form in a domain Ω belong to H_loc^1,q(Ω) withq >2. Let us consider the following definition.

Definition 2.1. Let Ω be a bounded domain contained in Rⁿ, n ≥2, and let σ ∈L^∞(Ω) satisfy (1.1) for a fixedλ, 0< λ <1. We say thatσsatisfies theQ-property, 2< Q≤ ∞, if for any 2< q < Qthe following holds.

Iff ∈L^q(Ω,Rⁿ),h∈L^q(Ω) andu∈H¹(Ω) is a weak solution to div(σ∇u) = div(f) +h in Ω

(5)

then u∈H_loc^1,q(Ω) and for any Ω₁⊂⊂Ω the following estimate holds

kukH^1,q(Ω1)≤C kukH¹(Ω)+kfkL^q(Ω)+khkL^q(Ω)

(2.1) where the constantCdepends onλ,n,q, Ω₁, Ω andσ.

The following result, due to Meyers [M], states that any σ satisfying (1.1) with a constant λ, 0< λ < 1, satisfies theQ-property for a constantQ >2 depending onλandnonly.

Theorem 2.2 (Meyers). Let Ω be a bounded Lipschitz domain contained in Rⁿ, n≥2. Fixed λ, 0 < λ <1, there exists a constantQ,2< Q <∞, depending on λand on nonly,Q→2 asλ→0 andQ→ ∞asλ→1, such that any σ∈L^∞(Ω) satisfying (1.1)with constantλsatisfies theQ-property.

Moreover the constant C in (2.1)depends onλ,n,q,Ω₁ andΩonly.

We note that in the previous theorem no regularity has been assumed onσ. Omitting the dependence upon n, we shall denote by Q(λ) the number Q defined in Meyers’s theorem. For anyσ satisfying (1.1), we shall denote by Q(σ) the supremum of all the numbersQso thatσ satisfies theQ-property. The previous theorem ensures that 2< Q(λ)≤Q(σ)≤ ∞. Some regularity properties ofσ may imply thatQ(σ) is strictly greater than Q(λ). In this case the constant in (2.1) will depend upon σ not only through the ellipticity constantλ but also from these regularity properties. Let us recall the following result ([Tr], Th. 3.7), stating that if σis H¨older continuous thenQ(σ) =∞.

Theorem 2.3. Let Ωbe a bounded Lipschitz domain contained in Rⁿ,n≥2, and letσ∈L^∞(Ω) satisfy (1.1) for a given positive constant λ,0< λ <1. Moreover let us assume thatσ∈C^0,δ(Ω),0< δ <1.

Then σsatisfy the∞-property and the constantC in (2.1)depends onλ,δ,kσk_C0,δ(Ω),n,q,Ω₁ andΩ.

It might be interesting to find a characterization of Q(σ) for σ belonging to the class of functions which are used in this papers, namely piecewise H¹, piecewise constant orSBV functions (which we shall introduce later on in Sect. 3). In fact these functions have a richer structure than those in L^∞ although they might be discontinuous, thus preventing the application of Theorem 2.3. With a somewhat different motivation, some studies in this directions have been developed recently in [Bo-V, Li-V].

We recall that, given a bounded and Lipschitz domain Ω, for anyσsatisfying (1.1) in Ω for a given positive constantλwe have defined Λ(σ,·) :0H^−1/2(∂Ω)7→0H^1/2(∂Ω) as a bounded linear operator given by

Λ(σ, f) =u|_∂Ω

whereuis the solution to (1.2) andf is any element of ₀H^−1/2(∂Ω). First of all we notice that there exists a constantC depending onλand Ω only such that ifusolves (1.2) then

kuk_H¹_(Ω)≤Ckfk₀_H^−1/2_(∂Ω). (2.2)

Therefore, as we have already noticed in Section 1, the norm of Λ(σ,·) either inB(₀H^−1/2(∂Ω),₀H^1/2(∂Ω)) or in B(₀L²(∂Ω)) depends onλand Ω only.

Let us consider the regularity properties of Λ with respect toσon the following set of admissible conductivities. We shall fix a constant λ >0 and set

D={σ∈L^∞(Ω) : σsatisfies (1.1) and supp(σ−τ₀)⊂Ω⁰} (2.3) where Ω⁰ is a smooth subset compactly contained in Ω and τ₀ is a given conductivity satisfying (1.1) as well.

We shall endowDwith anL^p-norm, 1≤p≤ ∞, usually withp= 2.

Let us call F:D 7→B(₀L²(∂Ω)) the function so defined

F(σ) = Λ(σ,·) for everyσ∈ D.

(6)

First of all we show the uniform continuity ofF in Dwith respect to theL^p-norm, for any 1≤p≤ ∞. In fact letσ₀ andσ₁be two conductivities belonging toD. Let us fixf ∈0L²(∂Ω) and letu₀andu₁be the weak solutions to (1.2) whereσis replaced byσ₀andσ₁respectively. By the weak formulation of (1.2) we infer that for anyφ∈H¹(Ω) Z

Ωσ₀∇u₀· ∇φ= Z

Ωσ₁∇u₁· ∇φ and so we have that, still for anyφ∈H¹(Ω),

Z

Ωσ₀∇(u₀−u₁)· ∇φ= Z

Ω(σ₁−σ₀)∇u₁· ∇φ.

Ifw∈H¹(Ω) is the weak solution to





div(σ₀∇w) = 0 in Ω σ₀∇w·ν = (u₀−u₁)|_∂Ω on∂Ω w|∂Ω∈0H^1/2(∂Ω)

(2.4)

we have that Z

∂Ω|u₀−u₁|²= Z

Ωσ₀∇w· ∇(u₀−u₁) = Z

Ω(σ₁−σ₀)∇u₁· ∇w.

We take qso that 2< q < Q(λ) andp=q/(q−2). Then by H¨older’s inequality and by the definition ofD ku0−u₁k²_L2(∂Ω)≤ kσ1−σ₀k_L^p_(Ω⁰₎k∇u1k_L^q_(Ω⁰₎k∇wk_L^q_(Ω⁰₎.

By Meyers’s theorem and (2.2) there exist constantsC andC₁ depending onλ,n, q, Ω⁰ and Ω only such that k∇u₁k_L^q_(Ω⁰₎≤Cku₁k_H¹_(Ω)≤C₁kfk_L²_(∂Ω).

By the same reasoning we obtain that

k∇wk_L^q_(Ω⁰₎≤C₁ku0−u₁k_L²_(∂Ω).

By collecting these last equations we immediately obtain the following proposition:

Proposition 2.4. LetF :D 7→B(₀L²(∂Ω)) be defined as

F(σ) = Λ(σ,·) for every σ∈ D

and let Dbe as in (2.3). Then for anyp,Q(λ)/(Q(λ)−2)< p≤ ∞, and any σ₀,σ₁ inDwe have kF(σ₀)− F(σ₁)k_B(₀_L²_(∂Ω))≤Ckσ₁−σ₀k_L^p_(Ω⁰₎

where C depends on λ, n, p, Ω⁰ and Ω only. Therefore F is Lipschitz continuous in D with respect to the L^p-norm,Q(λ)/(Q(λ)−2)< p≤ ∞, with a Lipschitz constant depending onλ,n,p,Ω⁰ andΩ only.

Since we havekσ1−σ₀k_L^∞_(Ω)≤λ⁻¹, by interpolation we deduce immediately as a corollary to the previous proposition that for anyp, 1≤p≤Q(λ)/(Q(λ)−2),F is H¨older continuous inDwith respect to theL^p-norm with constants depending onλ,n,p, Ω⁰ and Ω only.

Let us proceed now to the differentiability properties of F. We fix σ∈ D. For any f ∈0L²(∂Ω) we call u the solution to (1.2). Letδσbe a perturbation toσbelonging toL^∞(Ω⁰) and extended to zero outside Ω⁰. We

(7)

define δu∈H¹(Ω) as the weak solution to the following linearized problem





div(σ∇δu) =−div(δσ∇u) in Ω

σ∇δu·ν= 0 on∂Ω

δu|_∂Ω∈₀H^1/2(∂Ω)

(2.5)

and we shall callDF(σ) :L^∞(Ω⁰)7→B(₀L²(∂Ω)) the map so defined DF(σ)[δσ][f] =δu|_∂Ω

for anyδσ∈L^∞(Ω⁰) wheref ∈0L²(∂Ω) andδusolves (2.5). It is immediate to show that, for eachδσ∈L^∞(Ω⁰), DF(σ)[δσ] is a well defined bounded linear operator from 0L²(∂Ω) into itself and that the following estimate holds

kDF(σ)[δσ]k_B(₀_L²_(∂Ω))≤Ckδσk_L^∞_(Ω⁰₎

where the constantC depends onλand Ω only. Since it is clear from the definition thatDF(σ) is linear with respect toδσwe immediately infer thatDF(σ) is a bounded linear operator fromL^∞(Ω⁰) inB(₀L²(∂Ω)) with norm depending onλand Ω only.

We claim that for any p,Q(σ)/(Q(σ)−2)< p < ∞, we may extend DF(σ) to a bounded linear operator from L^p(Ω⁰) inB(₀L²(∂Ω)). We shall still denote this operator byDF(σ).

We fix δσ ∈L^∞(Ω⁰) andf ∈0L²(∂Ω). Letδu=DF(σ)[δσ][f]. As before we introduce w∈H¹(Ω) as the weak solution to





div(σ∇w) = 0 in Ω σ∇w·ν =δu|_∂Ω on∂Ω w|∂Ω∈0H^1/2(∂Ω)

(2.6)

and we observe that

Z

∂Ω|δu|²= Z

Ωσ∇w· ∇δu=− Z

Ωδσ∇u· ∇w.

Take qso that 2< q < Q(σ) andp=q/(q−2). Then by H¨older’s inequality kδuk²_L2(∂Ω)≤ kδσk_L^p_(Ω⁰₎k∇uk_L^q_(Ω⁰₎k∇wk_L^q_(Ω⁰₎.

Then sinceσsatisfies theQ(σ)-property we have a constantC depending onλ,n,p, Ω⁰, Ω andσsuch that kDF(σ)[δσ][f]k_L²_(∂Ω)≤Ckδσk_L^p_(Ω⁰₎kfk_L²_(∂Ω)

for anyδσ∈L^∞(Ω⁰) andf ∈₀L²(∂Ω). So the following proposition holds true.

Proposition 2.5. The linear operator DF(σ) : L^p(Ω⁰) 7→ B(₀L²(∂Ω)), such that DF(σ)[δσ][f] = δu|_∂Ω for any δσ∈L^∞(Ω⁰)and anyf ∈₀L²(∂Ω),δusolving (2.5), is bounded for anyp,Q(σ)/(Q(σ)−2)< p≤ ∞, and its norm depends on λ,n,p,Ω⁰,Ω andσ.

The linear operatorDF(σ) represents the differential ofF at the pointσin the sense of the following definition. Let a and b be two constants, −∞ < a < b < +∞, and an open set Ω we denote by X_a^b(Ω) the set of measurable functions σ on Ω such that a ≤σ ≤ b almost everywhere in Ω. We say that a functional

(8)

F : X_a^b(Ω) 7→X,X being a Banach space, isdifferentiable in X_a^b(Ω) at the point σ₀ ∈X_a^b(Ω) with respect to the L^p-norm, 1≤p≤ ∞, if there exists a linear and bounded operatorDF(σ₀) :L^p(Ω)7→X so that

kF(σ)−F(σ₀)−DF(σ₀)[σ−σ₀]kX

kσ−σ₀k_L^p_(Ω) →0 uniformly asσ∈X_a^b(Ω) converges toσ₀ inL^p(Ω).

Letσandσ+δσbelong toD. Iff ∈0L²(∂Ω) andδu=DF(σ)[δσ][f] we denote byR the weak solution to





div((σ+δσ)∇R) =−div(δσ∇δu) in Ω (σ+δσ)∇R·ν = 0 on∂Ω R|∂Ω∈0H^1/2(∂Ω)

then, by taking linear combinations, we notice that

R|∂Ω=F(σ+δσ)[f]− F(σ)[f]−DF(σ)[δσ][f].

In order to evaluatekRk_L²_(∂Ω) we introduce, as usual, the auxiliary functionw, weak solution to





div((σ+δσ)∇w) = 0 in Ω (σ+δσ)∇w·ν=R|∂Ω on∂Ω w|_∂Ω∈₀H^1/2(∂Ω)

and we infer that Z

∂Ω|R|²= Z

Ω(σ+δσ)∇w· ∇R=− Z

Ωδσ∇δu· ∇w.

We takep,qandrso that 2< r < Q(λ), 2< q < Q(σ) andp⁻¹+q⁻¹+r⁻¹= 1. Again by H¨older’s inequality kRk²_L2(∂Ω)≤ kδσk_L^p_(Ω⁰₎k∇δuk_L^q_(Ω⁰₎k∇wk_L^r_(Ω⁰₎.

First of all, by Meyers’s theorem, we notice that there exists a constantC depending onλ,n,r, Ω⁰ and Ω such that

k∇wk_L^r_(Ω⁰₎≤CkRk_L²_(∂_Ω).

By the Q(σ)-property ofσand sinceδσ is zero outside Ω⁰, we obtain, for a constantC₁ depending onλ, n,q, Ω⁰, Ω andσ,

k∇δuk_L^q_(Ω⁰₎≤C₁kδσ∇uk_L^q_(Ω⁰₎.

We choose ε >0 such thatq+ε < Q(σ) and we have, again by theQ-property, kδσ∇uk_L^q_(Ω⁰₎≤C₂kδσk_Lq(q+ε)/ε(Ω⁰)kfk_L²_(∂Ω) whereC₂depends onλ, n,q,ε, Ω⁰, Ω andσ.

Collecting all these estimates and by recalling thatσ+δσ∈ Dand hencekδσk_L^∞_(Ω⁰₎≤λ⁻¹, we deduce the following differentiability properties of F.

Proposition 2.6. LetF :D 7→B(₀L²(∂Ω)) be defined as

F(σ) = Λ(σ,·) for every σ∈ D,

let Dbe as in (2.3)and let σbelong to D. Then for everypsuch that p > Q(λ)Q(σ)−Q(σ)−Q(λ)^Q(λ)Q(σ) we may finds, 0 < s≤1, depending on p, Q(σ)and Q(λ) only and a constantC depending on λ, n,Ω⁰,Ω andσ only such

(9)

that

kF(σ+δσ)− F(σ)−DF(σ)[δσ]k_B(₀_L²_(∂Ω))≤Ckδσk^1+s_Lp(Ω⁰) (2.7) for any σ+δσ belonging toD.

Therefore F is differentiable inX_λ(Ω) =X_λ^λ⁻¹(Ω) at the point σ∈X_λ(Ω) with respect to the L^p-norm for any p > Q(λ)Q(σ)−Q(σ)−Q(λ)^Q(λ)Q(σ) .

We remark also that, since Q(λ)≤Q(σ) we have Q(λ)Q(σ)−Q(σ)−Q(λ)^Q(λ)Q(σ) ≥Q(σ)/(Q(σ)−2).

3. Existence results and Γ -convergence approximation

In this section we shall prove existence results for the solution of the minimization problems we have outlined in Section 1. We shall introduce a relaxed functional defined on a suitable space of functions of bounded variation. We shall show the existence of a minimum for the relaxed functional and, through a regularity argument, that such a minimum provides also a minimum for the original problem.

At the end of the section we shall recall an approximation, in the sense of Γ-convergence, of the relaxed functional by elliptic, even if not convex, functionals defined on spaces of smooth functions, for instance Sobolev spaces. We have used such an approximation, which has been developed by Ambrosio and Tortorelli in [A-T2], to perform some numerical simulations whose results will be the content of the next section.

We begin by briefly recalling some basic notations and properties of functions of bounded variation. For a more comprehensive treatment of the subject see, for instance [A-F-P, E-G] and [Gi].

Given a Borel functionσ: Ω7→R, Ω being an open and bounded subset ofRⁿ,n≥2, and a pointx∈Ω we define σ⁺(x), theapproximate upper limit ofσatx, as follows

σ⁺(x) = ap-lim sup

y→x σ(y) = inf

t∈R: lim

ρ→0⁺

|{y∈B_ρ∩Ω : σ(y)> t}|

ρⁿ = 0

·

In the same fashion we define σ⁻(x), theapproximate lower limit ofσatx, as σ⁻(x) = ap-lim inf

y→x σ(y) = sup

t∈R: lim

ρ→0⁺

|{y∈B_ρ∩Ω : σ(y)< t}|

ρⁿ = 0

·

Ifσ⁺(x) =σ⁻(x) then the common value will be called theapproximate limit ofσatxand will be denoted by

˜

σ(x) = ap-lim_y→xσ(y). If the approximate limit ofσatxdoes exist we say thatσisapproximately continuous at x. We define the jump set of σ as the subset of Ω where the approximate limit of σ does not exist. We denote the jump set of σbyS_σ and we notice that S_σ is a Borel set whose Lebesque measure is zero.

Given an open bounded set Ω⊂Rⁿ,n≥2, we denote byBV(Ω) the Banach space offunctions of bounded variation. We recall thatσ∈BV(Ω) if and only ifσ∈L¹(Ω) and its distributional derivativeDσ is a bounded vector measure. We endow BV(Ω) with the standard norm as follows. Givenσ∈BV(Ω), we denote by|Dσ|

the total variation of its distributional derivative and we set kσk_BV_(Ω)=kσk_L¹_(Ω)+|Dσ|(Ω).

For any σ∈ BV(Ω), by the Lebesque decomposition, we haveDσ =D^aσ+D^sσ where D^aσ is absolutely continuous with respect to the Lebesque measure whereasD^sσis singular with respect to the Lebesque measure.

We characterize the absolutely continuous part of Dσ as follows. We denote by ∇σ the density of D^aσ with respect to the Lebesque measure and we recall that, for almost every x ∈ Ω, ∇σ(x) coincides with the approximate gradient ofσatx, that is we have

ap- lim

y→x

σ(y)−σ(x)˜ − ∇σ·(y−x)

|y−x| = 0.

(10)

Ifσ∈BV(Ω) then S_σ is countably (Hⁿ⁻¹, n−1)-rectifiable that is there exists a countable family {Γ_i}^∞_i=1 of compact sets each of them contained in aC¹ hypersurface so thatHⁿ⁻¹(S_σ\ ∪^∞_i=1Γ_i) = 0. We characterize the singular part of Dσ as follows. The restriction ofD^sσ to S_σ will be called the jump part ofDσ and will be denoted by D^jσ. The remaining part ofD^sσ, that is the restriction ofD^sσto Ω\S_σ, will be called theCantor part of Dσ and will be denoted by D^cσ. Summing up we have that Dσ =Dâσ+D^sσ=Dâσ+D^jσ+D^cσ where Dâσ is absolutely continuous with respect to the Lebesque measure, D^sσ, D^jσ and D^cσ are singular with respect to the Lebesque measure andD^jσis concentrated on S_σ whereasD^cσis concentrated on Ω\S_σ.

We may characterize the jump part as follows

D^jσ= (σ⁺−σ⁻)ν_σHⁿ⁻¹|_S_σ

whereσ⁺andσ⁻are the approximate upper and lower limit ofσrespectively andν_σis given byD^jσ=ν_σ|D^jσ|, thereforeD^sσrestricted toS_σis absolutely continuous with respect to the (n−1)-dimensional Hausdorff measure.

We denote by SBV(Ω) the space of special functions of bounded variation that is the space of functions σ∈BV(Ω) whose singular part ofDσis concentrated on S_σ,S_σ being the jump set ofσ. Equivalently we say that σ∈BV belongs toSBV(Ω) if and only if the Cantor part of Dσis zero.

The special functions of bounded variation satisfy the following compactness and semicontinuity theorem due to Ambrosio [A1, A2].

Theorem 3.1 (SBV Compactness and Semicontinuity). Letting p >1, if {σh}^∞_h=1 is a sequence of functions belonging toSBV(Ω)satisfying for a given constant C >0

kσhk_BV_(Ω)≤C for any h

and Z

Ω|∇σh|^p+Hⁿ⁻¹(S_σ_h)≤C for anyh

then we may extract a subsequence, which we relabel {σ_k}^∞_k=1, such that σ_k converges in L¹(Ω) to a function σ∈SBV(Ω) and the following lower semicontinuity properties holds

Hⁿ⁻¹(S_σ)≤lim inf

k Hⁿ⁻¹(S_σ_k);

Z

Ω|∇σ|^p≤lim inf

k

Z

Ω|∇σk|^p.

The following remarks will be useful. Letσ∈BV(Ω). Ifa,bare two real numbers so thata < band we denote τ = (σ∧b)∨athenτ∈BV and

|∇τ| ≤ |∇σ| a.e. in Ω; Hⁿ⁻¹(S_τ\S_σ) = 0; |Dτ|(Ω)≤ |Dσ|(Ω).

Note that if σ∈SBV(Ω) then alsoτ∈SBV(Ω).

We notice also that if σ ∈ P H¹(Ω), that is σ ∈ L^∞(Ω) so that σ ∈ H¹(Ω\K) for some K closed in Ω, Hⁿ⁻¹(K)<∞, thenσ∈SBV(Ω) andHⁿ⁻¹(S_σ\K) = 0, see [DG-Ca-L].

On the other hand we shall need some conditions in order to ensure that an SBV(Ω) function belongs to P H¹(Ω). For anyσ∈SBV(Ω), anyA⊂Ω and any constantβ >0 we define

M S(σ, β, A) = Z

A|∇σ|²+βHⁿ⁻¹(S_σ∩A).

If A = Ω we shall simply writeM S(σ, β,Ω) = M S(σ, β). Also the dependence upon the constantβ will be suppressed when its role is clear from the context.

Let us assume that σ ∈ SBV(Ω)∩L^∞(Ω). Moreover let a, b be such that a≤ σ ≤b almost everywhere in Ω. We also assume that M S(σ, β,Ω) is finite. In order to ensure that σ∈P H¹(Ω) we only needS_σ to be essentially closed, that isHⁿ⁻¹((S_σ∩Ω)\S_σ) = 0. In this case, settingK=S_σ∩Ω, we have thatσ∈P H¹(Ω).

We have the following result which is an immediate consequence of Proposition 4.12 in [DG-Ca-L].

(11)

Proposition 3.2. Let σ ∈SBV(Ω) be such that a≤σ≤b almost everywhere in Ω, wherea, b are two real numbers so that a < b. We assume that, for a constantβ >0,M S(σ, β,Ω)is finite.

If for every compact set A⊂Ωwe have a constantC >0and a constant p,1≤p < n/(n−1), so that M S(σ, β, A)≤M S(τ, β, A) +Ckσ−τk_L^p_(A)

for anyτ ∈SBV(Ω) satisfyinga≤τ ≤b almost everywhere in Ωand such that τ =σ outsideA, then S_σ is essentially closed.

Let us now consider the case of piecewise constant functions. Clearly a piecewise constant function σ, that is a function σ∈L^∞(Ω) such thatσ∈H¹(Ω\K) with ∇σ= 0 almost everywhere in Ω\K,K being closed in Ω,Hⁿ⁻¹(K)<∞, is anSBV(Ω) function such thatHⁿ⁻¹(S_σ\K) = 0 and∇σ= 0 almost everywhere in Ω.

Let us look at some properties of SBV(Ω) functions whose approximate gradient ∇σ is zero almost everywhere. If σ satisfies these assumptions and we have also thatHⁿ⁻¹(S_σ)<∞then, see ([Co-Ta], Lem. 1.11), there exists a Borel partition {U_i}^∞_i=1 of Ω and a sequence{t_i}^∞_i=1in Rwitht_i6=t_j ifi6=j so that

σ= X∞ i=1

t_iχ_U_i a.e. in Ω

and X∞

i=1

P(U_i,Ω)<∞

where with P(U_i,Ω) we denote theperimeter ofU_i in Ω that is|Dχ_U_i|(Ω).

Again if σ ∈SBV(Ω)∩L^∞(Ω) is such that∇σ = 0 almost everywhere in Ω andHⁿ⁻¹(S_σ)<∞we have that σis piecewise constant ifS_σ is essentially closed. In fact in this case, by takingK=S_σ∩Ω we have that usatisfies the definition of piecewise constant functions.

A result analogous to Proposition 3.2 may be obtained also for this case. Here we refer to ([Co-Ta], Th. 2.4).

Proposition 3.3. Let σ ∈SBV(Ω) be such that a≤σ≤b almost everywhere in Ω, wherea, b are two real numbers so that a < b. Let us assume that ∇σ is zero almost everywhere in Ωand let T be a countable subset of [a, b] so thatσ(x)∈T for almost everyx∈Ω. We also assume thatHⁿ⁻¹(S_σ)<∞.

If for every compact set A⊂Ωwe have a constantC >0and a constant p,1≤p < n/(n−1), so that Hⁿ⁻¹(S_σ∩A)≤ Hⁿ⁻¹(S_τ∩A) +Ckσ−τk_L^p_(A)

for any τ ∈ SBV(Ω) such that τ(x)∈ T almost everywhere in Ω and such that τ =σ outsideA, thenS_σ is essentially closed.

With these results we can prove the existence of a solution to the following kinds of minimization problems.

First we consider

(σ,K)min

G(σ, K) =R

Ω\K|∇σ|²+βHⁿ⁻¹(K) +γF(σ) K closed in Ω; σ∈H¹(Ω\K); a≤σ≤b a.e. in Ω\K

(3.1) then we consider also the minimal partition version

(σ,K)min





G₁(σ, K) =βHⁿ⁻¹(K) +γF(σ) K closed in Ω; σ∈H¹(Ω\K)

∇σ= 0 a.e. in Ω\K; a≤σ≤b a.e. in Ω\K



· (3.2)

Here β and γ are positive parameters whereasa and b satisfy−∞< a < b < +∞. We remark that we may impose,without loss of generality,Hⁿ⁻¹(K)<∞, thereforeσwill be defined almost everywhere in Ω.

(12)

The following theorem may be proved:

Theorem 3.4. If the faithfulness term F(σ) is Lipschitz continuous from L^p(Ω) in R for some p, 1 ≤ p <

n/(n−1), then the minimization problems (3.1)and (3.2)admit a solution.

Proof. We begin by proving the existence of a solution to problem (3.1). For any σ ∈ SBV(Ω) such that a≤σ≤b almost everywhere in Ω we define the following relaxed functional

G(σ) =˜ Z

Ω|∇σ|²+βHⁿ⁻¹(S_σ) +γF(σ).

We notice that on X_a^b(Ω), the set of measurable functionsσon Ω satisfyinga≤σ≤balmost everywhere in Ω, the faithfulness functionalF is continuous with respect to the L¹(Ω) norm (indeed with respect to anyL^q(Ω) norm, 1≤q≤ ∞).

We remark also that for any admissible couple (σ, K) in (3.1) we have thatσ∈SBV(Ω) and

G(σ, K) = ˜G(σ). (3.3)

Then we prove existence of a solution for the following minimization problem

min{G(σ) :˜ σ∈SBV(Ω) anda≤σ≤b} · (3.4) The SBV Compactness and Semicontinuity Theorem 3.1 and the direct method in the Calculus of Variations allow us to prove immediately that such a minimization problem admits a solution, which we denote byσ.

Then for any admissibleτ∈SBV(Ω) we infer

M S(σ) +γF(σ)≤M S(τ) +γF(τ) and then

M S(σ)≤M S(τ) +γ(F(τ)−F(σ))≤M S(τ) +Ckτ−σk_L^p_(Ω).

So, by applying Proposition 3.2, we obtain thatS_σis essentially closed and this in turn implies that (σ, S_σ∩Ω) is an admissible couple in (3.1). By (3.3) we also immediately deduce that (σ, S_σ∩Ω) solves (3.1).

With a completely analogous reasoning also the existence of a solution to (3.2) may be proved. We introduce the relaxed functional

G˜₁(σ) =βHⁿ⁻¹(S_σ) +γF(σ)

for anyσ∈SBV(Ω) such that a≤σ≤b and∇σ= 0 almost everywhere in Ω. We minimize ˜G₁ on this class of functions. The SBV Compactness and Semicontinuity Theorem 3.1 ensures that any minimizing sequence converges in L¹ to a function of the same class which is therefore a minimum for ˜G₁. The same regularity argument, using this time Proposition 3.3, shows the existence of a couple which solves (3.2).

Remark 3.5. In the classical formulation of the Mumford–Shah problem, that is whenF(σ) =R

Ω|σ−g|²with g∈L^∞(Ω), wea priori know that a minimumσof the relaxed functional ˜G(or ˜G₁respectively) has to satisfy kσk_L^∞_(Ω) ≤ kgk_L^∞_(Ω). Therefore, in that case, no a priori bound on the ess-inf_Ωσ and on the ess-sup_Ωσis required.

Here, however, we have replaced the usual faithfulness term of the Mumford–Shah with a more general functionalFwhich may have some compactness properties which produce a lack of coercivity for the functionals G˜ and ˜G₁. Then we have enforced ana priori lower and upper bound on the values of the admissible functions.

This allows us to have coercivity and hence the existence results. Moreover we would like to apply this method to develop reconstruction procedures in inverse problems. Usually these inverse problems are illposed anda priori bounds on the admissible solutions are needed in order to ensure stability. We may see our bounds on the L^∞ norm of the admissible functionsσas such a kind ofa priori bound.

(13)

Such bounds arise in a natural way when we deal with digitized images, since we have natural bounds on the grey levels of the picture used, and also in the inverse conductivity problem where we have to ensure that σ satisfies (1.1) for a positive constantλso that the Neumann-to-Dirichlet map is well defined.

We may also impose further regularity conditions on the faithfulness term F so that the solutionσto (3.1) outsideK=S_σ∩Ω is more regular than simply belonging toH¹, for instance it isC¹. For example, this is the case for the classical formulation of the Mumford–Shah problem, see [DG-Ca-L]. This result would allow us to formulate our minimization problems in a class of more regular functions, for instance piecewise C¹ functions, and, nevertheless, to obtain existence results.

Proposition 3.6. Assume that the hypotheses of Theorem 3.4 are satisfied. We also assume that for any σ₀∈X_a^b(Ω)there existsp,1≤p < n/(n−1), such thatF is differentiable inX_a^b(Ω)at the pointσ₀ with respect to the L^p-norm.

Then if (σ, K) is a solution to the minimization problem (3.1), we have thatσ∈C¹(Ω\K).

Proof. We already know that there exists a solution (σ₀, K) to (3.1). We consider a point x₀ ∈Ω\K and we want to prove that locally in a neighbourhood ofx₀the functionσ₀isC¹. First of all we considerDF(σ₀). We may identify it with an L^q(Ω) functionf, withq > n.

Our argument is based on regularity theory for variational inequalities and obstacle problems. We refer to [K-St].

We introduce the following notation. If σ∈H¹(Ω), Ω being an open set, we say thatσ(x)>0 atx∈Ω (in the sense of H¹(Ω)) if there exist B_r(x) andφ∈H₀^1,∞(B_r(x)),φ≥0,φ(x)>0, so that u−φ≥0 onB_r(x) (in H¹). Remark that the set {x∈Ω : σ(x)>0} is open.

Let us assume thatx₀is such thata < σ₀(x₀)< b. Then we have that locallyσ₀solves the following equation

∆σ₀=f in B_r(x₀).

Then, sincef ∈L^q withq > n, standard regularity arguments shows thatσ₀is C¹ in a neighbourhood ofx₀. Some care is needed in treating the case when σ₀(x₀) is either equal toaor tob. We consider only the first case since the second one may be treated in a completely analogous way.

Without loss of generality we may assume that x₀ = 0, that B_r = B_r(0) is contained in Ω and that σ₀<(a+b)/2 inB_r. We fix

K={τ∈H¹(B_r) : τ =σ₀ on∂B_randτ ≥ain B_r} and we notice that if we denote by τ₀ the solution to

minτ∈K

1 2 Z

Br

|∇τ|²+ Z

Br

f τ

we have thatτ₀∈C¹(B_r), see [K-St].

Therefore it will be enough to prove that σ₀ =τ₀ to obtain our result. This may be achieved through the following procedure. First of all we notice that, sinceσ₀ inB_r is a supersolution in the following sense

Z

Br

∇σ₀· ∇φ≥ − Z

Br

f φ for anyφ∈H₀¹(B_r), φ≥0

and clearly belongs to K we have that τ₀ ≤ σ₀ in B_r. We assume, by contradiction, that the open set D ={x∈ B_r : σ₀(x) > τ₀(x)} is not empty. We notice the following facts. Since we have that σ₀ =τ₀ on

∂B_r we infer thatσ₀=τ₀ on∂D. Furthermore we have that ∆σ₀=f in D and this, in turn, implies thatσ₀ coincides withτ₀ onD. This contradiction allows us to conclude the proof.