Anisotropic Diffusion in ITK

(1)

HAL Id: hal-01121511

https://hal.archives-ouvertes.fr/hal-01121511

Preprint submitted on 3 Mar 2015

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Anisotropic Diffusion in ITK

Jean-Marie Mirebeau, Jérôme Fehrenbach, Laurent Risser, Shaza Tobji

To cite this version:

Jean-Marie Mirebeau, Jérôme Fehrenbach, Laurent Risser, Shaza Tobji. Anisotropic Diffusion in ITK. 2015. �hal-01121511�

(2)

Anisotropic Diffusion in ITK

Jean-Marie Mirebeau1, J´erˆome Fehrenbach2, Laurent Risser2, Shaza Tobji2 March 3, 2015

Abstract

Anisotropic Non-Linear Diffusion is a powerful image processing technique, which allows to simultaneously remove the noise and enhance sharp features in two or three dimensional images. Anisotropic Diffusion is understood here in the sense of Weickert, meaning that diffusion tensors are anisotropic and reflect the local orientation of image features. This is in contrast with the non-linear diffusion filter of Perona and Malik, which only involves scalar diffusion coefficients, in other words isotropic diffusion tensors. In this paper, we present an anisotropic non-linear diffusion technique we implemented in ITK. This technique is based on a recent adaptive scheme making the diffusion stable and requiring limited numerical resources.

1 Introduction

A digital image is usually a large two or three dimensional array of pixel values (typically scalars or vectors). Image processing methods based on Partial Differential Equations (PDEs) regard images as approximations of continuous objects, namely functions from an image domain to a pixel space, to which physics-inspired evolution rules can be applied. Among them, Non-Linear Anisotropic Diffusion (NLAD) is a variant of the heat equation, generalized in two regards: Non-Linearity and Anisotropy.

Anisotropy in diffusion means that the smoothing induced by the PDE can be favored in some directions and prevented in others. This is specified by local eigenvectors and eigenvalues of the diffusion tensor field (see §2). Diffusion coefficients are thus location and direction dependent, generalizing the approach of Perona and Malik [2] which is only location dependent. Importantly, efficient schemes for anisotropic diffusion have been recently made possible by the breakthrough in [1]. This has motivated the development of the ITK module presented in this paper.

1_{University Paris-Dauphine, Laboratory Ceremade, CNRS.} 2

Institut de Math´ematiques de Toulouse, Universit´e Paul Sabatier, CNRS.

(3)

Non-Linearity in diffusion means that diffusion tensors are automatically generated from the processed image. We implemented the strategies of Weickert [3] and we give a simple framework for designing extensions and variants, see §3. The implemented filters and their parameters are described in §4. Figures2and3illustrate their effect on 2D images; Figures4and5on 3D images; Figures6 and7 on color and vector images.

A possible application of NLAD is to enhance a fingerprint image by smoothing tangentially to the lines. Evidence is also plentiful for NLAD relevance in many other image processing ap-plications, but its use has been limited by technical aspects so far. We intend to alleviate such limitations with the present contribution.

Notations: Let d ∈ {2, 3} denote the image dimension, let Ω ⊂ Rd_{be the image domain, and let}

V be the pixel space (e.g. [0, 1] for grayscale, [0, 1]3for color, Rdfor vectors). Throughout the paper, we informally consider an idealized cartoon image model, involving a set Γ ⊂ Ω of image contours of dimension d − 1. The processed image u : Ω → V is smooth on Ω \ Γ, but has discontinuities across Γ, and is overall corrupted by e.g. by additive white noise. A key feature of NLAD is its ability to detect the set Γ and smoothen tangentially to it. Finally, let S_d+ denote the collection of symmetric positive definite d × d matrices, and let Id be the identity matrix. To each D ∈ S_d+ we associate the norm kekD :=phe, Dei, e ∈ Rd, where h·, ·i denotes the standard scalar product on

Rd.

2 Linear Anisotropic diffusion

Linear Anisotropic Diffusion1 (LAD), in divergence form, is an elliptic PDE which reads

∂tu = div(D ∇u), (1)

where D : Ω → S+_d is a given field of symmetric positive definite diffusion tensors. Eigenvectors of these tensors define preferential diffusion directions, and the eigenvalues their corresponding coefficients. Evolution rule (1) is complemented with an initial condition u(0, ·) = u0 at time

t = 0. If u0 has pixels of vector type, then their components are treated independently. We use

Neumann2 conditions on the domain boundary ∂Ω, as is common in image processing. LAD is formally a continuous gradient descent for the elliptic energy

E(u) := Z

Ω

k∇u(x)k2_D(x)dx. (2)

Qualitative effects of LAD strongly depend on the chosen field D of diffusion tensors. Choosing D = Id identically on Ω yields the standard heat equation, which qualitative properties are well known in image analysis: any noise present in the image u is quickly eliminated, but in the meanwhile all image sharp features are blurred.

This undesirable side effect can be limited with a proper choice of diffusion tensors D : Ω → S_d+. Indeed LAD smoothes primarily the image features which contribute strongly to the energy (2). In the spirit of Perona and Malik, one can introduce an isotropic but variable conductivity D(x) = c(x) Id, with c(x) 1 for all x close to the image contours Γ, see introduction. Smoothing is prevented in the neighborhood of Γ, which preserves the contours sharpness, but also traps some

1

We use here the terminology of Weickert [3]. Perona-Malik diffusion, which uses an adaptive scalar tensor field similar to (4), is in contrast a Non-Linear Isotropic Diffusion equation.

2

(4)

noise along them, see Figure 2 (IV). A more elaborate approach is to construct anisotropic diffu-sion tensors D(x), which favor diffudiffu-sion tangentially to the contours curves Γ, but simultaneously prevent diffusion transversally to these curves and between different image regions. All noise is eliminated, yet image discontinuities are preserved, see Figure2 (II).

Numerical schemes for LAD are in general non-trivial due to interaction between the anisotropic geometry of the diffusion tensors, and the cartesian structure of the pixel grid. The authors recently developed [1] a numerical scheme which handles this interaction using special tools from discrete geometry, named Lattice Basis Reduction (LBR). It provides strong mathematical guarantees (con-sistency, stability, maximum-principle) for a limited numerical cost.

3 Non-Linear Anisotropic Diffusion.

Linear Anisotropic Diffusion, discussed in §2, requires two main inputs: an image u0 serving as

an initial condition, and a field of diffusion tensors. In order to reduce user input, the diffusion tensors D can be defined in terms of the filtered image u. The resulting PDE is called non-linear anisotropic diffusion

∂tu = div(Du∇u), (3)

complemented, again, with Neumann boundary conditions. Perona and Malik [2] suggested to use the following non-linear isotropic (i.e. proportional to the identity matrix) tensors

Du(x, t) := cu(x, t) Id, with cu(x, t) :=

1

p1 + k∇u(x, t)k2_/λ2, (4)

where λ > 0 is a user specified constant. Diffusion is prevented where the conductivity cu(x, t)

is small, in other words where k∇u(x, t)k is large, such as along the image contours Γ. Perona-Malik diffusion is already available3 in ITK. It has been the subject of considerable academic and industrial interest revealing that, in spite of its numerous qualities, it is mathematically ill posed, unstable, often leads to unsightly “staircasing” visual artifacts, and is not adequate for oscillating patterns as in Figure3.

We describe in the following Coherence Enhancing Diffusion (CED) and Edge Enhancing Dif-fusion (EED), which are based on more complex tensor constructions introduced by Weickert [3]. Our first ingredient is the Gaussian convolution kernel: given a standard deviation σ > 0

Kσ(x) := 1 σdK1 x σ , where K1(x) := 1 (2π)dexp −kxk2 2 . (5)

The structure tensor Su : Ω → S_d+, defined below, is a robust estimator of the gradient direction in

an image u, even if this image has oscillating textures. It depends on two small positive parameters: the noise scale σ, and the feature scale ρ. We denote by ∗ the convolution operator, and by v ⊗ v = vvT the self outer product, which yields a semi-definite symmetric matrix.

Su := Kρ∗ (∇uσ⊗ ∇uσ), where uσ := Kσ∗ u. (6)

If u is an image with vector pixels, then Su is the sum of the structure tensors associated to the

components of u. Assume that u is a scalar image, fix a time t and a point x ∈ Ω, and denote S := Su(x, t), v := ∇u(x, t). Let also λ1 ≤ · · · ≤ λddenote the eigenvalues of S, sorted by increasing

magnitude, and e1, · · · , ed the corresponding unit eigenvectors. If u is sufficiently smooth, then the

3

(5)

largest eigenvalue approximates the gradient squared norm: λd ≈ kvk2, while the corresponding

eigenvector approximates the gradient direction: ed≈ ±kvkv .

Weickert’s diffusion tensors D := Du(x, t), are defined in terms of this eigen-analysis of the

structure tensor S := Su(x, t): D = X 1≤i≤d µiei⊗ ei, where S = X 1≤i≤d λiei⊗ ei. (7)

Smoothing is promoted in the direction ei if µi is large, and prevented if µi is small, for any

1 ≤ i ≤ d. Weickert’s classical constructions are presented in (8) and (11). One should not shy away of designing more complex and application dependent variants; for instance one may want to enhance filaments and tubular structures in 3D data. Three very simple variants (9), (10) and (12) are presented for illustration. All depend on three parameters λ, m, α. The main one, λ > 0, is an edge detection threshold. The exponent m is typically 2 or 4. The small parameter α, typically 1/100, determines the condition number of the diffusion tensors.

• Edge Enhancing Diffusion (EED) aims to avoid significant diffusion across the set Γ of image contours, but to allow it anywhere else. Note that for 3D images discontinuity planes will be enhanced, rather than edges. The first diffusion tensor eigenvalue is µ1 = 1, because the

eigenvector e1 is orthogonal to the image (approximate) gradient direction ed, hence never

transverse to Γ. Other eigenvalues satisfy µi ≈ α 1 if λi− λ1 & λ, and µi ≈ 1 otherwise.

The condition λi− λ1 & λ indeed suggests that the eigenvector ei points through an image

contour. Precisely4: (note that µ1 = 1)

µi := 1 − (1 − α) exp − λ λi− λ1 m . (8)

The choice of Weickert, to set µ1 = 1, may lead to undesired effects: one always performs

diffusion in at least one direction. An undesirable side effect is that the image is blurred close to the angles of its contour set Γ. We believe that such salient features should be preserved, hence we introduce a Conservative variant of EED (cEED) for which µ1 can be small, when

appropriate, so as to prevent diffusion around the angles of Γ, see Figure8 for a comparison. Precisely µi:= 1 − (1 − α) exp − λ λi m . (9)

If all eigenvalues are set equal µ1 = · · · = µd, then the diffusion tensors are isotropic, in other

words scalar multiples of the identity. The following isotropic variant of EED is close in spirit to the Perona-Malik model: diffusion is prevented in the neighborhood of the image contours Γ, regardless of direction. This construction is implemented purely for comparison with the anisotropic ones, and does not take advantage of the innovative numerical scheme developed by the authors µi:= 1 − (1 − α) exp − λ λd m . (10)

• Coherence Enhancing Diffusion (CED) prevents diffusion except along local image structures which have a coherent direction. The diffusion tensor eigenvalues satisfy µi ≈ α 1, unless

4

Actually, Weickert uses µ1= 1 together with (9) for i > 1. This results in discontinuous diffusion tensors, which

(6)

Figure 1: The provided filters expect image pixels of scalar or vector pixel type. A preliminary cast is required for integral pixel types (left), or non-Vector pixel types such as RGB (right). The provided executable (or “.cxx” file) performs these casts automatically.

if λd− λi & λ in which case µi≈ 1. The condition λd− λi& λ indeed suggests that ed points

through an image feature, and that ei points tangentially to it. Precisely: (note that µd= α)

µi := α + (1 − α) exp − λ λd− λi m . (11)

The above formula often leads to false positives: at a position with large gradients, but without a clear preferred direction, one may very well have λd− λi λ (for instance if

λd = 100λ and λi = 95λ). A more reliable coherence detector is λm − λi λ + λi, which

leads to a Conservative variant of CED (cCED), see Figure 8 for a comparison. Precisely: (note that µd= α) µi := α + (1 − α) exp − λ + λi λd− λi m . (12)

We emphasize that the distinction between EED and its conservative variant cEED (resp. CED and cCED) is rather subtle, and mostly located around image contour corners, as evidenced on Figure 8. In other illustrations, we only show the conservative variant, which is slightly better at preserving detail.

4 Implemented filters and their parameters

Our contribution to ITK consists of the following four image filters, which implement the mathemat-ical notions presented in §2and §3. The figures are produced with the last filter, which implements CED, EED and their variants described §3. The first three filters are its building blocks, but they may be of independent interest for other applications. All filters are multithreaded.

For each filter, the processed image pixels can be of scalar or vector type. In the latter case, the underlying floating point type needs to be specified via the second template parameter of the filter, see Figure 1 (bottom right). Pixels of integral type (resp. RGB pixels) must be cast to floating point (resp. Vector ) types, see Figure 1. The image dimension must be 2 or 3.

Linear Anisotropic Diffusion (LAD). The filter LinearAnisotropicDiffusionLBRImage-Filter , requires two inputs: a processed image and a tensor image. Note that non-linear diffusion is achieved through successive linear diffusions, over multiple small time intervals, with regularly updated diffusion tensors. Parameters:

• MaxDiffusionTime specifies the target physical time for the LAD evolution PDE (1). The filter has early abort options, as discussed in the next point, hence one should check the EffectiveDiffusionTime at termination.

(7)

• MaxNumberOfTimeSteps, RatioToMaxStableTimeStep. Explicit numerical schemes for diffusion are subject to a Courant-Friedrichs-Levy (CFL) condition, which limits the largest stable time step. This time step, which depends on the input diffusion tensors, is automatically computed by the filter, and the requested time interval [0, MaxDiffusionTime] is split accordingly. Early abort occurs if this splitting exceeds the specified MaxNum-berOfTimeSteps.

Structure tensor. Filter StructureTensorImageFilter . Parameters:

• NoiseScale σ, and FeatureScale ρ, see (6). Suggested ranges: σ ∈ [0.5, 3], λ ∈ [2, 10], assuming a unit pixel spacing. The lower bounds of these intervals are recommended, unless noise is extremely strong.

• RescaleForUnitMaximumTrace. If on, the structure tensors are rescaled: S_u0 = αSu,

where α > 0 is the largest constant such that Trace(S0_u(x)) ≤ 1 for all x ∈ Ω. (The trace is the sum of the eigenvalues λ1+ · · · + λd, see (7).) This option is meant to ease

the choice of the edge detection threshold λ in CED, EED. One may want to check the variable PostRescaling = α at termination.

Non-Linear Anisotropic Diffusion (NLAD). Filter AnisotropicDiffusionLBRImageFilter . This class (I) computes structure tensors by invoking the previous filter, (II) performs their eigen-analysis, (III) changes them into diffusion tensors via (7), (IV) runs linear diffusion with the constructed tensors by invoking the first filter. After a limited number of time-steps of linear diffusion, the steps (I-II-III-IV) are repeated so as to update the diffusion tensors, until exhaustion of the prescribed diffusion time. Parameters:

• DiffusionTime for which the evolution rule (3) is applied.

• Adimensionize. If on, the filter ignores the image pixel spacing information, and sets on the RescaleForMaximumUnitTrace option for structure tensor generation. This is intended to ease the choice of DiffusionTime, and of the edge detection threshold λ in CED, EED, and variants.

• MaxTimeStepsBetweenTensorUpdates is self descriptive. NoiseScale and FeatureScale are passed for structure tensor generation.

• EigenValuesTransform is a virtual method used to construct the diffusion tensor eigen-values (µi)di=i from those of the structure tensors (λi)di=1, which are sorted increasingly

for convenience. The method must be redefined in a subclass, as in the next filter, else it triggers an exception.

Coherence-Enhancing diffusion and Edge-Enhancing diffusion. The two PDEs, and their variants, are implemented in the same filter CoherenceEnhancingDiffusionFilter , which subclasses the Non-Linear Anisotropic Diffusion filter. Parameters:

• Enhancement allows to switch between EED, cEED, CED, cCED and Isotropic (10) tensor constructions, by redefining the superclass virtual method EigenValuesTransform. The relevant choice depends on the type of image structures that one wants to enhance. • Lambda= λ, Exponent = m, Alpha= α are the parameters involved in tensor design (8)

- (12).

• Suggested parameter ranges. Adimensionize=True. DiffusionTime ∈ [0.5, 5], although larger values can be relevant for very strong noise or artistic effects. Edge detection

(8)

Figure 2: I: Source image. II: cEED is the best pick. III: cCED only affects the directionally coherent parts, the image region contours, which is clearly inadequate. IV: Isotropic (10, Perona-Malik like) diffusion leaves some noise trapped along the image contours. Parameters: T = 20, λ = 0.05, σ = 3, others default.

Figure 3: I: Source image. II: cEED is not advised, since it blurs the junctions of the fingerprint lines. III: cCED is the best pick, since it enhances the fingerprint lines, and does not blur the more complex regions. IV: Isotropic (10, Perona-Malik like) diffusion either fails to remove noise, or blurs the fingerprint lines, depending on the image region. Parameters: T = 20, λ = 0.02, others default.

threshold λ ∈ [10−3, 5 × 10−2], small for complex images with detail, large for sim-ple “cartoon” like images. Finally m ∈ [2, 4], α = 0.01, though these parameters are secondary and have little impact.

References

[1] J´erˆome Fehrenbach and Jean-Marie Mirebeau. Sparse Non-negative Stencils for Anisotropic Diffusion. Journal of Mathematical Imaging and Vision, 2013. 1,2

[2] P Perona and J Malik. Scale-space and edge detection using anisotropic diffusion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1990. 1,3

(9)

Figure 4: Left: Volume plot and slice of an FMRI of the human skull, with artificially added Gaussian noise of variance 0.01 (data range: [0, 1]). In the volume plot, values below 0.08 are shown transparent. Right: Effect of cEED. Parameters: T = 5, λ = 0.003.

Figure 5: Synthetic function cos(kxk3), x ∈ [0, 1]3, corrupted with Gaussian noise of variance 0.5. Left: level lines, and slice {x = 0.1}. Right: effect of CED. By construction, CED barely affects the neighborhood of the origin (0, 0, 0), where no specific coherent direction can be determined. Parameters: T = 10, λ = 0.02, σ = 4, ρ = 10.

Figure 6: I: Detail of the Lena image. II: cEED removes noise and preserves most image detail. III: cCED enhances the flow structure of the hat plumes, without much affecting the rest of the image. In particular it does not remove noise. IV: Isotropic diffusion (10, Perona Malik like) removes noise in the interior of the image regions, but leaves some noise close to image contours such as the border of Lena’s cheek. Parameters: T = 2, λ = 0.003, m = 4, others default.

(10)

Figure 7: Left: Directions and norms (black=0, white=1) of the vector field v(x) := sign(kxk − 1) x⊥/kxk on [−1.3, 1.3]2, degraded by gaussian noise of variance 2, on a 50 × 50 grid. Norms of these vectors (black=0, white=1). Right: Effect of cEED. Note that the streamlines are better reconstructed, and that the norms vanish along the (approximate) circle kxk = 1 where v(x) changes sign so that there is a cancellation effect (and likewise close to the vector field singularity at the center). Parameters: T = 10, λ = 0.05, others default.

Figure 8: Left: (I) source image, (II) effect of cEED, (III) effect of EED. Note that the triangle corners are better preserved with cEED. Parameters: T = 5, λ = 0.03. Right: (I) source image, (II) effect of cCED, (III) effect of CED. Note that cCED better preserves contrast along where the two fronts meet. Parameters: T = 20, λ = 0.05.

Anisotropic Diffusion in ITK

HAL Id: hal-01121511

https://hal.archives-ouvertes.fr/hal-01121511

Anisotropic Diffusion in ITK

Jean-Marie Mirebeau, Jérôme Fehrenbach, Laurent Risser, Shaza Tobji

To cite this version:

Anisotropic Diffusion in ITK

Contents

1

Introduction

2

Linear Anisotropic diffusion

3

Non-Linear Anisotropic Diffusion.

4

Implemented filters and their parameters

References