MULTI-SPECTRAL IMAGE ANALYSIS FOR SKIN PIGMENTATION CLASSIFICATION

(1)

HAL Id: inria-00499492

https://hal.inria.fr/inria-00499492

Submitted on 9 Jul 2010

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

MULTI-SPECTRAL IMAGE ANALYSIS FOR SKIN

PIGMENTATION CLASSIFICATION

Sylvain Prigent, Xavier Descombes, Didier Zugaj, Philippe Martel, Josiane

Zerubia

To cite this version:

Sylvain Prigent, Xavier Descombes, Didier Zugaj, Philippe Martel, Josiane Zerubia.

MULTI-SPECTRAL IMAGE ANALYSIS FOR SKIN PIGMENTATION CLASSIFICATION. IEEE

Inter-national Conference on Image Processing (ICIP), Sep 2010, Hong Kong, China. �inria-00499492�

(2)

MULTI-SPECTRAL IMAGE ANALYSIS FOR SKIN PIGMENTATION CLASSIFICATION

Sylvain Prigent

1

, Xavier Descombes

1

, Didier Zugaj

2

, Philippe Martel

2

, Josiane Zerubia

1

_{EPI Ariana INRIA/I3S, 2004 route des Lucioles, BP93 06902 Sophia Antipolis, Cedex. France}

email: Sylvain.Prigent@inria.fr, Xavier.Descombes@inria.fr, Josiane.Zerubia@inria.fr

2

_{Galderma R&D, 2400 Route des Colles, BP87, 06902 Sophia-Antipolis, Cedex. France}

email: Didier.Zugaj@galderma.com, Philippe.Martel@galderma.com

ABSTRACT

In this paper, we compare two different approaches for semi-automatic detection of skin hyper-pigmentation on multi-spectral images. These two methods are support vector ma-chine (SVM) and blind source separation. To apply SVM, a dimension reduction method adapted to data classification is proposed. It allows to improve the quality of SVM clas-sification as well as to have reasonable computation time. For the blind source separation approach we show that, using independent component analysis, it is possible to extract a relevant cartography of skin pigmentation.

Index Terms— skin hyper-pigmentation, multi-spectral images, support vector machine, independent component analysis, data reduction.

1. INTRODUCTION

An accurate quantification of skin diseases is important both for quantifying the degree of the pathology, and evaluating as early as possible, the efficiency of a treatment. In this pa-per, we propose to use multi-spectral and hyper-spectral im-ages to analyse hyper-pigmentation of the skin. The interest of working with multi-spectral images is to have a more ac-curate information on skin properties than those obtained on conventional cameras. Several methods based on absorbance spectrum in visible and near infra-red have already been pre-sented in the literature for quantifying the melanin in the skin [1, 2]. The main idea of these methods is to select specific spectral bands in the data in order to extract information on skin pigmentation. One of the most famous algorithms has been proposed by Stamatas in [1]. This algorithm is based on the analysis of the skin chromophores absorbance spectrum. An affine model of the melanin absorbance is proposed in the spectrum range of 630-700 nm:

Am(λ) = aλ + b,

where Amrepresents melanin absorbance, a and b are linear

coefficients and λ the wavelength. The spectrum can be cor-rected by substracting the influence of melanin:

Ac(λ) = A(λ) − Am(λ).

Concentrations of oxy-hemoglobin and deoxy-hemoglobin are calculated by solving the system given by the Beer-Lamber law applied to two specified wavelengths (λ1 = 560

nm , λ2= 580 nm):

Ac(λ) = [oxy Hb]∗oxy Hb(λ)+[deoxy Hb]∗deoxy Hb(λ),

where oxy Hband deoxy Hbrepresent extinction coefficients

which have tabulated values. Finally, corrected melanin con-centration cartography can be extracted by correcting the affine estimation of melanin concentration taking into ac-count the deoxy-hemoglobin influence in the 630-700 nm range:

[M elanin]c= [M elanin]

− ([deoxy Hb] ∗ a[deoxy Hb] + b[deoxy Hb]) ,

where a[deoxy Hb] and b[deoxy Hb] are coefficients in the

lin-ear model of the deoxy-hemoglobin absorbance in the range 630-700 nm.

In this paper, we propose to extract melanin component con-sidering the whole spectrum. Two methods are proposed. The first one is based on support vector machines (SVMs) with a preprocessing dimension reduction step. The second method quantifies the melanin component assuming that the multi-spectral image is a linear mixture of components constituting the skin. SVMs are considered as one of the most efficient methods to classify multi and hyper-spectral images. SVM computs a class separator from learning pixels in order to classify all the data. For hyper-spectral images, SVMs are of-ten associated with dimension reduction algorithms [3], such as principal component analysis (PCA) or projection pursuit [4], in order to avoid the Hughes phenomenon [5]. In this paper, a projection pursuit method has been chosen since it permits dimension reduction based on an index I optimizing the classification. As an unmixing method we propose to use independent component analysis (ICA).

The paper is organized as follows. In section 2 we present the projection pursuit algorithm and classification by SVM. In section 3 we compare several ICA methods. In section 4 we report obtained results. In section 5 conclusions are drawn.

(3)

2. PROJECTION PURSUIT WITH SVM 2.1. Projection pursuit

The common way proposed in the literature to select spectral bands for skin analysis is to do it manually. Here, we pro-pose to employ projection pursuit. Projection pursuit, orig-inally proposed in [4], can provide physics spectral analysis involving the whole spectrum. It consists of two steps. First the spectrum is partitioned into groups of adjacent bands, and then each group is projected in a unique vector. Thus, a re-duced space is obtained and its size is equal to the number of groups. Each group projection vector has to be simultane-ously representative of its corresponding group and as “far” as possible from the other vectors to assure band separability in reduced data space. For this purpose, an index I is de-fined to optimize the projection vector basis. The choice of I defines the optimal subspace and can be adapted to targeted applications. Along the classical projection pursuit process, the index is iteratively maximized on the projection vectors. In our application, projection pursuit is used as a preprocess-ing step for classification. That is why, as proposed in [6], we prefer to calculate the index I on the training classes. This is also significantly faster than to consider the whole volume.

2.1.1. Choice of the projection index

In the literature, the most common projection pursuit indices are Bhattacharyya and Kullback-Leibler distances [7]. For classification purpose more specific indices can be used. For example, an index based on histogram analysis is proposed in [8]. On the other hand, such indices require manual parame-ter specification, and don’t improve significantly the classifi-cation accuracy. That is why we prefer to use the Kullback-Leibler distance D(i, j) = 0.5(H(i, j) + H(j, i)), where:

H(i, j) = Z fi(x)ln fi(x) fj(x) dx,

and fi, fjare the distributions of the two training classes.

2.1.2. Spectrum partitioning

The second important step in the dimension reduction is to choose a reasonable partitioning of the spectrum into groups. In the original projection pursuit [4], the spectrum is sepa-rated in K groups having the same number of bands. Since the purpose is to study information about skin components, it seems to be judicious to select a partitioning adapted to the skin properties. In [7], a partitioning method based on the index I is proposed. If K is a specification parameter, the method partitions the spectrum as follows: first, the spectrum is partitioned into K groups with a constant number of bands, then borders of each group are iteratively reestimated in order

to minimize the internal variance σ_I2of each group:

σ_I2(k) = 1 K K−1 X k=0 I2(zk, zk+1) − 1 K K−1 X k=0 I(zk, zk+1) !2 ,

where zk represents the position of the upper boundary of

the kth_{group. In skin images various objects like hair, lips,}

or eyes can be present. The presence of these objects influ-ences the physics of the spectrum and, thus, the classification. Therefore, an algorithm that permits a spectral partitioning with a variable number of groups K is needed. The solution proposed below is deduced from the discrete function FI

defined as follows:

FI(k) = I(bk, bk+1), k = 1, ..., Nb− 1,

where bk represent spectral bands and Nb the number of

spectral bands in the initial data. Local maxima of FI give

possible borders zk for the spectral partitioning. To choose

only significant maxima one can use, e.g., multi-scale analy-sis or the integral of FI between consecutive maxima. Such

an analysis of FI introduces a new parameter which can be

the scale step for the multi-scale analysis or a threshold for the FI integral analysis.

2.2. SVM

SVM, originally proposed in [9], is a supervised method for data classification. It can be decomposed in two steps. The first one, called the learning step, consists in computing an optimal linear separator between learning classes. This is done by quadratic optimization. When data are not linearly separable, the introduction of a kernel transforms the data space into a higher dimensional space where a linear separa-tor can be computed. The second step consists in classifying the whole data set by evaluating for each pixel the distance to the separator. Methods and algorithms for SVMs are avail-able in [3, 10]. For the skin analysis, we use a two-class non linear SVM with a Gaussian kernel defined as follows:

K(xi, xj) = exp(−

||xi− xj||2

2σ2 ),

where xiand xjare pixels from the training set.

3. UNMIXING METHOD

Unmixing or blind source-separation is a well known tech-nique for analyzing multichannel data. It assumes that the data consists of pure signals, called components, that are lin-early mixed. In the skin hyper-pigmentation analysis, one looks for a component (or source) as representative as pos-sible of the melanin concentration. If Xi,j denotes the initial

data at position (i, j), A the mixture matrix, Si,jthe weight of

(4)

Fig. 1. Comparaison between dermatologist evaluation of pathological area and detected pathological area by ICA source classification (a) and Stamatas algorithm (b).

additive noise, then the mixing model can be written [11, 12] as:

Xi,j= ASi,j+ Ni,j.

This equation reveals two kinds of ambiguities: when columns of A are transposed or multiplied by different scalars, a new component decomposition is obtained. To estimate sources or independent components, several methods have been pro-posed in the literature. Here we consider FastICA [12] and Joint Approximate Diagonalization of Eigenmatrices (JADE) [11]. FastICA iteratively computes each component max-imizing the negentropy J which is the difference between the entropy H of the source s and that of a Gaussian ran-dom variable g with the same correlation matrix as s, i.e. J (s) = H(g) − H(s). JADE algorithm performs simul-taneous component estimation by diagonalizing matrices of second and fourth order cumulants [11]. Although these two methods are formulated differently, they are close from a mathematical point of view [11, 12]. That explains why, on our data set, they report comparable results. For JADE algo-rithm, the more components are estimated, the more memory is needed. FastICA is very sensitive to the model ambigui-ties, and reports computation time varying from ten seconds to several minutes on different images of the data set. In our case, we look for a small number of sources and prefer to use JADE which has a stable computation time about 20 seconds. Once the most representative source of skin pigmentation is chosen, thresholding enables classification between healthy and hyper-pigmented skin.

4. EXPERIMENTS

In this section we compare the developed algorithms with a reference (i.e Stamatas algorithm [1]). These algorithms need manual parametric specification. Projection pursuit combined with SVM (PP-SVM) classification is initialized by choosing training pixels on each image, and melanin concentration car-tographies estimated by ICA and Stamatas algorithm are

man-ually thresholded to obtain classification. These thresholds are chosen on each image to optimize the correspondence of detected pathological area with a dermatologist analysis.

The two methods, PP-SVM and thresholding a selected ICA component, have been tested on a data set of 96 images taken from 48 patients. These multi-spectral images were provided by Galderma R&D. These images contain 18 spec-tral bands in a range from 405nm to 970nm with an average step of 25 nm. The spatial area is about 900*1200 pixels. On the whole dataset, the algorithms (PP-SVM and ICA) and the ground truth provided by dermatologists report the same pathological area in about 75% of the images. When algo-rithms and dermatologist analysis disagreed, it is due to false detections caused by facial features, like lips or eyes, and hair. To compare the accuracy of the skin analysis algorithms, we reduce the data set to 34 images to keep only images that contain as few facial features and hair as possible. In figure 2, we compare results of PP-SVM, thresholding of a selected ICA component and the reference Stamatas algorithm. One can see that on the two images, the three algorithms provide quite similar results. But the Stamatas algorithm fails to de-tect pathological areas when the contrast between healthy and hyper-pigmented skin is weak. PP-SVM and ICA methods provide better results since they succeed in detecting hyper-pigmentation in weak contrasted areas. For example, on the image 1 (left column) of figure 2, ICA based algorithm de-tects more accurately the weak hyper-pigmented area. Since the PP-SVM initialisation is operator dependent, we demon-strate here only the results for ICA based method. In figure 1(a), we show the correspondence between the dermatologist estimation (quantized into 6 categories by the percentage of hyper-pigmentation), and the detected pathological area by the ICA approch; in figure 1(b), we provide the correspon-dence with the Stamatas algorithm. Dermatologist analysis and algorithms coincide when dots are on the diagonal line. Comparing with the classification into the 6 classes provided by dermatologists, we obtain 100% of good classification with the ICA approach whereas Stamatas approach gives 71% of accurate classification.

5. CONCLUSION

In this paper, we have shown that using the whole spectrum instead of only few bands of interest can improve the skin hyper-pigmentation classification on multispectral images. We notice that the developed algorithms have an average computation time of 15 seconds on an image of 900 by 1200 by 18 pixels (with a Matlab code on a 2.2GHz core processor system) which is a reasonable time for a medical application. In future work, the goal will be to continue using multi/hyper-spectral images to provide precise local as well as global skin pigmentation evaluation. Methods to compensate artifacts caused by facial features and hair will be studied.

(5)

Fig. 2. Detected pathological areas by Stamatas algorithm, by SVM with Projection Pursuit (PP-SVM), and by thresholding the selected ICA component.

6. ACKNOWLEDGEMENT

The authors would like to thank Prof. Jocelyn Chanussot, Dr. Laurent Petit and Anne-Sophie Dugaret for interesting dis-cussions, and Galderma R&D early development for partial funding and for providing the data.

7. REFERENCES

[1] G. N. Stamatas, B. Z. Zmudzka, N. Kollias, and J. Z. Beer, “Non-invasive measurements of skin pigmenta-tion in situ.,” Pigment cell res, vol. 17, pp. 618–626, 2004.

[2] G. N. Stamatas, B. Z. Zmudzka, N. Kollias, and J. Z. Beer, “In vivo measurement of skin erythema and pig-mentation: new means of implementation of diffuse re-flectance spectroscopy with a commercial instrument,” British Journal of Dermatology, vol. 159, pp. 683–690, 2008.

[3] M. Fauvel, “Spectral and spatial methods for the classi-fication of urban remote sensing data,” in PhD Thesis. Institut National Polytechnique de Grenoble. France, 2007.

[4] S. Mallat and Z. Zhang, “Matching pursuit with time-frequency dictionaries,” IEEE Trans. on Signal Process-ing, vol. 41, pp. 3397–3415, 1993.

[5] G. F. Hughes, “On the mean accuracy of statistical pat-tern recognizers,” IEEE Trans. on Information Theory, vol. 14, pp. 55–63, 1968.

[6] L.O. Jimenez and D.A Landgrebe, “Hyperspectral data analysis and supervised feature reduction via projection pursuit,” IEEE Trans. on Geoscience and Remote Sens-ing, vol. 37, pp. 2653–2667, 1999.

[7] G. Rellier, X. Descombes, F. Falzon, and J. Zerubia, “Texture feature analysis using a gauss-markov model in hyperspectral image classification,” IEEE Trans. on Geoscience and Remote Sensing, vol. 42, pp. 1543– 1551, 2004.

[8] D. Gomez, L. K. Harder, B. Ersbll, and J. M. Carstensen, “Precise acquisition and unsupervised segmentation of multi-spectral images,” Computer Vision and Image Un-derstanding, vol. 106, pp. 183–193, 2007.

[9] V. Vapnik, “Statistical learning theory,” John Wiley and sons, inc., 1998.

[10] O. Zammit, X. Descombes, and J. Zerubia, “Burnt area mapping using support vector machines,” International Conference on Forest Fire Research, Figueira da Foz, Portugal, 2006.

[11] J.F. Cardoso, “High-order contrasts for independent component analysis,” Neural Computation, vol. 11, pp. 157–192, 1999.

[12] A. Hyvarinen, “Fast and robust fixed-point algorithms for independent component analysis,” IEEE Trans. on Neural Networks, vol. 10, pp. 626–634, 1999.