HAL Id: hal-00872889
https://hal.archives-ouvertes.fr/hal-00872889
Submitted on 14 Oct 2013
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
A Model-Based Approach for Compound Leaves Understanding and Identification
Guillaume Cerutti, Laure Tougne, Julien Mille, Antoine Vacavant, Didier Coquin
To cite this version:
Guillaume Cerutti, Laure Tougne, Julien Mille, Antoine Vacavant, Didier Coquin. A Model-Based
Approach for Compound Leaves Understanding and Identification. International Conference on Image
Processing (ICIP), Sep 2013, Melbourne, Australia. pp.1471-1475. �hal-00872889�
A MODEL-BASED APPROACH FOR COMPOUND LEAVES UNDERSTANDING AND IDENTIFICATION
Guillaume Cerutti
1,2Laure Tougne
1,2Julien Mille
1,3Antoine Vacavant
4Didier Coquin
51
Universit´e de Lyon, CNRS
2
Universit´e Lyon 2, LIRIS, UMR5205, F-69676, France
3
Universit´e Lyon 1, LIRIS, UMR5205, F-69622, France
4
Clermont Universit´e , Universit´e d’Auvergne, ISIT, F-63001, Clermont-Ferrand
5
LISTIC, Domaine Universitaire, F-74944, Annecy le Vieux
ABSTRACT
In this paper, we propose a specific method for the identifi- cation of compound-leaved tree species, with the aim of inte- grating it in an educational smartphone application. Our work is based on dedicated shape models for compound leaves, designed to estimate the number and shape of leaflets. A deformable template approach is used to fit these models and produce a high-level interpretation of the image content. The resulting models are later used for the segmentation of leaves in both plain and natural background images, by the use of multiple region-based active contours. Combined with other botany-inspired descriptors accounting for the morphological properties of the leaves, we propose a classification method that makes a semantic interpretation possible. Results are presented over a set of more than 1000 images from 17 Eu- ropean tree species, and an integration in the existing mobile application Folia
1is considered.
Index Terms— plant recognition, compound leaf, de- formable templates, image segmentation, active contours, classification
1. INTRODUCTION
When considering a tree identification mobile application, leaves are an obvious choice for recognition. They can be found all year long, are plane enough to be easily pho- tographed, and show properties that make the identification possible. However, leaves are natural objects whose mor- phological diversity makes modelling complicated, not to say impossible, and distinguishing between leaf types is necessary. This work concerns only compound leaves, for which a single leaf is actually divided in many leaflets.
The proposed method aims at modelling explicitly the dis- position, the global shape and the local specificities of the leaflets in order to classify photographs of leaves in a natural environment into a list of species. The use of high-level, in- dependent descriptors offers the possibility of an explanatory process, putting semantic concepts on what is recognized, which may be of great interest for the user. Figure 1 gives an insight of the proposed process.
This work has been supported by the French National Agency for
Fig. 1. Overview of the compound leaf recognition process Section 2 presents other works addressing close topics.
The specific models used to represent and segment compound leaves are introduced in Section 3. Section 4 expounds the descriptors and classification we use, as well as some results, and perspectives of future work are given in Section 5.
2. RELATED WORK
Plant recognition is a topic of interest in image processing, mostly in the context of leaf image retrieval. Some authors [1] even share our goal of conceiving a mobile application with great success
2, though being designed essentially for white-background images. The problem of segmenting leaves from a natural environment appears indeed to be much more challenging and is addressed by few other authors, using very complex methods [2, 3].
Segmentation of natural objects is a context where the introduction of prior shape knowledge could be very bene- ficial. Deformable models, which can be traced back to active contours [4], are a popular way to achieve this. Deformable templates modeling complex objects [5], strongly constrained
Research with the reference ANR-10-CORD-005 (REVES project).
1https://itunes.apple.com/app/folia/id547650203
2http://leafsnap.com: developed by researchers from Columbia University, the University of Maryland, and the Smithsonian Institution
templates, such as active shape models [6] or level-set active contours with shape priors [7] constitute ways of including prior knowledge in the segmentation. However, their shapes generally lack the necessary flexibility and expressiveness to capture the diversity of leaf shapes.
Concerning leaf shape description, many works apply es- tablished shape descriptors such as the Inner-Distance Shape Context [1], moments [3], Centroid-Contour Distance curves [2] or Curvature-Scale Space transform [8]. Such descriptors were not designed to take into account the nature of the ob- ject, even if they fit quite well with its specificities. On the other hand, some explicit geometric descriptions of the leaf morphology have been proposed [9, 10].
3. DEFORMABLE COMPOUND LEAF MODELS Similarly to what we have done in the case of simple leaves [11], the segmentation method we propose relies on the prior evaluation of a flexible leaf model, designed to cover the vari- ety of leaf shapes. Such a model constitutes a way of provid- ing a first segmentation as well as a description of the leaf’s global shape that can later be used for recognition.
3.1. Deformable Compound Leaf Model
A first model that tries to estimate the number and disposition of the leaflets was introduced in [12] and was modified here to achieve better robustness. It is crucial to point out that very often, the number of leaflets is not the number of connected components one would find in the segmented image, given that the overlap between leaflets is a constant risk. Estimating the actual number location of those leaflets beforehand would therefore be a guarantee that the resulting shapes will be cor- rectly described.
This model makes assumptions about the axial symme- try of the leaves, and the regularity of the leaflets in size and orientation, that are not strictly speaking always true, but are satisfying for a first approximation. As shown in Figure 2, the model represents leaflets by a variable number n
Lof pairs of circles (C
2l, C
2l+1)
l=1..nLsymmetrically positioned on either side of a curved axis defined by two points T (for the top leaflet C
1) and B (for the base of the petiole) and a curvature parameter k. The additional parameters used to build the model are the following :
• (p
l)
l=1..nL
, the position of pairs of circles on the axis
• d, the distance of all the circles to the axis
• r, the radius of all the circles
The estimation of the optimal model M
?on the actual image is performed through the minimization of an energy function by successive variations of the parameters. This en- ergy term is based on a color dissimilarity map estimated beforehand (Figure 3(b)) that accounts for each pixel’s like- lihood of being part of the leaf, given only its color. It is
Fig. 2. Construction of the compound leaf model
based on a simple leaf color model computed after a rough coloring of the three top leaflets (Figure 3(a)) by the user. This intuitive phase corresponds to what will be asked to an user of the mobile application, and turns out to be very handy to place the model accurately in its initialization.
In the context of compound leaves, we model the color by a single Gaussian (µ, Σ) in the L*a*b* colorspace, which is a perceptually more accurate representation than RGB. The dissimilarity of a pixel p to this color model is then simply given by the Mahalanobis distance d(p, µ, Σ) with respect to the computed Gaussian parameters.
The energy function the model minimizes during its evo- lution is simply the sum of the dissimilarity over the region defined by the model, minus a maximal dissimilarity that acts as a balloon force:
E(M ) = X
p∈S2nL+1 i=1 Ci
d(p, µ, Σ) − d
max(1)
The number of leaflets n
Lhas to be optimized separately, since the changes it produces in the shape of the model are too important to consider it in a gradient-descent like approach.
To overcome this difficulty, the model is initialized with an excessive number of leaflets. Following a process close to simulated annealing, a temperature variable is slowly decreas- ing through the evolution, and brutally raised in regular cy- cles. At the end of each of those cycles, circles that have grouped in actual leaflets are simply suppressed, a decision made comparing the distance between the centers of two con- secutive pairs of circles and the radius r. This way, unlike what was done in [12] where the number of leaflets was ap- proximated a posteriori, the convergence of the model ideally shows one single circle per leaflet (Figure 3(c)).
3.2. Deformable Joint Polygonal Leaflet Models
To capture the global shape of the leaflets, we rely on the polygonal leaf model introduced in [13], making the assump- tion that all the leaflets share the same shape. This is generally true (and species are described in the literature by the shape of their leaflets anyway) even if little exceptions (appreciably different shapes for the top leaflets) might cause some prob- lems.
Consequently we propose a novel joint approach where
we place one model Π
ifor each of the 2n
L+ 1 leaflets
obtained after the evaluation of the leaf model, and constrain
them to have the same shape parameters. Only the points defining the base and apex of each leaflet vary independently.
This new model {Π
i}
2ni=1L+1evolves the same way as the previous one, minimizing the same energy function, but with no suppression of overlapping leaflets. Constraints are added throughout the evolution, on the shape parameters so that leaflets keep leaf-like shapes, and on the points so that pairs of leaflets remain locally symmetrical, under the form of an internal energy term.
The process of fitting a model to a single leaflet, which may overlap with its neighbours is risky, but the fact that all the leaflet models are evaluated simultaneously ensures that they self-constrain (Figure 3(d)) The parameters of the opti- mal models {Π
∗i}
2ni=1L+1we obtain are therefore more robust than if they were computed independently on each leaflet.
(a) (b) (c) (d) (e)
Fig. 3. Example of model-fitting and segmentation of com- pound leaves
3.3. Multiple Active Contour Segmentation
To obtain different interpretable contours corresponding to each leaflet, the natural choice is to deform the polygonal shapes resulting from the previous step towards the actual contours. Once again, it is very interesting to perform this step in a joint fashion, so that the contours we are evaluating act as a constraint on each other.
The contour of each leaflet is represented by a region- based active contour model, using an extension of the level- set framework to the case of multiple regions [14] and an im- plicit definition approximating the original level-set evolution [15]. This model have the limitation that a pixel can only be part of one region, so that the contours do not interpenetrate.
The 2n
L+ 1 regions {Ω
i}
2ni=1L+1evolve simultaneously, minimizing the energy functional :
E({Ω
i}
2ni=1L+1) =
2nL+1
X
i=1