• Aucun résultat trouvé

C A C R C M R CHAHINIAN B COMMANDRÉ ,D EN-NEJJARY ,L PIBRE ,M CHAUMONT ,C DELENNE ,N M

N/A
N/A
Protected

Academic year: 2022

Partager "C A C R C M R CHAHINIAN B COMMANDRÉ ,D EN-NEJJARY ,L PIBRE ,M CHAUMONT ,C DELENNE ,N M"

Copied!
1
0
0

Texte intégral

(1)

MANHOLE COVER LOCALIZATION IN AERIAL IMAGES WITH A DEEP LEARNING APPROACH

BENJAMIN COMMANDRÉ1,3, DRISS EN-NEJJARY2, LIONEL PIBRE2, MARC CHAUMONT2, CAROLE DELENNE1,3, NANÉE CHAHINIAN1

CONTEXT

Urban expansion leads to more buried wastewater net- works, often poorly documented.

Very high reolution areal images may be used to identify and pinpoint the aerial elements of these networks

Deep Learning, Convolutional Neural Network

Challenge: detect small objects i.e manhole covers (80 cm); in low contrast settings and cluttered back- grounds

Objective: An automatic recognition and localization method for manhole covers.

Figure 1: Extract of the 5cm resolution image used for validation.

CONTACT

benjamin.commandre@umontpellier.fr

REFERENCES

[1] François Chollet. Keras. https://github.com/fchollet/keras, 2015.

[2] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convo- lutional neural networks. in Advances in neural information processing systems, pages 1097–1105, 2012.

[3] Xuchun Li, Lei Wang, and E. Sung. A study of adaboost with svm based weak learners. In Proceedings.

2005 IEEE International Joint Conference on Neural Networks, 2005., volume 1, pages 196–201 vol. 1, July 2005.

[4] M. Everingham, C. K. I. Van Gool, L.and Williams, J. Winn, and A Zisserman. The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(19):303–338, 2010.

[5] O. Bartoli, N. Chahinian, A. Allard, J.-S. Bailly, K. Chancibault, F. Rodriguez, C. Salles, M.-G.

Tournoud, and C. Delenne. Manhole cover detection using a geometrical filter on very high resolution aerial and satellite images. In Join Urban Remote Sensing Event, Lausanne, Switzerland, 30 March - 1 April 2015.

ACKNOWLEDGMENT

This study is part of the "Cart’Eaux" project funded by the French Languedoc-Roussillon region and the European Re- gional Development Fund (ERDF).

MATERIALS AND METHODS

The method is developed and applied on two towns located in the south of France: Gigean and Prades-Le-Lez.

Data:

2 RGB Images, 5cm/pixel:

- Training dataset: 605 manhole covers from Prades Le Lez - Validation dataset: 101 manhole covers from Gigean

The thumbnails are 40*40 pixels size (Figure 2)

Classification into 2 categories: "Manhole covers" and "others"

Data augmentation with the Keras library [1]

Figure 2: Example of thumbnails: up, manhole covers, down, others.

Method:

Convolutionnal Neural Network Customized Alexnet [2] (Figure 3)

Extract thumbnails from images using a sliding window:

Boosting the network:

After application on Prades-Le-Lez:

Add all false positives to the other objects’ category and train the network again. [3]

Cleaning the database:

Remove all the thumbnails that have a dominant feature that is not related to manhole covers from training database.

Classification:

A thumbnail is retained if the probability of representing a manhole is greater than 90%.

Validation:

Comparison with ground truth data [4]:

a0 = area(Bp Bgt) area(Bp Bgt) Bp = Bounding box detected by the network Bgt = Ground truth bounding box

True detection if a0 > 50%

Input Thumbnail

40x40

Convolution 1 96 (7x7), s=0, p=1

Convolution 2 256 (5x5), s=1, p=0

Convolution 3 256 (5x5), s=1, p=1

Convolution 4 256 (5x5), s=1, p=1

Convolution 5 256 (3x3), s=3, p=0

Max Pooling (3x3), s=1, p=0 ReLU Norm

Max Pooling (3x3), s=1, p=0 ReLU Norm

Max Pooling (3x3), s=1, p=0 ReLU Norm

Max Pooling (3x3), s=1, p=0 ReLU Norm

Max Pooling (3x3), s=1, p=0 ReLU Norm

Feature Learning

Classifi cation

Fully connected

Softmax

Manhole Other

Figure 3: Customized AlexNet architecture

Application:

Four networks tested:

1. Original Alexnet network

2. Fifth iteration boosted network

3. Fifth iteration boosted network with cleaned database 4. Customized network with cleaned database

The results are assessed in terms of precision and recall:

Precision = TP

TP + FP, Recall = TP

TP + FN TP = Number of correctly classified manhole covers FP = Number of thumbnails wrongly classified

FN = Number of undetected manhole covers

RESULTS

Figure 4: Sample of the results obtained with the customized net- work and the cleaned database.

Green square: correctly detected manhole covers Red square: false detection

Blue square: undetected manhole covers

Customized network yields better results as shown by the ROC curves:

CONCLUSION AND PERSPECTIVES

An automated procedure was put forward to identify and localize manhole covers using aerial RGB images.

Preliminary results: recall is higher than 50% for an average precision of 60%.

Perspectives:

Combine a circular filter [5] and convolutionnal neural network to reduce false positives.

Add a third category - inlet gates - to the classification in order to optimize the learning phase and extract more precise features.

Références

Documents relatifs

Thus, herein, we measured the frequency of total CD4 + CD45RA − CXCR5 + cTfhs and activated CXCR5 + PD1 + , CXCR5 + PD1 + ICOS + and CXCR5 + PD1 + CXCR3 − subsets at the time

We report data recorded at enrolment in an ongoing international NP-C registry initiated in September 2009 to describe disease natural history, clinical course and treatment

3) Simulated user annotation: In the experiments, the ground truth for unlabeled regions are used to simulate the user annotations. A similar strategy was adopted in [7, 30], as well

In this work, we investigate various methods to deal with semantic label- ing of very high resolution multi-modal remote sensing data. Especially, we study how deep fully

لصفلا لولأا : عادبلإا يرادلإا ميهافم سسأو بلطملا يناثلا : صئاصخ عادبلإا هعاونأو لولأا عرفلا : عادبلإا صئاصخ ارظن ةناكملل تيلا

Pour les occlusives, en parole modale, bien que la consonne non voisée /p/ soit produite avec une résistance articulatoire deux fois supérieure à la consonne voisée /b/

The members of the Notch signalling pathway that were expressed within the chick diencephalon from HH11 (Figure 1) mapped to the rostral end of the hypothalamus primordium

Ainsi, tandis que pour les énoncés 49 à 53, la construction formellement identifiable comme une focalisation reprend du fait de l’insertion de l’ensemble ∂âm∑ la