• Aucun résultat trouvé

Missing data completion in wastewater network databases: the added-value of Graph Convolutional Neural Networks.

N/A
N/A
Protected

Academic year: 2021

Partager "Missing data completion in wastewater network databases: the added-value of Graph Convolutional Neural Networks."

Copied!
3
0
0

Texte intégral

(1)

HAL Id: hal-03264958

https://hal.archives-ouvertes.fr/hal-03264958

Submitted on 18 Jun 2021

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Missing data completion in wastewater network

databases: the added-value of Graph Convolutional

Neural Networks.

Yassine Belghaddar, Carole Delenne, Nanée Chahinian, Ahlame Begdouri,

Abderrahmane Seriai

To cite this version:

Yassine Belghaddar, Carole Delenne, Nanée Chahinian, Ahlame Begdouri, Abderrahmane Seriai. Miss-ing data completion in wastewater network databases: the added-value of Graph Convolutional Neural Networks.. EGU General Assembly 2021, Apr 2021, Virtual, France. �10.5194/egusphere-egu21-8350�. �hal-03264958�

(2)

EGU21-8350

https://doi.org/10.5194/egusphere-egu21-8350 EGU General Assembly 2021

© Author(s) 2021. This work is distributed under the Creative Commons Attribution 4.0 License.

Missing data completion in wastewater network databases: the

added-value of Graph Convolutional Neural Networks.

Yassine Belghaddar1,2,3, Carole Delenne1, Nanée Chahinian1, Ahlame Begdouri2, and

Abderrahmane Seriai3

1HSM, Univ Montpellier, CNRS, IRD, Montpellier, France (yassine.belghaddar@usmba.ac.ma) 2LSIA, Univ. Sidi Mohamed Ben Abdellah, Fez, Morocco

3Berger-Levrault, Pérols, France

Wastewater networks are mandatory for urbanization. Their management, which includes reparation and expansion operations, requires precise information about their underground components, mainly pipes. For hydraulic modelling purposes, the characteristics of the nodes and pipes in the model must be fully known via specific, complete and consistent attribute tables. However, due to years of service and interventions by different actors, information about the attributes and characteristics associated with the various objects constituting a network are not properly tracked and reported. Therefore, databases related to wastewater networks, when available, still suffer from a large amount of missing data.

A wastewater network constitutes a graph composed of nodes and edges. Nodes represent manholes, equipment, repairs, etc. while edges represent pipes. Each of the nodes and edges has a set of properties in the form of attributes such as diameters of the pipes. In this work, we seek to complete the missing attributes of wastewater networks using machine learning techniques. The main goal is to make use of the graph structures in the learning process, taking into consideration the topology and the relationships between their components (nodes and edges) to predict missing attribute values.

Graph Convolutional Network models (GCN) have gained a lot of attention in recent years and achieved state of the art in many applications such as chemistry. These models are applied directly on graphs to perform diverse machine learning tasks. We present here the use of GCN models such as ChebConv to complete the missing attribute values of two datasets (1239 and 754 elements) extracted from the wastewater networks of Montpellier and Angers Metropolis in France. To emphasize the importance of the graph structure in the learning process and thus on the quality of the predictions, GCNs' results are benchmarked against non-topological neural networks. The application on diameter value completion, indicates that using the structure of the wastewater network in the learning process has a significant impact on the prediction results especially for minority classes. Indeed, the diameter classes are very heterogeneous in terms of number of elements with a highly majority class and several classes with few elements. Non-topological neural networks always fail to predict these classes and affect the majority class value

(3)

to every missing diameter, yielding a perfect precision for this class but a null one for all the others. On the contrary, the ChebConv model precision is slightly lower (0.93) for the majority class but much higher (increases from 0.3 to 0.81) for other classes, using only the structure of the graphs. The use of other available information in the learning process may enhance these results.

Références

Documents relatifs

 pushing forward to weakly supervised learning the problem Medical Image Segmentation of Thoracic Organs at Risk in CT-images..

Figure 5.8 – Proposal network performance on ASMP for different input image sizes and number of PEs.. Figure 5.9 shows the same results as reported in Table 5.8 in a

Data Augmentation for Time Series Classification using Convolutional Neural Networks.. Arthur Le Guennec, Simon Malinowski,

Figure 6: Accuracy stability on normal dataset (for class ”mountain bike”), showing a gradual increase of accuracy with respect to decreasing neurons’ criticality in case of

[10] Christopher Morris, Martin Ritzert, Matthias Fey, William L Hamilton, Jan Eric Lenssen, Gaurav Rattan, and Martin Grohe. Weisfeiler and leman go neural: Higher-order graph

To classify the data of the two other samples, using an equal number of classes of training and control samples, as well as using groups of semantically similar classes

The policy-based solutions parameterize the policy function directly by a neural architecture and try to find the optimal policy using gradient ascent, while the value-based

During training process, in order to make every forward unit responsible to the classification, for every forward unit there are temporary fully connected layers applied to train