• Aucun résultat trouvé

Detecting Anomalies over Message Streams in Railway Communication Systems

N/A
N/A
Protected

Academic year: 2021

Partager "Detecting Anomalies over Message Streams in Railway Communication Systems"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-02357927

https://hal.archives-ouvertes.fr/hal-02357927

Submitted on 4 Dec 2019

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Detecting Anomalies over Message Streams in Railway

Communication Systems

Lucas Foulon, Serge Fenet, Christophe Rigotti, Denis Jouvin

To cite this version:

(2)

OUR DATA

DETECTING ANOMALIES OVER MESSAGE STREAMS IN RAILWAY

COMMUNICATION SYSTEMS

GOALS

•  Monitor on real-time the proper functionning of the

information system

•  Support high volume of streaming data

•  Warn when an anomaly occurs

4th AALTD@ECML/PKDD 2019 – 20/09/2019

Lucas Foulon1,3, Serge Fenet1, Christophe

Rigotti2, Denis Jouvin3

•  Traces containing information about messages flowing in

the information system: number of messages, latency between differents checkpoints, …

•  Built by analyzing the content of the data stream: Sent/

Received timestamp, type of device/service, ...

•  Interfaced with the central platform of the SNCF IS

(CanalTrain) through ELK open source products

RESULTS

IN PROGRESS

METHOD

Use of CFOF anomaly measure [Angiulli, ECML PKDD 2017]

•  Unsupervised

•  Based on the stucture of the local

neighborhood

•  Adapted to high dimension data

•  But not adapted to data streams,

Use of the iSAX indexation tree [Shieh & Keogh, DAMI 2009]

•  Based on a modification of the SAX

discretization

•  Suited for times series indexation and

similarity search

•  Efficient access using distance boundings

•  Support Dynamic Time Warping,

weighting, and very high volumes (billion time series)

Proposition : exploit the properties of the iSAX

tree to accelerate the computing of the CFOF score in order to apply it to voluminous data streams

•  Reduced complexity allowing the efficient use of the

CFOF score on high volume data streams

•  High quality of the estimated score

•  Real time detection of IS anomalies

•  One parameter controlling the detection

•  Incremental update of the tree

•  From tree to forest to reduce dimensions and

accelerate the computing

•  Multi-scale and multi-indicators anomaly detection

•  Testing the robustness to regime changes

1Université Claude Bernard Lyon 1, CNRS, LIRIS, UMR5205

2Université Lyon, INSA Lyon, CNRS, INRIA, LIRIS, UMR5205

Références

Documents relatifs

For a two-dimensional neighborhood structure, we consider successively 3 SOM algorithms with a fixed number of neighbors (SOM5, SOM9 and SOM25, the suffix being the number

The test results show that by dynamic clustering the HTTP log streams and the network connections in KDD’99 data, our autonomic method is better than three other traditional

State-of-the-art methods on VCDA mostly have focused on appropriate visual content descriptors and are still less efficient on automatic textual annotation or capable of

Neural networks provide multi-class classification regardless of the linear time-separability of classes. In addition, neural networks are able to determine the presence

Besides, we present an example for publishing the Global Database of Events, Language and Tone (GDELT) as a Streaming Linked Data resource.. We open-sourced the code of our resource

If we can explain the relationship of information obtained on the basis of registration data with some patterns of behavior in the city, then tracking the changes in the data on

This paper is an exploratory qualitative study of the gamification possibilities offered by Twitch.tv and the community to the streamers by studying three popular chatbots

Again, the utility value for Dynamic Sieve-Streaming is consistently larger than for pure and reduced Sieve- Streaming explaining the increased recall performance.. Both algorithms