• Aucun résultat trouvé

ActiveData, un modèle de programmation pour la gestion des cycles de vie des

N/A
N/A
Protected

Academic year: 2022

Partager "ActiveData, un modèle de programmation pour la gestion des cycles de vie des"

Copied!
37
0
0

Texte intégral

(1)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

ActiveData, un mod `ele de programmation pour la gestion des cycles de vie des

donn ´ees

A. Simonet1, G. Fedak1, M. Ripeanu2

(1){Anthony.Simonet, Gilles.Fedak}@inria.fr INRIA, University of Lyon, France

(2)matei@ece.ubc.ca University of British Columbia, Canada

Calcul intensif et Sciences des donn ´ees VICHY 2014

(2)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Plan

1 Introduction

2 Active Data

Principes & fonctionnalit ´es Impl ´ementation

3 Etude de cas : Framework de surveillance des donn ´ees pour APS

4 Discussion

5 Conclusion

G. Fedak(INRIA/UBC) Active Data Vichy 2014 2/28

(3)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Outline

1 Introduction

2 Active Data

Principes & fonctionnalit ´es Impl ´ementation

3 Etude de cas : Framework de surveillance des donn ´ees pour APS

4 Discussion

5 Conclusion

(4)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Big Data ...

I Contexte: un d ´eluge de donn ´ees de sources multiples

!"#$%&'("$#) *+'&,-./"#)

0+1)*2+("2() !"$(%"($) 34(")5-$-)

I Quelques exemples:

I Les grands instruments scientifiques (LSST 30TB/nuit, LHC 1.5PB/week, OOOI), mais pas seulement (sequencing machine)

I Internet et les r ´eseaux sociaux (Google, Facebook, Twitter, etc.)

I Open Data (Biblioth `eques ouvertes, gouvernementales, code source)

→impacte le processus de d ´ecouverte scientifique (4thparadigme de la science)

G. Fedak(INRIA/UBC) Active Data Vichy 2014 4/28

(5)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

... ou Big Bottleneck ?

I La science des donn ´ees produit de nouveaux d ´efis :

I comment construire les infrastructures ad ´equates ?

I infra complexes et h ´et ´erog `enes, am ´elioration des performances de bout-en-bout, optimisations inter-syst `emes.

I comment am ´eliorer la productivit ´e des scientifiques confront ´es au Big Data ?

I outils de gestion orient ´es donn ´ees, qualit ´e des jeux de donn ´ees (provenance).

I comment impl ´ementer une science des donn ´ees collaborative ?

I motivation pour la publications et le partage des jeux de donn ´ees, r ´eseaux sociaux et applications Web.

I De nouveaux outils conceptuels et logiciels sont n ´ecessaires pour manipuler les jeux de donn ´ees scientifiques `a travers des infrastructures distribu ´ees et h ´et ´erog `enes.

(6)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Le cycle de vie des donn ´ees

Data Life Cycle (DLC):La succession de phases op ´erationnelles par laquelle passent les donn ´ees entre le moment o `u elles entrent dans un ensemble de syst `emes jusqu’au moment o `u elles le quittent.

!"#$%&%'()* +,-.,("-&&%)/* 01(,2/-*

!)234&%&*

!)234&%&*

G. Fedak(INRIA/UBC) Active Data Vichy 2014 6/28

(7)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Cas d’ ´etude : l’exp ´erience APS

Exemple:Advanced Photon Source (APS) `a Argonne National Lab

I 100TB de donn ´ees brutes par jour

I Les donn ´ees brutes sont pr ´e-trait ´ees et enregistr ´ees dans un catalog Globus.

I Les donn ´ees sont analys ´ees par diverses applications

I Les r ´esultats sont partag ´es entre les utilisateurs

Instrument (Beamline)

Local Storage Transfer

Extract &

Remote Data Center

Transf er

Academic Cluster Analysis

More analysis Upload result

result metadata

(8)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Nos objectifs

Un syst `eme parfait devrait :

I capturer les ´etapes et les propri ´et ´ees essentielles du cycle de vie : cr ´eation, destruction, fautes, r ´eplication, erreurs checking . . .

I Permettre aux syst `emes existants d’exposer leur DLC intrins `eque

I Raisonner sur des jeux de donn ´ees r ´epartis sur des ensembles d’infrastructures et de syst `emes h ´et ´erog `enes.

I Simplifier la programmation d’applications DLC.

G. Fedak(INRIA/UBC) Active Data Vichy 2014 8/28

(9)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Outline

1 Introduction

2 Active Data

Principes & fonctionnalit ´es Impl ´ementation

3 Etude de cas : Framework de surveillance des donn ´ees pour APS

4 Discussion

5 Conclusion

(10)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Active Data principles

Le cycle de vie des donn ´ees d’un syst `eme est repr ´esent ´e par un mod `ele inspir ´e des r ´eseaux de Petri :

il contient desPlaces( ´etats des donn ´ees), desTransitions(actions sur les donn ´ees) et desTokens(une donn ´ee).

CREATED

t1

WRITTEN

t2

READ

t3

t4

TERMINATED

public void handler() { computeMD5(); }

Chaque token a un identifiant unique qui correspond `a la donn ´ee dans le syst `eme.

G. Fedak(INRIA/UBC) Active Data Vichy 2014 10/28

(11)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Active Data principles

Le cycle de vie des donn ´ees d’un syst `eme est repr ´esent ´e par un mod `ele inspir ´e des r ´eseaux de Petri :

il contient desPlaces( ´etats des donn ´ees), desTransitions(actions sur les donn ´ees) et desTokens(une donn ´ee).

CREATED

t1

WRITTEN

t2

READ

t3

t4

TERMINATED

public void handler() { computeMD5(); }

Une transition est d ´eclench ´ee lorsqu’une donn ´ee change d’ ´etat.

(12)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Active Data principles

Le cycle de vie des donn ´ees d’un syst `eme est repr ´esent ´e par un mod `ele inspir ´e des r ´eseaux de Petri :

il contient desPlaces( ´etats des donn ´ees), desTransitions(actions sur les donn ´ees) et desTokens(une donn ´ee).

CREATED

t1

WRITTEN

t2

READ

t3

t4

TERMINATED

public void handler() { computeMD5();

}

Du code peut- ˆetre associ ´e aux transitions par le programmeur. Il est ex ´ecut ´e lorsque la transition est d ´eclench ´ee.

G. Fedak(INRIA/UBC) Active Data Vichy 2014 10/28

(13)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Active Data features

Le mod `ele de programmation Active Data et l’environnement d’ex ´ecution permettent :

I de r ´eagir `a la progression des donn ´ees dans leur cycle de vie

I de repr ´esenter des jeux de donn ´ees distribu ´es de fac¸on transparente

I peut ˆetre int ´egr ´e dans des syst `emes existants de fac¸on non-intrusive

I performance “scalable” et surco ˆut minimal

(14)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Integration with Data Management Systems

Createdt1 To Place

t2

Deleted t3

Placed t4

Loop t5

2 t6

t7

Lost

t8 t9

(a) Bitdew Scheduler

Created t1 Ready

t2

Started t3 Completed

t4 Deleted t5Invalid

t6

t7 t8

(b) Bitdew File Transfer

IN CREATE t1 IN MOVED FROM

t2

t3 IN MOVED TO

t4

IN CLOSE WRITE t5

t6

t7 t8

t9 t10

t11

DELETED t14

t13 CREATED t12

(c) inotify

Get Put

Created t5

t6 t7

t8 Deleted

(d) iRODS

Created

t1 t2

Succeeded Failed

t3 t4

Deleted

(e) Globus Online Figure 3: Data life cycle models for four data management system.

structed from its documentation.

Reading the source code of BitDew, we observe that data items are managed by instances of theDataclass, and this class has thestatusvariable which holds the data item state. Therefore, we simply deduce from the enumeration of the possible value ofstatusthe set of corresponding places in the Petri Net (see Figure 2a and 2b). By further analyz- ing the source code, we construct the model and summarize how high level DLM features are modelized using Active Data model:

Scheduling and replicationPart of the complexity of the data life cycle in BitDew comes from the Data Scheduler that schedules data on nodes. Whenever a data item is scheduled on a node, a new replica is created. We represent replicas with a loop on thescheduledstate that creates an additional token every time a token passes through it.

Fault toleranceBecause one of BitDew’s target archi- tectures is Desktop Grids, it must deal with frequent faults, i.e. nodes going offline. When a data item is scheduled to a node, and the node disappears from the system, it is marked with aLoststate and will be scheduled to an other node.

This is represented by the loopPlaced,Lost,tosched- ule.

Composition of File Transfer and Data Scheduler In BitDew, the Data Scheduler and File Transfer Service are closely related, and so are their life cycles. A file transfer cannot exist without an associated data item, and a deleted data item cannot be transferred. To connect the two Petri Nets we need to define the start and stop places as explained in section 3. In BitDew, a new file transfer can be started for aDataobject in any state, exceptDeleted,Lostand Loop. To represent this, we define all the places but the three mentioned above as start places and connect them to the transfer life cycle model.

5. EXPERIMENTAL EVALUATION

In this section we report on experimental evaluation of the prototype using micro-benchmarks and use-case scenar- ios. Experiments are run on a cluster of the Grid’5000 ex- perimental platform [5] composed of 92 2-CPU nodes. Each CPU is an Intel Xeon L5420 with 4 cores running at 2.5Ghz.

Each node is equipped with 16GB of RAM and a 320GB

Figure 4: Average number of transitions per second handled by the Active Data Service

Sata II hard drive. Nodes are interconnected with Gigabit Ethernet and are running Linux 2.6.32.

5.1 Performance Evaluation

To evaluate the performance in terms of throughput, la- tency and overhead, we conduct a set of benchmarks based on the version 0.1.2 of the prototype1.

Throughput is measured as the number of transitions that Active Data is able to handle per second. In order to stress the system we run one Active Data server and a varying number of clients (one per cluster node), which publish 10,000 transitions in a loop without a pause between the iterations.

Figure 3 plots the average number of transitions processed per second against the number of clients. Once saturated, the Active Data Service can handle up to 32,000 transitions per second.

We evaluate the latency, i.e the time to create of a new life cycle and to publish a transition, and the overhead, i.e the additional time incurred by the Active Data runtime en- vironment. We use the BitDew file transfer operation as a reference to measure the overhead. The experiment consists in creating and uploading 1,000 files (1KB to stress the sys-

1Active Data is free software, available under the GPL li- cence http://active-data.gforge.inria.fr

I BitDew (INRIA), programmable environment for data management.

I inotify Linux kernel subsystem: notification system for file creation, modification, write, movement and deletion.

I iRODS (DICE, Univ. North Carolina), rule-oriented data management system.

I Globus Online (ANL) offers fast, simple and reliable service to transfer large volumes of data.

G. Fedak(INRIA/UBC) Active Data Vichy 2014 12/28

(15)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Impl ´ementation

I Prototype d ´evelopp ´e en Java

I communications bas ´ees Publish/Subscribe

I 2 types of souscriptions:

I Toutes les transitions pour une donn ´ee

I Toutes les donn ´ees pour une transition

Active Data

Service Client

Client

subscr

Client ibe

subscribe

(16)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Impl ´ementation

I Plusieurs possibilit ´es pour publier une transition

I Instrumentation du code

I Analyse des logs

I Syst `eme de notification existant

I The service ordonne les transitions par leur heures d’arriv ´ee

Active Data

Service Client

publish transition

Client

subscr

Client subscribe ibe

publishtransition

G. Fedak(INRIA/UBC) Active Data Vichy 2014 13/28

(17)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Impl ´ementation

I Clients ex ´ecute le code des transitions localement

I les codes des transitions sont ex ´ecut ´es

I s ´equentiellement

I de fac¸on bloquante

I dans l’ordre de publication des transitions

Active Data

Service Client

publish transition

Client

subscr ibe notify

Client subscribe notify

publishtransition

(18)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Evaluation de performances

10 50 100 200 300 400 450 500 550

# clients 5,000

10,000 15,000 20,000 25,000 30,000 35,000

Transitions per second

Figure: Average number of transitions per second handled by the Active Data Service

I Le client publie une s ´erie de 10K transitions

I Le serveur plafonne `a 30K transitions/sec.

G. Fedak(INRIA/UBC) Active Data Vichy 2014 14/28

(19)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Applications

I D ´evelopement d’applications distibu ´ees en se basant sur un paradigme de programmation orient ´e transition.

I Optimisation ou coordination de plusieurs syst `emes entre eux.

I Rendre des applications “DLC-aware”

I Monitorer le cycle de vie et reconstituer la provenance des donn ´ees

(20)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Outline

1 Introduction

2 Active Data

Principes & fonctionnalit ´es Impl ´ementation

3 Etude de cas : Framework de surveillance des donn ´ees pour APS

4 Discussion

5 Conclusion

G. Fedak(INRIA/UBC) Active Data Vichy 2014 16/28

(21)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Etude de cas : Framework de surveillance des donn ´ees pour APS

Objectifs :

I monitorer la progression dans le DLC

I meilleure automatisation

I Notification et partage

I D ´etection et reprise sur erreurs

(22)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Extension `a Active Data

I Associer desTagsaux donn ´ees

I Installer desTaggerssur les transitions

I Transitions guard ´ees: ne s’ex ´ecutent que pour les tokens poss ´edant certains tags.

I Handlers de notification:

Push.co, Twitter, gdoc, ifttt etc...

A. Progress Monitoring

Scientists require mechanisms to monitor their workflows from a high level, generate reports on progress, and identify potential errors without painstakingly auditing every dataset.

Monitoring is not limited to estimating completion time, but also:i)receiving a single relevant notification when several related events occurred in different systems;ii)quickly notic- ing that an operation failed within the mass of operations that completed normally;iii)identifying steps that take longer to run than usual, backtracking the chain of causality, fixing the problem at runtime and optimizing the workflow for future executions;iv)accelerating data sharing with the community by pushing notifications to collaborators and colleagues.

B. Automation

The APS workflow, like many scientific workflows, re- quires explicit human intervention to progress between stages and to recover from unexpected events. Such interventions include running scripts on generated datasets on the shared cluster, registering datasets in the Globus catalog, and execut- ing Swift analysis scripts on the compute cluster Such inter- ventions cannot be easily integrated in a traditional workflow system, because they reside a level of abstraction above the workflow system. In fact, they are the operations thatstartthe workflow systems.

C. Sharing and Notification

Scientific sharing can be made more efficient by allowing other scientists to be notified of new datasets availability (in the catalog) with powerful filters to extract only the notifications they need, and even to start processes as soon as files are available. We believe the best way for scientists to automatically integrate new datasets in their workflows is to rely on widely used dissemination mechanisms—such as Twitter. Twitter is an efficient notification mechanism that is commonly used by scientists. It also can be simply integrated via its APIs.

D. Error Discovery and Recovery

Each system participating in the APS experiment has only a partial picture of the entire process, which impairs their ability to recover from unexpected events. Thus, when such events occur, systems often fail ungracefully, leaving the scientists as the only one able to resolve the problem through costly manipulations. For example, if a Swift script fails when processing a file, it will simply report the error in a log file and return. Later, the when inspecting the output files, a scientist may see that some results are missing, they will then read the log and discover the faulty file. A quick look at the file will tell them it was corrupted and their only measure will be to again transfer the file to the computing cluster and re-start the analysis. In cases where entire datasets are left unprocessed the process for error identification is less clear. This requirement for low level human intervention and manipulation delays workflow completion, places a burden on scientists and wastes valuable computing resources. Moreover, such approaches are also prone to human error. Automating these steps would improve each of these aspects.

Life Cycle View File transfer

File Dataset Metadata

Guard

Code Execution

}

Tagged Tokens

Notification

Fig. 2: Data surveillance framework design

IV. SYSTEMDESIGN

We next present the data surveillance framework that we designed to satisfy the APS users’ needs presented above. We also elaborate on the design of specific features.

A. Data surveillance framework

Fig. 2 shows the main features of the data surveillance framework; the framework is able to track any data object (such as files and sets of files) as well as elements related to data (such as file transfers and metadata). The frameworks receives events from different systems and integrates the identifiers of data and related elements from them in a single global namespace. Data replicas and their current state are exposed astokensin a unified identifier used across systems.

Tokens provide a model for linking together related data in different systems. The framework allows users to be notified of the progress of their data and to automatically run custom code at many operational stages of the life cycle. execution on a subset of events, according to user choice.

B. Active Data

The complexity of implementing the surveillance frame- work comes from a lack ofintegration, impairing tight coor- dination between loosely coupled systems. Such coordination is incredibly challenging because, in order to be efficient, it must be as noninvasive as possible. Additional challenges stem from the lack of feedback derived from the systems, that are mostly regarded as black boxes from the user perspective.

G. Fedak(INRIA/UBC) Active Data Vichy 2014 18/28

(23)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Mod `ele du cycle de vie des donn ´ees de APS

Avalon Daniel Arnaud Anthony Vincent

Use-case: APS data life cycle model

Created Start transfer

Terminated End

Detector

Created

Success Failure

Succeeded Failed

End End

Terminated

End transfer

Globus transfer

Created

End

Terminated Start transfer

Shared storage

Created

Success Failure

Succeeded Failed

End End

Terminated

Start Swift

Globus transfer Created Extract

Update

Terminated Remove Globus Catalog

Created

Initialize

Set

End Failure

Terminated Derive

Swift

Data life cycle model composed of 6 systems.

(24)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Exemple: Data Provenance

Definition

Historique compl `ete des d ´erivations et des op ´erations sur une donn ´ee

I Estimer la qualit ´e du jeu de donn ´ees

I Garder les conditions d’acquisition et de transformation des donn ´ees

I PASS: Provenance Aware Storage Systems

−→What about heterogeneous systems?

Example withGlobus OnlineandiRODS

File transfer service Data store and metadata catalog

G. Fedak(INRIA/UBC) Active Data Vichy 2014 20/28

(25)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Exemple: Data Provenance

Definition

Historique compl `ete des d ´erivations et des op ´erations sur une donn ´ee

I Estimer la qualit ´e du jeu de donn ´ees

I Garder les conditions d’acquisition et de transformation des donn ´ees

I PASS: Provenance Aware Storage Systems

−→What about heterogeneous systems?

Example withGlobus OnlineandiRODS

File transfer service Data store and metadata catalog

(26)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Exemple: Data Provenance

Definition

Historique compl `ete des d ´erivations et des op ´erations sur une donn ´ee

I Estimer la qualit ´e du jeu de donn ´ees

I Garder les conditions d’acquisition et de transformation des donn ´ees

I PASS: Provenance Aware Storage Systems

−→What about heterogeneous systems?

Example withGlobus OnlineandiRODS

File transfer service Data store and metadata catalog

G. Fedak(INRIA/UBC) Active Data Vichy 2014 20/28

(27)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Data events coming from Globus Online and iRODS

CREATED GET

PUT TERMINATED

t5

t9

t6

t7

t8 t10

Id:{GO: 7b9e02c4-925d-11e2, iRODS: 10032}

public void handler() { annotate(); }

CREATED

t1 t2

SUCCEEDED FAILED

t3 t4

TERMINATED

Id:{GO: 7b9e02c4-925d-11e2}

public void handler() { iput(...); }

(28)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

G. Fedak(INRIA/UBC) Active Data Vichy 2014 21/28

Data events coming from Globus Online and iRODS

CREATED GET

PUT TERMINATED

t5

t9

t6

t7

t8 t10

iRODS

Id:{GO: 7b9e02c4-925d-11e2, iRODS: 10032}

public void handler() { annotate(); }

CREATED

t1 t2

SUCCEEDED FAILED

t3 t4

TERMINATED

Globus Online

Id:{GO: 7b9e02c4-925d-11e2}

public void handler() { iput(...);

}

(29)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Data events coming from Globus Online and iRODS

CREATED GET

PUT TERMINATED

t5

t9

t6

t7

t8 t10

Id:{GO: 7b9e02c4-925d-11e2, iRODS: 10032}

public void handler() { annotate(); }

CREATED

t1 t2

SUCCEEDED FAILED

t3 t4

TERMINATED

Id:{GO: 7b9e02c4-925d-11e2}

public void handler() { iput(...); }

(30)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

G. Fedak(INRIA/UBC) Active Data Vichy 2014 21/28

Data events coming from Globus Online and iRODS

CREATED GET

PUT TERMINATED

t5

t9

t6

t7

t8 t10

iRODS

Id:{GO: 7b9e02c4-925d-11e2, iRODS: 10032}

public void handler() { annotate();

}

CREATED

t1 t2

SUCCEEDED FAILED

t3 t4

TERMINATED

Globus Online

Id:{GO: 7b9e02c4-925d-11e2}

public void handler() { iput(...); }

(31)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

$ imeta ls -d test/out_test_4628

AVUs defined for dataObj test/out_test_4628:

attribute: GO_FAULTS value: 0

----

attribute: GO_COMPLETION_TIME value: 2013-03-21 19:28:41Z ----

attribute: GO_REQUEST_TIME value: 2013-03-21 19:28:17Z ----

attribute: GO_TASK_ID

value: 7b9e02c4-925d-11e2-97ce-123139404f2e ----

attribute: GO_SOURCE value: go#ep1/˜/test ----

attribute: GO_DESTINATION

value: asimonet#fraise/˜/out_test_4628

(32)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Outline

1 Introduction

2 Active Data

Principes & fonctionnalit ´es Impl ´ementation

3 Etude de cas : Framework de surveillance des donn ´ees pour APS

4 Discussion

5 Conclusion

G. Fedak(INRIA/UBC) Active Data Vichy 2014 23/28

(33)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

I Fac¸on simple et graphique de programmer des applications DLC

I Permets de v ´erifier formellement des propri ´et ´es du DLC

I Coordination entre les syst `emes

I Facile `a monitorer

I Facilite la tol ´erance aux pannes

I Int ´eraction fine avec le DLC

(34)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

I Complexit ´e `a raisonner en terme d’ ´evennements

I Manque de standardisation

G. Fedak(INRIA/UBC) Active Data Vichy 2014 25/28

(35)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Outline

1 Introduction

2 Active Data

Principes & fonctionnalit ´es Impl ´ementation

3 Etude de cas : Framework de surveillance des donn ´ees pour APS

4 Discussion

5 Conclusion

(36)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Active Data c’est. . .

I Data-centric & Event-driven

I Int ´egration des donn ´ees au niveau du syst `eme La suite

I Active Data comme sp ´ecification de composition de services

I Collection et flux de donn ´ees

G. Fedak(INRIA/UBC) Active Data Vichy 2014 27/28

(37)

Outline Introduction Active Data Etude de cas : Framework de surveillance des donn ´ees pour APS Discussion Conclusion

Merci!

Questions?

Références

Documents relatifs

Perdre ses photos de vacances : ¸ca n’a pas de

Programme des enseignements – Syst` emes d’information et conception d’entrepˆ ots de donn´ ees – Les principes et la d´ emarche du Data Mining (fouille de donn´ ees)

Main goal: Integrate Web data and reason about it using provenance to estimate the reliability of results Reasoning and reliability go hand in hand:. • Data from one source

[r]

Reformuler et d ´ecomposer une requ ˆete utilisateur sur le sch ´ema global en des requ ˆetes sur le sch ´ema local qui sont ´evalu ´ees sur les sources de donn ´ees. Combiner les

Reformuler et d ´ecomposer une requ ˆete utilisateur sur le sch ´ema global en des requ ˆetes sur le sch ´ema local qui sont ´evalu ´ees sur les sources de donn ´ees. Combiner les

Une exp´ erience par coloration a montr´ e qu’une rivi` ere souterraine alimente une r´ esurgence dans la vall´ ee. La rivi` ere souterraine a un d´ ebit tr` es sensible aux

Ecrire en Java la gestion d’un tas, repr´esent´e par un tableau : cr´eer, ins´erer, minimum, supprimer (le minimum), modifier une valeur.. Vous programmerez ceci en TP, et