• Aucun résultat trouvé

Toward Autonomic Grids: Analyzing the Job Flow with Affinity Streaming

N/A
N/A
Protected

Academic year: 2022

Partager "Toward Autonomic Grids: Analyzing the Job Flow with Affinity Streaming"

Copied!
19
0
0

Texte intégral

(1)

Toward Autonomic Grids: Analyzing the Job Flow with Affinity Streaming

Xiangliang Zhang, Cyril Furtlehner

Julien Perez, Cécile Germain-Renaud, Michèle Sebag

TAO teamINRIA CNRS

Université de Paris-Sud, F-91405 Orsay Cedex, France

29 June 2009

(2)

Motivation: Autonomic Computing G-StrAP: Data Streaming for Jobs Results: Multi-scale Monitoring Conclusion and Perspective

Contents

1 Motivation: Autonomic Computing

2 G-StrAP: Data Streaming for Jobs

3 Results: Multi-scale Monitoring

4 Conclusion and Perspective

X. Zhang, C. Furtlehner, J. Perez, C. Germain, M. Sebag Toward Autonomic Grids: Analyzing the Job Flow byStrAP

(3)

Motivations of Autonomic Computing

(4)

Goals of Autonomic Computing

AUTONOMIC VISION & MANIFESTO http://www.research.ibm.com/autonomic/manifesto/

Self-managing system with the ability of Self-healing: detect, diagnose and repair problems Self-configuring: automatically incorporate and configure components

Self-optimizing: ensure the optimal functioning wrt defined requirements

Self-protecting: anticipate and defend against security breaches

Data Mining for Autonomic Computing

(5)

Autonomic Grid Computing System

(6)

Motivation: Autonomic Computing G-StrAP: Data Streaming for Jobs Results: Multi-scale Monitoring Conclusion and Perspective

Contents

1 Motivation: Autonomic Computing

2 G-StrAP: Data Streaming for Jobs

3 Results: Multi-scale Monitoring

4 Conclusion and Perspective

X. Zhang, C. Furtlehner, J. Perez, C. Germain, M. Sebag Toward Autonomic Grids: Analyzing the Job Flow byStrAP

(7)

G-StrAP: relies on Affinity Propagation (AP)

Affinity Propagation (AP) [Frey2007]

statistic physics algorithm for clustering (based on messaging passing)

a cluster = an exemplar (akin k-centers)

the model = set of {exemplar, frequency}

Why AP ?

Traceability: real jobs as exemplars

because of categorical variables, e.g., userid, queue name etc No prior knowledge of K, number of clusters

quasi optimality wrt. information loss

—> stability [Meila2006]

(8)

From AP to Large-scale Data Streaming 1 SCALABILITY

: fromO(N2logN) toO(Nh+h+21) Hierarchical Affinity Propagation

negligible infromation loss (proof in the paper)

(9)

From AP to Large-scale Data Streaming

2 Non stationary distribution

various Virtual Organization number and expertise of users Streaming AP (StrAP)

(10)

Non stationary distribution, continue

Page-Hinkley statistic

(Cumulative-Sum-like test) [Page54,Hinkley70]

0 100 200 300 400 500 600 700 800 900 1000

−5 0 5 10 15 20 25 30 35 40

time t

pt p¯t mt Mt

ptchanging distribution

¯ pt= 1tPt

=1p

mt=Pt

=1(p¯p+δ) Mt=max{m} PHt=Mtmt

ifPHt> λ, changed detected

How to set λ ???

(11)

Self-adaptive change detection test

Self-adapt λ

An optimization problem BIC: Fλ= |C|1 P|C|

i=1 1 ni

P

ej∈Ci d(ej,ei)

ρ2logN+ηOt

∝loss + size of model + percentage of outlier OPTIMIZATION:

ǫ-greedy searchfrom afinite set ofλvalues λ=argmin{E(Fλ}),

λ1 λ2 λ3 λ4 ...

E(Fλ1) E(Fλ2) E(Fλ3) E(Fλ4) ...

Gaussian Process Regression based on{λi,Fλi} continuous value ofλis generated

(12)

Motivation: Autonomic Computing G-StrAP: Data Streaming for Jobs Results: Multi-scale Monitoring Conclusion and Perspective

Contents

1 Motivation: Autonomic Computing

2 G-StrAP: Data Streaming for Jobs

3 Results: Multi-scale Monitoring

4 Conclusion and Perspective

X. Zhang, C. Furtlehner, J. Perez, C. Germain, M. Sebag Toward Autonomic Grids: Analyzing the Job Flow byStrAP

(13)

G-StrAP : Multi-scale Realtime Monitor

Dashboard for Monitoring

(14)

G-StrAP Dashboard for Grid Monitoring

Online Monitoring

1 2 3 4 5

0 20 40 60 80 100

Reservoir 7 0 0 0 0 0

10 47 54 129 0 0

8 18 24 30 595 139

7 13 14 24 9728 19190

Clusters

Percentage of jobs assigned (%)

exemplar shown as a job vector

1 2 3 4 5 6 7 8

0 20 40 60 80 100

Reservoir 0 0 0 0 0 0

7 0 0 0 0 0

10 47 54 129 0 0

9 18 25 20110 0 0

8 18 24 30 595 139

6 5 10 14 127 10854

10 18 29 20091 395 276

LogMonitor is getting clogged

(15)

G-StrAP Dashboard for Grid Monitoring Online Monitoring

1 2 3 4 5

0 20 40 60 80 100

Reservoir 7 0 0 0 0 0

10 47 54 129 0 0

8 18 24 30 595 139

7 13 14 24 9728 19190

Clusters

Percentage of jobs assigned (%)

exemplar shown as a job vector

1 2 3 4 5 6 7 8

0 20 40 60 80 100

Reservoir 0 0 0 0 0 0

7 0 0 0 0 0

10 47 54 129 0 0

9 18 25 20110 0 0

8 18 24 30 595 139

6 5 10 14 127 10854

10 18 29 20091 395 276

LogMonitor is getting clogged

Off-line Analysis

no prior knowledge about failure patterns

summarizing Terabyte data

(16)

Motivation: Autonomic Computing G-StrAP: Data Streaming for Jobs Results: Multi-scale Monitoring Conclusion and Perspective

Contents

1 Motivation: Autonomic Computing

2 G-StrAP: Data Streaming for Jobs

3 Results: Multi-scale Monitoring

4 Conclusion and Perspective

X. Zhang, C. Furtlehner, J. Perez, C. Germain, M. Sebag Toward Autonomic Grids: Analyzing the Job Flow byStrAP

(17)

Discussion and Conclusion

G-StrAP:

linear computational complexityO(N1+ǫ)with bounded information loss

dealing with non stationary distribution data with self-adaptive change detectiontest

G-StrAP for Grid Monitoring

Traceability of failures

providing multi-scale models to describing the status of Grid online description of different type of job patterns

offline globally analysis

good quality clustering wrt. supervised learning

(18)

Perspectives

Theoretical Perspectives:

limitation 1: AP single parameter s∗

—> self-adapts∗

limitation 2: better coping with flowing patterns

—> Hierarchical G-StrAP, absorb new patterns from reservoir more efficiently

Applicative Perspectives:

profiling users

—> a user = a histogram of job exemplars customized interface

characterize evolution of users or virtual organizations build crisis scenari to test Middleware

(19)

Xiangliang ZHANG [email protected] http://www.lri.fr/∼xlzhang

Références

Documents relatifs

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

5 6 3D imaging of a trout ovary sample after C-Eci and 3D analysis of 7 follicular content 8 The C-Eci procedure for 3D ovarian imaging was applied to the rainbow trout that

Liu, “A haptic shared control approach to teleoperation of mobile robots,” in 2015 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems

Le sulfate de calcium et l'oxalate de manganèse à deux molécules d'eau peuvent conduire par chauffage sous atmo~phère contrôlée à des phases solides

We are given (i) a streaming application whose dependence graph is a linear chain; (ii) a one-to-many mapping of the application onto a fully heterogeneous target platform, where

In the previous section, we have shown how to compute the throughput when all communication times and all processing times are exponential variables (and this even in polynomial

Ce dernier énoncé confirme de manière explicite la distance entre le narrateur et sa religiosité censée être sans reproche. La déclaration en question n’est

In this paper, we propose an architecture inspired by Human Autonomic Nervous System (HANS) which embodies AI planning in order to enable auto- nomic properties such