• Aucun résultat trouvé

Scaling Smart Appliances for Spatial Data Synthesis

N/A
N/A
Protected

Academic year: 2021

Partager "Scaling Smart Appliances for Spatial Data Synthesis"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01241718

https://hal.inria.fr/hal-01241718

Submitted on 10 Dec 2015

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Scaling Smart Appliances for Spatial Data Synthesis

Luis Pineda-Morales, Balaji Subramaniam, Kate Keahey, Gabriel Antoniu,

Alexandru Costan, Shaowen Wang, Anand Padmanabhan, Aiman Soliman

To cite this version:

Luis Pineda-Morales, Balaji Subramaniam, Kate Keahey, Gabriel Antoniu, Alexandru Costan, et al.. Scaling Smart Appliances for Spatial Data Synthesis. SC15 - ACM/IEEE International Conference in Supercomputing, Nov 2015, Austin, United States. 2015. �hal-01241718�

(2)

1

Microsoft Research – Inria Joint Centre, FR

2

KerData Project-Team, Inria, FR

3

Argonne National Laboratory, US

4

IRISA / INSA Rennes, FR

5

CyberGIS Center for Advanced Digital and Spatial Studies, UIUC, US

6

National Center for Supercomputing Applications, UIUC, US

7

Department of Geography and Geographic Information Science, UIUC, US

Luis Pineda-Morales

1,2

, Balaji Subramaniam

3

, Kate Keahey

3

, Gabriel Antoniu

2

, Alexandru Costan

2,4

Shaowen Wang

5,6,7

, Anand Padmanabhan

5,6,7

, and Aiman Soliman

5,6

• Spatial data streams have grown

considerably in size and complexity

• Volatility and growing number of user

requests through CyberGIS Gateway [1]

• Software integration for sustained geospatial

innovation

• Achieve scalability for

large geospatial problems

and number of users

• Use Case: Spatial index for estimating home

and work relocation, and unemployment rates

using geo-located twitter data

• 70%+ improvement with larger block size,

compared to the default 128 MB

• “Sweet spot” more evident in larger datasets

• Large dataset: disk saturation

• 2 bare-metal Chameleon nodes: 24 cores (48 VCPUs), 256 GB disk, 128 GB RAM

• Hadoop 2.7 (YARN): 128 MB default HDFS block size, up to 192 containers

• Up to 20 days of geo-located tweets (~1.9GB/day)

FILTER

CLASSIFY

CLUSTER

VISUALIZE

(GIS)

• How to deal with data

and user requests

volatility?

• How can we provision

appliances quickly and

efficiently?

[1] CyberGIS Software Integration for Sustained

Geospatial Innovation http://cybergis.cigi.uiuc.edu/

[2] Infrastructure Outsourcing in Multi-Cloud

Environment, Keahey, K., et al. Workshop on Cloud

Services, Federation, and the 8th Open Cirrus

Summit. 2012

• Scale out to months of geo-located tweets

• Test in other [virtualized] infrastructures

• Define models for elastic appliances

• Support for new job releases of the use case

involving Phantom elastic provisioner [2]

• Luis Pineda-Morales’ internship at Argonne National Laboratory was funded by the

Data@Exascale Inria-ANL Associate Team and by the NextGN Inria-ANL PUF project

• This material is based in part upon work supported by the U.S. National Science

Foundation under grant numbers: 1047916, 1429699, and 1443080. Insightful comments

and feedbacks received from the following members of the CyberGIS Center: Junjun Yin

and Kiumars Soltani are greatly appreciated

• Scale the computer resources in order to meet

volatility

• Smart Appliances

Social

Networks

Sensors

Scanners

Data

Stream

Data Storage

Elastic

Resources

User requests

Provisioner

Analytics

Algorithms

Portal

Appliance

Application

Smart

Appliance

Environment

Application

Environment

Model

.

.

.

.

.

.

Appliance

Repository

Références

Documents relatifs

The neutrosophic fuzzy goal programming is constructed to minimize the group tolerance of a satisfactory degrees, and to attain the highest degree for truth (unity),

Analyse du plutonium en solution par spectrophotométrie d’absorption: vue d’ensemble des techniques mises en oeuvre au Laboratoire de Conversion des Actinides et

Transpiration et comportement vis-à-vis de la sécheresse de jeunes plants forestiers (Abies alba Mill., Picea abies (L.) Karsten., Pinus nigra Arn. laricio Poir., Pinus strobus

Et ce, d'autant plus que ces savoir- faire ne sont pas toujours aisément explicitables par les opérateurs (cf. Bounot et al., 1996) et que les activités de maintenance

Afin mettre en œuvre la politique de tri et de recyclage des emballages ménagers de la collectivité et d’assurer le suivi avec l’éco-organisme en charge de la

Ces tendances se retrouvent chez le chien, pour lesquels les antiparasitaires représentent également à eux seuls plus de la moitié des cas déclarés d’usages hors AMM, avec des

This study aimed to identify DPs according to generation and sex using reduced-rank regression (RRR) with metabolic syndrome components as intermediate markers and assess

Unlike the winter demonstrations, who were ca rried out by the inhabitants of Bucharest, the on es taking place on the stadium, on the 23rd of August, were carried out by