• Aucun résultat trouvé

Design and Evaluation of Topology-aware Scatter and AllGather Algorithms for Dragonfly Networks

N/A
N/A
Protected

Academic year: 2021

Partager "Design and Evaluation of Topology-aware Scatter and AllGather Algorithms for Dragonfly Networks"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01400271

https://hal.inria.fr/hal-01400271

Submitted on 21 Nov 2016

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Design and Evaluation of Topology-aware Scatter and AllGather Algorithms for Dragonfly Networks

Nathanaël Cheriere, Matthieu Dorier

To cite this version:

Nathanaël Cheriere, Matthieu Dorier. Design and Evaluation of Topology-aware Scatter and All-Gather Algorithms for Dragonfly Networks. Supercomputing 2016, Nov 2016, Salt Lake City, United States. Supercomputing 2016. �hal-01400271�

(2)

Design and Evaluation of Topology-aware Scatter and AllGather Algorithms

for Dragonfly Networks

Nathana¨el Cheriere1, Matthieu Dorier2.

1 ENS Rennes, 2 Argonne National Laboratory

New efficient communication algorithms for Dragonfly

- Cost of traditional topologies too high at scale

- Dragonfly: a new topology

- Opportunity for new efficient communication algorithms

g g g g g r r r r r Router Local link t t t t t t t t t t g Group Global link Terminal Simple Representation

Global link Only local link

Challenge: Global links are potential bottlenecks

AllGather

1 2 3 4 1 2 3 4 time

Existing algorithms

RING

Default in MPICH for 80 KiB and more

BRUCK - Bruck’s algorithm

0: 1: 2: 3:

Default in MPICH for less than 80 KiB

Our contribution

TAR - Topology Aware Ring

Idea: mimimize global link utilization 1. Build a smart ring

2. Send data

Minimize data transfers on global links

Evaluation

- CODES based on ROSS, an event driven simulator

- 5256 terminals: 73 groups, 12 routers per group, and 6 terminals per router - Bandwidth:

* Global links at 4.7 GiB/s, * Local links at 5.25 GiB/s,

* Terminal links at 5.25 GiB/s

1 KiB per terminal

0 2 4 6 8 10 12 128 256 512 1024 2048 3072 4096 Runtime (ms) Allocation size

RING TAR BRUCK

0 0.2 0.4 0.6 0.8 1 128 256 512

Conclusion

- Topology awareness matters: TAR wins! - BRUCK still better for small transfers

- Background traffic does not change the behavior

Scatter

1 2 3 4

1 2 3 4

time

Existing Algorithms

TREE - Binomial Tree

Default in MPICH

Simple approach

LIN

Send data to terminals one by one Minimize link utilization

Our contribution

GLF - Global Link First

Using TREE:

1. Send data to groups 2. Send data to routers 3. Send data to terminals

Use global links only during the first phase

LLF - Local Link First

Using TREE:

1. Send data to routers in root group

2. Send data to groups 3. Send data to routers 4. Send data to terminals

Minimize data transfers on global links

LIN twice faster than the others!

Sending 1 KiB per terminal

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 128 256 512 1024 2048 3072 4096 Runtime (ms) Allocation size TREE GLF LLF LIN 0 0.1 0.2 0.3 0.4 0.5 128 256 512

With uniform background traffic

0 0.5 1 1.5 2 128 256 512 1024 2048 3072 4096 Runtime (ms) Allocation size TREE GLF LLF LIN/LIN+ 0 0.1 0.2 0.3 0.4 0.5 128 256 512

Major impact factor: the buffer size!

For TREE, GLF, and LLF:

Sender Input buffer Router

Output buffer Slowdown DATA Receiver FULL FULL

- Large amounts of data transferred - Fills buffers, slows down transfer

For LIN:

Sender Input buffer Router

Output

buffer

Receiver Receiver Receiver Receiver

DATA

- 1 KiB transferred each time

- No buffer saturated, no slowdown

Conclusion

- Topology and hardware characteristics impact the performance of algorithms - LIN is twice faster than LLF, GLF, and TREE

- No significant differences between LLF, GLF, and TREE

Discussion

Why is it complex to make AllGather hardware-aware?

- A lot more data sent over the network creating more contention

- Overall run time determined by the slowest link due to the ring structure

- Terminals transfer on the same path (TAR, RING), no balance across buffers

Ongoing work

- Broadcast as Scatter followed by AllGather

- Computation and communication algorithms such as Reduce - Energy efficiency of communication algorithms

Références

Documents relatifs

To test whether the vesicular pool of Atat1 promotes the acetyl- ation of -tubulin in MTs, we isolated subcellular fractions from newborn mouse cortices and then assessed

Néanmoins, la dualité des acides (Lewis et Bronsted) est un système dispendieux, dont le recyclage est une opération complexe et par conséquent difficilement applicable à

Cette mutation familiale du gène MME est une substitution d’une base guanine par une base adenine sur le chromosome 3q25.2, ce qui induit un remplacement d’un acide aminé cystéine

En ouvrant cette page avec Netscape composer, vous verrez que le cadre prévu pour accueillir le panoramique a une taille déterminée, choisie par les concepteurs des hyperpaysages

Chaque séance durera deux heures, mais dans la seconde, seule la première heure sera consacrée à l'expérimentation décrite ici ; durant la seconde, les élèves travailleront sur

A time-varying respiratory elastance model is developed with a negative elastic component (E demand ), to describe the driving pressure generated during a patient initiated

The aim of this study was to assess, in three experimental fields representative of the various topoclimatological zones of Luxembourg, the impact of timing of fungicide

Attention to a relation ontology [...] refocuses security discourses to better reflect and appreciate three forms of interconnection that are not sufficiently attended to