Optimised allgatherv, reduce_scatter and allreduce communication in message-passing systems

06/23/2020
by   Andreas Jocksch, et al.
0

Collective communications, namely the patterns allgatherv, reduce_scatter, and allreduce in message-passing systems are optimised based on measurements at the installation time of the library. The algorithms used are set up in an initialisation phase of the communication, similar to the method used in so-called persistent collective communication introduced in the literature. For allgatherv and reduce_scatter the existing algorithms, recursive multiply/divide and cyclic shift (Bruck's algorithm) are applied with a flexible number of communication ports per node. The algorithms for equal message sizes are used with non-equal message sizes together with a heuristic for rank reordering. The two communication patterns are applied in a plasma physics application that uses a specialised matrix-vector multiplication. For the allreduce pattern the cyclic shift algorithm is applied with a prefix operation. The data is gathered and scattered by the cores within the node and the communication algorithms are applied across the nodes. In general our routines outperform the non-persistent counterparts in established MPI libraries by up to one order of magnitude or show equal performance, with a few exceptions of number of nodes and message sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2023

MPI Advance : Open-Source Message Passing Optimizations

The large variety of production implementations of the message passing i...
research
04/20/2020

A Generalization of the Allreduce Operation

Allreduce is one of the most frequently used MPI collective operations, ...
research
06/02/2023

Local Message Passing on Frustrated Systems

Message passing on factor graphs is a powerful framework for probabilist...
research
11/14/2018

Communication-Optimal Distributed Dynamic Graph Clustering

We consider the problem of clustering graph nodes over large-scale dynam...
research
06/27/2023

Exploring Fully Offloaded GPU Stream-Aware Message Passing

Modern heterogeneous supercomputing systems are comprised of CPUs, GPUs,...
research
12/06/2018

Local Conditioning: Exact Message Passing for Cyclic Undirected Distributed Networks

This paper addresses practical implementation of summing out, expanding,...
research
09/17/2021

Sparbit: a new logarithmic-cost and data locality-aware MPI Allgather algorithm

The collective operations are considered critical for improving the perf...

Please sign up or login with your details

Forgot password? Click here to reset