Talks and Poster Presentations (with Proceedings-Entry):
"Micro-benchmarking MPI Neighborhood Collective Operations";
Talk: Euro-Par 2017 23rd International European Conference on Parallel and Distributed Computing,
Santiago de Compostela, Spain;
- 2017-09-01; in: "Euro-Par 2017: Parallel Processing, 23rd International Conference on Parallel and Distributed Computing, Proceedings",
F. Rivera, T. Pena, J. Cabaleiro (ed.);
In this article, performance expectations for MPI neighborhood collective operations are formulated as self-consistent performance guidelines. A microbenchmark and an experimental methodology are presented to assess these guidelines. Measurement results from a large, InfiniBand-based cluster, the Vienna Scientific Cluster (VSC), as well as from a small commodity cluster computer are shown and discussed to illustrate the methodology and to gain first insights into the performance of current MPI implementations. Results show that the examined libraries seem to be sensitive to the order in which topological neighbors are specified, and that in some cases Cartesian topologies can be outperformed by simulating them with distributed graph topologies.
MPI · Process topology · Neighborhood collectives · Performance guidelines · Benchmarking
"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
Project Head Jesper Larsson Träff:
Created from the Publication Database of the Vienna University of Technology.