Optimization of MPI-Process Mapping for Clusters with Angara Interconnect


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

An algorithm of MPI processes mapping optimization is adapted for supercomputers with interconnect Angara. The mapping algorithm is based on partitioning of parallel program communication pattern. It is performed in such a way that the processes between which the most intensive exchanges take place are tied to the nodes/processors with the highest bandwidth. The algorithm finds a near-optimal distribution of its processes for processor cores to minimize the total execution time of exchanges between MPI processes. The analysis of results of optimized placement of processes using proposed method on small supercomputers is shown. The analysis of the dependence of the MPI program execution time on supercomputer parameters and task parameters is performed. A theoretical model is proposed for estimation of effect of mapping optimization on the execution time for several types of supercomputer topologies. The prospect of using implemented optimization library for large-scale supercomputers with the interconnect Angara is discussed.

About the authors

M. R. Khalilov

National Research University Higher School of Economics

Author for correspondence.
Email: mkhalilov@hse.ru
Russian Federation, ul. Myasnitskaya 20, Moscow, 101000

A. V. Timofeev

National Research University Higher School of Economics; Joint Institute for High Temperatures of the Russian Academy of Sciences

Email: mkhalilov@hse.ru
Russian Federation, ul. Myasnitskaya 20, Moscow, 101000; ul. Izhorskaya 20, str. 2, Moscow, 125412


Copyright (c) 2018 Pleiades Publishing, Ltd.

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies