Optimization of MPI-Process Mapping for Clusters with Angara Interconnect


如何引用文章

全文:

开放存取 开放存取
受限制的访问 ##reader.subscriptionAccessGranted##
受限制的访问 订阅存取

详细

An algorithm of MPI processes mapping optimization is adapted for supercomputers with interconnect Angara. The mapping algorithm is based on partitioning of parallel program communication pattern. It is performed in such a way that the processes between which the most intensive exchanges take place are tied to the nodes/processors with the highest bandwidth. The algorithm finds a near-optimal distribution of its processes for processor cores to minimize the total execution time of exchanges between MPI processes. The analysis of results of optimized placement of processes using proposed method on small supercomputers is shown. The analysis of the dependence of the MPI program execution time on supercomputer parameters and task parameters is performed. A theoretical model is proposed for estimation of effect of mapping optimization on the execution time for several types of supercomputer topologies. The prospect of using implemented optimization library for large-scale supercomputers with the interconnect Angara is discussed.

作者简介

M. Khalilov

National Research University Higher School of Economics

编辑信件的主要联系方式.
Email: mkhalilov@hse.ru
俄罗斯联邦, ul. Myasnitskaya 20, Moscow, 101000

A. Timofeev

National Research University Higher School of Economics; Joint Institute for High Temperatures of the Russian Academy of Sciences

Email: mkhalilov@hse.ru
俄罗斯联邦, ul. Myasnitskaya 20, Moscow, 101000; ul. Izhorskaya 20, str. 2, Moscow, 125412


版权所有 © Pleiades Publishing, Ltd., 2018
##common.cookie##