Optimization of MPI-Process Mapping for Clusters with Angara Interconnect


Цитировать

Полный текст

Открытый доступ Открытый доступ
Доступ закрыт Доступ предоставлен
Доступ закрыт Только для подписчиков

Аннотация

An algorithm of MPI processes mapping optimization is adapted for supercomputers with interconnect Angara. The mapping algorithm is based on partitioning of parallel program communication pattern. It is performed in such a way that the processes between which the most intensive exchanges take place are tied to the nodes/processors with the highest bandwidth. The algorithm finds a near-optimal distribution of its processes for processor cores to minimize the total execution time of exchanges between MPI processes. The analysis of results of optimized placement of processes using proposed method on small supercomputers is shown. The analysis of the dependence of the MPI program execution time on supercomputer parameters and task parameters is performed. A theoretical model is proposed for estimation of effect of mapping optimization on the execution time for several types of supercomputer topologies. The prospect of using implemented optimization library for large-scale supercomputers with the interconnect Angara is discussed.

Ключевые слова

Об авторах

M. Khalilov

National Research University Higher School of Economics

Автор, ответственный за переписку.
Email: mkhalilov@hse.ru
Россия, ul. Myasnitskaya 20, Moscow, 101000

A. Timofeev

National Research University Higher School of Economics; Joint Institute for High Temperatures of the Russian Academy of Sciences

Email: mkhalilov@hse.ru
Россия, ul. Myasnitskaya 20, Moscow, 101000; ul. Izhorskaya 20, str. 2, Moscow, 125412


© Pleiades Publishing, Ltd., 2018

Данный сайт использует cookie-файлы

Продолжая использовать наш сайт, вы даете согласие на обработку файлов cookie, которые обеспечивают правильную работу сайта.

О куки-файлах