Driving a Petascale HPC Center with Octoshell Management System

D. A. Nikitenko; Vad. V. Voevodin; S. A. Zhumatiy

doi:10.1134/S1995080219110192

Driving a Petascale HPC Center with Octoshell Management System

Авторы: Nikitenko D.A.¹, Voevodin V.V.¹, Zhumatiy S.A.¹
Учреждения:
1. Research Computing Center
Выпуск: Том 40, № 11 (2019)
Страницы: 1817-1830
Раздел: Article
URL: https://journals.rcsi.science/1995-0802/article/view/206064
DOI: https://doi.org/10.1134/S1995080219110192
ID: 206064

Цитировать

Полный текст

Открытый доступ
Доступ закрыт

Доступ предоставлен
Доступ закрыт

Только для подписчиков

Аннотация
Об авторах
Список литературы
Дополнительные файлы
Статистика

Аннотация

Running any computing center is a complex task. With the growth of scales and costs such tasks become challenges. So the top supercomputer sites, being big in everything, have always required special approaches to manage, to control, and to take care of them. At present, large HPC centers can have a variety of totally diverse systems containing up to millions of components, having thousands of users worldwide with the full range of complicated applications. Obviously, tons of data have to be managed in a concerted way to allow such an informational factory functioning. This paper shares the design principles, some implementation details and the roadmap vision regarding the Octoshell HPC center management system, which has been developed and is currently being used in the everyday practice of Moscow State University supercomputer center. This open source system manages Lomonosov and Lomonosov-2 systems with a total of over 5 PFlops peak performance complexes at present, providing multiple tools aimed to tackle most typical workflow tasks both for regular users and system administrators in a single shell.

Ключевые слова

large-scale system administering, automation of administering routines, managing HPC systems, user support, fault-tolerant administering, HPC center workflow

Дополнительные файлы

Доп. файлы

Действие

1. JATS XML

Скачать

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Driving a Petascale HPC Center with Octoshell Management System

Полный текст

Аннотация

Ключевые слова

Об авторах

D. Nikitenko

Vad. Voevodin

S. Zhumatiy

Дополнительные файлы