Numerical algorithms and fault tolerance of hyperexascale computer systems
- Authors: Chetverushkin B.N.1, Yakobovskiy M.V.1
-
Affiliations:
- Federal Research Center Keldysh Institute of Applied Mathematics
- Issue: Vol 95, No 1 (2017)
- Pages: 7-11
- Section: Mathematics
- URL: https://journals.rcsi.science/1064-5624/article/view/224688
- DOI: https://doi.org/10.1134/S1064562417010021
- ID: 224688
Cite item
Abstract
A new method is discussed which provides the possibility of long-term continuous calculations on a computing systems consisting of millions of operating devices, some of which may suffer failures in the course of calculation. The method relies on the properties of hyperbolized systems of partial differential equations, for which the domain of influence on the solution is localized in space. As a result, the necessary part of the solution can be rapidly recalculated without restarting the whole calculation process. The number of additional processors required for executing the recalculation is estimated.
About the authors
B. N. Chetverushkin
Federal Research Center Keldysh Institute of Applied Mathematics
Author for correspondence.
Email: chetver@imamod.ru
Russian Federation, Moscow, 125047
M. V. Yakobovskiy
Federal Research Center Keldysh Institute of Applied Mathematics
Email: chetver@imamod.ru
Russian Federation, Moscow, 125047