Numerical algorithms and fault tolerance of hyperexascale computer systems


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

A new method is discussed which provides the possibility of long-term continuous calculations on a computing systems consisting of millions of operating devices, some of which may suffer failures in the course of calculation. The method relies on the properties of hyperbolized systems of partial differential equations, for which the domain of influence on the solution is localized in space. As a result, the necessary part of the solution can be rapidly recalculated without restarting the whole calculation process. The number of additional processors required for executing the recalculation is estimated.

About the authors

B. N. Chetverushkin

Federal Research Center Keldysh Institute of Applied Mathematics

Author for correspondence.
Email: chetver@imamod.ru
Russian Federation, Moscow, 125047

M. V. Yakobovskiy

Federal Research Center Keldysh Institute of Applied Mathematics

Email: chetver@imamod.ru
Russian Federation, Moscow, 125047


Copyright (c) 2017 Pleiades Publishing, Ltd.

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies