The DEEP-ER project: I/O and resiliency extensions for the Cluster-Booster architecture

04/15/2019
by   Anke Kreuzer, et al.
0

The recently completed research project DEEP-ER has developed a variety of hardware and software technologies to improve the I/O capabilities of next generation high-performance computers, and to enable applications recovering from the larger hardware failure rates expected on these machines. The heterogeneous Cluster-Booster architecture –first introduced in the predecessor DEEP project– has been extended by a multi-level memory hierarchy employing non-volatile and network-attached memory devices. Based on this hardware infrastructure, an I/O and resiliency software stack has been implemented combining and extending well established libraries and software tools, and sticking to standard user-interfaces. Real-world scientific codes have tested the projects' developments and demonstrated the improvements achieved without compromising the portability of the applications.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset