Crossing the Architectural Barrier: Evaluating Representative Regions of Parallel HPC Applications

03/20/2018
by   Alexandra Ferreron, et al.
0

Exascale computing will get mankind closer to solving important social, scientific and engineering problems. Due to high prototyping costs, High Performance Computing (HPC) system architects make use of simulation models for design space exploration and hardware-software co-design. However, as HPC systems reach exascale proportions, the cost of simulation increases, since simulators themselves are largely single-threaded. Tools for selecting representative parts of parallel applications to reduce running costs are widespread, e.g., BarrierPoint achieves this by analysing, in simulation, abstract characteristics such as basic blocks and reuse distances. However, architectures new to HPC have a limited set of tools available. In this work, we provide an independent cross-architectural evaluation on real hardware - across Intel and ARM - of the BarrierPoint methodology, when applied to parallel HPC proxy applications. We present both cases: when the methodology can be applied and when it cannot. In the former case, results show that we can predict the performance of full application execution by running shorter representative sections. In the latter case, we dive into the underlying issues and suggest improvements. We demonstrate a total simulation time reduction of up to 178x, whilst keeping the error below 2.3 cycles and instructions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2020

Simulation-Based Performance Prediction of HPC Applications: A Case Study of HPL

We propose a simulation-based approach for performance modeling of paral...
research
11/14/2018

Applying the swept rule for explicit partial differential equation solutions on heterogeneous computing systems

Applications that exploit the architectural details of high performance ...
research
03/15/2022

Towards an HPC Complementary Computing Facility

This Letter considers the design for computing facilities that are compl...
research
10/23/2018

LincoSim: a web based HPC-cloud platform for automatic virtual towing tank analysis

In this work, we present a new web based HPC-cloud platform for automati...
research
02/15/2019

Studying the Impact of Power Capping on MapReduce-based, Data-intensive Mini-applications on Intel KNL and KNM Architectures

In this poster, we quantitatively measure the impacts of data movement o...
research
06/19/2019

Collecting and Presenting Reproducible Intranode Stencil Performance: INSPECT

Stencil algorithms have been receiving considerable interest in HPC rese...
research
11/14/2018

Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems

Applications that exploit the architectural details of high-performance ...

Please sign up or login with your details

Forgot password? Click here to reset