Performance of an Astrophysical Radiation Hydrodynamics Code under Scalable Vector Extension Optimization

07/27/2022
by   Dennis C. Smolarski, et al.
0

We present results of a performance study of an astrophysical radiation hydrodynamics code, V2D, on the Arm-based A64FX processor developed by Fujitsu. The code solves sparse linear systems, a task for which the A64FX architecture should be well suited. We performed the performance analysis study on Ookami, an Apollo 80 platform utilizing the A64FX processor. We explored several compilers and performance analysis packages and found the code did not perform as expected under scalable vector extension optimization, suggesting that a "deeper dive" into analyzing the code is worthwhile. However, a simple driver program that exercised basic sparse linear algebra routines used by V2D did show significant speedup with the use of the scalable vector extension optimization. We present the initial results from the study which used V2D on a relatively simple test problem that emphasized the repeated solution of sparse linear systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2021

A Case Study of LLVM-Based Analysis for Optimizing SIMD Code Generation

This paper presents a methodology for using LLVM-based tools to tune the...
research
10/10/2018

Performance analysis and optimization of the JOREK code for many-core CPUs

This report investigates the performance of the JOREK code on the Intel ...
research
06/02/2019

Ara: A 1 GHz+ Scalable and Energy-Efficient RISC-V Vector Processor with Multi-Precision Floating Point Support in 22 nm FD-SOI

In this paper, we present Ara, a 64-bit vector processor based on the ve...
research
03/29/2013

A problem dependent analysis of SOCP algorithms in noisy compressed sensing

Under-determined systems of linear equations with sparse solutions have ...
research
03/16/2018

The ARM Scalable Vector Extension

This article describes the ARM Scalable Vector Extension (SVE). Several ...
research
01/22/2019

SVE-enabling Lattice QCD Codes

Optimization of applications for supercomputers of the highest performan...
research
04/10/2019

Performance Analysis of Linear Algebraic Functions using Reconfigurable Computing

This paper introduces a new mapping of geometrical transformation on the...

Please sign up or login with your details

Forgot password? Click here to reset