Wilson matrix kernel for lattice QCD on A64FX architecture

03/15/2023
by   Issaku Kanamori, et al.
0

We study the implementation of the even-odd Wilson fermion matrix for lattice QCD simulations on the A64FX architecture. Efficient coding of the stencil operation is investigated for two-dimensional packing to SIMD vectors. We measure the sustained performance on the supercomputer Fugaku at RIKEN R-CCS and show the profiler result of our code, which may signal an unexpected source of slow-down in addition to the detailed efficiency of each part of the code.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset