Demand for low-latency and high-bandwidth data transfer between GPUs has...
We present a new strategy for automatically exploring the design space o...
MPI derived datatypes are an abstraction that simplifies handling of
non...
This paper presents GPU performance optimization and scaling results for...
This report presents the design of the Scope infrastructure for extensib...