This work proposes a Momentum-Enabled Kronecker-Factor-Based Optimizer U...
Machine Learning (ML) models contain highly-parallel computations, such ...
Sparse computations frequently appear in scientific simulations and the
...
Sparse fusion is a compile-time loop transformation and runtime scheduli...
This work proposes a distributed algorithm for solving empirical risk
mi...
This work proposes a time-efficient Natural Gradient Descent method, cal...
ASYNC is a framework that supports the implementation of asynchronous ma...
We present MatRox, a novel model-based algorithm and implementation of
H...
Analyzing array-based computations to determine data dependences is usef...
The fast iterative soft thresholding algorithm (FISTA) is used to solve
...
Sparse tensors appear in many large-scale applications with multidimensi...
Sympiler is a domain-specific code generator that optimizes sparse matri...