Despite decades of efforts to resolve, memory safety violations are stil...
Deep Learning (DL) acceleration support in CPUs has recently gained a lo...
GPUs offer massive compute parallelism and high-bandwidth memory accesse...
CUDA is one of the most popular choices for GPU programming, but it can ...
As CUDA programs become the de facto program among data parallel applica...
The importance of open-source hardware and software has been increasing....
As AI-based applications become pervasive, CPU vendors are starting to
i...
With the rapid development of scientific computation, more and more
rese...
Intelligent mobile robots are critical in several scenarios. However, as...
The increasing interest in serverless computation and ubiquitous wireles...
To efficiently process visual data at scale, researchers have proposed t...
Sparse matrices are the key ingredients of several application domains, ...
Intelligent transportation systems (ITS) are expected to effectively cre...
Satisfying the high computation demand of modern deep learning architect...
The rise of deep neural networks (DNNs) is inspiring new studies in myri...
The current challenges in technology scaling are pushing the semiconduct...
The sequence-to-sequence (seq2seq) model for neural machine translation ...
With recent advancements in deep neural networks (DNNs), we are able to ...
The prevalence of Internet of things (IoT) devices and abundance of sens...
Fence instructions are fundamental primitives that ensure consistency in...
Recent studies have demonstrated that near-data processing (NDP) is an
e...
Memories that exploit three-dimensional (3D)-stacking technology, which
...
Three-dimensional (3D)-stacking technology, which enables the integratio...