Web applications are increasingly becoming the primary platform for AI
s...
Large language models (LLMs) based on transformers have made significant...
Deploying pre-trained transformer models like BERT on downstream tasks i...
Running out of GPU memory has become a main bottleneck for large-scale D...
Automatic Speech Recognition (ASR) has seen remarkable advancements with...
Efficient deployment of large language models (LLMs) necessitates low-bi...
Neural Architecture Search (NAS) has shown promising performance in the
...
Stencil computation is one of the most important kernels in various
scie...
The combination of Neural Architecture Search (NAS) and quantization has...
DNN inference requires huge effort of system development and resource co...
Ad relevance modeling plays a critical role in online advertising system...
Edge computing is being widely used for video analytics. To alleviate th...
DNNs are ubiquitous on edge devices nowadays. With its increasing import...
Medical applications have benefited from the rapid advancement in comput...
The CSSI 2019 workshop was held on October 28-29, 2019, in Austin, Texas...