Large Language Models (LLMs) have demonstrated impressive performance in...
Pre-trained language models have achieved promising success in code retr...
Question Answering (QA) has been a long-standing research topic in AI an...
The explosive growth in video streaming requires video understanding at ...
In multi-modal reasoning tasks, such as visual question answering (VQA),...
Model quantization is a widely used technique to compress and accelerate...
We present APQ for efficient deep learning inference on resource-constra...
Automatic transistor sizing is a challenging problem in circuit design d...
Efficient deep learning computing requires algorithm and hardware co-des...
Model quantization is a widely used technique to compress and accelerate...