Neural Network Quantization for Efficient Inference: A Survey

12/08/2021
by   Olivia Weng, et al.
0

As neural networks have become more powerful, there has been a rising desire to deploy them in the real world; however, the power and accuracy of neural networks is largely due to their depth and complexity, making them difficult to deploy, especially in resource-constrained devices. Neural network quantization has recently arisen to meet this demand of reducing the size and complexity of neural networks by reducing the precision of a network. With smaller and simpler networks, it becomes possible to run neural networks within the constraints of their target hardware. This paper surveys the many neural network quantization techniques that have been developed in the last decade. Based on this survey and comparison of neural network quantization techniques, we propose future directions of research in the area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2022

Hyperspherical Quantization: Toward Smaller and More Accurate Models

Model quantization enables the deployment of deep neural networks under ...
research
04/06/2021

Binary Neural Network for Speaker Verification

Although deep neural networks are successful for many tasks in the speec...
research
07/09/2022

CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Neural networks are essential components of learning-based software syst...
research
03/31/2020

Binary Neural Networks: A Survey

The binary neural network, largely saving the storage and computation, s...
research
08/18/2021

Verifying Low-dimensional Input Neural Networks via Input Quantization

Deep neural networks are an attractive tool for compressing the control ...
research
08/29/2019

Smaller Models, Better Generalization

Reducing network complexity has been a major research focus in recent ye...
research
02/12/2021

Confounding Tradeoffs for Neural Network Quantization

Many neural network quantization techniques have been developed to decre...

Please sign up or login with your details

Forgot password? Click here to reset