Improved Projection Learning for Lower Dimensional Feature Maps

10/27/2022
by   Ilan Price, et al.
0

The requirement to repeatedly move large feature maps off- and on-chip during inference with convolutional neural networks (CNNs) imposes high costs in terms of both energy and time. In this work we explore an improved method for compressing all feature maps of pre-trained CNNs to below a specified limit. This is done by means of learned projections trained via end-to-end finetuning, which can then be folded and fused into the pre-trained network. We also introduce a new `ceiling compression' framework in which evaluate such techniques in view of the future goal of performing inference fully on-chip.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset