Learning in the Frequency Domain

02/27/2020
by   Kai Xu, et al.
9

Deep neural networks have achieved remarkable success in computer vision tasks. Existing neural networks mainly operate in the spatial domain with fixed input sizes. For practical applications, images are usually large and have to be downsampled to the predetermined input size of neural networks. Even though the downsampling operations reduce computation and the required communication bandwidth, it removes both redundant and salient information obliviously, which results in accuracy degradation. Inspired by digital signal processing theories, we analyze the spectral bias from the frequency perspective and propose a learning-based frequency selection method to identify the trivial frequency components which can be removed without accuracy loss. The proposed method of learning in the frequency domain leverages identical structures of the well-known neural networks, such as ResNet-50, MobileNetV2, and Mask R-CNN, while accepting the frequency-domain information as the input. Experiment results show that learning in the frequency domain with static channel selection can achieve higher accuracy than the conventional spatial downsampling approach and meanwhile further reduce the input data size. Specifically for ImageNet classification with the same input size, the proposed method achieves 1.41 MobileNetV2, respectively. Even with half input size, the proposed method still improves the top-1 accuracy on ResNet-50 by 1 average precision improvement on Mask R-CNN for instance segmentation on the COCO dataset.

READ FULL TEXT

page 2

page 3

page 6

page 8

page 12

research
02/16/2023

Frequency-domain Learning for Volumetric-based 3D Data Perception

Frequency-domain learning draws attention due to its superior tradeoff b...
research
03/09/2021

MWQ: Multiscale Wavelet Quantized Neural Networks

Model quantization can reduce the model size and computational latency, ...
research
07/16/2018

Backward Reduction of CNN Models with Information Flow Analysis

This paper proposes backward reduction, an algorithm that explores the c...
research
04/17/2023

Frequency Regularization: Restricting Information Redundancy of Convolutional Neural Networks

Convolutional neural networks have demonstrated impressive results in ma...
research
12/22/2020

FcaNet: Frequency Channel Attention Networks

Attention mechanism, especially channel attention, has gained great succ...
research
02/18/2022

Joint Learning of Frequency and Spatial Domains for Dense Predictions

Current artificial neural networks mainly conduct the learning process i...
research
03/29/2019

Using Structured Input and Modularity for Improved Learning

We describe a method for utilizing the known structure of input data to ...

Please sign up or login with your details

Forgot password? Click here to reset