The Untapped Potential of Off-the-Shelf Convolutional Neural Networks

03/17/2021
by   Matthew Inkawhich, et al.
13

Over recent years, a myriad of novel convolutional network architectures have been developed to advance state-of-the-art performance on challenging recognition tasks. As computational resources improve, a great deal of effort has been placed in efficiently scaling up existing designs and generating new architectures with Neural Architecture Search (NAS) algorithms. While network topology has proven to be a critical factor for model performance, we show that significant gains are being left on the table by keeping topology static at inference-time. Due to challenges such as scale variation, we should not expect static models configured to perform well across a training dataset to be optimally configured to handle all test data. In this work, we seek to expose the exciting potential of inference-time-dynamic models. By allowing just four layers to dynamically change configuration at inference-time, we show that existing off-the-shelf models like ResNet-50 are capable of over 95 on ImageNet. This level of performance currently exceeds that of models with over 20x more parameters and significantly more complex training procedures.

READ FULL TEXT

page 6

page 7

page 11

page 12

research
10/12/2018

Graph HyperNetworks for Neural Architecture Search

Neural architecture search (NAS) automatically finds the best task-speci...
research
01/17/2023

DQNAS: Neural Architecture Search using Reinforcement Learning

Convolutional Neural Networks have been used in a variety of image relat...
research
04/23/2020

Depth-Wise Neural Architecture Search

Modern convolutional networks such as ResNet and NASNet have achieved st...
research
11/11/2020

Towards NNGP-guided Neural Architecture Search

The predictions of wide Bayesian neural networks are described by a Gaus...
research
09/14/2022

NAAP-440 Dataset and Baseline for Neural Architecture Accuracy Prediction

Neural architecture search (NAS) has become a common approach to develop...
research
10/26/2022

Architecture representations for quantum convolutional neural networks

The Quantum Convolutional Neural Network (QCNN) is a quantum circuit mod...
research
05/13/2019

ISBNet: Instance-aware Selective Branching Network

Recent years have witnessed growing interests in designing efficient neu...

Please sign up or login with your details

Forgot password? Click here to reset