Optimizing Convolutional Neural Network Architecture via Information Field

09/11/2020
by   Yuke Wang, et al.
0

CNN architecture design has attracted tremendous attention of improving model accuracy or reducing model complexity. However, existing works either introduce repeated training overhead in the search process or lack an interpretable metric to guide the design. To clear the hurdles, we propose Information Field (IF), an explainable and easy-to-compute metric, to estimate the quality of a CNN architecture and guide the search process of designs. To validate the effectiveness of IF, we build a static optimizer to improve the CNN architectures at both the stage level and the kernel level. Our optimizer not only provides a clear and reproducible procedure but also mitigates unnecessary training efforts in the architecture search process. Experiments show that the models generated by our optimizer can achieve up to 5.47 and up to 65.38 structures like MobileNet and ResNet.

READ FULL TEXT

page 4

page 5

research
11/16/2019

S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search

Recently, dynamic inference has emerged as a promising way to reduce the...
research
08/30/2022

Cardinal Optimizer (COPT) User Guide

Cardinal Optimizer is a high-performance mathematical programming solver...
research
11/26/2018

GP-CNAS: Convolutional Neural Network Architecture Search with Genetic Programming

Convolutional neural networks (CNNs) are effective at solving difficult ...
research
09/19/2020

ENAS4D: Efficient Multi-stage CNN Architecture Search for Dynamic Inference

Dynamic inference is a feasible way to reduce the computational cost of ...
research
05/07/2020

AutoSpeech: Neural Architecture Search for Speaker Recognition

Speaker recognition systems based on Convolutional Neural Networks (CNNs...
research
07/30/2018

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Currently, the neural network architecture design is mostly guided by th...
research
08/03/2023

Deep Maxout Network-based Feature Fusion and Political Tangent Search Optimizer enabled Transfer Learning for Thalassemia Detection

Thalassemia is a heritable blood disorder which is the outcome of a gene...

Please sign up or login with your details

Forgot password? Click here to reset