On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

04/07/2023
by   Li Shen, et al.
0

The field of deep learning has witnessed significant progress, particularly in computer vision (CV), natural language processing (NLP), and speech. The use of large-scale models trained on vast amounts of data holds immense promise for practical applications, enhancing industrial productivity and facilitating social development. With the increasing demands on computational capacity, though numerous studies have explored the efficient training, a comprehensive summarization on acceleration techniques of training deep learning models is still much anticipated. In this survey, we present a detailed review for training acceleration. We consider the fundamental update formulation and split its basic components into five main perspectives: (1) data-centric: including dataset regularization, data sampling, and data-centric curriculum learning techniques, which can significantly reduce the computational complexity of the data samples; (2) model-centric, including acceleration of basic modules, compression training, model initialization and model-centric curriculum learning techniques, which focus on accelerating the training via reducing the calculations on parameters; (3) optimization-centric, including the selection of learning rate, the employment of large batchsize, the designs of efficient objectives, and model average techniques, which pay attention to the training policy and improving the generality for the large-scale models; (4) budgeted training, including some distinctive acceleration methods on source-constrained situations; (5) system-centric, including some efficient open-source distributed libraries/systems which provide adequate hardware support for the implementation of acceleration algorithms. By presenting this comprehensive taxonomy, our survey presents a comprehensive review to understand the general mechanisms within each component and their joint interaction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2023

Efficient XAI Techniques: A Taxonomic Survey

Recently, there has been a growing demand for the deployment of Explaina...
research
12/02/2022

CLIP: Train Faster with Less Data

Deep learning models require an enormous amount of data for training. Ho...
research
08/04/2022

Vision-Centric BEV Perception: A Survey

Vision-centric BEV perception has recently received increased attention ...
research
10/25/2020

A Comprehensive Survey on Curriculum Learning

Curriculum learning (CL) is a training strategy that trains a machine le...
research
06/16/2021

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Deep Learning has revolutionized the fields of computer vision, natural ...
research
12/30/2021

Resource-Efficient Deep Learning: A Survey on Model-, Arithmetic-, and Implementation-Level Techniques

Deep learning is pervasive in our daily life, including self-driving car...
research
05/03/2022

A Comprehensive Survey of Image Augmentation Techniques for Deep Learning

Deep learning has been achieving decent performance in computer vision r...

Please sign up or login with your details

Forgot password? Click here to reset