LIRS: Enabling efficient machine learning on NVM-based storage via a lightweight implementation of random shuffling

10/10/2018
by   Zhi-Lin Ke, et al.
0

Machine learning algorithms, such as Support Vector Machine (SVM) and Deep Neural Network (DNN), have gained a lot of interests recently. When training a machine learning algorithm, randomly shuffle all the training data can improve the testing accuracy and boost the convergence rate. Nevertheless, realizing training data random shuffling in a real system is not a straightforward process due to the slow random accesses in hard disk drive (HDD). To avoid frequent random disk access, the effect of random shuffling is often limited in existing approaches. With the emerging non-volatile memory-based storage device, such as Intel Optane SSD, which provides fast random accesses, we propose a lightweight implementation of random shuffling (LIRS) to randomly shuffle the indexes of the entire training dataset, and the selected training instances are directly accessed from the storage and packed into batches. Experimental results show that LIRS can reduce the total training time of SVM and DNN by 49.9 on DNN by 1.01

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2019

Support Vector Machine-Based Fire Outbreak Detection System

This study employed Support Vector Machine (SVM) in the classification a...
research
03/04/2014

EnsembleSVM: A Library for Ensemble Learning Using Support Vector Machines

EnsembleSVM is a free software package containing efficient routines to ...
research
03/10/2022

Evaluation of Machine Learning Algorithms in Network-Based Intrusion Detection System

Cybersecurity has become one of the focuses of organisations. The number...
research
05/11/2020

A Relational Gradient Descent Algorithm For Support Vector Machine Training

We consider gradient descent like algorithms for Support Vector Machine ...
research
05/23/2018

Approximate Random Dropout

The training phases of Deep neural network (DNN) consume enormous proces...
research
07/13/2023

A Scenario-Based Functional Testing Approach to Improving DNN Performance

This paper proposes a scenario-based functional testing approach for enh...
research
05/30/2022

Dataset Condensation via Efficient Synthetic-Data Parameterization

The great success of machine learning with massive amounts of data comes...

Please sign up or login with your details

Forgot password? Click here to reset