Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

04/12/2018

∙

Gradient Boosting Decision Tree, i.e. GBDT, becomes one of the most important machine learning algorithms. However, the training process of GBDT needs a lot of computational resources and time even using fork-join parallel method and sampling technology. In order to accelerate the training process of GBDT, asynchronous parallel stochastic gradient boosting decision tree, abbr. asynch-SGBDT is proposed in this paper. Via changing the view of sampling, we adapt the numerical optimization process of traditional GBDT training process into stochastic optimization process and use asynchronous parallel stochastic gradient descent to accelerate the GBDT training process. Asynch-SGBDT provides good compatibility with Parameters Server. Meanwhile, the theoretical analysis of asynch-SGBDT is provided by us in this paper. Experimental results show that GBDT training process could be accelerated by asynch-SGBDT. Our asynchronous parallel strategy achieves an almost linear speedup, especially for high-dimensional sparse datasets.

READ FULL TEXT

Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

Asynchronous Parallel Sampling Gradient Boosting Decision Tree

Asynchronous Decentralized Parallel Stochastic Gradient Descent

Variance Suppression: Balanced Training Process in Deep Learning

SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems

Asynchronous stochastic convex optimization

Drill the Cork of Information Bottleneck by Inputting the Most Important Data

RoNGBa: A Robustly Optimized Natural Gradient Boosting Training Approach with Leaf Number Clipping

Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

Related Research

Asynchronous Parallel Sampling Gradient Boosting Decision Tree

Asynchronous Decentralized Parallel Stochastic Gradient Descent

Variance Suppression: Balanced Training Process in Deep Learning

SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems

Asynchronous stochastic convex optimization

Drill the Cork of Information Bottleneck by Inputting the Most Important Data

RoNGBa: A Robustly Optimized Natural Gradient Boosting Training Approach with Leaf Number Clipping