ForestPrune: Compact Depth-Controlled Tree Ensembles

05/31/2022
by   Brian Liu, et al.
0

Tree ensembles are versatile supervised learning algorithms that achieve state-of-the-art performance. These models are extremely powerful but can grow to enormous sizes. As a result, tree ensembles are often post-processed to reduce memory footprint and improve interpretability. In this paper, we present ForestPrune, a novel optimization framework that can post-process tree ensembles by pruning depth layers from individual trees. We also develop a new block coordinate descent method to efficiently obtain high-quality solutions to optimization problems under this framework. The number of nodes in a decision tree increases exponentially with tree depth, so pruning deep trees can drastically improve model parsimony. ForestPrune can substantially reduce the space complexity of an ensemble for a minimal cost to performance. The framework supports various weighting schemes and contains just a single hyperparameter to tune. In our experiments, we observe that ForestPrune can reduce model size 20-fold with negligible performance loss.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2020

Born-Again Tree Ensembles

The use of machine learning algorithms in finance, medicine, and crimina...
research
06/17/2016

Making Tree Ensembles Interpretable

Tree ensembles, such as random forest and boosted trees, are renowned fo...
research
01/24/2023

A Robust Hypothesis Test for Tree Ensemble Pruning

Gradient boosted decision trees are some of the most popular algorithms ...
research
05/06/2022

Green Accelerated Hoeffding Tree

State-of-the-art machine learning solutions mainly focus on creating hig...
research
05/25/2022

A Neural Tangent Kernel Formula for Ensembles of Soft Trees with Arbitrary Architectures

A soft tree is an actively studied variant of a decision tree that updat...
research
01/05/2016

Optimally Pruning Decision Tree Ensembles With Feature Cost

We consider the problem of learning decision rules for prediction with f...
research
05/19/2022

Flexible Modeling and Multitask Learning using Differentiable Tree Ensembles

Decision tree ensembles are widely used and competitive learning models....

Please sign up or login with your details

Forgot password? Click here to reset