Adaptive Histogram-Based Gradient Boosted Trees for Federated Learning

12/11/2020
by   Yuya Jeremy Ong, et al.
9

Federated Learning (FL) is an approach to collaboratively train a model across multiple parties without sharing data between parties or an aggregator. It is used both in the consumer domain to protect personal data as well as in enterprise settings, where dealing with data domicile regulation and the pragmatics of data silos are the main drivers. While gradient boosted tree implementations such as XGBoost have been very successful for many use cases, its federated learning adaptations tend to be very slow due to using cryptographic and privacy methods and have not experienced widespread use. We propose the Party-Adaptive XGBoost (PAX) for federated learning, a novel implementation of gradient boosting which utilizes a party adaptive histogram aggregation method, without the need for data encryption. It constructs a surrogate representation of the data distribution for finding splits of the decision tree. Our experimental results demonstrate strong model performance, especially on non-IID distributions, and significantly faster training run-time across different data sets than existing federated implementations. This approach makes the use of gradient boosted trees practical in enterprise federated learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2019

Practical Federated Gradient Boosting Decision Trees

Gradient Boosting Decision Trees (GBDTs) have become very successful in ...
research
12/15/2021

FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

We address the relatively unexplored problem of hyper-parameter optimiza...
research
11/12/2021

Flatee: Federated Learning Across Trusted Execution Environments

Federated learning allows us to distributively train a machine learning ...
research
05/20/2021

Fed-EINI: An Efficient and Interpretable Inference Framework for Decision Tree Ensembles in Federated Learning

The increasing concerns about data privacy and security drives the emerg...
research
01/15/2021

Probabilistic Inference for Learning from Untrusted Sources

Federated learning brings potential benefits of faster learning, better ...
research
11/10/2020

Mitigating Leakage in Federated Learning with Trusted Hardware

In federated learning, multiple parties collaborate in order to train a ...
research
04/15/2023

Gradient-less Federated Gradient Boosting Trees with Learnable Learning Rates

The privacy-sensitive nature of decentralized datasets and the robustnes...

Please sign up or login with your details

Forgot password? Click here to reset