Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

06/07/2020
by   Chengrun Yang, et al.
0

Data scientists seeking a good supervised learning model on a new dataset have many choices to make: they must preprocess the data, select features, possibly reduce the dimension, select an estimation algorithm, and choose hyperparameters for each of these pipeline components. With new pipeline components comes a combinatorial explosion in the number of choices! In this work, we design a new AutoML system to address this challenge: an automated system to design a supervised learning pipeline. Our system uses matrix and tensor factorization as surrogate models to model the combinatorial pipeline search space. Under these models, we develop greedy experiment design protocols to efficiently gather information about a new dataset. Experiments on large corpora of real-world classification problems demonstrate the effectiveness of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2022

Zero-Shot AutoML with Pretrained Models

Given a new dataset D and a low compute budget, how should we choose a p...
research
02/18/2022

SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions

Automatic machine learning, or AutoML, holds the promise of truly democr...
research
07/12/2023

Efficient and Joint Hyperparameter and Architecture Search for Collaborative Filtering

Automated Machine Learning (AutoML) techniques have recently been introd...
research
01/29/2015

Tensor Factorization via Matrix Factorization

Tensor factorization arises in many machine learning applications, such ...
research
05/01/2021

Exploring Opportunistic Meta-knowledge to Reduce Search Spaces for Automated Machine Learning

Machine learning (ML) pipeline composition and optimisation have been st...
research
05/18/2022

Stochastic uncertainty analysis of gravity gradient tensor components and their combinations

Full tensor gravity (FTG) devices provide up to five independent compone...
research
11/25/2022

Underground Diagnosis Based on GPR and Learning in the Model Space

Ground Penetrating Radar (GPR) has been widely used in pipeline detectio...

Please sign up or login with your details

Forgot password? Click here to reset