Minimum Viable Model Estimates for Machine Learning Projects

01/02/2021
by   John Hawkins, et al.
61

Prioritization of machine learning projects requires estimates of both the potential ROI of the business case and the technical difficulty of building a model with the required characteristics. In this work we present a technique for estimating the minimum required performance characteristics of a predictive model given a set of information about how it will be used. This technique will result in robust, objective comparisons between potential projects. The resulting estimates will allow data scientists and managers to evaluate whether a proposed machine learning project is likely to succeed before any modelling needs to be done. The technique has been implemented into the open source application MinViME (Minimum Viable Model Estimator) which can be installed via the PyPI python package management system, or downloaded directly from the GitHub repository. Available at https://github.com/john-hawkins/MinViME

READ FULL TEXT

page 8

page 9

research
10/04/2021

PyTorrent: A Python Library Corpus for Large-scale Language Models

A large scale collection of both semantic and natural language resources...
research
07/16/2021

LeanML: A Design Pattern To Slash Avoidable Wastes in Machine Learning Projects

We introduce the first application of the lean methodology to machine le...
research
11/03/2020

Brain Predictability toolbox: a Python library for neuroimaging based machine learning

Summary Brain Predictability toolbox (BPt) represents a unified framewor...
research
03/08/2021

Sampling Projects in GitHub for MSR Studies

Almost every Mining Software Repositories (MSR) study requires, as first...
research
12/26/2022

Studying the Characteristics of AIOps Projects on GitHub

Artificial Intelligence for IT Operations (AIOps) leverages AI approache...
research
08/18/2023

Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

Spectral pixels are often a mixture of the pure spectra of the materials...
research
06/19/2023

Human Limits in Machine Learning: Prediction of Plant Phenotypes Using Soil Microbiome Data

The preservation of soil health has been identified as one of the main c...

Please sign up or login with your details

Forgot password? Click here to reset