Challenges and Pitfalls of Reproducing Machine Learning Artifacts

04/29/2019
by   Cheng Li, et al.
6

An increasingly complex and diverse collection of Machine Learning(ML) models as well as hardware/software stacks, collectively referred to as "ML artifacts", are being proposed - leading to a diverse landscape of ML. These ML innovations proposed have outpaced researchers' ability to analyze, study and adapt them. This is exacerbated by the complicated and sometimes non-reproducible procedures for ML evaluation. The current practice of sharing ML artifacts is through repositories where artifact authors post ad-hoc code and some documentation. The authors often fail to reveal critical information for others to reproduce their results. One often fails to reproduce artifact authors' claims, not to mention adapt the model to his/her own use. This article discusses the common challenges and pitfalls of reproducing ML artifacts, which can be used as a guideline for ML researchers when sharing or reproducing artifacts.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
04/29/2019

Challenges and Pitfalls of Machine Learning Evaluation and Benchmarking

An increasingly complex and diverse collection of Machine Learning (ML) ...
research
05/14/2021

RC2020 Report: Learning De-biased Representations with Biased Representations

As part of the ML Reproducibility Challenge 2020, we investigated the IC...
research
03/31/2019

SysML'19 demo: customizable and reusable Collective Knowledge pipelines to automate and reproduce machine learning experiments

Reproducing, comparing and reusing results from machine learning and sys...
research
06/10/2022

Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models

We study how networking corruptions–data corruptions caused by networkin...
research
01/12/2018

SwarmRob: A Toolkit for Reproducibility and Sharing of Experimental Artifacts in Robotics Research

Due to the complexity of robotics, the reproducibility of results and ex...
research
06/12/2020

The Collective Knowledge project: making ML models more portable and reproducible with open APIs, reusable best practices and MLOps

This article provides an overview of the Collective Knowledge technology...
research
06/06/2019

Gradio: Hassle-Free Sharing and Testing of ML Models in the Wild

Accessibility is a major challenge of machine learning (ML). Typical ML ...

Please sign up or login with your details

Forgot password? Click here to reset