MLDev: Data Science Experiment Automation and Reproducibility Software

07/26/2021
by   Anton Khritankov, et al.
0

In this paper we explore the challenges of automating experiments in data science. We propose an extensible experiment model as a foundation for integration of different open source tools for running research experiments. We implement our approach in a prototype open source MLDev software package and evaluate it in a series of experiments yielding promising results. Comparison with other state-of-the-art tools signifies novelty of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2022

A Survey of Open Source Automation Tools for Data Science Predictions

We present an expository overview of technical and cultural challenges t...
research
08/12/2020

The Right Tools for the Job: The Case for Spatial Science Tool-Building

This paper was presented as the 8th annual Transactions in GIS plenary a...
research
08/03/2018

DataDeps.jl: Repeatable Data Setup for Replicable Data Science

We present DataDeps.jl: a julia package for the reproducible handling of...
research
02/05/2019

PUTWorkbench: Analysing Privacy in AI-intensive Systems

AI intensive systems that operate upon user data face the challenge of b...
research
11/08/2022

Caching and Reproducibility: Making Data Science experiments faster and FAIRer

Small to medium-scale data science experiments often rely on research so...
research
03/05/2017

Doing Things Twice: Strategies to Identify Studies for Targeted Validation

The "reproducibility crisis" has been a highly visible source of scienti...
research
02/26/2021

GraphSense: A General-Purpose Cryptoasset Analytics Platform

There is currently an increasing demand for cryptoasset analysis tools a...

Please sign up or login with your details

Forgot password? Click here to reset