DMOps: Data Management Operation and Recipes

01/02/2023
by   Eujeong Choi, et al.
0

Data-centric AI has shed light on the significance of data within the machine learning (ML) pipeline. Recognizing its significance, academia, industry, and government departments have suggested various NLP data research initiatives. While the ability to utilize existing data is essential, the ability to build a dataset has become more critical than ever, especially in the industry. In consideration of this trend, we propose a "Data Management Operations and Recipes" to guide the industry in optimizing the building of datasets for NLP products. This paper presents the concept of DMOps which is derived from real-world experiences with NLP data management and aims to streamline data operations by offering a baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2023

Who should I Collaborate with? A Comparative Study of Academia and Industry Research Collaboration in NLP

The goal of our research was to investigate the effects of collaboration...
research
07/15/2023

Visual Analytics For Machine Learning: A Data Perspective Survey

The past decade has witnessed a plethora of works that leverage the powe...
research
11/09/2022

DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems

While there have been a number of remarkable breakthroughs in machine le...
research
01/29/2021

Facilitating Knowledge Sharing from Domain Experts to Data Scientists for Building NLP Models

Data scientists face a steep learning curve in understanding a new domai...
research
12/30/2021

Chatbot for fitness management using IBM Watson

Chatbots have revolutionized the way humans interact with computer syste...
research
07/22/2023

Exploring MLOps Dynamics: An Experimental Analysis in a Real-World Machine Learning Project

This article presents an experiment focused on optimizing the MLOps (Mac...
research
11/16/2021

DataCLUE: A Benchmark Suite for Data-centric NLP

Data-centric AI has recently proven to be more effective and high-perfor...

Please sign up or login with your details

Forgot password? Click here to reset