Visual Analytics For Machine Learning: A Data Perspective Survey

07/15/2023
by   Junpeng Wang, et al.
1

The past decade has witnessed a plethora of works that leverage the power of visualization (VIS) to interpret machine learning (ML) models. The corresponding research topic, VIS4ML, keeps growing at a fast pace. To better organize the enormous works and shed light on the developing trend of VIS4ML, we provide a systematic review of these works through this survey. Since data quality greatly impacts the performance of ML models, our survey focuses specifically on summarizing VIS4ML works from the data perspective. First, we categorize the common data handled by ML models into five types, explain the unique features of each type, and highlight the corresponding ML models that are good at learning from them. Second, from the large number of VIS4ML works, we tease out six tasks that operate on these types of data (i.e., data-centric tasks) at different stages of the ML pipeline to understand, diagnose, and refine ML models. Lastly, by studying the distribution of 143 surveyed papers across the five data types, six data-centric tasks, and their intersections, we analyze the prospective research directions and envision future research trends.

READ FULL TEXT

page 4

page 12

page 15

page 16

research
12/22/2022

The State of the Art in Enhancing Trust in Machine Learning Models with the Use of Visualizations

Machine learning (ML) models are nowadays used in complex applications i...
research
09/10/2021

WiFi Meets ML: A Survey on Improving IEEE 802.11 Performance with Machine Learning

Wireless local area networks (WLANs) empowered by IEEE 802.11 (WiFi) hol...
research
04/20/2019

CleanML: A Benchmark for Joint Data Cleaning and Machine Learning [Experiments and Analysis]

It is widely recognized that the data quality affects machine learning (...
research
01/02/2023

DMOps: Data Management Operation and Recipes

Data-centric AI has shed light on the significance of data within the ma...
research
10/19/2022

Topology Optimization via Machine Learning and Deep Learning: A Review

Topology optimization (TO) is a method of deriving an optimal design tha...
research
09/08/2022

Lost in Translation: Reimagining the Machine Learning Life Cycle in Education

Machine learning (ML) techniques are increasingly prevalent in education...
research
09/16/2021

Studying Up Machine Learning Data: Why Talk About Bias When We Mean Power?

Research in machine learning (ML) has primarily argued that models train...

Please sign up or login with your details

Forgot password? Click here to reset