Empirical Evaluation of Data Augmentations for Biobehavioral Time Series Data with Deep Learning

by   Huiyuan Yang, et al.

Deep learning has performed remarkably well on many tasks recently. However, the superior performance of deep models relies heavily on the availability of a large number of training data, which limits the wide adaptation of deep models on various clinical and affective computing tasks, as the labeled data are usually very limited. As an effective technique to increase the data variability and thus train deep models with better generalization, data augmentation (DA) is a critical step for the success of deep learning models on biobehavioral time series data. However, the effectiveness of various DAs for different datasets with different tasks and deep models is understudied for biobehavioral time series data. In this paper, we first systematically review eight basic DA methods for biobehavioral time series data, and evaluate the effects on seven datasets with three backbones. Next, we explore adapting more recent DA techniques (i.e., automatic augmentation, random augmentation) to biobehavioral time series data by designing a new policy architecture applicable to time series data. Last, we try to answer the question of why a DA is effective (or not) by first summarizing two desired attributes for augmentations (challenging and faithful), and then utilizing two metrics to quantitatively measure the corresponding attributes, which can guide us in the search for more effective DA for biobehavioral time series data by designing more challenging but still faithful transformations. Our code and results are available at Link.


page 1

page 2

page 3

page 4


Time Series Data Augmentation for Deep Learning: A Survey

Deep learning performs remarkably well on many time series analysis task...

FrAug: Frequency Domain Augmentation for Time Series Forecasting

Data augmentation (DA) has become a de facto solution to expand training...

Data Augmentation techniques in time series domain: A survey and taxonomy

With the latest advances in deep learning generative models, it has not ...

Data Encoding For Healthcare Data Democratisation and Information Leakage Prevention

The lack of data democratization and information leakage from trained mo...

MLFriend: Interactive Prediction Task Recommendation for Event-Driven Time-Series Data

Most automation in machine learning focuses on model selection and hyper...

Learning Optimal Data Augmentation Policies via Bayesian Optimization for Image Classification Tasks

In recent years, deep learning has achieved remarkable achievements in m...

Harvesting Time-Series Data from Service-Based Systems Hosted in MANETs

We are concerned with reliably harvesting data collected from service-ba...

Please sign up or login with your details

Forgot password? Click here to reset