Large Scale Radio Frequency Signal Classification

by   Luke Boegner, et al.

Existing datasets used to train deep learning models for narrowband radio frequency (RF) signal classification lack enough diversity in signal types and channel impairments to sufficiently assess model performance in the real world. We introduce the Sig53 dataset consisting of 5 million synthetically-generated samples from 53 different signal classes and expertly chosen impairments. We also introduce TorchSig, a signals processing machine learning toolkit that can be used to generate this dataset. TorchSig incorporates data handling principles that are common to the vision domain, and it is meant to serve as an open-source foundation for future signals machine learning research. Initial experiments using the Sig53 dataset are conducted using state of the art (SoTA) convolutional neural networks (ConvNets) and Transformers. These experiments reveal Transformers outperform ConvNets without the need for additional regularization or a ConvNet teacher, which is contrary to results from the vision domain. Additional experiments demonstrate that TorchSig's domain-specific data augmentations facilitate model training, which ultimately benefits model performance. Finally, TorchSig supports on-the-fly synthetic data creation at training time, thus enabling massive scale training sessions with virtually unlimited datasets.


page 9

page 16

page 17

page 22

page 23

page 24

page 25

page 26


RF Signal Classification with Synthetic Training Data and its Real-World Performance

Neural nets are a powerful method for the classification of radio signal...

Deep Learning Radio Frequency Signal Classification with Hybrid Images

In recent years, Deep Learning (DL) has been successfully applied to det...

Standardizing and Centralizing Datasets to Enable Efficient Training of Agricultural Deep Learning Models

In recent years, deep learning models have become the standard for agric...

Transmitter Classification With Supervised Deep Learning

Hardware imperfections in RF transmitters introduce features that can be...

Quantifying and Extrapolating Data Needs in Radio Frequency Machine Learning

Understanding the relationship between training data and a model's perfo...

Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

Most NLP tasks are modeled as supervised learning and thus require label...

Optimizing the Procedure of CT Segmentation Labeling

In Computed Tomography, machine learning is often used for automated dat...

Please sign up or login with your details

Forgot password? Click here to reset