A Machine Learning-based Approach to Detect Threats in Bio-Cyber DNA Storage Systems

by   Federico Tavella, et al.

Data storage is one of the main computing issues of this century. Not only storage devices are converging to strict physical limits, but also the amount of data generated by users is growing at an unbelievable rate. To face these challenges, data centres grew constantly over the past decades. However, this growth comes with a price, particularly from the environmental point of view. Among various promising media, DNA is one of the most fascinating candidate. In our previous work, we have proposed an automated archival architecture which uses bioengineered bacteria to store and retrieve data, previously encoded into DNA. This storage technique is one example of how biological media can deliver power-efficient storing solutions. The similarities between these biological media and classical ones can also be a drawback, as malicious parties might replicate traditional attacks on the former archival system, using biological instruments and techniques. In this paper, first we analyse the main characteristics of our storage system and the different types of attacks that could be executed on it. Then, aiming at identifying on-going attacks, we propose and evaluate detection techniques, which rely on traditional metrics and machine learning algorithms. We identify and adapt two suitable metrics for this purpose, namely generalized entropy and information distance. Moreover, our trained models achieve an AUROC over 0.99 and AUPRC over 0.91.


page 1

page 7

page 8

page 9

page 11


MQ-Coder inspired arithmetic coder for synthetic DNA data storage

Over the past years, the ever-growing trend on data storage demand, more...

Image Storage on Synthetic DNA Using Autoencoders

Over the past years, the ever-growing trend on data storage demand, more...

A biologically constrained encoding solution for long-term storage of images onto synthetic DNA

Living in the age of the digital media explosion, the amount of data tha...

DNA data storage, sequencing data-carrying DNA

DNA is a leading candidate as the next archival storage media due to its...

Using Deep Learning to Detect Digitally Encoded DNA Trigger for Trojan Malware in Bio-Cyber Attacks

This article uses Deep Learning technologies to safeguard DNA sequencing...

Image processing in DNA

The main obstacles for the practical deployment of DNA-based data storag...

Securing Tag-based recommender systems against profile injection attacks: A comparative study

This work addresses challenges related to attacks on social tagging syst...

Please sign up or login with your details

Forgot password? Click here to reset