XVir: A Transformer-Based Architecture for Identifying Viral Reads from Cancer Samples

08/28/2023
by   Shorya Consul, et al.
0

It is estimated that approximately 15 viral infections. The viruses that can cause or increase the risk of cancer include human papillomavirus, hepatitis B and C viruses, Epstein-Barr virus, and human immunodeficiency virus, to name a few. The computational analysis of the massive amounts of tumor DNA data, whose collection is enabled by the recent advancements in sequencing technologies, have allowed studies of the potential association between cancers and viral pathogens. However, the high diversity of oncoviral families makes reliable detection of viral DNA difficult and thus, renders such analysis challenging. In this paper, we introduce XVir, a data pipeline that relies on a transformer-based deep learning architecture to reliably identify viral DNA present in human tumors. In particular, XVir is trained on genomic sequencing reads from viral and human genomes and may be used with tumor sequence information to find evidence of viral DNA in human cancers. Results on semi-experimental data demonstrate that XVir is capable of achieving high detection accuracy, generally outperforming state-of-the-art competing methods while being more compact and less computationally demanding.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2018

Deep Neural Network for Analysis of DNA Methylation Data

Many researches demonstrated that the DNA methylation, which occurs in t...
research
05/07/2019

Somatic mutations render human exome and pathogen DNA more similar

Immunotherapy has recently shown important clinical successes in a subst...
research
12/21/2018

Pan-Cancer Epigenetic Biomarker Selection from Blood Samples Using SAS

A key focus in current cancer research is the discovery of cancer biomar...
research
02/23/2022

Using Deep Learning to Detect Digitally Encoded DNA Trigger for Trojan Malware in Bio-Cyber Attacks

This article uses Deep Learning technologies to safeguard DNA sequencing...
research
02/17/2023

Learning models for classifying Raman spectra of genomic DNA from tumor subtypes

An early detection of different tumor subtypes is crucial for an effecti...
research
03/26/2022

AI-augmented histopathologic review using image analysis to optimize DNA yield and tumor purity from FFPE slides

To achieve minimum DNA input and tumor purity requirements for next-gene...
research
03/23/2023

Differential Co-Abundance Network Analyses for Microbiome Data Adjusted for Clinical Covariates Using Jackknife Pseudo-Values

A recent breakthrough in differential network (DN) analysis of microbiom...

Please sign up or login with your details

Forgot password? Click here to reset