Deep Learning and Random Forest-Based Augmentation of sRNA Expression Profiles

09/26/2019
by   Jelena Fiosina, et al.
0

The lack of well-structured annotations in a growing amount of RNA expression data complicates data interoperability and reusability. Commonly - used text mining methods extract annotations from existing unstructured data descriptions and often provide inaccurate output that requires manual curation. Automatic data-based augmentation (generation of annotations on the base of expression data) can considerably improve the annotation quality and has not been well-studied. We formulate an automatic augmentation of small RNA-seq expression data as a classification problem and investigate deep learning (DL) and random forest (RF) approaches to solve it. We generate tissue and sex annotations from small RNA-seq expression data for tissues and cell lines of homo sapiens. We validate our approach on 4243 annotated small RNA-seq samples from the Small RNA Expression Atlas (SEA) database. The average prediction accuracy for tissue groups is 98 77 83 RF, and considerably improves classification performance for 'unseen' datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2019

Explainable Deep Learning for Augmentation of sRNA Expression Profiles

The lack of well-structured metadata annotations complicates there-usabi...
research
04/19/2018

A Dynamic Boosted Ensemble Learning Method Based on Random Forest

We propose a dynamic boosted ensemble learning method based on random fo...
research
08/02/2023

Evaluation of network-guided random forest for disease gene discovery

Gene network information is believed to be beneficial for disease module...
research
06/28/2018

Deep Semi Supervised Generative Learning for Automated PD-L1 Tumor Cell Scoring on NSCLC Tissue Needle Biopsies

The level of PD-L1 expression in immunohistochemistry (IHC) assays is a ...
research
06/23/2018

Disease Classification in Metagenomics with 2D Embeddings and Deep Learning

Deep learning (DL) techniques have shown unprecedented success when appl...
research
08/22/2022

MetaRF: Differentiable Random Forest for Reaction Yield Prediction with a Few Trails

Artificial intelligence has deeply revolutionized the field of medicinal...
research
06/07/2023

Improved statistical benchmarking of digital pathology models using pairwise frames evaluation

Nested pairwise frames is a method for relative benchmarking of cell or ...

Please sign up or login with your details

Forgot password? Click here to reset