A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

04/30/2020
by   Austin Clyde, et al.
0

By combining various cancer cell line (CCL) drug screening panels, the size of the data has grown significantly to begin understanding how advances in deep learning can advance drug response predictions. In this paper we train >35,000 neural network models, sweeping over common featurization techniques. We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features. We found the inclusion of single nucleotide polymorphisms (SNPs) coded as count matrices improved model performance significantly, and no substantial difference in model performance with respect to molecular featurization between the common open source MOrdred descriptors and Dragon7 descriptors. Alongside this analysis, we outline data integration between CCL screening datasets and present evidence that new metrics and imbalanced data techniques, as well as advances in data standardization, need to be developed.

READ FULL TEXT
research
12/28/2018

Drug cell line interaction prediction

Understanding the phenotypic drug response on cancer cell lines plays a ...
research
10/28/2021

MOOMIN: Deep Molecular Omics Network for Anti-Cancer Drug Combination Therapy

We propose the molecular omics network (MOOMIN) a multimodal graph neura...
research
11/25/2020

Learning Curves for Drug Response Prediction in Cancer Cell Lines

Motivated by the size of cell line drug sensitivity data, researchers ha...
research
11/13/2019

AMPL: A Data-Driven Modeling Pipeline for Drug Discovery

One of the key requirements for incorporating machine learning into the ...
research
11/10/2016

Low Data Drug Discovery with One-shot Learning

Recent advances in machine learning have made significant contributions ...
research
12/30/2013

Identification of structural features in chemicals associated with cancer drug response: A systematic data-driven analysis

Motivation: Analysis of relationships of drug structure to biological re...
research
06/01/2020

Regression Enrichment Surfaces: a Simple Analysis Technique for Virtual Drug Screening Models

We present a new method for understanding the performance of a model in ...

Please sign up or login with your details

Forgot password? Click here to reset