Are Learned Molecular Representations Ready For Prime Time?

by   Kevin Yang, et al.

Advancements in neural machinery have led to a wide range of algorithmic solutions for molecular property prediction. Two classes of models in particular have yielded promising results: neural networks applied to computed molecular fingerprints or expert-crafted descriptors, and graph convolutional neural networks that construct a learned molecular representation by operating on the graph structure of the molecule. However, recent literature has yet to clearly determine which of these two methods is superior when generalizing to new chemical space. Furthermore, prior research has rarely examined these new models in industry research settings in comparison to existing employed models. In this paper, we benchmark models extensively on 19 public and 15 proprietary industrial datasets spanning a wide variety of chemical endpoints. In addition, we introduce a graph convolutional model that consistently outperforms models using fixed molecular descriptors as well as previous graph neural architectures on both public and proprietary datasets. Our empirical findings indicate that while approaches based on these representations have yet to reach the level of experimental reproducibility, our proposed model nevertheless offers significant improvements over models currently used in industrial workflows.


Comparison of Atom Representations in Graph Neural Networks for Molecular Property Prediction

Graph neural networks have recently become a standard method for analysi...

Molecular structure prediction based on graph convolutional networks

Due to the important application of molecular structure in many fields, ...

ToxicBlend: Virtual Screening of Toxic Compounds with Ensemble Predictors

Timely assessment of compound toxicity is one of the biggest challenges ...

Multitask Learning On Graph Neural Networks Applied To Molecular Property Predictions

Prediction of molecular properties, including physico-chemical propertie...

ProGReST: Prototypical Graph Regression Soft Trees for Molecular Property Prediction

In this work, we propose the novel Prototypical Graph Regression Self-ex...

Taking a Respite from Representation Learning for Molecular Property Prediction

Artificial intelligence (AI) has been widely applied in drug discovery w...

Please sign up or login with your details

Forgot password? Click here to reset