Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning

by   Sai Krishna Gottipati, et al.

Over the last decade, there has been significant progress in the field of machine learning for de novo drug design, particularly in deep generative models. However, current generative approaches exhibit a significant challenge as they do not ensure that the proposed molecular structures can be feasibly synthesized nor do they provide the synthesis routes of the proposed small molecules, thereby seriously limiting their practical applicability. In this work, we propose a novel forward synthesis framework powered by reinforcement learning (RL) for de novo drug design, Policy Gradient for Forward Synthesis (PGFS), that addresses this challenge by embedding the concept of synthetic accessibility directly into the de novo drug design system. In this setup, the agent learns to navigate through the immense synthetically accessible chemical space by subjecting commercially available small molecule building blocks to valid chemical reactions at every time step of the iterative virtual multi-step synthesis process. The proposed environment for drug discovery provides a highly challenging test-bed for RL algorithms owing to the large state space and high-dimensional continuous action space with hierarchical actions. PGFS achieves state-of-the-art performance in generating structures with high QED and logP. Moreover, we validate PGFS in an in-silico proof-of-concept associated with three HIV targets, and the candidates generated with PGFS outperformed the existing benchmarks in optimizing the activity of the biological targets. Finally, we describe how the end-to-end training conceptualized in this study represents an important paradigm in radically expanding the synthesizable chemical space and automating the drug discovery process.


page 1

page 2

page 3

page 4


Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning

The fundamental goal of generative drug design is to propose optimized m...

An efficient graph generative model for navigating ultra-large combinatorial synthesis libraries

Virtual, make-on-demand chemical libraries have transformed early-stage ...

Deep Denerative Models for Drug Design and Response

Designing new chemical compounds with desired pharmaceutical properties ...

GraphAF: a Flow-based Autoregressive Model for Molecular Graph Generation

Molecular graph generation is a fundamental problem for drug discovery a...

Uni-RXN: A Unified Framework Bridging the Gap between Chemical Reaction Pretraining and Conditional Molecule Generation

Chemical reactions are the fundamental building blocks of drug design an...

Barking up the right tree: an approach to search over molecule synthesis DAGs

When designing new molecules with particular properties, it is not only ...

Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

We developed Distilled Graph Attention Policy Networks (DGAPNs), a curio...

Please sign up or login with your details

Forgot password? Click here to reset