AI-driven Hypernetwork of Organic Chemistry: Network Statistics and Applications in Reaction Classification

by   Vipul Mann, et al.

Rapid discovery of new reactions and molecules in recent years has been facilitated by the advancements in high throughput screening, accessibility to a much more complex chemical design space, and the development of accurate molecular modeling frameworks. A holistic study of the growing chemistry literature is, therefore, required that focuses on understanding the recent trends and extrapolating them into possible future trajectories. To this end, several network theory-based studies have been reported that use a directed graph representation of chemical reactions. Here, we perform a study based on representing chemical reactions as hypergraphs where the hyperedges represent chemical reactions and nodes represent the participating molecules. We use a standard reactions dataset to construct a hypernetwork and report its statistics such as degree distributions, average path length, assortativity or degree correlations, PageRank centrality, and graph-based clusters (or communities). We also compute each statistic for an equivalent directed graph representation of reactions to draw parallels and highlight differences between the two. To demonstrate the AI applicability of hypergraph reaction representation, we generate dense hypergraph embeddings and use them in the reaction classification problem. We conclude that the hypernetwork representation is flexible, preserves reaction context, and uncovers hidden insights that are otherwise not apparent in a traditional directed graph representation of chemical reactions.


Scaffold Embeddings: Learning the Structure Spanned by Chemical Fragments, Scaffolds and Compounds

Molecules have seemed like a natural fit to deep learning's tendency to ...

Rxn Hypergraph: a Hypergraph Attention Model for Chemical Reaction Representation

It is fundamental for science and technology to be able to predict chemi...

Chemical Transformation Motifs - Modelling Pathways as Integer Hyperflows

We present an elaborate framework for formally modelling pathways in che...

Dynamic Complex Network Analysis of PM2.5 Concentrations in the UK using Hierarchical Directed Graphs

Worldwide exposure to fine atmospheric particles can exasperate the risk...

ChemiRise: a data-driven retrosynthesis engine

We have developed an end-to-end, retrosynthesis system, named ChemiRise,...

Chemical-Reaction-Aware Molecule Representation Learning

Molecule representation learning (MRL) methods aim to embed molecules in...

Efficient Modular Graph Transformation Rule Application

Graph transformation formalisms have proven to be suitable tools for the...

Please sign up or login with your details

Forgot password? Click here to reset