Heterogenous Ensemble of Models for Molecular Property Prediction

11/20/2022
by   Sajad Darabi, et al.
0

Previous works have demonstrated the importance of considering different modalities on molecules, each of which provide a varied granularity of information for downstream property prediction tasks. Our method combines variants of the recent TransformerM architecture with Transformer, GNN, and ResNet backbone architectures. Models are trained on the 2D data, 3D data, and image modalities of molecular graphs. We ensemble these models with a HuberRegressor. The models are trained on 4 different train/validation splits of the original train + valid datasets. This yields a winning solution to the 2nd edition of the OGB Large-Scale Challenge (2022) on the PCQM4Mv2 molecular property prediction dataset. Our proposed method achieves a test-challenge MAE of 0.0723 and a validation MAE of 0.07145. Total inference time for our solution is less than 2 hours. We open-source our code at https://github.com/jfpuget/NVIDIA-PCQM4Mv2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2022

An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022

In the technical report, we provide our solution for OGB-LSC 2022 Graph ...
research
06/29/2021

On Graph Neural Network Ensembles for Large-Scale Molecular Property Prediction

In order to advance large-scale graph machine learning, the Open Graph B...
research
10/04/2022

One Transformer Can Understand Both 2D 3D Molecular Data

Unlike vision and language data which usually has a unique format, molec...
research
02/28/2022

An Empirical Study of Graphormer on Large-Scale Molecular Modeling Datasets

This technical note describes the recent updates of Graphormer, includin...
research
07/14/2023

Can Large Language Models Empower Molecular Property Prediction?

Molecular property prediction has gained significant attention due to it...
research
06/22/2023

Molecular geometric deep learning

Geometric deep learning (GDL) has demonstrated huge power and enormous p...
research
06/27/2021

DGL-LifeSci: An Open-Source Toolkit for Deep Learning on Graphs in Life Science

Graph neural networks (GNNs) constitute a class of deep learning methods...

Please sign up or login with your details

Forgot password? Click here to reset