HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Molecule3D: A Benchmark for Predicting 3D Geometries from Molecular Graphs

Zhao Xu; Youzhi Luo; Xuan Zhang; Xinyi Xu; Yaochen Xie; Meng Liu; Kaleb Dickerson; Cheng Deng; Maho Nakata; Shuiwang Ji

Molecule3D: A Benchmark for Predicting 3D Geometries from Molecular Graphs

Abstract

Graph neural networks are emerging as promising methods for modeling molecular graphs, in which nodes and edges correspond to atoms and chemical bonds, respectively. Recent studies show that when 3D molecular geometries, such as bond lengths and angles, are available, molecular property prediction tasks can be made more accurate. However, computing of 3D molecular geometries requires quantum calculations that are computationally prohibitive. For example, accurate calculation of 3D geometries of a small molecule requires hours of computing time using density functional theory (DFT). Here, we propose to predict the ground-state 3D geometries from molecular graphs using machine learning methods. To make this feasible, we develop a benchmark, known as Molecule3D, that includes a dataset with precise ground-state geometries of approximately 4 million molecules derived from DFT. We also provide a set of software tools for data processing, splitting, training, and evaluation, etc. Specifically, we propose to assess the error and validity of predicted geometries using four metrics. We implement two baseline methods that either predict the pairwise distance between atoms or atom coordinates in 3D space. Experimental results show that, compared with generating 3D geometries with RDKit, our method can achieve comparable prediction accuracy but with much smaller computational costs. Our Molecule3D is available as a module of the MoleculeX software library (https://github.com/divelab/MoleculeX).

Code Repositories

chao1224/geossl
pytorch
Mentioned in GitHub
divelab/MoleculeX
Official
pytorch
chao1224/se3ddm
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-geometry-prediction-on-molecule3d-testDeeperGCN-DAGNN + Coordinates
MAE: 0.571
RMSE: 0.961
Validity: 100
Validity3D: 100
3d-geometry-prediction-on-molecule3d-testDeeperGCN-DAGNN + Distance
MAE: 0.483
RMSE: 0.753
Validity: 1.69
Validity3D: 0.03
3d-geometry-prediction-on-molecule3d-valDeeperGCN-DAGNN + Distance
MAE: 0.482
RMSE: 0.749
Validity: 1.71
Validity3D: 0.02
3d-geometry-prediction-on-molecule3d-valDeeperGCN-DAGNN + Coordinates
MAE: 0.509
RMSE: 0.849
Validity: 100
Validity3D: 100

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Molecule3D: A Benchmark for Predicting 3D Geometries from Molecular Graphs | Papers | HyperAI