HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language

Bing Su; Dazhao Du; Zhao Yang; Yujie Zhou; Jiangmeng Li; Anyi Rao; Hao Sun; Zhiwu Lu; Ji-Rong Wen

A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language

Abstract

Although artificial intelligence (AI) has made significant progress in understanding molecules in a wide range of fields, existing models generally acquire the single cognitive ability from the single molecular modality. Since the hierarchy of molecular knowledge is profound, even humans learn from different modalities including both intuitive diagrams and professional texts to assist their understanding. Inspired by this, we propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data (crawled from published Scientific Citation Index papers) via contrastive learning. This AI model represents a critical attempt that directly bridges molecular graphs and natural language. Importantly, through capturing the specific and complementary information of the two modalities, our proposed model can better grasp molecular expertise. Experimental results show that our model not only exhibits promising performance in cross-modal tasks such as cross-modal retrieval and molecule caption, but also enhances molecular property prediction and possesses capability to generate meaningful molecular graphs from natural language descriptions. We believe that our model would have a broad impact on AI-empowered fields across disciplines such as biology, chemistry, materials, environment, and medicine, among others.

Code Repositories

ai-hpc-research-team/git-mol
pytorch
Mentioned in GitHub
bingsu12/momu
Official
pytorch
yangzhao1230/graphtextretrieval
pytorch
Mentioned in GitHub
ai-hpc-research-team/slm4mol
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
molecule-captioning-on-chebi-20MoMu+MolT5-Large
BLEU-2: 59.9
BLEU-4: 51.5
METEOR: 59.7
Text2Mol: 58.2
molecule-captioning-on-chebi-20MoMu+MolT5-Base
BLEU-2: 54.9
BLEU-4: 46.2
METEOR: 57.6
Text2Mol: 55.8
molecule-captioning-on-chebi-20MoMu+MolT5-Small
BLEU-2: 53.2
BLEU-4: 44.5
METEOR: 55.7
Text2Mol: 55.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language | Papers | HyperAI