HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Open Graph Benchmark: Datasets for Machine Learning on Graphs

Weihua Hu Matthias Fey Marinka Zitnik Yuxiao Dong Hongyu Ren Bowen Liu Michele Catasta Jure Leskovec

Open Graph Benchmark: Datasets for Machine Learning on Graphs

Abstract

We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs. For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics. In addition to building the datasets, we also perform extensive benchmark experiments for each dataset. Our experiments suggest that OGB datasets present significant challenges of scalability to large-scale graphs and out-of-distribution generalization under realistic data splits, indicating fruitful opportunities for future research. Finally, OGB provides an automated end-to-end graph ML pipeline that simplifies and standardizes the process of graph data loading, experimental setup, and model evaluation. OGB will be regularly updated and welcomes inputs from the community. OGB datasets as well as data loaders, evaluation scripts, baseline code, and leaderboards are publicly available at https://ogb.stanford.edu .

Benchmarks

BenchmarkMethodologyMetrics
link-property-prediction-on-ogbl-citation2Matrix Factorization
Ext. data: No
Number of params: 281113505
Test MRR: 0.5186 ± 0.0443
Validation MRR: 0.5181 ± 0.0436
link-property-prediction-on-ogbl-collabMatrix Factorization
Ext. data: No
Number of params: 60514049
Test Hits@50: 0.3886 ± 0.0029
Validation Hits@50: 0.4896 ± 0.0029
link-property-prediction-on-ogbl-ddiMatrix Factorization
Ext. data: No
Number of params: 1224193
Test Hits@20: 0.1368 ± 0.0475
Validation Hits@20: 0.3370 ± 0.0264
link-property-prediction-on-ogbl-ppaMatrix Factorization
Ext. data: No
Number of params: 147662849
Test Hits@100: 0.3229 ± 0.0094
Validation Hits@100: 0.3228 ± 0.0428

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Open Graph Benchmark: Datasets for Machine Learning on Graphs | Papers | HyperAI