4 个月前

大规模非同质图上的学习:新基准和强大的简单方法

大规模非同质图上的学习:新基准和强大的简单方法

摘要

许多广泛用于图机器学习任务的数据集通常表现出同质性,即标签相似的节点相互连接。近期,新的图神经网络(G神经网络,GNNs)已经开发出来,突破了同质性的限制;然而,这些模型的评估往往是在规模较小且应用领域有限的图上进行的。我们收集并引入了来自不同应用领域的多样化非同质性数据集,这些数据集的节点数量最多比先前的数据集多384倍,边的数量最多多1398倍。我们进一步表明,现有的可扩展图学习和图小批量训练技术在这些非同质性数据集上会导致性能下降,从而突显了对可扩展非同质性方法进一步研究的需求。为了解决这些问题,我们提出了LINKX——一种强大的简单方法,支持直接的小批量训练和推理。通过在我们提出的多个数据集上与代表性简单方法和GNNs进行广泛的实验对比,结果表明LINKX在非同质性图上的学习性能达到了当前最佳水平。我们的代码和数据可在https://github.com/CUAI/Non-Homophily-Large-Scale 获取。

代码仓库

CUAI/Non-Homophily-Benchmarks
pytorch
GitHub 中提及
cuai/non-homophily-large-scale
官方
pytorch
GitHub 中提及
kkhuang81/AdaptKry
pytorch
GitHub 中提及
ivam-he/chebnetii
pytorch
GitHub 中提及
kkhuang81/UniFilter
pytorch
GitHub 中提及

基准测试

基准方法指标
node-classification-on-actorLINKX
Accuracy: 36.10 ± 1.55
node-classification-on-arxiv-yearLINKX
Accuracy: 56.00±1.34
node-classification-on-chameleonLINKX
Accuracy: 68.42 ± 1.38
node-classification-on-citeseer-48-32-20LINKX
1:1 Accuracy: 73.19 ± 0.99
node-classification-on-cora-48-32-20-fixedLINKX
1:1 Accuracy: 84.64 ± 1.13
node-classification-on-cornellLINKX
Accuracy: 77.84 ± 5.81
node-classification-on-geniusLINKX
Accuracy: 90.77 ± 0.27
node-classification-on-non-homophilic-10LINKX
1:1 Accuracy: 36.10 ± 1.55 
node-classification-on-non-homophilic-11LINKX
1:1 Accuracy: 68.42 ± 1.38 
node-classification-on-non-homophilic-12LINKX
1:1 Accuracy: 61.81 ± 1.80
node-classification-on-non-homophilic-13LINKX
1:1 Accuracy: 84.71 ± 0.52
node-classification-on-non-homophilic-13GCNJK
1:1 Accuracy: 81.63 ± 0.54
node-classification-on-non-homophilic-13L Prop 2-hop
1:1 Accuracy: 74.13 ± 0.46
node-classification-on-non-homophilic-13GATJK
1:1 Accuracy: 80.69 ± 0.36
node-classification-on-non-homophilic-13LINK 
1:1 Accuracy: 80.79 ± 0.49
node-classification-on-non-homophilic-13L Prop 1-hop
1:1 Accuracy: 63.21 ± 0.39
node-classification-on-non-homophilic-14GATJK
1:1 Accuracy: 56.70 ± 2.07
node-classification-on-non-homophilic-14MLP
1:1 Accuracy: 86.68 ± 0.09
node-classification-on-non-homophilic-14LINK 
1:1 Accuracy: 73.56 ± 0.14
node-classification-on-non-homophilic-14L Prop 2-hop
1:1 Accuracy: 67.04 ± 0.20
node-classification-on-non-homophilic-14GCNJK
1:1 Accuracy: 89.30 ± 0.19
node-classification-on-non-homophilic-14LINKX
1:1 Accuracy: 90.77 ± 0.27
node-classification-on-non-homophilic-14L Prop 1-hop
1:1 Accuracy: 66.02 ± 0.16
node-classification-on-non-homophilic-15MLP
1:1 Accuracy: 60.92 ± 0.07
node-classification-on-non-homophilic-15L Prop 1-hop
1:1 Accuracy: 62.77 ± 0.24
node-classification-on-non-homophilic-15GCNJK
1:1 Accuracy: 63.45 ± 0.22
node-classification-on-non-homophilic-15LINKX
1:1 Accuracy: 66.06 ± 0.19
node-classification-on-non-homophilic-15GATJK
1:1 Accuracy: 59.98 ± 2.87
node-classification-on-non-homophilic-15L Prop 2-hop
1:1 Accuracy: 63.88 ± 0.24
node-classification-on-non-homophilic-15LINK 
1:1 Accuracy: 64.85 ± 0.21
node-classification-on-non-homophilic-7LINKX
1:1 Accuracy:  77.84 ± 5.81 
node-classification-on-non-homophilic-8LINKX
1:1 Accuracy: 75.49 ± 5.72
node-classification-on-non-homophilic-9LINKX
1:1 Accuracy: 74.60 ± 8.37 
node-classification-on-penn94LINKX
Accuracy: 84.71 ± 0.52
node-classification-on-pokecLINKX
Accuracy: 82.04±0.07
node-classification-on-pubmed-48-32-20-fixedLINKX
1:1 Accuracy: 87.86 ± 0.77
node-classification-on-squirrelLINKX
Accuracy: 61.81 ± 1.80
node-classification-on-texasLINKX
Accuracy: 74.60 ± 8.37
node-classification-on-twitch-gamersLINKX
Accuracy: 66.06±0.19
node-classification-on-wiki-1LINKX
ACCURACY: 59.80±0.41
node-classification-on-wisconsinLINKX
Accuracy: 75.49 ± 5.72

用 AI 构建 AI

从想法到上线——通过免费 AI 协同编程、开箱即用的环境和市场最优价格的 GPU 加速您的 AI 开发

AI 协同编程
即用型 GPU
最优价格
立即开始

Hyper Newsletters

订阅我们的最新资讯
我们会在北京时间 每周一的上午九点 向您的邮箱投递本周内的最新更新
邮件发送服务由 MailChimp 提供
大规模非同质图上的学习:新基准和强大的简单方法 | 论文 | HyperAI超神经