HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Cleora: A Simple, Strong and Scalable Graph Embedding Scheme

Barbara Rychalska Piotr Bąbel Konrad Gołuchowski Andrzej Michałowski Jacek Dąbrowski

Cleora: A Simple, Strong and Scalable Graph Embedding Scheme

Abstract

The area of graph embeddings is currently dominated by contrastive learning methods, which demand formulation of an explicit objective function and sampling of positive and negative examples. This creates a conceptual and computational overhead. Simple, classic unsupervised approaches like Multidimensional Scaling (MSD) or the Laplacian eigenmap skip the necessity of tedious objective optimization, directly exploiting data geometry. Unfortunately, their reliance on very costly operations such as matrix eigendecomposition make them unable to scale to large graphs that are common in today's digital world. In this paper we present Cleora: an algorithm which gets the best of two worlds, being both unsupervised and highly scalable. We show that high quality embeddings can be produced without the popular step-wise learning framework with example sampling. An intuitive learning objective of our algorithm is that a node should be similar to its neighbors, without explicitly pushing disconnected nodes apart. The objective is achieved by iterative weighted averaging of node neigbors' embeddings, followed by normalization across dimensions. Thanks to the averaging operation the algorithm makes rapid strides across the embedding space and usually reaches optimal embeddings in just a few iterations. Cleora runs faster than other state-of-the-art CPU algorithms and produces embeddings of competitive quality as measured on downstream tasks: link prediction and node classification. We show that Cleora learns a data abstraction that is similar to contrastive methods, yet at much lower computational cost. We open-source Cleora under the MIT license allowing commercial use under https://github.com/Synerise/cleora.

Code Repositories

Synerise/booking-challenge
pytorch
Mentioned in GitHub
Synerise/cleora
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
node-classification-on-citeseerCleora
Accuracy: 75.7
node-classification-on-coraCleora
Accuracy: 86.80%
node-classification-on-pubmedCleora
Accuracy: 80.2
node-classification-on-youtubeCleora
Macro-F1@2%: 30.77
Micro-F1@2%: 38.59

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Cleora: A Simple, Strong and Scalable Graph Embedding Scheme | Papers | HyperAI