Command Palette
Search for a command to run...
Bryan Perozzi; Rami Al-Rfou; Steven Skiena

Abstract
We present DeepWalk, a novel approach for learning latent representations of vertices in a network. These latent representations encode social relations in a continuous vector space, which is easily exploited by statistical models. DeepWalk generalizes recent advancements in language modeling and unsupervised feature learning (or deep learning) from sequences of words to graphs. DeepWalk uses local information obtained from truncated random walks to learn latent representations by treating walks as the equivalent of sentences. We demonstrate DeepWalk's latent representations on several multi-label network classification tasks for social networks such as BlogCatalog, Flickr, and YouTube. Our results show that DeepWalk outperforms challenging baselines which are allowed a global view of the network, especially in the presence of missing information. DeepWalk's representations can provide $F_1$ scores up to 10% higher than competing methods when labeled data is sparse. In some experiments, DeepWalk's representations are able to outperform all baseline methods while using 60% less training data. DeepWalk is also scalable. It is an online learning algorithm which builds useful incremental results, and is trivially parallelizable. These qualities make it suitable for a broad class of real world applications such as network classification, and anomaly detection.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| document-classification-on-cora | DeepWalk | Accuracy: 67.2% |
| link-property-prediction-on-ogbl-collab | DeepWalk | Ext. data: No Number of params: 61390187 Test Hits@50: 0.5037 ± 0.0034 Validation Hits@50: Please tell us |
| link-property-prediction-on-ogbl-ddi | DeepWalk | Ext. data: No Number of params: 1543913 Test Hits@20: 0.2246 ± 0.0290 Validation Hits@20: Please tell us |
| link-property-prediction-on-ogbl-ppa | DeepWalk | Ext. data: No Number of params: 150138741 Test Hits@100: 0.2302 ± 0.0163 Validation Hits@100: Please tell us |
| node-classification-on-blogcatalog | DeepWalk | Accuracy: 22.5% Macro-F1: 0.214 |
| node-classification-on-wikipedia | DeepWalk | Accuracy: 19.4% Macro-F1: 0.183 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.