HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Infinite Recommendation Networks: A Data-Centric Approach

Noveen Sachdeva Mehak Preet Dhaliwal Carole-Jean Wu Julian McAuley

Infinite Recommendation Networks: A Data-Centric Approach

Abstract

We leverage the Neural Tangent Kernel and its equivalence to training infinitely-wide neural networks to devise $\infty$-AE: an autoencoder with infinitely-wide bottleneck layers. The outcome is a highly expressive yet simplistic recommendation model with a single hyper-parameter and a closed-form solution. Leveraging $\infty$-AE's simplicity, we also develop Distill-CF for synthesizing tiny, high-fidelity data summaries which distill the most important knowledge from the extremely large and sparse user-item interaction matrix for efficient and accurate subsequent data-usage like model training, inference, architecture search, etc. This takes a data-centric approach to recommendation, where we aim to improve the quality of logged user-feedback data for subsequent modeling, independent of the learning algorithm. We particularly utilize the concept of differentiable Gumbel-sampling to handle the inherent data heterogeneity, sparsity, and semi-structuredness, while being scalable to datasets with hundreds of millions of user-item interactions. Both of our proposed approaches significantly outperform their respective state-of-the-art and when used together, we observe 96-105% of $\infty$-AE's performance on the full dataset with as little as 0.1% of the original dataset size, leading us to explore the counter-intuitive question: Is more data what you need for better recommendation?

Code Repositories

noveens/infinite_ae_cf
Official
jax
Mentioned in GitHub
recsys-benchmark/daisyrec-v2.0
pytorch
Mentioned in GitHub
AmazingDD/daisyRec
pytorch
Mentioned in GitHub
noveens/distill_cf
Official
jax
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
collaborative-filtering-on-douban∞-AE
AUC: 0.9523
HR@10: 0.2356
HR@100: 0.2837
PSP@10: 0.0128
nDCG@10: 0.2494
nDCG@100: 0.2326
collaborative-filtering-on-movielens-1m∞-AE
HR@10: 0.3151
HR@100: 0.6005
PSP@10: 0.0322
nDCG@10: 0.3282
nDCG@100: 0.4253
collaborative-filtering-on-netflix∞-AE
AUC: 0.9728
PSP@10: 0.0375
Recall@10: 0.2969
Recall@100: 0.5088
nDCG@10: 0.3059
nDCG@100: 0.3659

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Infinite Recommendation Networks: A Data-Centric Approach | Papers | HyperAI