HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

OCD: Learning to Overfit with Conditional Diffusion Models

Shahar Lutati; Lior Wolf

OCD: Learning to Overfit with Conditional Diffusion Models

Abstract

We present a dynamic model in which the weights are conditioned on an input sample x and are learned to match those that would be obtained by finetuning a base model on x and its label y. This mapping between an input sample and network weights is approximated by a denoising diffusion model. The diffusion model we employ focuses on modifying a single layer of the base model and is conditioned on the input, activations, and output of this layer. Since the diffusion model is stochastic in nature, multiple initializations generate different networks, forming an ensemble, which leads to further improvements. Our experiments demonstrate the wide applicability of the method for image classification, 3D reconstruction, tabular data, speech separation, and natural language processing. Our code is available at https://github.com/ShaharLutatiPersonal/OCD

Code Repositories

shaharlutatipersonal/ocd
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
few-shot-text-classification-on-amazonSetFit + OCD
Accuracy: 0.41
few-shot-text-classification-on-average-onSetFit + OCD
Accuracy: 0.643
few-shot-text-classification-on-average-onT-few 3B
Accuracy: 0.633
few-shot-text-classification-on-average-onSetFit
Accuracy: 0.622
few-shot-text-classification-on-average-onSetFit + OCD(5)
Accuracy: 0.648
few-shot-text-classification-on-sst-5SetFit + OCD
Accuracy: 0.478
image-classification-on-tiny-imagenet-1DeiT-B/16-D + OCD
Validation Acc: 90.8%
image-classification-on-tiny-imagenet-1DeiT-B/16-D + OCD(5)
Validation Acc: 92.0%
speech-separation-on-libri5mixOCD
SI-SDRi: 13.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
OCD: Learning to Overfit with Conditional Diffusion Models | Papers | HyperAI