HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Dongjun Kim Seungjae Shin Kyungwoo Song Wanmo Kang Il-Chul Moon

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Abstract

Recent advances in diffusion models bring state-of-the-art performance on image generation tasks. However, empirical results from previous research in diffusion models imply an inverse correlation between density estimation and sample generation performances. This paper investigates with sufficient empirical evidence that such inverse correlation happens because density estimation is significantly contributed by small diffusion time, whereas sample generation mainly depends on large diffusion time. However, training a score network well across the entire diffusion time is demanding because the loss scale is significantly imbalanced at each diffusion time. For successful training, therefore, we introduce Soft Truncation, a universally applicable training technique for diffusion models, that softens the fixed and static truncation hyperparameter into a random variable. In experiments, Soft Truncation achieves state-of-the-art performance on CIFAR-10, CelebA, CelebA-HQ 256x256, and STL-10 datasets.

Code Repositories

Kim-Dongjun/Soft-Truncation
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-generation-on-celeba-64x64DDPM++ (VP, NLL) + ST
FID: 2.9
bits/dimension: 1.96
image-generation-on-celeba-64x64UNCSN++ (RVE) + ST
bits/dimension: 1.97
image-generation-on-celeba-64x64DDPM++ (VP, FID) + ST
FID: 1.9
bits/dimension: 2.1
image-generation-on-celeba-hq-256x256UNCSN++ (RVE) + ST
FID: 7.16
image-generation-on-ffhq-256-x-256UDM (RVE) + ST
FID: 5.54
image-generation-on-imagenet-32x32DDPM++ (VP, NLL) + ST
FID: 8.42
Inception score: 11.82
bpd: 3.85
image-generation-on-lsun-bedroom-256-x-256UDM (RVE) + ST
FID: 4.57
image-generation-on-stl-10UNCSN++ (RVE) + ST
FID: 7.71
Inception score: 13.43

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation | Papers | HyperAI