Image Generation On Imagenet 512X512

评估指标

FID
Inception score

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
ADM-G7.72172.71Diffusion Models Beat GANs on Image Synthesis
MaskGIT7.32156.0MaskGIT: Masked Generative Image Transformer
simple diffusion (U-ViT, L)4.53205.3Simple diffusion: End-to-end diffusion for high resolution images
MaskGIT (a=0.05)4.46342.0MaskGIT: Masked Generative Image Transformer
simple diffusion (U-Net)4.28171Simple diffusion: End-to-end diffusion for high resolution images
ADM-G, ADM-U3.85221.72Diffusion Models Beat GANs on Image Synthesis
Poly-INR3.81-Polynomial Implicit Neural Representations For Large Diverse Datasets
Latent Diffusion (LDM-4-G)3.60247.67High-Resolution Image Synthesis with Latent Diffusion Models
DPC-U3.54350.2Discrete Predictor-Corrector Diffusion Models for Image Synthesis-
SiD-EDM2-XS (125M)3.353-Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
MAGVIT-v2 (w/o guidance)3.07213.1Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
DiT-XL/23.04240.82Scalable Diffusion Models with Transformers
GIVT-Causal-L+A2.92-GIVT: Generative Infinite-Vocabulary Transformers
EDM2-XS2.91-Analyzing and Improving the Training Dynamics of Diffusion Models
DiMR-XL/3R2.89-Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
DiT-XL/2 with SA-Solver2.80-SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
SiD-EDM2-S (280M)2.707-Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
DiffiT2.67252.12DiffiT: Diffusion Vision Transformers for Image Generation
TiTok-L-642.49-An Image is Worth 32 Tokens for Reconstruction and Generation
StyleGAN-XL2.40-StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
0 of 48 row(s) selected.
Image Generation On Imagenet 512X512 | SOTA | HyperAI超神经