| VQGAN+Transformer (k=mixed, p=1.0, a=0.005) | 6.59 | Taming Transformers for High-Resolution Image Synthesis | |
| VQGAN+Transformer (k=600, p=1.0, a=0.05) | 5.2 | Taming Transformers for High-Resolution Image Synthesis | |
| ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0) | 4.09 | Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation | |
| ADM-G + EDS (ED-DPM, classifier_scale=0.75) | 3.96 | Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation | |