Command Palette
Search for a command to run...
FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained Object Generation and Discovery
Krishna Kumar Singh; Utkarsh Ojha; Yong Jae Lee

Abstract
We propose FineGAN, a novel unsupervised GAN framework, which disentangles the background, object shape, and object appearance to hierarchically generate images of fine-grained object categories. To disentangle the factors without supervision, our key idea is to use information theory to associate each factor to a latent code, and to condition the relationships between the codes in a specific way to induce the desired hierarchy. Through extensive experiments, we show that FineGAN achieves the desired disentanglement to generate realistic and diverse images belonging to fine-grained classes of birds, dogs, and cars. Using FineGAN's automatically learned features, we also cluster real images as a first attempt at solving the novel problem of unsupervised fine-grained object category discovery. Our code/models/demo can be found at https://github.com/kkanshul/finegan
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-clustering-on-cub-birds | FineGAN | Accuracy: 0.126 NMI: 0.403 |
| image-clustering-on-stanford-cars | FineGAN | Accuracy: 0.078 NMI: 0.354 |
| image-clustering-on-stanford-dogs | FineGAN | Accuracy: 0.079 NMI: 0.233 |
| image-generation-on-cub-128-x-128 | FineGAN | FID: 11.25 Inception score: 52.53 |
| image-generation-on-stanford-cars | FineGAN | FID: 16.03 Inception score: 32.62 |
| image-generation-on-stanford-dogs | FineGAN | FID: 25.66 Inception score: 46.92 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.