HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Photographic Image Synthesis with Cascaded Refinement Networks

Qifeng Chen; Vladlen Koltun

Photographic Image Synthesis with Cascaded Refinement Networks

Abstract

We present an approach to synthesizing photographic images conditioned on semantic layouts. Given a semantic label map, our approach produces an image with photographic appearance that conforms to the input layout. The approach thus functions as a rendering engine that takes a two-dimensional semantic specification of the scene and produces a corresponding photographic image. Unlike recent and contemporaneous work, our approach does not rely on adversarial training. We show that photographic images can be synthesized from semantic layouts by a single feedforward network with appropriate structure, trained end-to-end with a direct regression objective. The presented approach scales seamlessly to high resolutions; we demonstrate this by synthesizing photographic images at 2-megapixel resolution, the full resolution of our training data. Extensive perceptual experiments on datasets of outdoor and indoor scenes demonstrate that images synthesized by the presented approach are considerably more realistic than alternative approaches. The results are shown in the supplementary video at https://youtu.be/0fhUJT21-bs

Benchmarks

BenchmarkMethodologyMetrics
image-to-image-translation-on-ade20k-labelsCRN
Accuracy: 68.8%
FID: 73.3
mIoU: 22.4
image-to-image-translation-on-ade20k-outdoorCRN
Accuracy: 68.6%
FID: 99.0
mIoU: 16.5
image-to-image-translation-on-cityscapesCRN
FID: 104.7
Per-pixel Accuracy: 77.1%
mIoU: 52.4
image-to-image-translation-on-coco-stuffCRN
Accuracy: 40.4%
FID: 70.4
mIoU: 23.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Photographic Image Synthesis with Cascaded Refinement Networks | Papers | HyperAI