HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Adding Conditional Control to Text-to-Image Diffusion Models

Zhang Lvmin ; Rao Anyi ; Agrawala Maneesh

Adding Conditional Control to Text-to-Image Diffusion Models

Abstract

We present ControlNet, a neural network architecture to add spatialconditioning controls to large, pretrained text-to-image diffusion models.ControlNet locks the production-ready large diffusion models, and reuses theirdeep and robust encoding layers pretrained with billions of images as a strongbackbone to learn a diverse set of conditional controls. The neuralarchitecture is connected with "zero convolutions" (zero-initializedconvolution layers) that progressively grow the parameters from zero and ensurethat no harmful noise could affect the finetuning. We test various conditioningcontrols, eg, edges, depth, segmentation, human pose, etc, with StableDiffusion, using single or multiple conditions, with or without prompts. Weshow that the training of ControlNets is robust with small (<50k) and large(>1m) datasets. Extensive results show that ControlNet may facilitate widerapplications to control image diffusion models.

Code Repositories

explainingai-code/ControlNet-PyTorch
pytorch
Mentioned in GitHub
xuanliu2000/vgcdm
pytorch
Mentioned in GitHub
DingDing33/BPS-v1-1
pytorch
Mentioned in GitHub
lllyasviel/controlnet
Official
pytorch
Mentioned in GitHub
fhshen2022/prunerepaint
pytorch
Mentioned in GitHub
google/style-aligned
pytorch
Mentioned in GitHub
quereste/implicit-deepfake
pytorch
Mentioned in GitHub
KU-CVLAB/Perturbed-Attention-Guidance
pytorch
Mentioned in GitHub
bytedance/x-dyna
pytorch
Mentioned in GitHub
microsoft/visual-chatgpt
Mentioned in GitHub
Francis-Rings/MotionEditor
pytorch
Mentioned in GitHub

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Adding Conditional Control to Text-to-Image Diffusion Models | Papers | HyperAI