ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Overview
TL;DR: This work proposes a light-weight controllable module for various base models (SD1.5, SDXL, SD3, SVD) and tasks (image / video generation with various conditions).
⚠️ For the webpage transfer, we compress the images and videos to reduce file size. Please refer to the original files for full quality.
Hover on the picture to see the overlayed condition, and click the picture to see the full view!
SVD + Pose
SDXL + Canny
SD1.5 + Canny
SD3 + SR
Video Generation via Stable Video Diffusion
User Input: First frame image & pose guidance sequence
If you can't load the videos beacause of the network problem, you can also view them through BiliBili
SVD + Pose
SVD + Pose
SVD + Pose
SVD + Pose
SVD + Pose
SVD + Pose
Image Generation with SDXL
We trained a canny adapter on SDXL model.
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
Various Stylization & Editing
Condition
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
Condition
SDXL + Canny
SDXL + Canny
Condition
SDXL + Canny
SDXL + Canny
SDXL + Canny
SDXL + Canny
Source Image
SDXL + Canny
SDXL + Canny
SDXL + Canny
Condition
SDXL + Canny
SDXL + Canny
SDXL + Canny
Image Generation with 1.5
Our method is also adaptable for community's LoRA weights
Pose Condition
SD1.5 + Warrior
SD1.5 + Genshin
SD1.5 + Chinese Painting
SD1.5 + Animation
We trained a multiple adapters on SD1.5 model.
SD1.5 + Pose + LoRA
SD1.5 + Pose
SD1.5 + Canny
SD1.5 + Mask
SD1.5 + Depth
Image Super-Resolution with SD3
We trained a Super-Resolution ControlNeXt on SD3 with degraded inputs.
SD3 + LR
SD3 + SR
SD3 + LR
SD3 + SR
SD3 + LR
SD3 + SR
SD3 + LR
SD3 + SR
Training convergence
Our method achieves significantly faster convergence during training.
It starts to learn the control abilities within hundreds of training steps.
Training of SD1.5
Contact Us
Feel free to contact Bohao Peng at bhpeng22@cse.cuhk.edu.hk for any question,cooperation, and communication.
If you find this work useful, please consider citing:
@article{peng2024controlnext, title={ControlNeXt: Powerful and Efficient Control for Image and Video Generation}, author={Peng, Bohao and Wang, Jian and Zhang, Yuechen and Li, Wenbo and Yang, Ming-Chang and Jia, Jiaya}, journal={arXiv preprint arXiv:2408.06070}, year={2024} }