DreamOmni2: Multimodal Instraction-Based Editing And Generation
Object Replace
Image 1
Image 2
Replace the lantern in the first image with the dog in the second image.
Result
Image 1
Image 2
Replace the man in the first image with the woman in the second image.
Result
Image 1
Image 2
Replace the suit in the first image with the clothes in the second image.
Result
Image 1
Image 2
Replace the person in the first image with the person in the second image.
Result
Lighting Render
Image 1
Image 2
Make the first image has the same light condition as the second image.
Result
Image 1
Image 2
Make the first image has the same light condition as the second image.
Result
Image 1
Image 2
Make the first image has the same light condition as the second image.
Result
Image 1
Image 2
Make the first image has the same light condition as the second image.
Result
Style Transfer
Image 1
Image 2
Replace the first image have the same image style as the second image.
Result
Image 1
Image 2
Replace the first image have the same image style as the second image.
Result
Image 1
Image 2
Replace the first image have the same image style as the second image.
Result
Image 1
Image 2
Replace the first image have the same image style as the second image.
Result
Pose Imitation
Image 1
Image 2
Make the person from the first image has the same pose as person from the second image.
Result
Image 1
Image 2
Make the person from the first image has the same pose as person from the second image.
Result
Image 1
Image 2
Make the person from the first image has the same pose as person from the second image.
Result
Image 1
Image 2
Make the person from the first image has the same pose as person from the second image.
Result
Face Expression
Image 1
Image 2
Make the person in the first image have the same expression as the person in the second image.
Result
Image 1
Image 2
Make the person in the first image have the same expression as the person in the second image.
Result
Image 1
Image 2
Make the person in the first image have the same expression as the person in the second image.
Result
Image 1
Image 2
Make the person in the first image have the same expression as the person in the second image.
Result
Hair Style
Image 1
Image 2
Make the person in the first image have the same hairstyle as the person in the second image.
Result
Image 1
Image 2
Make the person in the first image have the same hairstyle as the person in the second image.
Result
Font Imitation
Image 1
Image 2
Make the words in the first image have the same font as the words in the second image.
Result
Image 1
Image 2
Make the words in the first image have the same font as the words in the second image.
Result
Image 1
Image 2
Make the words in the first image have the same font as the words in the second image.
Result
Image 1
Image 2
Make the words in the first image have the same font as the words in the second image.
Result
Pattern Imitation
Image 1
Image 2
Make the bag in the first image have the same pattern as the machine in the second image.
Result
Image 1
Image 2
Make the car in the first image have the same pattern as the mouse in the second image.
Result
Image 1
Image 2
Make the tape in the first image have the same pattern as the bag in the second image.
Result
Image 1
Image 2
Make the bottle in the first image have the same pattern as the compass in the second image.
Result
Image 1
Image 2
Make the dress in the first image have the same pattern in the second image.
Result
Image 1
Image 2
Make the T-shirt in the first image have the same pattern in the second image.
Result
Background Replace
Image 1
Image 2
Make the bag in the first image have the same pattern as the machine in the second image.
Result
Image 1
Image 2
Make the car in the first image have the same pattern as the mouse in the second image.
Result
Image 1
Image 2
Make the dress in the first image have the same pattern in the second image.
Result
Image 1
Image 2
Make the T-shirt in the first image have the same pattern in the second image.
Result
In-context Generation
Image 1
Image 2
The character from the first image is holding the item from the second picture.
Result
Image 1
Image 2
The character from the second image is holding the item from the first image.
Result
Image 1
Image 2
The logo from the first image is printed on the object from the second image.
Result
Image 1
Image 2
The man from the first image is wearing the clothes from the second image and is sitting on a sofa.
Result
Three References Generation
Image 1
Image 2
Image 3
The parrot from Image 1 is wearing the hat from Image 2 and standing on the ground, with a forest in the background. The color tone of the image is the same as in Image 3.
Result
Image 1
Image 2
Image 3
The cat from Image 1 and the dog from Image 2 are sitting side by side, with the background inside a car. The style of the image is the same as in Image 3.
Result
Image 1
Image 2
Image 3
On a fighting stage, two people are engaged in combat. Their movements are shown in Figure 3.
Result
Image 1
Image 2
Image 3
Picture 1 is hung on the wall of a bedroom. The cup in Picture 2, made of the same material as the plate in Picture 3, is placed on the table.
Result
Four References Generation
Image 1
Image 3
Image 2
Image 4
The man from Image 1 stands next to the woman from Image 2. The woman is wearing the hat from Image 4, which has the logo from Image 3 on it. The background is by the lake.
Result
Image 1
Image 3
Image 2
Image 4
The woman from image 1 and the man from image 2 are standing in front of a mountain. The dog from image 3 is standing between them. The style of the image is the same as in image 4.
Result
More Examples
Video Introduction
Compare with the Alternatives
Inputs
Ours
Kontext
Qwen-Edit
GPT-4o
Nano-Banana
OmniGen2
Image 1
Image 2
Make the person from the first image has the same pose as person from the second image.
Image 1
Image 2
Make the person in the first image have the same hairstyle as the person in the second image.
Image 1
Image 2
Make the first image has the same light condition as the second image.
Image 1
Image 2
Make the words in the first image have the same font as the words in the second image.
Contact Us
Feel free to contact Bin Xia at zjbinxia@gmail.com for any question,cooperation, and communication.
If you find this work useful, please consider citing:
@article{Xia2025,
author = {Bin Xia, Bohao Peng, Yuechen Zhang, Junjia Huang, Jiyang Liu, Jingyao Li, Haoru Tan, Sitong Wu, Chengyao Wang, Yitong Wang, Xinglong Wu, Bei Yu, Jiaya Jia},
title = {DreamOmni2: Multimodal Instruction-Based Editing and Generation},
year = {2025},
}