For the best experience, it's better to use a desktop computer to view this website.

DreamOmni2: Multimodal Instraction-Based Editing And Generation

Bin Xia1, Bohao PENG1, Yuechen Zhang1, Junjia Huang3, Jiyang Liu3, Jingyao Li1, Haoru Tan, Sitong Wu1, Chengyao Wang1, Yitong Wang3, Xinglong Wu3, Bei Yu1, Jiaya Jia1,2

1 The Chinese University of Hong Kong, 2 The Hong Kong University of Science and Technology, 3 Bytedance

Contact Us

Feel free to contact Bin Xia at zjbinxia@gmail.com for any question,cooperation, and communication.

If you find this work useful, please consider citing:

@article{Xia2025,
    author = {Bin Xia, Bohao Peng, Yuechen Zhang, Junjia Huang, Jiyang Liu, Jingyao Li, Haoru Tan, Sitong Wu, Chengyao Wang, Yitong Wang, Xinglong Wu, Bei Yu, Jiaya Jia},
    title = {DreamOmni2: Multimodal Instruction-Based Editing and Generation},
    year = {2025},
}