Xintao Wang (original) (raw)

News

GFPGANGitHub stars

Practical face restoration

Real-ESRGANGitHub stars

Practical algorithms for image restoration

BasicSRGitHub stars

Open source image and video restoration toolbox

T2I-AdapterGitHub stars

Dig out controllable ability for text-to-image diffusion models

VideoCrafterGitHub stars

Open sourced large models for video generation

HandyViewGitHub stars

Handy image viewer

Publications[Full List]

(* equal contribution, # corresponding author)

Selected Preprint

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang

arXiv preprint: 2503.11647.
Project Page Paper (arXiv) Codes GitHub stars

teaser

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Minglei Shi, Ziyang Yuan, Haotian Yang, Xintao Wang#, Mingwu Zheng, Xin Tao, Wenliang Zhao, Wenzhao Zheng, Jie Zhou, Jiwen Lu#, Pengfei Wan, Di Zhang, Kun Gai

arXiv preprint: 2503.14487.
Project Page Paper (arXiv) Codes GitHub stars

teaser

Improving Video Generation with Human Feedback

Jie Liu, Gongye Liu, Jiajun Liang, Ziyang Yuan, Xiaokun Liu, Mingwu Zheng, Xiele Wu, Qiulin Wang, Wenyu Qin, Menghan Xia, Xintao Wang, Xiaohong Liu, Fei Yang, Pengfei Wan, Di Zhang, Kun Gai, Yujiu Yang, Wanli Ouyang

arXiv preprint: 2501.13918.
Project Page Paper (arXiv) Codes GitHub stars

teaser

2024

teaser

teaser

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Xiao Fu, Xian Liu, Xintao Wang#, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin#

arXiv preprint: 2412.07759
ICLR, 2025. Project Page Paper (arXiv) Codes GitHub stars

teaser

teaser

teaser

teaser

teaser

teaser

2023

teaser

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Yuzhou Huang, Liangbin Xie, Xintao Wang#, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang#, Ying Shan

arXiv preprint: 2312.06739
CVPR, 2024 (hilight). Project Page Paper (arXiv) Codes GitHub stars

teaser

teaser

teaser

teaser

teaser

teaser

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Wangbo Yu, Hanyuan Liu, Gongye Liu, Xintao Wang, Ying Shan, Tien-Tsin Wong

arXiv preprint: 2310.12190
ECCV, 2024 (oral). Project Page Paper (arXiv) Codes GitHub stars

teaser

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond H. Chan, Ying Shan

arXiv preprint: 2310.11440
CVPR, 2024. Project Page Paper (arXiv) Codes GitHub stars

teaser

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Yingqing He, Shaoshu Yang, Haoxin Chen, Xiaodong Cun, Menghan Xia, Yong Zhang, Xintao Wang, Ran He, Qifeng Chen, Ying Shan

arXiv preprint: 2310.07702.
ICLR, 2024 (spotlight) Project Page Paper (arXiv) Codes GitHub stars

teaser

Making LLaMA SEE and Draw with SEED Tokenizer

Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, , Ying Shan, Ziwei Liu

arXiv preprint: 2310.01218.
ICLR, 2024 Project Page Paper (arXiv) Codes GitHub stars

teaser

teaser

teaser

teaser

teaser

2022

teaser

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu, Yufei Shi, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou

arXiv preprint: 2212.11565
ICCV, 2023. Project Page Paper (arXiv) Codes GitHub stars

teaser

teaser

teaser

teaser

teaser

teaser

teaser

teaser

teaser

teaser

2021

teaser

2020 and before

To be updated