Xudong Xu (original) (raw)

Xudong Xu 徐旭东 I am currently a researcher at Shanghai Artificial Intelligence Laboratory, working on Embodied AI. I recieved my Ph.D. degree from Multimedia Laboratory in the Chinese University of Hong Kong, advised by Prof. Dahua Lin. I recieved my B.E in Automation from Nanjing University in June 2018. My current research interests lie in Interactive Scene Generation and World Models in Embodied AI, especially focusing on compositional scene generation and interactive object generation. I am looking for highly motivated interns at Shanghai AI Laboratory. Shoot me an email (xuxudong@pjlab.org.cn) if you are interested. LinkedIn / Google Scholar / Github

CUHK

CUHK Aug. 2018 - Mar. 2023, Department of Information Engineering, the Chinese University of Hong Kong Ph.D. in Information Engineering
NJU Sept. 2014 - Jun. 2018 , School of Management and Engineering, Nanjing University Bachelor in Automation GPA: 92.4/100, Rank: 1 / 37
Meta June 2022 - Nov. 2022, Meta Reality Labs, Pittsburgh Research Scientist Intern. Pittsburgh, PA, USA
Selected Publications [full list]
Equal contribution +Corresponding author/Project lead
CaR Code-as-Room: Generating 3D Rooms from Top-Down View Images via Agentic Code Synthesis Yixuan Yang*, Zhen Luo*, Wanshui Gan*, ..., Zhaoyang Lyu, Xudong Xu+ Arxiv, 2026 [paper] [code] [project]
egosim EgoSim: Egocentric World Simulator for Embodied Interaction Generation Jinkun Hao*, Mingda Jia*, ..., Jiangmiao Pang, Xudong Xu+ Arxiv, 2026 [paper] [code] [project]
robovip RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Boyang Wang*, Haoran Zhang*, Shujie Zhang*, ..., Xudong Xu+, Jiangmiao Pang Arxiv, 2026 [paper] [code] [project]
stable STABLE: Simulation-Ready Tabletop Layout Generation via a Semantics-Physics Dual System Zhen Luo*, Yixuan Yang*, Xudong Xu+, Jinkun Hao, Zhaoyang Lyu, Feng Zheng+, Jiangmiao Pang, Yanwei Fu ICML, 2026 [paper] [code] [project]
mesatask MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning Jinkun Hao*, Naifu Liang*, Zhen Luo*, Xudong Xu+, ..., Jiangmiao Pang NeurIPS, 2025 (Spotlight) [paper] [code] [project]
internscenes InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts Weipeng Zhong*, Peizhou Cao*, ..., Bo Dai, Xudong Xu+, Jiangmiao Pang+ NeurIPS, 2025 [paper] [code] [project]
anysplat AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Lihan Jiang*, Yucheng Mao*, ..., Xudong Xu, ..., Dahua Lin, Bo Dai+ ACM Transactions on Graphics (Proc. SIGGRAPH Asia 2025) [paper] [code] [project]
infinite_mobility Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation Xinyu Lian, Zichao Yu, ..., Xudong Xu+, Zhaoyang Lyu+, Bo Dai, Jiangmiao Pang ArXiv, 2025 [paper] [code] [project]
BoostPBR Boosting 3D Object Generation through PBR Materials Yitong Wang, Xudong Xu+, Li Ma, Haoran Wang, Bo Dai SIGGRAPH Asia, 2024 [paper] [code] [project]
RoomTex RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting Qi Wang*, Ruijie Lu*, Xudong Xu+, ..., Bo Dai, Gang Zeng, Dan Xu ECCV, 2024 [paper] [code] [project]
matlaber MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR Xudong Xu, Yitong Wang, Zhaoyang Lyu, Xingang Pan, Bo Dai ArXiv, 2023 [paper] [code] [project]
soundingbodies Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio Xudong Xu, Dejan Markovic, Jacob Sandakly, Todd Keebler, Steven Krenn, Alexander Richard NeurIPS, 2023 (Spotlight) [paper] [code] [dataset]