Jiangyong Huang (original) (raw)
- I am currently a Ph.D. student at Peking University (PKU), advised by Prof. Song-Chun Zhu. I major in artificial intelligence (AI) with a primary focus on computer vision (CV). Before Ph.D., I graduated from PKU and obtained my Bachelor’s degree in 2022.
- My research interests include multi-modal model, 3D scene understanding, and embodied AI. My research goal is to develop embodied generalist agents capable of (1) understanding the 3D world, and (2) following human instructions to interact with the 3D world.
News
Publications

LARA: Latent Action Representation Alignment for Vision-Language-Action Models
ICML 2026
3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Xiongkun Linghu*, Jiangyong Huang*, Baoxiong Jia, and Siyuan Huang
ICML 2026
Lifting Unlabeled Internet-level Data for 3D Scene Understanding
Yixin Chen, Yaowei Zhang, Huangyue Yu, Junchao He, Yan Wang, Jiangyong Huang, Hongyu Shen, Junfeng Ni, Shaofei Wang, Baoxiong Jia, Song-Chun Zhu, and Siyuan Huang
CVPR 2026
SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
Xiongkun Linghu, Jiangyong Huang, Ziyu Zhu, Baoxiong Jia, and Siyuan Huang
ICLR 2026
LEO-VL: Efficient Scene Representation for Scalable 3D Vision-Language Learning
Jiangyong Huang, Xiaojian Ma, Xiongkun Linghu, Junchao He, Qing Li, Song-Chun Zhu, Yixin Chen, Baoxiong Jia, and Siyuan Huang
Preprint
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
Jiangyong Huang*, Baoxiong Jia*, Yan Wang, Ziyu Zhu, Xiongkun Linghu, Qing Li, Song-Chun Zhu, and Siyuan Huang
CVPR 2025
Multi-modal Situated Reasoning in 3D Scenes
Xiongkun Linghu*, Jiangyong Huang*, Xuesong Niu*, Xiaojian Ma, Baoxiong Jia, and Siyuan Huang
NeurIPS 2024
An Embodied Generalist Agent in 3D World
Jiangyong Huang*, Silong Yong*, Xiaojian Ma*, Xiongkun Linghu*, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, and Siyuan Huang
ICML 2024
ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Ran Gong*, Jiangyong Huang*, Yizhou Zhao, Haoran Geng, Xiaofeng Gao, Qingyang Wu, Wensi Ai, Ziheng Zhou, Demetri Terzopoulos, Song-Chun Zhu, Baoxiong Jia, and Siyuan Huang
ICCV 2023
Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation
Jiangyong Huang*, William Yicheng Zhu*, Baoxiong Jia, Zan Wang, Xiaojian Ma, Qing Li, and Siyuan Huang
Preprint
Experiences
| 2022 - Now | Reviewer for NeurIPS, CVPR, ICLR, ECCV, ICML, AAAI, RA-L |
|---|---|
| 2021 - Now | Research intern at BIGAI |
| Fall, 2022 & 2023 | TA for Statistical Vision at PKU |
| Summer, 2022 | TA for Directed Research in AI System at PKU |
| Spring, 2021 | Intern at AI Innovation Center, PKU |
| 2019 - 2020 | PKU annual scholarship and award |
| 2018 - 2019 | PKU annual scholarship and award |