Jiuxiang Gu (original) (raw)
My name is Jiuxiang Gu (顾久祥). I am a Senior Research Scientist at Adobe Research in Seattle. I received my Ph.D. from Nanyang Technological University, Singapore (2016.1–2019.5), under the supervision of Prof. Jianfei Cai, Dr. Gang Wang, and Prof. Tsuhan Chen. I currently serve as an Area Chair for ICLR 2025 and WACV 2024/2025, a Senior Program Committee Member for IJCAI 2021–2024, and a Program Committee Member for AAAI 2021–2023, NAACL 2021, and others. My research journey began in hardware design. From 2010 to 2015, I worked as an ASIC design engineer. In 2015, I made the transition to Artificial Intelligence. My current research interests include:
- Multimodal Foundation Models (LLM, MLLM, Diffusion LLM/MLLM, Text-to-Image/Video/3D Generation, Document Intelligence)
- Efficient Architecture & Scaling (Pruning, Quantization, KV Cache Optimization, Edge Deployment)
- Reasoning & Alignment (Chain-of-Thought, Hidden Thinking, Self-supervised Learning, Post-training)
- Impact & Production: Contribute to Adobe Firefly and Acrobat AI Assistant
Open to collaborations and internships in the above areas.
📧 Feel free to reach out: jigu@adobe.com / gu.jiuxiang@gmail.com
Selected Publications
2026
- CVPR 2026
Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models
Shufan Li, Jiuxiang Gu, Kangning Liu, and - ICLR 2026
LaViDa-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation
Shufan Li, Jiuxiang Gu, Kangning Liu, and4 more authors
2025
- AAAI 2025
Numerical pruning for efficient autoregressive models
Xuan Shen, Zhao Song, Yufa Zhou, and12 more authors
2024
- ICLR 2024 Oral
Lrm: Large reconstruction model for single image to 3d
Yicong Hong, Kai Zhang, Jiuxiang Gu, and7 more authors - ICLR 2024
ADoPD: A large-scale document page decomposition dataset
Jiuxiang Gu, Xiangxi Shi, Jason Kuen, and5 more authors
2021
- NeurIPS 2021
Unidoc: Unified pretraining framework for document understanding
Jiuxiang Gu, Jason Kuen, Vlad I Morariu, and5 more authors
2018
- AAAI 2018 Oral
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning - CVPR 2018 Spotlight
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models - Pattern Recognition, 2018
Recent advances in convolutional neural networks
Jiuxiang Gu, Zhenhua Wang, Jason Kuen, and8 more authors