Jialu Li (original) (raw)

Jialu Li Hi, thanks for stopping by. I'm an Applied Scientist at Adobe, working on text-to-image and text-to-video foundation model training. I received my Ph.D. from The University of North Carolina at Chapel Hill, advised by Prof. Mohit Bansal. Before joining UNC-CH, I got my Master degree from Cornell University, where I was advised by Prof. Claire Cardie. I did my Bachelor degree at Shanghai JiaoTong University. Email / CV / Google Scholar / Twitter / Github

Research

I have a broad interest in Multimodal research, with a focus on text-to-image generation, Vision-and-Language Navigation, and multi-modal LLM.

News

We have a paper accepted to ICML 2026.
We have a paper accepted to AAAI 2026.
I will join Adobe as an Applied Scientist starting from Summer 2025.
We have two papers accepted to ICLR 2025.
We have a paper accepted to NeurIPS 2024.
I will intern at Google as Student Researcher for Summer 2024.
We have a paper accepted to AAAI 2024.
We have a paper accepted to NeurIPS 2023.
We have a paper accepted to ICCV 2023 and selected as Oral presentation.
I will intern at Apple as Machine Learning Research Intern for Summer 2023.
We have a paper accepted to CVPR 2023.
We have a paper accepted to Findings of NAACL 2022.
We have a paper accepted to CVPR 2022.
I will intern at Amazon as Applied Scientist for Summer 2022.
We have a paper accepted to EMNLP 2021.
We have a paper accepted to NAACL 2021.
We have a paper accepted to EMNLP 2020.
I will join UNC-CH as a new Ph.D. student in Fall 2020.

	Training-free guidance in text-to-video generation via multimodal planning and structured noise initialization Jialu Li, Shoubin Yu, Han Lin, Jaemin Cho, Jaehong Yoon, Mohit Bansal. Preprint* paper /code /bib /website
	Unbounded: A Generative Infinite Game of Character Life Simulation Jialu Li, Yuanzhen Li, Neal Wadhwa, Yael Pritch, David E. Jacobs, Michael Rubinstein, Mohit Bansal, Nataniel Ruiz. ICLR, 2025 paper /bib /website
	DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Zun Wang, Jialu Li, Han Lin, Jaehong Yoon, Mohit Bansal. AAAI, 2026 paper /code /bib /website
	Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel Zun Wang, Jialu Li, Yicong Hong, Songze Li, Kunchang Li, Shoubin Yu, Yi Wang, Yu Qiao,Yali Wang, Mohit Bansal, Limin Wang. ICLR, 2025 paper /code /bib
	Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models Yue Zhang, Ziqiao Ma, Jialu Li, Yanyuan Qiao, Zun Wang, Joyce Chai, Qi Wu, Mohit Bansal, Parisa Kordjamshidi TMLR* paper
	SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li, Jaemin Cho, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal. NeurIPS, 2024 paper /code /bib /website
	VLN-VIDEO: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme, Mohit Bansal. AAAI, 2024 paper
	Multimodal large language model for visual navigation Yao-Hung Hubert Tsai, Vansh Dhar, Hugues Thomas, Jialu Li, Bowen Zhang, Jian Zhang Preprint paper
	PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation Jialu Li, Mohit Bansal. NeurIPS, 2023 paper /code /bib /website
	Scaling Data Generation in Vision-and-Language Navigation Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal,Stephen Gould, Hao Tan, Yu Qiao. ICCV, 2023, Oral Presentation* paper /code /bib
	Improving Vision-and-Language Navigation by Generating Future-View Image Semantics Jialu Li, Mohit Bansal. CVPR, 2023 paper /code /bib /website
	CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment Agnostic Representations Jialu Li, Hao Tan, Mohit Bansal. Findings of NAACL, 2022 paper /code /bib
	EnvEdit: Environment Editing for Vision-and-Language Navigation Jialu Li, Hao Tan, Mohit Bansal. CVPR, 2022 paper /code /bib
	NDH-Full: Learning and Evaluating Navigational Agents on Full-Length Dialogue Hyounghun Kim, Jialu Li, Mohit Bansal. EMNLP, 2021 paper /code /bib
	Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information Jialu Li, Hao Tan, Mohit Bansal. NAACL, 2021 (short papers) paper /code /bib
	Exploring the Role of Argument Structure in Online Debate Persuasion Jialu Li, Esin Durmus, Claire Cardie. EMNLP, 2020 (short papers) paper /code /bib

Teaching

Introduction to Natural Language Processing, Cornell University. Fall 2019.

Professional Service

Reviewer for ARR, ACL, EMNLP, NAACL, EACL.
Reviewer for ACM MM, AAAI, CVPR, ICCV, ECCV, ICLR, NeurIPS, WACV, AISTATS.