Jim Fan (original) (raw)

Hello there!

I am a Senior Research Scientist at NVIDIA and Lead of AI Agents Initiative. My mission is to build generally capable agents across physical worlds (robotics) and virtual worlds (games, simulation). I share insights about AI research & industry extensively on Twitter/X and LinkedIn. Welcome to follow me!

My research explores the bleeding edge of multimodal foundation models, reinforcement learning, computer vision, and large-scale systems. I obtained my Ph.D. degree at Stanford Vision Lab, advised by Prof. Fei-Fei Li. Previously, I interned at OpenAI (w/ Ilya Sutskever and Andrej Karpathy), Baidu AI Labs (w/ Andrew Ng and Dario Amodei), and MILA (w/ Yoshua Bengio). I graduated as the Valedictorian of Class 2016 and received the Illig Medal at Columbia University.

I spearheaded Voyager (the first AI agent that plays Minecraft proficiently and bootstraps its capabilities continuously), MineDojo (open-ended agent learning by watching 100,000s of Minecraft YouTube videos), Eureka (a 5-finger robot hand doing extremely dexterous tasks like pen spinning), and VIMA (one of the earliest multimodal foundation models for robot manipulation). MineDojo won the Outstanding Paper Award at NeurIPS 2022. My works have been widely featured in news media, such as New York Times, Forbes, MIT Technology Review, TechCrunch, The WIRED, VentureBeat, etc.

Fun fact: I was OpenAI’s very first intern in 2016. During that summer, I worked on World of Bits, an agent that perceives the web browser in pixels and outputs keyboard/mouse control. It was way before LLM became a thing at OpenAI. Good old times!

Research Highlights

GPT-4 writes reward functions to teach a 5-finger robot hand how to do extremely dexterous tasks like pen spinning.

LLM-powered agent that masters Minecraft by in-context lifelong learning.

Multimodal LLM for robot manipulation; unifies diverse robotics tasks in a single prompting framework.

NeurIPS Outstanding Paper Award✨. Large-scale open-ended agent learning framework in Minecraft.

Media

Coverage

Publications

Visit my Google Scholar page for a comprehensive listing!

Experience

NVIDIA

Research Scientist

Dec 2021 – Present California

NVIDIA

Research Intern

Jun 2020 – Sep 2020 California

Google Cloud AI

Research Intern

Jun 2018 – Sep 2018 California

Stanford Vision Lab

Ph.D. in Computer Science

Sep 2016 – Sep 2021 California

OpenAI

Research Intern

Jun 2016 – Mar 2017 California

Mila-Quebec AI Institute

Research Assistant

Sep 2015 – Mar 2016 Montréal, Quebec, Canada

Columbia University

Research Assistant

Sep 2013 – Dec 2014 New York City