RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation (original) (raw)

Yufei Wang*,1,Zhou Xian*,1,Feng Chen*, 2,Tsun-Hsuan Wang3,Yian Wang4,
Katerina Fragkiadaki1,Zackory Erickson1,David Held1,Chuang Gan4,5

1CMU, 2Tsinghua IIIS, 3MIT CSAIL, 4UMass Amherst, 5MIT-IBM AI Lab

*Equal Contribution

Abstract

We present RoboGen, a generative robotic agent that automatically learns diverse robotic skills at scale via generative simulation. RoboGen leverages the latest advancements in foundation and generative models. Instead of directly using or adapting these models to produce policies or low-level actions, we advocate for a generative scheme, which uses these models to automatically generate diversified tasks, scenes, and training supervisions, thereby scaling up robotic skill learning with minimal human supervision. Our approach equips a robotic agent with a self-guided propose-generate-learn cycle: the agent first proposes interesting tasks and skills to develop, and then generates corresponding simulation environments by populating pertinent objects and assets with proper spatial configurations. Afterwards, the agent decomposes the proposed high-level task into sub-tasks, selects the optimal learning approach (reinforcement learning, motion planning, or trajectory optimization), generates required training supervision, and then learns policies to acquire the proposed skill. Our work attempts to extract the extensive and versatile knowledge embedded in large-scale models and transfer them to the field of robotics. Our fully generative pipeline can be queried repeatedly, producing an endless stream of skill demonstrations associated with diverse tasks and environments.

RoboGen is simulated and rendered with Genesis, a multi-material multi-solver generative simulation engine for general-purpose robot learning.

RoboGen Pipeline

RoboGen is a fully automated pipeline for endless and diverse skill acquisition. RoboGen pipeline consists of 4 stages: A) task proposal, B) scene generation, C) training supervision generation, and D) skill learning with generated information.

Long-horizon task generation and learning


RoboGen is able to propose high-level tasks, generate corresponding environments, decompose the high-level goal into low-level subtasks, and then learn the sub-skills sequentially.

RoboGen generated tasks, scenes, training supervisions, and learned skills

Please select an image below to view the results.

Select an image above to show the RoboGen results:

RoboGen response shown within code block.

Change lamp light direction

Pull lever to start coffee brewing

Store the toy into the storage

Stand upright and walk forward using only hind legs

Bend noodle into a U-shape

Retrieve the gold bar from safe

Close the drawer of the table

Heat up a bowl of soup in microwave

Kick the soccer ball to the left

Spin clockwise without right hind foot touching the ground

Retrieve the toy car from box

Store the apple in refrigerator

Set the clock back by 5 minutes

Lie face down and cawl forward

Load dish into dishwasher

Stay balanced and move forward

Arrange three different cans in a row

Unload the milk from cart

Open the door of trash can

Take out a bottle of water from fridge

Press and rotate dispenser lid

Rotate globe horizontally

Store a toy inside the box

Put an toy into the storage

Direct and press dispenser

Kick the soccer ball to the left

Open washing machine door

Open fridges freezer door

Jump higher than 5 meters

Spin left without using right hind leg

Run backward fast using only front and left hind legs

Pull out the top drawer of the cabinet

Press the button to turn on the printer

Wrap the dumpling wrapper

Start dishwasher by pressing the start button

Opening both refrigerator doors

Shape the dough into a baguette

Slide down display screen

Put a fruit into a fruit bowl

Jump and kick the basketball

Turning on coffee machine using the knob

Turn rightward continuously

Kick the soccer ball to the right

Stand upright on two hind legs

Do a backflip and kick the soccer ball to the right

Put filling onto the wrapper

Turn on lamp by pressing the toggle button

Move clock ahead for daylight saving

BibTeX

@misc{wang2023robogen,
      title={RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation}, 
      author={Yufei Wang and Zhou Xian and Feng Chen and Tsun-Hsuan Wang and Yian Wang and Katerina Fragkiadaki and Zackory Erickson and David Held and Chuang Gan},
      year={2023},
      eprint={2311.01455},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}