ARNOLD Challenge (original) (raw)
CVPR 2026 Embodied AI Workshop
Hosted on CVPR 2026 Embodied AI Workshop, the ARNOLD Challenge is a robotic manipulation challenge that benchmarks language-grounded task learning in realistic interactive 3D scenes, featuring an understanding of continuous object states and systematic evaluation along various aspects. The ARNOLD Challenge comprises eight language-conditioned tasks covering the manipulation of rigid objects, articulated objects, and fluids. It also provides generalization tests regarding novel objects, novel scenes, and novel goal states. For more details on the ARNOLD benchmark, see the paper (ICCV 2023) and project page.
- Built upon Omniverse Isaac Sim, photo-realistic rendering and physics-realistic simulation
- 20 diverse scenes and 40 diverse objects, meticulously optimized materials and collisions, etc.
- 8 language-grounded manipulation tasks, covering the skills of manipulating rigid objects, articulated objects, and fluids
- Diverse language-grounded task instructions encompassing continuous object states
- Large-scale demonstrations (10k)
- Systematic evaluation, including in-domain test and generalization test for novel objects/scenes/states
We host the challenge on EvalAI. To participate in the current ongoing challenge, follow these steps:
- Refer to the Github codebase and Challenge Setup Guide to setup a workspace, prepare data, and reproduce the results. If building Docker fails, you can directly pull the released one (see here).
- Log into EvalAI (register if necessary) and go to the ARNOLD challenge page. Read the details and select a phase.
- Launch training and evaluation in the workspace with your own model.
- Make your own submissions. You need to create a team in order to make submissions. Register your team or join an existing team by using their invite code. Please make sure your submission is in public status.
EvalAI. We host the evaluation servers on EvalAI, which is developed by the CloudCV team. EvalAI is an open-source web platform designed for organizing and participating in challenges to push the state of the art on AI tasks.
Data splits. We divide the evaluation data into three splits, including dev, test-challenge, and test-reserve. Dev is used for debugging and validation, allowing for a maximum of 10 submissions per day. It does not contain generalization tests, i.e., novel objects, novel scenes, and novel goal states. Test-challenge is the default test data for the ARNOLD challenge, serving as the standard for comparison and providing a public leaderboard that is updated upon submission. Test-reserve is used to protect against possible overfitting. If there are substantial differences between a method's scores on test-standard and test-reserve, it will raise a red flag and prompt further investigation. Results on test-reserve will not be publicly revealed.
Phases. We maintain two phases for the ARNOLD challenge accordingly: Dev and Test. The Dev phase uses data from the dev split, and the Test phase uses data from test-challenge and test-reserve splits. We encourage participants to first submit to the Dev phase for sanity check and development feedback. Note that the submission procedures are identical for the two phases. All Public submissions to the Test phase will be considered participating in the challenge. Private submissions or submissions to the Dev phase will not be considered towards challenge participation.
- 03/16/2026: Submission opens
- 05/31/2026: Submission ends
- 06/04/2026: Winners announced at CVPR 2026 Embodied AI Workshop
Translate and rotate rigid objects
Open and close articulated objects (drawer, prismatic joint)
Open and close articulated objects (cabinet, revolute joint)
@inproceedings{gong2023arnold, title={ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes}, author={Gong, Ran and Huang, Jiangyong and Zhao, Yizhou and Geng, Haoran and Gao, Xiaofeng and Wu, Qingyang and Ai, Wensi and Zhou, Ziheng and Terzopoulos, Demetri and Zhu, Song-Chun and others}, booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, year={2023}}