OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control (original) (raw)

Behavior Montage

Behavior Montage

Unified policy control across diverse extreme behaviors.

B-boying with a rapid-fire string of back handsprings.

Skip back, pivot through a hand-to-headstand, drop, flip, and bounce back.

Long breaking dance.

Another long breaking dance.

Five consecutive webster flip.

Alternating pistol squats.

Back handspring, cartwheels, aerial cartwheel, bicycle kick flips.

Spinkicks, butterfly kicks.

Forward roll, backward roll, crawl forward.

Thomas flare, backspin, crawl forward and backflip.

More

Supplementary demos generated by a unified policy.

Attack combos.

Handstand walk.

Another attack combos.

Knocked out.

Push-ups.

Roll-flip combos.

Method

Method figure

Pre-training

A unified base policy is trained via DAgger-based Flow Matching to aggregate diverse motion priors from different motion tracking experts.

Post-training

The base policy is frozen while a residual policy is optimized under stringent motor constraints, extensive domain randomization, and power-safety regularization to bridge the sim-to-real gap.

Deployment

The whole inference pipeline is real- time and executed entirely onboard, facilitating robust and agile control in physical environments.

Ablation Study

UA ablation figure

Power-Safety Regularization explicitly penalizes excessive negative joint power to prevent unsafe energy absorption and transient braking loads during high-dynamic motions.

Citation

Use this arXiv BibTeX entry for reference.

@article{wang2026omnixtreme, title={OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control}, author={Yunshen Wang and Shaohang Zhu and Peiyuan Zhi and Yuhan Li and Jiaxin Li and Yong-Lu Li and Yuchen Xiao and Xingxing Wang and Baoxiong Jia and Siyuan Huang}, journal={arXiv preprint arXiv:2602.23843}, year={2026}, url={https://arxiv.org/abs/2602.23843} }