Bidirectional chain-of-thought for zero-shot object navigation (original) (raw)

Access this article

Log in via an institution

Subscribe and save

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

  1. Gireesh N, Kiran D A S, Banerjee S, Sridharan M, Bhowmick B, Krishna M. Object goal navigation using data regularized q-learning. In: Proceedings of the 18th IEEE International Conference on Automation Science and Engineering (CASE). 2022, 1092–1097
    Google Scholar
  2. Gadre S Y, Wortsman M, Ilharco G, Schmidt L, Song S. Cows on pasture: Baselines and benchmarks for language-driven zero-shot object navigation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023, 23171–23181
    Google Scholar
  3. Majumdar A, Aggarwal G, Devnani B, Hoffman J, Batra D. ZSON: Zero-shot object-goal navigation using multimodal goal embeddings. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. 2022, 2343
    Google Scholar
  4. Zhou K, Zheng K, Pryor C, Shen Y, Jin H, Getoor L, Wang X E. ESC: Exploration with soft commonsense constraints for zero-shot object navigation. In: Proceedings of the 40th International Conference on Machine Learning. 2023, 1806
    Google Scholar
  5. Chang A, Dai A, Funkhouser T, Halber M, Niebner M, Savva M, Song S, Zeng A, Zhang Y. Matterport3D: learning from RGB-D data in indoor environments. In: Proceedings of 2017 International Conference on 3D Vision (3DV). 2017, 667–676
    Chapter Google Scholar
  6. Ramakrishnan S K, Gokaslan A, Wijmans E, Maksymets O, Clegg A, Turner J, Undersander E, Galuba W, Westbury A, Chang A X, Savva M, Zhao Y, Batra D. Habitat-matterport 3D dataset (HM3D): 1000 large-scale 3D environments for embodied AI. In: Proceedings of the 35th Conference on Neural Information Processing Systems Track on Datasets and Benchmarks. 2021
    Google Scholar
  7. Ramakrishnan S K, Chaplot D S, Al-Halah Z, Malik J, Grauman K. PONI: Potential functions for ObjectGoal navigation with interaction-free learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022, 18868–18878
    Google Scholar
  8. Ramrakhya R, Batra D, Wijmans E, Das A. PIRLNav: Pretraining with imitation and RL finetuning for OBJECTNAV. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023, 17896–17906
    Google Scholar

Download references

Acknowledgments

The authors express their gratitude to the National Natural Science Foundation of China (Grant No. 62306247), the China Postdoctoral Science Foundation (2022M722630), the Sichuan Science and Technology Program (2024NSFSC1474, 2024ZH CG0166).

Author information

Authors and Affiliations

  1. School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, 611756, China
    Haonan Luo, Sijia Li, Yijie Zeng & Xiruo Jiang
  2. School of Computer Science and Engineering, Beihang University, Beijing, 100191, China
    Zihang Wang & Botao Jiang

Authors

  1. Haonan Luo
  2. Sijia Li
  3. Yijie Zeng
  4. Zihang Wang
  5. Botao Jiang
  6. Xiruo Jiang

Corresponding author

Correspondence toXiruo Jiang.

Ethics declarations

The authors declare that they have no competing interests or financial conflicts to disclose.

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Luo, H., Li, S., Zeng, Y. et al. Bidirectional chain-of-thought for zero-shot object navigation.Front. Comput. Sci. 20, 2001317 (2026). https://doi.org/10.1007/s11704-025-41283-7

Download citation