Autonomous navigation via dual-priority experience replay with adaptive hybrid weighting (original) (raw)
References
Quinlan S, Khatib O (1993) Elastic bands: Connecting path planning and control. In: [1993] Proceedings IEEE International Conference on Robotics and Automation, pp. 802–807. IEEE
Fox D, Burgard W, Thrun S (1997) The dynamic window approach to collision avoidance. IEEE Robotics & Automation Magazine 4(1):23–33 Article Google Scholar
Xiao X, Liu B, Warnell G, Stone P (2022) Motion planning and control for mobile robot navigation using machine learning: a survey. Auton Robot 46(5):569–597 Article Google Scholar
Brunke L, Greeff M, Hall AW, Yuan Z, Zhou S, Panerati J, Schoellig AP (2022) Safe learning in robotics: From learning-based control to safe reinforcement learning. Annual Review of Control Robotics and Autonomous Systems 5(1):411–444 Article Google Scholar
Ibarz J, Tan J, Finn C, Kalakrishnan M, Pastor P, Levine S (2021) How to train your robot with deep reinforcement learning: lessons we have learned. The International Journal of Robotics Research 40(4–5):698–721 Article Google Scholar
Han D, Mulyana B, Stankovic V, Cheng S (2023) A survey on deep reinforcement learning algorithms for robotic manipulation. Sensors 23(7):3762 Article Google Scholar
Kiran BR, Sobh I, Talpaert V, Mannion P, Al Sallab AA, Yogamani S, Pérez P (2021) Deep reinforcement learning for autonomous driving: A survey. IEEE Trans Intell Transp Syst 23(6):4909–4926 Article Google Scholar
Wu J, Huang Z, Lv C (2022) Uncertainty-aware model-based reinforcement learning: Methodology and application in autonomous driving. IEEE Transactions on Intelligent Vehicles 8(1):194–203 Article Google Scholar
Boute RN, Gijsbrechts J, Van Jaarsveld W, Vanvuchelen N (2022) Deep reinforcement learning for inventory control: A roadmap. Eur J Oper Res 298(2):401–412 ArticleMathSciNet Google Scholar
Gijsbrechts J, Boute RN, Van Mieghem JA, Zhang DJ (2022) Can deep reinforcement learning improve inventory management? performance on lost sales, dual-sourcing, and multi-echelon problems. Manufacturing & Service Operations Management 24(3):1349–1368 Article Google Scholar
Rupp F, Eberhardinger M, Eckert K (2023) Balancing of competitive two-player game levels with reinforcement learning. In: 2023 IEEE Conference on Games (CoG), 1–8. IEEE
Souchleris K, Sidiropoulos GK, Papakostas GA (2023) Reinforcement learning in game industry\(-\)review, prospects and challenges. Appl Sci 13(4):2443 Article Google Scholar
Watkins CJCH, et al (1989) Learning from delayed rewards
Williams RJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach Learn 8:229–256 Article Google Scholar
Zhang Z, Chen J, Chen Z, Li W (2019) Asynchronous episodic deep deterministic policy gradient: Toward continuous control in computationally complex environments. IEEE transactions on cybernetics 51(2):604–613 Article Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M (2013) Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602
Lillicrap TP, Hunt JJ, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2015) Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971
Shi Z, Xie X, Lu H, Yang H, Kadoch M, Cheriet M (2020) Deep-reinforcement-learning-based spectrum resource management for industrial internet of things. IEEE Internet Things J 8(5):3476–3489 Article Google Scholar
Zhang Y, Rao X, Liu C, Zhang X, Zhou Y (2023) A cooperative ev charging scheduling strategy based on double deep q-network and prioritized experience replay. Eng Appl Artif Intell 118:105642 Article Google Scholar
Pang G, Wang X, Wang L, Hao F, Lin Y, Wan P, Min G (2022) Efficient deep reinforcement learning-enabled recommendation. IEEE Transactions on Network Science and Engineering 10(2):871–886 Article Google Scholar
Yu L, Huo S, Wang Z, Li K (2023) Hybrid attention-oriented experience replay for deep reinforcement learning and its application to a multi-robot cooperative hunting problem. Neurocomputing 523:44–57 Article Google Scholar
Ruan X, Ren D, Zhu X, Huang J (2019) Mobile robot navigation based on deep reinforcement learning. In: 2019 Chinese Control and Decision Conference (CCDC), 6174–6178. IEEE
Chen G, Pan L, Chen Y, Xu P, Wang Z, Wu P, Ji J, Chen X (2021) Deep reinforcement learning of map-based obstacle avoidance for mobile robot navigation. SN Computer Science 2:1–14 Article Google Scholar
Zhu W, Hayashibe M (2022) A hierarchical deep reinforcement learning framework with high efficiency and generalization for fast and safe navigation. IEEE Trans Industr Electron 70(5):4962–4971 Article Google Scholar
Bo L, Zhang T, Zhang H, Hong J, Liu M, Zhang C, Liu B (2024) 3d uav path planning in unknown environment: A transfer reinforcement learning method based on low-rank adaption. Adv Eng Inform 62:102920 Article Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G, et al. (2015) Human-level control through deep reinforcement learning. nature 518(7540), 529–533
Lin L-J (1992) Self-improving reactive agents based on reinforcement learning, planning and teaching. Mach Learn 8:293–321 Article Google Scholar
Schaul T, Quan J, Antonoglou I, Silver D (2015) Prioritized experience replay. arXiv preprint arXiv:1511.05952
Andrychowicz M, Wolski F, Ray A, Schneider J, Fong R, Welinder P, McGrew B, Tobin J, Pieter Abbeel O, Zaremba W (2017) Hindsight experience replay. Advances in neural information processing systems 30
Liu H, Trott A, Socher R, Xiong C (2019) Competitive experience replay. arXiv preprint arXiv:1902.00528
Kang C, Rong C, Ren W, Huo F, Liu P (2021) Deep deterministic policy gradient based on double network prioritized experience replay. IEEE Access 9:60296–60308 Article Google Scholar
Cui J, Yuan L, He L, Xiao W, Ran T, Zhang J (2023) Multi-input autonomous driving based on deep reinforcement learning with double bias experience replay. IEEE Sens J 23(11):11253–11261 Article Google Scholar
Liu X, Yu M, Yang C, Zhou L, Wang H, Zhou H (2024) Value distribution ddpg with dual-prioritized experience replay for coordinated control of coal-fired power generation systems. IEEE Transactions on Industrial Informatics
Horgan D, Quan J, Budden D, Barth-Maron G, Hessel M, Hasselt HV, Silver D (2018) Distributed prioritized experience replay. arXiv: 1803.00933
Fujimoto S, Hoof H, Meger D (2018) Addressing function approximation error in actor-critic methods. In: International Conference on Machine Learning, pp. 1587–1596. PMLR
Lee K, Kim S, Choi J (2023) Adaptive and explainable deployment of navigation skills via hierarchical deep reinforcement learning. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 1673–1679