A deep Q-learning network based active object detection model with a novel training algorithm for service robots (original) (raw)

Abstract

This paper focuses on the problem of active object detection (AOD). AOD is important for service robots to complete tasks in the family environment, and leads robots to approach the target object by taking appropriate moving actions. Most of the current AOD methods are based on reinforcement learning with low training efficiency and testing accuracy. Therefore, an AOD model based on a deep Q-learning network (DQN) with a novel training algorithm is proposed in this paper. The DQN model is designed to fit the Q-values of various actions, and includes state space, feature extraction, and a multilayer perceptron. In contrast to existing research, a novel training algorithm based on memory is designed for the proposed DQN model to improve training efficiency and testing accuracy. In addition, a method of generating the end state is presented to judge when to stop the AOD task during the training process. Sufficient comparison experiments and ablation studies are performed based on an AOD dataset, proving that the presented method has better performance than the comparable methods and that the proposed training algorithm is more effective than the raw training algorithm.

摘要

本文研究了主动物品检测(AOD)问题。AOD是服务机器人在家庭环境中完成服务任务的重要组成部分,通过适当的移动动作引导机器人接近目标物品。目前基于强化学习的AOD模型存在训练效率低和测试精度差的问题。因此,本文提出一种基于深度Q学习网络的AOD模型,并设计了一种新的模型训练算法。该模型旨在拟合各种动作Q值,包括状态空间、特征提取和多层感知机。与现有研究不同,本文针对所提AOD模型设计了一种基于记忆的训练算法,以提高模型训练效率和测试精度。此外,提出一种最终状态生成方法判断训练过程中AOD任务何时停止。本文所提方法在AOD数据集上进行了充分的对比实验和消融实验。实验结果表明所提方法优于其他同类方法,所设计的训练算法比原始训练算法更高效。

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

  1. School of Control Science and Engineering, Shandong University, Jinan, 250061, China
    Shaopeng Liu (刘少鹏), Guohui Tian (田国会), Yongcheng Cui (崔永成) & Xuyang Shao (邵旭阳)

Authors

  1. Shaopeng Liu (刘少鹏)
  2. Guohui Tian (田国会)
  3. Yongcheng Cui (崔永成)
  4. Xuyang Shao (邵旭阳)

Contributions

Shaopeng LIU and Guohui TIAN designed the research. Shaopeng LIU addressed the problems, processed the data, and drafted the paper. Guohui TIAN, Yongcheng CUI, and Xuyang SHAO helped organize the paper. Shaopeng LIU and Guohui TIAN revised and finalized the paper.

Corresponding author

Correspondence toGuohui Tian (田国会).

Additional information

Compliance with ethics guidelines

Shaopeng LIU, Guohui TIAN, Yongcheng CUI, and Xuyang SHAO declare that they have no conflict of interest.

Project supported by the National Natural Science Foundation of China (Nos. U1813215 and 62273203)

Rights and permissions

About this article

Cite this article

Liu, S., Tian, G., Cui, Y. et al. A deep Q-learning network based active object detection model with a novel training algorithm for service robots.Front Inform Technol Electron Eng 23, 1673–1683 (2022). https://doi.org/10.1631/FITEE.2200109

Download citation

Key words

关键词

CLC number