A real-time recognition method of static gesture based on DSSD (original) (raw)
Abstract
Gesture recognition is of great significance for human-machine interaction and it has broad application prospects. In order to improve the detection accuracy and speed, a real-time recognition method of static gesture based on Deconvolutional Single Shot Detector (DSSD) is proposed in this paper. We have improved the original DSSD network and the deconvolution module, used the K-means clustering algorithm to select the aspect ratios of the prior boxes to improve the detection accuracy. The detection accuracy of small data set is improved by introducing transfer learning method, and the influences of three different base networks on the DSSD network model are discussed. In order to verify the effectiveness of the proposed method, we compared it with the gesture recognition methods based on SSD300, SSD321, YOLOV2 and DES in ASL dataset. The experimental results show that the proposed method has a recognition rate of 94.8%, which is 2.7%, 2.1% and 2.8%higher than SSD300, SSD321 and YOLOv2, respectively. The detection rate is close to the method of Single-Shot Object Detection with Enriched Semantics (DES), while still maintaining a reasonable detection speed of 27 FPS. In addition, since DSSD fuse the semantic information of each feature extraction layer, the proposed method also has good detection ability for small gesture objects.
Access this article
Subscribe and save
- Starting from 10 chapters or articles per month
- Access and download chapters and articles from more than 300k books and 2,500 journals
- Cancel anytime View plans
Buy Now
Price excludes VAT (USA)
Tax calculation will be finalised during checkout.
Instant access to the full article PDF.
Similar content being viewed by others
References
- Dai J, Li Y, He K (2016). R-FCN: object detection via region-based fully ConvolutionalNetworks. 2017 IEEE conference on computer vision and pattern recognition (CVPR)
- Fu CY, Liu W, Ranga A, Tyagi A, Berg AC (2017) DSSD : deconvolutional single shot detector. arXiv preprint arXiv:1701.06659
- He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. Proceedings of the 2016 IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas: IEEE, pp:770–778
- Krizhevsky A, Sutskever I, Hinton GE (2012). Image net classification with deep convolutional neural networks. Proceedings of international conference on neural information processing systems. Curran associates Inc, pp: 1097–1105.
- Lin T Y, Dollar P, Girshick R (2017). Feature pyramid networks for object detection 2017 IEEE conference on computer vision and pattern recognition (CVPR).
- Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY et al (2016)SSD: single shot MultiBox detector. European conference on computer vision (ECCV)
- Ma GW, Xu ZH, Zhang W, Li SC (2015) An enriched k-means clustering method for grouping fractures with meliorated initial centers. Arab J Geosci 8(4):1881–1893
Article Google Scholar - Peng Z, Bingbing N, Cong G (2018) Scale-transferrable object detection. 2018 IEEE conference on computer vision and pattern recognition (CVPR)
- Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, Real-Time Object Detection IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Shrivastava A, Sukthankar R, Malik J (2017). Beyond Skip Connections: Top-Down Modulation for Object Detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science
- Zhang S, Wen L, Bian X (2018) Single-shot refinement neural network for object detection. 2018 IEEE conference on computer vision and pattern recognition (CVPR)
- Zhang Z, Qiao S, Xie C (2018). Single-Shot Object Detection with Enriched Semantics. 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Author information
Authors and Affiliations
- School of Computer and Information, Hefei University of Technology, No.193, Tunxi Road, HeFei, Anhui, People’s Republic of China
Yong Zhang, Wenjun Zhou, Yujie Wang & Linjia Xu
Authors
- Yong Zhang
- Wenjun Zhou
- Yujie Wang
- Linjia Xu
Corresponding author
Correspondence toYong Zhang.
Ethics declarations
Conflict of interest
Author Yong Zhang declares that he has no conflict of interest. Author Wenjun Zhou declares that he has no conflict of interest. Author Yujie Wang declares that she has no conflict of interest. Author LinjiaXu declares that he has no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, Y., Zhou, W., Wang, Y. et al. A real-time recognition method of static gesture based on DSSD.Multimed Tools Appl 79, 17445–17461 (2020). https://doi.org/10.1007/s11042-020-08725-9
- Received: 17 October 2018
- Revised: 13 January 2020
- Accepted: 01 February 2020
- Published: 18 February 2020
- Version of record: 18 February 2020
- Issue date: July 2020
- DOI: https://doi.org/10.1007/s11042-020-08725-9