Cascaded hierarchical CNN for 2D hand pose estimation from a single color image (original) (raw)

References

  1. Cai Y, Ge L, Cai J, Yuan J (2018) Weakly-supervised 3d hand pose estimation from monocular rgb images. In: Proceedings of the European conference on computer vision (ECCV), pp 666–682
  2. Chen Z, Du K, Sun Y, Lin X, Ma X (2020) Hierarchical neural network for hand pose estimation. Signal Process Image Commun 87(5):115909
    Article Google Scholar
  3. Chen Y, Ma H, Kong D, Yan X, Wu J, Fan W, Xie X (2020) Nonparametric structure regularization machine for 2d hand pose estimation. In: IEEE Winter Conference on applications of computer vision (WACV), pp 3258–3268
  4. Chen X, Wang G, Guo H, Zhang C (2020) Pose guided structured region ensemble network for cascaded hand pose estimation. Neurocomputing 395:138–149
    Article Google Scholar
  5. Clark E (2006) A multicamera system for gesture tracking with three dimensional hand pose estimation. Rochester Institute of Technology
  6. Du K, Lin X, Sun Y, Ma X (2019) Crossinfonet: multi-task information sharing based hand pose estimation. In: IEEE/CVF Conference on computer vision and pattern recognition (CVPR), pp 9888–9897
  7. Elboushaki A, Hannane R, Afdel K, Koutti L (2020) Improving articulated hand pose detection for static finger sign recognition in rgb-d images. Multimed Tools Appl 79(39):28925–28969
    Article Google Scholar
  8. EricLee (2021) https://codechina.csdn.net/ericlee/handpose_x
  9. Erol A, Bebis G, Nicolescu M, Boyle RD, Twombly X (2007) Vision-based hand pose estimation: a review. Comput Vis Image Underst 108(1-2):52–73
    Article Google Scholar
  10. Fan L, Rao H, Yang W (2021) 3d hand pose estimation based on five-layer ensemble cnn. Sensors 21(2):649–665
    Article Google Scholar
  11. Fan L, Zhang J, Dai S, Yang W, Liu W (2020) Cascaded hierarchical cnn for rgb-based 3d hand pose estimation. Math Probl Eng 2020(3):1–13
    Google Scholar
  12. Gomez-Donoso F, Orts-Escolano S, Cazorla M (2019) Large-scale multiview 3d hand pose dataset. Image Vis Comput 81:25–33
    Article Google Scholar
  13. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 770–778
  14. Huang XY, Tsai MS, Huang CC (2019) 3d virtual-reality interaction system. In: IEEE International conference on consumer electronics(ICCE), pp 1–2
  15. Jia D, Wei D, Socher R, Li LJ, Kai L, Li FF (2009) Imagenet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 248–255
  16. Joo H, Simon T, Li X, Liu H, Tan L, Gui L, Banerjee S, Godisart T, Nabbe B, Matthews I (2017) Panoptic studio: a massively multiview system for social interaction capture. IEEE Trans Pattern Anal Mach Intell 41 (1):190–204
    Article Google Scholar
  17. Kingma D, Ba J (2014) Adam: a method for stochastic optimization, arXiv:1412.6980
  18. Kourbane I, Genc Y (2021) Skeleton-aware multi-scale heatmap regression for 2d hand pose estimation, arXiv:2105.10904
  19. Kulon D, Güler R, Kokkinos I, Bronstein M, Zafeiriou S (2020) Weakly-supervised mesh-convolutional hand reconstruction in the wild. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 4990–5000
  20. Madadi M, Escalera S, Baro X, Gonzalez J (2017) End-to-end global to local cnn learning for hand pose recovery in depth data, arXiv:1705.09606
  21. Mehta D, Sridhar S, Sotnychenko O, Rhodin H, Theobalt C (2017) Vnect: real-time 3d human pose estimation with a single rgb camera. ACM Trans Graph (TOG) 36(4):1–14
    Article Google Scholar
  22. Newell A, Yang K, Deng J (2016) Stacked hourglass networks for human pose estimation. In: European conference on computer vision (ECCV), pp 483–499
  23. Panteleris P, Oikonomidis I, Argyros A (2018) Using a single rgb frame for real time 3d hand pose estimation in the wild. In: IEEE Winter conference on applications of computer vision (WACV), pp 436–445
  24. Rosa-Pujazon A, Barbancho I, Tardon LJ, Barbancho AM (2016) Fast-gesture recognition and classification using kinect: an application for a virtual reality drumkit. Multimed Tools Appl 75(14):8137–8164
    Article Google Scholar
  25. Ruder S (2017) An overview of multi-task learning in deep neural networks, arXiv:1706.05098
  26. Simon T, Joo H, Matthews I, Sheikh Y (2017) Hand keypoint detection in single images using multiview bootstrapping. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1145–1153
  27. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition, arXiv:1409.1556
  28. Sun X, Wei Y, Liang S, Tang X, Sun J (2015) Cascaded hand pose regression. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 824–832
  29. Tang D, Jin Chang H, Tejani A, Kim TK (2014) Latent regression forest: Structured estimation of 3d articulated hand posture. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3786–3793
  30. Tompson J, Stein M, Lecun Y, Perlin K (2014) Real-time continuous pose recovery of human hands using convolutional networks. ACM Trans Graph (ToG) 33(5):1–10
    Article Google Scholar
  31. Wan J (2021) Gesture recognition and information recommendation based on machine learning and virtual reality in distance education. Journal of Intelligent and Fuzzy Systems (Preprint), 1–11
  32. Wan C, Yao A, Van Gool L (2016) Hand pose estimation from local surface normals. In: European conference on computer vision (ECCV), pp 554–569
  33. Wang Y, Cong P, Liu Y (2018) Mask-pose cascaded cnn for 2d hand pose estimation from single color image. IEEE Trans Circuits Syst Video Technol 29(11):3258–3268
    Article Google Scholar
  34. Wang Y, Zhang B, Peng C (2019) Srhandnet: real-time 2d hand pose estimation with simultaneous region localization. IEEE Trans Image Process 29:2977–2986
    Article Google Scholar
  35. Wei SE, Ramakrishna V, Kanade T, Sheikh Y (2016) Convolutional pose machines. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 4724–4732
  36. Yi Y, Ramanan D (2013) Articulated human detection with flexible mixtures of parts. IEEE Trans Softw Eng 35(12):2878–2890
    Google Scholar
  37. Zhou Y, Jiang G, Lin Y (2016) A novel finger and hand pose estimation technique for real-time hand gesture recognition. Pattern Recogn 49:102–114
    Article Google Scholar
  38. Zimmermann C, Brox T (2017) Learning to estimate 3d hand pose from single rgb images. In: IEEE International conference on computer vision (ICCV), pp 4913–4921
  39. Zimmermann C, Ceylan D, Yang J, Russell B, Argus M, Brox T (2019) Freihand: a dataset for markerless capture of hand pose and shape from single rgb images. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pp 813–822

Download references