Cascaded hierarchical CNN for 2D hand pose estimation from a single color image (original) (raw)
References
Cai Y, Ge L, Cai J, Yuan J (2018) Weakly-supervised 3d hand pose estimation from monocular rgb images. In: Proceedings of the European conference on computer vision (ECCV), pp 666–682
Chen Z, Du K, Sun Y, Lin X, Ma X (2020) Hierarchical neural network for hand pose estimation. Signal Process Image Commun 87(5):115909 Article Google Scholar
Chen Y, Ma H, Kong D, Yan X, Wu J, Fan W, Xie X (2020) Nonparametric structure regularization machine for 2d hand pose estimation. In: IEEE Winter Conference on applications of computer vision (WACV), pp 3258–3268
Chen X, Wang G, Guo H, Zhang C (2020) Pose guided structured region ensemble network for cascaded hand pose estimation. Neurocomputing 395:138–149 Article Google Scholar
Clark E (2006) A multicamera system for gesture tracking with three dimensional hand pose estimation. Rochester Institute of Technology
Du K, Lin X, Sun Y, Ma X (2019) Crossinfonet: multi-task information sharing based hand pose estimation. In: IEEE/CVF Conference on computer vision and pattern recognition (CVPR), pp 9888–9897
Elboushaki A, Hannane R, Afdel K, Koutti L (2020) Improving articulated hand pose detection for static finger sign recognition in rgb-d images. Multimed Tools Appl 79(39):28925–28969 Article Google Scholar
Erol A, Bebis G, Nicolescu M, Boyle RD, Twombly X (2007) Vision-based hand pose estimation: a review. Comput Vis Image Underst 108(1-2):52–73 Article Google Scholar
Fan L, Rao H, Yang W (2021) 3d hand pose estimation based on five-layer ensemble cnn. Sensors 21(2):649–665 Article Google Scholar
Fan L, Zhang J, Dai S, Yang W, Liu W (2020) Cascaded hierarchical cnn for rgb-based 3d hand pose estimation. Math Probl Eng 2020(3):1–13 Google Scholar
Gomez-Donoso F, Orts-Escolano S, Cazorla M (2019) Large-scale multiview 3d hand pose dataset. Image Vis Comput 81:25–33 Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 770–778
Huang XY, Tsai MS, Huang CC (2019) 3d virtual-reality interaction system. In: IEEE International conference on consumer electronics(ICCE), pp 1–2
Jia D, Wei D, Socher R, Li LJ, Kai L, Li FF (2009) Imagenet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 248–255
Joo H, Simon T, Li X, Liu H, Tan L, Gui L, Banerjee S, Godisart T, Nabbe B, Matthews I (2017) Panoptic studio: a massively multiview system for social interaction capture. IEEE Trans Pattern Anal Mach Intell 41 (1):190–204 Article Google Scholar
Kingma D, Ba J (2014) Adam: a method for stochastic optimization, arXiv:1412.6980
Kourbane I, Genc Y (2021) Skeleton-aware multi-scale heatmap regression for 2d hand pose estimation, arXiv:2105.10904
Kulon D, Güler R, Kokkinos I, Bronstein M, Zafeiriou S (2020) Weakly-supervised mesh-convolutional hand reconstruction in the wild. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 4990–5000
Madadi M, Escalera S, Baro X, Gonzalez J (2017) End-to-end global to local cnn learning for hand pose recovery in depth data, arXiv:1705.09606
Mehta D, Sridhar S, Sotnychenko O, Rhodin H, Theobalt C (2017) Vnect: real-time 3d human pose estimation with a single rgb camera. ACM Trans Graph (TOG) 36(4):1–14 Article Google Scholar
Newell A, Yang K, Deng J (2016) Stacked hourglass networks for human pose estimation. In: European conference on computer vision (ECCV), pp 483–499
Panteleris P, Oikonomidis I, Argyros A (2018) Using a single rgb frame for real time 3d hand pose estimation in the wild. In: IEEE Winter conference on applications of computer vision (WACV), pp 436–445
Rosa-Pujazon A, Barbancho I, Tardon LJ, Barbancho AM (2016) Fast-gesture recognition and classification using kinect: an application for a virtual reality drumkit. Multimed Tools Appl 75(14):8137–8164 Article Google Scholar
Ruder S (2017) An overview of multi-task learning in deep neural networks, arXiv:1706.05098
Simon T, Joo H, Matthews I, Sheikh Y (2017) Hand keypoint detection in single images using multiview bootstrapping. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1145–1153
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition, arXiv:1409.1556
Sun X, Wei Y, Liang S, Tang X, Sun J (2015) Cascaded hand pose regression. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 824–832
Tang D, Jin Chang H, Tejani A, Kim TK (2014) Latent regression forest: Structured estimation of 3d articulated hand posture. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3786–3793
Tompson J, Stein M, Lecun Y, Perlin K (2014) Real-time continuous pose recovery of human hands using convolutional networks. ACM Trans Graph (ToG) 33(5):1–10 Article Google Scholar
Wan J (2021) Gesture recognition and information recommendation based on machine learning and virtual reality in distance education. Journal of Intelligent and Fuzzy Systems (Preprint), 1–11
Wan C, Yao A, Van Gool L (2016) Hand pose estimation from local surface normals. In: European conference on computer vision (ECCV), pp 554–569
Wang Y, Cong P, Liu Y (2018) Mask-pose cascaded cnn for 2d hand pose estimation from single color image. IEEE Trans Circuits Syst Video Technol 29(11):3258–3268 Article Google Scholar
Wang Y, Zhang B, Peng C (2019) Srhandnet: real-time 2d hand pose estimation with simultaneous region localization. IEEE Trans Image Process 29:2977–2986 Article Google Scholar
Wei SE, Ramakrishna V, Kanade T, Sheikh Y (2016) Convolutional pose machines. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 4724–4732
Yi Y, Ramanan D (2013) Articulated human detection with flexible mixtures of parts. IEEE Trans Softw Eng 35(12):2878–2890 Google Scholar
Zhou Y, Jiang G, Lin Y (2016) A novel finger and hand pose estimation technique for real-time hand gesture recognition. Pattern Recogn 49:102–114 Article Google Scholar
Zimmermann C, Brox T (2017) Learning to estimate 3d hand pose from single rgb images. In: IEEE International conference on computer vision (ICCV), pp 4913–4921
Zimmermann C, Ceylan D, Yang J, Russell B, Argus M, Brox T (2019) Freihand: a dataset for markerless capture of hand pose and shape from single rgb images. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pp 813–822