Share Email Print
cover

Proceedings Paper • new

Deep network based 3D hand keypoints prediction from single RGB images
Author(s): Jialong Wang; Nong Sang
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

3D hand keypoints prediction is an important and fundamental task in Human-Computer Interaction. In this paper, we present an approach to predict 3D hand keypoints from single RGB images. Single RGB images are very common in daily life. However, it is challenging to predict 3D hand keypoints using single RGB images, because of depth ambiguities and occlusions. To deal with these challenges, we exploit deep neural networks to predict 3D hand keypoints. So far, there are several methods which predict 3D hand keypoints from single RGB images. Most of them separate the task into three stages. i.e., hand detection, 2D hand keypoints estimation and 3D hand keypoints prediction. We follow the idea and focus on the 2D hand keypoints estimation and 3D hand keypoints prediction. We improve an existing deep-network-based technique and get better results. Specifically, we combine the convolution and deconvolution network to get the pixel-wise estimation of 2D hand keypoints, and propose a new loss function to predict 3D hand keypoints from 2D keypoints. We evaluate our network on several public datasets and get better results than several other methods. Besides, ablation studies demonstrate that our network is valid.

Paper Details

Date Published: 13 May 2019
PDF: 9 pages
Proc. SPIE 10995, Pattern Recognition and Tracking XXX, 109950V (13 May 2019); doi: 10.1117/12.2522496
Show Author Affiliations
Jialong Wang, Huazhong Univ. of Science and Technology (China)
Nong Sang, Huazhong Univ. of Science and Technology (China)


Published in SPIE Proceedings Vol. 10995:
Pattern Recognition and Tracking XXX
Mohammad S. Alam, Editor(s)

© SPIE. Terms of Use
Back to Top