Share Email Print

Proceedings Paper

GAN-based data augmentation for visual finger spelling recognition
Author(s): Bogdan Kwolek
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

In this work we extend WGAN-GP in order to achieve better generation of synthesized images for finger spelling classification. The main difference between the ordinary WGAN-GP and the proposed algorithm is that in the training we employ both training samples and training labels. These training labels are fed to the generator, that generates the synthetic images using both the randomized latent input and the input label. In ordinary WGAN-GP, latent input variables are usually sampled from an unconditional prior. In the proposed algorithm the latent input vector is a concatenation of random part, the class labels and additional variables that are drawn from Gaussian distributions representing hand poses or gesture attributes. The JSL dataset for Hiragana sign recognition has been balanced using the rendered samples on the basis of a 3D hand model as well as the extended WGAN-GP.

Paper Details

Date Published: 15 March 2019
PDF: 8 pages
Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018), 110411U (15 March 2019); doi: 10.1117/12.2522935
Show Author Affiliations
Bogdan Kwolek, AGH Univ. of Science and Technology (Poland)

Published in SPIE Proceedings Vol. 11041:
Eleventh International Conference on Machine Vision (ICMV 2018)
Antanas Verikas; Dmitry P. Nikolaev; Petia Radeva; Jianhong Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?