Share Email Print
cover

Proceedings Paper

Generation method of synthetic training data for mobile OCR system
Author(s): Yulia S. Chernyshova; Alexander V. Gayer; Alexander V. Sheshkus
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

This paper addresses one of the fundamental problems of machine learning - training data acquiring. Obtaining enough natural training data is rather difficult and expensive. In last years usage of synthetic images has become more beneficial as it allows to save human time and also to provide a huge number of images which otherwise would be difficult to obtain. However, for successful learning on artificial dataset one should try to reduce the gap between natural and synthetic data distributions. In this paper we describe an algorithm which allows to create artificial training datasets for OCR systems using russian passport as a case study.

Paper Details

Date Published: 13 April 2018
PDF: 7 pages
Proc. SPIE 10696, Tenth International Conference on Machine Vision (ICMV 2017), 106962G (13 April 2018); doi: 10.1117/12.2310119
Show Author Affiliations
Yulia S. Chernyshova, National Univ. of Science and Technology "MISIS" (Russian Federation)
Smart Engines (Russian Federation)
Alexander V. Gayer, National Univ. of Science and Technology "MISIS" (Russian Federation)
Smart Engines (Russian Federation)
Alexander V. Sheshkus, Smart Engines (Russian Federation)


Published in SPIE Proceedings Vol. 10696:
Tenth International Conference on Machine Vision (ICMV 2017)
Antanas Verikas; Petia Radeva; Dmitry Nikolaev; Jianhong Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top