Share Email Print
cover

Proceedings Paper • new

Document image recognition algorithm based on similarity metric robust to projective distortions for mobile devices
Author(s): Aleksander Lynchenko; Aleksander Sheshkus ; Vladimir L. Arlazarov
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

The paper presents an algorithm for document image recognition robust to projective distortions. This algorithm is based on a similarity metric, which is learned using Siamese architecture. The idea of training Siamese networks is to build a function of converting the image into a space where a distance function corresponding to a pre-defined metric approximates the similarity between objects of initial space. During learning the loss function tries to minimize the distance between pairs of object from the same class and maximize it between the ones from different classes. A convolutional network is used for mapping initial space to the target one. This network lets to construct a feature vector in target space for each class. Classification of objects is performed using the mapping function and finding the nearest feature vector. The proposed algorithm achieved recognition quality comparable to classifying convolutional network on an open dataset of document images MIDV-500 [1]. Another important advantage of this method is the possibility of one-shot learning that is also shown in the paper.

Paper Details

Date Published: 15 March 2019
PDF: 7 pages
Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018), 110411K (15 March 2019); doi: 10.1117/12.2523152
Show Author Affiliations
Aleksander Lynchenko, Smart Engines Ltd. (Russian Federation)
Aleksander Sheshkus , Smart Engines Ltd. (Russian Federation)
Vladimir L. Arlazarov, Smart Engines Ltd. (Russian Federation)
Institute for Systems Analysis (Russian Federation)


Published in SPIE Proceedings Vol. 11041:
Eleventh International Conference on Machine Vision (ICMV 2018)
Antanas Verikas; Dmitry P. Nikolaev; Petia Radeva; Jianhong Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top