Share Email Print

Proceedings Paper

Methods for objective evaluation and improvement of text document images
Author(s): Valery S. Kot; Alexander V. Bondarenko
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Any optical character recognition (OCR) system contains preprocessing unit responsible for image binarization, and the entire recognition rate depends dramatically from the accuracy of this unit. In case of poor image quality user must spend much time to find out the best parameters of this unit while recognition rate may still remain unsatisfactory. Thus methods intended for objective evaluation and context- sensitive improvement of text document images are required. In this parameters set is proposed as a tool for integral image description. This compact set allows to select automatically or semiautomatically the optimal image processing sequence from the basic IP functions. For all tested commercial OCR systems, the proposed methods result in recognition errors decreasing about 50 - 60% for text document images of average and poor quality while requiring less than 1 minute per page of additional processing time.

Paper Details

Date Published: 1 October 1998
PDF: 9 pages
Proc. SPIE 3460, Applications of Digital Image Processing XXI, (1 October 1998); doi: 10.1117/12.323158
Show Author Affiliations
Valery S. Kot, Institute of Informatics Problems (Netherlands)
Alexander V. Bondarenko, State Research Institute of Aviation Systems (Russia)

Published in SPIE Proceedings Vol. 3460:
Applications of Digital Image Processing XXI
Andrew G. Tescher, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?