Share Email Print

Proceedings Paper

Identification and correction of rejection and substitution errors in optical character recognition systems
Author(s): Glenn S. Himes; Marty M. Scholl; Frank A. DeCosta
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We examine the use of character image analysis coupled with contextual information in complex data gathering forms to identify and correct optical character recognition (OCR) system rejection and substitution errors. Segmented characters from a complex data gathering form are initially classified using an OCR engine based on a combination of Karhunen-Loeve transforms and a back-propagation neural network. Systems of equations are derived from the data gathering form to determine the values of characters rejected by the OCR engine and to verify the consistency of the data captured. If the OCR results for a single form are determined to be inconsistent with respect to the form's data relationships, a set of decision algorithms which incorporates a second neural network and uses additional character features is used to tag characters according to their likelihood of substitution error. Potential substitution errors are incrementally added to the set of OCR reject errors and are processed through dynamically selected systems of equations and search techniques which correct both error classes. We provide experimental results and determine the extent to which errors can be detected and corrected for various OCR error rates.

Paper Details

Date Published: 14 April 1993
PDF: 12 pages
Proc. SPIE 1906, Character Recognition Technologies, (14 April 1993); doi: 10.1117/12.143616
Show Author Affiliations
Glenn S. Himes, MITRE Corp. (United States)
Marty M. Scholl, MITRE Corp. (United States)
Frank A. DeCosta, MITRE Corp. (United States)

Published in SPIE Proceedings Vol. 1906:
Character Recognition Technologies
Donald P. D'Amato, Editor(s)

© SPIE. Terms of Use
Back to Top