Share Email Print
cover

Proceedings Paper

Word recognition in a segmentation-free approach to OCR
Author(s): Prasanna G. Mulgaonkar; Chien-Huei Chen; Jeff L. DeCurtins
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Segmentation is a key step in current OCR systems. It has been estimated that half the errors in character recognition are due to segmentation. We have developed a novel approach that performs OCR without the segmentation step. The approach starts by extracting significant geometric features from the input document image of the page. Each feature then `votes' for the character that could have generated that feature. Thus, even if some of the features are occluded or lost due to degradation, the remaining features can successfully identify the character. In extreme case, the degradation may be severe enough to prevent recognition of some of the characters in a word. In such cases, we use a lexicon-based word recognition technique to resolve ambiguity. Inexact matching and probabilistic evaluation used in the technique allow us to identify the correct word, by detecting a partial set of characters. This paper first presents an overview of our segmentation-free OCR system and then focuses on the word-recognition technique. Preliminary experimental results show that this is a very promising approach.

Paper Details

Date Published: 25 February 1994
PDF: 7 pages
Proc. SPIE 2103, 22nd AIPR Workshop: Interdisciplinary Computer Vision: Applications and Changing Needs, (25 February 1994); doi: 10.1117/12.169464
Show Author Affiliations
Prasanna G. Mulgaonkar, SRI International (United States)
Chien-Huei Chen, SRI International (United States)
Jeff L. DeCurtins, SRI International (United States)


Published in SPIE Proceedings Vol. 2103:
22nd AIPR Workshop: Interdisciplinary Computer Vision: Applications and Changing Needs
J. Michael Selander, Editor(s)

© SPIE. Terms of Use
Back to Top