Share Email Print

Proceedings Paper

Applications of text-image editing
Author(s): Steven C. Bagley; Gary E. Kopec
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

The most common approach to processing text which originates as a scanned document image is format conversion, in which procedures such as page segmentation and character recognition are used to convert the scanned text into a structured symbolic description which can be manipulated by a conventional text editor. While this approach is attractive in many respects, there are situations in which complete recognition and format conversion is either unnecessary or very difficult to achieve with sufficient accuracy. This paper presents several applications illustrating an alternative approach to scanned text processing in which document processing operations are performed on image elements extracted from the scanned document image. The central and novel insight is that many document processing operations may be implemented directly by geometrical operations on image blobs, without explicit knowledge of the symbolic character labels (that is, without automatic character recognition). The applications are implemented as part of image EMACS, an editor for binary document images, and include editing multilingual documents, reformatting text to a new column width, differential comparison of two versions of a document, and preprocessing an image prior to character recognition.

Paper Details

Date Published: 1 August 1991
PDF: 9 pages
Proc. SPIE 1460, Image Handling and Reproduction Systems Integration, (1 August 1991); doi: 10.1117/12.44412
Show Author Affiliations
Steven C. Bagley, Xerox/Palo Alto Research Ctr. (United States)
Gary E. Kopec, Xerox/Palo Alto Research Ctr. (United States)

Published in SPIE Proceedings Vol. 1460:
Image Handling and Reproduction Systems Integration
Walter R. Bender; Wil Plouffe, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?