Share Email Print
cover

Proceedings Paper

Segmenting handwritten text lines into words using distance algorithms
Author(s): Giovanni Seni; Edward Cohen
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

This paper explores different distance algorithms that can group connected components of a handwritten text line into words. A binarized handwritten text image normally consists of many connected components, where each component is a character fragment, an isolated character, or a group of characters. When the writing style is unconstrained, recognition of individual components is unreliable so the components must be grouped into words before recognition algorithms (which may require dictionaries) can be used. Algorithms that compute the distance between connected components can indicate how the connected components should be clustered into words. We show that fast straightforward distance algorithms (such as using the horizontal distance between the component''s bounding boxes) have mediocre performance. Euclidean distance algorithms perform well but are computationally slow. This paper describes original methods of computing distances. These algorithms include combining a set of horizontal distances between components (applied to each pixel row) with the Euclidean and bounding box methods to achieve high performance and reasonable speed. We examine six distance algorithms and each is tested on unconstrained handwritten address images.

Paper Details

Date Published: 1 August 1992
PDF: 12 pages
Proc. SPIE 1661, Machine Vision Applications in Character Recognition and Industrial Inspection, (1 August 1992); doi: 10.1117/12.130274
Show Author Affiliations
Giovanni Seni, SUNY/Buffalo (United States)
Edward Cohen, SUNY/Buffalo (United States)


Published in SPIE Proceedings Vol. 1661:
Machine Vision Applications in Character Recognition and Industrial Inspection
Donald P. D'Amato; Wolf-Ekkehard Blanz; Byron E. Dom; Sargur N. Srihari, Editor(s)

© SPIE. Terms of Use
Back to Top