Share Email Print

Proceedings Paper

Generalizing edit distance for handwritten text recognition
Author(s): Giovanni Seni; V. Kripasundar; Rohini K. Srihari
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

In this paper the Damerau-Levenshtein string difference metric is generalized in two ways to more accurately compensate for the types of errors that are present in the script recognition domain. First, the basic dynamic programming method for computing such a measure is extended to allow for merges, splits and two-letter substitutions. Second, edit operations are refined into categories according to the effect they have on the visual `appearance' of words. A set of recognizer-independent constraints is developed to reflect the severity of the information lost due to each operation. These constraints are solved to assign specific costs to the operations. Experimental results on 2,335 corrupted strings and a lexicon of 21,299 words show higher correcting rates than with the original form.

Paper Details

Date Published: 30 March 1995
PDF: 12 pages
Proc. SPIE 2422, Document Recognition II, (30 March 1995); doi: 10.1117/12.205841
Show Author Affiliations
Giovanni Seni, SUNY/Buffalo (United States)
V. Kripasundar, SUNY/Buffalo (United States)
Rohini K. Srihari, SUNY/Buffalo (United States)

Published in SPIE Proceedings Vol. 2422:
Document Recognition II
Luc M. Vincent; Henry S. Baird, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?