Share Email Print

Proceedings Paper

Transcript mapping for handwritten English documents
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Transcript mapping or text alignment with handwritten documents is the automatic alignment of words in a text file with word images in a handwritten document. Such a mapping has several applications in fields ranging from machine learning where large quantities of truth data are required for evaluating handwriting recognition algorithms, to data mining where word image indexes are used in ranked retrieval of scanned documents in a digital library. The alignment also aids "writer identity" verification algorithms. Interfaces which display scanned handwritten documents may use this alignment to highlight manuscript tokens when a person examines the corresponding transcript word. We propose an adaptation of the True DTW dynamic programming algorithm for English handwritten documents. The integration of the dissimilarity scores from a word-model word recognizer and Levenshtein distance between the recognized word and lexicon word, as a cost metric in the DTW algorithm leading to a fast and accurate alignment, is our primary contribution. Results provided, confirm the effectiveness of our approach.

Paper Details

Date Published: 28 January 2008
PDF: 8 pages
Proc. SPIE 6815, Document Recognition and Retrieval XV, 68150M (28 January 2008); doi: 10.1117/12.766489
Show Author Affiliations
Damien Jose, Univ. at Buffalo (United States)
Anurag Bharadwaj, Univ. at Buffalo (United States)
Venu Govindaraju, Univ. at Buffalo (United States)

Published in SPIE Proceedings Vol. 6815:
Document Recognition and Retrieval XV
Berrin A. Yanikoglu; Kathrin Berkner, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?