Share Email Print

Proceedings Paper

Interactive training for handwriting recognition in historical document collections
Author(s): Douglas J. Kennard; William A. Barrett
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We present a method of interactive training for handwriting recognition in collections of documents. As the user transcribes (labels) the words in the training set, words are automatically skipped if they appear to match words that are already transcribed. By reducing the amount of redundant training, better coverage of the data is achieved, resulting in more accurate recognition. Using word-level features for training and recognition in a collection of George Washington's manuscripts, the recognition ratio is approximately 2%-8% higher after training with our interactive method than after training the same number of words sequentially. Using our approach, less training is required to achieve an equivalent recognition ratio. A slight improvement in recognition ratio is also observed when using our method on a second data set, which consists of several pages from a diary written by Jennie Leavitt Smith.

Paper Details

Date Published: 29 January 2007
PDF: 8 pages
Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000E (29 January 2007); doi: 10.1117/12.703378
Show Author Affiliations
Douglas J. Kennard, Brigham Young Univ. (United States)
William A. Barrett, Brigham Young Univ. (United States)

Published in SPIE Proceedings Vol. 6500:
Document Recognition and Retrieval XIV
Xiaofan Lin; Berrin A. Yanikoglu, Editor(s)

© SPIE. Terms of Use
Back to Top