Share Email Print
cover

Proceedings Paper

Min-cut segmentation of cursive handwriting in tabular documents
Author(s): Brian L. Davis; William A. Barrett; Scott D. Swingle
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Handwritten tabular documents, such as census, birth, death and marriage records, contain a wealth of information vital to genealogical and related research. Much work has been done in segmenting freeform handwriting, however, segmentation of cursive handwriting in tabular documents is still an unsolved problem. Tabular documents present unique segmentation challenges caused by handwriting overlapping cell-boundaries and other words, both horizontally and vertically, as “ascenders” and “descenders” overlap into adjacent cells. This paper presents a method for segmenting handwriting in tabular documents using a min-cut/max-flow algorithm on a graph formed from a distance map and connected components of handwriting. Specifically, we focus on line, word and first letter segmentation. Additionally, we include the angles of strokes of the handwriting as a third dimension to our graph to enable the resulting segments to share pixels of overlapping letters. Word segmentation accuracy is 89.5% evaluating lines of the data set used in the ICDAR2013 Handwriting Segmentation Contest. Accuracy is 92.6% for a specific application of segmenting first and last names from noisy census records. Accuracy for segmenting lines of names from noisy census records is 80.7%. The 3D graph cutting shows promise in segmenting overlapping letters, although highly convoluted or overlapping handwriting remains an ongoing challenge.

Paper Details

Date Published: 8 February 2015
PDF: 12 pages
Proc. SPIE 9402, Document Recognition and Retrieval XXII, 940208 (8 February 2015); doi: 10.1117/12.2076228
Show Author Affiliations
Brian L. Davis, Brigham Young Univ. (United States)
William A. Barrett, Brigham Young Univ. (United States)
Scott D. Swingle, Brigham Young Univ. (United States)


Published in SPIE Proceedings Vol. 9402:
Document Recognition and Retrieval XXII
Eric K. Ringger; Bart Lamiroy, Editor(s)

© SPIE. Terms of Use
Back to Top