Share Email Print

Proceedings Paper

Form similarity via Levenshtein distance between ortho-filtered logarithmic ruling-gap ratios
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Geometric invariants are combined with edit distance to compare the ruling configuration of noisy filled-out forms. It is shown that gap-ratios used as features capture most of the ruling information of even low-resolution and poorly scanned form images, and that the edit distance is tolerant of missed and spurious rulings. No preprocessing is required and the potentially time-consuming string operations are performed on a sparse representation of the detected rulings. Based on edit distance, 158 Arabic forms are classified into 15 groups with 89% accuracy. Since the method was developed for an application that precludes public dissemination of the data, it is illustrated on public-domain death certificates.

Paper Details

Date Published: 24 March 2014
PDF: 8 pages
Proc. SPIE 9021, Document Recognition and Retrieval XXI, 902106 (24 March 2014); doi: 10.1117/12.2041956
Show Author Affiliations
George Nagy, Rensselaer Polytechnic Institute (United States)
Daniel Lopresti, Lehigh Univ. (United States)

Published in SPIE Proceedings Vol. 9021:
Document Recognition and Retrieval XXI
Bertrand Coüasnon; Eric K. Ringger, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?