Share Email Print

Proceedings Paper

Form similarity via Levenshtein distance between ortho-filtered logarithmic ruling-gap ratios
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Geometric invariants are combined with edit distance to compare the ruling configuration of noisy filled-out forms. It is shown that gap-ratios used as features capture most of the ruling information of even low-resolution and poorly scanned form images, and that the edit distance is tolerant of missed and spurious rulings. No preprocessing is required and the potentially time-consuming string operations are performed on a sparse representation of the detected rulings. Based on edit distance, 158 Arabic forms are classified into 15 groups with 89% accuracy. Since the method was developed for an application that precludes public dissemination of the data, it is illustrated on public-domain death certificates.

Paper Details

Date Published: 24 March 2014
PDF: 8 pages
Proc. SPIE 9021, Document Recognition and Retrieval XXI, 902106 (24 March 2014); doi: 10.1117/12.2041956
Show Author Affiliations
George Nagy, Rensselaer Polytechnic Institute (United States)
Daniel Lopresti, Lehigh Univ. (United States)

Published in SPIE Proceedings Vol. 9021:
Document Recognition and Retrieval XXI
Bertrand Coüasnon; Eric K. Ringger, Editor(s)

© SPIE. Terms of Use
Back to Top