Share Email Print

Proceedings Paper

Document matching on CCITT Group 4 compressed images
Author(s): Jonathan J. Hull
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

A method is proposed for detecting whether tow CCITT group 4 images were scanned from the same document. Features are extracted from rectangular patches of text and compared with a modified Hausdorff distance measure. Two images are said to be 'equivalent' if the Hausdorff measure finds that a specified number of features are located within a given distance of one another in both images. This paper explains the technique and presents experimental results that demonstrate its effectiveness.It is shown that features extracted from a one-inch square patch of image data provide better than 95 percent correct retrieval accuracy with no false positives on a database of 800 documents.

Paper Details

Date Published: 3 April 1997
PDF: 6 pages
Proc. SPIE 3027, Document Recognition IV, (3 April 1997); doi: 10.1117/12.270061
Show Author Affiliations
Jonathan J. Hull, Ricoh California Research Ctr. (United States)

Published in SPIE Proceedings Vol. 3027:
Document Recognition IV
Luc M. Vincent; Jonathan J. Hull, Editor(s)

© SPIE. Terms of Use
Back to Top