Share Email Print

Proceedings Paper

Software tools and test data for research and testing of page-reading OCR systems
Author(s): Thomas A. Nartker; Stephen V. Rice; Steven E. Lumos
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We announce the availability of the UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text. This combination of tools and test data will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms. The value of this collection of software tools and test data is enhanced by knowledge of the past performance of several systems using exactly these tools and this data. These performance comparisons were published in previous ISRI Test Reports and are also provided. Another value is that the tools can be used to test the character accuracy of any page-reading OCR system for any language included in the Unicode standard. The paper concludes with a summary of the programs, test data, and documentation that is available and gives the URL where they can be located.

Paper Details

Date Published: 17 January 2005
PDF: 11 pages
Proc. SPIE 5676, Document Recognition and Retrieval XII, (17 January 2005); doi: 10.1117/12.587293
Show Author Affiliations
Thomas A. Nartker, Univ. of Nevada/Las Vegas (United States)
Stephen V. Rice, Univ. of Nevada/Las Vegas (United States)
Steven E. Lumos, Univ. of Nevada/Las Vegas (United States)

Published in SPIE Proceedings Vol. 5676:
Document Recognition and Retrieval XII
Elisa H. Barney Smith; Kazem Taghva, Editor(s)

© SPIE. Terms of Use
Back to Top