Share Email Print

Proceedings Paper

Large scale parallel document image processing
Author(s): Tijn van der Zant; Lambert Schomaker; Edwin Valentijn
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Building a system which allows to search a very large database of document images requires professionalization of hardware and software, e-science and web access. In astrophysics there is ample experience dealing with large data sets due to an increasing number of measurement instruments. The problem of digitization of historical documents of the Dutch cultural heritage is a similar problem. This paper discusses the use of a system developed at the Kapteyn Institute of Astrophysics for the processing of large data sets, applied to the problem of creating a very large searchable archive of connected cursive handwritten texts. The system is adapted to the specific needs of processing document images. It shows that interdisciplinary collaboration can be beneficial in the context of machine learning, data processing and professionalization of image processing and retrieval systems.

Paper Details

Date Published: 28 January 2008
PDF: 7 pages
Proc. SPIE 6815, Document Recognition and Retrieval XV, 68150S (28 January 2008); doi: 10.1117/12.765482
Show Author Affiliations
Tijn van der Zant, Univ. of Groningen (Netherlands)
Lambert Schomaker, Univ. of Groningen (Netherlands)
Edwin Valentijn, Univ. of Groningen (Netherlands)

Published in SPIE Proceedings Vol. 6815:
Document Recognition and Retrieval XV
Berrin A. Yanikoglu; Kathrin Berkner, Editor(s)

© SPIE. Terms of Use
Back to Top