Share Email Print

Proceedings Paper

Lexicon-supported OCR of eighteenth century Dutch books: a case study
Author(s): Jesse de Does; Katrien Depuydt
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We report on a case study on OCR of eighteenth century books conducted in the IMPACT project. After introducing the IMPACT project and its approach to lexicon building and deployment, we zoom in to the application of IMPACT tools and data to the Dutch EDBO collection. The results are exemplified by detailed discussion of various practical options to improve text recognition beyond a baseline of running an uncustomized Finereader 10. In particular, we discuss improved recognition of long s.

Paper Details

Date Published: 4 February 2013
PDF: 14 pages
Proc. SPIE 8658, Document Recognition and Retrieval XX, 86580L (4 February 2013); doi: 10.1117/12.2008423
Show Author Affiliations
Jesse de Does, INL (Netherlands)
Katrien Depuydt, INL (Netherlands)

Published in SPIE Proceedings Vol. 8658:
Document Recognition and Retrieval XX
Richard Zanibbi; Bertrand Coüasnon, Editor(s)

© SPIE. Terms of Use
Back to Top