Share Email Print

Proceedings Paper

Automatic indexing of scanned documents: a layout-based approach
Author(s): Daniel Esser; Daniel Schuster; Klemens Muthmann; Michael Berger; Alexander Schill
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Archiving official written documents such as invoices, reminders and account statements in business and private area gets more and more important. Creating appropriate index entries for document archives like sender's name, creation date or document number is a tedious manual work. We present a novel approach to handle automatic indexing of documents based on generic positional extraction of index terms. For this purpose we apply the knowledge of document templates stored in a common full text search index to find index positions that were successfully extracted in the past.

Paper Details

Date Published: 23 January 2012
PDF: 8 pages
Proc. SPIE 8297, Document Recognition and Retrieval XIX, 82970H (23 January 2012); doi: 10.1117/12.908542
Show Author Affiliations
Daniel Esser, Technische Univ. Dresden (Germany)
Daniel Schuster, Technische Univ. Dresden (Germany)
Klemens Muthmann, Technische Univ. Dresden (Germany)
Michael Berger, DocuWare AG (Germany)
Alexander Schill, Technische Univ. Dresden (Germany)

Published in SPIE Proceedings Vol. 8297:
Document Recognition and Retrieval XIX
Christian Viard-Gaudin; Richard Zanibbi, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?