Share Email Print

Proceedings Paper

A framework to improve digital corpus uses: image-mode navigation
Author(s): Loris Eynard; Vincent Malleron; Hubert Emptoz
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

In this paper, we propose a new system to enhance navigation inside digital corpora. This system is based on an automatic indexation in image mode and provides the user intuitive navigation in interactive time. Keywords and containers are extracted directly from the document images to create an Image Mode Index, which shows the keywords as cut-out images of their actual appearances. Our approach recreates a summary of the structured documents, following indications given by the creators of the document themselves. Our system is detailed in the general case and sample applications on a 19th century handwritten corpus and a 18th century machine printed text corpus are provided. This approach, developed for documents unreachable otherwise, can be applied on any corpus where keywords and containers can be identified.

Paper Details

Date Published: 24 January 2011
PDF: 8 pages
Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 78740X (24 January 2011); doi: 10.1117/12.873389
Show Author Affiliations
Loris Eynard, CNRS, INSA-Lyon, LIRIS, Univ. Lyon (France)
Vincent Malleron, CNRS, INSA-Lyon, LIRIS, Univ. Lyon (France)
CNRS, LIRE, Univ. Lyon 2 (France)
Hubert Emptoz, CNRS, INSA-Lyon, LIRIS, Univ. Lyon (France)

Published in SPIE Proceedings Vol. 7874:
Document Recognition and Retrieval XVIII
Gady Agam; Christian Viard-Gaudin, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?