Share Email Print

Proceedings Paper

Fast structural matching for document image retrieval through spatial databases
Author(s): Hongxing Gao; Maçal Rusiñol; Dimosthenis Karatzas; Josep Lladós
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

The structure of document images plays a significant role in document analysis thus considerable efforts have been made towards extracting and understanding document structure, usually in the form of layout analysis approaches. In this paper, we first employ Distance Transform based MSER (DTMSER) to efficiently extract stable document structural elements in terms of a dendrogram of key-regions. Then a fast structural matching method is proposed to query the structure of document (dendrogram) based on a spatial database which facilitates the formulation of advanced spatial queries. The experiments demonstrate a significant improvement in a document retrieval scenario when compared to the use of typical Bag of Words (BoW) and pyramidal BoW descriptors.

Paper Details

Date Published: 24 March 2014
PDF: 10 pages
Proc. SPIE 9021, Document Recognition and Retrieval XXI, 90210N (24 March 2014); doi: 10.1117/12.2042458
Show Author Affiliations
Hongxing Gao, Univ. Autònoma de Barcelona (Spain)
Maçal Rusiñol, Univ. Autònoma de Barcelona (Spain)
Dimosthenis Karatzas, Univ. Autònoma de Barcelona (Spain)
Josep Lladós, Univ. Autònoma de Barcelona (Spain)

Published in SPIE Proceedings Vol. 9021:
Document Recognition and Retrieval XXI
Bertrand Coüasnon; Eric K. Ringger, Editor(s)

© SPIE. Terms of Use
Back to Top