Share Email Print

Proceedings Paper

Introduction of statistical information in a syntactic analyzer for document image recognition
Author(s): André O. Maroneze; Bertrand Coüasnon; Aurélie Lemaitre
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

This paper presents an improvement to a document layout analysis system, offering a possible solution to Sayre's paradox ("a letter must be recognized before it can be segmented; and it must be segmented before it can be recognized"). This improvement, based on stochastic parsing, allows integration of statistical information, obtained from recognizers, during syntactic layout analysis. We present how this fusion of numeric and symbolic information in a feedback loop can be applied to syntactic methods to simplify document description. To limit combinatorial explosion during exploration of solutions, we devised an operator that allows optional activation of the stochastic parsing mechanism. Our evaluation on 1250 handwritten business letters shows this method allows the improvement of global recognition scores.

Paper Details

Date Published: 24 January 2011
PDF: 11 pages
Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 787404 (24 January 2011); doi: 10.1117/12.873393
Show Author Affiliations
André O. Maroneze, INSA (France)
Bertrand Coüasnon, INSA (France)
Aurélie Lemaitre, Univ. of Rennes 2 (France)

Published in SPIE Proceedings Vol. 7874:
Document Recognition and Retrieval XVIII
Gady Agam; Christian Viard-Gaudin, Editor(s)

© SPIE. Terms of Use
Back to Top