Share Email Print

Proceedings Paper

Document recognition: an attribute grammar approach
Author(s): Mahesh Viswanathan; Edward Green; Mukkai Krishnamoorthy
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

A formulation of a hierarchical page decomposition technique for technical journal pages using attribute grammars is presented. In this approach, block-grammars are recursively applied until a page is classified into its most significant sub-blocks. While a grammar devised for each block depends on its logical function, it is possible to formulate a generic description for all block grammars using attribute grammars. This attribute grammar formulation forms a generic framework on which this syntactic approach is based, while the attributes themselves are derived from publication-specific knowledge. The attribute extraction process and the formulation itself are covered in this paper. We discuss an application of attribute grammars to a document analysis problem, the extraction of logical, relational information from the image of tables.

Paper Details

Date Published: 7 March 1996
PDF: 11 pages
Proc. SPIE 2660, Document Recognition III, (7 March 1996); doi: 10.1117/12.234695
Show Author Affiliations
Mahesh Viswanathan, IBM T. J. Watson Research Ctr. (United States)
Edward Green, Rensselaer Polytechnic Institute (United States)
Mukkai Krishnamoorthy, Rensselaer Polytechnic Institute (United States)

Published in SPIE Proceedings Vol. 2660:
Document Recognition III
Luc M. Vincent; Jonathan J. Hull, Editor(s)

© SPIE. Terms of Use
Back to Top