Share Email Print

Proceedings Paper

Table recognition for automated document entry system
Author(s): Haruhiko Kojima; Teruo Akiyama
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Most documents include various layout objects such as headlines text lines charts and tables. In particular tables are powerful tools that allow large quantities of data to be easily understood. An automated document entry system is needed that can recognize the document layout objects and extract the information from tables. In this paper an effective table recognition method is described. The proposed method is composed of three steps: (1) document layout structure recognition (2) table layout structure recognition (3) table content recognition. To develop the table layout structure recognition step we first examined the layout structure of tables in existing documents and classified several common structures. As a result of the examination we created ten rules and designed a ruled line and box extraction algorithm based on these rules. The effectiveness of the proposed method has been confirmed in experiments. Accordingly the proposed method will greatly contribute to the creation of an automated document entry system to allow faster document recognition and permit the data in tables to be extracted.

Paper Details

Date Published: 1 February 1991
PDF: 8 pages
Proc. SPIE 1384, High-Speed Inspection Architectures, Barcoding, and Character Recognition, (1 February 1991); doi: 10.1117/12.25330
Show Author Affiliations
Haruhiko Kojima, NTT Human Interface Labs. (Japan)
Teruo Akiyama, NTT Intelligent Technology Co., Ltd. (Japan)

Published in SPIE Proceedings Vol. 1384:
High-Speed Inspection Architectures, Barcoding, and Character Recognition
Michael J. W. Chen, Editor(s)

© SPIE. Terms of Use
Back to Top