Share Email Print

Proceedings Paper

Document Image Analysis For Reading Books
Author(s): Yoshitake Tsuji; Jun Tsukumo; Ko Asai
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

A fundamental problem in machine vision is to detect and identify special objects in an image. In the field of machine-reading for existing printed matter and books, a very important technique allows extracting and recognizing characters in desired text lines from a document image. This paper describes a hierarchical image segmentation, which separates a document image into its entities. Furthermore, a character segmentation, with minimum variance criterion, and a character recognition, based on three improved loci feature, have been developed as two elemental methods for reading books. In these experimental results using different commercial Japanese pocket books, 99% of text lines were correctly extracted. Also, it was successful in reading 99.30% of the Japanese characters and Chinese ideographs, as used in printed text.

Paper Details

Date Published: 14 October 1987
PDF: 9 pages
Proc. SPIE 0804, Advances in Image Processing, (14 October 1987); doi: 10.1117/12.941320
Show Author Affiliations
Yoshitake Tsuji, NEC Corporation (Japan)
Jun Tsukumo, NEC Corporation (Japan)
Ko Asai, NEC Corporation (Japan)

Published in SPIE Proceedings Vol. 0804:
Advances in Image Processing
Andre J. Oosterlinck; Andrew G. Tescher, Editor(s)

© SPIE. Terms of Use
Back to Top