Share Email Print

Proceedings Paper

Document text segmentation using multiband disc model
Author(s): Chew Lim Tan; Bo Yuan
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

This paper proposes a multi-band disc model to do document page segmentation to segregate text blocks from graphic images. We first introduce the idea of our disc-model and go on to discuss the improved multi-band version of the disc- model. The disc-model takes a bottom-up segmentation approach that tries to establish local neighborhood of objects on a page and then trace the propagation of such neighborhood until all objects in text blocks are reached. The significance of the disc-model is the link established between the sizes of the objects and their positional thus logical relationship. Furthermore, the disc-model is rotational symmetric. Therefore, the disc-model can be applied to text with mixed typefaces, with arbitrary outline shapes. It is tolerable to skews or misalignment of the objects in the input images.

Paper Details

Date Published: 21 December 2000
PDF: 11 pages
Proc. SPIE 4307, Document Recognition and Retrieval VIII, (21 December 2000);
Show Author Affiliations
Chew Lim Tan, National Univ. of Singapore (Singapore)
Bo Yuan, National Univ. of Singapore (Singapore)

Published in SPIE Proceedings Vol. 4307:
Document Recognition and Retrieval VIII
Paul B. Kantor; Daniel P. Lopresti; Jiangying Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?