Share Email Print

Proceedings Paper

Text block segmentation using pyramid structure
Author(s): Chew Lim Tan; Zheng Zhang
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Text block segmentation is necessary in document layout analysis. An algorithm and its implementation that segregates text block by block (a block is either a title or a paragraph) from the provided document, e.g. newspaper image, based on pyramid structure is described in this paper. The pyramid structure, which is amenable for parallel processing on output, is a multi-resolution image representation. The pyramid structure also simulates what the human eyes see the document from afar visualizing the block structure of the document, the block segmentation can identify the titles, and distinguish different paragraphs based on the indentation between them. Our implementation will be used in a news articles retrieval project.

Paper Details

Date Published: 21 December 2000
PDF: 10 pages
Proc. SPIE 4307, Document Recognition and Retrieval VIII, (21 December 2000); doi: 10.1117/12.410849
Show Author Affiliations
Chew Lim Tan, National Univ. of Singapore (Singapore)
Zheng Zhang, National Univ. of Singapore (Singapore)

Published in SPIE Proceedings Vol. 4307:
Document Recognition and Retrieval VIII
Paul B. Kantor; Daniel P. Lopresti; Jiangying Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?