Share Email Print

Proceedings Paper

Separation of text and background regions for high performance document image compression
Author(s): Wei Fan; Jun Sun; Satoshi Naoi
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We describe a document image segmentation algorithm to classify a scanned document into different regions such as text/line drawings, pictures, and smooth background. The proposed scheme is relatively independent of variations in text font style, size, intensity polarity and of string orientation. It is intended for use in an adaptive system for document image compression. The principal parts of the algorithm are the generation of the foreground and background layers and the application of hierarchical singular value decomposition (SVD) in order to smoothly fill the blank regions of both layers so that the high compression ratio can be achieved. The performance of the algorithm, both in terms of its effectiveness and computational efficiency, was evaluated using several test images and showed superior performance compared to other techniques.

Paper Details

Date Published: 8 February 2015
PDF: 12 pages
Proc. SPIE 9402, Document Recognition and Retrieval XXII, 94020K (8 February 2015); doi: 10.1117/12.2075416
Show Author Affiliations
Wei Fan, Fujitsu Research and Development Center Co., Ltd. (China)
Jun Sun, Fujitsu Research and Development Center Co., Ltd. (China)
Satoshi Naoi, Fujitsu Research and Development Center Co., Ltd. (China)

Published in SPIE Proceedings Vol. 9402:
Document Recognition and Retrieval XXII
Eric K. Ringger; Bart Lamiroy, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?