Share Email Print

Proceedings Paper

Segmentation for MRC compression
Format Member Price Non-Member Price
PDF $17.00 $21.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Mixed Raster Content (MRC) is a standard for efficient document compression which can dramatically improve the compression/quality tradeoff as compared to traditional lossy image compression algorithms. The key to MRC's performance is the separation of the document into foreground and background layers, represented as a binary mask. Typically, the foreground layer contains text colors, the background layer contains images and graphics, and the binary mask layer represents fine detail of text fonts. The resulting quality and compression ratio of a MRC document encoder is highly dependent on the segmentation algorithm used to compute the binary mask. In this paper, we propose a novel segmentation method based on the MRC standards (ITU-T T.44). The algorithm consists of two components: Cost Optimized Segmentation (COS) and Connected Component Classification (CCC). The COS algorithm is a blockwise segmentation algorithm formulated in a global cost optimization framework, while CCC is based on feature vector classification of connected components. In the experimental results, we show that the new algorithm achieves the same accuracy of text detection but with lower false detection of non-text features, as compared to state-of-the-art commercial MRC products. This results in high quality MRC encoded documents with fewer non-text artifacts, and lower bit rate.

Paper Details

Date Published: 29 January 2007
PDF: 12 pages
Proc. SPIE 6493, Color Imaging XII: Processing, Hardcopy, and Applications, 649304 (29 January 2007); doi: 10.1117/12.711692
Show Author Affiliations
Eri Haneda, Purdue Univ. (United States)
Jonghyon Yi, Samsung Electronics (South Korea)
Charles A. Bouman, Purdue Univ. (United States)

Published in SPIE Proceedings Vol. 6493:
Color Imaging XII: Processing, Hardcopy, and Applications
Reiner Eschbach; Gabriel G. Marcu, Editor(s)

© SPIE. Terms of Use
Back to Top