Share Email Print

Proceedings Paper

Better PostScript than PostScript: portable self-extracting PostScript representation of scanned document images
Author(s): Qin Zhang; John M. Danskin
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We present a pattern matching based compression (PMBC) system which compresses scanned documents into postscript format. The output of a PMBC system is a pattern library, or font, and a series of pattern indices and positions. PMBC represents scanned documents in the same way that word processing programs represent their output pages. We explore various postscript representations of this output file, choosing the one resulting in the smallest output after compression with gzip. The resulting postscript file doesn't require a separate decompression program to view and print, and is at least 50 percent smaller than the postscript files generated by other conventional programs, such as tifftops.

Paper Details

Date Published: 3 April 1997
PDF: 12 pages
Proc. SPIE 3027, Document Recognition IV, (3 April 1997); doi: 10.1117/12.270075
Show Author Affiliations
Qin Zhang, Dartmouth College (United States)
John M. Danskin, Dartmouth College (United States)

Published in SPIE Proceedings Vol. 3027:
Document Recognition IV
Luc M. Vincent; Jonathan J. Hull, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?