Share Email Print

Proceedings Paper

Pixel and semantic capabilities from an image-object based document representation
Author(s): Michael Gormish; Kathrin Berkner; Martin Boliek; Guotong Feng; Edward L. Schwartz
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

This paper reports on novel and traditional pixel and semantic operations using a recently standardized document representation called JPM. The JPM representation uses compressed pixel arrays for all visible elements on a page. Separate data containers called boxes provide the layout and additional semantic information. JPM and related image-based document representation standards were designed to obtain the most rate efficient document compression. The authors, however, use this representation directly for operations other than compression typically performed either on pixel arrays or semantic forms. This paper describes the image representation used in the JPM standard and presents techniques to (1) perform traditional raster-based document analysis on the compressed data, (2) transmit semantically meaningful portions of compressed data between devices, (3) create multiple views from one compressed data stream, and (4) edit high resolution document images with only low resolution proxy images.

Paper Details

Date Published: 29 January 2007
PDF: 11 pages
Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000I (29 January 2007); doi: 10.1117/12.704502
Show Author Affiliations
Michael Gormish, Ricoh Innovations, Inc. (United States)
Kathrin Berkner, Ricoh Innovations, Inc. (United States)
Martin Boliek, Ricoh Innovations, Inc. (United States)
Guotong Feng, Ricoh Innovations, Inc. (United States)
Edward L. Schwartz, Ricoh Innovations, Inc. (United States)

Published in SPIE Proceedings Vol. 6500:
Document Recognition and Retrieval XIV
Xiaofan Lin; Berrin A. Yanikoglu, Editor(s)

© SPIE. Terms of Use
Back to Top