Share Email Print
cover

Proceedings Paper

Characteristics of digitized images of technical articles
Author(s): Mahesh Viswanathan; George Nagy
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Document image blocks characterized using projection profiles is proposed. We have collected statistical information on scanned pages of technical articles as a by-product of digitized document analysis. Specifically, in our hierarchical block segmentation and labeling approach (syntactic), 65 training and test pages from two publications were used. Additional information on compression and profiles was also used. Pixel-level information is required as input whether the analyzing tool is an expert system or something else. The issues covered are: (1) Profile characteristics of document objects like text, line drawings, tables, and half-tones, and the variation of these profiles with block size, type size, and direction of scan. (2) Speckle noise: sizes and distribution. (3) For the hierarchical (syntactic) approach, number of tree nodes at each level along with their areas, and comparison with node areas derived from transition-cut trees. (4) CCITT-Group 4 compression statistics on document sub-blocks and whole pages. (5) Size of postscript files and postscript commands used in printing these page files. We believe that these results would allow predicting some characteristics of a printed page digitized at any specified sampling rate.

Paper Details

Date Published: 1 August 1992
PDF: 12 pages
Proc. SPIE 1661, Machine Vision Applications in Character Recognition and Industrial Inspection, (1 August 1992); doi: 10.1117/12.130269
Show Author Affiliations
Mahesh Viswanathan, IBM/Storage Systems Products Div. (United States)
George Nagy, Rensselaer Polytechnic Institute (United States)


Published in SPIE Proceedings Vol. 1661:
Machine Vision Applications in Character Recognition and Industrial Inspection
Donald P. D'Amato; Wolf-Ekkehard Blanz; Byron E. Dom; Sargur N. Srihari, Editor(s)

© SPIE. Terms of Use
Back to Top