Share Email Print

Proceedings Paper

Segmentation of document images
Author(s): Philip J. Bones; Todd C. Griffin; Chris M. Carey-Smith
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Document scanning is now an accepted part of office procedure, allowing the incorporation of digitized images into new documents and the conversion of scanned print into ASCII by optical character recognition ( OCR). Often document pages contain more than one form of information - textual, graphical and/or pictorial. Segmentation of document images into these three categories is feasible with the aid of image processing. Projections of the thresholded document images in conjunction with autocorrelation are used to check text alignment. Then the edge shifting properties of the rank filter are used to coalesce image regions containing text into solid near-rectangular blocks. Pyramidal reduction is combined with the filtering to ease the computational burden. Horizontal and vertical projections are used to segment whole pages recursively into homogeneous blocks whose properties are then analysed. Applications forseen for the image segmentation include modified facsimile systems, achievement of artifact-free OCR and conversion of document images into files with separate formats for text, graphics and pictures.

Paper Details

Date Published: 1 August 1990
PDF: 11 pages
Proc. SPIE 1258, Image Communications and Workstations, (1 August 1990); doi: 10.1117/12.19950
Show Author Affiliations
Philip J. Bones, Univ. of Canterbury (New Zealand)
Todd C. Griffin, Univ. of Canterbury (New Zealand)
Chris M. Carey-Smith, Univ. of Canterbury (New Zealand)

Published in SPIE Proceedings Vol. 1258:
Image Communications and Workstations
Walter Bender; Mitsunaga Saito, Editor(s)

© SPIE. Terms of Use
Back to Top