Share Email Print

Proceedings Paper

Document image segmentation using a two-stage neural network
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

In this paper, we present a new system to segment and label document images into text, halftone images, and background using feature extraction and unsupervised clustering. Each pixel is assigned a feature pattern consisting of a scaled family of differential geometrical invariant features and texture features extracted from the cooccurence matrix. The invariant feature pattern is then assigned to a specific region using a two-stage neural network system. The first stage is a self-organizing principal components analysis (SOPCA) network that is used to project the feature vector onto its leading principal axes found by using principal components analysis. Using the SOPCA algorithm, we can train the SOPCA network to project our feature vector orthogonally onto the subspace spanned by the eigenvectors belonging to the largest eigenvalues. By doing that we ensure that the vector is represented by a reduced number of effective features. The next step is to cluster the output of the SOPCA network into different regions. This is accomplished using a self-organizing feature-map (SOFM) network. In this paper, we demonstrate the power of the SOPCA-SOFM approach to segment document images into text, halftone, and background.

Paper Details

Date Published: 14 April 2000
PDF: 9 pages
Proc. SPIE 3962, Applications of Artificial Neural Networks in Image Processing V, (14 April 2000); doi: 10.1117/12.382919
Show Author Affiliations
Mohamed Nooman Ahmed, Lexmark International, Inc. (United States)
Brian E. Cooper, Lexmark International, Inc. (United States)
Shaun T. Love, Lexmark International, Inc. (United States)

Published in SPIE Proceedings Vol. 3962:
Applications of Artificial Neural Networks in Image Processing V
Nasser M. Nasrabadi; Aggelos K. Katsaggelos, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?