Share Email Print

Proceedings Paper

Rotation and scale invariant pattern recognition using a multistaged neural network
Author(s): Jay I. Minnix; Eugene S. McVey; Rafael M. Inigo
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

This paper presents a pattern recognition system that self-organizes to recognize objects by shape. The images are processed using a log-polar transformation that maps rotations and magnifications into representative translations. The systems then uses a multistaged hierarchical neural network that exhibits insensitivity to translations in representation space, which corresponds to rotations and scalings in the image space. The network's three layers perform the functionally disjoint tasks of preprocessing (dynamic thresholding), invariance (position normalization), and recognition (identification of the shape). The Preprocessing stage uses a single layer of elements to dynamically threshold the grey level input image into a binary image. The Invariance stage is a multilayered neural network implementation of a modified Walsh-Hadamard transform that generates a representation of the object that is invariant with respect to the object's position, which maps back to an invariance to rotational orientation and/or size. The Recognition stage is a modified version of Fukushima's Neocognitron that identifies the normalized representation by shape. The resulting network can successfully recognize objects that have been rotated, scaled, or a combination of both. The network uses a small number of fairly simple elements, a subset of which self-organize to produce the recognition performance.

Paper Details

Date Published: 1 November 1991
PDF: 11 pages
Proc. SPIE 1606, Visual Communications and Image Processing '91: Image Processing, (1 November 1991); doi: 10.1117/12.50321
Show Author Affiliations
Jay I. Minnix, Stanford Telecommunications, Inc. (United States)
Eugene S. McVey, Univ. of Virginia (United States)
Rafael M. Inigo, Univ. of Virginia (United States)

Published in SPIE Proceedings Vol. 1606:
Visual Communications and Image Processing '91: Image Processing
Kou-Hu Tzou; Toshio Koga, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?