Share Email Print

Proceedings Paper

The spatial vision tree: a generic pattern recognition engine: scientific foundations, design principles, and preliminary tree design
Author(s): Zia-ur Rahman; Daniel J. Jobson; Glenn A. Woodell
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

New foundational ideas are used to define a novel approach to generic visual pattern recognition. These ideas proceed from the starting point of the intrinsic equivalence of noise reduction and pattern recognition when noise reduction is taken to its theoretical limit of explicit matched filtering. This led us to think of the logical extension of sparse coding using basis function transforms for both de-noising and pattern recognition to the full pattern specificity of a lexicon of matched filter pattern templates. A key hypothesis is that such a lexicon can be constructed and is, in fact, a generic visual alphabet of spatial vision. Hence it provides a tractable solution for the design of a generic pattern recognition engine. Here we present the key scientific ideas, the basic design principles which emerge from these ideas, and a preliminary design of the Spatial Vision Tree (SVT). The latter is based upon a cryptographic approach whereby we measure a large aggregate estimate of the frequency of occurrence (FOO) for each pattern. These distributions are employed together with Hamming distance criteria to design a two-tier tree. Then using information theory, these same FOO distributions are used to define a precise method for pattern representation. Finally the experimental performance of the preliminary SVT on computer generated test images and complex natural images is assessed.

Paper Details

Date Published: 15 April 2010
PDF: 11 pages
Proc. SPIE 7701, Visual Information Processing XIX, 77010K (15 April 2010); doi: 10.1117/12.851869
Show Author Affiliations
Zia-ur Rahman, Old Dominion Univ. (United States)
Daniel J. Jobson, NASA Langley Research Ctr. (United States)
Glenn A. Woodell, NASA Langley Research Ctr. (United States)

Published in SPIE Proceedings Vol. 7701:
Visual Information Processing XIX
Zia-ur Rahman; Stephen E. Reichenbach; Mark A. Neifeld, Editor(s)

© SPIE. Terms of Use
Back to Top