Share Email Print
cover

Proceedings Paper

Geometrical and statistical properties of vision models obtained via maximum differentiation
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We examine properties of perceptual image distortion models, computed as the mean squared error in the response of a 2-stage cascaded image transformation. Each stage in the cascade is composed of a linear transformation, followed by a local nonlinear normalization operation. We consider two such models. For the first, the structure of the linear transformations is chosen according to perceptual criteria: a center-surround filter that extracts local contrast, and a filter designed to select visually relevant contrast according to the Standard Spatial Observer. For the second, the linear transformations are chosen based on statistical criterion, so as to eliminate correlations estimated from responses to a set of natural images. For both models, the parameters that govern the scale of the linear filters and the properties of the nonlinear normalization operation, are chosen to achieve minimal/maximal subjective discriminability of pairs of images that have been optimized to minimize/maximize the model, respectively (we refer to this as MAximum Differentiation, or “MAD”, Optimization). We find that both representations substantially reduce redundancy (mutual information), with a larger reduction occurring in the second (statistically optimized) model. We also find that both models are highly correlated with subjective scores from the TID2008 database, with slightly better performance seen in the first (perceptually chosen) model. Finally, we use a foveated version of the perceptual model to synthesize visual metamers. Specifically, we generate an example of a distorted image that is optimized so as to minimize the perceptual error over receptive fields that scale with eccentricity, demonstrating that the errors are barely visible despite a substantial MSE relative to the original image.

Paper Details

Date Published: 17 March 2015
PDF: 9 pages
Proc. SPIE 9394, Human Vision and Electronic Imaging XX, 93940L (17 March 2015); doi: 10.1117/12.2085653
Show Author Affiliations
Jesús Malo, Univ. de València (Spain)
Eero P. Simoncelli, New York Univ. (United States)


Published in SPIE Proceedings Vol. 9394:
Human Vision and Electronic Imaging XX
Bernice E. Rogowitz; Thrasyvoulos N. Pappas; Huib de Ridder, Editor(s)

© SPIE. Terms of Use
Back to Top