Share Email Print

Proceedings Paper

Attention as a Bayesian inference process
Author(s): Sharat Chikkerur; Thomas Serre; Cheston Tan; Tomaso Poggio
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

David Marr famously defined vision as "knowing what is where by seeing". In the framework described here, attention is the inference process that solves the visual recognition problem of what is where. The theory proposes a computational role for attention and leads to a model that performs well in recognition tasks and that predicts some of the main properties of attention at the level of psychophysics and physiology. We propose an algorithmic implementation a Bayesian network that can be mapped into the basic functional anatomy of attention involving the ventral stream and the dorsal stream. This description integrates bottom-up, feature-based as well as spatial (context based) attentional mechanisms. We show that the Bayesian model predicts well human eye fixations (considered as a proxy for shifts of attention) in natural scenes, and can improve accuracy in object recognition tasks involving cluttered real world images. In both cases, we found that the proposed model can predict human performance better than existing bottom-up and top-down computational models.

Paper Details

Date Published: 17 March 2011
PDF: 10 pages
Proc. SPIE 7865, Human Vision and Electronic Imaging XVI, 786511 (17 March 2011); doi: 10.1117/12.876734
Show Author Affiliations
Sharat Chikkerur, Massachusetts Institute of Technology (United States)
Thomas Serre, Massachusetts Institute of Technology (United States)
Brown Univ. (United States)
Cheston Tan, Massachusetts Institute of Technology (United States)
Tomaso Poggio, Massachusetts Institute of Technology (United States)

Published in SPIE Proceedings Vol. 7865:
Human Vision and Electronic Imaging XVI
Bernice E. Rogowitz; Thrasyvoulos N. Pappas, Editor(s)

© SPIE. Terms of Use
Back to Top