Abstract
We propose an algorithm for the determination of three dimensional shape and perspective based on the response of the human visual system to changes in visual textures. Current computer vision algorithms are computationally intensive and show inherent difficulties in integrating additional cues for the determination of shape, such as shading, contour, or motion. In order to develop a fast and simple mechanism less constrained for integrating other cues, we incorporated aspects of the physiological properties of cortical cells in VI into a network model. We provide psychophysical evidence that the local spatial frequency spectrum is represented by the spatially averaged peak frequency (APF). After normalization, this APF measures texture compression and leads to estimates of 3D shape and depth. Simulations of the model show good agreement with human responses to a range of textured images.<>