Experience is thought to play a critical role in shaping the cortical representations that support object recognition by creating neural responses are selective for some dimensions of change and invariant to others. Although many previous studies have examined the effects of supervised training on object selective regions of the brain, much less is known about the degree to which statistical regularities in the retinal input can directly shape the neural substrates involved in object recognition. Unsupervised learning is important because it allows the brain to employ simple self organizing mechanisms that turn the continuous flux of visual input into the stable objects of our experience. While behavioral and computational work strongly suggests that unsupervised learning plays a key role in object recognition, most related neuroscience work examining the role of input statistics has focused on its effects in early visual areas. Here we propose experiments that combine cutting edge techniques in fMRI, psychophysics, and computational modeling to examine two hypotheses concerning unsupervised learning in object recognition. First, we propose that neural responses may become tuned to match the range and frequency of shape and object exemplars experienced during unsupervised training. That is, neural responses will increase and become more selective for items seen more frequently during unsupervised training relative to infrequently seen or untrained items. This may provide a mechanism which improves discrimination performance for stimuli seen most frequently. Second, behavioral and computational evidence suggests the intriguing hypothesis that the brain uses spatio-temporal correlations as a means for binding different images as belonging to the same object, allowing for recognition of the same object across dramatic transformations, such as changes in its appearance due to rotation. We will determine if spatio- temporal correlations in the visual input during unsupervised training increases the invariance of both brain responses and perceptual performance relative to similar items trained in an uncorrelated manner and pre- training responses (and performance). Third, we will examine if mechanisms of unsupervised learning generalize to supervised learning. In all of our experiments we will examine neural responses and performance both before and after unsupervised training, and use computational modeling to link fMRI data to the possible underlying neural mechanisms such as sharpening of neural tuning and increased firing rates. The proposed work will fill important gaps in knowledge by providing the first account of the neural mechanisms that generate effective representations for object recognition from the statistics of visual experience.

Public Health Relevance

The results of these studies will be important for understanding the role of visual experience in shaping normal visual representations. As these mechanisms do not require explicit instruction, they are especially important for unraveling the means by which pre-verbal children and animals learn to recognize objects. Understanding these mechanisms will form a much needed foundation for studying development disorders such as congenital prosopagnosia, autism and Williams Syndrome. Further, if we find significant behavioral improvements due to the statistics of the visual inputs, these training paradigms may be used as an intervention to offset developmental visual disabilities.

Agency
National Institute of Health (NIH)
Institute
National Eye Institute (NEI)
Type
Research Project (R01)
Project #
5R01EY019279-04
Application #
8266462
Study Section
Central Visual Processing Study Section (CVP)
Program Officer
Steinmetz, Michael A
Project Start
2009-05-01
Project End
2014-04-30
Budget Start
2012-05-01
Budget End
2013-04-30
Support Year
4
Fiscal Year
2012
Total Cost
$380,160
Indirect Cost
$142,560
Name
Stanford University
Department
Psychology
Type
Schools of Arts and Sciences
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94305
Witthoft, Nathan; Nguyen, Mai Lin; Golarai, Golijeh et al. (2014) Where is human V4? Predicting the location of hV4 and VO1 from cortical folding. Cereb Cortex 24:2401-8
LaRocque, Karen F; Smith, Mary E; Carr, Valerie A et al. (2013) Global similarity and pattern separation in the human medial temporal lobe predict subsequent memory. J Neurosci 33:5466-74
Witthoft, Nathan; Winawer, Jonathan (2013) Learning, memory, and synesthesia. Psychol Sci 24:258-65
Davidenko, Nicolas; Flusberg, Stephen J (2012) Environmental inversion effects in face perception. Cognition 123:442-7
Weiner, Kevin S; Grill-Spector, Kalanit (2011) Not one extrastriate body area: using anatomical landmarks, hMT+, and visual field maps to parcellate limb-selective activations in human lateral occipitotemporal cortex. Neuroimage 56:2183-99
Weiner, Kevin S; Sayres, Rory; Vinberg, Joakim et al. (2010) fMRI-adaptation and category selectivity in human ventral temporal cortex: regional differences across time scales. J Neurophysiol 103:3349-65