The core goal of the MIT project is to understand object recognition in naturally cluttered scenes in the cortical ventral visual stream by physiological experiments in primates guided by, and integrated with, a quantitative computational model. The theory and its computer simulations drive the formulation of intedocked hypotheses for different experiments, their planning and the analysis of their results. The plan is: A) to measure invariance and selectivity of IT cells in tasks of object identification with and without natural clutter for different classes of objects to elucidate the neural mechanisms underlying recognition in cluttered, natural scenes; and B) to study the neural correlates of categorization and the relation between categorization and identification suggested by the model.
Our specific aims are: A.1 to determine with single cells recordings the shape tuning of single neurons in IT cortex and their baseline translation invariance for different stimulus classes and clutter conditions; A.2 to measure recognition performance in clutter and correlate it with IT neuron responses; B.1 to verify whether there is a common neural substrate for different recognition tasks, in particular identification and categorization; B.2 to study generalization of categorization in PFC to novel objects and the neural basis of Categorical Perception. The MIT group will collaborate with CalTech in extending the existing quantitative model of recognition -- based on the physiology data from DiCado's lab and the psychophysical data from Caltech --- to include attention and saliency when clutter becomes too severe and the recognition task becomes too difficult for a simple forward flow of information. MIT will also connect the biophysical work of Northwestern and CalTech to the physiology work in IT (DiCarlo) and PFC (Miller) by simulating the implications of properties of the MAX circuitry --- as characterized by intracellular recordings at Northwestern in cat striate cortex --for the recognition model and specifically for its predictions of the selectivity and invariance properties of IT cells in clutter conditions. The corresponding experiments in visual cortex will be done in Jim DiCarlo's and Earl Miller's labs at MIT.

Agency
National Institute of Health (NIH)
Institute
National Institute of Mental Health (NIMH)
Type
Exploratory Grants (P20)
Project #
1P20MH066239-01A1
Application #
6824628
Study Section
Special Emphasis Panel (ZMH1)
Project Start
2003-09-30
Project End
2007-07-31
Budget Start
Budget End
Support Year
1
Fiscal Year
2003
Total Cost
Indirect Cost
Name
Massachusetts Institute of Technology
Department
Type
DUNS #
City
Cambridge
State
MA
Country
United States
Zip Code
02139
Baldassi, Carlo; Alemi-Neissi, Alireza; Pagan, Marino et al. (2013) Shape similarity, better than semantic membership, accounts for the structure of visual object representations in a population of monkey inferotemporal neurons. PLoS Comput Biol 9:e1003167
Glezer, Laurie S; Jiang, Xiong; Riesenhuber, Maximilian (2009) Evidence for highly selective neuronal tuning to whole words in the ""visual word form area"". Neuron 62:199-204
Riesenhuber, Maximilian; Wolff, Brian S (2009) Task effects, performance levels, features, configurations, and holistic face processing: a reply to Rossion. Acta Psychol (Amst) 132:286-92
Kouh, Minjoon; Poggio, Tomaso (2008) A canonical neural circuit for cortical nonlinear operations. Neural Comput 20:1427-51
Einhauser, Wolfgang; Koch, Christof; Makeig, Scott (2007) The duration of the attentional blink in natural scenes depends on stimulus category. Vision Res 47:597-607
Jiang, Xiong; Bradley, Evan; Rini, Regina A et al. (2007) Categorization training results in shape- and category-selective human neural plasticity. Neuron 53:891-903
Finn, Ian M; Priebe, Nicholas J; Ferster, David (2007) The emergence of contrast-invariant orientation tuning in simple cells of cat visual cortex. Neuron 54:137-52
Finn, Ian M; Ferster, David (2007) Computational diversity in complex cells of cat primary visual cortex. J Neurosci 27:9638-48
Serre, Thomas; Oliva, Aude; Poggio, Tomaso (2007) A feedforward architecture accounts for rapid categorization. Proc Natl Acad Sci U S A 104:6424-9
Cadieu, Charles; Kouh, Minjoon; Pasupathy, Anitha et al. (2007) A model of V4 shape selectivity and invariance. J Neurophysiol 98:1733-50

Showing the most recent 10 out of 12 publications