A fundamental goal of perceptual neuroscience is to understand the neuronal representations that underlie our remarkable ability to perceive, recognize, and remember visual objects. In humans and non-human primates, these representations are produced by processing along the ventral visual stream, and conveyed by patterns of neuronal activity in its highest level -- the monkey inferior temporal cortex (IT). The key computational problem the ventral stream solves is that it produces an IT neuronal representation of visual images that conveys selectivity for object identity and category, with tolerance ("invariance") to changes in object position, size, pose, illumination and clutter. Indeed, although the shape selectivity properties of the ventral stream have received much study, we know very little about the mechanisms that construct that tolerance. The goal of this proposal is a mechanistic understanding of how the ventral visual stream constructs the tolerant ("invariant") visual shape selectivity that underlies our object recognition abilities.
In Aim 1 we ask: does naturally-acquired temporally contiguous experience "instruct" the formation of tolerance in the ventral stream? We have recently discovered that the tolerance of IT neuronal shape selectivity can be strongly and rapidly sculpted by altered temporal contiguity of unsupervised visual object experience. In this aim, we will use a series of closely-related visual experience manipulations to systematically test and characterize the role of this plasticity in position, size, and pose tolerance learning. This will illuminate its role in instructing adult visual object representation, and set the stage for longer-term studies of how these powerful representations are assembled during early development.
In Aim 2 we will take a comparative approach to ask how object information is transformed across two ventral stream areas (V4 vs. IT)? Using the same monkeys, same task, and same visual stimuli, we will use neuronal population methods to ask: How is the tolerance of the IT representation changed from the V4 representation? Is V4 shape selectivity preserved in the IT representation? Does the sparseness of visual representation change from V4 to IT? How does tolerant shape selectivity evolve in real time? Together, these experiments will inform a central question: "How is the tolerant object selectivity in IT built from earlier visual representation?", and the results will provide strong constraints on computational models of the ventral visual stream and guide our understanding of cortical information transformation more generally.
Visual object recognition is fundamental to our well-being and our brain is remarkably good at solving this problem even though the same object can appear very differently to our eyes. The overarching goal of these experiments is a mechanistic understanding of how the visual system constructs the patterns of neuronal acitivity that solve this problem. This will lead to an understanding of the brain processes that allow us to see and evaluate the visual world (e.g. recognize and remember objects).
|Hong, Ha; Yamins, Daniel L K; Majaj, Najib J et al. (2016) Explicit information for category-orthogonal object properties increases along the ventral stream. Nat Neurosci 19:613-22|
|Majaj, Najib J; Hong, Ha; Solomon, Ethan A et al. (2015) Simple Learned Weighted Sums of Inferior Temporal Neuronal Firing Rates Accurately Predict Human Core Object Recognition Performance. J Neurosci 35:13402-18|
|Afraz, Arash; Boyden, Edward S; DiCarlo, James J (2015) Optogenetic and pharmacological suppression of spatial clusters of face neurons reveal their causal role in face gender discrimination. Proc Natl Acad Sci U S A 112:6730-5|
|Cadieu, Charles F; Hong, Ha; Yamins, Daniel L K et al. (2014) Deep neural networks rival the representation of primate IT cortex for core visual object recognition. PLoS Comput Biol 10:e1003963|
|Yamins, Daniel L K; Hong, Ha; Cadieu, Charles F et al. (2014) Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proc Natl Acad Sci U S A 111:8619-24|
|Baldassi, Carlo; Alemi-Neissi, Alireza; Pagan, Marino et al. (2013) Shape similarity, better than semantic membership, accounts for the structure of visual object representations in a population of monkey inferotemporal neurons. PLoS Comput Biol 9:e1003167|
|Issa, Elias B; Papanastassiou, Alex M; DiCarlo, James J (2013) Large-scale, high-resolution neurophysiological maps underlying FMRI of macaque temporal lobe. J Neurosci 33:15207-19|
|DiCarlo, James J; Zoccolan, Davide; Rust, Nicole C (2012) How does the brain solve visual object recognition? Neuron 73:415-34|
|Issa, Elias B; DiCarlo, James J (2012) Precedence of the eye region in neural processing of faces. J Neurosci 32:16666-82|
|Li, Nuo; Dicarlo, James J (2012) Neuronal learning of invariant object representation in the ventral visual stream is not dependent on reward. J Neurosci 32:6611-20|
Showing the most recent 10 out of 22 publications