Visual perception seems trivially easy. Nevertheless, it has long withstood the attempts of both computer scientists and vision scientists to crack its code, most likely because past theories were based predominantly on conscious vision. In order for perceivers to experience a coherent visual world, perceptual input must be organized into separate objects, or "figures." As part of this process, the brain decides where a figure lies with respect to every border between two contiguous regions of space. Consider a vertical border, for instance. The brain decides whether that border is a boundary for a figure lying on the left or the right. When the border is perceived as a boundary of a figure lying on the left, the region on the right seems simply to continue behind the figure at the border; no shape is perceived there. These "figure-ground" decisions are made outside of conscious awareness. Because of this, it is extremely difficult to investigate the mechanisms that produce figure-ground perception. An important, unanswered, question is whether figure-ground perception is accomplished via fast, feed-forward mechanisms or whether iterative mechanisms involving feedback from higher processing levels are involved. In feed-forward models, the input is processed in successive stages until a coherent percept emerges. In iterative models, feed-forward processing is not sufficient; feedback from higher to lower levels is necessary to create the percept.

Mary Peterson and her colleagues at the University of Arizona have designed visual displays that allow them to investigate this question. One display type is a small symmetric, bounded silhouette lying on a larger ground. Portions of familiar shapes are hidden along the groundside of the silhouette's border; these shapes are not perceived consciously, only the shape of the enclosed silhouette is perceived consciously. (The classic "face-vase illusion" is an example of this.) Previous experiments have shown that the shape of the hidden object is suppressed when it is not perceived, supporting the view that figure-ground perception results from inhibitory competition between shapes that might be seen on opposite sides of a border; the winner is perceived as the shaped figure, whereas the loser is suppressed and the portion of space where it might have been seen is perceived as a shapeless ground. These displays are designed to isolate competition at the shape processing stage. Dr. Peterson and colleagues will also conduct a series of experiments which test whether suppression can be observed at lower levels where individual parts are represented and at higher levels where shape descriptions ("semantics") are represented. According to a feed-forward account, suppression of the losing shape would prevent access to its semantics; hence, no effects should be evident at higher levels. Without feedback, no effects should be evident at lower, part-processing, levels either. An iterative view could account for suppression at lower and/or higher levels by assuming that the outcome of the between-shape competition was relayed to higher and lower levels. Thus the proposed experiments will adjudicate between these competing views of how perception occurs. The researchers will also attempt to identify the processes involved in segregating figures from grounds in crowded real-world scenes. The perception of these more complex displays constitutes a challenge to current iterative models of figure-ground perception. The planned research will provide a foundation for neurophysiological experiments and formal computational models of vision and will contribute to our understanding of the temporal and spatial dynamics of shape perception.

Project Report

Most people assume that visual perception requires only a pair of eyes with acuity sufficient to identify the letters in line 6 of the optometrist’s chart. But, the eyes are insufficient for vision; the brain plays a huge role. The brain finds the best way to interpret the light received by the eyes as objects and scenes. What do you see in Figure 1? Most people see an unfamiliar black object on a white background. Insert Figure 1 about here Yet if you look at the white regions close to the left and right sides, you can see portions of familiar objects — white seahorses in profile. Pictures like this allow us to investigate a critical question regarding visual perception: Is visual perception best understood as a sequential process in which light received by the eye is processed in a one-way street in the brain from low to high levels until, at some high level, perception occurs and conceptual knowledge is accessed? Or, does visual perception involve a two-way street with communication between levels occurring in a backward direction as well as a forward direction? According to the one-way street hypothesis, there is no reason for the brain to process the areas on the outside of black silhouettes like the one in Figure 1; an object will not ultimately be perceived there, so it is wasteful to expend processing resources evaluating what might be there. In contrast, the two-way street hypothesis posits that the brain evaluates all objects that might be present and selects the best interpretation for the scene; the selection process involves backward communication from high to low levels as well as forward communication from low to high levels. By testing people with pictures like Figure 1 during this award period, my students and I found that before you see objects, your brain considers alternatives and then decides what you will see. These alternatives compete outside of conscious awareness, and the viewer is aware only of the object that wins the competition. For the image in Figure 1, the brain considers white seahorses as well as the unfamiliar black object. Conceptual knowledge about the white seahorses –that they are natural objects rather than artificial objects, for instance – is also activated earl in the course of perceptual processing. However, the black object tends to win the competition, and that is the object observers perceive. Observers tend to remain unaware of the seahorses; they are even unaware that there might be an alternative interpretation. We found that similar processes occur for whole scenes, and measured the time required to resolve scene-wide competition, even in displays that seem unambiguous because the vast majority of observers perceive the same thing. These results support the two-way street hypothesis. Intellectual Merit. My students and I are the first to find evidence that conceptual knowledge is activated before the brain determines whether to perceive a object on the inside or the outside of silhouettes like those in Figure 1. We are also the first to show that the visual system considers alternative interpretations of scenes that are unambiguous by perceptual report. These findings are difficult to explain on the prevailing one-way processing models. In challenging those models, our results advance the frontiers of knowledge. Broader Impacts. In the course of conducting this research, five graduate students in my laboratory were exposed to cutting-edge research, and were trained in theoretical thinking, in careful experimental design, in data analysis and interpretation, as well as in how to present their research in professional as well as public settings. Together we have disseminated the research broadly, via 19 publications (and another 4 under review), and 64 national and international conference presentations during the award period. Because the University of Arizona is located in the southwest United States, it attracts graduate students from Native American and Mexican American cultures. Two of the graduate students who worked on these projects came from those backgrounds. One has completed his Ph.D. and is employed in business as an experimental cognitive psychologist. The other student plans to complete her Ph.D. this fall. Two other students, one male and one female, who worked on grant-related projects completed their Ph.D.’s this summer. Those students are currently in post-doctoral fellowship positions, and eventually plan to seek employment in a university setting. The fifth student is a second-year female graduate student in the lab. Thus this project has contributed substantially to the training of 5 graduate students in a STEM field, 3 women, and 2 from minority backgrounds. Understanding how visual perception proceeds is of great value to computer vision scientists who seek to create machines that "see" as well as humans do. It is also of interest to neuroscientists who seek to unravel the complexities of neural computation.

Project Start
Project End
Budget Start
2010-06-15
Budget End
2014-05-31
Support Year
Fiscal Year
2009
Total Cost
$444,193
Indirect Cost
Name
University of Arizona
Department
Type
DUNS #
City
Tucson
State
AZ
Country
United States
Zip Code
85721