This multi-university Industry/University Cooperative Research Center for Safety, Security and Rescue Research located at the University of South Florida and the University of Minnesota will bring together industry, academe, and public sector users together to provide integrative robotics and artificial intelligence solutions in robotics for activities conducted by the police, FBI, FEMA, firefighting, transportation safety, and emergency response to mass casuality-related activities. The need for safety, security, and rescue technologies has accelerated in the aftermath of 9/11 and a new research community is forming, as witnessed by the first IEEE Workshop on Safety, Security and Rescue Robotics in February 2003.
The Center will be built upon the knowledge and expertise of multi-disciplinary researchers in computer science, engineering, industrial organization, psychology, public health, and marine sciences at the University of South Florida (the lead institution) and the University of Minnesota.
One of the projects at the Center studied the contents and use of shared mental models for mixed teams of humans, robots, and agents in the context of urban search and rescue. The study was supported by an agent based simulator that was used for modeling complex systems with hundreds of agents in a large city and to study their behaviors under different circumstances. Agents were modeled as autonomous entities that interacted with other agents while moving around the city to search for victims and to stop fires from spreading. Humans interacted with the agents in making decisions and carrying them out. We focused on the use of shared team mental models in those mixed human-robot-agent teams, in situations where information was uncertain, and communication was limited and unreliable. Specific outcomes were: 1. development of shared mental models for mixed teams of humans, robots, and agents; 2. study of the effects of different levels of shared mental models on team performance; 3. measurement of the performance of the shared team mental models in complex urban search and rescue environments with uncertain knowledge and limited communication. Another outcome focused on incremental learning of people appearances for tracking across cameras. Modern surveillance systems have come a long way in the last decade having being transformed into automata providing security warnings about activities recorded in video streams. However, it is unrealistic to believe that these systems can be fully automatic given the unacceptable high rates of false alarms under which they operate. It is also often the case that security warnings generated by the various cameras in a surveillance network are viewed in isolation. Therefore, interpretation of activities resulting from analysis of video sequences characterized by limited temporal and spatial scope is an impossible task not only for computer/camera systems but for humans as well. This work put forth a new paradigm of video interpretation and information linking amongst multiple cameras. It assumed availability of information extracted by low level processing video streams, it continued with supervised learning for data organization and management across time and space and it finally ended with visualization of activity pertinent to human initiated queries. Specifically, we introduced a technique for dynamic clustering of people appearances by taking advantage of availability of data extracted from low level vision processes such as tracking and labeled data provided by the user who escalates security hypotheses. We also used supervised clustering to explore the manifold of people appearances in an incremental (or dynamic) fashion. Another outcome was about dictionary learning for robust background subtraction. Background subtraction is a fundamental task in many computer vision applications, such as robotics and automated surveillance systems. The performance of high-level vision tasks such as object detection and tracking is dependent on effective foreground detection techniques. In this work, we have developed a novel background modeling algorithm that represents the background as a linear combination of dictionary atoms and the foreground as a sparse error, when one uses the respective set of dictionary atoms as basis elements to linearly approximate/ reconstruct a new image. The dictionary atoms represent variations of the background model and they are learned from a batch of randomly sampled training frames. The sparse foreground is estimated during both training and performance phases, and this is formulated as a Lasso problem, while the dictionary update step in the training phase is motivated from the K-SVD algorithm. Our method works well in the presence of foreground in the training frames, and also gives the foreground masks for the training frames as a by-product of the batch training phase. Experimental validation was provided on standard datasets with ground truth information, and the results were very promising.