Machine learning is a powerful tool for artificial intelligence and data mining problems. However, its success critically relies on a good feature representation of the data; therefore, the problem of feature construction poses a fundamental challenge. In recent years, representation learning has emerged as a promising method for learning useful feature representations from data. However, the current state-of-the-art methods are still limited in building intelligent agents that can learn and interact with complex environments and large amounts of sensory input. Specifically, the majority of the existing methods cannot scale well to large-scale data.

The goal of this project is to fill this gap by formulating a new framework that can effectively learn representations from complex environments and scale to large data. Specifically, we propose novel approaches for learning robust representations from large-scale data by (1) controlling the complexity of the feature representations and (2) adaptively modeling relevant patterns in the presence of significant amounts of irrelevant patterns or noise.

Key intellectual contributions of this project will be (1) a novel framework of representation learning that provides robust representations from large amounts of unlabeled data and relatively small amounts of labeled data, and (2) theoretical and algorithmic advances for inference, learning, and related optimization problems in representation learning for large-scale, complex sensory information processing.

This work will serve as a catalyst leading to applications, such as multimedia processing and search, medical image processing, speech recognition, and autonomous navigation. The results will be disseminated through publications and free software.

Project Report

In this project, we developed representation learning algorithms that can learn meaningful features from large amount of data automatically in deep learning framework. In one thrust, we advanced the training methods for deep learning algorithms by reducing the effort in model complexity selection. The "latent" variable plays an important role in deep learning since it controls the model capacity; if we have too many latent variables, the model overfits and does not generalize well to unseen data; if we have too few latent variables, the model underfits and does not represent the data well. We developed novel learning frameworks that finds an optimal model complexity. Specifically, our model adaptively increases or decreases the number of latent variables by adding new latent variables or merging existing similar latent variables depending on the current models’ fit to the data. In addition, our model can discover the appropriate number of semantically meaningful mid-level features (e.g., attributes) in weakly supervised setting from (virtually) infinite amount of latent variables. In another thrust, we developed algorithms for learning robust features to irrelevant patterns, such as patterns in the background or random noise. Learning robust features to the noise is essential in many applications of computer vision (e.g., natural image understanding) and medical imaging (e.g., analysis of CT scans) since the images are complex and highly variable, containing a lot of information that are not relevant to the task of interest. Our methods can effectively remove different types of unseen noise, which lead to robust recognition. Furthermore, when there exists a weak human supervision, our proposed learning algorithm can learn to focus on the relevant region (e.g., learning from the object region instead of background clutters). Besides the visual perception, our models have a great potential for other domains such as speech recognition where the data could be cluttered by the surrounding "wild" environment.

Project Start
Project End
Budget Start
2012-08-01
Budget End
2014-07-31
Support Year
Fiscal Year
2012
Total Cost
$115,000
Indirect Cost
Name
Regents of the University of Michigan - Ann Arbor
Department
Type
DUNS #
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109