The recent explosion of large amounts of high dimensional data in science, engineering, and society demands new technologies to recover sparse signals from high dimensional noisy observations. To address it, this project aims to develop efficient and robust methods for analyzing high-dimensional data, which have wide applications in signal processing, communication, computational biology, machine learning, image/video coding, sensor networks, social science, etc.

Recently, compressed sensing (CS) has attracted a good deal of attention from computer science, engineering, and statistics communities. However, the CS recovery only makes sense when the features are weakly correlated. A single rogue outlier may break down the reconstruction completely. The computational procedures cannot meet the challenge of ultrahigh dimensional problems in terms of statistical accuracy, algorithmic stability and computation expediency. To address these challenges, the investigators develop novel nonconvex regularization techniques to attain prediction accuracy and model parsimony for coherent statistical models that go much beyond Gaussianity. Theoretical analysis of its performance in estimation, prediction, and sparsity recovery is conducted. A class of simple algorithms feasible for solving essentially any nonconvex penalized generalized linear models is developed, together with a randomization technique of nonmarginal feature screening for ultra-high dimensional data. Furthermore, the investigators explicitly study the critical effects of outliers and develop a robust CS for handling high leverage points and gross outliers. A unified framework that applies to small-sample-size-high-dimension problems is provided for simultaneous variable selection and outlier identification under Gaussian/non-Gaussian noise. Finally, this project involves rich motivating examples and widespread applications in various areas including spectral analysis, network topology and dynamics modeling, graphical models, computational biology, machine learning, and image compression, as an essential component.

Project Start
Project End
Budget Start
2011-07-01
Budget End
2015-06-30
Support Year
Fiscal Year
2011
Total Cost
$237,225
Indirect Cost
Name
University of Florida
Department
Type
DUNS #
City
Gainesville
State
FL
Country
United States
Zip Code
32611