Modern high-dimensional complex-structured datasets are usually very noisy and heavy-tailed. The project will address two major inter-related questions arising from the challenges due to these highly noisy datasets and unstable tuning-sensitive nonparametric methods. The first question concerns the noisy nature of high-dimensional complex-structured datasets. It addresses the extent to which we can believe in the results of estimation procedures designed for regression models with very light-tailed errors, while keeping in mind that such an ideal assumption may fail miserably in practice due to numerous outliers. The second question concerns the subtle and usually unstable tuning-sensitive multidimensional nonparametric methods. It aims at understanding how far alternative tuning-free and stable statistical estimation methods with additional shape constraints can be reliable. The graduate student will be involved in developing theory and methods for multi-dimensional shape constrained models, implementing algorithms, performing numerical experiments, and validating the theoretical properties of the statistical methods.

The proposed research questions share a close common tie at the level of their underlying probabilistic structures exhibited by multiplier empirical processes, a special form of the empirical processes. Unfortunately, existing tools necessarily fail to provide sharp understandings due to complex structures within the processes. The PI will develop probabilistic tools and techniques in connection with these multiplier empirical processes. These tools and techniques will play a crucial role in understanding the phase-transitional behavior of the commonly used least squares and other related frequentist and Bayes methods in regression models with heavy tails and shape constraints. The particular problems under investigation include (a) behavior of penalized least squares estimators in nonparametric models with heavy-tailed errors, (b) behavior of Bayes procedures under quasi-Gaussian likelihood with heavy-tailed errors, (c) behavior of least squares estimators in multi-dimensional isotonic and convex shape constrained models and (d) behavior of maximum likelihood estimators in general additive models with shape constraints.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Agency
National Science Foundation (NSF)
Institute
Division of Mathematical Sciences (DMS)
Application #
1916221
Program Officer
Gabor Szekely
Project Start
Project End
Budget Start
2019-07-01
Budget End
2022-06-30
Support Year
Fiscal Year
2019
Total Cost
$109,171
Indirect Cost
Name
Rutgers University
Department
Type
DUNS #
City
Piscataway
State
NJ
Country
United States
Zip Code
08854