The collection and analysis of high dimensional data is now of central importance in a wide range of scientific applications, including genomics, bioinformatics, climate studies, and signal processing. There are many new challenges in the analysis of such high dimensional data and there is much demand for the development of new statistical theory and methodology in this field. Although high dimensional data is often analyzed using complex models in many applications, attention is often focused on testing specific hypotheses that are of low dimension. In such cases new statistical theory and methodology relies on the development of new techniques for estimating non-smooth functionals, the topic of this project.

The focus of this research project is to establish a comprehensive methodology for estimating non-smooth functionals and to show how this methodology can be used in the detection and testing of multiple high-dimensional sparse sequences encountered in genomics. The proposed research will provide scientists with new methodologies to analyze such highly complex data. The statistical procedures developed in this project will be implemented in a high-level programming language and made available on the internet with the related research reports so as to allow comparisons with other approaches.

Agency
National Science Foundation (NSF)
Institute
Division of Mathematical Sciences (DMS)
Type
Standard Grant (Standard)
Application #
1403708
Program Officer
Gabor Szekely
Project Start
Project End
Budget Start
2014-07-01
Budget End
2017-06-30
Support Year
Fiscal Year
2014
Total Cost
$485,840
Indirect Cost
Name
Department
Type
DUNS #
City
State
Country
Zip Code