This project is concerned with statistical models whose parameter spaces have singularities. The investigator studies how singularities impact the behavior of existing statistical methods and develops new techniques for adequate assessment of statistical significance. The focus is on algebraic statistical models, that is, models that have (semi-)algebraic sets as parameter spaces. The class of algebraic models comprises many of the singular models employed in practice and can be studied using tools from computational algebraic geometry. Importantly, the well-behaved local geometry of semi-algebraic sets makes it possible to obtain general results without having to assume difficult to verify regularity conditions. The statistical techniques under study include classical procedures from likelihood inference such as likelihood ratio and Wald tests as well as information criteria.

Modern scientific studies often require analysis of data on several jointly observed variables. Statistical models of dependence relationships among the different variables are often formulated using additional variables that are not observable (or hidden). A common feature of hidden variable models is that their statistical properties are not entirely understood because of a lack of smoothness properties that makes them irregular. This is the primary motivation for this project that develops theory and methods that have a bearing on problems such as determining the number and type of unobserved variables to be included in a statistical model. Such problems arise in particular in applications in the social sciences where key concepts such as intelligence are not directly observable, and in computational biology where hidden variables are employed, for example, when DNA of present-day species is used to validate evolutionary theories that involve extinct species. More broadly, the work is relevant for any study, medical or otherwise, in which the existence of influential unobserved variables cannot be excluded.

Agency
National Science Foundation (NSF)
Institute
Division of Mathematical Sciences (DMS)
Application #
0746265
Program Officer
Gabor J. Szekely
Project Start
Project End
Budget Start
2008-07-01
Budget End
2013-05-31
Support Year
Fiscal Year
2007
Total Cost
$400,000
Indirect Cost
Name
University of Chicago
Department
Type
DUNS #
City
Chicago
State
IL
Country
United States
Zip Code
60637