This.
aim of this proposal is to develop novel statistical methodology to address issues in the analysis of large scale data from biomedical studies, especially the studies of tumors and virus diseases. The problems arising from the analysis of DMAmicorarray, proteiomic and longitudinal data will be carefully investigated. The proposal focuses on developing innovative semiparametric technqiues for removing systematic biases in microarray experiments, selecting significantly expressed patterns of genes and proteins at different time points and under different experimental conditions, and efficiently assessing the covariate effects and predicting individual response trajectory for longitudinal studies. The strength and weakness of each proposed method will be critically scrutinized via theoretical investigations and simulation studies. Related software will be developed. Data sets from ongoing biologial studies on cancer and virus diseases will be analyzed by using the newly developed statistical methods. This study allows biologists to more effectively remove the impact of experimental variations inherited in microarray experiments and permits biologists to reveal more meaningful scientific results with lower false discovery rates. It provides cutting-edge tools for biologists to understand biological processes, molecular functions and cellular activities. It introduces new tools for medical scientists to unveil how the risk factors affect individual disease over time . These will result in improved disease classification, diagnosis, prognosis, and drug design, among other pharmaceutical, theraputic and medical goals.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM072611-03
Application #
7348360
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Remington, Karin A
Project Start
2006-02-01
Project End
2010-01-31
Budget Start
2008-02-01
Budget End
2009-01-31
Support Year
3
Fiscal Year
2008
Total Cost
$187,784
Indirect Cost
Name
Princeton University
Department
Miscellaneous
Type
Schools of Arts and Sciences
DUNS #
002484665
City
Princeton
State
NJ
Country
United States
Zip Code
08544
Fan, Jianqing; Liu, Han; Wang, Weichen (2018) LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS. Ann Stat 46:1383-1414
Chen, Zhao; Fan, Jianqing; Li, Runze (2018) Error Variance Estimation in Ultrahigh-Dimensional Additive Models. J Am Stat Assoc 113:315-327
Li, Quefeng; Cheng, Guang; Fan, Jianqing et al. (2018) Embracing the Blessing of Dimensionality in Factor Models. J Am Stat Assoc 113:380-389
Fan, Jianqing; Shao, Qi-Man; Zhou, Wen-Xin (2018) ARE DISCOVERIES SPURIOUS? DISTRIBUTIONS OF MAXIMUM SPURIOUS CORRELATIONS AND THEIR APPLICATIONS. Ann Stat 46:989-1017
Battey, Heather; Fan, Jianqing; Liu, Han et al. (2018) DISTRIBUTED TESTING AND ESTIMATION UNDER SPARSE HIGH DIMENSIONAL MODELS. Ann Stat 46:1352-1382
Zhou, Wen-Xin; Bose, Koushiki; Fan, Jianqing et al. (2018) A NEW PERSPECTIVE ON ROBUST M-ESTIMATION: FINITE SAMPLE THEORY AND APPLICATIONS TO DEPENDENCE-ADJUSTED MULTIPLE TESTING. Ann Stat 46:1904-1931
Fan, Jianqing; Liu, Han; Sun, Qiang et al. (2018) I-LAMM FOR SPARSE LEARNING: SIMULTANEOUS CONTROL OF ALGORITHMIC COMPLEXITY AND STATISTICAL ERROR. Ann Stat 46:814-841
Avella-Medina, Marco; Battey, Heather S; Fan, Jianqing et al. (2018) Robust estimation of high-dimensional covariance and precision matrices. Biometrika 105:271-284
Aït-Sahalia, Yacine; Fan, Jianqing; Laeven, Roger J A et al. (2017) Estimation of the Continuous and Discontinuous Leverage Effects. J Am Stat Assoc 112:1744-1758
Wang, Weichen; Fan, Jianqing (2017) Asymptotics of empirical eigenstructure for high dimensional spiked covariance. Ann Stat 45:1342-1374

Showing the most recent 10 out of 77 publications