Neuropsychiatric disorders pose an immense burden on patients, families, and health care systems, thus underscoring the urgent need to develop disease-modifying treatment. Research on neuropsychiatric disorders (e.g., Alzheimer?s disease, Parkinson?s disease) faces unique challenges, including the fact that these disorders typically have a late onset and slow progression, the diagnostic criteria are based on subjective clinical symptoms, and there is substantial disease and subject heterogeneity. In the proposed work, we aim to tackle these challenges by leveraging complementary contributions from multiple biomarkers, including genome-wide polymorphisms, whole brain neuroimaging, biofluids, and comprehensive neuropsychiatric assessments. We develop sophisticated analytic tools with higher resolution and improved accuracy by accounting for biological mechanisms of disease, synthesizing dynamic system-wide information, and integrating multiple sources of biomarkers. These methods are applied to clinical data collected by the investigative team or available from large international consortia in order to model the earliest pathological changes of neurodegenerative disease, assess treatment responses, and inform the design of early-intervention clinical trials and the discovery of optimal personalized therapies. Specifically, in Aim 1, we develop efficient methods for multi-level semiparametric transformation models to estimate and test the risk of genetic variants on various types of complex phenotypes to inform genetic counseling and improve clinical trial efficiency. Our methods do not rely on full pedigree genotyping and provide family-specific substructure, in addition to population substructure, to better control confounding and reduce false discovery rates in genome-wide association studies.
In Aim 2, we develop large-scale nonlinear dynamic systems through ordinary differential equations with random inflections to understand early pathological changes and identify subjects with preclinical signs. Our method provides multi-domain integration of ensembles of biomarker dynamics.
In Aim 3, we develop dynamic hazards models and incorporate dynamic network structures to estimate biomarker profiles that evolve smoothly with disease progression for earlier disease diagnosis. We account for irregularly measured biomarkers and biological network dependence among biomarkers.
In Aim 4, we develop doubly robust and efficient machine learning methods to identify predictive markers, estimate optimal individualized therapies, and identify subgroups who may receive the greatest benefit from therapy, with minimal risk. In each aim, we will validate the proposed methods through extensive simulation studies and demonstrate their practical value via application to real-world clinical studies. We establish theoretical properties of the proposed methods using modern empirical process theory and statistical learning theory. Together, the state-of-the-art analytic methods proposed here will substantially improve analytic accuracy, and our combined statistical and clinical expertise will ensure that our methods are translated directly back to the clinical and translational research community.

Public Health Relevance

The ultimate goal of neuropsychiatric research is to develop experimental therapeutics to delay disease on- set, slow disease progression, and provide effective treatment at each stage of disease. This proposal aims to develop new statistical approaches to integrate complementary sources of information from genomic measures, brain imaging biomarkers, and early clinical signs to characterize disease mechanism, progression, and treatment responses, and thereby inform the design of clinical trials and the discovery of optimal personalized therapies.

National Institute of Health (NIH)
National Institute of Neurological Disorders and Stroke (NINDS)
Research Project (R01)
Project #
Application #
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Miller, Daniel L
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Columbia University (N.Y.)
Biostatistics & Other Math Sci
Schools of Public Health
New York
United States
Zip Code
Wang, Yuanjia; Fu, Haoda; Zeng, Donglin (2018) Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: with an Application to Treating Type 2 Diabetes Patients with Insulin Therapies. J Am Stat Assoc 113:1-13
Liang, Liang; Carroll, Raymond; Ma, Yanyuan (2018) Dimension reduction and estimation in the secondary analysis of case-control studies. Electron J Stat 12:1782-1821
Li, Xiang; Xie, Shanghong; Zeng, Donglin et al. (2018) Efficient ?0 -norm feature selection based on augmented and penalized minimization. Stat Med 37:473-486
Qiu, Xin; Zeng, Donglin; Wang, Yuanjia (2018) Estimation and evaluation of linear individualized treatment rules to guarantee performance. Biometrics 74:517-528
Liu, Jianxuan; Ma, Yanyuan; Wang, Lan (2018) An alternative robust estimator of average treatment effect in causal inference. Biometrics 74:910-923
Liu, Ying; Wang, Yuanjia; Kosorok, Michael R et al. (2018) Augmented outcome-weighted learning for estimating optimal dynamic treatment regimens. Stat Med 37:3776-3788
Lee, Annie J; Wang, Yuanjia; Alcalay, Roy N et al. (2017) Penetrance estimate of LRRK2 p.G2019S mutation in individuals of non-Ashkenazi Jewish ancestry. Mov Disord 32:1432-1438
Liang, Baosheng; Tong, Xingwei; Zeng, Donglin et al. (2017) SEMIPARAMETRIC REGRESSION ANALYSIS OF REPEATED CURRENT STATUS DATA. Stat Sin 27:1079-1100
Liu, Ying; Wang, Yuanjia; Huang, Chaorui et al. (2017) Estimating personalized diagnostic rules depending on individualized characteristics. Stat Med 36:1099-1117
Wang, Qianqian; Ma, Yanyuan; Wang, Yuanjia (2017) Predicting disease Risk by Transformation Models in the Presence of Unspecified Subgroup Membership. Stat Sin 27:1857-1878

Showing the most recent 10 out of 60 publications