Statistical methods for early disease prediction and treatment strategy estimation using biomarker signatures

Wang, Yuanjia

Abstract

Neuropsychiatric disorders pose an immense burden on patients, families, and health care systems, thus underscoring the urgent need to develop disease-modifying treatment. Research on neuropsychiatric disorders (e.g., Alzheimer's disease, Parkinson's disease) faces unique challenges, including the fact that these disorders typically have a late onset and slow progression, the diagnostic criteria are based on subjective clinical symptoms, and there is substantial disease and subject heterogeneity. In the proposed work, we aim to tackle these chal- lenges by leveraging complementary contributions from multiple biomarkers, including genome-wide polymor- phisms, whole brain neuroimaging, bio?uids, and comprehensive neuropsychiatric assessments. We develop sophisticated analytic tools with higher resolution and improved accuracy by accounting for biological mecha- nisms of disease, synthesizing dynamic system-wide information, and integrating multiple sources of biomarkers. These methods are applied to clinical data collected by the investigative team or available from large international consortia in order to model the earliest pathological changes of neurodegenerative disease, assess treatment responses, and inform the design of early-intervention clinical trials and the discovery of optimal personalized therapies. Speci?cally, in Aim 1, we develop ef?cient methods for multi-level semiparametric transformation mod- els to estimate and test the risk of genetic variants on various types of complex phenotypes to inform genetic counseling and improve clinical trial ef?ciency. Our methods do not rely on full pedigree genotyping and provide family-speci?c substructure, in addition to population substructure, to better control confounding and reduce false discovery rates in genome-wide association studies.
In Aim 2, we develop large-scale nonlinear dynamic sys- tems through ordinary differential equations with random in?ections to understand early pathological changes and identify subjects with preclinical signs. Our method provides multi-domain integration of ensembles of biomarker dynamics.
In Aim 3, we develop dynamic hazards models and incorporate dynamic network structures to estimate biomarker pro?les that evolve smoothly with disease progression for earlier disease diagnosis. We account for irregularly measured biomarkers and biological network dependence among biomarkers.
In Aim 4, we develop doubly robust and ef?cient machine learning methods to identify predictive markers, estimate optimal individu- alized therapies, and identify subgroups who may receive the greatest bene?t from therapy, with minimal risk. In each aim, we will validate the proposed methods through extensive simulation studies and demonstrate their practical value via application to real-world clinical studies. We establish theoretical properties of the proposed methods using modern empirical process theory and statistical learning theory. Together, the state-of-the-art ana- lytic methods proposed here will substantially improve analytic accuracy, and our combined statistical and clinical expertise will ensure that our methods are translated directly back to the clinical and translational research com- munity.

Public Health Relevance

The ultimate goal of neuropsychiatric research is to develop experimental therapeutics to delay disease on- set, slow disease progression, and provide effective treatment at each stage of disease. This proposal aims to develop new statistical approaches to integrate complementary sources of information from genomic measures, brain imaging biomarkers, and early clinical signs to characterize disease mechanism, progression, and treatment responses, and thereby inform the design of clinical trials and the discovery of optimal personalized therapies.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Neurological Disorders and Stroke (NINDS)
Type: Research Project (R01)
Project #: 5R01NS073671-06
Application #: 9502388
Study Section: Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer: Sutherland, Margaret L

Project Start: 2011-07-15
Project End: 2021-04-30
Budget Start: 2018-05-01
Budget End: 2019-04-30
Support Year: 6
Fiscal Year: 2018
Total Cost
Indirect Cost

Institution

Name: Columbia University (N.Y.)
Department: Biostatistics & Other Math Sci
Type: Schools of Public Health
DUNS #: 621889815

City: New York
State: NY
Country: United States
Zip Code: 10032

Related projects


NIH 2020 R01 NS	Statistical methods for early disease prediction and treatment strategy estimation using biomarker signatures Wang, Yuanjia / Columbia University (N.Y.)
NIH 2019 R01 NS	Statistical methods for early disease prediction and treatment strategy estimation using biomarker signatures Wang, Yuanjia / Columbia University (N.Y.)
NIH 2018 R01 NS	Statistical methods for early disease prediction and treatment strategy estimation using biomarker signatures Wang, Yuanjia / Columbia University (N.Y.)
NIH 2017 R01 NS	Statistical methods for early disease prediction and treatment strategy estimation using biomarker signatures Wang, Yuanjia / Columbia University (N.Y.)	$366,940
NIH 2014 R01 NS	Efficient Methods for Genotype-Specific Distributions with Unobserved Genotypes. Wang, Yuanjia / Columbia University (N.Y.)
NIH 2013 R01 NS	Efficient Methods for Genotype-Specific Distributions with Unobserved Genotypes. Wang, Yuanjia / Columbia University (N.Y.)	$257,478
NIH 2012 R01 NS	Efficient Methods for Genotype-Specific Distributions with Unobserved Genotypes. Wang, Yuanjia / Columbia University (N.Y.)	$267,091
NIH 2011 R01 NS	Efficient Methods for Genotype-Specific Distributions with Unobserved Genotypes. Wang, Yuanjia / Columbia University (N.Y.)	$280,540

Publications

Wang, Yuanjia; Fu, Haoda; Zeng, Donglin (2018) Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: with an Application to Treating Type 2 Diabetes Patients with Insulin Therapies. J Am Stat Assoc 113:1-13

Liang, Liang; Carroll, Raymond; Ma, Yanyuan (2018) Dimension reduction and estimation in the secondary analysis of case-control studies. Electron J Stat 12:1782-1821

Li, Xiang; Xie, Shanghong; Zeng, Donglin et al. (2018) Efficient ?0 -norm feature selection based on augmented and penalized minimization. Stat Med 37:473-486

Qiu, Xin; Zeng, Donglin; Wang, Yuanjia (2018) Estimation and evaluation of linear individualized treatment rules to guarantee performance. Biometrics 74:517-528

Liu, Jianxuan; Ma, Yanyuan; Wang, Lan (2018) An alternative robust estimator of average treatment effect in causal inference. Biometrics 74:910-923

Liu, Ying; Wang, Yuanjia; Kosorok, Michael R et al. (2018) Augmented outcome-weighted learning for estimating optimal dynamic treatment regimens. Stat Med 37:3776-3788

Garcia, Tanya P; Ma, Yanyuan; Marder, Karen et al. (2017) ROBUST MIXED EFFECTS MODEL FOR CLUSTERED FAILURE TIME DATA: APPLICATION TO HUNTINGTON'S DISEASE EVENT MEASURES. Ann Appl Stat 11:1085-1116

Chen, Huaihou; Zeng, Donglin; Wang, Yuanjia (2017) Penalized nonlinear mixed effects model to identify biomarkers that predict disease progression. Biometrics 73:1343-1354

Ma, Shujie; Ma, Yanyuan; Wang, Yanqing et al. (2017) A Semiparametric Single-Index Risk Score Across Populations. J Am Stat Assoc 112:1648-1662

Zeng, Donglin; Gao, Fei; Lin, D Y (2017) Maximum likelihood estimation for semiparametric regression models with multivariate interval-censored data. Biometrika 104:505-525

Showing the most recent 10 out of 60 publications

Comments

Be the first to comment on Yuanjia Wang's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: