The long-term objective of this project is to develop novel and powerful statistical methods to identify genes underlying complex traits. With the availability of large numbers of genetic markers in the human genome, it is becoming feasible in genetic association studies to genotype thousands of markers and to saturate candidate regions with many tightly linked markers. The analysis of such data poses challenging statistical issues and both theoretical and empirical studies are needed to develop and evaluate statistical methods that can best extract the most relevant information.
The specific aims of this projects are: (1) Develop statistical methods to appropriately control for population stratification in an association study, both for qualitative traits and for quantitative traits; (2) Develop statistical methods to study associations between multiple tightly linked markers and complex traits of interest, both for samples consisting of unrelated individuals and for samples consisting of pedigrees; (3) Compare statistical efficiencies and the overall cost for various genotyping strategies in association studies; and (4) Develop computer programs that implement the statistical methods developed in this project and distribute them to the scientific community. We will also apply these methods to map complex disease genes through our extensive collaborations. The developments of these novel statistical methods and user-friendly computer programs will provide biomedical researchers with important tools to identify genes underlying complex traits .

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM059507-05
Application #
6608838
Study Section
Genome Study Section (GNM)
Program Officer
Eckstrand, Irene A
Project Start
1999-02-01
Project End
2005-07-31
Budget Start
2003-08-01
Budget End
2004-07-31
Support Year
5
Fiscal Year
2003
Total Cost
$230,284
Indirect Cost
Name
Yale University
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
043207562
City
New Haven
State
CT
Country
United States
Zip Code
06520
Sun, Jiehuan; Herazo-Maya, Jose D; Huang, Xiu et al. (2018) Distance-correlation based gene set analysis in longitudinal studies. Stat Appl Genet Mol Biol 17:
Wang, Tao; Zhao, Hongyu (2017) A Dirichlet-tree multinomial regression model for associating dietary nutrients with gut microorganisms. Biometrics 73:792-801
Liu, Yiyi; Zhao, Hongyu (2017) Variable importance-weighted Random Forests. Quant Biol 5:338-351
Sun, Jiehuan; Herazo-Maya, Jose D; Kaminski, Naftali et al. (2017) A Dirichlet process mixture model for clustering longitudinal gene expression data. Stat Med 36:3495-3506
Yan, Xiting; Liang, Anqi; Gomez, Jose et al. (2017) A novel pathway-based distance score enhances assessment of disease heterogeneity in gene expression. BMC Bioinformatics 18:309
Chung, Dongjun; Kim, Hang J; Zhao, Hongyu (2017) graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture. PLoS Comput Biol 13:e1005388
Hou, Lin; Sun, Ning; Mane, Shrikant et al. (2017) Impact of genotyping errors on statistical power of association tests in genomic analyses: A case study. Genet Epidemiol 41:152-162
Lin, Zhixiang; Wang, Tao; Yang, Can et al. (2017) On joint estimation of Gaussian graphical models for spatial and temporal data. Biometrics 73:769-779
Sun, Jiehuan; Warren, Joshua L; Zhao, Hongyu (2017) A Bayesian semiparametric factor analysis model for subtype identification. Stat Appl Genet Mol Biol 16:145-158
Zhu, Ruoqing; Zhao, Ying-Qi; Chen, Guanhua et al. (2017) Greedy outcome weighted tree learning of optimal personalized treatment rules. Biometrics 73:391-400

Showing the most recent 10 out of 190 publications