Recent years have seen great progress in the identification of millions of single nucleotide polymorphisms (SNPs). The abundance of SNPs coupled with the developments of various platforms for high-throughput, high-quality genotyping have made genome-wide association studies within reach for human geneticists. This is further strengthened by the on-going HapMap project that is collecting valuable population genetics information on the allele frequencies and linkage disequilibrium (LD) patterns of millions of SNPs in four different populations, yielding valuable information that can be used to select SNPs in genetic association studies. However, statistical methods lag behind the rapid accumulation of biological information and technology advancement in that existing methods are either not appropriate or not optimized to address scientific questions using these data. Moreover, the new technologies have also led to new questions that are unique and need to be appropriately addressed in a timely fashion to fully realize the potential in these technologies. In this application, we focus on the design and analysis of high-density SNPs in genetic association and linkage studies to identify DMAvariants associated with disease risk. The primary objectives are to investigate study design issues in genome-wide association studies and to develop statistically sound and computationally feasible methods for genetic linkage and association studies. More specifically, four specific aims will be achieved: (1) to investigate strategies in the design of genome-wide association studies;(2) to develop statistical methods for association studies in samples involving individuals having uncertainties on inbreeding patterns and relationships;(3) to develop statistical methods for linkage studies in the presence of LOamong markers;and (4) to implement the developed statistical methods in user-friendly computer programs and make them available to the scientific community.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Krasnewich, Donna M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Yale University
Public Health & Prev Medicine
Schools of Medicine
New Haven
United States
Zip Code
Sun, Jiehuan; Herazo-Maya, Jose D; Huang, Xiu et al. (2018) Distance-correlation based gene set analysis in longitudinal studies. Stat Appl Genet Mol Biol 17:
Wang, Tao; Zhao, Hongyu (2017) A Dirichlet-tree multinomial regression model for associating dietary nutrients with gut microorganisms. Biometrics 73:792-801
Liu, Yiyi; Zhao, Hongyu (2017) Variable importance-weighted Random Forests. Quant Biol 5:338-351
Sun, Jiehuan; Herazo-Maya, Jose D; Kaminski, Naftali et al. (2017) A Dirichlet process mixture model for clustering longitudinal gene expression data. Stat Med 36:3495-3506
Yan, Xiting; Liang, Anqi; Gomez, Jose et al. (2017) A novel pathway-based distance score enhances assessment of disease heterogeneity in gene expression. BMC Bioinformatics 18:309
Chung, Dongjun; Kim, Hang J; Zhao, Hongyu (2017) graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture. PLoS Comput Biol 13:e1005388
Hou, Lin; Sun, Ning; Mane, Shrikant et al. (2017) Impact of genotyping errors on statistical power of association tests in genomic analyses: A case study. Genet Epidemiol 41:152-162
Lin, Zhixiang; Wang, Tao; Yang, Can et al. (2017) On joint estimation of Gaussian graphical models for spatial and temporal data. Biometrics 73:769-779
Sun, Jiehuan; Warren, Joshua L; Zhao, Hongyu (2017) A Bayesian semiparametric factor analysis model for subtype identification. Stat Appl Genet Mol Biol 16:145-158
Zhu, Ruoqing; Zhao, Ying-Qi; Chen, Guanhua et al. (2017) Greedy outcome weighted tree learning of optimal personalized treatment rules. Biometrics 73:391-400

Showing the most recent 10 out of 190 publications