The Human Genome Project is providing powerful resources for the identification of genes that predispose to human diseases: the complete sequence of the human genome and the sequences of many other species, a catalog of most common and many rare human genetic variants and their dependency relationships, and increasingly detailed sequence annotation. Along with these resources have come increasingly efficient means of genotyping and DNA sequencing. These resources and technologies will be critical as we seek to unravel the complex etiologic basis of common human diseases. In this proposal, I address a set of statistical problems that arise in human disease gene mapping. I describe how my colleagues and I will address these problems through analytic methods, computer simulation, and application to interesting test data, and how we will generalize these solutions through the production, distribution, and support of efficient computer software. ? ? First, we will continue to develop and test statistical designs and analysis methods for association mapping for complex human diseases. Specifically, we will: (A.I) develop two- and multi-stage methods for genetic association studies; (A.2) assess the impact of the preferential mistyping and nontyping of heterozygotes in association analysis; (A.3) develop permutation-based methods to assess the significance of association tests given multiple traits, markers, and/or groups of individuals; (A.4) model the effect of the """"""""winner's curse"""""""" on the estimation of the strength of association in complex disease studies; and (A.5) develop a parametric statistical framework to assess disease-marker association given family data of variable structure and to assess the role of a genetic marker in explaining a linkage signal for disease. ? ? Second, we will continue to address the impact of violating model assumptions on linkage analysis of genes for human diseases. Specifically, we will: (B.I) assess the impact of assuming equal male and female recombination fractions when they are different given various sampling designs; and (B.2) assess the impact of modeling marker-marker linkage disequilibrium in linkage analysis given genotype data on a dense set of SNPs. ? ? Third, we will continue to: (C) develop, test, distribute, and support computer software based on the methods that arise from the other aims of this project, and update, distribute, and support our current software, including SIMLINK, RHMAP, RELPAIR, and SIBMED. Finally, we will continue to be opportunistic in identifying and addressing important statistical problems that are related to the other goals of this project. Under separate funding, we will apply the resulting methods to the analysis of data from studies of type 2 diabetes, bipolar disorder, and other complex diseases. ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG000376-19
Application #
7114256
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Brooks, Lisa
Project Start
1988-09-28
Project End
2010-06-30
Budget Start
2006-07-01
Budget End
2007-06-30
Support Year
19
Fiscal Year
2006
Total Cost
$327,828
Indirect Cost
Name
University of Michigan Ann Arbor
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
073133571
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109
Wojcik, Genevieve L; Fuchsberger, Christian; Taliun, Daniel et al. (2018) Imputation-Aware Tag SNP Selection To Improve Power for Large-Scale, Multi-ethnic Association Studies. G3 (Bethesda) 8:3255-3267
Reppell, M; Zöllner, S (2018) An efficient algorithm for generating the internal branches of a Kingman coalescent. Theor Popul Biol 122:57-66
Jiang, Yu; Chen, Sai; McGuire, Daniel et al. (2018) Proper conditional analysis in the presence of missing data: Application to large scale meta-analysis of tobacco use phenotypes. PLoS Genet 14:e1007452
Dutta, Diptavo; Scott, Laura; Boehnke, Michael et al. (2018) Multi-SKAT: General framework to test for rare-variant association with multiple phenotypes. Genet Epidemiol :
Ray, Debashree; Boehnke, Michael (2018) Methods for meta-analysis of multiple traits using GWAS summary statistics. Genet Epidemiol 42:134-145
Scott, Robert A; Scott, Laura J; Mägi, Reedik et al. (2017) An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. Diabetes 66:2888-2902
Chiu, Chi-Yang; Jung, Jeesun; Chen, Wei et al. (2017) Meta-analysis of quantitative pleiotropic traits for next-generation sequencing with multivariate functional linear models. Eur J Hum Genet 25:350-359
Chiu, Chi-Yang; Jung, Jeesun; Wang, Yifan et al. (2017) A comparison study of multivariate fixed models and Gene Association with Multiple Traits (GAMuT) for next-generation sequencing. Genet Epidemiol 41:18-34
Taliun, Daniel; Chothani, Sonia P; Schönherr, Sebastian et al. (2017) LASER server: ancestry tracing with genotypes or sequence reads. Bioinformatics 33:2056-2058
McCarthy, Shane; Das, Sayantan; Kretzschmar, Warren et al. (2016) A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 48:1279-83

Showing the most recent 10 out of 67 publications