The Human Genome Project and follow-on projects such as the International HapMap, 1000 Genomes, and ENCODE Projects are providing powerful resources for the identification of genes that predispose to human diseases. Along with these resources have come increasingly efficient technologies for genotyping and DNA sequencing. These resources and technologies will be critical as we continue to seek to unravel the complex etiologic basis of common human diseases. In this proposal, I address a set of statistical problems that arise in human disease gene mapping. I describe how my colleagues and I will address these problems through analytic methods, computer simulation, and application to interesting complex disease genetics data, and how we will generalize these solutions through the production, distribution, and support of efficient computer software. Specifically, we will: 1. identify the range of genetic models consistent with available linkage and association information to aid in the efficient design of large-scale resequencing studies; 2. develop efficient multi-stage designs for large resequencing and follow-up association studies with a particular focus on the optimal combination of sequencing, genotyping, and genotype imputation; 3. identify the most probable set of causal variants among those tested in GWAS or resequencing studies; 4. develop methods for efficient association fine mapping of known causal loci and detection of additional causal loci given GWA and/or resequencing data on multiple ancestry groups;and 5. continue to develop, test, distribute, and support computer software based on the methods that arise from the other aims of this project, and update, distribute, and support our current software, including SIMLINK, RHMAP, RELPAIR, SIBMED, LocusZoom, Snipper, and Spotter. In addition, we will continue to be opportunistic in identifying and addressing important statistical problems that are related to the goals of this project. Under separate funding, we will apply the resulting methods to the analysis of data from genetic studies of type 2 diabetes and related quantitative traits.

Public Health Relevance

Studies to localize and identify genetic variants that predispose to human diseases have the potential to inform breakthrough strategies to develop new drugs, to develop genetic tests to stratify risk, and to enable more targeted approaches to prevention and treatment in the population. Efficient statistical and computational methods are critical for the success of such studies.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG000376-25
Application #
8423411
Study Section
Special Emphasis Panel (ZRG1-GGG-M (91))
Program Officer
Brooks, Lisa
Project Start
1988-09-28
Project End
2015-12-31
Budget Start
2013-01-01
Budget End
2013-12-31
Support Year
25
Fiscal Year
2013
Total Cost
$391,756
Indirect Cost
$124,529
Name
University of Michigan Ann Arbor
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
073133571
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109
Li, Shi; Mukherjee, Bhramar; Taylor, Jeremy M G et al. (2014) The role of environmental heterogeneity in meta-analysis of gene-environment interactions with quantitative traits. Genet Epidemiol 38:416-29
(2014) Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat Genet 46:234-44
Reppell, Mark; Boehnke, Michael; Zöllner, Sebastian (2014) The impact of accelerating faster than exponential population growth on genetic variation. Genetics 196:819-28
Lee, Seunggeung; Abecasis, Gonçalo R; Boehnke, Michael et al. (2014) Rare-variant association analysis: study designs and statistical tests. Am J Hum Genet 95:5-23
Zawistowski, Matthew; Reppell, Mark; Wegmann, Daniel et al. (2014) Analysis of rare variant population structure in Europeans explains differential stratification of gene-based tests. Eur J Hum Genet 22:1137-44
Wang, Chaolong; Zhan, Xiaowei; Bragg-Gresham, Jennifer et al. (2014) Ancestry estimation and control of population stratification for sequence-based association studies. Nat Genet 46:409-15
Minelli, Cosetta; De Grandi, Alessandro; Weichenberger, Christian X et al. (2013) Importance of different types of prior knowledge in selecting genome-wide findings for follow-up. Genet Epidemiol 37:205-13
Thompson, John R; Gogele, Martin; Weichenberger, Christian X et al. (2013) SNP prioritization using a Bayesian probability of association. Genet Epidemiol 37:214-21
Ma, Clement; Blackwell, Tom; Boehnke, Michael et al. (2013) Recommended joint and meta-analysis strategies for case-control association testing of single low-count variants. Genet Epidemiol 37:539-50
Wu, Michael C; Lee, Seunggeun; Cai, Tianxi et al. (2011) Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 89:82-93

Showing the most recent 10 out of 31 publications