The Human Genome Project and follow-on projects such as the International HapMap, 1000 Genomes, and ENCODE Projects are providing powerful resources for the identification of genes that predispose to human diseases. Along with these resources have come increasingly efficient technologies for genotyping and DNA sequencing. These resources and technologies will be critical as we continue to seek to unravel the complex etiologic basis of common human diseases. In this proposal, I address a set of statistical problems that arise in human disease gene mapping. I describe how my colleagues and I will address these problems through analytic methods, computer simulation, and application to interesting complex disease genetics data, and how we will generalize these solutions through the production, distribution, and support of efficient computer software. Specifically, we will: 1. identify the range of genetic models consistent with available linkage and association information to aid in the efficient design of large-scale resequencing studies; 2. develop efficient multi-stage designs for large resequencing and follow-up association studies with a particular focus on the optimal combination of sequencing, genotyping, and genotype imputation; 3. identify the most probable set of causal variants among those tested in GWAS or resequencing studies; 4. develop methods for efficient association fine mapping of known causal loci and detection of additional causal loci given GWA and/or resequencing data on multiple ancestry groups;and 5. continue to develop, test, distribute, and support computer software based on the methods that arise from the other aims of this project, and update, distribute, and support our current software, including SIMLINK, RHMAP, RELPAIR, SIBMED, LocusZoom, Snipper, and Spotter. In addition, we will continue to be opportunistic in identifying and addressing important statistical problems that are related to the goals of this project. Under separate funding, we will apply the resulting methods to the analysis of data from genetic studies of type 2 diabetes and related quantitative traits.

Public Health Relevance

Studies to localize and identify genetic variants that predispose to human diseases have the potential to inform breakthrough strategies to develop new drugs, to develop genetic tests to stratify risk, and to enable more targeted approaches to prevention and treatment in the population. Efficient statistical and computational methods are critical for the success of such studies.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-GGG-M (91))
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Michigan Ann Arbor
Biostatistics & Other Math Sci
Schools of Public Health
Ann Arbor
United States
Zip Code
Lee, Seunggeun; Fuchsberger, Christian; Kim, Sehee et al. (2016) An efficient resampling method for calibrating single and gene-based rare variant association analysis in case-control studies. Biostatistics 17:1-15
Das, Sayantan; Forer, Lukas; Schönherr, Sebastian et al. (2016) Next-generation genotype imputation service and methods. Nat Genet 48:1284-7
Ma, Clement; Boehnke, Michael; Lee, Seunggeun et al. (2015) Evaluating the Calibration and Power of Three Gene-Based Association Tests of Rare Variants for the X Chromosome. Genet Epidemiol 39:499-508
Moutsianas, Loukas; Agarwala, Vineeta; Fuchsberger, Christian et al. (2015) The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. PLoS Genet 11:e1005165
Flickinger, Matthew; Jun, Goo; Abecasis, Gonçalo R et al. (2015) Correcting for Sample Contamination in Genotype Calling of DNA Sequence Data. Am J Hum Genet 97:284-90
Li, Shi; Mukherjee, Bhramar; Taylor, Jeremy M G et al. (2014) The role of environmental heterogeneity in meta-analysis of gene-environment interactions with quantitative traits. Genet Epidemiol 38:416-29
Lee, Seunggeung; Abecasis, Gonçalo R; Boehnke, Michael et al. (2014) Rare-variant association analysis: study designs and statistical tests. Am J Hum Genet 95:5-23
Reppell, Mark; Boehnke, Michael; Zöllner, Sebastian (2014) The impact of accelerating faster than exponential population growth on genetic variation. Genetics 196:819-28
Zawistowski, Matthew; Reppell, Mark; Wegmann, Daniel et al. (2014) Analysis of rare variant population structure in Europeans explains differential stratification of gene-based tests. Eur J Hum Genet 22:1137-44
Wang, Chaolong; Zhan, Xiaowei; Bragg-Gresham, Jennifer et al. (2014) Ancestry estimation and control of population stratification for sequence-based association studies. Nat Genet 46:409-15

Showing the most recent 10 out of 50 publications