The Human Genome Project and follow-on projects such as the International HapMap, 1000 Genomes, and ENCODE Projects are providing powerful resources for the identification of genes that predispose to human diseases. Along with these resources have come increasingly efficient technologies for genotyping and DNA sequencing. These resources and technologies will be critical as we continue to seek to unravel the complex etiologic basis of common human diseases. In this proposal, I address a set of statistical problems that arise in human disease gene mapping. I describe how my colleagues and I will address these problems through analytic methods, computer simulation, and application to interesting complex disease genetics data, and how we will generalize these solutions through the production, distribution, and support of efficient computer software. Specifically, we will: 1. identify the range of genetic models consistent with available linkage and association information to aid in the efficient design of large-scale resequencing studies; 2. develop efficient multi-stage designs for large resequencing and follow-up association studies with a particular focus on the optimal combination of sequencing, genotyping, and genotype imputation; 3. identify the most probable set of causal variants among those tested in GWAS or resequencing studies; 4. develop methods for efficient association fine mapping of known causal loci and detection of additional causal loci given GWA and/or resequencing data on multiple ancestry groups;and 5. continue to develop, test, distribute, and support computer software based on the methods that arise from the other aims of this project, and update, distribute, and support our current software, including SIMLINK, RHMAP, RELPAIR, SIBMED, LocusZoom, Snipper, and Spotter. In addition, we will continue to be opportunistic in identifying and addressing important statistical problems that are related to the goals of this project. Under separate funding, we will apply the resulting methods to the analysis of data from genetic studies of type 2 diabetes and related quantitative traits.
Studies to localize and identify genetic variants that predispose to human diseases have the potential to inform breakthrough strategies to develop new drugs, to develop genetic tests to stratify risk, and to enable more targeted approaches to prevention and treatment in the population. Efficient statistical and computational methods are critical for the success of such studies.
Showing the most recent 10 out of 67 publications