Our primary goals in developing a program for the analysis of sequence data from the 1000 Genomes Project (TGF) are to address fundamental issues in human biology, including how best to identify rare variants affecting complex human phenotypes, what proportion of the heritability for complex traits is attributable to rare vs. common variants, and how to predict which genes are most likely to harbor rare variants associated with complex traits.. To achieve these goals we have assembled a multi-disciplinary team with access to a variety of unique resources.
Our specific aims are: 1) We will develop and apply a variety of approaches for characterizing rare variants that affect complex human phenotypes. While this aim will focus on development of methods to relate rare variants to complex traits, our studies may also allow us to identify novel functional elements. 2) We will use results of comprehensive association studies in cytotoxicity phenotypes with common and rare variants throughout the human genome to determine the proportion of the total phenotypic variance in cytotoxicity phenotypes is attributable to rare vs. common variants. 3) We will attempt to develop predictive models for identifying genes likely to have rare variants affecting complex human phenotypes. Although the primary focus of the proposed research is on cytotoxicity phenotypes because we believe that these phenotypes are an outstanding model for general complex human phenotypes, we will make extensive use of our expression and miRNA data and results from these same samples to enhance our studies. Collectively, the datasets that we utilize in our studies and the tools that we develop to achieve our goals will provide novel and valuable resources to the 1000 Genomes Project and the larger scientific community.

Public Health Relevance

The sequence data available in the 1000 Genomes Project and the large number of cytotoxicity phenotypes we have assayed in the same samples provides us with unique opportunities to develop methods for relating rare variants to complex traits, to determine the proportion of phenotypic variance attributable to rare vs. common variants, and to develop predictive models for genes with rare variant associations.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project--Cooperative Agreements (U01)
Project #
Application #
Study Section
Special Emphasis Panel (ZHG1-HGR-M (J1))
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Chicago
Internal Medicine/Medicine
Schools of Medicine
United States
Zip Code
McCarthy, Shane; Das, Sayantan; Kretzschmar, Warren et al. (2016) A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 48:1279-83
Fuchsberger, Christian (see original citation for additional authors) (2016) The genetic architecture of type 2 diabetes. Nature 536:41-47
Gamazon, Eric R; Cox, Nancy J; Davis, Lea K (2014) Structural architecture of SNP effects on complex traits. Am J Hum Genet 95:477-89
Gamazon, Eric R; Huang, R Stephanie; Cox, Nancy J (2013) SCAN: a systems biology approach to pharmacogenomic discovery. Methods Mol Biol 1015:213-24
Gamazon, Eric R; Pinto, Navin; Konkashbaev, Anuar et al. (2013) Trans-population analysis of genetic mechanisms of ethnic disparities in neuroblastoma survival. J Natl Cancer Inst 105:302-9
Liu, Qianying; Nicolae, Dan L; Chen, Lin S (2013) Marbled inflation from population structure in gene-based association studies with rare variants. Genet Epidemiol 37:286-92
Chen, Lin S; Hsu, Li; Gamazon, Eric R et al. (2012) An exponential combination procedure for set-based association tests in sequencing studies. Am J Hum Genet 91:977-86
Gamazon, Eric R; Huang, R Stephanie; Dolan, M Eileen et al. (2011) Copy number polymorphisms and anticancer pharmacogenomics. Genome Biol 12:R46