Type 2 diabetes (T2D) shows complex inheritance, indicating a causal role for multiple inherited DNA variants. Genome wide association studies (GWAS) have now mapped over 20 novel loci where common variants are associated with risk of T2D. Despite this progress, identified risk alleles explain relatively little of the overall variation in T2D risk. To fully understand the genetic architecture of T2D we need to move from locus to gene to pinpoint specific causal gene(s) responsible for observed associations. We need to address allelic heteroaeneitv. where T2D genes are likely to have multiple different common and rare mutations. We need to explore ethnic variation, where the specific complement of gene mutations contributing to T2D are likely to vary in frequency and effect size across ethnic groups. We hypothesize that: (1) each region identified by GWAS contains at least one causal T2D gene, influenced by at least one common functional variant;(2) in addition to the index variant identified by GWAS, one or more additional common variants in each locus influence T2D;(3) in addition to common variants, each gene may harbor rare mutations that more strongly influence risk of T2D, and (4) the identities, frequencies and effects of these variants vary across multiple ethnic groups representative of the US population. To address these hypotheses we propose three Specific Aims. (1) Bring together multiethnic samples representative of the US population, drawn from the Jackson Heart Study, Framingham Heart Study, Multi-Ethnic Cohort Study, and Diabetes Prevention Program, that together include -29,000 individuals with T2D phenotypes and DNA;(2) Identify and fine-map common variants at each locus in each ethnic group by leveraging our multi-ethnic design and emerging data from the 1000 Genomes Project;and (3) Identify rare causal mutations at each locus by performing deep sequencing of all coding exons in each ethnic group. The co-investigators have extensive experience in complex disease genetics and genomics, next-generation sequencing, statistical genetics, metabolic physiology and epidemiology, and have a long track-record of effective collaboration and leadership that, combined with a large multiethnic, well-phenotyped sample, we hope can contribute to RFA-DK-09-004.

Public Health Relevance

Genetic studies of type 2 diabetes (T2D) have identified new genomic risk regions. We will look in these regions for genes, define variation within genes and variation in different people by bringing together -29,000 individuals from ethnic groups representing the US population, map genes in each region, and identify mutations by detailed DNA analysis, leading to better prevention and treatment of T2D.

National Institute of Health (NIH)
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
Research Project--Cooperative Agreements (U01)
Project #
Application #
Study Section
Special Emphasis Panel (ZDK1)
Program Officer
Akolkar, Beena
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Broad Institute, Inc.
United States
Zip Code
Gallaugher, Michael; Canty, Angelo J; Paterson, Andrew D (2016) Factors associated with heterogeneity in microarray gene expression in peripheral blood mononuclear cells from large pedigrees. BMC Proc 10:91-95
Green, Alden; Cook, Kaitlyn; Grinde, Kelsey et al. (2016) A general method for combining different family-based rare-variant tests of association to improve power and robustness of a wide range of genetic architectures. BMC Proc 10:165-170
Friedrichs, Stefanie; Malzahn, Dörthe; Pugh, Elizabeth W et al. (2016) Filtering genetic variants and placing informative priors based on putative biological function. BMC Genet 17 Suppl 2:8
Datta, Ananda S; Biswas, Swati (2016) Comparison of haplotype-based statistical tests for disease association with rare and common variants. Brief Bioinform 17:657-71
Katsumata, Yuriko; Fardo, David W (2016) On combining family- and population-based sequencing data. BMC Proc 10:175-179
Held, Elizabeth; Cape, Joshua; Tintle, Nathan (2016) Comparing machine learning and logistic regression methods for predicting hypertension using a combination of gene expression and next-generation sequencing data. BMC Proc 10:141-145
Lent, Samantha; Deng, Xuan; Cupples, L Adrienne et al. (2016) Imputing rare variants in families using a two-stage approach. BMC Proc 10:209-214
Sitlani, Colleen M; Dupuis, Josée; Rice, Kenneth M et al. (2016) Genome-wide gene-environment interactions on quantitative traits using family data. Eur J Hum Genet 24:1022-8
Almeida, Marcio; Blondell, Lucy; Peralta, Juan M et al. (2016) Independent test assessment using the extreme value distribution theory. BMC Proc 10:245-249
Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V et al. (2016) Analysis of protein-coding genetic variation in 60,706 humans. Nature 536:285-91

Showing the most recent 10 out of 54 publications