While we are strongly supportive of NIH initiatives in data sharing, we have long believed that it is insufficient to share only the raw data from genomic studies. The massive amounts of data created in today's genomic studies generate even more massive results that merit more careful scrutiny than is generally practical except through the use of sophisticated databases. Thus, we have devoted substantial resources to developing results databases (see, for example, ://www.scandb.org) that we make publicly available. The SCAN database (SNP and Copy number ANnotation) allows users to query results of our transcriptome studies by SNP, by gene and by region, and can be used to annotate SNPs with information on function, including potential eQTL (expression Quantitative Trait Locus) function. Our preliminary studies with SCAN have revealed that SNPs associated with complex traits are more likely to be eQTLs than minor-allele-frequency matched SNPs drawn from high-density SNP genotyping platforms. These results are robust across a wide range of definitions for trait-associated SNPs and eQTLs (p-values ranging from 10-4 to 10-8), and across a broad range of complex trait phenotypes. We now propose to extend the SCAN database to include results of transcriptome association studies being conducted at the University of Chicago on a broad range of human tissues and to continue to develop software tools to maximize the utility of this database. These efforts are informed by our near-complete immersion in studies relating genotype to phenotype (and in developing methods for relating genotype to phenotype) for many different complex traits. Thus, our specific aims are: 1) to extend the SCAN database to serve results of transcriptome studies in liver, brain adipose tissue, and skeletal muscle in addition to the results of the transcriptome studies from GTEx and our studies in lymphoblastoid cell lines that are currently served;2) to augment the novel software tools we have already developed for use with the SCAN database to use transcriptome association results to facilitate the identification of genetic risk factors for complex traits;and 3) to develop novel approaches for investigating the function of genetic variation with an emphasis on GxG interaction and nQTLs (network QTLs).

Public Health Relevance

We have long been committed to the development of public results databases (see, for example ://www.scandb.org) that permit us to serve the massive results of genomic studies in a way that facilitates further discovery research. Our project will allow users to query results of transcriptome association studies in a variety of human tissues, and to use this information to discover and better characterize genetic risk factors for complex traits.

National Institute of Health (NIH)
National Institute of Mental Health (NIMH)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-GGG-A (52))
Program Officer
Addington, Anjene M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Chicago
Internal Medicine/Medicine
Schools of Medicine
United States
Zip Code
Gamazon, Eric R; Trendowski, Matthew R; Wen, Yujia et al. (2018) Gene and MicroRNA Perturbations of Cellular Response to Pemetrexed Implicate Biological Networks and Enable Imputation of Response in Lung Adenocarcinoma. Sci Rep 8:733
Zhang, Mingfeng; Lykke-Andersen, Soren; Zhu, Bin et al. (2018) Characterising cis-regulatory variation in the transcriptome of histologically normal and tumour-derived pancreatic tissues. Gut 67:521-533
Agrawal, A; Chou, Y-L; Carey, C E et al. (2018) Genome-wide association study identifies a novel locus for cannabis dependence. Mol Psychiatry 23:1293-1302
Gamazon, Eric R; Segrè, Ayellet V; van de Bunt, Martijn et al. (2018) Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation. Nat Genet 50:956-967
Salisbury-Ruf, Christi T; Bertram, Clinton C; Vergeade, Aurelia et al. (2018) Bid maintains mitochondrial cristae structure and function and protects against cardiac disease in an integrative genomics study. Elife 7:
Collado-Torres, Leonardo; Nellore, Abhinav; Kammers, Kai et al. (2017) Reproducible RNA-seq analysis using recount2. Nat Biotechnol 35:319-321
Benítez-Buelga, Carlos; Baquero, Juan Miguel; Vaclova, Tereza et al. (2017) Genetic variation in the NEIL2 DNA glycosylase gene is associated with oxidative DNA damage in BRCA2 mutation carriers. Oncotarget 8:114626-114636
Tukiainen, Taru; Villani, Alexandra-Chloé; Yen, Angela et al. (2017) Landscape of X chromosome inactivation across human tissues. Nature 550:244-248
Chiang, Colby; Scott, Alexandra J; Davis, Joe R et al. (2017) The impact of structural variation on human gene expression. Nat Genet 49:692-699
Gudmundsson, Julius; Thorleifsson, Gudmar; Sigurdsson, Jon K et al. (2017) A genome-wide association study yields five novel thyroid cancer risk loci. Nat Commun 8:14517

Showing the most recent 10 out of 85 publications