In recent years, studies that associate genetic variation with gene expression (eQTL studies) have become a major tool for identifying regulatory genetic variation. However, the difficulty of securing primary tissue samples means that up to now these eQTL studies have been conducted in a limited range of cell and tissue types. Most notably the largest studies have been conducted in EBV-transformed lymphoblastoid cell lines, and it is unclear to what extent eQTLs identified in these cell lines wil be relevant to human disease mapping. The GTEx Project will provide data to remedy this situation, collecting RNA-seq and genotype data on 30 tissues in hundreds of individuals. However, current analytic tools are limited in their ability to fully exploit the richness of these data. In particular, available methods fall short in their ability to jointly analyze data on all tssues to maximize power, while simultaneously allowing for differences among eQTLs present in each tissue. Here we propose to develop novel statistical methods to help address these issues. We will apply these methods to identify eQTLs in the GTEx project data, integrate the GTEx data with other relevant data such as those available from the ENCODE project, and disseminate the results on the internet in a convenient form. We will also provide researchers with convenient tools to cross-reference results of the GTEx project with results of genome-wide association studies. The overall goal of the project is to build and apply an infrastructure for improved eQTL analyses, helping to maximize the utility and accessibility of GTEx data to the broad community of scientists who would like to use these data.
This project will generate and apply statistical tools for analyzing large-scale studies that aim to understand the impact of genetic variation on transcriptomes, and to better understand the different regulatory mechanisms underlying different tissue types, a fundamental issue in biology. Understanding these mechanisms, identifying the regulatory genetic variants, and correlating them with human disease, has the potential help understand the biology of disease, eventually leading to new treatment strategies.
Zhang, Mingfeng; Lykke-Andersen, Soren; Zhu, Bin et al. (2018) Characterising cis-regulatory variation in the transcriptome of histologically normal and tumour-derived pancreatic tissues. Gut 67:521-533 |
Banovich, Nicholas E; Li, Yang I; Raj, Anil et al. (2018) Impact of regulatory variation across human iPSCs and differentiated cells. Genome Res 28:122-131 |
Agrawal, A; Chou, Y-L; Carey, C E et al. (2018) Genome-wide association study identifies a novel locus for cannabis dependence. Mol Psychiatry 23:1293-1302 |
Collado-Torres, Leonardo; Nellore, Abhinav; Kammers, Kai et al. (2017) Reproducible RNA-seq analysis using recount2. Nat Biotechnol 35:319-321 |
Benítez-Buelga, Carlos; Baquero, Juan Miguel; Vaclova, Tereza et al. (2017) Genetic variation in the NEIL2 DNA glycosylase gene is associated with oxidative DNA damage in BRCA2 mutation carriers. Oncotarget 8:114626-114636 |
Tukiainen, Taru; Villani, Alexandra-Chloé; Yen, Angela et al. (2017) Landscape of X chromosome inactivation across human tissues. Nature 550:244-248 |
Chiang, Colby; Scott, Alexandra J; Davis, Joe R et al. (2017) The impact of structural variation on human gene expression. Nat Genet 49:692-699 |
Gudmundsson, Julius; Thorleifsson, Gudmar; Sigurdsson, Jon K et al. (2017) A genome-wide association study yields five novel thyroid cancer risk loci. Nat Commun 8:14517 |
Dolan, M Eileen; El Charif, Omar; Wheeler, Heather E et al. (2017) Clinical and Genome-Wide Analysis of Cisplatin-Induced Peripheral Neuropathy in Survivors of Adult-Onset Cancer. Clin Cancer Res 23:5757-5768 |
Varma, V R; Varma, S; An, Y et al. (2017) Alpha-2 macroglobulin in Alzheimer's disease: a marker of neuronal injury through the RCAN1 pathway. Mol Psychiatry 22:13-23 |
Showing the most recent 10 out of 42 publications