A handful of mutation processes operate on the human germline to form small insertions and deletions (indels), large copy number variants (CNVs), inversions, translocations and more complex changes in chromosome structure. These diverse mutations are collectively referred to as structural variation (SV). Assessing the functional and pathogenic impact of singleton and rare structural variants in disease is one of the most pressing and understudied problems in human genetics today. Here we describe methodological innovations for integrating structural variation into eQTL studies, and then transforming knowledge learned from GTEx data into a probabilistic pathogenicity assessment tool that can be used by a wide range of researchers. We will pilot new approaches for integrating SVs and single nucleotide variants (SNVs) in a coherent framework. The centerpiece of this integrative effort will be a new model- based pathogenicity assessment method that will integrate (i) knowledge gleaned from GTEx analyses, (ii) recent breakthroughs in classification of Mendelian disease genes, and (iii) the rapidly expanding set of known disease mutations matriculating from array- and sequencing-based studies of severe Mendelian and other pediatric diseases. This method will be the first tool for generic functional assessment of both SVs and SNVs and will interpret variation affecting coding and/or non-coding regions.

Public Health Relevance

It is estimated that approximately 15% of cases of idiopathic intellectual disability and congenital defects are caused by chromosomal rearrangements. This number is certainly an underestimate, as no tools exist to evaluate the functional impact of most chromosomal rearrangements. In this project we will develop novel model-based methods for improved interpretation of this class of mutation, driven in part by the new availability of extensive gene expression data from multiple human tissues.

National Institute of Health (NIH)
National Institute of Mental Health (NIMH)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Addington, Anjene M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Washington University
Schools of Medicine
Saint Louis
United States
Zip Code
Castel, Stephane E; Mohammadi, Pejman; Chung, Wendy K et al. (2016) Rare variant phasing and haplotypic expression from RNA sequencing with phASER. Nat Commun 7:12817
Long, Quan; Argmann, Carmen; Houten, Sander M et al. (2016) Inter-tissue coexpression network analysis reveals DPP4 as an important gene in heart to blood communication. Genome Med 8:15
Benitez-Buelga, Carlos; Vaclová, Tereza; Ferreira, Sofia et al. (2016) Molecular insights into the OGG1 gene, a cancer risk modifier in BRCA1 and BRCA2 mutations carriers. Oncotarget 7:25815-25
Hartmann, Katherine; Seweryn, Michał; Handleman, Samuel K et al. (2016) Non-linear interactions between candidate genes of myocardial infarction revealed in mRNA expression profiles. BMC Genomics 17:738
Hou, Liping; Bergen, Sarah E; Akula, Nirmala et al. (2016) Genome-wide association study of 40,000 individuals identifies two novel loci associated with bipolar disorder. Hum Mol Genet 25:3383-3394
Stacey, Simon N; Kehr, Birte; Gudmundsson, Julius et al. (2016) Insertion of an SVA-E retrotransposon into the CASP8 gene is associated with protection against prostate cancer. Hum Mol Genet 25:1008-18
Wilfert, Amy B; Chao, Katherine R; Kaushal, Madhurima et al. (2016) Genome-wide significance testing of variation from single case exomes. Nat Genet 48:1455-1461
Brinkmeyer-Langford, Candice L; Guan, Jinting; Ji, Guoli et al. (2016) Aging Shapes the Population-Mean and -Dispersion of Gene Expression in Human Brains. Front Aging Neurosci 8:183
Gordon, Erin D; Palandra, Joe; Wesolowska-Andersen, Agata et al. (2016) IL1RL1 asthma risk variants regulate airway type 2 inflammation. JCI Insight 1:e87871
Pierson, Emma; GTEx Consortium; Koller, Daphne et al. (2015) Sharing and Specificity of Co-expression Networks across 35 Human Tissues. PLoS Comput Biol 11:e1004220

Showing the most recent 10 out of 16 publications