Most human genes contain introns, and presence of introns often increases the expression of the host gene, a phenomenon known as intron-mediated enhancement (IME). IME has been observed in diverse genes in animals, plants and fungi and often varies in magnitude across introns. However, little is known about how introns impact expression or what intron features modulate IME activity. Recently, we have described a novel phenomenon that we call exon-mediated activation of transcription starts (EMATS), in which the splicing of internal exons impacts the spectrum of promoters used and expression level of the gene. EMATS acts at a distance of up to a few kb, can alter gene expression by at least severalfold, and appears more active at certain promoters ? especially intrinsically weak promoters. The detailed sequence requirements and mode of action of EMATS are not yet known. This proposal is seeks to understand the rules that govern IME and EMATS, to improve the prediction of gene expression and to enable methods to modulate gene expression by altering splicing. It is organized around the following aims. SA1. Determine the sequence dependence of intron-mediated enhancement. SA2. Explore the scope and rules for EMATS regulation.
In Aim 1, we will generate a library of many thousands of distinct random sequences inserted into an intron in a dual fluorescent reporter system that is chromosomally integrated into human cells. This design will enable high-throughput measurement of the effects of each intron on nascent RNA, mature RNA and protein levels, and these data will be used to identify motifs that enhance or silence expression in a splicing-dependent manner from an intronic location.
In aim 2, we will systematically derive and test rules for how EMATS regulation depends on the location and sequence of the internal exon and on properties of the involved promoter. Finally, we will use the information learned about IME and EMATS to improve predictions of gene expression from primary sequence. Together, the research described in these aims will establish rules governing how splicing impacts gene expression in mammalian genomes. Identification of motifs that function as splicing-dependent activators or silencers of expression can be used to improve prediction of expression from genome sequence and may enable detection of intronic variants that alter expression. Understanding how splicing impacts expression may also enable new approaches for gene expression modulation.

Public Health Relevance

This project seeks to understand and establish predictive rules for how splicing of pre-mRNAs impacts the expression of human genes. These rules will enable improved prediction of gene expression from genomic sequence and likely the identification of new classes of non-coding regulatory variants in the human genome that impact gene expression and contribute to disease phenotypes in a manner dependent on splicing. This work will also guide the application of existing technologies for perturbing splicing to enable the modulation of gene expression for therapeutic applications, such as boosting the expression of a tumor suppressor.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
2R01HG002439-17A1
Application #
10120999
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Sen, Shurjo Kumar
Project Start
2021-01-06
Project End
2024-12-31
Budget Start
2021-01-06
Budget End
2021-12-31
Support Year
17
Fiscal Year
2021
Total Cost
Indirect Cost
Name
Massachusetts Institute of Technology
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
001425594
City
Cambridge
State
MA
Country
United States
Zip Code
Pai, Athma A; Henriques, Telmo; McCue, Kayla et al. (2017) The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture. Elife 6:
Taliaferro, J Matthew; Lambert, Nicole J; Sudmant, Peter H et al. (2016) RNA Sequence Context Effects Measured In Vitro Predict In Vivo Protein Binding and Regulation. Mol Cell 64:294-306
Taliaferro, J Matthew; Vidaki, Marina; Oliveira, Ruan et al. (2016) Distal Alternative Last Exons Localize mRNAs to Neural Projections. Mol Cell 61:821-33
Merkin, Jason J; Chen, Ping; Alexis, Maria S et al. (2015) Origins and impacts of new mammalian exons. Cell Rep 10:1992-2005
Katz, Yarden; Wang, Eric T; Silterra, Jacob et al. (2015) Quantitative visualization of alternative exon expression from RNA-seq data. Bioinformatics 31:2400-2
Lambert, Nicole; Robertson, Alex; Jangi, Mohini et al. (2014) RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. Mol Cell 54:887-900
Shalgi, Reut; Hurt, Jessica A; Krykbaeva, Irina et al. (2013) Widespread regulation of translation by elongation pausing in heat shock. Mol Cell 49:439-52
Spies, Noah; Burge, Christopher B; Bartel, David P (2013) 3' UTR-isoform choice has limited influence on the stability and translational efficiency of most mRNAs in mouse fibroblasts. Genome Res 23:2078-90
Han, Hong; Irimia, Manuel; Ross, P Joel et al. (2013) MBNL proteins repress ES-cell-specific alternative splicing and reprogramming. Nature 498:241-5
Hurt, Jessica A; Robertson, Alex D; Burge, Christopher B (2013) Global analyses of UPF1 binding and function reveal expanded scope of nonsense-mediated mRNA decay. Genome Res 23:1636-50

Showing the most recent 10 out of 30 publications