One of the unexpected developments in the last decade emerged from large scale sequencing efforts - the discovery of widespread genome-wide transcription. While the human genome project revealed an unexpectedly small fraction of the genome dedicated to protein-coding genes, the ENCODE and related projects revealed that at least 70% of the genome is actively transcribed. This led to the discovery of a new class of RNAs known as long non-coding RNAs or lncRNAs. Rapidly accumulating evidence strongly suggests that these lncRNAs have important autonomous activities as RNAs. The emerging functions are predominantly in development, differentiation and pluripotency, processes with critical links to human health. Thus, establishing an understanding of the mechanism of action of lncRNAs is a high priority frontier in biology. Towards the long term-goal of producing a molecular understanding of lncRNA function, we are using a set of biochemical and structural approaches to elucidate how lncRNAs organize aspects of the nucleus to regulate and coordinate chromatin expression. In the first Aim of our proposed research program, the molecular basis of the interaction between hnRNP U and the Xist and Firre RNAs will be investigated. These lncRNAs are essential for X-chromosome inactivation (Xist) and adipogenesis (Firre), using mechanisms requiring their direct interaction with hnRNP U that localizes the lncRNA to the correct chromosome or loci. Selective binding of the RNA-binding RGG domain of hnRNP U will be explored using a combination of bottom-up and top-down biochemical approaches and structural approaches to understand the molecular details of protein-RNA recognition, focusing on the critical but poorly understood RGG domain. The results of these studies will illuminate a number of RNA-protein interactions mediated by RGG domains that are central to human RNA metabolism.
The second Aim focuses on the decoy/guide model for lncRNA function by investigating the interactions of two key transcription factors, glucocorticoid receptor (GR) and Sox2, with the lncRNAs that are proposed to modulate the specificity and regulatory activity of these proteins. According to this model, pervasive transcription at promoters and enhancers can either titrate away a transcription factor from its dsDNA-binding site or, conversely, help to recruit and localize a transcription factor to a target. To reveal the sequences and/or structures of lncRNAs that these dsDNA-binding proteins can recognize, an in vitro selection based approach will be used and the results correlated with transcriptomic studies. In parallel, traditional biochemical and structural approaches will be used to understand the molecular details of these protein-RNA interactions with known lncRNA targets. These studies will yield direct insights into how key transcription factors that have classically been regarded as solely dsDNA-binding factors also interact with RNA as a critical part of their gene regulatory mechanism.

Public Health Relevance

Our newfound ability to directly measure the transcriptional activity of the cell has, just like the human genome project, revealed completely unexpected insights into the nature of gene regulation. The most surprising revelation has been the discovery that the majority of chromatin is actively transcribed even though only a small fraction is then translated into protein. Initial insights into the action of these non-coding RNAs suggest that they act by regulating the activation state of chromatin and play key roles in differentiation, development and pluripotency. These myriad roles in fundamental cellular processes suggest a clear connection to many aspects of human health from both diagnostic and therapeutic perspectives, including many connections to cancer and other diseases.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM120347-04
Application #
9734135
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Carter, Anthony D
Project Start
2016-09-22
Project End
2020-06-30
Budget Start
2019-07-01
Budget End
2020-06-30
Support Year
4
Fiscal Year
2019
Total Cost
Indirect Cost
Name
University of Colorado at Boulder
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
007431505
City
Boulder
State
CO
Country
United States
Zip Code
80303
Ozdilek, Bagdeser A; Thompson, Valery F; Ahmed, Nasiha S et al. (2017) Intrinsically disordered RGG/RG domains mediate degenerate specificity in RNA binding. Nucleic Acids Res 45:7984-7996