This is a continuation of a project to analyze, annotate, organize, and integrate RNA 3D structural data with RNA sequence and functional data and deliver it to biomedical researchers. The significance of the proposal lies in the prominent role of RNA, much of it non-protein-coding and structured, in a host of cellular and physiological processes, as revealed by recent high-throughput studies. Thousands of new non-coding RNAs have been discovered, many of which are expressed with temporal and spatial specificity. RNA plays many different roles in the human body, from carrying genetic information and regulating gene expression, to mediating intra-cellular communication, responding to environmental signals, and catalyzing important cellular processes such as protein synthesis. Deciphering the relationships between 3D structure and sequence is key to understanding RNA function. RNA 3D motifs are vital components in the architecture of RNA and in RNA- RNA and RNA-protein interactions. We are developing tools to study the variability of 3D motif structures across different conditions within the same and among homologous RNA molecules, integrating 3D structure and sequence data. These tools will be key to improving our ability to predict structure from sequence.
The specific aims of the project are: 1. Deepen our understanding of RNA 3D structure by extracting and organizing 3D motifs. 2. Improve our understanding of the structural variation of RNA molecules and RNA complexes by examining and comparing 3D structures assigned to the same sequence/structure equivalence class. 3. Accelerate the study of the correspondence between RNA 3D structure and RNA sequence. 4. Provide annotations for RNA-protein interactions. 5. Develop new and extended tools for delivery, search and visualization of the RNA sequence and structure annotations. In the previous grant period, we created the RNA 3D Motif Atlas and a data pipeline to automatically extract, analyze and cluster RNA hairpin and internal loop motifs from new RNA-containing 3D structures on a monthly schedule; in the current project, we will do the same for RNA-RNA interaction motifs and RNA multi-helix junctions. We will use real-space refinement (RsR) statistics to evaluate, at the nucleotide level, how well RNA 3D structures fit the underlying experimental x-ray data. We will create and integrate annotations of RNA-protein interactions from a variety of resources; RNA- protein interactions are a key way that regulatory RNAs carry out their functions in living organisms. Finally, all products of the research will be made publicly available through the Nucleic Acid Database (NDB) website. These include interactive 2D maps and 3D visualizations of large RNAs, services to allow geometric searches for 3D structure fragments, and the many new annotations. These services will benefit a wide variety of biomedical researchers who will find ready access to what is known about the 3D structures of RNA motifs, interactions they make with other RNAs, interactions they make with proteins, and how to identify these through RNA sequence alone.

Public Health Relevance

RNA plays many different roles in the human body, from carrying genetic information and regulating which genes are active, to carrying messages between cells, responding to environmental signals, and catalyzing important cellular processes such as making proteins. Deciphering the relationships between 3D structure and sequence is key to understanding RNA function. We are creating and disseminating methods to analyze and compare RNA 3D structures, to integrate 3D structure data with sequences from genomes, and to deliver powerful data visualizations that may ultimately enable advances in RNA nano- biomedicine, especially the development of advanced RNA-based diagnostics and therapies.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM085328-06
Application #
8935831
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Preusch, Peter
Project Start
2010-08-10
Project End
2018-08-31
Budget Start
2015-09-01
Budget End
2016-08-31
Support Year
6
Fiscal Year
2015
Total Cost
$315,498
Indirect Cost
$55,058
Name
Bowling Green State University
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
617407325
City
Bowling Green
State
OH
Country
United States
Zip Code
43403
Roll, James; Zirbel, Craig L; Sweeney, Blake et al. (2016) JAR3D Webserver: Scoring and aligning RNA loop sequences to known 3D motifs. Nucleic Acids Res 44:W320-7
Parlea, Lorena G; Sweeney, Blake A; Hosseini-Asanjan, Maryam et al. (2016) The RNA 3D Motif Atlas: Computational methods for extraction, organization and evaluation of RNA motifs. Methods 103:99-119
Zirbel, Craig L; Roll, James; Sweeney, Blake A et al. (2015) Identifying novel sequence variants of RNA 3D motifs. Nucleic Acids Res 43:7504-20
Sweeney, Blake A; Roy, Poorna; Leontis, Neocles B (2015) An introduction to recurrent nucleotide interactions in RNA. Wiley Interdiscip Rev RNA 6:17-45
Rahrig, Ryan R; Zirbel, Craig L (2015) DETECTING CONFORMATIONAL DIFFERENCES BETWEEN RNA 3D STRUCTURES. JP J Biostat 12:99-115
Theis, Corinna; Zirbel, Craig L; Zu Siederdissen, Christian Höner et al. (2015) RNA 3D Modules in Genome-Wide Predictions of RNA 2D Structure. PLoS One 10:e0139900
Cannone, Jamie J; Sweeney, Blake A; Petrov, Anton I et al. (2015) R3D-2-MSA: the RNA 3D structure-to-multiple sequence alignment server. Nucleic Acids Res 43:W15-23
Coimbatore Narayanan, Buvaneswari; Westbrook, John; Ghosh, Saheli et al. (2014) The Nucleic Acid Database: new features and capabilities. Nucleic Acids Res 42:D114-22
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab et al. (2014) Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence. Gene 548:81-90
Rahrig, Ryan R; Petrov, Anton I; Leontis, Neocles B et al. (2013) R3D Align web server for global nucleotide to nucleotide alignments of RNA 3D structures. Nucleic Acids Res 41:W15-21

Showing the most recent 10 out of 19 publications