This is a continuation of a project to analyze, annotate, organize, and integrate RNA 3D structural data with RNA sequence and functional data and deliver it to biomedical researchers. The significance of the proposal lies in the prominent role of RNA, much of it non-protein-coding and structured, in a host of cellular and physiological processes, as revealed by recent high-throughput studies. Thousands of new non-coding RNAs have been discovered, many of which are expressed with temporal and spatial specificity. RNA plays many different roles in the human body, from carrying genetic information and regulating gene expression, to mediating intra-cellular communication, responding to environmental signals, and catalyzing important cellular processes such as protein synthesis. Deciphering the relationships between 3D structure and sequence is key to understanding RNA function. RNA 3D motifs are vital components in the architecture of RNA and in RNA- RNA and RNA-protein interactions. We are developing tools to study the variability of 3D motif structures across different conditions within the same and among homologous RNA molecules, integrating 3D structure and sequence data. These tools will be key to improving our ability to predict structure from sequence.
The specific aims of the project are: 1. Deepen our understanding of RNA 3D structure by extracting and organizing 3D motifs. 2. Improve our understanding of the structural variation of RNA molecules and RNA complexes by examining and comparing 3D structures assigned to the same sequence/structure "equivalence" class. 3. Accelerate the study of the correspondence between RNA 3D structure and RNA sequence. 4. Provide annotations for RNA-protein interactions. 5. Develop new and extended tools for delivery, search and visualization of the RNA sequence and structure annotations. In the previous grant period, we created the RNA 3D Motif Atlas and a data pipeline to automatically extract, analyze and cluster RNA hairpin and internal loop motifs from new RNA-containing 3D structures on a monthly schedule;in the current project, we will do the same for RNA-RNA interaction motifs and RNA multi-helix junctions. We will use real-space refinement (RsR) statistics to evaluate, at the nucleotide level, how well RNA 3D structures fit the underlying experimental x-ray data. We will create and integrate annotations of RNA-protein interactions from a variety of resources;RNA- protein interactions are a key way that regulatory RNAs carry out their functions in living organisms. Finally, all products of the research will be made publicly available through the Nucleic Acid Database (NDB) website. These include interactive 2D maps and 3D visualizations of large RNAs, services to allow geometric searches for 3D structure fragments, and the many new annotations. These services will benefit a wide variety of biomedical researchers who will find ready access to what is known about the 3D structures of RNA motifs, interactions they make with other RNAs, interactions they make with proteins, and how to identify these through RNA sequence alone.

Public Health Relevance

RNA plays many different roles in the human body, from carrying genetic information and regulating which genes are active, to carrying messages between cells, responding to environmental signals, and catalyzing important cellular processes such as making proteins. Deciphering the relationships between 3D structure and sequence is key to understanding RNA function. We are creating and disseminating methods to analyze and compare RNA 3D structures, to integrate 3D structure data with sequences from genomes, and to deliver powerful data visualizations that may ultimately enable advances in RNA nano- biomedicine, especially the development of advanced RNA-based diagnostics and therapies.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
2R01GM085328-05
Application #
8814878
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Preusch, Peter
Project Start
2010-08-10
Project End
2018-08-31
Budget Start
2014-09-30
Budget End
2015-08-31
Support Year
5
Fiscal Year
2014
Total Cost
$333,083
Indirect Cost
$72,642
Name
Bowling Green State University
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
617407325
City
Bowling Green
State
OH
Country
United States
Zip Code
43403
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab et al. (2014) Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence. Gene 548:81-90
Coimbatore Narayanan, Buvaneswari; Westbrook, John; Ghosh, Saheli et al. (2014) The Nucleic Acid Database: new features and capabilities. Nucleic Acids Res 42:D114-22
Rahrig, Ryan R; Petrov, Anton I; Leontis, Neocles B et al. (2013) R3D Align web server for global nucleotide to nucleotide alignments of RNA 3D structures. Nucleic Acids Res 41:W15-21
Havrila, Marek; Reblova, Kamila; Zirbel, Craig L et al. (2013) Isosteric and nonisosteric base pairs in RNA motifs: molecular dynamics and bioinformatics study of the sarcin-ricin internal loop. J Phys Chem B 117:14302-19
Petrov, Anton I; Zirbel, Craig L; Leontis, Neocles B (2013) Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas. RNA 19:1327-40
Abu Almakarem, Amal S; Petrov, Anton I; Stombaugh, Jesse et al. (2012) Comprehensive survey and geometric classification of base triples in RNA structures. Nucleic Acids Res 40:1407-23
Petrov, Anton I; Zirbel, Craig L; Leontis, Neocles B (2011) WebFR3D--a server for finding, aligning and analyzing recurrent RNA 3D motifs. Nucleic Acids Res 39:W50-5