We propose to make the growing body of experimental three-dimensional (3D) RNA structure data more useful to biomedical researchers by providing improved methods to integrate 3D RNA structure with sequence and other experimental data. New annotation tools and services developed in this project will be integrated into the Nucleic Acid Database (NDB) which will provide a platform for disseminating project results. Among the expected benefits are better methods for 1) predicting 3D structures of functional RNA motifs from sequence, 2) searching for non-coding RNA genes in genomes, and 3) improving alignments of homologous RNA sequences. We focus attention on recurrent, modular RNA 3D motifs, which occur in a wide variety of structured RNA molecules, and which give RNA its distinctive 3D shape. This includes hairpin loops, internal loops, junction loops, and tertiary interaction motifs. We will develop systematic methods to identify, classify, and name recurrent RNA 3D motifs and to define search criteria to reliably find instances of each motif in 3D structures. An annotation procedure will be established so that new motifs are rapidly identified in new structures and vetted in collaboration with other members of the RNA Ontology Consortium. All experimental RNA 3D structures will be annotated with lists of motifs. A Motif Atlas will be created to make information about 3D motif instances in structures available to users. This new Atlas containing the annotation of motifs will be added to the Nucleic Acid Database (NDB), a web resource containing structural and functional annotation of nucleic acid containing macromolecules. An update procedure will be developed such that motif data and Atlas entries will automatically be added to the NDB as new RNA structures become available in the PDB archive. We will extend the query capabilities of the NDB with tools for users to search the NDB for RNA motifs using multiple criteria and to integrate search results with experimental confidence measures. We will maintain statistics on the occurrences of motifs and base pairing interactions, incorporating experimental confidence measures, and make these data available as a resource for refinement and validation tools. Each entry in the Motif Atlas will include a structural alignment of all instances of the motif to reveal sequence variants for each motif, including patterns of insertions and deletions. These data will be combined with statistical covariation data for Watson-Crick and non-Watson-Crick basepairs and statistical data for base-stacking and base- backbone interactions to develop probabilistic models for the sequence variability of each modular RNA 3D motif. These models will be used to deploy a web-based tool for users to find the 3D motif from the Motif Atlas which best matches the sequences of hairpin, internal, or junction loops that they submit.

Public Health Relevance

Recent work shows that most of the human genome is transcribed, most of the produced RNA is non-protein coding, and a large fraction of it is critical for human reproduction, growth, and development. This proposal aims to make the growing body of experimental three-dimensional (3D) RNA structure data more useful to the biomedical research community by providing improved methods to integrate 3D RNA structure with sequence and other experimental data.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM085328-02
Application #
8125047
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Preusch, Peter C
Project Start
2010-08-10
Project End
2014-07-31
Budget Start
2011-08-01
Budget End
2012-07-31
Support Year
2
Fiscal Year
2011
Total Cost
$304,967
Indirect Cost
Name
Bowling Green State University
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
617407325
City
Bowling Green
State
OH
Country
United States
Zip Code
43403
Roll, James; Zirbel, Craig L; Sweeney, Blake et al. (2016) JAR3D Webserver: Scoring and aligning RNA loop sequences to known 3D motifs. Nucleic Acids Res 44:W320-7
Parlea, Lorena G; Sweeney, Blake A; Hosseini-Asanjan, Maryam et al. (2016) The RNA 3D Motif Atlas: Computational methods for extraction, organization and evaluation of RNA motifs. Methods 103:99-119
Zirbel, Craig L; Roll, James; Sweeney, Blake A et al. (2015) Identifying novel sequence variants of RNA 3D motifs. Nucleic Acids Res 43:7504-20
Sweeney, Blake A; Roy, Poorna; Leontis, Neocles B (2015) An introduction to recurrent nucleotide interactions in RNA. Wiley Interdiscip Rev RNA 6:17-45
Rahrig, Ryan R; Zirbel, Craig L (2015) DETECTING CONFORMATIONAL DIFFERENCES BETWEEN RNA 3D STRUCTURES. JP J Biostat 12:99-115
Theis, Corinna; Zirbel, Craig L; Zu Siederdissen, Christian Höner et al. (2015) RNA 3D Modules in Genome-Wide Predictions of RNA 2D Structure. PLoS One 10:e0139900
Cannone, Jamie J; Sweeney, Blake A; Petrov, Anton I et al. (2015) R3D-2-MSA: the RNA 3D structure-to-multiple sequence alignment server. Nucleic Acids Res 43:W15-23
Coimbatore Narayanan, Buvaneswari; Westbrook, John; Ghosh, Saheli et al. (2014) The Nucleic Acid Database: new features and capabilities. Nucleic Acids Res 42:D114-22
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab et al. (2014) Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence. Gene 548:81-90
Rahrig, Ryan R; Petrov, Anton I; Leontis, Neocles B et al. (2013) R3D Align web server for global nucleotide to nucleotide alignments of RNA 3D structures. Nucleic Acids Res 41:W15-21

Showing the most recent 10 out of 19 publications