RNA molecules and their functions are central to cellular and molecular biology; their functions range from the classic messengers of the central dogma, to ribozymes that carry out enzymatic activities, to acting as regulatory molecules. The existence and important of non-coding RNAs poses a challenge to bioinformatics. How are biologists to identify these molecules in newly sequenced genomes or sequence databases? One key approach is to use bioinformatic methods that can search genomes and databases for specific RNA secondary structures. The research we propose is directed towards developing computational tools to model the structures of RNAs, to perform RNA structure alignment and database searches, and then to experimentally test our predictions. Our approach is based upon conformational graph models and graph theoretic techniques, developing fast, yet accurate, RNA structural homology search tools based upon the notion of graph tree decomposition. These methods can describe both stem-loop and pseudoknot structures. Our preliminary results include successful searches of prokaryotic and eukaryotic genomes for large and complex RNAs by their structure. The specific steps of the proposed work are: (1) development of the conformational graph - tree decomposition method into high throughput tools for RNA structure search that biologists can readily use; (2) development of tools for automated comparative analysis and modeling of non-annotated, unaligned, RNA sequences; (3) application of our method to discovery and annotation of specific RNA gene families, including experimental verification of the predictions; (4) distribution of the search and modeling tools, including a user-friendly interface, together with a conformational graph structure profile database. Our goal is to develop a practical RNA structure modeling and search tool set that biologists can readily use to find RNA structures in genomes or databases, to help them generate testable hypotheses about the numbers, functions and evolution of RNA gene families. RNA molecules are at the center of basic biology as well as public health, and may be a key component of future medical practice. Our proposed research will help biologists find RNA molecules of interest in the mass of genome sequence data being generated. ? ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM072080-02
Application #
7236730
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Preusch, Peter C
Project Start
2006-06-01
Project End
2009-05-31
Budget Start
2007-06-01
Budget End
2008-05-31
Support Year
2
Fiscal Year
2007
Total Cost
$234,885
Indirect Cost
Name
University of Georgia
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
004315578
City
Athens
State
GA
Country
United States
Zip Code
30602
Manzourolajdad, Amirhossein; Wang, Yingfeng; Shaw, Timothy I et al. (2013) Information-theoretic uncertainty of SCFG-modeled folding space of the non-coding RNA. J Theor Biol 318:140-63
Wang, Yingfeng; Manzour, Amir; Shareghi, Pooya et al. (2012) Stable stem enabled Shannon entropies distinguish non-coding RNAs from random backgrounds. BMC Bioinformatics 13 Suppl 5:S1
Zhang, Dong; Xue, Xingran; Malmberg, Russell L et al. (2012) TRFolder-W: a web server for telomerase RNA structure prediction in yeast genomes. Bioinformatics 28:2696-7
Shareghi, Pooya; Wang, Yingfeng; Malmberg, Russell et al. (2012) Simultaneous prediction of RNA secondary structure and helix coaxial stacking. BMC Genomics 13 Suppl 3:S7
Guo, Leilei; Zhang, Dong; Wang, Yingfeng et al. (2011) TRFolder: computational prediction of novel telomerase RNA structures in yeast genomes. Int J Bioinform Res Appl 7:63-81
Shaw, Timothy I; Manzour, Amir; Wang, Yingfeng et al. (2011) Analyzing modular RNA structure reveals low global structural entropy in microRNA sequence. J Bioinform Comput Biol 9:283-98
Srivastava, Anuj; Cai, Liming; Mrazek, Jan et al. (2011) Mutational patterns in RNA secondary structure evolution examined in three RNA families. PLoS One 6:e20484
Rogers, Willie L; Cruse-Sanders, Jennifer M; Determann, Ron et al. (2010) Development and characterization of microsatellite markers in Sarracenia L. (pitcher plant) species. Conserv Genet Resour 2:75-79
Malmberg, Russell L; Shaw, Timothy I; Cai, Liming (2010) RNApasta: a tool for analysis of RNA structural alignments. Int J Bioinform Res Appl 6:571-83
Wang, Zhi-Ru; Guo, Leilei; Chen, Lizhen et al. (2009) Evidence for an additional base-pairing element between the telomeric repeat and the telomerase RNA template in Kluyveromyces lactis and other yeasts. Mol Cell Biol 29:5389-98

Showing the most recent 10 out of 13 publications