The majority of genes in bacterial genomes, even in species for which extensive experimental evidence is available, are of hypothetical or unknown functions.
The aims of this proposal are to investigate this enigmatic class of genes by elucidating the source and functions of """"""""ORFans"""""""", i.e., sequences within a genome that encode proteins having no homology (and often no structural similarity) to proteins in any other genome. Moreover, the uniqueness of ORFan genes prohibits use of any of homology-based methods that have traditionally been employed to establish gene function. Thus, these genes present a major challenge to discovering their roles in bacterial genomes. In many respects, these genes constitute the most intriguing portion of bacterial genomes because they give clue to how new genes originate, and likely contribute to the remarkable diversification and adaptation of bacteria. Although it has been hypothesized that ORFans might represent non-coding regions rather than actual genes, we have recently established that the vast majority that ORFans present in the E. coli genome are under selective constraints and encode functional proteins. By combining experimental and bioinformatic approaches, the present proposal will analyze the origins, functions and structural properties of ORFans, and how they have assumed key roles in cellular function.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM074738-02
Application #
7095968
Study Section
Genetic Variation and Evolution Study Section (GVE)
Program Officer
Eckstrand, Irene A
Project Start
2005-08-01
Project End
2009-07-31
Budget Start
2006-08-01
Budget End
2007-07-31
Support Year
2
Fiscal Year
2006
Total Cost
$248,825
Indirect Cost
Name
University of Arizona
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
806345617
City
Tucson
State
AZ
Country
United States
Zip Code
85721
Raghavan, Rahul; Kacharia, Fenil R; Millar, Jess A et al. (2015) Genome rearrangements can make and break small RNA genes. Genome Biol Evol 7:557-66
Sloan, Daniel B; Bennett, Gordon M; Engel, Philipp et al. (2013) Disentangling associated genomes. Methods Enzymol 531:445-64
Kelkar, Yogeshwar D; Ochman, Howard (2013) Genome reduction promotes increase in protein functional complexity in bacteria. Genetics 193:303-7
Kelkar, Yogeshwar D; Ochman, Howard (2012) Causes and consequences of genome expansion in fungi. Genome Biol Evol 4:13-23
Raghavan, Rahul; Sloan, Daniel B; Ochman, Howard (2012) Antisense transcription is pervasive but rarely conserved in enteric bacteria. MBio 3:
Degnan, Patrick H; Pusey, Anne E; Lonsdorf, Elizabeth V et al. (2012) Factors associated with the diversification of the gut microbial communities within chimpanzees from Gombe National Park. Proc Natl Acad Sci U S A 109:13034-9
Raghavan, Rahul; Kelkar, Yogeshwar D; Ochman, Howard (2012) A selective force favoring increased G+C content in bacterial genes. Proc Natl Acad Sci U S A 109:14504-7
Raghavan, Rahul; Groisman, Eduardo A; Ochman, Howard (2011) Genome-wide detection of novel regulatory RNAs in E. coli. Genome Res 21:1487-97
Raghavan, Rahul; Sage, Alan; Ochman, Howard (2011) Genome-wide identification of transcription start sites yields a novel thermosensing RNA and new cyclic AMP receptor protein-regulated genes in Escherichia coli. J Bacteriol 193:2871-4
Jarvik, Tyler; Smillie, Chris; Groisman, Eduardo A et al. (2010) Short-term signatures of evolutionary change in the Salmonella enterica serovar typhimurium 14028 genome. J Bacteriol 192:560-7

Showing the most recent 10 out of 15 publications