The long-term aim of this work is to develop evolution-based technologies for the study of biological function in open reading frames (ORFs) identified by genome projects.
The specific aims of this proposal are to: 1. Implement structure prediction tools to identify long distance homologs in genomic databases. This is the first step towards assigning function to an open reading frame (ORF) with unknown function. Prediction of secondary structure is a powerful tool for identifying long distance homologies that cannot be detected by simple comparison of sequence data. Because structure prediction requires sequence data as the only input, it is a low cost way of expanding the value of existing genome sequence databases. 2. Implement tools for reconstructing the evolutionary history of nuclear families of proteins. The tools will identify in the evolutionary history of nuclear families of proteins episodes of divergent evolution of a type that indicates conserved function, and other episodes of divergent evolution of a type that characterizes divergence without function. These tools will permit the user to assess the likelihood that the function of a homologous protein is the same as that of an ORF obtained from the genome database. This tool also requires only sequence data as input, making it another low cost way of expanding the value of genome sequence databases. 3. Develop tools for correlating episodes of functional evolution in protein families with geological periods where new physiology emerged in metazoan animals. These tools will allow a user to generate hypotheses relating protein evolution to possible functions of ORFs. This tool draws on existing and developing paleontological databases, and couples sequence data directly to biological function. These tools will be implemented within Darwin, a comprehensive platform for performing theoretical and practical genomic analysis. Further, they will directly interface with the new field of experimental paleobiochemistry, which permits the experimental testing of hypotheses relating function and behavior derived from the tools described above.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG001729-02
Application #
2674281
Study Section
Special Emphasis Panel (ZHG1-HGR-N (O1))
Project Start
1997-09-30
Project End
2000-07-31
Budget Start
1998-08-01
Budget End
1999-07-31
Support Year
2
Fiscal Year
1998
Total Cost
Indirect Cost
Name
University of Florida
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
073130411
City
Gainesville
State
FL
Country
United States
Zip Code
32611
Liberles, D A; Schreiber, D R; Govindarajan, S et al. (2001) The adaptive evolution database (TAED). Genome Biol 2:RESEARCH0028
Liberles, D A; Schreiber, D R; Govindarajan, S et al. (2001) The Adaptive Evolution Database (TAED). Genome Biol 2:PREPRINT0003
Peltier, M R; Raley, L C; Liberles, D A et al. (2000) Evolutionary history of the uterine serpins. J Exp Zool 288:165-74
Benner, S A; Chamberlin, S G; Liberles, D A et al. (2000) Functional inferences from reconstructed evolutionary biology involving rectified databases--an evolutionarily grounded approach to functional genomics. Res Microbiol 151:97-106