The goal of this research is to lay the foundations for what we call Genomic Enzymology, the extension of our understanding of enzyme chemistry to include the structural context. To accomplish this, we propose to investigate how nature re-engineers protein structures for new functions by studying enzyme superfamilies whose members have diverged to perform substantially different overall chemical reactions. We focus on enzymes because the relationships between protein structure and function can be more easily identified than in other classes of proteins. This is because the individual steps of the chemical reactions can be mapped explicitly to specific elements of their associated structures. To accomplish this goal, we propose 1) to cluster the known enzyme sequences into their respective superfamilies at divergence distances that are not currently available; 2) to examine the reactions of the enzymes in these superfamilies to distinguish the fundamental steps of their mechanisms that can be associated with the common elements of the superfamily architecture; 3) to use this information to generate a correlated structure/function """"""""fingerprint"""""""" for each superfamily that can be used to infer important overall and sub-group properties of the member enzymes, and 4) to organize the results into an internet-accessible database that can be used and further developed by the scientific community. For enzyme superfamilies that we have previously investigated, this work has led to the correct prediction for unknown reading frames, identification of new functions for previously characterized enzymes and insight into the fundamental aspects of enzyme mechanism for proteins that had been only poorly characterized. As proposed in this work, we expect that a systematic investigation of the universe of enzyme superfamilies will provide a conceptual framework for prediction of enzyme function for the many unknown reading frames that have been generated by the genome projects. Our proposal for identification of sequence elements associated with specific sub-groups within an enzyme superfamily will be useful for refining the assignments of function generated by automated annotation of genomic data. Finally, by providing a better understanding into the functional opportunities and constraints nature has engineered into a particularly superfamily scaffold, we expect that this research will provide information useful for reengineering of proteins in the laboratory and for rational drug design.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM060595-01
Application #
6039020
Study Section
Physical Biochemistry Study Section (PB)
Program Officer
Jones, Warren
Project Start
2000-03-01
Project End
2005-02-28
Budget Start
2000-03-01
Budget End
2001-02-28
Support Year
1
Fiscal Year
2000
Total Cost
$242,513
Indirect Cost
Name
University of California San Francisco
Department
Pharmacology
Type
Schools of Pharmacy
DUNS #
073133571
City
San Francisco
State
CA
Country
United States
Zip Code
94143
Holliday, Gemma L; Akiva, Eyal; Meng, Elaine C et al. (2018) Atlas of the Radical SAM Superfamily: Divergent Evolution of Function Using a ""Plug and Play"" Domain. Methods Enzymol 606:1-71
Davidson, Rebecca; Baas, Bert-Jan; Akiva, Eyal et al. (2018) A global view of structure-function relationships in the tautomerase superfamily. J Biol Chem 293:2342-2357
LeVieux, Jake A; Baas, Bert-Jan; Kaoud, Tamer S et al. (2017) Kinetic and structural characterization of a cis-3-Chloroacrylic acid dehalogenase homologue in Pseudomonas sp. UW4: A potential step between subgroups in the tautomerase superfamily. Arch Biochem Biophys 636:50-56
Knutson, Stacy T; Westwood, Brian M; Leuthaeuser, Janelle B et al. (2017) An approach to functionally relevant clustering of the protein universe: Active site profile-based clustering of protein structures and sequences. Protein Sci 26:677-699
Finn, Robert D; Attwood, Teresa K; Babbitt, Patricia C et al. (2017) InterPro in 2017-beyond protein family and domain annotations. Nucleic Acids Res 45:D190-D199
Holliday, Gemma L; Davidson, Rebecca; Akiva, Eyal et al. (2017) Evaluating Functional Annotations of Enzymes Using the Gene Ontology. Methods Mol Biol 1446:111-132
Harper, Angela F; Leuthaeuser, Janelle B; Babbitt, Patricia C et al. (2017) An Atlas of Peroxiredoxins Created Using an Active Site Profile-Based Approach to Functionally Relevant Clustering of Proteins. PLoS Comput Biol 13:e1005284
Holliday, Gemma L; Brown, Shoshana D; Akiva, Eyal et al. (2017) Biocuration in the structure-function linkage database: the anatomy of a superfamily. Database (Oxford) 2017:
Holliday, Gemma L; Brown, Shoshana D; Akiva, Eyal et al. (2017) Biocuration in the structure-function linkage database: the anatomy of a superfamily. Database (Oxford) 2017:
Akiva, Eyal; Copp, Janine N; Tokuriki, Nobuhiko et al. (2017) Evolutionary and molecular foundations of multiple contemporary functions of the nitroreductase superfamily. Proc Natl Acad Sci U S A 114:E9549-E9558

Showing the most recent 10 out of 71 publications