Functional identification of unknown proteins discovered in genome projects remains a major challenge for contemporary biology. This Program Project is focused on developing an integrated strategy for (nontrivial) functional assignment of unknown enzymes by predicting the substrate specificities of members of the functionally diverse enolase, amidohydrolase (AH), and D-ribulose 1,5-bisphosphate carboxylase/oxygenase (RuBisCO) superfamilies that share the ubiquitous (p/a)8-fold. In the past project period, the Program Project brought together expertise in computational enzymology (bioinformatics, homology modeling, and molecular docking), structural enzymology (high resolution x-ray structural analysis), and functional enzymology (protein purification, measurement of function, and determination of mechanism). We demonstrated that accurate computational prediction of substrate specificities of uncharacterized enzymes is possible using either an experimentally determined structure or a homology model, thereby facilitating experimental verification of function. In this competing renewal application, a new focus is on unknown members of the enolase, AH, and RuBisCO superfamilies that participate in novel metabolic pathways as deduced by operon context. This new focus adds to our previous """"""""one enzyme-one function"""""""" approach and is based on the expectation that enzymes that occur in the same metabolic pathway will share conserved elements of substrate specificity, facilitating functional assignment of not only the unknown superfamily """"""""targets"""""""" but, also, the entire metabolic pathway. The impact of this approach is considerable because it will identify new enzymes, new metabolites, new pathways, and, therefore, new biology. The integrated strategy developed in this Project will be applicable to deciphering the ligand specificity of any uncharacterized enzyme. The goals of this Program Project extend the contribution of the Protein Structure Initiative funded by NIGMS that seeks to obtain structures for proteins of unknown function that will allow reliable homology modeling.

Public Health Relevance

The assignment of functions to the complete set of proteins encoded by genomes is a major problem. However, when this problem is solved, their roles in molecular, cellular, and organismal functions will be known and novel targets for specific small molecule intervention can be identified, thereby providing new approaches for therapeutic design. This Program Project is focused on developing and implementing an integrated sequence-structure-computation strategy for predicting the substrate specificities of uncharacterized proteins discovered in genome projects, thereby facilitating their functional assignment.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BCMB-D (40))
Program Officer
Anderson, Vernon
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Illinois Urbana-Champaign
Schools of Arts and Sciences
United States
Zip Code
Holliday, Gemma L; Brown, Shoshana D; Akiva, Eyal et al. (2017) Biocuration in the structure-function linkage database: the anatomy of a superfamily. Database (Oxford) 2017:
Holliday, Gemma L; Brown, Shoshana D; Akiva, Eyal et al. (2017) Biocuration in the structure-function linkage database: the anatomy of a superfamily. Database (Oxford) 2017:
Webb, Benjamin; Sali, Andrej (2016) Comparative Protein Structure Modeling Using MODELLER. Curr Protoc Bioinformatics 54:5.6.1-5.6.37
Vladimirova, Anna; Patskovsky, Yury; Fedorov, Alexander A et al. (2016) Substrate Distortion and the Catalytic Reaction Mechanism of 5-Carboxyvanillate Decarboxylase. J Am Chem Soc 138:826-36
Fedorov, Alexander A; Martí-Arbona, Ricardo; Nemmara, Venkatesh V et al. (2015) Structure of N-formimino-L-glutamate iminohydrolase from Pseudomonas aeruginosa. Biochemistry 54:890-7
Xiang, Dao Feng; Patskovsky, Yury; Nemmara, Venkatesh V et al. (2015) Function discovery and structural characterization of a methylphosphonate esterase. Biochemistry 54:2919-30
Zhang, Xinshuai; Kumar, Ritesh; Vetting, Matthew W et al. (2015) A unique cis-3-hydroxy-l-proline dehydratase in the enolase superfamily. J Am Chem Soc 137:1388-91
Akiva, Eyal; Brown, Shoshana; Almonacid, Daniel E et al. (2014) The Structure-Function Linkage Database. Nucleic Acids Res 42:D521-30
Korczynska, Magdalena; Xiang, Dao Feng; Zhang, Zhening et al. (2014) Functional annotation and structural characterization of a novel lactonase hydrolyzing D-xylono-1,4-lactone-5-phosphate and L-arabino-1,4-lactone-5-phosphate. Biochemistry 53:4727-38
Brown, Shoshana; Babbitt, Patricia (2014) Using the structure-function linkage database to characterize functional domains in enzymes. Curr Protoc Bioinformatics 48:2.10.1-16

Showing the most recent 10 out of 120 publications