The HAD superfamily is a large enzyme family (~19,000 nonredundant sequences) [1] of phosphotransferases (phosphomutases, ATPases and phosphatases) represented in all three kingdoms of life [2-4], and, within each cell, by a large number of homologs (28 in E. coli;35 in Salmonella typhimurium;31 in Pseudomonas aeruginosa;30 in Mycobacterium tuberculosis;31 in Bacillus cereus;24 in Bacteroides fragilis; 24 in Streptococcus pneumoniae;45 in Saccharomyces cerevisiae;84 in Caenorhabditis elegans;169 in Arabidopsis thaliana;292 in Selaginella moeltendorffii;183 in human). As many as 80-90% of the members are phosphatases [5], the vast majority of which have unknown functions. Approximately 40% of the bacterial metabolome is comprised of phosphorylated metabolites [6]. Phosphate substituents are common because they enhance the water solubility of the metabolite as well as its ability to bind to metabolic enzymes with high affinity and specificity. The removal of phosphate groups from phosphorylated metabolites is performed by phosphatases. The "function" of a particular phosphatase is defined by the phosphorylated metabolite that it targets in the cell, i.e., by its "physiological substrate". Thus, the HAD phosphatases meet the demands of cellular processes and metabolic pathways that involve phosphorylated macromolecules and metabolites. Divergence in HAD phosphatase funcfion is based on the divergence of the substrate-recognition elements. The substrate-recognition elements are separate from the catalytic scaffold, which is located in the core domain (Figure 1A). The four pepfide segments or "motifs" which form the active site position the consen/ed Asp nucleophile, Asp acid/base, the Lys/Arg and Ser/Thr phosphate-binding residues and the Mg^* cofactor Asp/Glu binding residues (Figure IB). These residues, in combinafion with the scaffold main-chain elements, form a steric and electrostafic mold that stabilizes the trigonal bipyramidal transifion states/intermediates produced along the reaction pathway (Figure 1B) [7]. The HAD phosphatase substrate recognition elements are located in either a cap domain (as in HAD classes Cl and C2, also known as Type I and Type 11) tethered to the core domain by a solvated linker, or in short loop/helical segments that extend from the core domain (as in the "capless" HAD class CO also known as Type III) (Figure 1A) [8]. Although HAD phosphatases possess the same catalytic site and proceed through the same second partial reaction, they are able to use the specific structural requirements of the substrate-binding step and the subsequent addition-eliminafion steps of the first partial reacfion to discriminate between the physiological substrate and other phosphorylated species (macromolecules and metabolites). The induced fit model, wherein substrate binding is followed by cap domain or loop closure, applies to most HAD phosphatases. Favorable electrostafic interaction between the substrate leaving group and the cap domain/gafing loops will contribute to the substrate-binding affinity. For efficient turnover, the phosphoryl group must be bound in the correct orientation within the catalyfic site. If the substrate-leaving group is too large or too small, nonproductive binding is likely to occur. Thus, the size, shape and electrostafic surface ofthe acfive site region that extends from the catalytic site to the active site entrance can provide significant insight into the identity of the physiological substrate. This serves as the basis for the use of virtual screening (made possible by the Structure Core and Computation Core) to identify candidates for the physiological substrate herein. Substrate specificities defined by experimental activity screens suggest that the typical HAD phosphatase has loose substrate specificity coupled with modest catalytic efficiency. Thus, acfivity screens alone often cannot idenfify the actual physiological substrate. Rather, they provide candidates that can be further interrogated using the tools provided by the Sequence/Genome Analysis Core and Microbiology Core.

National Institute of Health (NIH)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZGM1)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Illinois Urbana-Champaign
United States
Zip Code
Mashiyama, Susan T; Malabanan, M Merced; Akiva, Eyal et al. (2014) Large-scale determination of sequence, structure, and function relationships in cytosolic glutathione transferases across the biosphere. PLoS Biol 12:e1001843
Akiva, Eyal; Brown, Shoshana; Almonacid, Daniel E et al. (2014) The Structure-Function Linkage Database. Nucleic Acids Res 42:D521-30
Zheng, Heping; Hou, Jing; Zimmerman, Matthew D et al. (2014) The future of crystallography in drug discovery. Expert Opin Drug Discov 9:125-37
Wichelecki, Daniel J; Graff, Dylan C; Al-Obaidi, Nawar et al. (2014) Identification of the in vivo function of the high-efficiency D-mannonate dehydratase in Caulobacter crescentus NA1000 from the enolase superfamily. Biochemistry 53:4087-9
Dong, Guang Qiang; Calhoun, Sara; Fan, Hao et al. (2014) Prediction of substrates for glutathione transferases by covalent docking. J Chem Inf Model 54:1687-99
Wichelecki, Daniel J; Vendiola, Jean Alyxa Ferolin; Jones, Amy M et al. (2014) Investigating the physiological roles of low-efficiency D-mannonate and D-gluconate dehydratases in the enolase superfamily: pathways for the catabolism of L-gulonate and L-idonate. Biochemistry 53:5692-9
Bouvier, Jason T; Groninger-Poe, Fiona P; Vetting, Matthew et al. (2014) Galactaro ?-lactone isomerase: lactone isomerization by a member of the amidohydrolase superfamily. Biochemistry 53:614-6
Wichelecki, Daniel J; Froese, D Sean; Kopec, Jolanta et al. (2014) Enzymatic and structural characterization of rTS? provides insights into the function of rTS?. Biochemistry 53:2732-8
Pandya, Chetanya; Dunaway-Mariano, Debra; Xia, Yu et al. (2014) Structure-guided approach for detecting large domain inserts in protein sequences as illustrated using the haloacid dehalogenase superfamily. Proteins 82:1896-906
Kumar, Ritesh; Zhao, Suwen; Vetting, Matthew W et al. (2014) Prediction and biochemical demonstration of a catabolic pathway for the osmoprotectant proline betaine. MBio 5:e00933-13

Showing the most recent 10 out of 49 publications