The long-term goal of this project is to develop a structure-based approach for the prediction of protein molecular function so that the information provided by both genome sequencing and structural genomics can be more fully exploited. To achieve this overall objective, this proposal further develops a very promising and tightly integrated, sequence-to-structure-to-function approach that employs protein structure to predict protein- protein interactions, protein molecular function, and ligand binding sites. It also holds considerable promise for improved ligand screening. In particular, the following Specific Aims are proposed: (1) Monomeric sequence profile-based threading algorithms, which currently fail to find the good template structures in the PDB for the ~25% of single domain proteins with very low sequence identity to solved protein structures, will be extended and improved. (2) A purely structure-based version of threading will be developed, as the best contemporary threading algorithms have a strong evolutionary component that limits their structure recognition ability when the target and template proteins are evolutionarily distant or have analogous structures. In that regard, potentials of mean force suitable for structure-based threading will be derived from a new AMBER-related, physics-based atomic potential that shows significant ability to refine structures closer to native. (3) The multimeric structure prediction algorithm, m-TASSER, will be enhanced by improving the accuracy of interfacial side chain contact predictions and the use of physics-based interfacial potentials for structure refinement. In addition, by exploiting the fact that the library of single domain protein structures is likely complete, all-against-all docking will provide an estimate of the number of possible dimer complexes of single domain proteins. (4) The FINDSITE structure-based protein molecular function prediction algorithm will be extended and improved. Included are enhancements of its ligand screening ability based on the insight that for evolutionarily distant proteins, there are conserved anchor regions in both the protein binding site and in the 2 bound ligands that can be exploited for rapid ligand binding pose prediction and screening. (5) EFICAz , a precise enzyme function inference approach, will be combined with FINDSITE to develop a more powerful ligand screening approach. (6) The entire set of tools developed in Aims 1-5 will be applied to all sequenced 2 proteomes and the resulting sequence-to-structure-to-function, S F, database made available to the academic 2 community. Whole proteome structure predictions will be combined with EFICAz and FINDSITE to identify possible receptors of small regulatory molecules including the targets of anticancer metabolites, and to provide whole proteome screened ligand libraries, libraries of protein-protein interactions, quaternary structures and molecular functional annotations. In all cases, large scale, careful benchmarking will be done. Thus, this project holds the promise of making a significant impact across a wide spectrum of biologically important problems.

Public Health Relevance

The development and whole proteome application of the tightly integrated, protein sequence-to-structure- function approach described in this project will be of utility to a broad spectrum of researchers. By assisting in the early stages of drug discovery, the proposed algorithms could have significant therapeutic utility. Also, most of the estimated 650,000 protein-protein interactions in the human interactome are unknown;by providing predicted protein quaternary structures, insights into how these proteins perform their function will result.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM048835-18
Application #
8029588
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Wehrle, Janna P
Project Start
1994-05-01
Project End
2013-01-31
Budget Start
2011-02-01
Budget End
2012-01-31
Support Year
18
Fiscal Year
2011
Total Cost
$288,591
Indirect Cost
Name
Georgia Institute of Technology
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
097394084
City
Atlanta
State
GA
Country
United States
Zip Code
30332
Srinivasan, Bharath; Marks, Hanna; Mitra, Sreyoshi et al. (2016) Catalytic and substrate promiscuity: distinct multiple chemistries catalysed by the phosphatase domain of receptor protein tyrosine phosphatase. Biochem J 473:2165-77
Zhou, Hongyi; Skolnick, Jeffrey (2016) A knowledge-based approach for predicting gene-disease associations. Bioinformatics 32:2831-8
Skolnick, Jeffrey; Gao, Mu; Zhou, Hongyi (2016) How special is the biochemical function of native proteins? F1000Res 5:
Roy, Ambrish; Skolnick, Jeffrey (2015) LIGSIFT: an open-source tool for ligand structural alignment and virtual screening. Bioinformatics 31:539-44
Zhou, Hongyi; Gao, Mu; Skolnick, Jeffrey (2015) Comprehensive prediction of drug-protein interactions and side effects for the human proteome. Sci Rep 5:11090
Skolnick, Jeffrey; Gao, Mu; Roy, Ambrish et al. (2015) Implications of the small number of distinct ligand binding pockets in proteins for drug discovery, evolution and biochemical function. Bioorg Med Chem Lett 25:1163-70
Srinivasan, Bharath; Skolnick, Jeffrey (2015) Insights into the slow-onset tight-binding inhibition of Escherichia coli dihydrofolate reductase: detailed mechanistic characterization of pyrrolo [3,2-f] quinazoline-1,3-diamine and its derivatives as novel tight-binding inhibitors. FEBS J 282:1922-38
Srinivasan, Bharath; Tonddast-Navaei, Sam; Skolnick, Jeffrey (2015) Ligand binding studies, preliminary structure-activity relationship and detailed mechanistic characterization of 1-phenyl-6,6-dimethyl-1,3,5-triazine-2,4-diamine derivatives as inhibitors of Escherichia coli dihydrofolate reductase. Eur J Med Chem 103:600-14
Boles, Richard G; Hornung, Holly A; Moody, Alastair E et al. (2015) Hurt, tired and queasy: Specific variants in the ATPase domain of the TRAP1 mitochondrial chaperone are associated with common, chronic ""functional"" symptomatology including pain, fatigue and gastrointestinal dysmotility. Mitochondrion 23:64-70
Gao, Mu; Zhou, Hongyi; Skolnick, Jeffrey (2015) Insights into Disease-Associated Mutations in the Human Proteome through Protein Structural Analysis. Structure 23:1362-9

Showing the most recent 10 out of 114 publications