Interaction Pattern Based Predictor of Protein Structure

Skolnick, Jeffrey

Abstract

The long-term goal of this project is to develop a structure-based approach for the prediction of protein molecular function so that the information provided by both genome sequencing and structural genomics can be more fully exploited. To achieve this overall objective, this proposal further develops a very promising and tightly integrated, sequence-to-structure-to-function approach that employs protein structure to predict protein- protein interactions, protein molecular function, and ligand binding sites. It also holds considerable promise for improved ligand screening. In particular, the following Specific Aims are proposed: (1) Monomeric sequence profile-based threading algorithms, which currently fail to find the good template structures in the PDB for the ~25% of single domain proteins with very low sequence identity to solved protein structures, will be extended and improved. (2) A purely structure-based version of threading will be developed, as the best contemporary threading algorithms have a strong evolutionary component that limits their structure recognition ability when the target and template proteins are evolutionarily distant or have analogous structures. In that regard, potentials of mean force suitable for structure-based threading will be derived from a new AMBER-related, physics-based atomic potential that shows significant ability to refine structures closer to native. (3) The multimeric structure prediction algorithm, m-TASSER, will be enhanced by improving the accuracy of interfacial side chain contact predictions and the use of physics-based interfacial potentials for structure refinement. In addition, by exploiting the fact that the library of single domain protein structures is likely complete, all-against-all docking will provide an estimate of the number of possible dimer complexes of single domain proteins. (4) The FINDSITE structure-based protein molecular function prediction algorithm will be extended and improved. Included are enhancements of its ligand screening ability based on the insight that for evolutionarily distant proteins, there are conserved anchor regions in both the protein binding site and in the 2 bound ligands that can be exploited for rapid ligand binding pose prediction and screening. (5) EFICAz , a precise enzyme function inference approach, will be combined with FINDSITE to develop a more powerful ligand screening approach. (6) The entire set of tools developed in Aims 1-5 will be applied to all sequenced 2 proteomes and the resulting sequence-to-structure-to-function, S F, database made available to the academic 2 community. Whole proteome structure predictions will be combined with EFICAz and FINDSITE to identify possible receptors of small regulatory molecules including the targets of anticancer metabolites, and to provide whole proteome screened ligand libraries, libraries of protein-protein interactions, quaternary structures and molecular functional annotations. In all cases, large scale, careful benchmarking will be done. Thus, this project holds the promise of making a significant impact across a wide spectrum of biologically important problems.

Public Health Relevance

The development and whole proteome application of the tightly integrated, protein sequence-to-structure- function approach described in this project will be of utility to a broad spectrum of researchers. By assisting in the early stages of drug discovery, the proposed algorithms could have significant therapeutic utility. Also, most of the estimated 650,000 protein-protein interactions in the human interactome are unknown;by providing predicted protein quaternary structures, insights into how these proteins perform their function will result.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 2R01GM048835-16
Application #: 7649932
Study Section: Macromolecular Structure and Function D Study Section (MSFD)
Program Officer: Wehrle, Janna P

Project Start: 1994-05-01
Project End: 2013-01-31
Budget Start: 2009-03-01
Budget End: 2010-01-31
Support Year: 16
Fiscal Year: 2009
Total Cost: $294,450
Indirect Cost

Institution

Name: Georgia Institute of Technology
Department: Biology
Type: Schools of Arts and Sciences
DUNS #: 097394084

City: Atlanta
State: GA
Country: United States
Zip Code: 30332

Related projects

Publications

Srinivasan, Bharath; Marks, Hanna; Mitra, Sreyoshi et al. (2016) Catalytic and substrate promiscuity: distinct multiple chemistries catalysed by the phosphatase domain of receptor protein tyrosine phosphatase. Biochem J 473:2165-77

Zhou, Hongyi; Skolnick, Jeffrey (2016) A knowledge-based approach for predicting gene-disease associations. Bioinformatics 32:2831-8

Skolnick, Jeffrey; Gao, Mu; Zhou, Hongyi (2016) How special is the biochemical function of native proteins? F1000Res 5:

Gao, Mu; Zhou, Hongyi; Skolnick, Jeffrey (2015) Insights into Disease-Associated Mutations in the Human Proteome through Protein Structural Analysis. Structure 23:1362-9

Roy, Ambrish; Srinivasan, Bharath; Skolnick, Jeffrey (2015) PoLi: A Virtual Screening Pipeline Based on Template Pocket and Ligand Similarity. J Chem Inf Model 55:1757-70

Lee, Hui Sun; Jo, Sunhwan; Mukherjee, Srayanta et al. (2015) GS-align for glycan structure alignment and similarity measurement. Bioinformatics 31:2653-9

Dhakshinamoorthy, Suganthagunthalam; Dinh, Nha-Truc; Skolnick, Jeffrey et al. (2015) Metabolomics identifies the intersection of phosphoethanolamine with menaquinone-triggered apoptosis in an in vitro model of leukemia. Mol Biosyst 11:2406-16

Roy, Ambrish; Skolnick, Jeffrey (2015) LIGSIFT: an open-source tool for ligand structural alignment and virtual screening. Bioinformatics 31:539-44

Zhou, Hongyi; Gao, Mu; Skolnick, Jeffrey (2015) Comprehensive prediction of drug-protein interactions and side effects for the human proteome. Sci Rep 5:11090

Skolnick, Jeffrey; Gao, Mu; Roy, Ambrish et al. (2015) Implications of the small number of distinct ligand binding pockets in proteins for drug discovery, evolution and biochemical function. Bioorg Med Chem Lett 25:1163-70

Showing the most recent 10 out of 114 publications

Comments

Be the first to comment on Jeffrey Skolnick's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: