The prediction of the three dimensional structure of a globular protein from its amino acid sequence along with the mechanism by which protein folding occurs are among the most important unsolved problems of contemporary molecular biology. The overall objectives of this proposal are the continued development and refinement of algorithms which not only can predict protein tertiary structure using only sequence information as input but also may provide insights into the folding pathway. To achieve these goals, this proposal focuses on the lattice based aspects of a hierarchical approach to protein folding. High resolution lattice models of proteins, comprised of an alpha-carbon plus reduced off lattice, side chain description, will provide the overall folding pathways and folded conformations. The resulting folded lattice structures are estimated for the alpha-carbons to have a 2-4 angstroms rms deviation from the native state. Turning to the folding pathways, the predicted molten globule states and their free energy landscape will be characterized in detail. The factors responsible for side chain fixation on passage from the molten globule to the native state will be explored, with particular attention focused on the interplay of protein sequence and side chain packing. Specifically this proposal will address the following. (1). A new high coordination lattice model of proteins will be refined, different side chain realizations will be examined and the dynamic Monte Carlo algorithms parallelized. (2). Better empirical free energy functions will be developed. These include better methods for predicting the propensities for secondary structure and generalization of the hydrogen bond scheme to include backbone-side chain hydrogen bonds. To help eliminate misfolded structures, additional very robust knowledge based rules, such as the connections in supersecondary structural elements do not cross, will be included in the interaction scheme. Sequence specific tertiary interactions including a local burial turn, pair interactions and generalized cooperative multibody side chain contact templates will be self consistently derived in the presence of predicted secondary structure propensities. Then, a recently developed neural network which can recognize whether 7 by 7 subfragments of sidechain contact maps are protein like or not will be extended to include sequence specific preferences for subsequences to adopt specific patterns. This information will be obtained from a neural network trained on both homologous and non homologous subsequences that adopt these patterns. Thus, it should be general and not simply applicable to homologous sequence fragments. (3). The folding of representative motifs of globular proteins will be undertaken. Included are the helical proteins such as cytochrome c, whose predicted folding pathway will be compared to experiment, myohemerythrin, myoglobin and complement factor, 1c5a. The mixed motif proteins include ubiquitin, flavodoxin and PRA isomerase, and the beta-proteins include the 16th complement control protein of factor H, 1hcc, alpha-amylase, plastocyanin and retinol binding protein. (4). To validate the methodology, additional blind predictions of proteins whose structures are unknown will be undertaken. Likely candidates include rusticyanin and erythropoietin.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM037408-09
Application #
2178769
Study Section
Molecular and Cellular Biophysics Study Section (BBCA)
Project Start
1986-12-01
Project End
1997-08-31
Budget Start
1995-09-01
Budget End
1996-08-31
Support Year
9
Fiscal Year
1995
Total Cost
Indirect Cost
Name
Scripps Research Institute
Department
Type
DUNS #
City
La Jolla
State
CA
Country
United States
Zip Code
92037
Zhou, Hongyi; Gao, Mu; Skolnick, Jeffrey (2016) ENTPRISE: An Algorithm for Predicting Human Disease-Associated Amino Acid Substitutions from Sequence Entropy and Predicted Protein Structures. PLoS One 11:e0150965
Zhou, Hongyi; Gao, Mu; Skolnick, Jeffrey (2015) Comprehensive prediction of drug-protein interactions and side effects for the human proteome. Sci Rep 5:11090
Skolnick, Jeffrey; Gao, Mu; Roy, Ambrish et al. (2015) Implications of the small number of distinct ligand binding pockets in proteins for drug discovery, evolution and biochemical function. Bioorg Med Chem Lett 25:1163-70
Srinivasan, Bharath; Skolnick, Jeffrey (2015) Insights into the slow-onset tight-binding inhibition of Escherichia coli dihydrofolate reductase: detailed mechanistic characterization of pyrrolo [3,2-f] quinazoline-1,3-diamine and its derivatives as novel tight-binding inhibitors. FEBS J 282:1922-38
Tonddast-Navaei, Sam; Skolnick, Jeffrey (2015) Are protein-protein interfaces special regions on a protein's surface? J Chem Phys 143:243149
Srinivasan, Bharath; Tonddast-Navaei, Sam; Skolnick, Jeffrey (2015) Ligand binding studies, preliminary structure-activity relationship and detailed mechanistic characterization of 1-phenyl-6,6-dimethyl-1,3,5-triazine-2,4-diamine derivatives as inhibitors of Escherichia coli dihydrofolate reductase. Eur J Med Chem 103:600-14
Boles, Richard G; Hornung, Holly A; Moody, Alastair E et al. (2015) Hurt, tired and queasy: Specific variants in the ATPase domain of the TRAP1 mitochondrial chaperone are associated with common, chronic ""functional"" symptomatology including pain, fatigue and gastrointestinal dysmotility. Mitochondrion 23:64-70
Gao, Mu; Zhou, Hongyi; Skolnick, Jeffrey (2015) Insights into Disease-Associated Mutations in the Human Proteome through Protein Structural Analysis. Structure 23:1362-9
Roy, Ambrish; Srinivasan, Bharath; Skolnick, Jeffrey (2015) PoLi: A Virtual Screening Pipeline Based on Template Pocket and Ligand Similarity. J Chem Inf Model 55:1757-70
Roy, Ambrish; Skolnick, Jeffrey (2015) LIGSIFT: an open-source tool for ligand structural alignment and virtual screening. Bioinformatics 31:539-44

Showing the most recent 10 out of 121 publications