The amino acid sequence of a protein uniquely determines its tertiary structure. Deciphering this relationship, the protein folding problem has become increasingly important to molecular biologists. DNA sequencing has become routine, but structural experiments remain very difficult. Computational strategies are needed to help address this problem. This proposal describes a strategy to identify the location of alpha- helices and beta-strands throughout the sequence. A method for using off- lattice simulations of a polypeptide chain to identify secondary structure preferences in the ensemble average is proposed. Once secondary structure is located, computational methods exist for generating plausible tertiary structures. However, these combinatorial strategies give rise to a large number of alternative structures which are difficult to distinguish from the correct fold. Experimental and theoretical methods for clarifying the distinction between correctly folded structures and their misfolded counterparts will be considered. In a new direction, we propose to develop a multiple sequence analysis strategy to relate sequence and structure to function. In particular, we will focus on identifying the binding sites on the G-alpha family of GTPases for the relevant G-protein coupled receptors, G-beta-gamma and downstream effectors. We plan to continue to develop a genetic algorithm for the construction of polypeptide loops subject to a series of constraints. This method will be used to model the loop regions of G- protein coupled receptors involved in the interaction with peptide ligands and the hetero-trimeric G-protein complex. Finally, we propose to develop a new method to compare structures based on the area of the minimal """"""""soap film"""""""" that could join them following the appropriate rotation and translation of one structure relative to another. This provides a natural way to circumvent the gap penalty problem that plagues current structure alignment algorithms.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM039900-11
Application #
2838542
Study Section
Special Emphasis Panel (ZRG3-BMT (01))
Project Start
1988-12-01
Project End
1999-11-30
Budget Start
1998-12-01
Budget End
1999-11-30
Support Year
11
Fiscal Year
1999
Total Cost
Indirect Cost
Name
University of California San Francisco
Department
Pharmacology
Type
Schools of Pharmacy
DUNS #
073133571
City
San Francisco
State
CA
Country
United States
Zip Code
94143
Chandonia, John-Marc (2007) StrBioLib: a Java library for development of custom computational structural biology applications. Bioinformatics 23:2018-20
Li, Z; Chen, X; Davidson, E et al. (1994) Anti-malarial drug development using models of enzyme structure. Chem Biol 1:31-7
Harris, N L; Presnell, S R; Cohen, F E (1994) Four helix bundle diversity in globular proteins. J Mol Biol 236:1356-68
Hearst, D P; Cohen, F E (1994) GRAFTER: a computational aid for the design of novel proteins. Protein Eng 7:1411-21
Jin, L; Cohen, F E; Wells, J A (1994) Structure from function: screening structural models with functional data. Proc Natl Acad Sci U S A 91:113-7
Hunt, N G; Gregoret, L M; Cohen, F E (1994) The origins of protein secondary structure. Effects of packing density and hydrogen bonding studied by a fast conformational search. J Mol Biol 241:214-25
Ring, C S; Cohen, F E (1993) Modeling protein structures: construction and their applications. FASEB J 7:783-90
Sun, E; Cohen, F E (1993) Computer-assisted drug discovery--a review. Gene 137:127-32
Aroeti, B; Kosen, P A; Kuntz, I D et al. (1993) Mutational and secondary structural analysis of the basolateral sorting signal of the polymeric immunoglobulin receptor. J Cell Biol 123:1149-60
Presnell, S R; Cohen, F E (1993) Artificial neural networks for pattern recognition in biochemical sequences. Annu Rev Biophys Biomol Struct 22:283-98

Showing the most recent 10 out of 26 publications