We have developed an automatic protein fingerprinting method for the evaluation of protein structural similarities based on secondary structure element compositions, spatial arrangements, lengths, and topologies. This method rapidly and accurately identifies proteins sharing structural homologies in five test cases: the globins, the mammalian trypsin-like serine proteases, the immunoglobulins, the cupredoxins, and the actin-like ATPase domain-containing proteins. Principal components analysis (PCA) of the similarity distance matrix calculated from an all- by-all comparison of 1031 unique chains in the Protein Data Bank has produced a distribution of structures within a high-dimensional structural space. Fifty percent of the variance observed for this distribution is bounded by 6 axes, 2 of which encode structural variability within 2 large families, the immunoglobulins and the trypsin-like serine proteases. The structure of the universe remains stable upon reduction of the database to 140 representatives and subsequent PCA, although the axes correlated with specific structural families are no longer observed. A clear hierarchy of organization is seen in the arrangement of protein structures in the universe. At the highest level, protein structures populate regions corresponding to the all-alpha, all-beta, and alpha/beta superfamilies. Large protein families are arranged along family-specific axes, forming local densely-populated regions within the space. The lowest level of organization is intrafamilial; homologous structures are ordered by variaions in peripheral secondary structure elements or by conformational shifts in the tertiary structure.

Agency
National Institute of Health (NIH)
Institute
National Center for Research Resources (NCRR)
Type
Biotechnology Resource Grants (P41)
Project #
5P41RR001081-21
Application #
6280234
Study Section
Project Start
1998-07-01
Project End
1999-06-30
Budget Start
1997-10-01
Budget End
1998-09-30
Support Year
21
Fiscal Year
1998
Total Cost
Indirect Cost
Name
University of California San Francisco
Department
Type
DUNS #
073133571
City
San Francisco
State
CA
Country
United States
Zip Code
94143
Kozak, John J; Gray, Harry B; Garza-López, Roberto A (2018) Relaxation of structural constraints during Amicyanin unfolding. J Inorg Biochem 179:135-145
Alamo, Lorenzo; Pinto, Antonio; Sulbarán, Guidenn et al. (2018) Lessons from a tarantula: new insights into myosin interacting-heads motif evolution and its implications on disease. Biophys Rev 10:1465-1477
Viswanath, Shruthi; Chemmama, Ilan E; Cimermancic, Peter et al. (2017) Assessing Exhaustiveness of Stochastic Sampling for Integrative Modeling of Macromolecular Structures. Biophys J 113:2344-2353
Chu, Shidong; Zhou, Guangyan; Gochin, Miriam (2017) Evaluation of ligand-based NMR screening methods to characterize small molecule binding to HIV-1 glycoprotein-41. Org Biomol Chem 15:5210-5219
Portioli, Corinne; Bovi, Michele; Benati, Donatella et al. (2017) Novel functionalization strategies of polymeric nanoparticles as carriers for brain medications. J Biomed Mater Res A 105:847-858
Alamo, Lorenzo; Koubassova, Natalia; Pinto, Antonio et al. (2017) Lessons from a tarantula: new insights into muscle thick filament and myosin interacting-heads motif structure and function. Biophys Rev 9:461-480
Nguyen, Hai Dang; Yadav, Tribhuwan; Giri, Sumanprava et al. (2017) Functions of Replication Protein A as a Sensor of R Loops and a Regulator of RNaseH1. Mol Cell 65:832-847.e4
Sofiyev, Vladimir; Kaur, Hardeep; Snyder, Beth A et al. (2017) Enhanced potency of bivalent small molecule gp41 inhibitors. Bioorg Med Chem 25:408-420
Nekouzadeh, Ali; Rudy, Yoram (2016) Conformational changes of an ion-channel during gating and emerging electrophysiologic properties: Application of a computational approach to cardiac Kv7.1. Prog Biophys Mol Biol 120:18-27
Towse, Clare-Louise; Vymetal, Jiri; Vondrasek, Jiri et al. (2016) Insights into Unfolded Proteins from the Intrinsic ?/? Propensities of the AAXAA Host-Guest Series. Biophys J 110:348-361

Showing the most recent 10 out of 508 publications