Several families of proteins are under investigation for the determination of the relationships between structure and function. Most of these protein families have the property of binding to DNA or RNA, for example, the HMG-1 box proteins, heat shock factors, ets proteins and ribosomal proteins. The HMG-1 box family of proteins is not highly conserved and is represented by over 100 examples in the sequence databases. We have identified a signature for this family of proteins and have classified them into several groups according to their sequence and functional relation. In the past few years, numerous proteins have been identified as containing a stretch of about 75 amino acids which are homologous to an abundant non-histone chromosomal protein HMG-1. These proteins bind DNA and bends it on binding or bind preferentially to bent DNA. Several of these proteins have been implicated in numerous nuclear functions including transcription, replication, and chromatin structure as well as transcription regulation in mitochondria resulting, in some cases, in such phenotypes as sex and mating type determination. We have compiled and are maintaining a database of the HMG-1 box family of proteins and are analyzing the sequences to determine the phylogeny between these functionally widely-diverse proteins. Recently, the 3D structure of several HMG-1 box domains have been determined by multi-dimensional NMR. We have used one of these structure to model the other members of the family by a threading method. We have successfully produced models for the complete family of HMG-1 box domains. The conclusion drawn from these calculations is that these proteins are mostly likely to fold into similar conformations. We are using a threading technique to compare sequences of RNA--binding proteins to the 3D structures of several RNA- binding proteins, which includes many ribosomal proteins and a splicing factor. This method allows the determination of the ability of a sequence to fold into a particular known protein conformation. We are presently comparing ribosomal proteins to the known protein structures and are also modeling other RNA-binding proteins. An iterative motif search algorithms being used to detect new and as yet unidentified motifs of several families of DNA-binding and RNA-binding proteins. We have identified several proteins which contain domains which could be related to the histone fold of nucleosome octamer histones. We are extending this study to include the ability to identify partners in a dimerization interaction. Also, the relationship between the heat shock factor and ets families of proteins has been identified by the same technique.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM000071-02
Application #
2578644
Study Section
Special Emphasis Panel (CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
2
Fiscal Year
1996
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Holliday, Gemma L; Bairoch, Amos; Bagos, Pantelis G et al. (2015) Key challenges for the creation and maintenance of specialist protein resources. Proteins 83:1005-13
Marino-Ramirez, Leonardo; Hsu, Benjamin; Baxevanis, Andreas D et al. (2006) The Histone Database: a comprehensive resource for histones and histone fold-containing proteins. Proteins 62:838-42
Marino-Ramirez, Leonardo; Kann, Maricel G; Shoemaker, Benjamin A et al. (2005) Histone structure and nucleosome stability. Expert Rev Proteomics 2:719-29
Sullivan, Steven A; Landsman, David (2004) Mining core histone sequences from public protein databases. Methods Enzymol 375:3-20
Sullivan, Steven A; Landsman, David (2003) Characterization of sequence variability in nucleosome core histone folds. Proteins 52:454-65
Strichman-Almashanu, Liora Z; Bustin, Michael; Landsman, David (2003) Retroposed copies of the HMG genes: a window to genome dynamics. Genome Res 13:800-12
Sullivan, Steven; Sink, Daniel W; Trout, Kenneth L et al. (2002) The Histone Database. Nucleic Acids Res 30:341-2
Fassler, Jan; Landsman, David; Acharya, Asha et al. (2002) B-ZIP proteins encoded by the Drosophila genome: evaluation of potential dimerization partners. Genome Res 12:1190-200
Birger, Y; Ito, Y; West, K L et al. (2001) HMGN4, a newly discovered nucleosome-binding protein encoded by an intronless gene. DNA Cell Biol 20:257-64
Nestler, E J; Landsman, D (2001) Learning about addiction from the genome. Nature 409:834-5

Showing the most recent 10 out of 13 publications