Our goal is to compare microbial genomics in terms of membrane protein structure, work falling into the emerging field of structural genomics. We will focus on the occurrence of membrane proteins composed of transmembrane (TM) helices and the interactions between pairs of these proteins. (I) Our first aim will be to inventory all the TM-helix proteins in the recently sequenced microbial genomes. Initially, we will use membrane- protein prediction methods based on transfer energy scales. Then we will try to improve upon these by building a Hidden Markov Model to identify membrane proteins. This probabilistic approach will allow us to systematically combine, in a Bayesian framework, prior information from biophysical scales with statistical information from the known membrane proteins. (Ii) Our second aim is to look at protein-protein interactions among helical membrane proteins from a database perspective. We will find all the common helix-helix interfaces in the database of known structures and compare these to the TM-helix oligomerization motifs found in genetic screens by the Beckwith and Engelman groups. In particular, we will measure the packing efficiency for all the helix-helix interfaces, trying to determine whether membrane-protein interfaces are packed less tightly than soluble ones. We will also see how often sequence motifs associated with TM-helix oligomerization occur in a number of genomes, estimating the fraction of proteins in a genome that could potentially interact via these motifs. (iii) Our final aim is to integrate into a comprehensive database the information on the occurrence and interaction of membrane proteins from the first two parts with further information, e.g. related to expression. This will allow us to compare genomes in terms of membrane-protein fold usage and look for TM-proteins common to many diverse organisms. It will also allow us to put the patterns of occurrence of TM-proteins into context, by comparing them to those of soluble proteins. We expect our analysis will initially involve approximately 10 genomes with this number increasing to approximately 100 during the funding period.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Program Projects (P01)
Project #
5P01GM054160-05
Application #
6324681
Study Section
Project Start
2000-07-01
Project End
2001-06-30
Budget Start
Budget End
Support Year
5
Fiscal Year
2000
Total Cost
$176,227
Indirect Cost
Name
Yale University
Department
Type
DUNS #
082359691
City
New Haven
State
CT
Country
United States
Zip Code
06520
Alexandrov, Vadim; Lehnert, Ursula; Echols, Nathaniel et al. (2005) Normal modes for predicting protein motions: a comprehensive database assessment and associated Web tool. Protein Sci 14:633-43
Freeman-Cook, Lisa L; Edwards, Anne P B; Dixon, Ann M et al. (2005) Specific locations of hydrophilic amino acids in constructed transmembrane ligands of the platelet-derived growth factor beta receptor. J Mol Biol 345:907-21
Balasubramanian, Suganthi; Xia, Yu; Freinkman, Elizaveta et al. (2005) Sequence variation in G-protein-coupled receptors: analysis of single nucleotide polymorphisms. Nucleic Acids Res 33:1710-21
Lehnert, Ursula; Xia, Yu; Royce, Thomas E et al. (2004) Computational analysis of membrane proteins: genomic occurrence, structure prediction and helix interactions. Q Rev Biophys 37:121-46
Freeman-Cook, Lisa L; Dixon, Ann M; Frank, Jennifer B et al. (2004) Selection and characterization of small random transmembrane proteins that bind and activate the platelet-derived growth factor beta receptor. J Mol Biol 338:907-20
Zhang, Zhaolei; Gerstein, Mark (2003) The human genome has 49 cytochrome c pseudogenes, including a relic of a primordial gene that still functions in mouse. Gene 312:61-72
Schneider, Dirk; Engelman, Donald M (2003) GALLEX, a measurement of heterologous association of transmembrane helices in a biological membrane. J Biol Chem 278:3105-11
Lin, J; Qian, J; Greenbaum, D et al. (2002) GeneCensus: genome comparisons in terms of metabolic pathway activity and protein family sharing. Nucleic Acids Res 30:4574-82
Jansen, R; Gerstein, M (2000) Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins. Nucleic Acids Res 28:1481-8
MacKenzie, K R; Prestegard, J H; Engelman, D M (1997) A transmembrane helix dimer: structure and implications. Science 276:131-3