Structural genomics aims to provide a good experimental structure or computational model of every tractable protein in completely sequenced genomes. We propose to computationally select targets and model molecular structures as a component of the structural genomics of Mycoplasma genitalium and Mycoplasma pneumoniae. Underlying this goal is the immense value of protein structure, especially in permitting recognition of distant evolutionary relationships for proteins whose sequence analysis has failed to find any significant homolog. A considerable fraction of the genes in all sequenced genomes have no known function or only phenotypic descriptions, and structure determination provides a direct means of revealing homology that may be used to infer putative molecular function. Structure may also help answer scientific questions arising from genomes, such as the provenance of the orphan genes unique to a particular organism. More generally, knowledge of an increasingly complete repertoire of protein structures will aid structure prediction methods, improve understanding of protein structure, and ultimately lend insight into molecular interactions and pathways. We plan to use computational methods to select families based on proteins in the Mycoplasmas for which structures cannot be predicted and which are likely to be amenable to experimental characterization. Methods to be employed included modern sequence analysis, clustering algorithms, and incorporation of other groups' analyses from public databases. The protein families will be ranked according to several criteria including taxonomic diversity and known functional information. Individual proteins will be selected from these families as targets for structure determination. The solved three-dimensional structures will be examined for similarity to other proteins of known structure. Homologous proteins in sequence databases will be computationally modeled, to provide a resource of protein structure models complementing the experimentally solved protein structures. In addition, we will develop and deploy a laboratory information management system. To keep the community apprized of our work, we will create an active web presence and also regularly submit data regularly public repositories.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Specialized Center (P50)
Project #
3P50GM062412-02S1
Application #
6503008
Study Section
Special Emphasis Panel (ZGM1)
Project Start
2001-09-28
Project End
2002-08-31
Budget Start
Budget End
Support Year
2
Fiscal Year
2001
Total Cost
Indirect Cost
Name
University of California Berkeley
Department
Type
DUNS #
094878337
City
Berkeley
State
CA
Country
United States
Zip Code
94704
Jun, Se-Ran; Sims, Gregory E; Wu, Guohong A et al. (2010) Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution. Proc Natl Acad Sci U S A 107:133-8
Sims, Gregory E; Jun, Se-Ran; Wu, Guohong A et al. (2009) Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions. Proc Natl Acad Sci U S A 106:2677-82
Lee, Jonas; Kim, Sung-Hou (2009) High-throughput T7 LIC vector for introducing C-terminal poly-histidine tags with variable lengths without extra sequences. Protein Expr Purif 63:58-61
Wu, Guohong Albert; Jun, Se-Ran; Sims, Gregory E et al. (2009) Whole-proteome phylogeny of large dsDNA virus families by an alignment-free method. Proc Natl Acad Sci U S A 106:12826-31
Pereira, Jose Henrique; Kim, Sung-Hou (2009) Structure of human Brn-5 transcription factor in complex with CRH gene promoter. J Struct Biol 167:159-65
Sims, Gregory E; Jun, Se-Ran; Wu, Guohong Albert et al. (2009) Whole-genome phylogeny of mammals: evolutionary information in genic and nongenic regions. Proc Natl Acad Sci U S A 106:17077-82
Pereira, Jose Henrique; Ha, Sung Chul; Kim, Sung-Hou (2008) Crystallization and preliminary X-ray analysis of human Brn-5 transcription factor in complex with DNA. Acta Crystallogr Sect F Struct Biol Cryst Commun 64:175-8
Dahl, Christiane; Schulte, Andrea; Stockdreher, Yvonne et al. (2008) Structural and molecular genetic insight into a widespread sulfur oxidation pathway. J Mol Biol 384:1287-300
Shin, Dong Hae (2008) Preliminary structural studies on MPN423 expressed from an orthologous ORFan of Mycoplasma pneumoniae. Protein Pept Lett 15:753-5
Zwart, Peter H; Afonine, Pavel V; Grosse-Kunstleve, Ralf W et al. (2008) Automated structure solution with the PHENIX suite. Methods Mol Biol 426:419-35

Showing the most recent 10 out of 91 publications