The Molecular Modeling Section develops and uses theoretical tools to study and predict the structure and stability of globular protein molecules and to design protein molecules with new or improved properties. It also analyzes the Expressed Sequence Tag (EST) cDNA database to discover new genes for use as targets for immunotoxins in cancer therapy. Following are some of the notable achievements made during this reporting period: (a) We began to develop a new method for recognizing protein structural types from its sequence alone (fold recognition). The method uses the pair-to-pair scoring matrix we developed in the previous year, as well as a new contact potential and a novel sequence-structure alignment procedure. The method is currently under active development. (b) The ab initio protein structure search engine that was partially developed in the previous year has been essentially completed this year by adding efficient local moves. We are now developing the energy function that will drive this engine. (c) A procedure was developed for systematically searching for circularly permuted protein structures. Using this procedure, it was found that nearly half of all known protein structural domains show evidence of having been circularly permuted with respect to some other protein in the database. This finding suggests that many of the protein domains have been produced by exon shuffling and gene duplication/fusion events. (d) We have our own in-house molecular graphics program, which is being heavily used. However this program was developed about 15 years ago and does not take advantage of the modern features of the graphics software and hardware. Instead of modifying the existing program, we recently decided to write a new molecular graphics program from scratch, using an object oriented programming language (C++) and modern software development tools. We have been able to make a good progress so far and plan to continue the development. (e) In cooperation with the Molecular Biology Section, we designed mutants of several different antibody Fv molecules for use as the cancer cell-targeting moiety of immunotoxins. These have lower toxic side effect, better yield, and/or more stability. (f) We devised databases and tools to classify and sort NCBI's Unigene clusters according to organ or cell-type specificity. This procedure now supersedes our own EST clustering procedure and saves a huge amount of computation time for us. Several promising new prostate-specific genes have been found using new lists generated by this procedure. (f) Our own EST clustering procedure has been automated and applied to the Unigene clusters to sub-cluster them. We found many alternatively spliced examples using this procedure, some of which turned out to be organ specific.

Agency
National Institute of Health (NIH)
Institute
Division of Basic Sciences - NCI (NCI)
Type
Intramural Research (Z01)
Project #
1Z01BC008759-09
Application #
6433126
Study Section
(LMB)
Project Start
Project End
Budget Start
Budget End
Support Year
9
Fiscal Year
2000
Total Cost
Indirect Cost
Name
Basic Sciences
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Das, Sudipto; Hahn, Yoonsoo; Walker, Dawn A et al. (2008) Topology of NGEP, a prostate-specific cell:cell junction protein widely expressed in many cancers of different grade level. Cancer Res 68:6306-12
Hahn, Yoonsoo; Jeong, Sangkyun; Lee, Byungkook (2007) Inactivation of MOXD2 and S100A15A by exon deletion during human evolution. Mol Biol Evol 24:2203-12
Das, Sudipto; Hahn, Yoonsoo; Nagata, Satoshi et al. (2007) NGEP, a prostate-specific plasma membrane protein that promotes the association of LNCaP cells. Cancer Res 67:1594-601
Grigoryev, Dmitry N; Ma, Shwu-Fan; Shimoda, Larissa A et al. (2007) Exon-based mapping of microarray probes: recovering differential gene expression signal in underpowered hypoxia experiment. Mol Cell Probes 21:134-9
Hahn, Yoonsoo; Lee, Byungkook (2006) Human-specific nonsense mutations identified by genome sequence comparisons. Hum Genet 119:169-78
Sam, Vichetra; Tai, Chin-Hsien; Garnier, Jean et al. (2006) ROC and confusion analysis of structure comparison methods identify the main causes of divergence from manual protein classification. BMC Bioinformatics 7:206
Hahn, Yoonsoo; Bera, Tapan K; Pastan, Ira H et al. (2006) Duplication and extensive remodeling shaped POTE family genes encoding proteins containing ankyrin repeat and coiled coil domains. Gene 366:238-45
Egland, Kristi A; Liu, Xiu Fen; Squires, Stephen et al. (2006) High expression of a cytokeratin-associated protein in many cancers. Proc Natl Acad Sci U S A 103:5929-34
Bera, Tapan K; Saint Fleur, Ashley; Lee, Yoomi et al. (2006) POTE paralogs are induced and differentially expressed in many cancers. Cancer Res 66:52-6
Hahn, Yoonsoo; Lee, Byungkook (2005) Identification of nine human-specific frameshift mutations by comparative analysis of the human and the chimpanzee genome sequences. Bioinformatics 21 Suppl 1:i186-94

Showing the most recent 10 out of 29 publications