Studies on sequences:POTE is one of the genes we found some time ago through our gene discovery project. The human genome contains 13 closely similar paralogs of this gene dispersed in 8 different chromosomes. Different paralogs are expressed in only a few normal tissues (prostate, ovary, testis and embryonic stem cells) but in numerous cancer cells and tissues. The POTE gene family is primate-specific, but we do not know the function of these genes. We found the following about these genes this year:(1) We identified what is likely to be the single ancestral gene, which has an ortholog in the mouse. Our experimental colleagues in Pastan's laboratory are conducting functional studies of this gene in mouse.(2) Some POTE gene paralogs acquired an actin transposon, which inserted in-frame in an exon of the parental POTE gene. Our experimental colleagues showed that this POTE-actin chimeric gene produces the ecpected fused protein product.In an unrelated study, we compared the genomes of human, chimpanzee and other species to find human-specific frameshift or nonsense mutations. We could confidently identify 18 such genes. For example, one of the genes that contain human-specific non-sense mutation is NPPA, which codes for a precursor protein for the peptide ANP, which plays a central role in the regulation of blood pressure. The non-sense mutation deletes the last two arginine residues from the peptide. Human population is polymorphic with respect to the non-sense mutation. It has been reported that the monkey form allele is significantly associated with the increased risk of stroke recurrence in humans.Studies on protein structure:In collaboration with Dr. Peter Munson's group at the Center for Information Technology (CIT) of NIH and with Drs. Jean Garnier and Jean-Francois Gibrat of the Institut National de la Recherche Agronomique, Jouy-en-Josas, France, we studied the world of protein structures through the well-known, manually procured protein structure classification database SCOP. We found that protein structure space is continuous and that it is often artificial to draw a line in this space to classify the protein structures in mutually exclusive groups. The manually procured SCOP database is based on recognition of particular structural elements, called the 'core', that is common for a group of proteins. However, we found that different groups of protein structures may share different core elements and that one core can transform to another core through a set of successive structures that contain some features of both cores. We are currently studyng automatic, as opposed to manual, clustering procedures.

Agency
National Institute of Health (NIH)
Institute
Division of Basic Sciences - NCI (NCI)
Type
Intramural Research (Z01)
Project #
1Z01BC008759-15
Application #
7338122
Study Section
(LMB)
Project Start
Project End
Budget Start
Budget End
Support Year
15
Fiscal Year
2006
Total Cost
Indirect Cost
Name
Basic Sciences
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Das, Sudipto; Hahn, Yoonsoo; Walker, Dawn A et al. (2008) Topology of NGEP, a prostate-specific cell:cell junction protein widely expressed in many cancers of different grade level. Cancer Res 68:6306-12
Hahn, Yoonsoo; Jeong, Sangkyun; Lee, Byungkook (2007) Inactivation of MOXD2 and S100A15A by exon deletion during human evolution. Mol Biol Evol 24:2203-12
Das, Sudipto; Hahn, Yoonsoo; Nagata, Satoshi et al. (2007) NGEP, a prostate-specific plasma membrane protein that promotes the association of LNCaP cells. Cancer Res 67:1594-601
Grigoryev, Dmitry N; Ma, Shwu-Fan; Shimoda, Larissa A et al. (2007) Exon-based mapping of microarray probes: recovering differential gene expression signal in underpowered hypoxia experiment. Mol Cell Probes 21:134-9
Bera, Tapan K; Saint Fleur, Ashley; Lee, Yoomi et al. (2006) POTE paralogs are induced and differentially expressed in many cancers. Cancer Res 66:52-6
Hahn, Yoonsoo; Lee, Byungkook (2006) Human-specific nonsense mutations identified by genome sequence comparisons. Hum Genet 119:169-78
Sam, Vichetra; Tai, Chin-Hsien; Garnier, Jean et al. (2006) ROC and confusion analysis of structure comparison methods identify the main causes of divergence from manual protein classification. BMC Bioinformatics 7:206
Hahn, Yoonsoo; Bera, Tapan K; Pastan, Ira H et al. (2006) Duplication and extensive remodeling shaped POTE family genes encoding proteins containing ankyrin repeat and coiled coil domains. Gene 366:238-45
Egland, Kristi A; Liu, Xiu Fen; Squires, Stephen et al. (2006) High expression of a cytokeratin-associated protein in many cancers. Proc Natl Acad Sci U S A 103:5929-34
Hahn, Yoonsoo; Lee, Byungkook (2005) Identification of nine human-specific frameshift mutations by comparative analysis of the human and the chimpanzee genome sequences. Bioinformatics 21 Suppl 1:i186-94

Showing the most recent 10 out of 29 publications