Our goal is to understand the process and mechanisms of the evolution of duplicate genes in eukaryotic genomes. Several eukaryotic genomes have been completed and several more are soon to be completed. The huge amounts of genomic sequence data are very useful for studying various aspects of the evolution of duplicate genes. This is the first time that evolution of duplicate genes can be studied at the genomic level. To seize this unprecedented opportunity, we propose to pursue the following studies: 1. To develop methods for detecting duplicate genes in a genome and for classifying proteins into families. 2. To develop methods for detecting block duplications in a genome. A computer software package for the methods in (1) and (2) will be developed. 3. To assess the extents of gene duplication in various eukaryotic genomes. For each complete genome we ask: How many duplicate genes (gene families) exist in the genome? How often block duplication occurs? Then in comparison among genomes, we shall ask, """"""""which genes have not been duplicated in any of the genomes studied, which duplicate genes are common to diverse genomes and which are restricted to only certain evolutionary lineages? And why are they so distributed?"""""""" 4. To study the rate of gene duplication in a genome. 5. To study the mechanisms of evolution of duplicate genes or how duplicate genes evolve following the duplication event. We shall examine various models for the retention and evolution of duplicate genes.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM066104-04
Application #
6929295
Study Section
Genetics Study Section (GEN)
Program Officer
Eckstrand, Irene A
Project Start
2002-08-01
Project End
2006-07-31
Budget Start
2005-08-01
Budget End
2006-07-31
Support Year
4
Fiscal Year
2005
Total Cost
$174,203
Indirect Cost
Name
University of Chicago
Department
Biology
Type
Schools of Medicine
DUNS #
005421136
City
Chicago
State
IL
Country
United States
Zip Code
60637
Oakley, Todd H; Gu, Zhenglong; Abouheif, Ehab et al. (2005) Comparative methods for the analysis of gene-expression evolution: an example using yeast functional genomic data. Mol Biol Evol 22:40-50
Wang, Daryi; Hsieh, Mufen; Li, Wen-Hsiung (2005) A general tendency for conservation of protein length across eukaryotic kingdoms. Mol Biol Evol 22:142-7
Zhang, Liqing; Li, Wen-Hsiung (2004) Mammalian housekeeping genes evolve more slowly than tissue-specific genes. Mol Biol Evol 21:236-9
Shih, Arthur Chun-Chieh; Li, Wen-Hsiung (2003) GS-Aligner: a novel tool for aligning genomic sequences using bit-level operations. Mol Biol Evol 20:1299-309
Qin, Hong; Lu, Henry H S; Wu, Wei B et al. (2003) Evolution of the yeast protein interaction network. Proc Natl Acad Sci U S A 100:12820-4