Large scale duplication, from chromosomal fragments to the entire genome, followed by mutation is considered a major force driving functional diversity in vertebrates. Isolated examples of duplicated regions support this theory, but a genome-wide study of mammalian gene duplication and subsequent functional diversification has not been attempted. A better understanding of the functional similarities between duplicated genes would greatly enhance the power of paralogous relationships in predicting gene function and understanding genetic disease. The candidate's long-term career goal is to establish an independent, research program in academia, using computational genomics to study the role of gene duplication in the evolution, structure and function of mammalian genomes. The specific goals of the current proposal constitute the first step in this program and will allow the candidate to demonstrate the feasibility of her interdisciplinary approach. They are (1) to construct a spatially ordered set of all discernible duplicated genes in the mouse and human genomes and to estimate the time of duplication for each; (2) to develop algorithms to identify the number of large scale duplications that took place and determine the sequence of rearrangements that subsequently fragmented them; (3) to determine, using probabilistic models of rearrangements, to what extent spatial organization of duplicated regions is preserved; and (4) to annotate the duplication data with functional data in preparation for studying the processes of functional differentiation following duplication.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Career Transition Award (K22)
Project #
5K22HG002451-04
Application #
6767771
Study Section
Ethical, Legal, Social Implications Review Committee (GNOM)
Program Officer
Graham, Bettie
Project Start
2001-07-01
Project End
2006-06-30
Budget Start
2004-07-01
Budget End
2006-06-30
Support Year
4
Fiscal Year
2004
Total Cost
$265,790
Indirect Cost
Name
Carnegie-Mellon University
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
052184116
City
Pittsburgh
State
PA
Country
United States
Zip Code
15213
Song, Nan; Joseph, Jacob M; Davis, George B et al. (2008) Sequence similarity network reveals common ancestry of multidomain proteins. PLoS Comput Biol 4:e1000063
Vernot, Benjamin; Stolzer, Maureen; Goldman, Aiton et al. (2008) Reconciliation with non-binary species trees. J Comput Biol 15:981-1006
Raghupathy, Narayanan; Hoberman, Rose; Durand, Dannie (2008) Two plus two does not equal three: statistical tests for multiple genome comparison. J Bioinform Comput Biol 6:1-22
Vernot, B; Stolzer, M; Goldman, A et al. (2007) Reconciliation with non-binary species trees. Comput Syst Bioinformatics Conf 6:441-52
Durand, Dannie; Hoberman, Rose (2006) Diagnosing duplications--can it be done? Trends Genet 22:156-64
Przytycka, Teresa; Davis, George; Song, Nan et al. (2006) Graph theoretical insights into evolution of multidomain proteins. J Comput Biol 13:351-63
Durand, Dannie; Halldorsson, Bjarni V; Vernot, Benjamin (2006) A hybrid micro-macroevolutionary approach to gene tree reconstruction. J Comput Biol 13:320-35
Hoberman, Rose; Sankoff, David; Durand, Dannie (2005) The statistical analysis of spatially clustered genes under the maximum gap criterion. J Comput Biol 12:1083-102
Durand, Dannie; Sankoff, David (2003) Tests for gene clustering. J Comput Biol 10:453-82
Durand, Dannie (2003) Vertebrate evolution: doubling and shuffling with a full deck. Trends Genet 19:2-5