The human genome contains several hundred clusters of related genes, in which duplication operations create new gene copies that serve as the raw ingredients for rapid change as humans evolve to adapt to their environment. For instance, many immunity-related genes lie in such clusters. Thus, understanding all of the genetic differences between humans and other primates, or among individual humans, will require a comparative analysis of gene clusters. One benefit will be a better idea of which primate species make the best biomedical models for the progression of infectious diseases like AIDS or influenza in humans. However, the presence of highly similar segments makes it difficult to determine complete and accurate sequence data for these clusters, which has inhibited development of computational methods for their analysis. We propose to collaborate in the generation of accurate primate sequence data for carefully chosen gene clusters, to develop new computational tools for comparative analysis of such data, and to deliver the results to the biomedical community.

Public Health Relevance

Changes within clusters of closely related genes explain many of the differences among humans and primates, as well as among individual humans. We will produce data and computational methods that will greatly increase our understanding of these differences, which may assist with selection of biomedical models for progression of infectious diseases.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG002238-21
Application #
7882269
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Good, Peter J
Project Start
2000-08-15
Project End
2013-06-30
Budget Start
2010-07-01
Budget End
2013-06-30
Support Year
21
Fiscal Year
2010
Total Cost
$708,479
Indirect Cost
Name
Pennsylvania State University
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
003403953
City
University Park
State
PA
Country
United States
Zip Code
16802
Giardine, Belinda; Borg, Joseph; Viennas, Emmanouil et al. (2014) Updates of the HbVar database of human hemoglobin variants and thalassemia mutations. Nucleic Acids Res 42:D1063-9
Song, Giltae; Riemer, Cathy; Dickins, Benjamin et al. (2012) Revealing mammalian evolutionary relationships by comparative analysis of gene clusters. Genome Biol Evol 4:586-601
Song, Giltae; Hsu, Chih-Hao; Riemer, Cathy et al. (2011) Conversion events in gene clusters. BMC Evol Biol 11:226
Song, Giltae; Hsu, Chih-Hao; Riemer, Cathy et al. (2011) Evaluation of methods for detecting conversion events in gene clusters. BMC Bioinformatics 12 Suppl 1:S45
Wu, Weisheng; Cheng, Yong; Keller, Cheryl A et al. (2011) Dynamics of the epigenetic landscape during erythroid differentiation after GATA1 restoration. Genome Res 21:1659-71
Locke, Devin P; Hillier, LaDeana W; Warren, Wesley C et al. (2011) Comparative and demographic analysis of orang-utan genomes. Nature 469:529-33
Miller, Webb; Wright, Stephen J; Zhang, Yu et al. (2010) Optimization methods for selecting founder individuals for captive breeding or reintroduction of endangered species. Pac Symp Biocomput :43-53
Hsu, Chih-Hao; Zhang, Yu; Hardison, Ross C et al. (2010) An effective method for detecting gene conversion events in whole genomes. J Comput Biol 17:1281-97
Chen, Kuan-Bei; Zhang, Yu (2010) A varying threshold method for ChIP peak-calling using multiple sources of information. Bioinformatics 26:i504-10
Ratan, Aakrosh; Zhang, Yu; Hayes, Vanessa M et al. (2010) Calling SNPs without a reference sequence. BMC Bioinformatics 11:130

Showing the most recent 10 out of 79 publications