FOX family of transcription factor genes have expanded to over 40 members in mammals, and are involved in a variety of key developmental processes and human diseases. However, several key question concerning their evolutionary expansion, functional elaboration, transcriptional regulation, and selection are not completely understood. Gene duplication is a major driver of evolutionary innovation, as it allows an organism to elaborate existing biological function with specialization or diversification, while at the same time potentially avoiding negative fitness effects. Elaboration of new or specialized gene functions can involve at least two distinct pathways: (1) alteration of the spatial and temporal expression pattern of the gene, (2) alteration of the interaction partners, both DNA and protein. Alteration of expression patterns is likely to involve evolutionary divergence of the upstream cis regulatory elements while the alteration of interaction partners may involve sequence changes in functional domains of the protein. Previous comparative and evolutionary studies have focused on conserved aspects of gene sequence, structure and networks. Part of the challenge in studying divergence is that it is difficult to distinguish functionally relevant divergence from neutral drift. We propose to develop computational algorithms and statistical analysis methods that key in on correlated patterns of evolutionary divergence to identify novel regulatory elements and help elucidate the evolution of gene family members and their relation to disease phenotypes.
In specific aim 1 we will develop a novel method - Intra-genomic Phylogenetic Footprinting - to identify cis elements by exploiting correlated patterns of divergence in the regulatory-sequence and the expression of paralogs.
In specific aim 2, we will investigate additional pathways by which gene paralogs diversify in function and characterize the relationships between these pathways.
In specific aim 3, we will develop methods to quantify selective pressure and epistatic interactions among human cis regulatory polymorphisms. The methods will be comprehensively applied to gene families in multiple species. Application to specific gene families will provide insights into their functional expansion. Selected predictions will be experimentally validated. Besides providing a broader evolutionary understanding of gene family expansion, transcriptional regulation, and intraspecific variability in human populations, specific applications of the proposed research will be of direct biomedical relevance.

Public Health Relevance

FOX family of transcription factor genes have expanded to over 40 members in mammals, and are involved in a variety of key developmental processes and human diseases. However, several key question concerning their evolutionary expansion, functional elaboration, transcriptional regulation, and selection are not completely understood. We proposed to develop computational algorithms and statistical analysis methods to address some of these questions concerning the evolution of gene families and transcriptional regulation.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM085226-02
Application #
7637397
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Eckstrand, Irene A
Project Start
2008-06-20
Project End
2012-05-31
Budget Start
2009-06-01
Budget End
2010-05-31
Support Year
2
Fiscal Year
2009
Total Cost
$304,334
Indirect Cost
Name
University of Pennsylvania
Department
Genetics
Type
Schools of Medicine
DUNS #
042250712
City
Philadelphia
State
PA
Country
United States
Zip Code
19104
Plasschaert, Robert N; Vigneau, Sébastien; Tempera, Italo et al. (2014) CTCF binding site sequence differences are associated with unique regulatory and functional trends during embryonic stem cell differentiation. Nucleic Acids Res 42:774-89
Darnell, Diana K; Zhang, Li S; Hannenhalli, Sridhar et al. (2014) Developmental expression of chicken FOXN1 and putative target genes during feather development. Int J Dev Biol 58:57-64
Sahu, Avinash Das; Aniba, Radhouane; Chang, Yen-Pei Christy et al. (2013) Epigenomic model of cardiac enhancers with application to genome wide association studies. Pac Symp Biocomput :92-102
Choi, Sun Shim; Hannenhalli, Sridhar (2013) Three independent determinants of protein evolutionary rate. J Mol Evol 76:98-111
Zeng, Jia; Hannenhalli, Sridhar (2013) Inferring evolution of gene duplicates using probabilistic models and nonparametric belief propagation. BMC Genomics 14 Suppl 1:S15
Mukherjee, Rithun; Evans, Perry; Singh, Larry N et al. (2013) Correlated evolution of positions within mammalian cis elements. PLoS One 8:e55521
Carl Jr, Joseph W; Trgovcich, Joanne; Hannenhalli, Sridhar (2013) Widespread evidence of viral miRNAs targeting host pathways. BMC Bioinformatics 14 Suppl 2:S3
Vecsey, Christopher G; Peixoto, Lucia; Choi, Jennifer H K et al. (2012) Genomic analysis of sleep deprivation reveals translational regulation in the hippocampus. Physiol Genomics 44:981-91
Khaladkar, Mugdha; Hannenhalli, Sridhar (2012) Functional divergence of gene duplicates - a domain-centric view. BMC Evol Biol 12:126
Khaladkar, Mugdha; Smyda, Mark; Hannenhalli, Sridhar (2011) Epigenomic and RNA structural correlates of polyadenylation. RNA Biol 8:529-37

Showing the most recent 10 out of 26 publications