Biochemical networks are meshes of homologous and non-homologous proteins. The """"""""small world"""""""" topology - often scale free and in which a small number of hub nodes display extraordinarily high connectivity - is detected in the network models generated from omics results. Genomic basis of the scale-free topology - how to deduce this topology from genomic sequences - remains an open question. This proposal initiates an attempt to find a footing for this topology in genomic sequences. The focus is functional diversification of paralogous proteins and the formation of parallel pathways in the networks, in which the intrinsically disordered protein (IDP) segments is hypothesized to play preeminent roles.
Our specific aims are as follows. 1: Quantifying parallel pathways in biochemical networks. Our preliminary studies suggest paralogous proteins diverge in their functional specificity to form parallel pathways. The proteome sequences would be clustered into families and each protein assigned to a numerical family ID. Biochemical network models would then be annotated with this numerical format. Subsequently, parallel pathways can be visualized and quantified by analysis combinatorial patterns of these numerical IDs. 2: Roles of disordered regions in the topology of biochemical networks. It is hypothesized that IDPs are crucial for functional diversification of paralogous proteins. This hypothesis will be tested by a combination of genome wide IDP analysis, comparative genomic analysis as well as experimental verification. 3: Scale-free distribution and multi-cellularity. The exponent constant in power-law distribution varies across species. This constant would be determined for specific tissue/cell types in order to explain this variation from single cell species to multi-cellular species. The roles of disordered regions in functional diversification of paralogous proteins in multi-cellular species would also be investigated.

Public Health Relevance

Biochemical networks are meshes of homologous and non-homologous proteins. The small world topology - often scale free and in which a small number of hub nodes display extraordinarily high connectivity - is detected in the network models generated from omics experimental results. This proposal attempts to find the footing of this topology in genomic sequences - how to deduce this topology from genomic sequence analysis. The focus is functional diversification of paralogous proteins and the formation of parallel pathways in the networks, in which the intrinsically disordered protein (IDP) segments is hypothesized to play preeminent roles.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
1R01LM010212-01A1
Application #
7986507
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2010-09-30
Project End
2013-09-29
Budget Start
2010-09-30
Budget End
2011-09-29
Support Year
1
Fiscal Year
2010
Total Cost
$282,250
Indirect Cost
Name
University of South Florida
Department
Microbiology/Immun/Virology
Type
Schools of Arts and Sciences
DUNS #
069687242
City
Tampa
State
FL
Country
United States
Zip Code
33612
Jiang, Wen; Guo, Zhanyong; Lages, Nuno et al. (2018) A Multi-Parameter Analysis of Cellular Coordination of Major Transcriptome Regulation Mechanisms. Sci Rep 8:5742
Zhang, Fangyuan; Wang, Degeng (2017) The Pattern of microRNA Binding Site Distribution. Genes (Basel) 8:
Guo, Zhanyong; Jiang, Wen; Lages, Nuno et al. (2014) Relationship between gene duplicability and diversifiability in the topology of biochemical networks. BMC Genomics 15:577
Pan, Haihui; Qin, Kunhua; Guo, Zhanyong et al. (2014) Negative elongation factor controls energy homeostasis in cardiomyocytes. Cell Rep 7:79-85
Padawer, Timothy; Leighty, Ralph E; Wang, Degeng (2012) Duplicate gene enrichment and expression pattern diversification in multicellularity. Nucleic Acids Res 40:7597-605