Biochemical networks are meshes of homologous and non-homologous proteins. The """"""""small world"""""""" topology - often scale free and in which a small number of hub nodes display extraordinarily high connectivity - is detected in the network models generated from omics results. Genomic basis of the scale-free topology - how to deduce this topology from genomic sequences - remains an open question. This proposal initiates an attempt to find a footing for this topology in genomic sequences. The focus is functional diversification of paralogous proteins and the formation of parallel pathways in the networks, in which the intrinsically disordered protein (IDP) segments is hypothesized to play preeminent roles.
Our specific aims are as follows. 1: Quantifying parallel pathways in biochemical networks. Our preliminary studies suggest paralogous proteins diverge in their functional specificity to form parallel pathways. The proteome sequences would be clustered into families and each protein assigned to a numerical family ID. Biochemical network models would then be annotated with this numerical format. Subsequently, parallel pathways can be visualized and quantified by analysis combinatorial patterns of these numerical IDs. 2: Roles of disordered regions in the topology of biochemical networks. It is hypothesized that IDPs are crucial for functional diversification of paralogous proteins. This hypothesis will be tested by a combination of genome wide IDP analysis, comparative genomic analysis as well as experimental verification. 3: Scale-free distribution and multi-cellularity. The exponent constant in power-law distribution varies across species. This constant would be determined for specific tissue/cell types in order to explain this variation from single cell species to multi-cellular species. The roles of disordered regions in functional diversification of paralogous proteins in multi-cellular species would also be investigated.

Public Health Relevance

Biochemical networks are meshes of homologous and non-homologous proteins. The small world topology - often scale free and in which a small number of hub nodes display extraordinarily high connectivity - is detected in the network models generated from omics experimental results. This proposal attempts to find the footing of this topology in genomic sequences - how to deduce this topology from genomic sequence analysis. The focus is functional diversification of paralogous proteins and the formation of parallel pathways in the networks, in which the intrinsically disordered protein (IDP) segments is hypothesized to play preeminent roles.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM010212-04
Application #
8333434
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2010-09-30
Project End
2014-09-29
Budget Start
2012-09-30
Budget End
2014-09-29
Support Year
4
Fiscal Year
2012
Total Cost
$276,594
Indirect Cost
$91,581
Name
University of Texas Health Science Center San Antonio
Department
Type
Schools of Medicine
DUNS #
800772162
City
San Antonio
State
TX
Country
United States
Zip Code
78229
Jiang, Wen; Guo, Zhanyong; Lages, Nuno et al. (2018) A Multi-Parameter Analysis of Cellular Coordination of Major Transcriptome Regulation Mechanisms. Sci Rep 8:5742
Zhang, Fangyuan; Wang, Degeng (2017) The Pattern of microRNA Binding Site Distribution. Genes (Basel) 8:
Guo, Zhanyong; Jiang, Wen; Lages, Nuno et al. (2014) Relationship between gene duplicability and diversifiability in the topology of biochemical networks. BMC Genomics 15:577
Pan, Haihui; Qin, Kunhua; Guo, Zhanyong et al. (2014) Negative elongation factor controls energy homeostasis in cardiomyocytes. Cell Rep 7:79-85
Padawer, Timothy; Leighty, Ralph E; Wang, Degeng (2012) Duplicate gene enrichment and expression pattern diversification in multicellularity. Nucleic Acids Res 40:7597-605