Predicting and analyzing protein interaction networks

Singh, Mona

Abstract

Molecular interactions are the underlying basis of all processes that are executed in an organism, and their complete mapping would be a great aid in understanding and interpreting both normal and disease functioning. Transcriptional regulatory interactions are of particular interest as they are critical in the proper spatial and temporal regulation of genes. This proposal aims to develop several novel and complementary computational methods for predicting transcription factor interactions and specificities, and for uncovering their conservation and variation across organisms. Taken together, these methods will vastly expand our knowledge of eukaryotic regulatory networks and their underlying principles. We will devise a combined constrained optimization and statistical approach to predict the DNA-binding specificities of multidomain C2H2 zinc finger proteins;these proteins comprise the largest class of transcription factors in eukaryotic genomes. We will also establish a novel comparative sequence framework for determining binding specificity variation amongst homologous transcription factors, as network divergence underlies much of the observed phenotypic and functional diversity between and within organisms;this framework will be applied to explore the extent to which changes in transcription factors can affect regulatory network variation across organisms. Finally, we will develop a cross-genomic framework for predicting genomic binding sites for transcription factors with known specificities, along with analysis techniques for inferring interactions amongst these transcription factors across organisms. The DNA-binding specificities for an increasing number of transcription factors are being determined, and this large-scale data presents new opportunities to map transcription factor binding sites and to uncover transcription factor- transcription factor interactions across organisms;these interactions are an important component of regulatory networks and their variation plays a key role in network divergence. Successful completion of these aims will result in computational methods that will significantly increase the rate with which transcriptional networks are characterized and will reveal fundamental aspects of their functioning and evolution. All developed software will be made publicly available.

Public Health Relevance

Cellular networks underlie all processes that are executed in an organism, and their complete mapping would aid in understanding both normal and disease functioning. The proposed research will yield software for uncovering and characterizing protein interactions and specificities. These computational tools will help to place proteins, including those important for disease, within the broader context of their cellular pathways, thereby expanding our understanding of diseases and providing an important avenue for uncovering putative drug targets.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 5R01GM076275-08
Application #: 8634108
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Wu, Mary Ann

Project Start: 2006-02-18
Project End: 2016-03-31
Budget Start: 2014-04-01
Budget End: 2015-03-31
Support Year: 8
Fiscal Year: 2014
Total Cost: $307,890
Indirect Cost: $115,390

Institution

Name: Princeton University
Department
Type: Organized Research Units
DUNS #: 002484665

City: Princeton
State: NJ
Country: United States
Zip Code: 08544

Related projects

Publications

Pritykin, Yuri; Brito, Tarcisio; Schupbach, Trudi et al. (2017) Integrative analysis unveils new functions for the Drosophila Cutoff protein in noncoding RNA biogenesis and gene regulation. RNA 23:1097-1109

Ochoa, Alejandro; Singh, Mona (2017) Domain prediction with probabilistic directional context. Bioinformatics 33:2471-2478

Ochoa, Alejandro; Storey, John D; Llinás, Manuel et al. (2015) Beyond the E-Value: Stratified Statistics for Protein Domain Prediction. PLoS Comput Biol 11:e1004509

Persikov, Anton V; Wetzel, Joshua L; Rowland, Elizabeth F et al. (2015) A systematic survey of the Cys2His2 zinc finger DNA-binding landscape. Nucleic Acids Res 43:1965-84

Pritykin, Yuri; Ghersi, Dario; Singh, Mona (2015) Genome-Wide Detection and Analysis of Multifunctional Genes. PLoS Comput Biol 11:e1004467

Nadimpalli, Shilpa; Persikov, Anton V; Singh, Mona (2015) Pervasive variation of transcription factor orthologs contributes to regulatory network evolution. PLoS Genet 11:e1005011

Persikov, Anton V; Rowland, Elizabeth F; Oakes, Benjamin L et al. (2014) Deep sequencing of large library selections allows computational discovery of diverse sets of zinc fingers that bind common targets. Nucleic Acids Res 42:1497-508

Persikov, Anton V; Singh, Mona (2014) De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins. Nucleic Acids Res 42:97-108

Ghersi, Dario; Singh, Mona (2014) Interaction-based discovery of functionally important genes in cancers. Nucleic Acids Res 42:e18

Jiang, Peng; Singh, Mona (2014) CCAT: Combinatorial Code Analysis Tool for transcriptional regulation. Nucleic Acids Res 42:2833-47

Showing the most recent 10 out of 32 publications

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: