Revealing mechanism-based associations among effects of disparate biological perturbations by integrative analysis of diverse signatures requires statistical methods for identifying commonalities in high dimensional readouts of cellular states. It also requires statistical methods and computational algorithms for inferring and comparing condition dependent regulatory network models of perturbation effects. Generation and interpretation of transcriptional signature of diverse perturbations in diverse cell types is the focus one of the currently funded Large Scale Production of Perturbagen-induced Cellular Signatures (LINCS) centers (U54HG006093). The other currently funded center (U54HG006097) will assess signatures of signaling molecule activities and compound biochemical activity. Jointly, these diverse readouts of cellular states can be used to effectively elicit condition dependent regulatory network (CORN) models goveming the cellular response to perturbations. We propose to develop, test, validate benchmark and implement a general statistical framework for assessing concordance in different types of perturbation signatures. Our preliminary results demonstrate stunning improvements in statistical power over currently available methods in identifying concordant transcriptional signatures. They also demonstrate the utility and strong improvements in statistical power for identifying common biological pathways by integrated analysis of signatures of multiple perturbations and multiple types of cellular readouts as opposed to using isolated analyses of different signatures. Based on these results we also propose integrative statistical methods for mechanistically explaining perturbation signatures in the context of known pathways and for constructing de-novo mechanistic network models. Finally, we propose a strategy for using newly developed methods and LINCS perturbation signatures as a novel resource for interpreting disease-related genomics data. All new methods and algorithms will be deployed within existing on- and off-line computational platforms integrating data, computational tools and the functional knowledge base.
We propose the development of statistical methods and computational tools for inferring mechanistic network models by integrative analysis of diverse perturbation signatures. These methods and infrastructure will open important new avenues for interpreting results from disease-related genomics experiments by comparing them to perturbation signatures and meta-signatures. The resulting infrastructure will remove methodological and infrastructural barriers for meaningful re-use of LINCS perturbation signatures and related network models, enabling scientists throughout the world to use as resource to gain insight into the genomic conditions underlying human disease.
Govindarajah, Vinothini; Leung, Yuet-Kin; Ying, Jun et al. (2016) In utero exposure of rats to high-fat diets perturbs gene expression profiles and cancer susceptibility of prepubertal mammary glands. J Nutr Biochem 29:73-82 |
Chen, Jing; Hu, Zhen; Phatak, Mukta et al. (2013) Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules. PLoS Comput Biol 9:e1003198 |