Deciphering the transcriptional regulatory network (TRN) governing a biological process in mammalian systems is essential to our understanding of basic mechanisms underlying normal physiology as well as disease etiology. It is a daunting task because too many links in the TRN are unknown. The emergence of ChIP-chip and ChIP-seq technologies has enabled the mapping of the genome-wide binding sites of many transcription factors (TFs) known to be key regulators in a biological process. However, these two technologies are limited to the known regulators with ChIP-quality antibodies. We found that TF binding is often associated with a dynamic histone mark signature and can be computationally predicted from the genome-wide histone mark dynamics. Therefore, we hypothesize that with time-course nucleosome-resolution ChIP-seq of a few informative histone marks and RNA-seq data of gene expression, and effective computational modeling, we could infer the TRNs in mammalian biological processes. Specifically, we propose to develop effective computational algorithms to achieve Aim1: first, predict TF binding from nucleosome-resolution histone mark dynamics;second, identify target genes from TF binding, histone marks and gene expression profiles;and third, infers the TRN over a time course. We also propose to apply the above algorithms in two biological systems in Aim 2. One is the mouse myoblast cell line C2C12 differentiation into bone, fat, or muscle, and the other is the human apocrine breast cancer cell line MDA-MB-453 reversible reprogramming to epithelial cells with vitamin D treatment. Through time-course nucleosome-resolution histone mark ChIP-seq and RNA-seq profiling, we will computationally infer and experimentally validate the TRNs in these two systems.

Public Health Relevance

We propose a systematic and unbiased approach to infer the transcriptional regulatory networks (TRNs) governing two biological processes in mammalian systems, which can be cost- effectively extended to other processes. In particular, we expect our approaches to benefit the better understanding of stem cell fate control and cell identity reprogramming, and facilitate treatments for many devastating diseases and injuries. The TRNs obtained from the two biological systems in Aim 2 will unravel new molecular mechanisms of transcriptional and epigenetic regulation. They may help identify new targets for therapeutic intervention for obesity and cancer. The resulting computational algorithms in Aim 1 and time-course high throughput data in Aim 2 will be made publicly available as valuable resources for the biomedical research community.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Hagan, Ann A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Dana-Farber Cancer Institute
United States
Zip Code
Mei, Shenglin; Meyer, Clifford A; Zheng, Rongbin et al. (2017) Cistrome Cancer: A Web Resource for Integrative Gene Regulation Modeling in Cancer. Cancer Res 77:e19-e22
Liu, Yin; Chen, Sujun; Wang, Su et al. (2017) Transcriptional landscape of the human cell cycle. Proc Natl Acad Sci U S A 114:3473-3478
Gao, Wenhua; Li, Wei; Xiao, Tengfei et al. (2017) Inactivation of the PBRM1 tumor suppressor gene amplifies the HIF-response in VHL-/- clear cell renal carcinoma. Proc Natl Acad Sci U S A 114:1027-1032
Wang, Su; Zang, Chongzhi; Xiao, Tengfei et al. (2016) Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles. Genome Res 26:1417-1429
Zang, Chongzhi; Luyten, Annouck; Chen, Justina et al. (2016) NF-E2, FLI1 and RUNX1 collaborate at areas of dynamic chromatin to activate transcription in mature mouse megakaryocytes. Sci Rep 6:30255
Zang, Chongzhi; Wang, Tao; Deng, Ke et al. (2016) High-dimensional genomic data bias correction and data integration using MANCIE. Nat Commun 7:11305
Zhang, Naiqian; Wang, Haiyun; Fang, Yun et al. (2015) Predicting Anticancer Drug Responses Using a Dual-Layer Integrated Cell Line-Drug Network Model. PLoS Comput Biol 11:e1004498
Yamamoto, Shoji; Wu, Zhenhua; Russnes, Hege G et al. (2014) JARID1B is a luminal lineage-driving oncogene in breast cancer. Cancer Cell 25:762-77
He, Housheng Hansen; Meyer, Clifford A; Hu, Sheng'en Shawn et al. (2014) Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification. Nat Methods 11:73-78
Luyten, Annouck; Zang, Chongzhi; Liu, X Shirley et al. (2014) Active enhancers are delineated de novo during hematopoiesis, with limited lineage fidelity among specified primary blood cells. Genes Dev 28:1827-39

Showing the most recent 10 out of 21 publications