Deciphering the transcriptional regulatory network (TRN) governing a biological process in mammalian systems is essential to our understanding of basic mechanisms underlying normal physiology as well as disease etiology. It is a daunting task because too many links in the TRN are unknown. The emergence of ChIP-chip and ChIP-seq technologies has enabled the mapping of the genome-wide binding sites of many transcription factors (TFs) known to be key regulators in a biological process. However, these two technologies are limited to the known regulators with ChIP-quality antibodies. We found that TF binding is often associated with a dynamic histone mark signature and can be computationally predicted from the genome-wide histone mark dynamics. Therefore, we hypothesize that with time-course nucleosome-resolution ChIP-seq of a few informative histone marks and RNA-seq data of gene expression, and effective computational modeling, we could infer the TRNs in mammalian biological processes. Specifically, we propose to develop effective computational algorithms to achieve Aim1: first, predict TF binding from nucleosome-resolution histone mark dynamics;second, identify target genes from TF binding, histone marks and gene expression profiles;and third, infers the TRN over a time course. We also propose to apply the above algorithms in two biological systems in Aim 2. One is the mouse myoblast cell line C2C12 differentiation into bone, fat, or muscle, and the other is the human apocrine breast cancer cell line MDA-MB-453 reversible reprogramming to epithelial cells with vitamin D treatment. Through time-course nucleosome-resolution histone mark ChIP-seq and RNA-seq profiling, we will computationally infer and experimentally validate the TRNs in these two systems.

Public Health Relevance

We propose a systematic and unbiased approach to infer the transcriptional regulatory networks (TRNs) governing two biological processes in mammalian systems, which can be cost- effectively extended to other processes. In particular, we expect our approaches to benefit the better understanding of stem cell fate control and cell identity reprogramming, and facilitate treatments for many devastating diseases and injuries. The TRNs obtained from the two biological systems in Aim 2 will unravel new molecular mechanisms of transcriptional and epigenetic regulation. They may help identify new targets for therapeutic intervention for obesity and cancer. The resulting computational algorithms in Aim 1 and time-course high throughput data in Aim 2 will be made publicly available as valuable resources for the biomedical research community.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Swain, Amy L
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Dana-Farber Cancer Institute
United States
Zip Code
Zang, Chongzhi; Luyten, Annouck; Chen, Justina et al. (2016) NF-E2, FLI1 and RUNX1 collaborate at areas of dynamic chromatin to activate transcription in mature mouse megakaryocytes. Sci Rep 6:30255
Zang, Chongzhi; Wang, Tao; Deng, Ke et al. (2016) High-dimensional genomic data bias correction and data integration using MANCIE. Nat Commun 7:11305
Wang, Su; Zang, Chongzhi; Xiao, Tengfei et al. (2016) Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles. Genome Res 26:1417-1429
Zhang, Naiqian; Wang, Haiyun; Fang, Yun et al. (2015) Predicting Anticancer Drug Responses Using a Dual-Layer Integrated Cell Line-Drug Network Model. PLoS Comput Biol 11:e1004498
Wang, Hongfang; Zang, Chongzhi; Taing, Len et al. (2014) NOTCH1-RBPJ complexes drive target gene expression through dynamic interactions with superenhancers. Proc Natl Acad Sci U S A 111:705-10
Meyer, Clifford A; Liu, X Shirley (2014) Identifying and mitigating bias in next-generation sequencing methods for chromatin biology. Nat Rev Genet 15:709-21
He, Housheng Hansen; Meyer, Clifford A; Hu, Sheng'en Shawn et al. (2014) Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification. Nat Methods 11:73-8
Meng, Fei-Long; Du, Zhou; Federation, Alexander et al. (2014) Convergent transcription at intragenic super-enhancers targets AID-initiated genomic instability. Cell 159:1538-48
Li, Wei; Xu, Han; Xiao, Tengfei et al. (2014) MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol 15:554
Zheng, Xiaoqi; Zhao, Qian; Wu, Hua-Jun et al. (2014) MethylPurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes. Genome Biol 15:419

Showing the most recent 10 out of 17 publications