Deciphering the transcriptional regulatory network (TRN) governing a biological process in mammalian systems is essential to our understanding of basic mechanisms underlying normal physiology as well as disease etiology. It is a daunting task because too many links in the TRN are unknown. The emergence of ChIP-chip and ChIP-seq technologies has enabled the mapping of the genome-wide binding sites of many transcription factors (TFs) known to be key regulators in a biological process. However, these two technologies are limited to the known regulators with ChIP-quality antibodies. We found that TF binding is often associated with a dynamic histone mark signature and can be computationally predicted from the genome-wide histone mark dynamics. Therefore, we hypothesize that with time-course nucleosome-resolution ChIP-seq of a few informative histone marks and RNA-seq data of gene expression, and effective computational modeling, we could infer the TRNs in mammalian biological processes. Specifically, we propose to develop effective computational algorithms to achieve Aim1: first, predict TF binding from nucleosome-resolution histone mark dynamics;second, identify target genes from TF binding, histone marks and gene expression profiles;and third, infers the TRN over a time course. We also propose to apply the above algorithms in two biological systems in Aim 2. One is the mouse myoblast cell line C2C12 differentiation into bone, fat, or muscle, and the other is the human apocrine breast cancer cell line MDA-MB-453 reversible reprogramming to epithelial cells with vitamin D treatment. Through time-course nucleosome-resolution histone mark ChIP-seq and RNA-seq profiling, we will computationally infer and experimentally validate the TRNs in these two systems.

Public Health Relevance

We propose a systematic and unbiased approach to infer the transcriptional regulatory networks (TRNs) governing two biological processes in mammalian systems, which can be cost- effectively extended to other processes. In particular, we expect our approaches to benefit the better understanding of stem cell fate control and cell identity reprogramming, and facilitate treatments for many devastating diseases and injuries. The TRNs obtained from the two biological systems in Aim 2 will unravel new molecular mechanisms of transcriptional and epigenetic regulation. They may help identify new targets for therapeutic intervention for obesity and cancer. The resulting computational algorithms in Aim 1 and time-course high throughput data in Aim 2 will be made publicly available as valuable resources for the biomedical research community.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Swain, Amy L
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Dana-Farber Cancer Institute
United States
Zip Code
Wang, Hongfang; Zang, Chongzhi; Taing, Len et al. (2014) NOTCH1-RBPJ complexes drive target gene expression through dynamic interactions with superenhancers. Proc Natl Acad Sci U S A 111:705-10
Yamamoto, Shoji; Wu, Zhenhua; Russnes, Hege G et al. (2014) JARID1B is a luminal lineage-driving oncogene in breast cancer. Cancer Cell 25:762-77
He, Housheng Hansen; Meyer, Clifford A; Hu, Sheng'en Shawn et al. (2014) Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification. Nat Methods 11:73-8
Zheng, Xiaoqi; Zhao, Qian; Wu, Hua-Jun et al. (2014) MethylPurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes. Genome Biol 15:419
Meyer, Clifford A; Liu, X Shirley (2014) Identifying and mitigating bias in next-generation sequencing methods for chromatin biology. Nat Rev Genet 15:709-21
Cai, Changmeng; He, Housheng Hansen; Gao, Shuai et al. (2014) Lysine-specific demethylase 1 has dual functions as a major regulator of androgen receptor transcriptional activity. Cell Rep 9:1618-27
Luyten, Annouck; Zang, Chongzhi; Liu, X Shirley et al. (2014) Active enhancers are delineated de novo during hematopoiesis, with limited lineage fidelity among specified primary blood cells. Genes Dev 28:1827-39
Du, Zhou; Fei, Teng; Verhaak, Roel G W et al. (2013) Integrative genomic analyses reveal clinically relevant long noncoding RNAs in human cancer. Nat Struct Mol Biol 20:908-13
Du, Zhou; Li, Hui; Wei, Qiang et al. (2013) Genome-wide analysis of histone modifications: H3K4me2, H3K4me3, H3K9ac, and H3K27ac in Oryza sativa L. Japonica. Mol Plant 6:1463-72
He, Housheng Hansen; Meyer, Clifford A; Chen, Mei Wei et al. (2012) Differential DNase I hypersensitivity reveals factor-dependent chromatin dynamics. Genome Res 22:1015-25