Defining the regulatory networks altered in the disease can provide not only the insights on the mechanisms underlying disease, but also the possible therapeutic targets. Several factors such as genetic variation and methylation sites can disrupt the interaction between transcription factors (TFs) and cis-regulatory regions (e.g. promoters and enhancers) and thus alter the regulatory networks. However, to identify the altered networks in disease is still challenging. First, to identify the genetic variation and methylation sites that play a role in gene regulation, we will need to map the genetic variation and methylation sites on the regulatory regions that is specific to the pathological tissues. While DNase I hypersensitivity sites (DHSs) and histone mark profiles are powerful to determine the regulatory regions, it is not feasible for every laboratory to be equipped to measure DHS and histone mark on the tissues of interest. Therefore, we need a computational algorithm that is accurate enough to differentiate the regulatory regions between diseased and normal samples. Second, although a large number of differentially methylated sites have been determined for different disease, their functional role remains largely unclear. DNA methylation has been generally considered as a potent epigenetic modification that prohibits TF recruitment, resulting in transcription suppression. Recent studies and our own preliminary results showed that some TFs preferentially bind to methylated DNA, an interaction that in some cases activates gene transcription. Therefore, we need to identify such TFs and incorporate these methylation- dependent TF-DNA interactions in the computational platform. Third, we need a unified computational framework to incorporate various and these diverse types of factors that could alter the regulatory networks. To address these challenges, we will develop a computational framework to incorporate the effects of genetic and epigenetic variations and identify the regulatory networks altered by these effects. In this framework, we will develop a computational approach to predict regulatory regions in tissue of interest by integrating various epigenetic datasets (Aim 1). Our approach is analogous to homology modeling for protein structure prediction, fully utilizing the existing epigenetic datasets from ENCODE project. We will then develop a model to provide quantitative measurement of interaction strength between TFs and DNA with consideration of genetic variation, DNA methylation and TF concentration (Aim 2). This model will incorporate our new discovery that some TFs preferentially bind to methylated DNA motifs. Our computational framework will then be applied to age-related macular degeneration (AMD), which is the leading cause of vision loss in Americans aged 60 and older. The altered regulatory networks in AMD will then be experimentally evaluated (Aim 3). Finally, we will make our software and the regulatory networks in AMD available through an interactive, user-friendly database (Aim 4).

Public Health Relevance

In this proposal, we will develop novel computational approaches to predict the altered regulatory relationships by integrating a variety of datasets. We will then apply the approaches to age-related macular degeneration, which is the leading cause of vision loss in Americans aged 60 and older. The predictions obtained from this project will not only provide the insights into the mechanisms underlying human disease, but also help to identify the possible therapeutic targets.

National Institute of Health (NIH)
National Eye Institute (NEI)
Research Project (R01)
Project #
Application #
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Shen, Grace L
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Johns Hopkins University
Schools of Medicine
United States
Zip Code
Wan, Jun; Su, Yijing; Song, Qifeng et al. (2017) Methylated cis-regulatory elements mediate KLF4-dependent gene transactivation and cell migration. Elife 6:
Liu, Sheng; Zibetti, Cristina; Wan, Jun et al. (2017) Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility. BMC Bioinformatics 18:355
Zhu, Heng; Wang, Guohua; Qian, Jiang (2016) Transcription factors as readers and effectors of DNA methylation. Nat Rev Genet 17:551-65
Gao, Tianshun; He, Bing; Liu, Sheng et al. (2016) EnhancerAtlas: a resource for enhancer annotation and analysis in 105 human cell/tissue types. Bioinformatics 32:3543-3551
Wang, Jie; Xia, Shuli; Arand, Brian et al. (2016) Single-Cell Co-expression Analysis Reveals Distinct Functional Modules, Co-regulation Mechanisms and Clinical Outcomes. PLoS Comput Biol 12:e1004892
Hwang, Woochang; Oliver, Verity F; Merbs, Shannath L et al. (2015) Prediction of promoters and enhancers using multiple DNA methylation-associated features. BMC Genomics 16 Suppl 7:S11
Li, Renfeng; Liao, Gangling; Nirujogi, Raja Sekhar et al. (2015) Phosphoproteomic Profiling Reveals Epstein-Barr Virus Protein Kinase Integration of DNA Damage Response and Mitotic Signaling. PLoS Pathog 11:e1005346
Oliver, Verity F; Jaffe, Andrew E; Song, Jin et al. (2015) Differential DNA methylation identified in the blood and retina of AMD patients. Epigenetics 10:698-707
Wan, Jun; Oliver, Verity F; Wang, Guohua et al. (2015) Characterization of tissue-specific differential DNA methylation suggests distinct modes of positive and negative gene expression regulation. BMC Genomics 16:49
Bedont, Joseph L; LeGates, Tara A; Slat, Emily A et al. (2014) Lhx1 controls terminal differentiation and circadian function of the suprachiasmatic nucleus. Cell Rep 7:609-22