Despite clear ties between many viruses and human diseases, the relationship between disease genetic susceptibility and viral infection remains largely unexplored. For most diseases, genetic factors only explain a small fraction of disease susceptibility, suggesting that synergism with environmental factors such as viral infection must play a key role in their etiology. A common way that viruses interact with their host is through manipulation of host gene expression via regulatory interactions between viral proteins and the host genome. Many disease-associated genetic variants are non-coding, and thus likely to impact gene regulation. Our hypothesis is that specific genetic variants associated with human diseases act by influencing the binding of viral transcription factors (TFs) to host DNA. We propose a computational system that will use available data to identify these variants, and produce testable hypotheses about the viral proteins whose binding they influence:
Aim 1. Construct a regulatory network linking viral proteins to host genomic binding locations. We will create a network of genomic binding locations, and physical interactions between viral and human proteins.
Aim 2. Determine the DNA binding preferences of human-hosted viral transcription factors. We will experimentally determine the DNA binding preferences of viral TFs using Protein Binding Microarrays. We will supplement these data with already established viral TF DNA binding motifs.
Aim 3. Create a system to identify disease-associated genetic variants influencing viral protein binding. We will create a computational system to identify the union of the disease-associated genetic variants with the regulatory network developed in Aim 1, and the motifs determined in Aim 2. This system will systematically search for variants that are likely to alter the binding of viral proteins by modifying TF binding sites. The freely available database and tools developed in this study will facilitate new genomic analyses investigating the impact of disease-associated genetic variants on virus-host interactions. The proposed research is innovative because it will be the first study to combine available relevant data for studying this mechanism in a single predictive framework. Its contribution will be significant because it will enable the investigation of the effect of disease-associated genetic variants on viral protein binding for any disease with a known or suspected viral component. It is now possible to systematically identify such interactions using existing data, thanks to massive increases in genome-wide association study (GWAS) and functional genomics data. This resource will be essential for analysis of both currently available and new GWAS data involving any disease with a known or suspected viral environmental component. Based on our expertise, we will focus on identifying predictions relevant to a specific disease (lupus) and specific viruses (Epstein-Barr virus and Herpes Simplex Virus 1). We expect that the mechanisms identified by this system will help reveal environment-genotype links for many human diseases, opening new avenues of exploration into the mechanisms of disease onset and progression.

Public Health Relevance

The proposed research is relevant to public health because many human diseases, despite established epidemiological ties to viral infection, lack a mechanistic understanding of these associations. This work will facilitate new analyses investigating virus-human interactions, and the influence of disease-associated genetic variants on these interactions. The project is relevant to NIH's mission because it will deepen our understanding of how genetic variants and viruses synergize to influence the progression of many diseases.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Exploratory/Developmental Grants (R21)
Project #
5R21HG008186-03
Application #
9189640
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Troyer, Jennifer L
Project Start
2014-12-15
Project End
2018-11-30
Budget Start
2016-12-01
Budget End
2018-11-30
Support Year
3
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Cincinnati Children's Hospital Medical Center
Department
Type
DUNS #
071284913
City
Cincinnati
State
OH
Country
United States
Zip Code
45229
Valanejad, Leila; Cast, Ashley; Wright, Mary et al. (2018) PARP1 activation increases expression of modified tumor suppressors and pathways underlying development of aggressive hepatoblastoma. Commun Biol 1:67
Harley, John B; Chen, Xiaoting; Pujato, Mario et al. (2018) Transcription factors operate across disease loci, with EBNA2 implicated in autoimmunity. Nat Genet 50:699-707
Zhang, Xue; Biagini Myers, Jocelyn M; Burleson, J D et al. (2018) Nasal DNA methylation is associated with childhood asthma. Epigenomics 10:629-641
Endale, Mehari; Ahlfeld, Shawn; Bao, Erik et al. (2017) Temporal, spatial, and phenotypical changes of PDGFR? expressing fibroblasts during late lung development. Dev Biol 425:161-175
Tillo, Desiree; Ray, Sreejana; Syed, Khund-Sayeed et al. (2017) The Epstein-Barr Virus B-ZIP Protein Zta Recognizes Specific DNA Sequences Containing 5-Methylcytosine and 5-Hydroxymethylcytosine. Biochemistry 56:6200-6210
Gu, Zirong; Kalambogias, John; Yoshioka, Shin et al. (2017) Control of species-dependent cortico-motoneuronal connections underlying manual dexterity. Science 357:400-404
Miller, Daniel E; Patel, Zubin H; Lu, Xiaoming et al. (2016) Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay (EMSA) and DNA-affinity Precipitation Assay (DAPA). J Vis Exp :
Chen, Xiaoting; Ernst, Kevin; Soman, Frances et al. (2015) CressInt: a user-friendly web resource for genome-scale exploration of gene regulation in Arabidopsis thaliana. Curr Plant Biol 3-4:48-55