Gene regulation is the framework on which neuronal cellular diversity is built. The substantial cellular diversity that characterizes the central nervous system of vertebrates, such as humans, must therefore require immense regulatory complexity. Although regulatory control acts at many levels, we will focus on the roles played by cis- regulatory elements (REs) in controlling the timing, location and levels of neuronal transcripts. However, the biological relevance of non-coding sequences cannot be inferred by examination of sequence alone. Perhaps the most commonly used indicator of non-coding REs is evolutionary sequence conservation. Although conservation can uncover functionally constrained sequences, it cannot predict biological function and regulatory function is not always confined to conserved sequences. At its simplest level, regulatory instructions are inscribed in transcription factor binding sites (TFBS) within REs. Yet, while many TFBS have been identified, TFBS combinations predictive of specific regulatory control have not yet emerged for vertebrates. We posit that motif combinations accounting for tissue-specific regulatory control can be identified in REs of genes expressed in those cell types. The long-range goal for this proposal is to begin to identify TFBS combinations that can predict neuronal REs - a first step in developing a neuronal regulatory lexicon. We propose 3 aims to directly approach this important challenge. First, we will evaluate ~500 putative neuronal REs in vivo, prioritizing genes critical in catecholaminergic (CA) neurogenesis and function because of the prominent role of these neurons in neurodegenerative and psychiatric disorders (Aim 1), establishing a repository of regulatory data to support the study neuronal development and dysfunction. Critically such an undertaking would not be cost effective in mice. We have developed a highly efficient reporter transgene system in zebrafish that can accurately evaluate the regulatory control of mammalian sequences, enabling characterization of reporter expression during development at a fraction of the cost. Second, we will directly determine what fraction of regulatory information may be overlooked by conservation, tiling across 4 loci (approximately 150 amplicons) and testing all non-coding sequences in our in vivo assay (Aim 2). Third, we will use these and published data sets to improve upon existing computational tools, predicting/evaluating the biological relevance of sequences at genes not tested in Aims 1 and 2 (Aim 3). This proposal is a crucial first step towards a neuronal regulatory lexicon, independent of conservation, and subsequently for other cell types.

Public Health Relevance

We wish to better understand how the regulatory instructions of critical developmental and disease genes are encoded in DNA sequence. We will focus on genes important for the neurons that are lost in disorders like Parkinson's disease. We also aim to establish new computational paradigms, and generate reagents, that will have wide applicability to understanding the wealth of information arising out of genome sequencing efforts.

National Institute of Health (NIH)
National Institute of Neurological Disorders and Stroke (NINDS)
Research Project (R01)
Project #
Application #
Study Section
Molecular Neurogenetics Study Section (MNG)
Program Officer
Morris, Jill A
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Johns Hopkins University
Schools of Medicine
United States
Zip Code
Edie, Sarah; Zaghloul, Norann A; Leitch, Carmen C et al. (2018) Survey of Human Chromosome 21 Gene Expression Effects on Early Development in Danio rerio. G3 (Bethesda) 8:2215-2223
Hook, Paul W; McClymont, Sarah A; Cannon, Gabrielle H et al. (2018) Single-Cell RNA-Seq of Mouse Dopaminergic Neurons Informs Candidate Gene Selection for Sporadic Parkinson Disease. Am J Hum Genet 102:427-446
Wang, Dong; Wu, Fan; Yuan, Haoyong et al. (2017) Sox10+ Cells Contribute to Vascular Development in Multiple Organs-Brief Report. Arterioscler Thromb Vasc Biol 37:1727-1731
Eckart, Nicole; Song, Qifeng; Yang, Rebecca et al. (2016) Functional Characterization of Schizophrenia-Associated Variation in CACNA1C. PLoS One 11:e0157086
Turner, Tychele N; Hormozdiari, Fereydoun; Duyzend, Michael H et al. (2016) Genome Sequencing of Autism-Affected Families Reveals Disruption of Putative Noncoding Regulatory DNA. Am J Hum Genet 98:58-74
Fufa, Temesgen D; Harris, Melissa L; Watkins-Chow, Dawn E et al. (2015) Genomic analysis reveals distinct mechanisms and functional classes of SOX10-regulated genes in melanocytes. Hum Mol Genet 24:5433-50
Jiang, Qian; Arnold, Stacey; Heanue, Tiffany et al. (2015) Functional loss of semaphorin 3C and/or semaphorin 3D and their epistatic interaction with ret are critical to Hirschsprung disease liability. Am J Hum Genet 96:581-96
Fischer, Audrey; Wolman, Marc; Granato, Michael et al. (2015) Carbamate nerve agent prophylatics exhibit distinct toxicological effects in the zebrafish embryo model. Neurotoxicol Teratol 50:1-10
Lee, Dongwon; Gorkin, David U; Baker, Maggie et al. (2015) A method to predict the impact of regulatory variants from DNA sequence. Nat Genet 47:955-61
Ghandi, Mahmoud; Mohammad-Noori, Morteza; Beer, Michael A (2014) Robust k-mer frequency estimation using gapped k-mers. J Math Biol 69:469-500

Showing the most recent 10 out of 28 publications