Our ability to identify the majority of exons in the human genome has been dramatically facilitated by the availability of extensive experimental data (EST, cDNA, and protein sequences) thereby providing training sets for the development of effective algorithms for the cfe novo prediction of such elements. In stark contrast, the vocabulary of gene regulatory regions in the human genome remains poorly defined, in large part, due to the lack of parallel experimental training sets for these sequences. Recent advances in our ability to predict which non-coding sequences have a higher likelihood of acting as transcriptional enhancers based on deep evolutionary conservation have provided some leverage for addressing this problem. In preliminary studies, we have examined 150 extremely conserved non-coding sequences in a transgenic mouse reporter assay and demonstrate that 58 of these sequences have distinct tissue specific enhancer activity. With this background, we propose here to couple our expertise in comparative genomics and high throughput mouse transgenesis to define the enhancer activity of 1,500 deeply conserved non-coding elements located throughout the human genome. We will make the results of our in vivo studies publicly available through an online database with extensive search capabilities, allowing users to bin sequences producing similar expression patterns to identify shared sequence features. These datasets will provide an essential resource for a broad group of investigators in computational, developmental, and clinical biology focused on deciphering the rules that govern human gene expression. Accordingly, this grant aims to classify the gene regulatory properties of non-coding DNA in the human genome through: (1) the characterization of 1,500 extremely conserved human DNA fragments for spatial enhancer activity in transgenic mice and (2) the development of a publicly available in vivo enhancer database to display these results. In addition, to provide the bioinformatic community with a means to test ab initio predictions of enhancers based on their analyses of our data generated in Aim 1, we further propose to (3) test 15-20 predicted enhancers by outside investigators per year in our transgenic mouse system. Lay Person Summary: The generation of the entire human genome sequence serves as a routine starting point for a huge investigator base and has aided in defining the majority of genes in our genome. However, our understanding of the sequences that regulate these genes is meager, despite their presumed alterations in human disease. Here, we propose to leverage human-fish genome comparisons to identify deeply conserved non-gene sequences and to test their ability to act as gene regulatory sequences in transgenic mice. Such a community resource is expected to significantly fill our void in gene regulatory annotation of the human genome and to decipher their mutation as a cause of human disease. .

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
3R01HG003988-04S1
Application #
7941543
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Feingold, Elise A
Project Start
2009-09-30
Project End
2011-08-31
Budget Start
2009-09-30
Budget End
2011-08-31
Support Year
4
Fiscal Year
2009
Total Cost
$352,224
Indirect Cost
Name
Lawrence Berkeley National Laboratory
Department
Genetics
Type
Organized Research Units
DUNS #
078576738
City
Berkeley
State
CA
Country
United States
Zip Code
94720
McClymont, Sarah A; Hook, Paul W; Soto, Alexandra I et al. (2018) Parkinson-Associated SNCA Enhancer Variants Revealed by Open Chromatin in Mouse Dopamine Neurons. Am J Hum Genet 103:874-892
Osterwalder, Marco; Barozzi, Iros; Tissières, Virginie et al. (2018) Enhancer redundancy provides phenotypic robustness in mammalian development. Nature 554:239-243
Dickel, Diane E; Ypsilanti, Athena R; Pla, Ramón et al. (2018) Ultraconserved Enhancers Are Required for Normal Development. Cell 172:491-499.e15
Stender, Stefan; Smagris, Eriks; Lauridsen, Bo K et al. (2018) Relationship between genetic variation at PPP1R3B and levels of liver glycogen and triglyceride. Hepatology 67:2182-2195
Anderson, Courtney M; Hu, Jianxin; Thomas, Reuben et al. (2017) Cooperative activation of cardiac transcription through myocardin bridging of paired MEF2 sites. Development 144:1235-1241
Gompers, Andrea L; Su-Feher, Linda; Ellegood, Jacob et al. (2017) Germline Chd8 haploinsufficiency alters brain development in mouse. Nat Neurosci 20:1062-1073
Turner, Tychele N; Coe, Bradley P; Dickel, Diane E et al. (2017) Genomic Patterns of De Novo Mutation in Simplex Autism. Cell 171:710-722.e12
Laurent, Frédéric; Girdziusaite, Ausra; Gamart, Julie et al. (2017) HAND2 Target Gene Regulatory Networks Control Atrioventricular Canal and Cardiac Valve Development. Cell Rep 19:1602-1613
Monti, Remo; Barozzi, Iros; Osterwalder, Marco et al. (2017) Limb-Enhancer Genie: An accessible resource of accurate enhancer predictions in the developing limb. PLoS Comput Biol 13:e1005720
Will, Anja J; Cova, Giulia; Osterwalder, Marco et al. (2017) Composition and dosage of a multipartite enhancer cluster control developmental expression of Ihh (Indian hedgehog). Nat Genet 49:1539-1545

Showing the most recent 10 out of 72 publications