Our project seeks to identify the regulatory elements recognized by essentially all of transcription factors (TFs) in the fruit fly Drosophila melanogaster and the nematode Caenorhabditis elegans. Transcription factors (TFs) play key roles in diverse aspects of development and physiology. A catalog of sites where transcription factors bind (regulatory sequences) is perhaps only second in importance to a catalog of genes in understanding how a genome specifies an organism. First as part of modENCODE and over the past grant cycle as the independent modern program, we will have generated about 800 ChIP-seq profiles (440 worm, 380 fly) for more than 600 transcription factors (280 worm, 340 fly). Building on this progress in this proposal we seek: 1) to complete an initial catalog of binding sites for all transcription factors in both D. melanogaster and C. elegans; 2) to validate these sites through measuring the impact of transcription factor loss on the expression of genes; 3) to integrate binding sites and gene expression profiles both with one another and with other available information to develop models of gene expression and gene regulatory networks. All of the strain resources and data will be made publicly available on a timely basis throughout the project. These catalogs will represent the first comprehensive description of the TF binding sites in any metazoan and will provide a context for understanding the catalog of TF binding sites that will emerge from ENCODE.

Public Health Relevance

Insights from the study of the model organisms Drosophila and C. elegans provide the basis for broad understanding of fundamental processes of animal biology. Because many of their genes have clear relatives in humans, these studies have also led directly to improved understanding of human diseases and in some cases to therapies. Similarly, creating a comprehensive understanding of transcription factor binding sites and building regulatory networks in these key model organisms will create the foundation for understanding human regulatory networks both in health and disease.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Biotechnology Resource Cooperative Agreements (U41)
Project #
5U41HG007355-07
Application #
9944655
Study Section
Special Emphasis Panel (ZHG1)
Program Officer
Morris, Stephanie A
Project Start
2013-09-20
Project End
2022-03-31
Budget Start
2020-04-01
Budget End
2021-03-31
Support Year
7
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of Washington
Department
Genetics
Type
Schools of Medicine
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
Kudron, Michelle M; Victorsen, Alec; Gevirtzman, Louis et al. (2018) The ModERN Resource: Genome-Wide Binding Profiles for Hundreds of Drosophila and Caenorhabditis elegans Transcription Factors. Genetics 208:937-949
Cao, Junyue; Packer, Jonathan S; Ramani, Vijay et al. (2017) Comprehensive single-cell transcriptional profiling of a multicellular organism. Science 357:661-667
Sin, Olga; de Jong, Tristan; Mata-Cabana, Alejandro et al. (2017) Identification of an RNA Polymerase III Regulator Linked to Disease-Associated Protein Aggregation. Mol Cell 65:1096-1108.e6
Weicksel, Steven E; Mahadav, Assaf; Moyle, Mark et al. (2016) A novel small molecule that disrupts a key event during the oocyte-to-embryo transition in C. elegans. Development 143:3540-3548
Thompson, Owen A; Snoek, L Basten; Nijveen, Harm et al. (2015) Remarkably Divergent Regions Punctuate the Genome Assembly of the Caenorhabditis elegans Hawaiian Strain CB4856. Genetics 200:975-89
Cheng, Chao; Andrews, Erik; Yan, Koon-Kiu et al. (2015) An approach for determining and measuring network hierarchy applied to comparing the phosphorylome and the regulome. Genome Biol 16:63
Wang, Daifeng; Yan, Koon-Kiu; Sisu, Cristina et al. (2015) Loregic: a method to characterize the cooperative logic of regulatory factors. PLoS Comput Biol 11:e1004132
Kasper, Dionna M; Wang, Guilin; Gardner, Kathryn E et al. (2014) The C. elegans SNAPc component SNPC-4 coats piRNA domains and is globally required for piRNA abundance. Dev Cell 31:145-58
Gerstein, Mark B; Rozowsky, Joel; Yan, Koon-Kiu et al. (2014) Comparative analysis of the transcriptome across distant species. Nature 512:445-8