Dissecting the transcriptional regulatory networks is essential for understanding development and the molecular basis of many diseases. Great progress has been made and the emerging view is that the presence of individual regulatory elements is rarely sufficient to explain spatial-temporal specific gene expression and regulatory elements usually are organized into functional units - modules. Modules control gene expression in a particular context independent of its position and orientation. Experimental identification of modules is often a laborious and expensive process. Computational approaches can be fast and inexpensive, however, the development of computational methods to identify modules is still in its infancy. The goal of the proposed research is to use C. elegans as a model system, to develop and validate computational strategies to identify regulatory modules in the genomic sequences. First, the regulatory region of a set of genes that are preferentially expressed in the muscle tissue of C. elegans, together with that of the orthologous genes in related species will be used to identify muscle-specific regulatory motifs. Several different computational approaches and various existing computational tools will be employed. Next, statistic analysis will be used to exploit the enrichment of certain combinations of motifs in muscle specific genes in comparison with the genome at large and to analyze the interactions among motifs. This will provide insight into the rules that govern the organization of cis-regulatory elements to form biologically active modules. The information will be used to develop computational tools to identify modules that control muscle - specific transcription. To validate computational predictions in vivo, GFP reporter gene constructs will be used to determine whether a gene is expressed in muscle tissue and to test putative regulatory modules. The results of the validation experiments will be used to refine and improve our algorithms. Although I will focus my efforts on muscle specific gene expression, I believe the approaches and tools I develop will be of general use for many other context-specific module identification. The computational tools I develop will be made freely available to the scientific community.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Postdoctoral Individual National Research Service Award (F32)
Project #
1F32GM073444-01
Application #
6886620
Study Section
Special Emphasis Panel (ZRG1-F08 (20))
Program Officer
Haynes, Susan R
Project Start
2005-07-01
Project End
2008-06-30
Budget Start
2005-07-01
Budget End
2006-06-30
Support Year
1
Fiscal Year
2005
Total Cost
$43,976
Indirect Cost
Name
Washington University
Department
Genetics
Type
Schools of Medicine
DUNS #
068552207
City
Saint Louis
State
MO
Country
United States
Zip Code
63130
Zhao, Guoyan; Ihuegbu, Nnamdi; Lee, Mo et al. (2012) Conserved Motifs and Prediction of Regulatory Modules in Caenorhabditis elegans. G3 (Bethesda) 2:469-81
Missiuro, Patrycja Vasilyev; Liu, Kesheng; Zou, Lihua et al. (2009) Information flow analysis of interactome networks. PLoS Comput Biol 5:e1000350
Zhao, Guoyan; Schriefer, Lawrence A; Stormo, Gary D (2007) Identification of muscle-specific regulatory modules in Caenorhabditis elegans. Genome Res 17:348-57