Interactions between transcription factors (TFs) and their DNA binding sites are an integral part of regulatory networks within cells. These interactions control critical steps in development and responses to environmental stresses, and their dysfunction can contribute to the progression of various diseases. However, the genomic binding sites and regulatory functions of most of the approximately 2000 described and predicted human TFs are unknown. Prediction of regulatory sites in the genomes of higher eukaryotes is difficult because of the size and complexity of their genomes. Furthermore, even general themes regarding the locations of DNA regulatory elements are still unknown. My lab uses computational and experimental approaches for studying transcriptional regulatory networks, both in model organisms and in the human genome. Experimentally, we are performing high-density oligonucleotide array-based readout of chromatin immunoprecipitation experiments on human and mouse TFs, to identify their in vivo binding sites in a high-throughput manner at much higher resolution than has been permitted using microarrays spotted with PCR products. We are also using improved in vitro protein binding microarray (PBM) technology to characterize the sequence specificity of transcription factors. Computationally, we are predicting candidate cis regulatory elements by integrating mRNA expression data with data on genomic noncoding regions conserved between the mouse and human genomes. We are also developing a rigorous statistical framework for the analysis of binding site clustering, and using it to develop improved algorithms for the prediction of candidate cis regulatory elements. These studies will permit better understanding of the locations and organization of regulatory DNA elements in higher eukaryotic genomes, and will aid in understanding regulatory complexity arising from combinatorial interactions of TFs. Furthermore, the combination of these data with mRNA expression analysis, protein interaction databases and prior genetic and biochemical data in the literature will allow the development of more detailed models of transcriptional regulatory networks in higher eukaryotes. In addition, we will make our data publicly available, so that other researchers may focus their efforts on those genomic regions most likely to contain cis regulatory elements.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG002966-02
Application #
6801836
Study Section
Genome Study Section (GNM)
Program Officer
Feingold, Elise A
Project Start
2003-09-19
Project End
2008-06-30
Budget Start
2004-07-01
Budget End
2005-06-30
Support Year
2
Fiscal Year
2004
Total Cost
$389,250
Indirect Cost
Name
Brigham and Women's Hospital
Department
Type
DUNS #
030811269
City
Boston
State
MA
Country
United States
Zip Code
02115
Hirsch, Heather A; Iliopoulos, Dimitrios; Joshi, Amita et al. (2010) A transcriptional signature and common gene networks link cancer with lipid metabolism and diverse human diseases. Cancer Cell 17:348-61
Grove, Christian A; De Masi, Federico; Barrasa, M Inmaculada et al. (2009) A multiparameter network reveals extensive divergence between C. elegans bHLH transcription factors. Cell 138:314-27
McCord, Rachel Patton; Bulyk, Martha L (2008) Functional trends in structural classes of the DNA binding domains of regulatory transcription factors. Pac Symp Biocomput :441-52
Warner, Jason B; Philippakis, Anthony A; Jaeger, Savina A et al. (2008) Systematic identification of mammalian regulatory motifs'target genes and functions. Nat Methods 5:347-53
Bulyk, Martha L (2007) Protein binding microarrays for the characterization of DNA-protein interactions. Adv Biochem Eng Biotechnol 104:65-85
Michelson, Alan M; Bulyk, Martha L (2006) Biological code breaking in the 21st century. Mol Syst Biol 2:2006.0018
Philippakis, Anthony A; Busser, Brian W; Gisselbrecht, Stephen S et al. (2006) Expression-guided in silico evaluation of candidate cis regulatory codes for Drosophila muscle founder cells. PLoS Comput Biol 2:e53
Huber, Bertrand R; Bulyk, Martha L (2006) Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data. BMC Bioinformatics 7:229
Bulyk, Martha L (2006) Analysis of sequence specificities of DNA-binding proteins with protein binding microarrays. Methods Enzymol 410:279-99
Berger, Michael F; Bulyk, Martha L (2006) Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA binding proteins. Methods Mol Biol 338:245-60

Showing the most recent 10 out of 15 publications