Control of transcription initiation is the most common mechanism by which genes are regulated. An essential part of this control is the sequence-specific binding of proteins (transcription factors) to DNA sequence elements. However, there is not a simple correspondence in eukaryotes between the existence of a transcription factor binding site and the regulation of a nearby gene by that factor. The availability of genomic sequences and the advent of DNA microarray technologies allow us to ask fundamentally new questions about gene regulation. Where in the past we have been able to show that a sequence motif is necessary for regulation of a gene, or even sufficient in some context, we can now ask how predictive of regulation the motif is, evaluated over every gene in the genome. The improvement in correlation between sequence elements and expression can be evaluated as more information is added to the analysis: improved binding site descriptions, considerations of spacing and orientation, cooperative and competing interactions, and so on. Ultimately, we would like to be able to explain, on the basis of genomic sequence, why one set of genes is regulated in response to a signal and the rest of the genes in the genome are not. As a first step in this direction, we will determine the limits to how well simple binding site considerations can rationalize gene regulation and will determine the role of protein-protein competition in gene regulation by related transcription factors. These studies are being conducted in yeast because the delineation of genes in the genome is better for yeast than for higher eukaryotes and because DNA microarray data can readily be obtained for essentially every gene. In addition to computational analyses on a variety of systems, experimental data will be obtained for selected systems with properties ideally suited for addressing issues of binding specificity and competition.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM065179-02
Application #
6735708
Study Section
Genome Study Section (GNM)
Program Officer
Tompkins, Laurie
Project Start
2003-05-01
Project End
2006-04-30
Budget Start
2004-05-01
Budget End
2005-04-30
Support Year
2
Fiscal Year
2004
Total Cost
$331,088
Indirect Cost
Name
Johns Hopkins University
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
001910777
City
Baltimore
State
MD
Country
United States
Zip Code
21218
Liu, Xiao; Lee, Cheol-Koo; Granek, Joshua A et al. (2006) Whole-genome comparison of Leu3 binding in vitro and in vivo reveals the importance of nucleosome occupancy in target site selection. Genome Res 16:1517-28
Tang, Lin; Liu, Xiao; Clarke, Neil D (2006) Inferring direct regulatory targets from expression and genome location analyses: a comparison of transcription factor deletion and overexpression. BMC Genomics 7:215
Liu, Xiao; Noll, David M; Lieb, Jason D et al. (2005) DIP-chip: rapid and accurate determination of DNA-binding specificity. Genome Res 15:421-7
Carroll, Kristina L; Pradhan, Dennis A; Granek, Josh A et al. (2004) Identification of cis elements directing termination of yeast nonpolyadenylated snoRNA transcripts. Mol Cell Biol 24:6241-52