We have developed a method to clone synthetic DNAs with regions of random sequence. We are using this technique to create millions of ribosome binding sites in an isogenic background. The cloning vehicle contains the lacZ gene positioned to allow quantitation of initiation from each binding site and the f1 origin which allows rapid DNA sequencing. From a large set of sequences for which we know the relative efficiencies of initiation we can build linear models of the E. coli ribosome binding site. We will construct plasmids with regions that differ on the 5' and 3' sides from our original plasmid in order to study the more complex effects of mRNA structure and context on translational initiation. We propose to extend this technique into a tool for the study of any binding site. We are also studying sequence specific recognition of DNA by proteins. We have adapted the Shannon measure of information to quantitate the information content of several recognition systems. This has led to a prediction that the T7 late promoters have overlapping repressor sites, a prediction that we are now testing. We have shown that the information content of sites for recognizer proteins is related to the sequence specific binding energies, and have devised a method to determine the quantitative sequence preferences of proteins through in vitro binding to random DNA.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM028755-08
Application #
3276033
Study Section
Microbial Physiology and Genetics Subcommittee 2 (MBC)
Project Start
1980-12-01
Project End
1988-11-30
Budget Start
1987-12-01
Budget End
1988-11-30
Support Year
8
Fiscal Year
1988
Total Cost
Indirect Cost
Name
University of Colorado at Boulder
Department
Type
Schools of Arts and Sciences
DUNS #
City
Boulder
State
CO
Country
United States
Zip Code
80309
Zhou, Yiyong; Cras-Meneur, Corentin; Ohsugi, Mitsuru et al. (2007) A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments. Bioinformatics 23:2073-9
Liu, Jiajian; Stormo, Gary D (2005) Combining SELEX with quantitative assays to rapidly obtain accurate models of protein-DNA interactions. Nucleic Acids Res 33:e141
Liu, Jiajian; Stormo, Gary D (2005) Quantitative analysis of EGR proteins binding to DNA: assessing additivity in both the binding site and the protein. BMC Bioinformatics 6:176
Man, Tsz-Kwong; Yang, Joshua SungWoo; Stormo, Gary D (2004) Quantitative modeling of DNA-protein interactions: effects of amino acid substitutions on binding specificity of the Mnt repressor. Nucleic Acids Res 32:4026-32
Li, Jin Billy; Gerdes, Jantje M; Haycraft, Courtney J et al. (2004) Comparative genomics identifies a flagellar and basal body proteome that includes the BBS5 human disease gene. Cell 117:541-52
GuhaThakurta, Debraj; Schriefer, Lawrence A; Waterston, Robert H et al. (2004) Novel transcription regulatory elements in Caenorhabditis elegans muscle genes. Genome Res 14:2457-68
Liu, Jiajian; Tan, Kai; Stormo, Gary D (2003) Computational identification of the Spo0A-phosphate regulon that is essential for the cellular differentiation and development in Gram-positive spore-forming bacteria. Nucleic Acids Res 31:6891-903
Benos, Panayiotis V; Bulyk, Martha L; Stormo, Gary D (2002) Additivity in protein-DNA interactions: how good an approximation is it? Nucleic Acids Res 30:4442-51
Silbaq, Fauzi S; Ruttenberg, Steven E; Stormo, Gary D (2002) Specificity of Mnt 'master residue' obtained from in vivo and in vitro selections. Nucleic Acids Res 30:5539-48
Lin, Yiing; Han, Mei; Shimada, Brian et al. (2002) Influence of the period-dependent circadian clock on diurnal, circadian, and aperiodic gene expression in Drosophila melanogaster. Proc Natl Acad Sci U S A 99:9562-7

Showing the most recent 10 out of 36 publications