DNA and protein sequences digitally store information about biological function in a complex code that is not yet fully understood. The fundamental unit of this code is the sequence motif, which is defined as a small, recurring DNA or protein sequence pattern. A DNA motif might be involved, for example, in turning on or off the transcription of a gene in response to environmental cues. A protein motif might encode the properties of the binding site that allows the protein to carry out its function. The MEME Suite of motif-based sequence analysis software builds statistical models of DNA and protein motifs, allowing biologists to discover novel motifs, to search for new instances of known motifs, and to compare motifs to one another. This proposal continues to develop and maintain the MEME Suite, which is in regular use by biologists around the world.
The aims of this work are five-fold: (1) to increase the accessiblity, usability and interoperability of the MEME Suite, (2) to expand the MEME Suite to handle epigenetic data regarding histone modifications, methylation, nucleosome positioning and DNaseI hypersensitive sites, (3) to integrate a variety of existing motif-based software tools into the MEME Suite, (4) to augment the algorithms used by the MEME Suite with proven enhancements, and (5) to continue to improve our user support services.

Public Health Relevance

This project will improve existing, widely used software that enables biologists to understand how DNA and protein sequences encode information about biological function. Identifying and accurately char- acterizing functional sequence motifs allows scientists to understand how genes are turned on and off and how proteins carry out their functions in the cell. Such knowledge is fundamental to any model of the basic molecular mechanisms of the cell, and in particular, for molecular-scale models of disease processes.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-Q (01))
Program Officer
Ravichandran, Veerasamy
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Washington
Schools of Medicine
United States
Zip Code
Ilsley, Melissa D; Gillinder, Kevin R; Magor, Graham W et al. (2017) Krüppel-like factors compete for promoters and enhancers to fine-tune transcription. Nucleic Acids Res 45:6572-6588
Overman, Jeroen; Fontaine, Frank; Moustaqil, Mehdi et al. (2017) Pharmacological targeting of the transcription factor SOX18 delays breast cancer in mice. Elife 6:
Grant, Charles E; Johnson, James; Bailey, Timothy L et al. (2016) MCAST: scanning for cis-regulatory motif clusters. Bioinformatics 32:1217-9
O'Connor, Timothy; Bodén, Mikael; Bailey, Timothy L (2016) CisMapper: predicting regulatory interactions from transcription factor ChIP-seq data. Nucleic Acids Res :
Gillinder, Kevin R; Ilsley, Melissa D; Nébor, Danitza et al. (2016) Promiscuous DNA-binding of a mutant zinc finger protein corrupts the transcriptome and diminishes cell viability. Nucleic Acids Res :
Bailey, Timothy L; Johnson, James; Grant, Charles E et al. (2015) The MEME Suite. Nucleic Acids Res 43:W39-49
Lim, Jonathan W C; Donahoo, Amber-Lee S; Bunt, Jens et al. (2015) EMX1 regulates NRP1-mediated wiring of the mouse anterior cingulate cortex. Development 142:3746-57
Ma, Wenxiu; Noble, William S; Bailey, Timothy L (2014) Motif-based analysis of large nucleotide data sets using MEME-ChIP. Nat Protoc 9:1428-50
Lesluyes, Tom; Johnson, James; Machanick, Philip et al. (2014) Differential motif enrichment analysis of paired ChIP-seq experiments. BMC Genomics 15:752
Tanaka, Emi; Bailey, Timothy L; Keich, Uri (2014) Improving MEME via a two-tiered significance analysis. Bioinformatics 30:1965-73

Showing the most recent 10 out of 13 publications