Sequence-specific transcription factors (TFs) regulate gene expression through their interactions with DNA sequences in the genome. The goals of this project are to continue developing approaches and data sets for understanding the DNA binding specificities of TFs and to identify the effects of coding polymorphisms within TFs'DNA binding domains on their DNA binding preferences. Identification of TFs'DNA binding specificities is important in understanding transcriptional regulatory networks, in particular in the prediction of cis regulator modules, inference of cis regulatory codes, and interpretation of in vivo TF binding data and gene expression data. Identification of the DNA binding effects of such polymorphic TF variants will be essential in studies aimed at understanding the gene regulatory effects resulting from natural genetic variation. This project will focus on human TFs, with an emphasis on those with known mutations or polymorphisms identified in exome or whole-genome sequencing projects. Our results will provide data that will likely be of importance to other systems, and more generally, our data, approaches, technologies, and database will be useful not only for human TFs but also for model organism studies. Specifically, we will: (1) develop a computational pipeline to predict the effects of coding mutations or polymorphisms within TF DNA binding domains (DBDs) for TFs of major structural classes;(2) determine the DNA binding specificities of mutant TFs designed to test our computational pipeline;(3) experimentally determine the effects of known mutations or coding polymorphisms within human TF DBDs;(4) further develop and maintain the UniPROBE database of universal protein binding microarray (PBM) data on TFs'DNA binding specificities.

Public Health Relevance

The interactions between transcription factors (TFs) and their DNA binding sites are an integral part of gene regulatory networks within cells;however, it is not well understood how mutations or polymorphisms within TFs affect their DNA binding activities. In this project, we will develop a computational pipeline to identify with greater accuracy potentially damaging mutations or polymorphisms within TFs, and we will test such predictions experimentally. The resulting data are anticipated to improve the ability to understand the potential effects of such TF mutations or polymorphisms on gene regulation.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Smith, Michael
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Brigham and Women's Hospital
United States
Zip Code
Barrera, Luis A; Vedenko, Anastasia; Kurland, Jesse V et al. (2016) Survey of variation in human transcription factors reveals prevalent DNA binding changes. Science 351:1450-4
Nelms, Bradlee D; Waldron, Levi; Barrera, Luis A et al. (2016) CellMapper: rapid and accurate inference of gene expression in difficult-to-isolate cell types. Genome Biol 17:201
Hume, Maxwell A; Barrera, Luis A; Gisselbrecht, Stephen S et al. (2015) UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions. Nucleic Acids Res 43:D117-22
Menke, Chelsea; Cionni, Megan; Siggers, Trevor et al. (2015) Grhl2 is required in nonneural tissues for neural progenitor survival and forebrain development. Genesis :
Nishi, Yuichi; Zhang, Xiaoxiao; Jeong, Jieun et al. (2015) A direct fate exclusion mechanism by Sonic hedgehog-regulated transcriptional repressors. Development 142:3286-93
Siggers, Trevor; Reddy, Jessica; Barron, Brian et al. (2014) Diversification of transcription factor paralogs via noncanonical modularity in C2H2 zinc finger DNA binding. Mol Cell 55:640-8
Zhao, Bo; Barrera, Luis A; Ersing, Ina et al. (2014) The NF-κB genomic landscape in lymphoblastoid B cells. Cell Rep 8:1595-606
Christodoulou, Danos C; Wakimoto, Hiroko; Onoue, Kenji et al. (2014) 5'RNA-Seq identifies Fhl1 as a genetic modifier in cardiomyopathy. J Clin Invest 124:1364-70
Cheatle Jarvela, Alys M; Brubaker, Lisa; Vedenko, Anastasia et al. (2014) Modular evolution of DNA-binding preference of a Tbrain transcription factor provides a mechanism for modifying gene regulatory networks. Mol Biol Evol 31:2672-88
Gordan, Raluca; Shen, Ning; Dror, Iris et al. (2013) Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape. Cell Rep 3:1093-104

Showing the most recent 10 out of 44 publications