Transcription factors (TF) have a crucial role in controlling gene expression, and exploring their molecular targets, binding partners and mode of regulation is essential to understand any plant biological process. Arabidopsis offers some unique advantages for the development of large-scale genomic approaches for the study of TFs function such as the ease and low cost to generate large transgenic collections, or the propensity of most gene-regulatory regions to be circumscribed to a short region upstream the transcription start site. The potential of these types of strategies is greatly exemplified in a recent report from our laboratory where, by using a fraction of the full TF collection, a novel clock component was identified. For these reasons, we started gathering all available Arabidopsis TFs from the different ORFeome resources (Salk, Pekin-Yale, REGIA, TIGR and RIKEN) to generate a complete TF collection. However, our findings from resequencing the different clones revealed a large overlap between the collections and several hundred mislabeled or missing ones, resulting in a final coverage close to 75% of all the Arabidopsis transcription factors and regulators. Here, we propose to generate an homogenous gold standard GATEWAYTM compatible collection containing every Arabidopsis TF. The corresponding coding sequences will be cloned in the same vector using the available ORFeomes as the template resource when possible. The remaining 25% missing ones will be generated by following different complementary amplification protocols and subsequently cloned in the same vector backbone. In addition, we propose to create and distribute to the community, nine application-ready genomic collections containing each TF in fusion with different tags for a multitude of applications. These nine collections will allow (1) overexpression screens in plants of wild type as well as EAR or VP64 translational fusions, (2) protein-protein interaction screens, (3) protein-DNA interaction screens, (4) subcellular localization and, (5) bacterial recombinant protein expression. In addition, in collaboration with Dr. Joe Ecker at the Salk Institute, we propose to test and compare the efficiency of different protein epitope-tags suitable to perform ChIP-seq experiment. A collection of TF tagged with the selected epitope will be generated. Finally, we propose to devise a simple protocol to perform yeast one-hybrid screens with full TF collections at a reasonable cost. We strongly believe these resources have the potential to greatly enhance research in Arabidopsis and other crop species. As the knowledge gap is being filled, the study of transcriptional networks in Arabidopsis will ultimately help us understand the biochemical complexity of multicellular organisms and positively impact the biomedical research community.

Public Health Relevance

This project will be the development, construction and distribution of a large scale genomic library that will serve as molecular tools available to the Arabidopsis community. Studies of the Arabidopsis cellular pathways and transcriptional networks using these tools range from improvement of plant productivity in food, biomass production and environmental sustainability, to the impact on human health relating to malnutrition, and systems biology which could eventually impact the study of multigenic diseases such a Alzheimer's, Parkinson and cancer.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
High Impact Research and Research Infrastructure Programs (RC2)
Project #
Application #
Study Section
Special Emphasis Panel (ZGM1-GDB-7 (CR))
Program Officer
Tompkins, Laurie
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Schools of Arts and Sciences
La Jolla
United States
Zip Code
Lee, Chin-Mei; Feke, Ann; Li, Man-Wah et al. (2018) Decoys Untangle Complicated Redundancy and Reveal Targets of Circadian Clock F-Box Proteins. Plant Physiol 177:1170-1186
Kubota, Akane; Ito, Shogo; Shim, Jae Sung et al. (2017) TCP4-dependent induction of CONSTANS transcription requires GIGANTEA in photoperiodic flowering in Arabidopsis. PLoS Genet 13:e1006856
Breton, Ghislain; Kay, Steve A; Pruneda-Paz, José L (2016) Identification of Arabidopsis Transcriptional Regulators by Yeast One-Hybrid Screens Using a Transcription Factor ORFeome. Methods Mol Biol 1398:107-18
Taylor-Teeples, M; Lin, L; de Lucas, M et al. (2015) An Arabidopsis gene regulatory network for secondary cell wall synthesis. Nature 517:571-5
Nagel, Dawn H; Doherty, Colleen J; Pruneda-Paz, Jose L et al. (2015) Genome-wide identification of CCA1 targets uncovers an expanded clock network in Arabidopsis. Proc Natl Acad Sci U S A 112:E4802-10
Zheng, Xiao-Yu; Zhou, Mian; Yoo, Heejin et al. (2015) Spatial and temporal regulation of biosynthesis of the plant immune signal salicylic acid. Proc Natl Acad Sci U S A 112:9166-73
Zhou, Yun; Liu, Xing; Engstrom, Eric M et al. (2015) Control of plant stem cell function by conserved interacting transcriptional regulators. Nature 517:377-80
Kolmos, Elsebeth; Chow, Brenda Y; Pruneda-Paz, Jose L et al. (2014) HsfB2b-mediated repression of PRR7 directs abiotic stress responses of the circadian clock. Proc Natl Acad Sci U S A 111:16172-7
Guan, Peizhu; Wang, Rongchen; Nacry, Philippe et al. (2014) Nitrate foraging by Arabidopsis roots is mediated by the transcription factor TCP20 through the systemic signaling pathway. Proc Natl Acad Sci U S A 111:15267-72
Chow, Brenda Y; Sanchez, Sabrina E; Breton, Ghislain et al. (2014) Transcriptional regulation of LUX by CBF1 mediates cold input to the circadian clock in Arabidopsis. Curr Biol 24:1518-24

Showing the most recent 10 out of 16 publications