The broad goal of this proposal is to develop a general and highly-sensitive method that maps the position, amount, and orientation of transcriptionally-engaged RNA polymerases across the entire genomes of mammalian cells and Drosophila. Preliminary results from this Global nuclear Run-On and massively parallel Sequencing method (GRO-seq) demonstrate the potential of this approach to provide a global picture of the transcription status of genome. Additionally, these early results uncovered novel antisense short transcription units upstream and adjacent to many known promoters and suggest modes of regulation that heretofore have not been considered. This proposal seeks to rigorously test and further develop the GRO-seq method to provide unprecedented, near-nucleotide resolution of RNA polymerase positions over the genome.
Aim 1 is to optimize and critically evaluate the first generation GRO-seq method, which is already producing results. Simple variations of this procedure should be capable of identifying the type of polymerase, that is, RNA polymerases I, II, or III.
Aim 2 is to develop second generation, ultra-high resolution versions of GRO-seq that map with several-nucleotide resolution the 3'ends of nacent RNA, which are associated with active site of all transcriptionally- engaged polymerases. An additional adaptation of GRO-seq will identify transcription start sites of transcriptionally- engaged RNA polymerase II (Pol II) with high sensitivity. The precise position and status of Pol II relative to transcription start sites will identify sequences where transcription elongation is confronted with a rate limiting step. Indeed, recent studies are documenting that Pol II pausing immediately downstream of promoters is a rate-limiting step in the regulation of many metazoan genes. Furthermore, the correlation of Pol II position to genome-wide transcription factor maps will identify candidate factors involved in this regulation.
Aim 3 will develop GRO-seq methods so they can be generally applied to key model systems including representative classes of human cell lines, mouse embryonic stem cells, and Drosophila embryos. This last aim will also test the ability to globally detect rapid changes in transcription in response to regulatory signals. Finally, although the focus of the proposal is to develop robust, generally-applicable assays of transcriptionally-engaged RNA polymerases, the data generated in testing these cell and organismal models will also provide critical transcription information to existing the human ENCODE and model organism modENCODE projects as well as to the mouse stem cell efforts of the International Regulome Consortium. Because GRO-seq can define engaged RNA polymerases in cells under normal conditions as well as during infection and disease, it provides a valuable tool for basic research of numerous diseases and can produce critical signatures in diagnostic studies. Project Narrative This is a proposal to develop a novel method that maps the position, amount, and orientation of transcriptionally-engaged RNA polymerases across the genome at unprecedented, near-nucleotide resolution. GRO-seq will provide a general tool that will be applicable to efforts of ENCODE and modENCODE projects to define in molecular terms how the DNA code in higher eukaryotes is read and regulated to provide the patterns of gene expression required to develop and maintain a multicellular organism. GRO-seq can define both the level and status of all engaged RNA polymerases in cells under normal conditions, and during infection and disease providing a powerful tool for basic and diagnostic studies.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG004845-03
Application #
8014950
Study Section
Molecular Genetics B Study Section (MGB)
Program Officer
Feingold, Elise A
Project Start
2009-01-24
Project End
2012-12-31
Budget Start
2011-01-01
Budget End
2012-12-31
Support Year
3
Fiscal Year
2011
Total Cost
$343,617
Indirect Cost
Name
Cornell University
Department
Biochemistry
Type
Schools of Earth Sciences/Natur
DUNS #
872612445
City
Ithaca
State
NY
Country
United States
Zip Code
14850
Kruesi, William S; Core, Leighton J; Waters, Colin T et al. (2013) Condensin controls recruitment of RNA polymerase II to achieve nematode X-chromosome dosage compensation. Elife 2:e00808
Kilpinen, Helena; Waszak, Sebastian M; Gschwind, Andreas R et al. (2013) Coordinated effects of sequence variation on DNA binding, chromatin structure, and transcription. Science 342:744-7
Kwak, Hojoong; Fuda, Nicholas J; Core, Leighton J et al. (2013) Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science 339:950-3
Core, Leighton J; Waterfall, Joshua J; Gilchrist, Daniel A et al. (2012) Defining the status of RNA polymerase at promoters. Cell Rep 2:1025-35
Min, Irene M; Waterfall, Joshua J; Core, Leighton J et al. (2011) Regulating RNA polymerase pausing and transcription elongation in embryonic stem cells. Genes Dev 25:742-54
Larschan, Erica; Bishop, Eric P; Kharchenko, Peter V et al. (2011) X chromosome dosage compensation via enhanced transcriptional elongation in Drosophila. Nature 471:115-8
Hah, Nasun; Danko, Charles G; Core, Leighton et al. (2011) A rapid, extensive, and transient transcriptional response to estrogen signaling in breast cancer cells. Cell 145:622-34
Chopra, Vivek S; Hendrix, David A; Core, Leighton J et al. (2011) The polycomb group mutant esc leads to augmented levels of paused Pol II in the Drosophila embryo. Mol Cell 42:837-44