Since the human genome was first sequenced a decade ago, researchers have made great strides in identifying the genomic locations of many kinds of functional elements, including the sequences that control gene regulation. Nevertheless, the primary focus to date has been to catalog individual regulatory elements, without regard for their dynamic behavior or interactions. In this proposal, we outline an innovative approach for both identifying sequences critical for gene regulation and characterizing their dynamic interactions. Our proposal involves combining a powerful method for directly measuring the expression of genes, called PRO-seq, with an adaptation of DNase-seq, a method for identifying positions in the genome at which gene-regulating transcription factors are bound. We propose to apply these methods in a time course after stimulation of an inducible system to obtain dynamic, genome-wide information about both binding and expression, focusing in particular on stress responses induced by the small molecular celastrol in the immortalized K562 leukemia cell line. Because neither PRO-seq nor DNase-seq depends on antibodies to particular transcription factors, or on the technique of chromatin immunoprecipitation, we describe this approach as factor-general and ChIP-free. Our proposal has three main aims: (1) to identify and characterize transcription units using PRO-seq;(2) to identify and characterize the binding sites for many transcription factors using DNase-seq;and (3) to integrate these dynamic patterns of transcription and binding to reveal networks of interaction between regulatory sequences and transcription units. Each of these aims involves the development of new statistical models and computational methods. Our newly generated data, our predictions, and our software will all be made publicly available.

Public Health Relevance

(unchanged from original) We propose to make use of powerful experimental technologies and computational methods to shed new light on the mechanisms of gene regulation in human cells. Gene regulation is a critical link between genotype and phenotype, and is implicated in many human diseases.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
1R01HG007070-01A1
Application #
8578768
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Pazin, Michael J
Project Start
2013-09-01
Project End
2016-05-31
Budget Start
2013-09-01
Budget End
2014-05-31
Support Year
1
Fiscal Year
2013
Total Cost
$342,489
Indirect Cost
$117,489
Name
Cornell University
Department
Biostatistics & Other Math Sci
Type
Schools of Earth Sciences/Natur
DUNS #
872612445
City
Ithaca
State
NY
Country
United States
Zip Code
14850
Danko, Charles G; Choate, Lauren A; Marks, Brooke A et al. (2018) Dynamic evolution of regulatory element ensembles in primate CD4+ T cells. Nat Ecol Evol 2:537-548
Dukler, Noah; Booth, Gregory T; Huang, Yi-Fei et al. (2017) Nascent RNA sequencing reveals a dynamic global transcriptional response at genes and enhancers to the natural medicinal compound celastrol. Genome Res 27:1816-1829
Dukler, Noah; Gulko, Brad; Huang, Yi-Fei et al. (2016) Is a super-enhancer greater than the sum of its parts? Nat Genet 49:2-3
Danko, Charles G; Hyland, Stephanie L; Core, Leighton J et al. (2015) Identification of active transcriptional regulatory elements from GRO-seq data. Nat Methods 12:433-8
Core, Leighton J; Martins, André L; Danko, Charles G et al. (2014) Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers. Nat Genet 46:1311-20
Siepel, Adam; Arbiza, Leonardo (2014) Cis-regulatory elements and human evolution. Curr Opin Genet Dev 29:81-9