Though the genome's sequence is essentially ?xed, its state may be constantly changing inside a cell. Two aspects of this changing state are the speci?c arrangement of myriad protein complexes along the genome in the form of chromatin, and the current rate of transcript production for each gene. A fundamental research objective is to understand the relationship between these two, and in particular, how transcript production rates (TxPRs) are in?uenced by genome-wide chromatin state. The goal of this proposal is to develop models that are capable of predicting a cell's genome-wide transcription state from knowledge of its genome-wide chromatin state. To build such models requires simultaneously pro?ling a cell's genome-wide chromatin state and transcription state at different times or under different conditions: Observing how the two change together, particularly in the context of directed perturbation, provides the statistical leverage needed to build predictive models that can provide causal insight. In this proposal, models will be developed and validated by monitoring chromatin and transcription in budding yeast as they progress through the cell cycle, a temporal series of highly regulated events controlling cell proliferation, aberrations of which can lead to cancer. Owing to the complexity of this challenge, yeast is used as a starting point because of its compact genome and genetic tractability, but we anticipate our methods will also be applicable in more complex organisms, including human. One major obstacle is that state-of-the-art chromatin immunoprecipitation (ChIP) methods for assaying chromatin state require a separate experiment not only for each time point and experimental condition, but also for each of the 100s-1000s of types of proteins binding along the genome. To overcome this hurdle, this proposal describes a novel method for ef?ciently learning quantitative genome-wide chromatin occupancy pro?les (GCOPs) using nuclease-digested chromatin at single-base resolution. The proposed method enables the comprehensive determination of quantitative chromatin occupancy of transcription factors, nucleosomes, and other DNA-binding factors across the entire genome without requiring a separate experiment for each. Producing GCOPs in conjunction with high- resolution measurements of TxPRs will allow the development of sophisticated, mechanistically interpretable models that predict transcript production rates as a function of chromatin state. The proposed research will result in (i) ef?cient new methods for producing quantitative GCOPs that will be applicable in any organism with a sequenced genome; (ii) GCOPs from budding yeast as they progress through the cell cycle, revealing for the ?rst time in any organism how genome-wide chromatin occupancy changes over the course of the cell cycle; (iii) characterization of how genome-wide chromatin changes are linked to changes in TxPRs, not only in wild-type yeast, but also under a wide range of genetic and genomic perturbations; and (iv) models learned from all these data that can predict TxPRs on the basis of chromatin occupancy, providing mechanistic insight into how the cell-cycle-regulated transcription program is in?uenced by its changing chromatin state.

Public Health Relevance

We will comprehensively and quantitatively pro?le genome-wide chromatin occupancy and transcription during the cell cycle, and will use the data to develop predictive models that provide mechanistic insight into how a cell's transcription state is in?uenced by its chromatin state. Successfully achieving this will have enormous scienti?c and practical impact, enhancing our understanding of fundamental processes like cell growth and division, signal response, and differentiation, as well as clinically relevant consequences of thei dysregulation, whether in cancer, diabetes, or developmental disorders. In addition, we expect that the innovative experimental and computational modeling approaches we develop and validate in S. cerevisiae can be extended to higher eukaryotes, including human.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM118551-02
Application #
9251861
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Ravichandran, Veerasamy
Project Start
2016-04-01
Project End
2020-03-31
Budget Start
2017-04-01
Budget End
2018-03-31
Support Year
2
Fiscal Year
2017
Total Cost
$356,957
Indirect Cost
$132,456
Name
Duke University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
044387793
City
Durham
State
NC
Country
United States
Zip Code
27705
McDowell, Ian C; Barrera, Alejandro; D'Ippolito, Anthony M et al. (2018) Glucocorticoid receptor recruits to enhancers and drives activation by motif-directed binding. Genome Res 28:1272-1284
Mayhew, Michael B; Iversen, Edwin S; Hartemink, Alexander J (2017) Characterization of dependencies between growth and division in budding yeast. J R Soc Interface 14:
Welch, Joshua D; Hartemink, Alexander J; Prins, Jan F (2017) MATCHER: manifold alignment reveals correspondence between single cell transcriptome and epigenome dynamics. Genome Biol 18:138
Welch, Joshua D; Hartemink, Alexander J; Prins, Jan F (2016) SLICER: inferring branched, nonlinear cellular trajectories from single cell RNA-seq data. Genome Biol 17:106