? PROJECT 2: UW-CNOF DATA ANALYSIS AND MODELING Making effective use of the large and diverse nucleome data sets generated by the UW-CNOF and other members of the 4D Nucleome Consortium requires sophisticated computational methods deployed through robust, user-friendly software. Here, we propose to create and validate such methods and to disseminate the resulting software tools to the wider scientific community. Interpreting genomic and epigenomic data requires methods that scale to very large data sets and that handle heterogeneous data types, each with its own idiosyncratic patterns of statistical dependence and noise. In addition, 4D nucleome data of the type to be generated by the UW-CNOF gives rise to new challenges. First, Hi-C data are defined over pairs of genomic loci, rendering time series analyses based on, e.g., Markov chains, inapplicable. Instead, the data are best understood under a projection into three-dimensional coordinates, with a hierarchical model that captures multiple levels of chromatin conformation. Second, the 4D nucleome includes two distinct notions of time: the relatively fast, cyclic time of the cell cycle, coupled with the slower, branching time of differentiation. Third, as we move from bulk Hi-C data to single cell Hi-C data, potentially coupled with concurrent data measuring RNA expression and chromatin accessibility in the same cells, we must explicitly account for cell-to-cell variability while still retaining computational tractability and statistical power. The project will produce two complementary software toolkits that directly address these challenges. The first toolkit (Aims 1 and 2) uses a hierarchical probabilistic mixture modeling approach to model 3D and 4D nucleome architecture, taking into account diploidy and cell-to-cell variability. In particular, we employ a cylindrical ?pseudotime? projection that jointly models cell cycle and differentiation time scales. The second toolkit (Aim 3) provides a general framework for relating Hi-C data or corresponding 3D or 4D models to more traditional genomic and epigenomic data sets, with particular emphasis on relating 4D nucleome data to gene regulation and replication timing. The proposed project builds upon the two investigators' expertise in 3D modeling of Hi-C data (Noble) and single-cell analyses (Trapnell). The software tools will be developed in close collaboration with other investigators in the UW-CNOF, helping to validate the novel assays developed in Project 1 and in turn being validated by the experiments described in Project 3 and applied to disease-relevant systems in Project 4. The software tools themselves will be made available under an open source license and will be disseminated (Aim 4) via published articles and protocols, as well as through hands-on training activities.

Agency
National Institute of Health (NIH)
Institute
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
5U54DK107979-05
Application #
9782930
Study Section
Special Emphasis Panel (ZRG1)
Project Start
2019-08-01
Project End
2020-07-31
Budget Start
2019-08-01
Budget End
2020-07-31
Support Year
5
Fiscal Year
2019
Total Cost
Indirect Cost
Name
University of Washington
Department
Type
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
Ma, Wenxiu; Bonora, Giancarlo; Berletch, Joel B et al. (2018) X-Chromosome Inactivation and Escape from X Inactivation in Mouse. Methods Mol Biol 1861:205-219
Ma, Wenxiu; Ay, Ferhat; Lee, Choli et al. (2018) Using DNase Hi-C techniques to map global and local three-dimensional genome architecture at high resolution. Methods 142:59-73
Wray-Dutra, Michelle N; Chawla, Raghav; Thomas, Kerri R et al. (2018) Activated CARD11 accelerates germinal center kinetics, promoting mTORC1 and terminal differentiation. J Exp Med 215:2445-2461
Bonora, G; Deng, X; Fang, H et al. (2018) Orientation-dependent Dxz4 contacts shape the 3D structure of the inactive X chromosome. Nat Commun 9:1445
Bonora, Giancarlo; Disteche, Christine M (2017) Structural aspects of the inactive X chromosome. Philos Trans R Soc Lond B Biol Sci 372:
Cao, Junyue; Packer, Jonathan S; Ramani, Vijay et al. (2017) Comprehensive single-cell transcriptional profiling of a multicellular organism. Science 357:661-667
Qiu, Xiaojie; Hill, Andrew; Packer, Jonathan et al. (2017) Single-cell mRNA quantification and differential analysis with Census. Nat Methods 14:309-315
Yard?mc?, Galip Gürkan; Noble, William Stafford (2017) Software tools for visualizing Hi-C data. Genome Biol 18:26
Qiu, Xiaojie; Mao, Qi; Tang, Ying et al. (2017) Reversed graph embedding resolves complex single-cell trajectories. Nat Methods 14:979-982
Kim, Seungsoo; Liachko, Ivan; Brickner, Donna G et al. (2017) The dynamic three-dimensional organization of the diploid yeast genome. Elife 6:

Showing the most recent 10 out of 15 publications