It is widely recognized that the three dimensional (3D) architecture of eukaryotic chromatin plays critical roles in nuclear and cellular function. Indeed, such relationships constitute one of the key threads of the just published (09/06/12) suite of Nature papers from the ENCODE consortium. However, until a few years ago, observing / inferring, 3D structure at even modest resolutions was problematic, in part because genomes are highly condensed. Recently devised molecular techniques are changing this situation. In particular, the development of novel genome-scale assays, such as chromatin conformation capture (CCC), has enabled elicitation of chromatin contacts. These techniques have already provided insight into chromatin organization at unprecedented resolutions, and permitted exploration of the downstream influence of such organization on a variety of biological. Notably, gene regulation and cancer-driving gene fusions are believed to be strongly influenced by 3D organization. Accordingly, obtaining high resolution 3D reconstructions of genome architecture is a compelling biological quest. However, most analysis of CCC data has focused on the one dimensional (1D) contact level, with appreciably less effort directed toward evaluating accuracy and reproducibility of 3D reconstructions, and deploying such structures to analyze consequent biological processes. The overarching hypothesis to be addressed in this proposal is that chromatin contact data can be reliably used to determine 3D structures of genomes and to assess downstream relationships with biological function. To test this hypothesis we will develop new, and refine existing, reconstruction algorithms, and will undertake a systematic evaluation of their performance and operating characteristics. The only genome-scale algorithms developed employ constrained optimization which, on account of high-dimensionality, is computationally burdensome and can be trapped in local optima. Accordingly, we will investigate alternate algorithms. Our preliminary work indicates that reproducibility under plausible perturbations to data and constraint inputs is qualitatively poor. We will devise metrics to quantify 3D agreement. While reproducibility under perturbations can be assessed using computational and statistical tools, appraising accuracy requires using targeted confirmatory assays, as will be deployed by our wet-lab collaborators. Further, we will apply our compendium of 3D reconstructions to explore a series of biologic questions relating to proximities of functional elements.
3D chromatin positioning plays a prominent role in chromosomal aberrations, with several known cancer-related gene fusions and 20% of human cancer morbidity being driven by proximity. Given that gene fusions occur in all malignancies, these likely significant underestimates result in part from lack of 3D reconstructions. Improvements in understanding the extent of, and mechanisms inducing, such aberrations will flow from expanded, reliable 3D reconstructions, and have a profound scientific impact.
Segal, Mark R; Bengtsson, Henrik L (2018) Improved accuracy assessment for 3D genome reconstructions. BMC Bioinformatics 19:196 |
Stolz, Robert; Yoshida, Masaaki; Brasher, Reuben et al. (2017) Pathways of DNA unlinking: A story of stepwise simplification. Sci Rep 7:12420 |
Lee, Cheng-Sheng; Wang, Ruoxi W; Chang, Hsiao-Han et al. (2016) Chromosome position determines the success of double-strand break repair. Proc Natl Acad Sci U S A 113:E146-54 |
Capurso, Daniel; Bengtsson, Henrik; Segal, Mark R (2016) Discovering hotspots in functional genomic data superposed on 3D chromatin configuration reconstructions. Nucleic Acids Res 44:2028-35 |
Diao, Yuanan; Rodriguez, Victor; Klingbeil, Michele et al. (2015) Orientation of DNA Minicircles Balances Density and Topological Complexity in Kinetoplast DNA. PLoS One 10:e0130998 |
Segal, Mark R; Bengtsson, Henrik L (2015) Reconstruction of 3D genome architecture via a two-stage algorithm. BMC Bioinformatics 16:373 |
Arsuaga, Javier; Jayasinghe, Reyka G; Scharein, Robert G et al. (2015) Current theoretical models fail to predict the topological complexity of the human genome. Front Mol Biosci 2:48 |
Segal, Mark R; Xiong, Hao; Capurso, Daniel et al. (2014) Reproducibility of 3D chromatin configuration reconstructions. Biostatistics 15:442-56 |
Capurso, Daniel; Segal, Mark R (2014) Distance-based assessment of the localization of functional annotations in 3D genome reconstructions. BMC Genomics 15:992 |