Multiple lines of evidence point to a distinctly non-random organization of functional elements in the genome. It appears that the mouse genome is organized into domains-contiguous, possibly overlapping, segments of chromosomes containing functionally related groups of elements (including, but not limited to genes and regulatory elements) and that non-contiguous domains and isolated elements interact in networks. This project will develop the bioinformatics tools required for using multiple, diverse data sets in analyzing these relationships.
Aim 2 a. we will develop methods and software to identify and characterize domains, moving from analyses based on dynamic programming to more sophisticated methods based on Hidden Markov Models. We will include methods for combining multiple data sources to form composite domain structures.
Aim 2 b. we will generate the networks implied by disparate data, using a graph representation in which nodes are chromosome positions (or intervals) and edges imply an interaction (direct or indirect) between the nodes. As with domains, we will generate both data-specific and composite networks for analysis. We will develop methods and software to compare alternative networks, focusing specifically on methods to identify statistically significant common subnets.
Aim 2 c. we will provide our computational tools to the other projects in this proposal for the analysis of data generated by this grant (e.g., gene expression data from Project by Churchill, and recombination data from Project by Petkov, as well as overall integration in Project by Paigen), as well as data obtained from external sources (e.g., GO annotations or the KEGG database). We will work also with the Computational Core to define efficient and comprehensive databases and web interfaces for the data and analysis we produce.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Specialized Center (P50)
Project #
Application #
Study Section
Special Emphasis Panel (ZGM1)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Jackson Laboratory
Bar Harbor
United States
Zip Code
Wang, Jeremy R; Holt, James; McMillan, Leonard et al. (2018) FMLRC: Hybrid long read error correction using an FM-index. BMC Bioinformatics 19:50
Ju, Chelsea J-T; Zhao, Zhuangtian; Wang, Wei (2017) Efficient Approach to Correct Read Alignment for Pseudogene Abundance Estimates. IEEE/ACM Trans Comput Biol Bioinform 14:522-533
Simecek, Petr; Forejt, Jiri; Williams, Robert W et al. (2017) High-Resolution Maps of Mouse Reference Populations. G3 (Bethesda) 7:3427-3434
Tyler, Anna L; Ji, Bo; Gatti, Daniel M et al. (2017) Epistatic Networks Jointly Influence Phenotypes Related to Metabolic Disease and Gene Expression in Diversity Outbred Mice. Genetics 206:621-639
Morgan, Andrew P; Gatti, Daniel M; Najarian, Maya L et al. (2017) Structural Variation Shapes the Landscape of Recombination in Mouse. Genetics 206:603-619
Parvanov, Emil D; Tian, Hui; Billings, Timothy et al. (2017) PRDM9 interactions with other proteins provide a link between recombination hotspots and the chromosomal axis in meiosis. Mol Biol Cell 28:488-499
Morgan, Andrew P; Didion, John P; Doran, Anthony G et al. (2016) Whole Genome Sequence of Two Wild-Derived Mus musculus domesticus Inbred Strains, LEWES/EiJ and ZALENDE/EiJ, with Different Diploid Numbers. G3 (Bethesda) 6:4211-4216
Chesler, Elissa J; Gatti, Daniel M; Morgan, Andrew P et al. (2016) Diversity Outbred Mice at 21: Maintaining Allelic Variation in the Face of Selection. G3 (Bethesda) 6:3893-3902
Gu, Tongjun; Gatti, Daniel M; Srivastava, Anuj et al. (2016) Genetic Architectures of Quantitative Variation in RNA Editing Pathways. Genetics 202:787-98
Korstanje, Ron; Deutsch, Konstantin; Bolanos-Palmieri, Patricia et al. (2016) Loss of Kynurenine 3-Mono-oxygenase Causes Proteinuria. J Am Soc Nephrol 27:3271-3277

Showing the most recent 10 out of 128 publications