Genetic crosses in model organisms play an essential role in understanding the heritable architecture of medically relevant phenotypes. Traditionally, such crosses have tended to be on a small scale with either limited power to detect genetic effects or limited resolution to localize causal variants. Recently, however, the emergence of larger-scale interdisciplinary research, cheaper genotyping and parallel advances in human genetics, have spurred the development of more sophisticated and powerful experimental designs. Genetic Resource Populations (GRPs) use economies of scale to provide cost-effective and replicable platforms for genetic studies. This project concerns the largest, most ambitious GRP in mouse genetics to date, the Collaborative Cross (CC), and a series of crosses and designs related to or derived from it: the Diversity Outbred (DO) cross, the CC Recombinant Inbred Cross (CC-RIX) and the diallel. Experiments on each separate cross provide distinct information about the heritable architecture of a target complex disease. In combination, this Genetic Reference Super-Population (GRSP) potentially provides an unparalleled basis for cross-study replication and integration in mouse genetics. This project aims to develop statistical methods that advance the current state of complex trait analysis of these populations separately, and, by exploiting the unique structure that connects them, proposes to develop a statistical framework that allows for their joint use.
Aim 1 develops a Bayesian probabilistic framework for haplotype-based analysis of quantitative trait loci (QTL).
Aim 1 a develops a statistical software module for flexible haplotype-based analysis, which can be ex- tended by the researcher to model a rich variety of designs and disease types.
Aim 1 b will adapt machine learning techniques to provide posterior inference of the allelic series of a QTL.
Aim 1 c will incorporate Bayesian modeling of polygenic effects.
Aim 2 and 3 concern joint analysis, building on the foundation set by Aim 1.
Aim 2 develops methods to optimize experimental design of follow-up studies in one population given results from another.
Aim 2 a uses the diallel to inform design of CC/CC-RIX/DO experiments.
Aim 2 b uses partial data on CC/CC-RIX/DO to guide collection of additional data.
Aim 3 explores models for jointly analyzing multiple populations in the GRSP, using complementary datasets to stabilize analysis at single QTL (Aim 3a) and across multiple QTL (Aim 3b).
These aims address specific and persistent challenges in the cost-effective design and efficient analysis of multiparent genetic data, in particular the CC, DO, CC-RIX and diallel. The project will generate tools useful for a wide range of model organism crosses and can be applied to the genetic study of any complex disease.

Public Health Relevance

The proposed research will lead to improvements in the analysis and design of genetic studies on animal models of human disease. Because the project focuses on statistical methodology applied to experimental mouse populations, the scientific output of the project is expected to be applicable to basic research focusing on any medical condition that can be studied in the mouse.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Krasnewich, Donna M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Schools of Medicine
Chapel Hill
United States
Zip Code
Sabourin, Jeremy A; Valdar, William; Nobel, Andrew B (2015) A permutation approach for selecting the penalty parameter in penalized model selection. Biometrics 71:1185-94
Sabourin, Jeremy; Nobel, Andrew B; Valdar, William (2015) Fine-mapping additive and dominant SNP effects using group-LASSO and fractional resample model averaging. Genet Epidemiol 39:77-88
Phillippi, J; Xie, Y; Miller, D R et al. (2014) Using the emerging Collaborative Cross to probe the immune system. Genes Immun 15:38-46
Zhang, Zhaojun; Wang, Wei; Valdar, William (2014) Bayesian modeling of haplotype effects in multiparent populations. Genetics 198:139-56
Crowley, James J; Kim, Yunjung; Lenarcic, Alan B et al. (2014) Genetics of adverse reactions to haloperidol in a mouse diallel: a drug-placebo experiment and Bayesian causal analysis. Genetics 196:321-47
Calaway, John D; Lenarcic, Alan B; Didion, John P et al. (2013) Genetic architecture of skewed X inactivation in the laboratory mouse. PLoS Genet 9:e1003853