Experimental model organisms such as mice, rats, and Drosophila provide valuable insights into human diseases. This is because we can manipulate genetic and environmental factors in experimental populations. Biological constraints limit our ability to manipulate genetic systems, and infer causation (relative to industrial experimental settings). Our proposal addresses unique challenges due to these constraints in experimental genome populations used to study human diseases. We will develop statistical design and analysis methods that will reduce experimental cost, make efficient use of existing resources, and better infer causation when we have incomplete control over the assignment of genetic factors to individual organisms. We focus on three types of collections of genetic perturbations: (1) experimental crosses (derived from crossing two or more strains such as backcross, intercross, and recombinant inbred line populations), (2) introgression line populations (such as consomic strains, and genome-tagged mice), and (3) non- randomized genome populations (such as inbred strain collections).
Our specific aims are: (1) To develop efficient and robust genotyping strategies in experimental crosses (2) To develop statistical design and analysis tools for introgression line populations. (3) To develop statistical methods for detecting causal elements in genetic association studies in experimental organisms. Relevance: Scientists discover genetic elements contributing to human disease susceptibility by studying experimentally-constructed animal populations (such as mice). We will develop statistical methods to construct and use these populations efficiently. Our methods will reduce experimental cost, and improve detection of causal genetic elements contributing to disease susceptibility.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-GGG-A (53))
Program Officer
Krasnewich, Donna M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Francisco
Public Health & Prev Medicine
Schools of Medicine
San Francisco
United States
Zip Code
Segal, Mark R; Xiong, Hao; Bengtsson, Henrik et al. (2012) Querying genomic databases: refining the connectivity map. Stat Appl Genet Mol Biol 11:
King, Elizabeth G; Merkes, Chris M; McNeil, Casey L et al. (2012) Genetic dissection of a model complex trait using the Drosophila Synthetic Population Resource. Genome Res 22:1558-66
Xiong, Hao; Capurso, Daniel; Sen, Saunak et al. (2011) Sequence-based classification using discriminatory motif feature selection. PLoS One 6:e27382
Nichols, Robert J; Sen, Saunak; Choo, Yoe Jin et al. (2011) Phenotypic landscape of a bacterial cell. Cell 144:143-56
Xiong, Hao; Goulding, Evan H; Carlson, Elaine J et al. (2011) A flexible estimating equations approach for mapping function-valued traits. Genetics 189:305-16
Shah, Sohela; Sanford, Ukina R; Vargas, Julie C et al. (2010) Strain background modifies phenotypes in the ATP8B1-deficient mouse. PLoS One 5:e8984
Manichaikul, Ani; Moon, Jee Young; Sen, Saunak et al. (2009) A model selection approach for the identification of quantitative trait loci in experimental crosses, allowing epistasis. Genetics 181:1077-86
Sen, Saunak; Johannes, Frank; Broman, Karl W (2009) Selective genotyping and phenotyping strategies in a complex trait context. Genetics 181:1613-26