Genomic based studies of disease now involve highly diverse types of data collected on large groups of patients. A major challenge facing scientists is how best to combine the data, extract important features, and comprehensively characterize the ways in which they affect an individual's disease course and/or likelihood of response to treatment. This project aims to develop statistical methods to address important problems that arise in genomic based studies of disease. In particular, we propose methods to improve the power and accuracy of results obtained from genome-wide studies of gene expression. We also propose statistical methods that integrate data across multiple platforms and scales. These integrative methods enable powerful inference related to identifying and quantifying groups of features that change across biological conditions (e.g. healthy vs. disease), and they also allow for the identification of important collections of features that affect a patient's disease course and/or treatment response. Successful completion of the project will help to ensure that maximal utility is gained from the powerful genomic-based technologies that are now routinely used in efforts to gain insights into and information about the genomic mechanisms underlying disease manifestation, progression, and maintenance.

Public Health Relevance

The development of statistically sound approaches to resolve the genomic basis of complex traits is vital to individualizing medicine and improving public health. Ideally, high-throughput genetic, genomic, and pheno- typic measurements on diseased individuals would lead quickly to the identification of the salient features underlying their disease, along with a specification about how these features affect disease course. Many challenges in biostatistics must be overcome before this ideal is achieved. This proposal addresses some of those critical challenges.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Marcus, Stephen
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Wisconsin Madison
Biostatistics & Other Math Sci
Schools of Medicine
United States
Zip Code
Korthauer, Keegan D; Chu, Li-Fang; Newton, Michael A et al. (2016) A statistical approach for identifying differential distributions in single-cell RNA-seq experiments. Genome Biol 17:222
Chu, Li-Fang; Leng, Ning; Zhang, Jue et al. (2016) Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm. Genome Biol 17:173
Tian, Jianan; Keller, Mark P; Broman, Aimee Teo et al. (2016) The Dissection of Expression Quantitative Trait Locus Hotspots. Genetics 202:1563-74
Bacher, Rhonda; Kendziorski, Christina (2016) Design and computational analysis of single-cell RNA-sequencing experiments. Genome Biol 17:63
Leng, Ning; Li, Yuan; McIntosh, Brian E et al. (2015) EBSeq-HMM: a Bayesian approach for identifying gene-expression changes in ordered RNA-seq experiments. Bioinformatics 31:2614-22
Tian, Jianan; Keller, Mark P; Oler, Angie T et al. (2015) Identification of the Bile Acid Transporter Slco1a6 as a Candidate Gene That Broadly Affects Gene Expression in Mouse Pancreatic Islets. Genetics 201:1253-62
Tran, Khoa A; Jackson, Steven A; Olufs, Zachariah P G et al. (2015) Collaborative rewiring of the pluripotency network by chromatin and signalling modulating pathways. Nat Commun 6:6188
Leng, Ning; Chu, Li-Fang; Barry, Chris et al. (2015) Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments. Nat Methods 12:947-50
Korthauer, Keegan D; Kendziorski, Christina (2015) MADGiC: a model-based approach for identifying driver genes in cancer. Bioinformatics 31:1526-35
St John, Hillary C; Bishop, Kathleen A; Meyer, Mark B et al. (2014) The osteoblast to osteocyte transition: epigenetic changes and response to the vitamin D3 hormone. Mol Endocrinol 28:1150-65

Showing the most recent 10 out of 18 publications