We develop novel statistical techniques for nonparametric Bayes analysis of high-dimensional covariate data, directly motivated by the largest population-based study ever conducted on the causes of birth defects. The methods we develop will enable borrowing of information and shrinkage across high-dimensional environmental, biomedical, pharmacological, and sociodemographic risk factors (and interactions among them) and across a multitude of birth defects, many of which are too rare to be studied in isolation. Using a hierarchical structure directly motivated by embryonic development, the borrowing of information can be informed by our knowledge of mechanistic development of the embryo. These novel methods may significantly impact the study of rare congenital malformations. The methods to be developed have broad application in public health and medicine, where exposures or characteristics of interest may be great in number and interactions are important, such as the examination high-dimensional gene by environment and gene-gene interactions.

Public Health Relevance

This project addresses a critical need of finding clues to the etiology and pathogenesis of congenital mal- formations, using data from the largest population-based study ever conducted on the causes of birth defects. While birth defects are the leading cause of infant mortality, the leading cause of death among children aged 1-4, and the fifth-ranked cause of premature mortality in the United States, many individual defects are too rare to be studied comprehensively, even in studies that are very large. Our new statistical methods for sparse shrinkage incorporate current knowledge of embryonic development and allow some borrowing of information across differ- ent birth defects while keeping each defect as a separate entity of interest in the statistical model. These novel methods will allow investigators to investigate the simultaneous influence of multiple exposures and combinations of exposures on multiple outcomes.

National Institute of Health (NIH)
Research Project (R01)
Project #
Application #
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Dilworth, Caroline H
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Biostatistics & Other Math Sci
Schools of Public Health
Chapel Hill
United States
Zip Code
Long, D Leann; Preisser, John S; Herring, Amy H et al. (2014) A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat Med 33:5151-65
Wheeler, Matthew W; Dunson, David B; Pandalai, Sudha P et al. (2014) Mechanistic Hierarchical Gaussian Processes. J Am Stat Assoc 109:894-904
Tabler, Jacqueline M; Barrell, William B; Szabo-Rogers, Heather L et al. (2013) Fuz mutant mice reveal shared mechanisms between ciliopathies and FGF-related syndromes. Dev Cell 25:623-35