Investigating the microbial basis of early childhood caries via metagenomics and metatranscriptomics analyses Abstract The increasing availability and scale of omics data have revolutionized our ability to understand complex biological processes underlying health and disease. Such biologically-informed insights are aligned with the notion of precision medicine and have the potential to improve diagnoses, prevention and treatment. In the oral health domain, multiple omics data layers (e.g., genomics, metagenomics, transcriptomics, metabolomics), intended to capture aspects of otherwise unobservable biology, are increasingly being collected in oral health studies. However, methods for powerful and informative integration of information gained from these multiple data layers remains elusive. The focus of this proposal, early childhood caries (ECC), is the most common chronic childhood disease. ECC is defined as dental decay among children under the age of 6? it persists as a clinical and dental public health problem, and confers substantial and multi-level human and economic impacts. The advent of precision oral health care, based upon a new, microbially-informed understanding of ECC, is expected to shed light onto mechanistic aspects of the disease processes and reveal new ways to prevent it. To this end, we will analyze existing clinical (i.e., ECC case status) and matched metagenomics (whole genome sequencing shotgun; WGS) and metatranscriptomics (RNA-seq.) data from supragingival plaque samples of 170 preschool-age children, mainly ages 3 and 4, enrolled in a community-based oral health study in NC. The goal of the proposed study is to identify ECC-associated bacteria, bacterial genes and pathways via metagenomics and metatranscriptomics analyses, conducted separately and jointly. Aside from the unique characteristics (e.g., matched WGS and RNA-seq. data from the same biofilm sample in each participant), quality and size of the dataset, the proposal's novelty is amplified by the testing, development and dissemination of appropriate statistical methods and optimized analytical pipelines. Seven models will be evaluated via rigorous simulations, accounting for the handling of over-dispersion, zero-inflation, more than 2 phenotype groups and batch effects, and will be optimized prior to the real study data application. Upon completion, we anticipate that the study will provide novel insights into the microbial basis of ECC. The integrative data analysis framework will offer opportunities to accommodate additional metabolomics data as they become available, to further increase the potential for mechanistic insights.

Public Health Relevance

Microbiome imbalance, being related with many dental diseases, can be studied using high-throughput DNA and RNA sequencing methods. ZOE study for early childhood caries (ECC), as the largest cohort with matched DNA and mRNA sequencing data from 170 children age 2-5, is to understand the association among microbial species, genes and pathways and ECC. We propose to separately and jointly analyze the matched sparse compositional microbiome data to identify significant microbial composition and function enriched in the ECC group.

Agency
National Institute of Health (NIH)
Institute
National Institute of Dental & Craniofacial Research (NIDCR)
Type
Small Research Grants (R03)
Project #
1R03DE028983-01
Application #
9809425
Study Section
Special Emphasis Panel (ZDE1)
Program Officer
Khatipov, Emir A
Project Start
2019-08-01
Project End
2021-07-31
Budget Start
2019-08-01
Budget End
2020-07-31
Support Year
1
Fiscal Year
2019
Total Cost
Indirect Cost
Name
University of North Carolina Chapel Hill
Department
Dentistry
Type
Schools of Dentistry/Oral Hygn
DUNS #
608195277
City
Chapel Hill
State
NC
Country
United States
Zip Code
27599