Our ability to study the microbiomes is enabled by the same technologies that allow us to quantify the host physiological state at greater depth and precision, including advanced high-throughput sequencing for genomics and transcriptomics; mass spectrometry for metabolomics, proteomics, and lipidomics; and flow-cytometry for characterization of circulating cell populations. The integration of host and microbiome panomic data is the roadmap for future biomedical discoveries. One example of such studies is the Integrative Human Microbiome Project (iHMP), which is currently generating panomic data on microbes and their host environment in three different diseases (diabetes, irritable bowel disease, pre-term delivery). Lack of appropriate analytics for these data and a steep curve for their validation and adoption is a major concern for the community. The main challenge of inference in panomic-scale microbiome datasets is overcoming the ?curses of dimensionality?. Local causal learning has proven useful for making discoveries with high-dimensional data, while distance-based learning is a promising paradigm for multivariate data analysis. We are proposing to combine these to develop the next generation of panomic data analytics and make these tools available directly to the biomedical investigators.
The aims of this project are: (1) Develop analytics for distance-based omnibus panomic integration; (2) Develop methodology for top-down distance-based sub-system interdependence learning. The overarching goal is to develop user-facing applications utilizing the methodologies in Aims 1 and 2 and apply those in several existing studies generating panomic data. The analytics, applications, and educational resources (case studies and tutorials) resulting from this project will enable the biomedical community to study panomic-scale datasets in a coherent and comprehensive way. The methods and tools resulting from this project will support new biomedical discoveries. ! !

Public Health Relevance

Current technology allows for comprehensive characterization of the host physiological state using high- throughput ?omic? techniques. The same techniques are used to study the microbiomes that have an impact on the human health in a broad range of clinical conditions including cancers, diabetes, irritable bowel disease, and other disease. We propose to develop novel informatics analysis methods to characterize the interactions of the host measurements with microbial data and their relationship to human diseases. !

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM012517-03
Application #
9903452
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2018-06-01
Project End
2022-04-30
Budget Start
2020-05-01
Budget End
2021-04-30
Support Year
3
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Medical University of South Carolina
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
183710748
City
Charleston
State
SC
Country
United States
Zip Code
29407