This application requests support for the development of new software tools for analyzing metabolomics data. The proposed tools will analyze metabolomics data in the context of metabolic pathways obtained from the BioCyc database collection. A key motivation underlying our work is that it is advantageous to interpret metabolomics data in the context of existing metabolic knowledge, and that pathway databases that accurately map the metabolism of thousands of organisms are a logical source of pathway information. Known metabolic networks provide a valuable framework for interpreting metabolomics data because patterns in metabolomics data often result from patterns among the biochemical reactions that interconnect metabolites. However, the large size and high connectivity of organism metabolic networks, and the large number of path- ways within most organisms, present signi?cant cognitive barriers to this notion of using existing metabolic knowledge to analyze metabolomics data. A major motivation of this work is to lower these cognitive barri- ers by identifying subsets of the metabolic network for the user to focus their attention on. We will evaluate the following two proposed software tools by applying them to re-analyze existing metabolomics datasets previously generated by our group. Our ?rst aim is to develop software to compute minimal pathway covering sets for a set of metabolites, that is, a minimal number of metabolic pathways that contain the metabolites within a metabolomics dataset submitted by a user.
Our second aim i s to develop software to compute reaction spanning trees for metabolite sets to eluci- date mechanistic relationships among metabolites. As a companion to the preceding pathway-centric algorithm, this algorithm will consider the full metabolism of the organism as a graph, and will compute a minimal tree that spans the metabolites in a metabolomics dataset. Both the minimal pathway-covering sets and the reaction spanning trees will be displayed using existing pathway display algorithms in our Pathway Tools software. Fur- thermore, our new software will be integrated into Pathway Tools, which is a highly used software package in bioinformatics.

Public Health Relevance

This project proposes to create new software tools for the analysis of metabolomics datasets. Although scientists can now measure changes in the activities of hundreds of small molecules in living systems, it can be very chal- lenging to elucidate the mechanisms responsible for those changes. Our proposed software will aid scientists in explaining patterns in metabolomics datasets by applying existing knowledge of metabolic pathways.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Research Grants (R03)
Project #
1R03CA211814-01
Application #
9222825
Study Section
Special Emphasis Panel (ZRG1-BST-U (50)R)
Program Officer
Spalholz, Barbara A
Project Start
2016-09-14
Project End
2017-08-31
Budget Start
2016-09-14
Budget End
2017-08-31
Support Year
1
Fiscal Year
2016
Total Cost
$192,184
Indirect Cost
$92,184
Name
Sri International
Department
Type
DUNS #
009232752
City
Menlo Park
State
CA
Country
United States
Zip Code
94025