The completion of the genome sequence for an organism such as a disease-causing bacterium marks the start of a phase of accelerated research that can now lead to the production of a quantitative metabolic flux model for the organism. Such metabolic flux models have diverse applications. They enable basic scientific understanding of biochemical pathways. They allow computational predictions of which genes and reactions within the metabolic network are essential, thus facilitating the design of drugs against disease-causing bacteria. They support the engineering of new metabolic pathways, such as for bioenergy research. The huge quantity of data and knowledge defined by the genome sequence, and by the large number of past and future experimental findings for the organism, requires using a database as a central repository for information about the genome, the biochemical network, and the regulatory processes of that organism. The Pathway Tools software is a robust and comprehensive system for constructing organism-specific databases, and for pathway analysis of genomes. Pathway Tools enables communities of scientists to create, query, visualize, analyze MODs, and to publish the MODs on the web. Pathway Tools supports construction of MODs that combine a large number of bioinformatics datatypes, including genome maps, genes, operons, RNAs, proteins, chemical compounds, biochemical reactions, metabolic pathways, and regulatory interactions. Pathway Tools is a mature and production grade software environment that has been used by 264 groups outside SRI (see Appendix) to analyze 1,969 genomes. We propose major improvements to the new Pathway Tools module for producing metabolic flux models for individual organisms and for microbial communities. This tool allows much faster creation of these complex models than does comparable tools. We propose to perform a major redesign of the Pathway Tools web interfaces. We propose to enhance the Pathway Tools web metabolic network diagram and regulatory network diagram, which provide interactive, web-based cellular network maps. We propose to create a change-reporting system for notifying scientists of updates to Pathway Tools databases. We propose to create a tool for inferring multimeric protein complexes in sequenced genomes, and to extend Pathway Tools with several sequence-related operations. We propose to provide support services for the large and growing user community for Pathway Tools, to maintain quality documentation for the software, and to create two thoroughly tested releases of the software per year.

Public Health Relevance

The proposed project will generate software tools that increase the ability of scientists to use genome sequence data to advance human health. This software enables creating central database repositories and websites con- training genome data and data on cellular networks for disease-causing bacteria, and for experimental model organisms such as the laboratory mouse. The software allows researchers to access the most up-to-date information about those organisms using intuitive querying tools, and provides scientific visualization capabilities that aid scientists in more quickly understanding large, complex collections of data, and applying that data to advance health.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM075742-10
Application #
8501523
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Sledjeski, Darren D
Project Start
2005-04-01
Project End
2016-05-31
Budget Start
2013-06-01
Budget End
2014-05-31
Support Year
10
Fiscal Year
2013
Total Cost
$1,014,111
Indirect Cost
$463,265
Name
Sri International
Department
Type
DUNS #
009232752
City
Menlo Park
State
CA
Country
United States
Zip Code
94025
Caspi, Ron; Billington, Richard; Fulcher, Carol A et al. (2018) The MetaCyc database of metabolic pathways and enzymes. Nucleic Acids Res 46:D633-D639
Karp, Peter D; Weaver, Daniel; Latendresse, Mario (2018) How accurate is automated gap filling of metabolic models? BMC Syst Biol 12:73
Latendresse, Mario; Karp, Peter D (2018) Evaluation of reaction gap-filling accuracy by randomization. BMC Bioinformatics 19:53
Karp, Peter D; Billington, Richard; Caspi, Ron et al. (2017) The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinform :
Paley, Suzanne; Karp, Peter D (2017) Update notifications for the BioCyc collection of databases. Database (Oxford) 2017:
Paley, Suzanne; Parker, Karen; Spaulding, Aaron et al. (2017) The Omics Dashboard for interactive exploration of gene-expression data. Nucleic Acids Res 45:12113-12124
Paley, Suzanne; Krummenacker, Markus; Karp, Peter D (2016) Representation and inference of cellular architecture for metabolic reconstruction and modeling. Bioinformatics 32:1074-9
Edison, Arthur S; Hall, Robert D; Junot, Christophe et al. (2016) The Time Is Right to Focus on Model Organism Metabolomes. Metabolites 6:
Caspi, Ron; Billington, Richard; Ferrer, Luciana et al. (2016) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 44:D471-80
Karp, Peter D; Latendresse, Mario; Paley, Suzanne M et al. (2016) Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology. Brief Bioinform 17:877-90

Showing the most recent 10 out of 34 publications