One of the major challenges facing biomedical researchers is how to effectively utilize the growing tsunami of genome-wide data that is becoming available as DNA sequencing costs plunge and volumes of available data expand. One solution to this challenge is the development of robust databases that organize, integrate, curate and validate this data, while providing tools to search, visualize and explore this extremely valuable data. dictyBase is the model organism database that uses the genome sequence of Dictyostelium discoiedum to organize biological knowledge resulting from studies using Dictyostelium and related species. Investigators using Dictyostelium in bench research comprise a vibrant community of over 1500 researchers. Dictyostelium's position on the tree of life provides a unique evolutionary perspective that provides great value for evolutionary and computational biologists wishing to employ comparative genomics approaches. Dictyostelium has contributed to improving biological understanding of a variety of fundamental processing including cell migration, phagocytosis, cell-cell and intracellular signaling, cellular differentiation, self-nonself recognition and multicellular morphogenesis. Dictyostelium research has important medical relevance, having contributed, for example, to our understanding of mitochondrial-related diseases, host-pathogen interactions, especially for intracellular pathogens, and the mode of action of mood stabilizing drugs and defining the pathways targeted by bisphosphonates. dictyBase has become the trusted resource for investigators seeking Dictyostelium genome information, annotations, and functional data, having been accessed over 6 million times by over 280,000 independent IP addresses. This application seeks continued funding for dictyBase to allow completion of the annotation of all gene models and integration of important new data types. dictyBase seeks to provide innovative tools, strategies and high quality annotations that enable bench researchers to effectively benefit from rapidly increasing collection of available data.
Specific aims for the next funding period are to (1) Annotate the D. discoideum genome and curate experimental results from the literature, (2) Integrate and display novel large data sets including genome sequences for strains and related species, RNAseq based expression data, proteomics data and protein-protein interactions;(3) Maximize the utility of dictyBase to the biomedical research community. Successful completion of these aims will provide a critical resource for biomedical research and will maximize the investment of the NIH in research using Dictyostelium.

Public Health Relevance

dictyBase enables efficient biomedical research by organizing, integrating, and validating genome-wide data such as genome sequences, proteomics data, RNAseq transciptomics data and functional data captured by automated and manual literature curation and making this data available in a readily searchable format. The work proposed in this application is focused on increasing the variety and extent of datasets that dictyBase integrates, improving interfaces and searchability of the data and making that data widely available to the biomedical research community.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
2R01GM064426-09A1
Application #
8113573
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Maas, Stefan
Project Start
2002-08-01
Project End
2015-04-30
Budget Start
2011-05-01
Budget End
2012-04-30
Support Year
9
Fiscal Year
2011
Total Cost
$506,196
Indirect Cost
Name
Northwestern University at Chicago
Department
Genetics
Type
Schools of Medicine
DUNS #
005436803
City
Chicago
State
IL
Country
United States
Zip Code
60611
Basu, Siddhartha; Fey, Petra; Jimenez-Morales, David et al. (2015) dictyBase 2015: Expanding data and annotations in a new software environment. Genesis 53:523-534
Basu, Siddhartha; Fey, Petra; Pandit, Yogesh et al. (2013) DictyBase 2013: integrating multiple Dictyostelid species. Nucleic Acids Res 41:D676-83
Fey, Petra; Dodson, Robert J; Basu, Siddhartha et al. (2013) One stop shop for everything Dictyostelium: dictyBase and the Dicty Stock Center in 2012. Methods Mol Biol 983:59-92
Van Auken, Kimberly; Fey, Petra; Berardini, Tanya Z et al. (2012) Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR. Database (Oxford) 2012:bas040
Gaudet, Pascale; Bairoch, Amos; Field, Dawn et al. (2011) Towards BioDBcore: a community-defined information specification for biological databases. Database (Oxford) 2011:baq027
Gaudet, Pascale; Fey, Petra; Basu, Siddhartha et al. (2011) dictyBase update 2011: web 2.0 functionality and the initial steps towards a genome portal for the Amoebozoa. Nucleic Acids Res 39:D620-4
Sucgang, Richard; Kuo, Alan; Tian, Xiangjun et al. (2011) Comparative genomics of the social amoebae Dictyostelium discoideum and Dictyostelium purpureum. Genome Biol 12:R20
Gaudet, Pascale; Bairoch, Amos; Field, Dawn et al. (2011) Towards BioDBcore: a community-defined information specification for biological databases. Nucleic Acids Res 39:D7-10
Yu, Bing; Fey, Petra; Kestin-Pilcher, Karen E et al. (2011) Spliceosomal genes in the D. discoideum genome: a comparison with those in H. sapiens, D. melanogaster, A. thaliana and S. cerevisiae. Protein Cell 2:395-409
Reference Genome Group of the Gene Ontology Consortium (2009) The Gene Ontology's Reference Genome Project: a unified framework for functional annotation across species. PLoS Comput Biol 5:e1000431

Showing the most recent 10 out of 19 publications