The Candida Genome Database (CGD) is considered THE resource for comprehensive information about the human fungal pathogen Candida albicans and related Candida species, and is widely used by the Candida research community, who relies upon CGD in their everyday work. C. albicans is the third or fourth most common nosocomial bloodstream isolate; mortality rates are high (35% or greater) and treatment is costly. It is thus vital that there is a comprehensive and up-to-date resource for researchers investigating the biology and pathogenesis of C. albicans, as such a resource accelerates their research. The central challenge for any community database, in this era of high throughput technologies becoming widely available, is to turn the flood of data into knowledge, which the community can access, use and build upon. We firmly believe that a research community is best served by collection of all relevant data in a single location, followed by manual, expert curation of those data, coupled with tools to allow users to search and navigate the data in an intuitive fashion. Most of the data available in CGD are not available from any other site, and no other site performs curation of the C. albicans literature. We re-use software wherever possible, writing our own only when necessary. This philosophy has served us well, in that we have built CGD into an indispensible resource with modest staff, and we will continue to apply this model going forward. In this renewal for CGD we propose to build on our previous successes. We will expand the large-scale data types stored at CGD, curate the datasets, and provide new tools to analyze and visualize these data. We will also use these high-throughput data to improve the sequences and primary annotations for Candida genomes - reference genomes provide the fundamental platform upon which a community's research builds, and it is vital that they be correct and correctly annotated. We will perform real-time curation of the experimental literature, capturing gene names, mutant phenotypes, Gene Ontology Terms, etc., from papers as they are published, so that at a glance a bench biologist can find the salient, up-to-date information about any gene to which their research leads them. Finally, we will provide support to the Candida scientific community, ensuring that we are continuing to serve their needs as the indispensible resource that we have become. Together, successful completion of these aims will support and accelerate research into fungal pathogenesis, and thus have a positive impact on human health.

Public Health Relevance

C. albicans has become the third or fourth most common nosocomial bloodstream isolate; mortality rates are high (35% or greater) and treatment is costly. While antifungal compounds exist, these drugs are often of limited use because of their toxicity and side effects, and due to the emergence of antifungal resistance in the clinical setting. It is vital that C. albicans research continue as rapidly as possible - successful advance of this project, to provide a curated and comprehensive Candida albicans database, will continue to accelerate Candida research, and in doing so will aid in the fight against these fungal infections, significantly positively impacting human health.

Agency
National Institute of Health (NIH)
Institute
National Institute of Dental & Craniofacial Research (NIDCR)
Type
Research Project (R01)
Project #
5R01DE015873-13
Application #
9315796
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Lunsford, Dwayne
Project Start
2004-04-01
Project End
2020-07-31
Budget Start
2017-08-01
Budget End
2018-07-31
Support Year
13
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Stanford University
Department
Genetics
Type
Schools of Medicine
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94304
Ropars, Jeanne; Maufrais, Corinne; Diogo, Dorothée et al. (2018) Gene flow contributes to diversification of the major fungal pathogen Candida albicans. Nat Commun 9:2253
Skrzypek, Marek S; Binkley, Jonathan; Sherlock, Gavin (2018) Using the Candida Genome Database. Methods Mol Biol 1757:31-47
Skrzypek, Marek S; Binkley, Jonathan; Binkley, Gail et al. (2017) The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data. Nucleic Acids Res 45:D592-D596
Skrzypek, Marek S; Binkley, Jonathan; Sherlock, Gavin (2016) How to Use the Candida Genome Database. Methods Mol Biol 1356:3-15
Feri, Adeline; Loll-Krippleber, Raphaël; Commere, Pierre-Henri et al. (2016) Analysis of Repair Mechanisms following an Induced Double-Strand Break Uncovers Recessive Deleterious Alleles in the Candida albicans Diploid Genome. MBio 7:
Keseler, Ingrid M; Skrzypek, Marek; Weerasinghe, Deepika et al. (2014) Curation accuracy of model organism databases. Database (Oxford) 2014:
Binkley, Jonathan; Arnaud, Martha B; Inglis, Diane O et al. (2014) The Candida Genome Database: the new homology information page highlights protein similarity and phylogeny. Nucleic Acids Res 42:D711-6
Muzzey, Dale; Sherlock, Gavin; Weissman, Jonathan S (2014) Extensive and coordinated control of allele-specific expression by both transcription and translation in Candida albicans. Genome Res 24:963-73
Muzzey, Dale; Schwartz, Katja; Weissman, Jonathan S et al. (2013) Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure. Genome Biol 14:R97
Inglis, Diane O; Sherlock, Gavin (2013) Ras signaling gets fine-tuned: regulation of multiple pathogenic traits of Candida albicans. Eukaryot Cell 12:1316-25

Showing the most recent 10 out of 17 publications