A crucial component to the recent major advances in genomic research has been the uniting of advances in biology with those in computers, informatics and networking. As sequencing throughput has increased, the technological burden has shifted increasingly to analysis and informatics. This project was established to ensure that necessary computational tools and resources are available to the NIH community. An integrated system for the storage, management, analysis and viewing of microArray data has been developed to support the NCI Advanced Technology Center microArray facility. The mAdb (microArray database) system provides a secure data management system for gathering, storing, and managing experimental information and expression array data. A variety of web accessible tools have been implemented to support the multiple analytical approaches needed to decipher array data in a more meaningful way. Important to the mAdb system design is compatibility with any platform (Unix, Windows or Macintosh) capable of running an Internet browser. We have taken an evolutionary developmental approach to designing and implementing the mAdb system, which provides for continuous evaluation, improvement, flexibility and quick turnaround. In addition, tools for mining UniGene for tissue-specific gene sets and that allow comparison of various microArray gene sets have been made available to the community. A natural extension of mAdb has been the inclusion of additional data resources. This includes supporting information from various data sources (e.g. Gene Ontology, GenBank, LocusLink, UniGene, KEGG Pathways, Biocarta Pathways and GeneCards) to enable drilling down into the rapidly expanding biological knowledgebase. In order to have effective use of the informational resource developed to support microArray analysis, ongoing user training and support is provided through CIT facilities for this collaborative effort. While ongoing development of new and improved analysis tools continues, the mAdb system is in routine service, supporting over 1500 NIH researchers and collaborators and containing over 82,000 microArray experiments. A critical design element for the mAdb system was to accommodate scalability to allow expansion to support other ICDs. The design allows us to support separate web servers serving different user communities from a single code base. The mAdb system has been set up on separate web servers to support users of the NIAID microArray core facility and the Lymphoma Leukemia Molecular Profiling Project (LLMPP)/Strategic Partnering to Evaluate Cancer Signature (SPECS) consortium. The LLMPP/SPECS project is using microArrays and other high throughput whole genome technologies to define the molecular profiles of all types of human lymphoid malignancies. One primary goal of this project is to redefine the classification of human lymphoid malignancies in molecular terms. A second major goal is to define molecular correlates of clinical parameters that can be used in prognosis and in the selection of appropriate therapy for these patients. As members of the international LLMPP/SPECS consortium, we provide the informatics development and support critical to the success of this project. A database and tools have been implemented to facilitate integrating and analyzing clinical parameters with genomic/genetic data from high throughput technologies. Data for over 2,300 clinical cases has been uploaded into the system. In a leadership role as part of the National Database for Autism Research project, we are responsible for overseeing the implementation for the Genetics/Genomics component.
Showing the most recent 10 out of 22 publications