A crucial component to the recent major advances in genomic research has been the uniting of advances in biology with those in computers, informatics and networking. As sequencing throughput has increased, the technological burden has shifted increasingly to analysis and informatics. This project was established to ensure that necessary computational tools and resources are available to the NIH community. An integrated system for the storage, management, analysis and viewing of microArray data has been developed to support the NCI Advanced Technology Center microArray facility. The mAdb (microArray database) system provides a secure data management system for gathering, storing, and managing experimental information and expression array data. A variety of web accessible tools have been implemented to support the multiple analytical approaches needed to decipher array data in a more meaningful way. Important to the mAdb system design is compatibility with any platform (Unix, Windows or Macintosh) capable of running an Internet browser. We have taken an evolutionary developmental approach to designing and implementing the mAdb system, which provides for continuous evaluation, improvement, flexibility and quick turnaround. In addition, tools for mining UniGene for tissue-specific gene sets and that allow comparison of various microArray gene sets have been made available to the community. A natural extension of mAdb has been the inclusion of additional data resources. This includes supporting information from various data sources (e.g. Gene Ontology, GenBank, LocusLink, UniGene, KEGG Pathways, Biocarta Pathways and GeneCards) to enable drilling down into the rapidly expanding biological knowledgebase. In order to have effective use of the informational resource developed to support microArray analysis, ongoing user training and support is provided through CIT facilities for this collaborative effort. While ongoing development of new and improved analysis tools continues, the mAdb system is in routine service, supporting over 1500 NIH researchers and collaborators and containing over 82,000 microArray experiments. A critical design element for the mAdb system was to accommodate scalability to allow expansion to support other ICDs. The design allows us to support separate web servers serving different user communities from a single code base. The mAdb system has been set up on separate web servers to support users of the NIAID microArray core facility and the Lymphoma Leukemia Molecular Profiling Project (LLMPP)/Strategic Partnering to Evaluate Cancer Signature (SPECS) consortium. The LLMPP/SPECS project is using microArrays and other high throughput whole genome technologies to define the molecular profiles of all types of human lymphoid malignancies. One primary goal of this project is to redefine the classification of human lymphoid malignancies in molecular terms. A second major goal is to define molecular correlates of clinical parameters that can be used in prognosis and in the selection of appropriate therapy for these patients. As members of the international LLMPP/SPECS consortium, we provide the informatics development and support critical to the success of this project. A database and tools have been implemented to facilitate integrating and analyzing clinical parameters with genomic/genetic data from high throughput technologies. Data for over 2,300 clinical cases has been uploaded into the system. In a leadership role as part of the National Database for Autism Research project, we are responsible for overseeing the implementation for the Genetics/Genomics component.

Agency
National Institute of Health (NIH)
Institute
Center for Information Technology (CIT)
Type
Intramural Research (Z01)
Project #
1Z01CT000260-13
Application #
7733760
Study Section
Project Start
Project End
Budget Start
Budget End
Support Year
13
Fiscal Year
2008
Total Cost
$646,228
Indirect Cost
Name
Center for Information Technology
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Shaffer, Arthur L; Emre, N C Tolga; Lamy, Laurence et al. (2008) IRF4 addiction in multiple myeloma. Nature 454:226-31
Lenz, Georg; Wright, George W; Emre, N C Tolga et al. (2008) Molecular subtypes of diffuse large B-cell lymphoma arise by distinct genetic pathways. Proc Natl Acad Sci U S A 105:13520-5
Annunziata, Christina M; Davis, R Eric; Demchenko, Yulia et al. (2007) Frequent engagement of the classical and alternative NF-kappaB pathways by diverse genetic abnormalities in multiple myeloma. Cancer Cell 12:115-30
Ngo, Vu N; Davis, R Eric; Lamy, Laurence et al. (2006) A loss-of-function RNA interference screen for molecular targets in cancer. Nature 441:106-10
Shaffer, Arthur L; Wright, George; Yang, Liming et al. (2006) A library of gene expression signatures to illuminate normal and pathological lymphoid biology. Immunol Rev 210:67-85
Riss, Joseph; Khanna, Chand; Koo, Seongjoon et al. (2006) Cancers as wounds that do not heal: differences and similarities between renal regeneration/repair and renal cell carcinoma. Cancer Res 66:7216-24
Dave, Sandeep S; Fu, Kai; Wright, George W et al. (2006) Molecular diagnosis of Burkitt's lymphoma. N Engl J Med 354:2431-42
Kovanen, Panu E; Young, Lynn; Al-Shami, Amin et al. (2005) Global analysis of IL-2 target genes: identification of chromosomal clusters of expressed genes. Int Immunol 17:1009-21
Bea, Silvia; Zettl, Andreas; Wright, George et al. (2005) Diffuse large B-cell lymphoma subgroups have distinct genetic profiles that influence tumor biology and improve gene-expression-based survival prediction. Blood 106:3183-90
Dave, Sandeep S; Wright, George; Tan, Bruce et al. (2004) Prediction of survival in follicular lymphoma based on molecular features of tumor-infiltrating immune cells. N Engl J Med 351:2159-69

Showing the most recent 10 out of 22 publications