A crucial component to the recent major advances in genomic research has been the uniting of advances in biology with those in computers, informatics and networking. As technologies have advanced allowing high throughput, Genomics scale data collection, the technological burden has shifted to analysis and informatics. This project was established to ensure that necessary computational tools and resources are available to the NIH intramural community. OIR's long-term collaboration with Dr. Louis Staudt (Distinguished Investigator, NCI Center for Cancer Research and Director, NCI Center for Cancer Genomics) has yielded significant findings and discoveries that have led to improvements in the treatment of lymphoma. By providing comprehensive computational expertise, resources, and support, Dr. Staudt's lab has been able to perform sophisticated analyses on large-scale, high-dimensional data which have in turn been instrumental to achieving a number of highly significant findings. In 2019, OIR made significant contributions to the identification of genetic subtypes of Diffuse Large B-cell Lymphoma (DLBCL) based on patterns of occurrence of mutations and other genetic aberrations. OIR contributed to the development, validation, and Web deployment of the novel LymphGen system, a probablistic classiciation tool that predicts genetic subtypes of diffuse large B-cell lymphoma. OIR provides comprehensive computational support to Dr. Staudt's laboratory. This support entails maintaining databases of genomic data, providing computational servers with custom software for running a variety of analyses, and developing and maintaining public and local-access Web sites. These supported resources include the following: - LLMPP/SPECS: The Lymphoma/Leukemia Molecular Profiling Project/Strategic Partnering to Evaluate Cancer Signature (SPECS) program is a multi-institution grant for translational cancer research funded by National Cancer Institute. This website is designed for entering/managing clinical data for cases associated with samples included in the SPECS study. The LLMPP/SPECS project is using microarrays and other high throughput whole genome technologies to define the molecular profiles of all types of human lymphoid malignancies. One primary goal of this project is to redefine the classification of human lymphoid malignancies in molecular terms. A second major goal is to define molecular correlates of clinical parameters that can be used in prognosis and in the selection of appropriate therapy for these patients. As members of the international LLMPP/SPECS consortium, we provide the informatics development and support critical to the success of this project. A database and tools have been implemented to facilitate integrating and analyzing clinical parameters with genomic/genetic data from high throughput technologies. The consortium involves 12 participating centers in 7 countries. Data for 3,000 clinical cases have been uploaded into the system. In 2019, all personally identifiable information (PII) were removed from the SPECS database. - LYMPHCX: A Web site that allows researchers to predict DLBCL subtypes based on samples processed with a Nanostring protocol. Determination of these subtypes can be critical in deciding appropriate therapy since some subtypes are more aggressive than others. - LymphoDB: An interactive Web site and database for researchers to search and compare over 1.5 million lymphoma mutations that have been reported in 57 prominent publications. All mutations have been validated and stored along with relevant annotations and metrics to enable comparative quantitative analyses. - Signature database: A Web-site companion to Shaffer AL et al. A library of gene expression signatures to illuminate normal and pathological lymphoid biology, Immunol Rev. 2006 Apr;210:67-85. - Staudt lab analytical test bed: Web site to support quick turn-around of test analytical methods and rapidly allow lab members to more easily explore their own data with new algorithms. - Database support: OIR maintains information on more than 10 million mutations across over 3,000 clinical samples. Information on digital expression is also stored. The mAdb (microArray database, https://madb.nci.nih.gov) system provides a secure data management system for gathering, storing, and managing experimental information and expression array data. A variety of Web accessible tools has been implemented to support the multiple analytical approaches needed to decipher array data in a more meaningful way. Important to the mAdb system design is compatibility with any platform (Unix, Windows or Macintosh) capable of running an Internet browser. In 2019, mAdb was decommissioned. In collaboration with Dr. Timothy Myers of NIAID, OIR also provides comprehensive computational support the Genomic Technologies Section (GTS) of NIAID. Since GTS provides state-of-the-art bioinformatics support to the entire NIAID intramural research program, we effectively support all the users of the GTS facility. In addition to maintaining GTS computational servers and databases, OIR maintains a number of commercial software packages for GTS, including CLC-Genomics and SAS Visual Analyzer.

Agency
National Institute of Health (NIH)
Institute
Center for Information Technology (CIT)
Type
Scientific Computing Intramural Research (ZIH)
Project #
1ZIHCT000260-24
Application #
10016052
Study Section
Project Start
Project End
Budget Start
Budget End
Support Year
24
Fiscal Year
2019
Total Cost
Indirect Cost
Name
Center for Information Technology
Department
Type
DUNS #
City
State
Country
Zip Code
Schmitz, Roland; Wright, George W; Huang, Da Wei et al. (2018) Genetics and Pathogenesis of Diffuse Large B-Cell Lymphoma. N Engl J Med 378:1396-1407
Liang, Ma; Raley, Castle; Zheng, Xin et al. (2016) Distinguishing highly similar gene isoforms with a clustering-based bioinformatics analysis of PacBio single-molecule long reads. BioData Min 9:13