The principal objective of this project, headed by Dr. Marc C. Nicklaus, Head, Computer-Aided Drug Design MiniCore Facility, is to make the information in the Open NCI Database available for aiding in drug development, both in-house and publicly. Both the data from NCI's Developmental Therapeutics Program (DTP) and additional information with which we have augmented the DTP datasets are used. Advanced processing is applied to the data, and powerful searching and display capabilities are being implemented. The NCI chemical structural database is a collection about half a million structures, accumulated in computer-readable form during the past 45 years in the course of NCI's screening of compounds for anti-cancer (and recently also anti-AIDS) activity. For typically 60% of these molecules, samples are available for, e.g., testing in assays. This sample collection is a valuable resource used in many of the LMCh's computer-aided drug development projects. Approximately half of the database is covered by confidentiality agreements with the samples' suppliers, whereas the other half (the """"""""Open NCI Database"""""""") is openly accessible, with the computer structures being made available by DTP as public domain data. We have subjected the Open NCI Database to various analyses that help to better understand its characteristics and put it in perspective of other large databases used in computer-aided drug design and chemical information sciences. Various clustering methods have been applied to it to elucidate its diversity, and the results have been compared with those for other databases. Internal duplication rates as well as mutual overlaps have been calculated for the entire set of databases including the Open NCI Database. The Open NCI Database has been converted into various formats, suitable for further processing including 3D pharmacophore searching. We have also implemented a powerful public search tool for the Open NCI Database with a web interface based on the chemical information toolkit CACTVS. Using just a web browser, the user is able to search about 250,000 structures for more than 600 criteria. We have greatly augmented the original DTP files with numerous additional data fields, be it calculated, predicted or hyperlinked information. These data have also been made available in directly downloadable format. Links to several additional services for further processing have been implemented. An online 3D pharmacophore capability has been built, a capability that is currently unique on the web, as far as we are aware of. Searchable predictions of more than 550 different biological activities, calculated by the program PASS for most of the quarter-million compounds, have been included in the web service. Current and future collaborations are intended to provide additional data and possibly structure sets. The URL is http://cactus.nci.nih.gov.

Agency
National Institute of Health (NIH)
Institute
Division of Basic Sciences - NCI (NCI)
Type
Intramural Research (Z01)
Project #
1Z01BC010389-02
Application #
6559253
Study Section
(LMC)
Project Start
Project End
Budget Start
Budget End
Support Year
2
Fiscal Year
2001
Total Cost
Indirect Cost
Name
Basic Sciences
Department
Type
DUNS #
City
State
Country
United States
Zip Code