ChemBench: the Integrated Web Portal to Accelerate Cheminformatics and Chemical Genomics Research. Project Summary/Abstract As a result of growing proliferation of high-throughput technologies in modern chemical and biological research, the experimental scientists generating large volumes of data are no longer equipped with adequate tools and approaches to manage, let alone analyze, their own data. Modern chemical and biological research requires sophisticated public-domain tools for computational and statistical analysis of large experimental datasets to create data models that can aid and prioritize further experiments. Cheminformatics has emerged in the last decade as a burgeoning research discipline combining computational, statistical, and informational methodologies with some of the key concepts in chemistry and biology. Modern cheminformatics is defined broadly as a chemocentric scientific discipline encompassing the creation, retrieval, management, visualization, modeling, discovery, and dissemination of chemical knowledge. Further progress of cheminformatics research is hampered by the lack of publicly available computational tools and software to analyze chemical genomics data. This project endeavors to fill this gap through an advanced cheminformatics web portal called Chembench. The current system has been built in our group in the last four years with limited initial support from the NIH RoadMap planning grant. This project will advance Chembench to become an easily extensible and maintainable system addressing the needs of both computational and experimental chemical genomics scientists by providing a freely available model building tool for cheminformaticians and freely available compound activity predicting tools for biologists and medicinal chemists. The computational workflow to incorporate these key procedures and afford their transparent use must be based on rigorous software design approaches and procedures that constitute the focus of Specific Aim 1 of this project. Building upon this rigorous computational infrastructure, the functional key elements of the workflow include modules for (i) chemical data preparation including curation, descriptor calculation, and chemical space visualization (Aim 2); (ii) QSAR model development and statistical validation (Aim 3); (iii) prediction of computational hits in external compound libraries (Aim 4); and (iv) analysis and exploration of computational hits (Aim 5). The portal is designed as an aggregate of modules that can be used as part of an automated integrated workflow or individually, depending on the research interests and specific expertise of users. Similar to the role that bioinformatics has played in transforming modern biomedical research; cheminformatics is poised to revolutionize all areas of research in chemistry and chemical biology. Chembench provides the experimental and computational probe/drug discovery researchers with knowledge discovery tools and infrastructure that enables them to explore and exploit chemical genomics databases, build rigorous data models, and reduce the experimental effort required to identify novel biologically active compounds. The publicly available Chembench portal will help translate large scale chemical genomics research into the discovery of new medicines to improve public health.

Public Health Relevance

This project intends to develop a publicly available cheminformatics ChemBench portal to provide the experimental and computational probe/drug discovery researchers with knowledge discovery tools and infrastructure to explore and exploit chemical genomics databases, build rigorous data models, and reduce the amount of experimental effort required to identify novel biologically active compounds such as chemical probes and drug candidates.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM096967-04
Application #
8923299
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Brazhnik, Paul
Project Start
2012-09-01
Project End
2016-08-31
Budget Start
2015-09-01
Budget End
2016-08-31
Support Year
4
Fiscal Year
2015
Total Cost
$288,800
Indirect Cost
$98,800
Name
University of North Carolina Chapel Hill
Department
Pharmacology
Type
Schools of Pharmacy
DUNS #
608195277
City
Chapel Hill
State
NC
Country
United States
Zip Code
27599
Capuzzi, Stephen J; Kim, Ian Sang-June; Lam, Wai In et al. (2017) Chembench: A Publicly Accessible, Integrated Cheminformatics Portal. J Chem Inf Model 57:105-108
Alves, Vinicius M; Capuzzi, Stephen J; Muratov, Eugene et al. (2016) QSAR models of human data can enrich or replace LLNA testing for human skin sensitization. Green Chem 18:6501-6515
Zakharov, Alexey V; Varlamova, Ekaterina V; Lagunin, Alexey A et al. (2016) QSAR Modeling and Prediction of Drug-Drug Interactions. Mol Pharm 13:545-56
Fourches, Denis; Muratov, Eugene; Tropsha, Alexander (2016) Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation. J Chem Inf Model 56:1243-52
Bulusu, Krishna C; Guha, Rajarshi; Mason, Daniel J et al. (2016) Modelling of compound combination effects and applications to efficacy and toxicity: state-of-the-art, challenges and perspectives. Drug Discov Today 21:225-38
Melo-Filho, Cleber C; Dantas, Rafael F; Braga, Rodolpho C et al. (2016) QSAR-Driven Discovery of Novel Chemical Scaffolds Active against Schistosoma mansoni. J Chem Inf Model 56:1357-72
Neves, Bruno J; Dantas, Rafael F; Senger, Mario R et al. (2016) Discovery of New Anti-Schistosomal Hits by Integration of QSAR-Based Virtual Screening and High Content Screening. J Med Chem 59:7075-88
Alves, Vinicius; Muratov, Eugene; Capuzzi, Stephen et al. (2016) Alarms about structural alerts. Green Chem 18:4348-4360
Neves, Bruno Junior; Muratov, Eugene; Machado, Renato Beilner et al. (2016) Modern approaches to accelerate discovery of new antischistosomal drugs. Expert Opin Drug Discov 11:557-67
Dekina, Svetlana; Romanovska, Irina; Ovsepyan, Ani et al. (2016) Gelatin/carboxymethyl cellulose mucoadhesive films with lysozyme: Development and characterization. Carbohydr Polym 147:208-215

Showing the most recent 10 out of 28 publications