The purpose of this Exploratory Center for Cheminformatics Research (ECCR) P20 planning grant is to develop a mechanism for bringing together and stimulating collaborative pilot projects among a constantly-evolving nucleus of experts in Cheminformatics-related fields ranging from methods of encoding and capturing molecular information, to machine learning and data mining techniques, to predictive model development, validation, interpretation and utilization. In addition to these research efforts, the Center will bring together a set of domain specialists and application scientists who will serve as both data generators and end users of the knowledge provided by the molecular property models and modeling methods developed during the course of the grant. This group will also test the new Cheminformatics software that will constitute a tangible, deliverable product from this work. Ten application project modules that exemplify possible interactions between various groups and areas of expertise within the Center are presented as part of this proposal. The unifying vision behind the proposed Center is that much of what is done in each of the subdisciplines represented here can be expressed in a Cheminformatics context: The many diverse project areas can be grouped into one or more overlapping categories: """"""""Data Generators"""""""" (those who use either theoretical or experimental methods for creating or extracting knowledge), """"""""Machine Learning and Datamining"""""""" groups (who perform model validation, feature selection, pattern recognition, generation of potentials of mean force and knowledge-based potential work), as well as """"""""Property-Prediction"""""""" groups (who perform chemically-aware model building, molecular property descriptor generation, Quantitative Structure-Property Relationship modeling, validation, and interpretation), and """"""""Application"""""""" groups who utilize the information made available using the new tools and methods that are developed as part of the Center. It is our strong belief that these areas of expertise can be brought together within this Planning Grant proposal to generate something larger than the sum of the parts. The Exploratory Center will seed new interdisciplinary projects and train graduate students in these areas. Relevance: Advances in the generation, mining and analysis of chemical information is crucial to the development of new drug therapies, and to modern methods of bioinformatics and molecular medicine.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Exploratory Grants (P20)
Project #
5P20HG003899-02
Application #
7125575
Study Section
Special Emphasis Panel (ZHG1-HGR-N (O))
Program Officer
Ajay, Ajay
Project Start
2005-09-23
Project End
2009-07-31
Budget Start
2006-08-01
Budget End
2009-07-31
Support Year
2
Fiscal Year
2006
Total Cost
$377,226
Indirect Cost
Name
Rensselaer Polytechnic Institute
Department
Type
Organized Research Units
DUNS #
002430742
City
Troy
State
NY
Country
United States
Zip Code
12180
Zaretzki, Jed; Bergeron, Charles; Huang, Tao-wei et al. (2013) RS-WebPredictor: a server for predicting CYP-mediated sites of metabolism on drug-like molecules. Bioinformatics 29:497-8
Zaretzki, Jed; Rydberg, Patrik; Bergeron, Charles et al. (2012) RS-Predictor models augmented with SMARTCyp reactivities: robust metabolic regioselectivity predictions for nine CYP isozymes. J Chem Inf Model 52:1637-59
McLellan, Margaret R; Ryan, M Dominic; Breneman, Curt M (2011) Rank order entropy: why one metric is not enough. J Chem Inf Model 51:2302-19
Zaretzki, Jed; Bergeron, Charles; Rydberg, Patrik et al. (2011) RS-predictor: a new tool for predicting sites of cytochrome P450-mediated metabolism applied to CYP 3A4. J Chem Inf Model 51:1667-89
Sgourakis, Nikolaos G; Merced-Serrano, Myrna; Boutsidis, Christos et al. (2011) Atomic-level characterization of the ensemble of the A?(1-42) monomer in water using unbiased molecular dynamics simulations and spectral algorithms. J Mol Biol 405:570-83
Das, Sourav; Krein, Michael P; Breneman, Curt M (2010) PESDserv: a server for high-throughput comparison of protein binding site surfaces. Bioinformatics 26:1913-4
Das, Sourav; Krein, Michael P; Breneman, Curt M (2010) Binding affinity prediction with property-encoded shape distribution signatures. J Chem Inf Model 50:298-308
Sgourakis, Nikolaos G; Garcia, Angel E (2010) The membrane complex between transducin and dark-state rhodopsin exhibits large-amplitude interface dynamics on the sub-microsecond timescale: insights from all-atom MD simulations. J Mol Biol 398:161-73
Das, Sourav; Kokardekar, Arshad; Breneman, Curt M (2009) Rapid comparison of protein binding site surfaces with property encoded shape distributions. J Chem Inf Model 49:2863-72
Sgourakis, Nikolaos G; Yan, Yilin; McCallum, Scott A et al. (2007) The Alzheimer's peptides Abeta40 and 42 adopt distinct conformations in water: a combined MD / NMR study. J Mol Biol 368:1448-57