Computational techniques that build models to correctly assign chemical compounds to various classes of interests have extensive applications in pharmaceutical research and are used extensively at various phases during the drug development process. These techniques are used to solve a number of classification problems such as predicting whether or not a chemical compound has the desired biological activity, is toxic or non-toxic, and filtering out drug-like compounds from large compound libraries. The overall goal of this proposal is to develop substructure-based classification algorithms for chemical compound datasets. The key elements of these algorithms are that they (i) utilize highly efficient substructure discovery algorithms to mine the chemical compounds and discover all substructures that can be critical for the classification task, (ii) use multiple criteria to generate a set of substructure-based features that simultaneously simplify the compounds' representation while retaining and exposing the features that are responsible for the specific classification problem, and (iii) build predictive models by employing kernel-based methods that take into account the relationships between these substructures at different levels of granularity and complexity, as well as information provided by traditional descriptors.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
1R01LM008713-01A1
Application #
6965348
Study Section
Special Emphasis Panel (ZRG1-BDMA (01))
Program Officer
Sim, Hua-Chuan
Project Start
2005-09-30
Project End
2009-09-29
Budget Start
2005-09-30
Budget End
2006-09-29
Support Year
1
Fiscal Year
2005
Total Cost
$283,196
Indirect Cost
Name
University of Minnesota Twin Cities
Department
Biostatistics & Other Math Sci
Type
Schools of Engineering
DUNS #
555917996
City
Minneapolis
State
MN
Country
United States
Zip Code
55455
Wale, Nikil; Karypis, George (2009) Target fishing for chemical compounds using target-ligand activity data and ranking based methods. J Chem Inf Model 49:2190-201
Podolyan, Yevgeniy; Karypis, George (2009) Common pharmacophore identification using frequent clique detection algorithm. J Chem Inf Model 49:13-21
DeRonne, Kevin W; Karypis, George (2009) Improved estimation of structure predictor quality. BMC Struct Biol 9:41
Wale, Nikil; Watson, Ian A; Karypis, George (2008) Indirect similarity based methods for effective scaffold-hopping in chemical compounds. J Chem Inf Model 48:730-41