? The long-term objective of this project is to develop computational algorithms and software to gain theoretical and empirical insights in the use of chemical diversity for determining quantitative structure-activity relationships (QSARs). In addition to addressing scientific and technical goals with respect to QSAR modeling, planning-period tasks will include specific activities to bring together the researchers and to facilitate inter-disciplinary communication. ? ? Specific Aim 1 is to develop and enhance collaborations between three broad disciplines: statistics, computer science, and chemistry. This will be accomplished primarily through several intense workshops per year and regularly scheduled status meetings. ? ? Specific Aim 2 is to initiate a benchmarking study to compare structural descriptors, modeling strategies, and methods of model assessment. Through a web server, results will be posted from analyzing several datasets using many QSAR modeling techniques, a variety of molecular descriptors, and a number of assessment criteria. ? ? Specific Aim 3 is to design and beta-test web-accessibility of modeling software. PowerMV, a cheminformatics software tool created at the National Institute of Statistical Sciences, will be upgraded and made web-accessible. ? ? And Specific Aim 4 is to develop a broad view of cheminformatics tools based on the singular value decomposition and other similar decompositions where computations take advantage of the high degree of sparseness often exhibited by HTS data sets. ? ? The significance of these specific aims, in support of the long-term objective, is to reduce resource requirements for, and thus streamline, the process of drug discovery. ? ?

Public Health Relevance

? By reducing resource requirements for Lead Identification and Lead Optimization, which are two of the five phases of drug discovery, this project will help to streamline the entire discovery process. As a result, the nation's health and well-being will be improved because of the increased returns on research expenditures. These returns include decreasing the time for finding effective cures and/or treatments for diseases already being studied as well as increasing the number of diseases being studied. ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Exploratory Grants (P20)
Project #
1P20HG003900-01
Application #
7032110
Study Section
Special Emphasis Panel (ZHG1-HGR-N (O))
Program Officer
Ozenberger, Bradley
Project Start
2005-09-23
Project End
2007-07-31
Budget Start
2005-09-23
Budget End
2006-07-31
Support Year
1
Fiscal Year
2005
Total Cost
$385,126
Indirect Cost
Name
North Carolina State University Raleigh
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
042092122
City
Raleigh
State
NC
Country
United States
Zip Code
27695
Zhang, Ke; Hughes-Oliver, Jacqueline M; Young, S Stanley (2013) Analysis of High-Dimensional Structure-Activity Screening Datasets Using the Optimal Bit String Tree. Technometrics 55:161-173
Hughes-Oliver, Jacqueline M; Brooks, Atina D; Welch, William J et al. (2011) ChemModLab: a web-based cheminformatics modeling laboratory. In Silico Biol 11:61-81
Lin, Matthew M; Chu, Moody T (2010) On the Nonnegative Rank of Euclidean Distance Matrices. Linear Algebra Appl 433:681-689
Lin, Matthew M; Dong, Bo; Chu, Moody T (2010) SEMI-DEFINITE PROGRAMMING TECHNIQUES FOR STRUCTURED QUADRATIC INVERSE EIGENVALUE PROBLEMS. Numer Algorithms 53:419-437
Zhang, Qianyi; Hughes-Oliver, Jacqueline M; Ng, Raymond T (2009) A model-based ensembling approach for developing QSARs. J Chem Inf Model 49:1857-65
Dong, Bo; Lin, Matthew M; Chu, Moody T (2009) Parameter Reconstruction of Vibration Systems from Partial Eigeninformation. J Sound Vib 327:391-401
Liu, Jiajun; Hughes-Oliver, Jacqueline M; Menius Jr, J Alan (2007) Domain-enhanced analysis of microarray data using GO annotations. Bioinformatics 23:1225-34