There is a significant academic and commercial need for new tools that provide high dimensional data visualizations, coupled to analytical data mining techniques. We believe that visualization is the interface to analysis and provides guidance in the discovery process. As a major aim, we will investigate and evaluate new visualization tools, some of which are proprietary, capable of displaying an arbitrary number of dimensions, some of which are proprietary, capable of displaying an arbitrary number of dimensions of data simultaneously. To do this, we will use the large public NCI DIS compound dataset that has been tested against a battery of 60 cancer cell lines. In addition to tool evaluation using this dataset, a lesser aim will be knowledge discovery in the dataset. We propose calculation of the Molconn-Z chemical descriptors and the combined data mining of these descriptors. and associated cell line data. This activity is aimed at the discovery of new compound cancer activity patterns that may be useful in a clinical setting. In a follow on Phase II research study, we will integrate the selected visualization and analytic tools into a robust integrated data mining package for commercial use.

Proposed Commercial Applications

The Specific Aims of this Phase I proposal will allow us to evaluate the commercial potential of high dimensional visualization and analysis tools using the publicly available NCI DIS dataset, as well as data mine this dataset for potential new discoveries.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43CA094429-01
Application #
6443134
Study Section
Special Emphasis Panel (ZRG1-SSS-9 (12))
Program Officer
Choudhry, Jawahar
Project Start
2002-05-01
Project End
2002-10-31
Budget Start
2002-05-01
Budget End
2002-10-31
Support Year
1
Fiscal Year
2002
Total Cost
$99,225
Indirect Cost
Name
Anvil Informatics, Inc.
Department
Type
DUNS #
City
Lowell
State
MA
Country
United States
Zip Code
01854
McCarthy, John F; Marx, Kenneth A; Hoffman, Patrick E et al. (2004) Applications of machine learning and high-dimensional visualization in cancer detection, diagnosis, and management. Ann N Y Acad Sci 1020:239-62