Skip to main content
An official website of the United States government
Grant Details

Grant Number: 1R43CA094429-01 Interpret this number
Primary Investigator: Mcmanus, Michael
Organization: Anvil Informatics, Inc.
Project Title: Very High Dimensional Visual Mining of the NCI Dataset
Fiscal Year: 2002


Abstract

There is a significant academic and commercial need for new tools that provide high dimensional data visualizations, coupled to analytical data mining techniques. We believe that visualization is the interface to analysis and provides guidance in the discovery process. As a major aim, we will investigate and evaluate new visualization tools, some of which are proprietary, capable of displaying an arbitrary number of dimensions, some of which are proprietary, capable of displaying an arbitrary number of dimensions of data simultaneously. To do this, we will use the large public NCI DIS compound dataset that has been tested against a battery of 60 cancer cell lines. In addition to tool evaluation using this dataset, a lesser aim will be knowledge discovery in the dataset. We propose calculation of the Molconn-Z chemical descriptors and the combined data mining of these descriptors. and associated cell line data. This activity is aimed at the discovery of new compound cancer activity patterns that may be useful in a clinical setting. In a follow on Phase II research study, we will integrate the selected visualization and analytic tools into a robust integrated data mining package for commercial use. PROPOSED COMMERCIAL APPLICATIONS: The Specific Aims of this Phase I proposal will allow us to evaluate the commercial potential of high dimensional visualization and analysis tools using the publicly available NCI DIS dataset, as well as data mine this dataset for potential new discoveries.



Publications

Applications of machine learning and high-dimensional visualization in cancer detection, diagnosis, and management.
Authors: McCarthy J.F. , Marx K.A. , Hoffman P.E. , Gee A.G. , O'Neil P. , Ujwal M.L. , Hotchkiss J. .
Source: Annals of the New York Academy of Sciences, 2004 May; 1020, p. 239-62.
PMID: 15208196
Related Citations

Data mining the NCI cancer cell line compound GI(50) values: identifying quinone subtypes effective against melanoma and leukemia cell classes.
Authors: Marx K.A. , O'Neil P. , Hoffman P. , Ujwal M.L. .
Source: Journal of chemical information and computer sciences, 2003 Sep-Oct; 43(5), p. 1652-67.
PMID: 14502500
Related Citations




Back to Top