Recent advances in high-throughput measurements of critical parameters related to cancer genesis and development have led to a wealth of cancer-related information available in public and private databases. To realize the promise of dramatic advancement in integrative cancer research enabled by this rapidly expanding information, novel informatics tools that allow researchers to efficiently integrate this available data are needed. The main objective of this proposal is the development of a computerized system capable of integrating data and information from disparate sources in order to enable enhanced models of cancer processes. The proposed system aims to use and expand the capabilities of the Cancer Biomedical Informatics Grid (caBIG) to allow users and applications to perform queries on cancer-related data sources at a conceptual level, utilizing the rich semantic information contained in caBIG in novel ways. The system contains mechanisms to expose these semantics using the Web Ontology Language (OWL), and to build and execute RDF-based querying using SPARQL, automatically identifying data services to query within the grid. An intuitive user interface providing multiple visualization and concept searching abilities is used to build queries and view results, while a programmatic interface is also provided for computer-to-computer interaction through Semantic Web standards. In Phase I of our proposal, the main algorithms and mechanisms of the proposed system, including the exposure of ontology views and the conversion of queries from SPARQL into caBIG's common query language will be developed, and proof-of-concept prototypes will be tested to prove the feasibility of our design.The Cancer Biology Data Integration System is an information integration solution that enables users to query cancer-related data using conceptual abstractions in a declarative manner more closely resembling the way in which research questions are stated. It models the rich semantic information contained in the Cancer Biomedical Informatics Grid (caBIG) as an ontology view, and uses Semantic Web standards to create and execute queries into caBIG-compatible data sources. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43CA132293-01
Application #
7404838
Study Section
Special Emphasis Panel (ZRG1-BST-D (10))
Program Officer
Couch, Jennifer A
Project Start
2007-09-21
Project End
2008-08-31
Budget Start
2007-09-21
Budget End
2008-08-31
Support Year
1
Fiscal Year
2007
Total Cost
$234,689
Indirect Cost
Name
Infotech Soft, Inc.
Department
Type
DUNS #
035354070
City
Miami
State
FL
Country
United States
Zip Code
33131
Shironoshita, E Patrick; Jean-Mary, Yves R; Bradley, Ray M et al. (2009) semQA: SPARQL with Idempotent Disjunction. IEEE Trans Knowl Data Eng 21:401-414
Shironoshita, E Patrick; Jean-Mary, Yves R; Bradley, Ray M et al. (2008) semCDI: a query formulation for semantic data integration in caBIG. J Am Med Inform Assoc 15:559-68