The appearance of new scientific research methods has greatly increased the volume of molecular data in all the basic medical sciences. While many data resources are available on-line and generally very useful to researchers, they are difficult to use for a non-expert user. Since every database uses its own user interface and vocabulary, querying these databases and combining results can be very time consuming. In addition, current search engines return too many inaccurate results. There is thus an urgent need for development of better access to the molecular databases through user-friendly interface and high-precision retrieval. The ultimate goal of this project is to develop technologies that allow user to access biological information stored in heterogeneous data resources by entering queries in explicit sentences or questions. Natural language search system provides a unified and transparent interface by translating questions into appropriate database retrievals. It also promises higher precision than conventional keyword-based search engines. The specific objective of this Phase I research is to develop algorithms for extracting higher level semantic structures composed of concepts, and relationships between concepts, from both questions and potential answers. A domain ontology, such as the UMLS and the GO ontology, is incorporated to provide a conceptual framework between linguistic primitives/structures and domain-specific concepts/relations. Answers are retrieved through dynamic, query-driven entity-relational filtering, transformation and matching. In Phase I, the feasibility and efficacy of the proposed approach will be demonstrated and tested on question interpretation and answer retrieval from the MEDLINE bibliographical database.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43LM008464-01
Application #
6831997
Study Section
Special Emphasis Panel (ZRG1-BDMA (01))
Program Officer
Sim, Hua-Chuan
Project Start
2005-02-01
Project End
2006-01-31
Budget Start
2005-02-01
Budget End
2006-01-31
Support Year
1
Fiscal Year
2005
Total Cost
$99,354
Indirect Cost
Name
Insightful Corporation
Department
Type
DUNS #
150683779
City
Seattle
State
WA
Country
United States
Zip Code
98109