? ? The Unified Medical Language System (UMLS) is an invaluable resource for the biomedical community. ? One of the intended uses of the UMLS Metathesaurus is to support the translation of terms from a source terminology into terms in a target terminology. It is evident from the research literature on the UMLS that users generally need to perform more broader types of """"""""translations"""""""" that involve finding terms with closest meaning to source term (mapping), finding terms that are related to source term and can serve as proxy for various functions (e.g. information retrieval, knowledge discovery) or finding target terms that satisfy some structural or semantic constraint (e.g. information theoretic distance). The methods for finding such """"""""translations"""""""" or connections between terms in Meta (other than the case of one-to-one synonymy) are not at all clear. Previous attempts to exploit such connections have depended on either manual selection of relevant connections, or problem-specific algorithms that use expert knowledge about the relative suitability of various inter-concept relationships. We believe that machine learning techniques offer automated, generalizable approaches that are appropriate for use with the UMLS, given the large set of potential connections and the need for a problem-independent approach. We hypothesize that learning strategies that exploit the relational features, scale free properties and probabilistic dependencies of connections in the UMLS will identify meaningful inter-term relationships and that a combined approach will perform better across different problem domains when compared to any of the approaches in isolation. We will evaluate the proposed learning algorithms with training connections from a variety of problem domains in biomedicine. We will disseminate the successful algorithms via the UMLS Knowledge Source API toolkit for mining and visualizing the connections. We believe that the UMLS provides a unique fertile ground to develop novel semantic relational mining methods and advance our understanding of mining large biomedical concept graphs. ? ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21LM009638-01
Application #
7299922
Study Section
Special Emphasis Panel (ZLM1-ZH-H (M3))
Program Officer
Sim, Hua-Chuan
Project Start
2007-09-30
Project End
2009-09-29
Budget Start
2007-09-30
Budget End
2008-09-29
Support Year
1
Fiscal Year
2007
Total Cost
$181,125
Indirect Cost
Name
Columbia University (N.Y.)
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
621889815
City
New York
State
NY
Country
United States
Zip Code
10032
Freedberg, Daniel E; Salmasian, Hojjat; Friedman, Carol et al. (2013) Proton pump inhibitors and risk for recurrent Clostridium difficile infection among inpatients. Am J Gastroenterol 108:1794-801
Patel, Chintan O; Cimino, James J (2010) A network-theoretic approach for decompositional translation across Open Biological Ontologies. J Biomed Inform 43:608-12
Patel, Chintan O; Cimino, James J (2009) Using semantic and structural properties of the Unified Medical Language System to discover potential terminological relationships. J Am Med Inform Assoc 16:346-53