The UMLS is an invaluable resource to the biomedical community. However, the Metathesaurus's (META's) size and complexity can hinder its usefulness. We propose a number of divide and conquer techniques that facilitate the tasks of auditing and extending the META. The divide and conquer approach breaks down a large collection of items into more manageable units. In this research, the outcome will be new partitions of the META. Each partition will be employed in auditing the META. The UMLS's Semantic Network (SN) does not partition the META into disjoint sets. We first create a Refined Semantic Network (RSN) that does partition the META into disjoint sets of concepts (extents). Derived from the SN, the RSN incorporates intersection types along with pure semantic types and improves the UMLS in a number of ways, e.g., it contains types with semantically uniform and (usually) smaller extents. In addition to the RSN, three additional, novel partitioning techniques will be applied to extents: Partition along the dimension of relationship similarity; create singly rooted subgroups; and subdivide tangled, multi-rooted hierarchies. Each partition will (normally) expose new errors. We propose a large scale audit of the 2006AA UMLS concentrating on small groups obtained by the partition. The kinds of errors expected to be uncovered include: concept ambiguity and synonymy, wrong and missing IS-A and semantic relationships, inconsistent and incorrect classifications and omissions. We will perform an enhanced audit of the genomic component of the UMLS. For this we will create a Refined Genomic SN (RGSN). We will build an auditing software tool that incorporates the RSN and the three additional levels of partitioning. A formal evaluation, with human subjects, of the tool and a two-level expert evaluation of the auditing results are planned. Finally we focus on the integration of new terminologies into the UMLS, based on the RSN. In summary we will: 1) Create the RSN that partitions META into disjoint units; 2) Design three new partitioning techniques for extents of semantic types; 3) Audit the concepts of small groups; 4) Design a Refined Genomic SN (RGSN) for auditing genomic concepts; 5) Build an auditing tool; 6) Design and evaluate an integration technique for terminologies. ? ? ?
He, Zhe; Perl, Yehoshua; Elhanan, Gai et al. (2017) Auditing the Assignments of Top-Level Semantic Types in the UMLS Semantic Network to UMLS Concepts. Proceedings (IEEE Int Conf Bioinformatics Biomed) 2017:1262-1269 |
Geller, James; He, Zhe; Perl, Yehoshua et al. (2013) Rule-based support system for multiple UMLS semantic type assignments. J Biomed Inform 46:97-110 |
Chen, Yan; Gu, Huanying; Perl, Yehoshua et al. (2012) Overcoming an obstacle in expanding a UMLS semantic type extent. J Biomed Inform 45:61-70 |
Gu, Huanying Helen; Elhanan, Gai; Perl, Yehoshua et al. (2012) A study of terminology auditors' performance for UMLS semantic type assignments. J Biomed Inform 45:1042-8 |
Halper, Michael; Morrey, C Paul; Chen, Yan et al. (2011) Auditing hierarchical cycles to locate other inconsistencies in the UMLS. AMIA Annu Symp Proc 2011:529-36 |
Morrey, C Paul; Chen, Ling; Halper, Michael et al. (2011) Resolution of redundant semantic type assignments for organic chemicals in the UMLS. Artif Intell Med 52:141-51 |
Elhanan, Gai; Huang, Kuo-Chuan; Perl, Yehoshua (2010) Source authenticity in the UMLS--a case study of the Minimal Standard Terminology. J Biomed Inform 43:988-97 |
Morrey, C Paul; Geller, James; Halper, Michael et al. (2009) The Neighborhood Auditing Tool: a hybrid interface for auditing the UMLS. J Biomed Inform 42:468-89 |
Chen, Yan; Gu, Huanying Helen; Perl, Yehoshua et al. (2009) Structural group-based auditing of missing hierarchical relationships in UMLS. J Biomed Inform 42:452-67 |
Chen, Yan; Gu, Huanying Helen; Perl, Yehoshua et al. (2009) Structural group auditing of a UMLS semantic type's extent. J Biomed Inform 42:41-52 |
Showing the most recent 10 out of 19 publications