The construction of ontologies that define the entities in an application area and the relationships among them has become essential for modern work in biomedicine. Ontologies help both humans and computers to manage burgeoning numbers of data. The need to annotate, retrieve, and integrate high-throughput data sets, to process natural language, and to build systems for decision support has set many communities of biomedical investigators to work building large ontologies. We developed and evaluated the Collaborative Prot?g? system in the first phase of our research project. This software system has become an indispensable open-source resource for an international community of scientists who develop ontologies in a cooperative, distributed manner. In this competing renewal proposal, we describe novel data-driven methods and tools that promise to make collaborative ontology design both more streamlined and more principled. Our goal is to create a more empirical basis for ontology engineering, and to develop methods whereby the ontology-engineering enterprise both can profit from data regarding the underlying processes and those processes in turn can generate increasing amounts of data to inform future ontology-engineering activities. Our research plan entails three specific aims. First, we will enable ontology developers to apply ontology-design patterns (ODPs) to their ontologies, and we will measure the way in which these patterns alter the ontology-engineering process. Second, we will analyze the vast amounts of log data that we collect from users of Collaborative Prot?g? to understand the patterns of ontology development. We will use these patterns to recommend to developers areas of ontologies that may need their attention, facilitating the process of reaching consensus and making collaborative ontology engineering more efficient. Finally, we will use the extensive data collected by our group and others to understand how scientists reuse terms from various ontologies and we will use these emerging patterns to facilitate term reuse. Each of these analyses not only will increase our understanding of collaboration in scientific modeling, but also will lead to new technology within our Collaborative Prot?g? suite that will improve the ontology-development process and make collaboration among biomedical scientists more efficient.
Collaborative Prot g is a software system that helps a burgeoning user community to cooperate in developing ontologies that enhance biomedical research and improve patient care. Collaborative Prot g supports scientists; clinician researchers; and workers in informatics to build ontologies to solve problems in data annotation; data integration; information retrieval; natural-language processing; electronic patient record systems; and decision support. The proposed research will develop data-driven methods to identify patterns in design; development; and use of ontologies; and will apply these methods to help us to build new technology that both facilitates the ontology-development process and makes ontology design more principled.
|Mortensen, Jonathan M; Minty, Evan P; Januszyk, Michael et al. (2015) Using the wisdom of the crowds to find critical errors in biomedical ontologies: a study of SNOMED CT. J Am Med Inform Assoc 22:640-8|
|Walk, Simon; Singer, Philipp; Strohmaier, Markus et al. (2014) Discovering beaten paths in collaborative ontology-engineering projects using Markov chains. J Biomed Inform 51:254-71|
|Strohmaier, Markus; Walk, Simon; Poschko, Jan et al. (2013) How Ontologies are Made: Studying the Hidden Social Dynamics Behind Collaborative Ontology Engineering Projects. Web Semant 20:|
|Tudorache, Tania; Nyulas, Csongor; Noy, Natalya F et al. (2013) WebProtege: A Collaborative Ontology Editor and Knowledge Acquisition Tool for the Web. Semant Web 4:89-99|