The construction of ontologies that define the entities in an application area and the relationships among them has become essential for modern work in biomedicine. Ontologies help both humans and computers to manage burgeoning numbers of data. The need to annotate, retrieve, and integrate high-throughput data sets, to process natural language, and to build systems for decision support has set many communities of biomedical investigators to work building large ontologies. We developed and evaluated the Collaborative Prot?g? system in the first phase of our research project. This software system has become an indispensable open-source resource for an international community of scientists who develop ontologies in a cooperative, distributed manner. In this competing renewal proposal, we describe novel data-driven methods and tools that promise to make collaborative ontology design both more streamlined and more principled. Our goal is to create a more empirical basis for ontology engineering, and to develop methods whereby the ontology-engineering enterprise both can profit from data regarding the underlying processes and those processes in turn can generate increasing amounts of data to inform future ontology-engineering activities. Our research plan entails three specific aims. First, we will enable ontology developers to apply ontology-design patterns (ODPs) to their ontologies, and we will measure the way in which these patterns alter the ontology-engineering process. Second, we will analyze the vast amounts of log data that we collect from users of Collaborative Prot?g? to understand the patterns of ontology development. We will use these patterns to recommend to developers areas of ontologies that may need their attention, facilitating the process of reaching consensus and making collaborative ontology engineering more efficient. Finally, we will use the extensive data collected by our group and others to understand how scientists reuse terms from various ontologies and we will use these emerging patterns to facilitate term reuse. Each of these analyses not only will increase our understanding of collaboration in scientific modeling, but also will lead to new technology within our Collaborative Prot?g? suite that will improve the ontology-development process and make collaboration among biomedical scientists more efficient.

Public Health Relevance

Collaborative Prot g is a software system that helps a burgeoning user community to cooperate in developing ontologies that enhance biomedical research and improve patient care. Collaborative Prot g supports scientists; clinician researchers; and workers in informatics to build ontologies to solve problems in data annotation; data integration; information retrieval; natural-language processing; electronic patient record systems; and decision support. The proposed research will develop data-driven methods to identify patterns in design; development; and use of ontologies; and will apply these methods to help us to build new technology that both facilitates the ontology-development process and makes ontology design more principled.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM086587-06
Application #
8628132
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Brazhnik, Paul
Project Start
2009-03-01
Project End
2017-01-31
Budget Start
2014-02-01
Budget End
2015-01-31
Support Year
6
Fiscal Year
2014
Total Cost
$525,880
Indirect Cost
$189,620
Name
Stanford University
Department
Social Sciences
Type
Schools of Medicine
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94305
Mortensen, Jonathan M; Minty, Evan P; Januszyk, Michael et al. (2015) Using the wisdom of the crowds to find critical errors in biomedical ontologies: a study of SNOMED CT. J Am Med Inform Assoc 22:640-8
Walk, Simon; Singer, Philipp; Strohmaier, Markus et al. (2014) Discovering beaten paths in collaborative ontology-engineering projects using Markov chains. J Biomed Inform 51:254-71
Strohmaier, Markus; Walk, Simon; Poschko, Jan et al. (2013) How Ontologies are Made: Studying the Hidden Social Dynamics Behind Collaborative Ontology Engineering Projects. Web Semant 20:
Tudorache, Tania; Nyulas, Csongor; Noy, Natalya F et al. (2013) WebProtege: A Collaborative Ontology Editor and Knowledge Acquisition Tool for the Web. Semant Web 4:89-99