We hypothesize that there are significant synergies between the applications of biomedical ontologies and of biomedical language processing (BLP) which can be used to improve the quality and scope of both activities. A growing body of work suggests such synergies might exist, but there has yet to be a systematic exploration of their potential. We propose to carry out a focused effort to explore both the potential for, and obstacles to, the mutual application of biomedical ontologies and biomedical language processing. To provide immediate biological relevance to our work, we propose to focus on the topics of autoimmune and pulmonary disease. We group our proposed explorations into three specific aims: (1) Create novel tools and approaches for the application and maintenance of biomedical ontologies, based on an assessment of the processes and tools used for the ontological annotation of textual corpora in the biomedical language processing community. Particularly, we will focus on the creation of new methods for effective search through large ontologies, compositional approaches to annotation, effective capture of the evidence underlying annotations, and the use of automated suggestions for manual confirmation. (2) Evaluate the utility of BLP tools and techniques when applied to terms and definitions of biomedical ontologies, both to enrich and interconnect orthogonal ontologies, and to provide quality assurance and quality control mechanisms. Particularly, we propose to develop and evaluate methods for connecting terms within and across ontologies, for assessing completeness of an ontology against the literature, and for implementing automatically executable measures of ontology quality. (3) Compare the differences between annotations produced by manual procedures and those produced by automated BLP methods for completeness and correctness. Based on the resulting data, produce guidelines for the optimal interplay between manual and automatic procedures for producing broad, accurate and useful knowledge-bases. Because ontologies are the central organizing tool of the model organism databases, improvements in their quality and in the ease and efficiency of their use will have a major effect on the model organism databases, speed the translational process generally, and create a potentially large public health impact. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM083649-01A1
Application #
7364235
Study Section
Special Emphasis Panel (ZRG1-BST-E (50))
Program Officer
Lyster, Peter
Project Start
2007-09-30
Project End
2011-08-31
Budget Start
2007-09-30
Budget End
2008-08-31
Support Year
1
Fiscal Year
2007
Total Cost
$631,600
Indirect Cost
Name
University of Colorado Denver
Department
Pharmacology
Type
Schools of Medicine
DUNS #
041096314
City
Aurora
State
CO
Country
United States
Zip Code
80045
Cohen, K Bretonnel; Hunter, Lawrence E (2013) Chapter 16: text mining for translational bioinformatics. PLoS Comput Biol 9:e1003044
Verspoor, Karin; Cohen, Kevin Bretonnel; Lanfranchi, Arrick et al. (2012) A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools. BMC Bioinformatics 13:207
Jani, Alkesh; Orlicky, David J; Karimpour-Fard, Anis et al. (2012) Kidney proteome changes provide evidence for a dynamic metabolism and regional redistribution of plasma proteins during torpor-arousal cycles of hibernation. Physiol Genomics 44:717-27
Hahn, Udo; Cohen, K Bretonnel; Garten, Yael et al. (2012) Mining the pharmacogenomics literature--a survey of the state of the art. Brief Bioinform 13:460-94
Grabek, Katharine R; Karimpour-Fard, Anis; Epperson, L Elaine et al. (2011) Multistate proteomics analysis reveals novel strategies used by a hibernator to precondition the heart and conserve ATP for winter heterothermy. Physiol Genomics 43:1263-75
Kano, Yoshinobu; Bjorne, Jari; Ginter, Filip et al. (2011) U-Compare bio-event meta-service: compatible BioNLP event extraction services. BMC Bioinformatics 12:481
Cohen, K Bretonnel; Verspoor, Karin; Johnson, Helen L et al. (2011) HIGH-PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA. Comput Intell 27:681-701
Lu, Zhiyong; Kao, Hung-Yu; Wei, Chih-Hsuan et al. (2011) The gene normalization task in BioCreative III. BMC Bioinformatics 12 Suppl 8:S2
Roeder, Christophe; Jonquet, Clement; Shah, Nigam H et al. (2010) A UIMA wrapper for the NCBO annotator. Bioinformatics 26:1800-1
Cohen, K Bretonnel; Johnson, Helen L; Verspoor, Karin et al. (2010) The structural and content aspects of abstracts versus bodies of full text journal articles are different. BMC Bioinformatics 11:492

Showing the most recent 10 out of 16 publications