Real-time Disambiguation of Abbreviations in Clinical Notes

Xu, Hua

Abstract

A key prerequisite for high-quality healthcare delivery is effective communication within and across healthcare settings. However, communication can be hampered by the pervasive use of abbreviations in clinical notes. Clinicians use abbreviations to save time during documentation. While abbreviations may seem unambiguous to their authors, they often cause confusion to other readers, including healthcare providers, patients, and natural language processing (NLP) systems attempting to extract clinical terms from text. While the understanding that abbreviations can cause errors is widespread, few have deployed pragmatic solutions for this important problem. The proposed project will develop, evaluate, and share a systematic approach to Clinical Abbreviation Recognition and Disambiguation (CARD), and in doing so substantially aims to benefit existing NLP systems and to improve computer-based documentation systems by reducing ambiguities in electronic records in real-time. The study includes the following five Specific Aims: 1) Develop automated methods to detect abbreviations and their senses from clinical text corpora and build a comprehensive knowledge base of clinical abbreviations;2) Develop and evaluate three automated word sense disambiguation (WSD) classifiers, and establish methods to combine those classifiers to maximize both their performance and coverage;3) Develop the CARD system, and demonstrate its effectiveness by integrating it with two established NLP systems (MedLEE and KnowledgeMap);4) Integrate CARD with an institutional clinical documentation system (Vanderbilt's StarNotes) and evaluate its ability to expand abbreviations in real-time as clinicians generate records;5) Distribute the CARD knowledge base and software for non-commercial uses.

Public Health Relevance

Abbreviations are widely used throughout all types of clinical documents and they cause confusion to both healthcare providers and patients and limit effective communications within and across care settings. This proposed study will develop informatics methods to automatically detect abbreviations and their possible meanings from large clinical text and to disambiguate abbreviations that have multiple meanings. We will also integrate those methods with clinical documentation systems so that abbreviations will be expanded in real-time when physicians entering clinical notes, thus to improve the quality of health records.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Research Project (R01)
Project #: 5R01LM010681-03
Application #: 8305149
Study Section: Biomedical Library and Informatics Review Committee (BLR)
Program Officer: Sim, Hua-Chuan

Project Start: 2010-05-31
Project End: 2012-10-14
Budget Start: 2012-05-31
Budget End: 2012-10-14
Support Year: 3
Fiscal Year: 2012
Total Cost: $129,035
Indirect Cost: $46,320

Institution

Name: Vanderbilt University Medical Center
Department: Internal Medicine/Medicine
Type: Schools of Medicine
DUNS #: 004413456

City: Nashville
State: TN
Country: United States
Zip Code: 37212

Related projects


NIH 2017 R01 LM	Interactive machine learning methods for clinical natural language processing Xu, Hua / University of Texas Health Science Center Houston
NIH 2016 R01 LM	Interactive machine learning methods for clinical natural language processing Xu, Hua / University of Texas Health Science Center Houston
NIH 2015 R01 LM	Interactive machine learning methods for clinical natural language processing Xu, Hua / University of Texas Health Science Center Houston
NIH 2014 R01 LM	Interactive machine learning methods for clinical natural language processing Xu, Hua / University of Texas Health Science Center Houston	$558,372
NIH 2012 R01 LM	Real-time Disambiguation of Abbreviations in Clinical Notes Xu, Hua / Vanderbilt University Medical Center	$129,035
NIH 2011 R01 LM	Real-time Disambiguation of Abbreviations in Clinical Notes Xu, Hua / Vanderbilt University Medical Center	$374,000
NIH 2010 R01 LM	Real-time Disambiguation of Abbreviations in Clinical Notes Xu, Hua / Vanderbilt University Medical Center	$387,500

Publications

Lee, Hee-Jin; Zhang, Yaoyun; Jiang, Min et al. (2018) Identifying direct temporal relations between time and events from clinical notes. BMC Med Inform Decis Mak 18:49

Brusco, Lauren L; Wathoo, Chetna; Mills Shaw, Kenna R et al. (2018) Physician interpretation of genomic test results and treatment selection. Cancer 124:966-972

Zhang, Yaoyun; Zhang, Olivia; Wu, Yonghui et al. (2017) Psychiatric symptom recognition without labeled data using distributional representations of phrases and on-line knowledge. J Biomed Inform 75S:S129-S137

Wu, Yonghui; Jiang, Min; Xu, Jun et al. (2017) Clinical Named Entity Recognition Using Deep Learning Models. AMIA Annu Symp Proc 2017:1812-1819

Ji, Zongcheng; Zhang, Yaoyun; Xu, Jun et al. (2017) Comparing Cancer Information Needs for Consumers in the US and China. Stud Health Technol Inform 245:126-130

Lee, Hee-Jin; Zhang, Yaoyun; Roberts, Kirk et al. (2017) Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation. AMIA Annu Symp Proc 2017:1070-1079

Lee, Hee-Jin; Wu, Yonghui; Zhang, Yaoyun et al. (2017) A hybrid approach to automatic de-identification of psychiatric notes. J Biomed Inform 75S:S19-S27

Wang, Yue; Zheng, Kai; Xu, Hua et al. (2016) Clinical Word Sense Disambiguation with Interactive Search and Classification. AMIA Annu Symp Proc 2016:2062-2071

Zhang, Yaoyun; Xu, Jun; Chen, Hui et al. (2016) Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning. Database (Oxford) 2016:

Duan, Rui; Cao, Ming; Wu, Yonghui et al. (2016) An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies. AMIA Annu Symp Proc 2016:1764-1773

Showing the most recent 10 out of 30 publications

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: