Asthma is the most common chronic condition in children and one of the five most burdensome disease in the United States. Despite this, epidemiologic investigations into childhood asthma are limited by variations in asthma diagnosis across sites and inefficient utilization of electronic medical records (EMRs) to facilitate large- scale studies. Algorithms based on structured data (e.g., ICD-9 codes) have shown strong specificity, but lack the sensitivity required for population-based studies for asthma. Manual EMR reviews allow application of well- recognized criteria-based definitions such as the Asthma Predictive Index (API) or the Predetermined Asthma Criteria (PAC), but are labor-intensive and expensive, and therefore not feasible for population-level studies. Because of the lack of consistent, reproducible, and efficient asthma ascertainment methods, the use of inconsistent a. asthma criteria, b. ascertainment processes, and c. sampling frames results in inconsistent asthma cohorts and study results for clinical trials or other studies. This inconsistency causes confusion, delayed translation of important study findings into clinical practice, and may obscure the true heterogeneity of asthma. Our long-term goal is to advance research and clinical care for asthma, by developing a robust software tool to streamline the process of automatic medical record ascertainment of asthma based on the asthma criteria (PAC and API). We propose to augment traditional structured data criteria with natural language processing (NLP) techniques to account for unstructured text. Thus, the main goal of this proposal is to develop NLP-API, an NLP algorithm for automating API, and apply the NLP algorithms for both PAC and API to identify a cohort of children with asthma. In addition, we will use the tools to characterize children with asthma thereby demonstrating its usefulness in epidemiological investigations and also possibly in asthma management. We hypothesize that asthma criteria-based NLP algorithms applied to the EMR will allow us to identify and characterize asthma status accurately, consistently, and efficiently.
In Aim 1, we will develop NLP-API, an NLP algorithm for API.
In Aim 2, we will apply both NLP-API (developed under Aim 1) and NLP-PAC (our recently developed PAC-based NLP algorithm) to two evaluation cohorts.
In Aim 3, we will characterize the subgroups of children with asthma identified under Aim 2 by assessing the association of NLP-ascertained asthma status with lung function and biomarkers for asthma. The expected outcomes of the proposed study are: (i) enhanced research capabilities for asthma by enabling more consistent, reproducible, and efficient large-scale asthma ascertainment, sampling frames, and timing estimations; (ii) a basis for improving timely asthma diagnosis and care through clinical decision support systems; and (iii) advancement of the use of NLP techniques for clinical studies. Successful completion of this project will provide an accurate, consistent, and efficient tool for addressing the significant burden of asthma in children and a framework for extension to other chronic diseases and adults.

Public Health Relevance

Asthma is the most common chronic condition in children and one of the five most burdensome diseases in the United States. However, conducting research on and developing quality improvement initiatives around asthma management at the population level is difficult, primarily due to the inconsistent identification of who has asthma and who doesn't. Identification of asthma status from review of medical records is very time consuming and expensive. Thus, researchers and clinicians often use more easily available but potentially inconsistent ways to determine asthma status, such as review of diagnostic billing codes. This results in varied cohorts of people defined as having asthma that are associated with inconsistent study outcomes in asthma such as risk of asthma or asthma control. We will use natural language processing, a computer based search of electronic medical records (clinical notes), to efficiently determine who has asthma, what kind of asthma, and the onset of recognizable asthma symptoms that identify asthma efficiently, accurately, and consistently. An algorithm that robustly ascertains the status of asthma, and by extension possibly other chronic diseases, will accelerate scientific discoveries on asthma, enhance translation of research findings into clinical practice, and ultimately improve patient care.

Agency
National Institute of Health (NIH)
Institute
National Heart, Lung, and Blood Institute (NHLBI)
Type
Research Project (R01)
Project #
5R01HL126667-02
Application #
9032521
Study Section
Infectious Diseases, Reproductive Health, Asthma and Pulmonary Conditions Study Section (IRAP)
Program Officer
Freemer, Michelle M
Project Start
2015-04-01
Project End
2018-03-31
Budget Start
2016-04-01
Budget End
2017-03-31
Support Year
2
Fiscal Year
2016
Total Cost
Indirect Cost
Name
Mayo Clinic, Rochester
Department
Type
DUNS #
006471700
City
Rochester
State
MN
Country
United States
Zip Code
55905
Kaur, Harsheen; Sohn, Sunghwan; Wi, Chung-Il et al. (2018) Automated chart review utilizing natural language processing algorithm for asthma predictive index. BMC Pulm Med 18:34
Wi, C-I; Krusemark, E A; Voge, G et al. (2018) Usefulness of asthma predictive index in ascertaining asthma status of children using medical records: An explorative study. Allergy 73:1276-1283
Yawn, Barbara P; Wollan, Peter C; Rank, Matthew A et al. (2018) Use of Asthma APGAR Tools in Primary Care Practices: A Cluster-Randomized Controlled Trial. Ann Fam Med 16:100-110
Sheen, Youn Ho; Rolfes, Mary C; Wi, Chung-Il et al. (2018) Association of Asthma with Rheumatoid Arthritis: A Population-Based Case-Control Study. J Allergy Clin Immunol Pract 6:219-226
Wi, Chung-Il; Sohn, Sunghwan; Ali, Mir et al. (2018) Natural Language Processing for Asthma Ascertainment in Different Practice Settings. J Allergy Clin Immunol Pract 6:126-131
Sohn, Sunghwan; Wi, Chung-Il; Juhn, Young J et al. (2017) Analysis of Clinical Variations in Asthma Care Documented in Electronic Health Records Between Staff and Resident Physicians. Stud Health Technol Inform 245:1170-1174
Voge, Gretchen A; Carey, William A; Ryu, Euijung et al. (2017) What accounts for the association between late preterm births and risk of asthma? Allergy Asthma Proc 38:152-156
Wi, Chung-Il; Sohn, Sunghwan; Rolfes, Mary C et al. (2017) Application of a Natural Language Processing Algorithm to Asthma Ascertainment. An Automated Chart Review. Am J Respir Crit Care Med 196:430-437
Ryu, Euijung; Wi, Chung-Il; Crow, Sheri S et al. (2016) Assessing health disparities in children using a modified housing-related socioeconomic status measure: a cross-sectional study. BMJ Open 6:e011564
Mehrabi, Saeed; Krishnan, Anand; Sohn, Sunghwan et al. (2015) DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx. J Biomed Inform 54:213-9