Individuals with substance use disorders are disproportionately experiencing homelessness, poverty, and chronic medical conditions (diabetes and hypertension), which are emerging risk factors for contracting SARS-CoV-2 (official name for the virus that causes COVID-19). Different types of substance use have been associated with development of respiratory infections and progression to severe respiratory failure, also known as Acute Respiratory Distress Syndrome (ARDS). However, complex syndromes like ARDS and behavioral conditions like substance misuse are difficult to identify from the electronic health record. Clinical notes and radiology reports provide a rich source of information that may be used to identify cases of substance misuse and ARDS. This information is routinely recorded during hospital care, and automated, data-driven solutions with natural language processing (NLP) can extract semantics and important risk factors from the unstructured data of clinical notes. The computational methods of NLP derive meaning from clinical notes, from which machine learning can predict risk factors for patients leaving AMA or progressing to respiratory failure. Our team developed tools with >80% sensitivity/specificity to identify individual types of substance misuse using NLP with machine learning (ML). Our single-center models delineated risk factors embedded in the notes (e.g., mental health conditions, socioeconomic indicators). Further, we have developed and externally validated a machine learning tool to identify cases of ARDS with high accuracy for early treatment.
We aim to expand this work by pooling data across health systems and build a generalizable and comprehensive classifier that captures multiple types of substance misuse for use in risk stratification and prognostication during the COVID pandemic. We hypothesize that a single-model NLP substance misuse classifier will provide a standardized, interoperable, and accurate approach for universal analysis of hospitalized patients, and that such information can be used to identify those at risk for disrupted care and those at risk for respiratory failure.
We aim to train and test our substance misuse classifiers at Rush in a retrospective dataset of over 60,000 hospitalizations that have been manually screened with the universal screen, AUDIT, and DAST. This Administrative Supplement will allow us to examine the correlations between substances of misuse and risk for COVID-19 as well as development of Acute Respiratory Distress Syndrome (ARDS) in the context of these phenomena.

Public Health Relevance

We anticipate that the research proposed will provide novel and critically important tools in artificial intelligence for the detection of substance misuse and COVID-19 from the electronic health record (EHR). Development and validation of a digital classifier would enable a standardized approach to perform screening on all patient encounters on a daily basis in health systems. We will rigorously develop and test the classifier retrospectively on an existing dataset of 60,000 patients. This will serve as the first step towards a comprehensive universal screener that leverages available data in the EHR.

Agency
National Institute of Health (NIH)
Institute
National Institute on Drug Abuse (NIDA)
Type
Clinical Research Cooperative Agreements - Single Project (UG1)
Project #
3UG1DA049467-02S1
Application #
10173503
Study Section
Program Officer
Dobbins, Ronald
Project Start
2019-06-01
Project End
2024-02-29
Budget Start
2020-03-01
Budget End
2021-02-28
Support Year
2
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Rush University Medical Center
Department
Psychiatry
Type
Schools of Medicine
DUNS #
068610245
City
Chicago
State
IL
Country
United States
Zip Code
60612