Ischemic stroke is an important cause of morbidity, mortality, and cost within the VHA. An accurate assessment of who has had a stroke is a vital part of many ongoing VHA research and quality improvement efforts. However, current approaches to ascertaining this outcome have important limitations. Chart review, while accurate, is extremely resource intensive, thus limiting the size of projects relying upon it. Approaches using administrative data (i.e. ICD-9 codes) are less resource intensive, but less accurate. We propose a novel approach to ascertaining ischemic strokes from VHA automated data using natural language processing and machine learning. Natural language processing and machine learning use artificial intelligence principles to 'teach' a computer how to extract clinical data from free-text notes with acceptable sensitivity and specificity. OBJECTIVES Aim 1: Develop and evaluate an automated approach to the extraction of incident ischemic stroke from the free text of VHA medical records.
Aim 2 : Deploy this empirically evaluated approach upon an existing database of approximately 150,000 patients who have received warfarin (an oral anticoagulant) from the VHA, and are thus at elevated risk for ischemic stroke. METHODS Using chart review as our gold standard, we will develop algorithms to extract the ischemic stroke outcome from VHA medical records. We will begin by annotating clinical records, highlighting information that can inform the algorithm regarding which patients did or did not have a stroke. We will then divide the charts into a training set and a validation set. The training set will be used to inform the design of an algorithm for automated detection of ischemic stroke. The developed algorithm will be evaluated in terms of recall, precision, and harmonic mean (F-measure) using the validation set. These performance characteristics are analogous to sensitivity, positive predictive value, and overall accuracy. We will then deploy this validated algorithm upon an existing database of patients known to be at elevated risk of stroke. ANTICIPATED IMPACT The proposed project will greatly improve ascertainment of ischemic stroke from the VHA electronic health record, combining the accuracy usually associated with chart review with the low cost and large datasets usually associated with ICD-9 based approaches. Algorithms developed through this project will be made available to any VHA entity wishing to use them for research or quality improvement. This has the potential to facilitate VHA research and quality improvement efforts relating to the prevention and management of ischemic stroke.

Public Health Relevance

Ischemic stroke (stroke) occurs when a blockage causes blood to stop flowing to part of the brain, causing that part of the brain to die. Thousands of VHA patients experience stroke each year, leading to death, disability, and high costs of care. Two major goals for the VHA are to reduce the number of strokes, and when strokes do occur, to treat them as effectively as possible. Fulfilling these goals is predicated upon the ability accurately to identify when a stroke has occurred. However, currently available methods for detecting strokes from VHA data are either limited in accuracy (examination of automated data only) or extremely resource- intensive (manual chart review). We propose to use natural language processing and machine learning to teach a computer to review patient charts to determine the occurrence of stroke. This approach has now been used with success to abstract several similar outcomes from patient charts. We anticipate that this approach will achieve similar accuracy to manual chart review, while being much less resource intensive.

Agency
National Institute of Health (NIH)
Institute
Veterans Affairs (VA)
Type
Non-HHS Research Projects (I01)
Project #
1I01HX000265-01
Application #
7868733
Study Section
Blank (HSR6)
Project Start
2010-10-01
Project End
2011-03-31
Budget Start
2010-10-01
Budget End
2011-03-31
Support Year
1
Fiscal Year
2011
Total Cost
Indirect Cost
Name
Ern Memorial Veterans Hospital
Department
Type
DUNS #
080042336
City
Bedford
State
MA
Country
United States
Zip Code
01730