The long-term objective of this proposal is to advance patient safety and reduce the cost of medical care by discovering novel adverse drug events (ADEs) through use of automated methods. We will utilize natural language processing (NLP) and data mining methodologies on vast quantities of clinical data in electronic health records (EHRs) to detect novel ADE signals. ADEs are major problems world-wide and cause hospitalizations, deaths, and incur a huge cost to health care. Therefore, continued post-marketing surveillance encompassing large and varied patient populations is crucial for patient safety. EHRs contain a comprehensive amount of clinical information, which if harnessed properly, would be invaluable for pharmacovigilance. We have already demonstrated that we can accurately encode information in clinical reports using the NLP system MedLEE, and that we can accurately detect associations among clinical events using statistical methods that we developed. Therefore, this is an excellent opportunity to continue our research accomplishments and to advance the state of the art in pharmacovigilance. More specifically, MedLEE will be used to map comprehensive clinical information in the EHR to codified data, and then statistical methods will be used to generate an extensive knowledge base of disease-symptom, disease-drug, drug-drug, and drug-symptom associations, which will be used to discover new ADEs. Additionally, we will develop methods to determine the correct sequence of drug, disease, and symptom events, which is critical for detecting ADEs. We will also develop methods to map fine-grained concepts into higher level concepts, which is important for optimizing the statistical methods. The performance of our discovery methods will be evaluated by testing the methods using drugs currently in use with known ADEs, and also by using historical rollback. We will first focus on discovery of short-term events using inpatient records, and then longer-term events using outpatient office visits. This proposal is well positioned to overcome problems associated with existing automated methods based on spontaneous reporting databases and administrative databases. We are confident the methods will be effective because a strong infrastructure is in place for us to build upon. Most importantly, the methodology developed in this proposal presents an excellent chance to dramatically improve patient safety and reduce costs.

Public Health Relevance

This proposal aims to improve patient safety and reduce health care costs by developing effective methods for the discovery of new adverse drug events. The use of natural language processing on vast quantities of EHR records will result in the harnessing of comprehensive clinical information for this purpose, overcoming some of the limitations of current methods that rely on spontaneous reporting and administrative databases.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM010016-02
Application #
7779983
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Sim, Hua-Chuan
Project Start
2009-07-01
Project End
2013-06-30
Budget Start
2010-07-01
Budget End
2011-06-30
Support Year
2
Fiscal Year
2010
Total Cost
$343,397
Indirect Cost
Name
Columbia University (N.Y.)
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
621889815
City
New York
State
NY
Country
United States
Zip Code
10032
Vilar, Santiago; Friedman, Carol; Hripcsak, George (2018) Detection of drug-drug interactions through data mining studies using clinical sources, scientific literature and social media. Brief Bioinform 19:863-877
Backenroth, Daniel; Chase, Herbert S; Wei, Ying et al. (2017) Monitoring prescribing patterns using regression and electronic health records. BMC Med Inform Decis Mak 17:175
Chase, Herbert S; Mitrani, Lindsey R; Lu, Gabriel G et al. (2017) Early recognition of multiple sclerosis using natural language processing of the electronic health record. BMC Med Inform Decis Mak 17:24
Yadav, Kabir; Sarioglu, Efsun; Choi, Hyeong Ah et al. (2016) Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury. Acad Emerg Med 23:171-8
Finkelstein, Joseph; Friedman, Carol; Hripcsak, George et al. (2016) Pharmacogenetic polymorphism as an independent risk factor for frequent hospitalizations in older adults with polypharmacy: a pilot study. Pharmgenomics Pers Med 9:107-116
Backenroth, Daniel; Chase, Herbert; Friedman, Carol et al. (2016) Using Rich Data on Comorbidities in Case-Control Study Design with Electronic Health Record Data Improves Control of Confounding in the Detection of Adverse Drug Reactions. PLoS One 11:e0164304
Salmasian, Hojjat; Tran, Tran H; Chase, Herbert S et al. (2015) Medication-indication knowledge bases: a systematic review and critical appraisal. J Am Med Inform Assoc 22:1261-70
Adams, Hayden; Friedman, Carol; Finkelstein, Joseph (2015) Automated Determination of Publications Related to Adverse Drug Reactions in PubMed. AMIA Jt Summits Transl Sci Proc 2015:31-5
Li, Ying; Ryan, Patrick B; Wei, Ying et al. (2015) A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions. Drug Saf 38:895-908
Li, Ying; Salmasian, Hojjat; Vilar, Santiago et al. (2014) A method for controlling complex confounding effects in the detection of adverse drug reactions using electronic health records. J Am Med Inform Assoc 21:308-14

Showing the most recent 10 out of 47 publications