The long-term objective of this proposal is to advance patient safety and reduce the cost of medical care by discovering novel adverse drug events (ADEs) through use of automated methods. We will utilize natural language processing (NLP) and data mining methodologies on vast quantities of clinical data in electronic health records (EHRs) to detect novel ADE signals. ADEs are major problems world-wide and cause hospitalizations, deaths, and incur a huge cost to health care. Therefore, continued post-marketing surveillance encompassing large and varied patient populations is crucial for patient safety. EHRs contain a comprehensive amount of clinical information, which if harnessed properly, would be invaluable for pharmacovigilance. We have already demonstrated that we can accurately encode information in clinical reports using the NLP system MedLEE, and that we can accurately detect associations among clinical events using statistical methods that we developed. Therefore, this is an excellent opportunity to continue our research accomplishments and to advance the state of the art in pharmacovigilance. More specifically, MedLEE will be used to map comprehensive clinical information in the EHR to codified data, and then statistical methods will be used to generate an extensive knowledge base of disease-symptom, disease-drug, drug-drug, and drug-symptom associations, which will be used to discover new ADEs. Additionally, we will develop methods to determine the correct sequence of drug, disease, and symptom events, which is critical for detecting ADEs. We will also develop methods to map fine-grained concepts into higher level concepts, which is important for optimizing the statistical methods. The performance of our discovery methods will be evaluated by testing the methods using drugs currently in use with known ADEs, and also by using historical rollback. We will first focus on discovery of short-term events using inpatient records, and then longer-term events using outpatient office visits. This proposal is well positioned to overcome problems associated with existing automated methods based on spontaneous reporting databases and administrative databases. We are confident the methods will be effective because a strong infrastructure is in place for us to build upon. Most importantly, the methodology developed in this proposal presents an excellent chance to dramatically improve patient safety and reduce costs.

Public Health Relevance

This proposal aims to improve patient safety and reduce health care costs by developing effective methods for the discovery of new adverse drug events. The use of natural language processing on vast quantities of EHR records will result in the harnessing of comprehensive clinical information for this purpose, overcoming some of the limitations of current methods that rely on spontaneous reporting and administrative databases.

National Institute of Health (NIH)
National Library of Medicine (NLM)
Research Project (R01)
Project #
Application #
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Sim, Hua-Chuan
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Columbia University (N.Y.)
Internal Medicine/Medicine
Schools of Medicine
New York
United States
Zip Code
Backenroth, Daniel; Chase, Herbert S; Wei, Ying et al. (2017) Monitoring prescribing patterns using regression and electronic health records. BMC Med Inform Decis Mak 17:175
Yadav, Kabir; Sarioglu, Efsun; Choi, Hyeong Ah et al. (2016) Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury. Acad Emerg Med 23:171-8
Finkelstein, Joseph; Friedman, Carol; Hripcsak, George et al. (2016) Pharmacogenetic polymorphism as an independent risk factor for frequent hospitalizations in older adults with polypharmacy: a pilot study. Pharmgenomics Pers Med 9:107-116
Backenroth, Daniel; Chase, Herbert; Friedman, Carol et al. (2016) Using Rich Data on Comorbidities in Case-Control Study Design with Electronic Health Record Data Improves Control of Confounding in the Detection of Adverse Drug Reactions. PLoS One 11:e0164304
Salmasian, Hojjat; Tran, Tran H; Chase, Herbert S et al. (2015) Medication-indication knowledge bases: a systematic review and critical appraisal. J Am Med Inform Assoc 22:1261-70
Adams, Hayden; Friedman, Carol; Finkelstein, Joseph (2015) Automated Determination of Publications Related to Adverse Drug Reactions in PubMed. AMIA Jt Summits Transl Sci Proc 2015:31-5
Li, Ying; Ryan, Patrick B; Wei, Ying et al. (2015) A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions. Drug Saf 38:895-908
Vilar, S; Ryan, P B; Madigan, D et al. (2014) Similarity-based modeling applied to signal detection in pharmacovigilance. CPT Pharmacometrics Syst Pharmacol 3:e137
Salmasian, Hojjat; Tran, Tran H; Friedman, Carol (2014) Developing a formal representation for medication appropriateness criteria. AMIA Annu Symp Proc 2014:1911-9
Vilar, Santiago; Uriarte, Eugenio; Santana, Lourdes et al. (2014) State of the art and development of a drug-drug interaction large scale predictor based on 3D pharmacophoric similarity. Curr Drug Metab 15:490-501

Showing the most recent 10 out of 45 publications