Adverse drug reactions (ADRs) are a major burden for patients and healthcare, causing preventable hospitalizations and deaths, and incurring a huge cost. The long-term objective of this proposal is to advance patient safety and reduce costs by discovering novel serious ADRs through use of automated methods that combine information from large and varied patient populations as well as from the literature. There have been considerable advances in pharmacovigilance, but more work is needed. For example, Vioxx, a commonly used drug, was recently found to cause at least 88,000 occurrences of myocardial infarction, highlighting the insufficiency of current methods. To date, methods have mainly depended on the use of single sources of data, primarily from the Federal Food and Drug Administration Adverse Event Reporting System (FAERS) and from electronic health records (EHRS). Although important, each of the sources has different limitations and advantages, and therefore, combining the data across them should lead to more effective drug safety surveillance by increasing the statistical power, and also by allowing each data source to complement the other sources. We already have developed methods associated with each of the single sources, and therefore, this is an excellent opportunity to build upon our research accomplishments to advance the state of the art in pharmacovigilance. More specifically, we will a) acquire and combine comprehensive clinical data from the electronic health records (EHRs) of two different health care sites serving diverse populations by utilizing natural language processing (NLP) to obtain vast quantities of fine-grained data, and then by developing data mining methodologies on the clinical data to detect novel ADR signals, b) analyze differences in therapy-related risk factors between the two EHR populations, such as racial and ethnic differences, c) detect ADR signals in the FAERS database using an established methodology, d) develop improved methods to acquire ADR signals based on information in the literature, and e) develop methods that utilize the results from the above sources to maximize effectiveness. We will focus on eight serious ADRs, and collect a high-quality reference standard for those ADRs so that we will be able to evaluate and compare performance of the different detection methods individually as well as the methods that combine the sources. This proposal is well positioned to overcome problems associated with existing automated methods, which are primarily based on use of individual sources of data. We are confident the methods will be effective because a strong infrastructure is in place for us to build upon. Most importantly, the methodology developed in this proposal presents an excellent chance to leverage heterogeneous data sources to dramatically improve patient safety and reduce costs.

Public Health Relevance

Adverse drug reactions (ADRs) are a major burden for patients and health care, causing preventable hospitalizations and deaths, and incurring huge costs, and, therefore, continuous post-marketing surveillance is crucial for patient safety. This proposal aims to improve patient safety and reduce health care costs by developing effective methods to discover new adverse drug reactions through the combination of information in the FDA's Adverse Event Reporting System, the literature, and comprehensive clinical data from electronic health records of two different sites with diverse populations, thereby overcoming limitations that rely mainly on use of one data source.

National Institute of Health (NIH)
National Library of Medicine (NLM)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZLM1-ZH-C (01))
Program Officer
Sim, Hua-Chuan
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Columbia University (N.Y.)
Internal Medicine/Medicine
Schools of Medicine
New York
United States
Zip Code
Vilar, Santiago; Friedman, Carol; Hripcsak, George (2018) Detection of drug-drug interactions through data mining studies using clinical sources, scientific literature and social media. Brief Bioinform 19:863-877
Backenroth, Daniel; Chase, Herbert S; Wei, Ying et al. (2017) Monitoring prescribing patterns using regression and electronic health records. BMC Med Inform Decis Mak 17:175
Chase, Herbert S; Mitrani, Lindsey R; Lu, Gabriel G et al. (2017) Early recognition of multiple sclerosis using natural language processing of the electronic health record. BMC Med Inform Decis Mak 17:24
Yadav, Kabir; Sarioglu, Efsun; Choi, Hyeong Ah et al. (2016) Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury. Acad Emerg Med 23:171-8
Finkelstein, Joseph; Friedman, Carol; Hripcsak, George et al. (2016) Pharmacogenetic polymorphism as an independent risk factor for frequent hospitalizations in older adults with polypharmacy: a pilot study. Pharmgenomics Pers Med 9:107-116
Backenroth, Daniel; Chase, Herbert; Friedman, Carol et al. (2016) Using Rich Data on Comorbidities in Case-Control Study Design with Electronic Health Record Data Improves Control of Confounding in the Detection of Adverse Drug Reactions. PLoS One 11:e0164304
Salmasian, Hojjat; Tran, Tran H; Chase, Herbert S et al. (2015) Medication-indication knowledge bases: a systematic review and critical appraisal. J Am Med Inform Assoc 22:1261-70
Adams, Hayden; Friedman, Carol; Finkelstein, Joseph (2015) Automated Determination of Publications Related to Adverse Drug Reactions in PubMed. AMIA Jt Summits Transl Sci Proc 2015:31-5
Li, Ying; Ryan, Patrick B; Wei, Ying et al. (2015) A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions. Drug Saf 38:895-908
Freedberg, Daniel E; Salmasian, Hojjat; Friedman, Carol et al. (2014) Response to Daniell. Am J Gastroenterol 109:922-3

Showing the most recent 10 out of 47 publications