The need to monitor unintended effects of approved drugs has been highlighted by several recent high-profile events in which fatal side effects of drugs were detected after their release to market. Notoriously, the Cox-2 inhibitor Rofecoxib (Vioxx) was withdrawn from market on account of evidence suggesting that treatment with the drug increased the rate of coronary artery disease, and recently new evidence has emerged suggesting the commonly used antibiotic Azithromycin (Zithromax) may cause fatal arrythmias. In an effort to mitigate the morbidity and mortality resulting from such undetected side effects, regulatory bodies such as the Food and Drug Administration (FDA) have instituted spontaneous reporting systems to systematize post-marketing surveillance. However there is evidence that under-reporting of adverse drug events (ADEs) is widespread. Automated monitoring of events documented in the Electronic Health Record (EHR) as free text or structured data has been proposed as a path toward earlier identification of meaningfully correlated drug-event pairs. As these pairs must ultimately be reviewed by domain experts to assess their implications, there is a pressing need to develop methods to selectively identify plausible drug-event pairs within the large pool of correlations to be found in clinical data. In the proposed research, we will develop and evaluate models of biological plausibility, based on knowledge extracted from the biomedical literature and using methods of hyperdimensional computing for efficient search and inference across multiple concepts and relations simultaneously. These methods will be used to selectively identify plausible drug-event pairs found in structured clinical data, and extracted from unstructured data using natural language extraction. The developed methods will be evaluated formatively, for their ability to rediscover known side effects from the biomedical literature, and summatively for their ability to improve the precision of effects attributed to a st of known drugs using statistical methods alone. In addition we will evaluate their ability to predict recent FDA warnings, using historical data and knowledge. If successful, the proposed research will provide the means to identify automatically plausible drug-event pairs for regulatory purposes, mitigating consequent morbidity and mortality. In addition, the methods will provide a generalizable approach that can be used to apply knowledge derived from the biomedical literature to interpret clinical data.

Public Health Relevance

The need to monitor unintended effects of medications has been highlighted by several high-profile events in which fatal side effects of approved drugs were detected after their release to market. In the proposed research, we will develop and evaluate methods to identify automatically biologically plausible adverse drug events found within clinical patient records, using knowledge extracted from the biomedical literature. If successful, these methods will provide the means for earlier detection of harmful drug effects, limiting consequent morbidity and mortality.

National Institute of Health (NIH)
National Library of Medicine (NLM)
Research Project (R01)
Project #
Application #
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Sim, Hua-Chuan
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Texas Health Science Center Houston
Schools of Allied Health Profes
United States
Zip Code
Mower, Justin; Subramanian, Devika; Cohen, Trevor (2018) Learning predictive models of drug side-effect relationships from distributed representations of literature-derived semantic predications. J Am Med Inform Assoc 25:1339-1350
Cai, Ruichu; Liu, Mei; Hu, Yong et al. (2017) Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artif Intell Med 76:7-15
Yu, Zhiguo; Wallace, Byron C; Johnson, Todd et al. (2017) Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness. Stud Health Technol Inform 245:657-661
Amith, Muhammad; Cunningham, Rachel; Savas, Lara S et al. (2017) Using Pathfinder networks to discover alignment between expert and consumer conceptual knowledge from online vaccine content. J Biomed Inform 74:33-45
Cohen, Trevor; Widdows, Dominic (2017) Embedding of semantic predications. J Biomed Inform 68:150-166
Mower, Justin; Subramanian, Devika; Shang, Ning et al. (2016) Classification-by-Analogy: Using Vector Representations of Implicit Relationships to Identify Plausibly Causal Drug/Side-effect Relationships. AMIA Annu Symp Proc 2016:1940-1949
Malec, Scott A; Wei, Peng; Xu, Hua et al. (2016) Literature-Based Discovery of Confounding in Observational Clinical Data. AMIA Annu Symp Proc 2016:1920-1929
Widdows, Dominic; Cohen, Trevor (2015) Reasoning with Vectors: A Continuous Model for Fast Robust Inference. Log J IGPL 23:141-173
Shang, Ning; Xu, Hua; Rindflesch, Thomas C et al. (2014) Identifying plausible adverse drug reactions using knowledge extracted from the literature. J Biomed Inform 52:293-310
Cohen, T; Widdows, D; Stephan, C et al. (2014) Predicting high-throughput screening results with scalable literature-based discovery methods. CPT Pharmacometrics Syst Pharmacol 3:e140