Rapid growth in the clinical implementation of large electronic medical records (EMRs) has led to an unprecedented expansion in the availability of dense longitudinal datasets for observational research. More recently, huge efforts have linked EMR databases with archived biological material, to accelerate research in personalized medicine. EMR- linked DNA biobanks have identified common and rare genetic variants that contribute to risk of disease. An appealing vision, which has not been extensively explored, is to use EMRs-linked biobanks for pharmacogenomic studies, which identify associations between genetic variation and drug efficacy and toxicity. The longitudinal nature of the data contained within EMRs make them ideal for quantifying drug outcome (both efficacy and toxicity). Efforts are already underway to link these EMRs across institutions, and standardize the definition of phenotypes for large-scale studies of treatment outcome, specifically within the context of routine clinical care. Despite its success, EMR-based pharmacogenomic studies are often hampered by its data-intensive nature -- it is time- consuming and costly to extract and integrate data from multiple heterogeneous EMR databases, for large-scale pharmacogenomic studies. The Informatics for Integrating Biology and the Bedside (i2b2) is a National Center for Biomedical Computing based at Partners Healthcare System. I2b2 has developed a scalable informatics framework to enable clinical researchers to repurpose existing EMR data for clinical and genomic discovery. In this study, we will collaborate with i2b2 to extend its informatics framework to the pharmacogenomics domain, by proposing the following specific aims: 1) Develop new methods to extract and model drug exposure and outcome information from EMR and integrate them with the i2b2 NLP components; 2) Build ontology tools to normalize and integrate pharmacogenomic data across different sites; 3) Conduct known and novel pharmacogenomic studies to evaluate and refine tools developed in Aim 1 and 2; and 4) Disseminate the developed informatics tools among pharmacogenomic researchers.

Public Health Relevance

Longitudinal electronic medical records (EMRs) linked with DNA biobanks have become valuable resources for genomic and pharmacogenomics research, allowing identification of associations between genetic variations and drug efficacy and toxicity. The Informatics for Integrating Biology and the Bedside (i2b2), a National Center for Biomedical Computing based at Partners Healthcare System, has developed a scalable informatics framework to enable clinical researchers to use existing EMR data for genomic knowledge discovery of diseases. In this study, we will collaborate with i2b2 to extend its informatics framework to the pharmacogenomics domain, by developing new natural language processing, ontology components, and user-friendly interfaces, and then apply these tools to real-world pharmacogenomic studies.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM103859-02
Application #
8929257
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Long, Rochelle M
Project Start
2014-09-18
Project End
2016-05-31
Budget Start
2015-06-01
Budget End
2016-05-31
Support Year
2
Fiscal Year
2015
Total Cost
Indirect Cost
Name
Vanderbilt University Medical Center
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
004413456
City
Nashville
State
TN
Country
United States
Zip Code
37240
Morash, Maggie; Mitchell, Hannah; Yu, Anthony et al. (2018) CATCH-KB: Establishing a Pharmacogenomics Variant Repository for Chemotherapy-Induced Cardiotoxicity. AMIA Jt Summits Transl Sci Proc 2017:168-177
Wei, Wei-Qi; Li, Xiaohui; Feng, Qiping et al. (2018) LPA Variants Are Associated With Residual Cardiovascular Risk in Patients Receiving Statins. Circulation 138:1839-1849
Lee, Hee-Jin; Zhang, Yaoyun; Jiang, Min et al. (2018) Identifying direct temporal relations between time and events from clinical notes. BMC Med Inform Decis Mak 18:49
Wu, Yonghui; Jiang, Min; Xu, Jun et al. (2017) Clinical Named Entity Recognition Using Deep Learning Models. AMIA Annu Symp Proc 2017:1812-1819
Miller, Timothy; Dligach, Dmitriy; Bethard, Steven et al. (2017) Towards generalizable entity-centric clinical coreference resolution. J Biomed Inform 69:251-258
Cai, Ruichu; Liu, Mei; Hu, Yong et al. (2017) Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artif Intell Med 76:7-15
Lee, Kathleen; Weiskopf, Nicole; Pathak, Jyotishman (2017) A Framework for Data Quality Assessment in Clinical Research Datasets. AMIA Annu Symp Proc 2017:1080-1089
Wiley, Laura K; Vanhouten, Jacob P; Samuels, David C et al. (2017) STRATEGIES FOR EQUITABLE PHARMACOGENOMIC-GUIDED WARFARIN DOSING AMONG EUROPEAN AND AFRICAN AMERICAN INDIVIDUALS IN A CLINICAL POPULATION. Pac Symp Biocomput 22:545-556
Ji, Zongcheng; Zhang, Yaoyun; Xu, Jun et al. (2017) Comparing Cancer Information Needs for Consumers in the US and China. Stud Health Technol Inform 245:126-130
Savova, Guergana K; Tseytlin, Eugene; Finan, Sean et al. (2017) DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records. Cancer Res 77:e115-e118

Showing the most recent 10 out of 39 publications