Rapid growth in the clinical implementation of large electronic medical records (EMRs) has led to an unprecedented expansion in the availability of dense longitudinal datasets for observational research. More recently, huge efforts have linked EMR databases with archived biological material, to accelerate research in personalized medicine. EMR- linked DNA biobanks have identified common and rare genetic variants that contribute to risk of disease. An appealing vision, which has not been extensively explored, is to use EMRs-linked biobanks for pharmacogenomic studies, which identify associations between genetic variation and drug efficacy and toxicity. The longitudinal nature of the data contained within EMRs make them ideal for quantifying drug outcome (both efficacy and toxicity). Efforts are already underway to link these EMRs across institutions, and standardize the definition of phenotypes for large-scale studies of treatment outcome, specifically within the context of routine clinical care. Despite its success, EMR-based pharmacogenomic studies are often hampered by its data-intensive nature -- it is time- consuming and costly to extract and integrate data from multiple heterogeneous EMR databases, for large-scale pharmacogenomic studies. The Informatics for Integrating Biology and the Bedside (i2b2) is a National Center for Biomedical Computing based at Partners Healthcare System. I2b2 has developed a scalable informatics framework to enable clinical researchers to repurpose existing EMR data for clinical and genomic discovery. In this study, we will collaborate with i2b2 to extend its informatics framework to the pharmacogenomics domain, by proposing the following specific aims: 1) Develop new methods to extract and model drug exposure and outcome information from EMR and integrate them with the i2b2 NLP components; 2) Build ontology tools to normalize and integrate pharmacogenomic data across different sites; 3) Conduct known and novel pharmacogenomic studies to evaluate and refine tools developed in Aim 1 and 2; and 4) Disseminate the developed informatics tools among pharmacogenomic researchers.
Longitudinal electronic medical records (EMRs) linked with DNA biobanks have become valuable resources for genomic and pharmacogenomics research, allowing identification of associations between genetic variations and drug efficacy and toxicity. The Informatics for Integrating Biology and the Bedside (i2b2), a National Center for Biomedical Computing based at Partners Healthcare System, has developed a scalable informatics framework to enable clinical researchers to use existing EMR data for genomic knowledge discovery of diseases. In this study, we will collaborate with i2b2 to extend its informatics framework to the pharmacogenomics domain, by developing new natural language processing, ontology components, and user-friendly interfaces, and then apply these tools to real-world pharmacogenomic studies.
|Lee, Hee-Jin; Zhang, Yaoyun; Jiang, Min et al. (2018) Identifying direct temporal relations between time and events from clinical notes. BMC Med Inform Decis Mak 18:49|
|Morash, Maggie; Mitchell, Hannah; Yu, Anthony et al. (2018) CATCH-KB: Establishing a Pharmacogenomics Variant Repository for Chemotherapy-Induced Cardiotoxicity. AMIA Jt Summits Transl Sci Proc 2017:168-177|
|Wei, Wei-Qi; Li, Xiaohui; Feng, Qiping et al. (2018) LPA Variants Are Associated With Residual Cardiovascular Risk in Patients Receiving Statins. Circulation 138:1839-1849|
|Wu, Yonghui; Jiang, Min; Xu, Jun et al. (2017) Clinical Named Entity Recognition Using Deep Learning Models. AMIA Annu Symp Proc 2017:1812-1819|
|Miller, Timothy; Dligach, Dmitriy; Bethard, Steven et al. (2017) Towards generalizable entity-centric clinical coreference resolution. J Biomed Inform 69:251-258|
|Cai, Ruichu; Liu, Mei; Hu, Yong et al. (2017) Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artif Intell Med 76:7-15|
|Lee, Kathleen; Weiskopf, Nicole; Pathak, Jyotishman (2017) A Framework for Data Quality Assessment in Clinical Research Datasets. AMIA Annu Symp Proc 2017:1080-1089|
|Wiley, Laura K; Vanhouten, Jacob P; Samuels, David C et al. (2017) STRATEGIES FOR EQUITABLE PHARMACOGENOMIC-GUIDED WARFARIN DOSING AMONG EUROPEAN AND AFRICAN AMERICAN INDIVIDUALS IN A CLINICAL POPULATION. Pac Symp Biocomput 22:545-556|
|Ji, Zongcheng; Zhang, Yaoyun; Xu, Jun et al. (2017) Comparing Cancer Information Needs for Consumers in the US and China. Stud Health Technol Inform 245:126-130|
|Savova, Guergana K; Tseytlin, Eugene; Finan, Sean et al. (2017) DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records. Cancer Res 77:e115-e118|
Showing the most recent 10 out of 39 publications