The long-term objective of this research is to increase the clinical trial enrollment of US patients via a semi- automated, Natural Language Processing (NLP) based, interactive and patient-centered informatics application. The study design is prospective observational study. Scope is limited to cancer patients. There are three specific aims for this project.
The first aim i s to identify concepts that overlap between the electronic medical record's (EMR) clinical notes and the free text of clinical trial announcements. The PI will use the concepts to develop mapping frames that connect concepts in the text of trial announcements to those found in clinical notes in the medical record. When he has the mapping frames he will build the NLP module for the application. In the software development work he will utilize as many publicly available software components as possible. He will experiment with UIMA, GATE, MetaMap, Stanford Parser, NegEx algorithm and others. The PI will develop the tool around the National Library of Medicine's Unified Medical Language System knowledgebase. He will use Java for programming.
The second aim i s to create an algorithm that automatically generates questions to request information directly from the patient if the information is not available or accessible in the records.
The third aim i s to evaluate the in-vitro, laboratory performance of the application. For performance evaluation purposes the PI will recruit cancer care specialists to generate the gold standard lists of eligible clinical trials for study patients. He will publicly release the developed code at the end of the grant period. This K99/R00 project will serve the foundation for future R01 grant applications. The PI is fully committed to become faculty in the Clinical Research Informatics domain with a specialization in biomedical NLP. The support of the K99/R00 grant will enable him to acquire substantial formal training in Computational Linguistics while contributing to the body of knowledge of the Clinical Research Informatics field. The five-year grant support will ensure success in his endeavor. The proposed work is highly significant because the dismal clinical trial accrual rates (2-4 % nationally) hampers timely development of new drugs. In addition, studies show that physicians have statistically significant bias against elderly and minority patients to invite participation in clinical trials. The proposed project is synergistic with physician-centered efforts but the goal is to provide individualized, EMR based clinical trial recommendations directly to the patients. The results of this research will empower the patients and elevate their role in the decision making process.

Public Health Relevance

The long-term objective of this research is to increase the clinical trial enrollment of US patients via a semi- automated, Natural Language Processing (NLP) based, interactive and patient-centered informatics application. The proposed work is highly significant because the dismal clinical trial accrual rates (2-4 % nationally) hampers timely development of new drugs. In addition, studies show that physicians have statistically significant bias against elderly and minority patients to invite participation in clinical trials. The proposed project is synergistic with physician-centered efforts but the goal is to provide individualized, electronic medical record based clinical trial recommendations directly to the patients. The results of this research will empower patients and elevate their role in the decision making process.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Career Transition Award (K99)
Project #
1K99LM010227-01
Application #
7770648
Study Section
Special Emphasis Panel (ZLM1-ZH-C (01))
Program Officer
Sim, Hua-Chuan
Project Start
2009-09-30
Project End
2010-09-29
Budget Start
2009-09-30
Budget End
2010-09-29
Support Year
1
Fiscal Year
2009
Total Cost
$84,306
Indirect Cost
Name
University of Washington
Department
Miscellaneous
Type
Schools of Medicine
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
Zhai, Haijun; Lingren, Todd; Deleger, Louise et al. (2013) Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. J Med Internet Res 15:e73
Namjou, Bahram; Keddache, Mehdi; Marsolo, Keith et al. (2013) EMR-linked GWAS study: investigation of variation landscape of loci for body mass index in children. Front Genet 4:268
Li, Qi; Zhai, Haijun; Deleger, Louise et al. (2013) A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction. J Am Med Inform Assoc 20:915-21
Deleger, Louise; Molnar, Katalin; Savova, Guergana et al. (2013) Large-scale evaluation of automated clinical note de-identification and its impact on information extraction. J Am Med Inform Assoc 20:84-94
Deleger, Louise; Li, Qi; Lingren, Todd et al. (2012) Building gold standard corpora for medical natural language processing tasks. AMIA Annu Symp Proc 2012:144-53
Wu, Cuijun; Xia, Fei; Deleger, Louise et al. (2011) Statistical machine translation for biomedical text: are we there yet? AMIA Annu Symp Proc 2011:1290-9
Halgrim, Scott Russell; Xia, Fei; Solti, Imre et al. (2011) A cascade of classifiers for extracting medication information from discharge summaries. J Biomed Semantics 2 Suppl 3:S2
Uzuner, Ozlem; Solti, Imre; Xia, Fei et al. (2010) Community annotation experiment for ground truth generation for the i2b2 medication challenge. J Am Med Inform Assoc 17:519-23
Uzuner, Ozlem; Solti, Imre; Cadag, Eithon (2010) Extracting medication information from clinical text. J Am Med Inform Assoc 17:514-8
Yetisgen-Yildiz, Meliha; Solti, Imre; Xia, Fei (2010) Using Amazon's Mechanical Turk for Annotating Medical Named Entities. AMIA Annu Symp Proc 2010:1316

Showing the most recent 10 out of 13 publications