Temporal relation discovery for clinical text

Savova, Guergana; Palmer, Martha

Abstract

The overarching long-term vision of our research is to create novel technologies for processing clinical free text. Such technologies will enable sophisticated and efficient indexing, retrieval and data mining over the ever increasing amounts of electronic clinical data. Processing free text poses a number of challenges to which the fields of Artificial intelligence, natural language processing and computer science in general have made advances. Methods for processing free text are informed by linguistic theory combined with the power of statistical inferencing. A key component to the next step, natural language understanding, is discovering events and their relations on a timeline. Temporal relations are of prime importance in biomedicine as they are intrinsically linked to diseases, signs and symptoms, and treatments. Understanding the timeline of clinically relevant events is key to the next generation of translational research where the importance of generalizing over large amounts of data holds the promise of deciphering biomedical puzzles. The goal of our current proposal is to discover temporal relations from clinical free text through achieving four specific aims:
Specific Aim 1 : Develop (1) a temporal relation annotation schema and guidelines for clinical free text based on TimeML, which will require extensions to Treebank, PropBank and VerbNet annotation guidelines to the clinical domain, (2) an annotated corpus following the temporal relations schema with additions to Treebank, PropBank and VerbNet, (3) a descriptive study comparing temporal relations in the clinical and general domains.
Specific Aim 2 : Extend and evaluate existing methods and/or develop new algorithms for temporal relation discovery in the clinical domain. Component-level evaluation Specific Aim 3: Integrate best method and/or a variety of methods for temporal relation discovery into the open source Mayo Clinic IE pipeline and release as open source annotators in the pipeline. Functional testing. Dissemination activities.
Specific Aim 4 : System-level evaluation. Test the functionality of the enhanced Mayo Clinic IE pipeline on translational research use cases, e.g. the progression of colon cancer as documented in clinical notes and pathology reports, the progression of brain tumor as documented in radiology reports. The methods we will use for the temporal relation discovery are based on machine learning, e.g., Support Vector Machine technology. Such methods require the annotation of a reference standard from which the computations are derived. The best methods will be released as part of the Mayo Clinic Information Extraction System for the larger community to use and contribute to. We will test the methods against biomedical queries.

Public Health Relevance

(max 2-3 sentences) Temporal relations are of prime importance in biomedicine as they are intrinsically linked to diseases, signs and symptoms, and treatments. Understanding the timeline of clinically relevant events is key to the next generation of translational research where the importance of generalizing over large amounts of data holds the promise of deciphering biomedical puzzles. The goal of our current proposal is to automatically discover temporal relations from clinical free text and create a timeline.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Research Project (R01)
Project #: 5R01LM010090-04
Application #: 8535820
Study Section: Biomedical Library and Informatics Review Committee (BLR)
Program Officer: Sim, Hua-Chuan

Project Start: 2010-09-30
Project End: 2014-09-29
Budget Start: 2013-09-30
Budget End: 2014-09-29
Support Year: 4
Fiscal Year: 2013
Total Cost: $663,984
Indirect Cost: $124,234

Institution

Name: Children's Hospital Boston
Department
Type
DUNS #: 076593722

City: Boston
State: MA
Country: United States
Zip Code: 02115

Related projects


NIH 2020 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Boston Children's Hospital
NIH 2019 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Boston Children's Hospital
NIH 2017 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Boston Children's Hospital
NIH 2016 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Children's Hospital Boston
NIH 2015 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Children's Hospital Boston
NIH 2013 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Children's Hospital Boston	$663,984
NIH 2012 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Children's Hospital Boston	$745,770
NIH 2011 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Children's Hospital Boston	$695,125
NIH 2010 R01 LM	Temporal relation discovery for clinical text Savova, Guergana K.; Palmer, Martha Stone / Children's Hospital Boston	$775,112

Publications

Névéol, Aurélie; Dalianis, Hercules; Velupillai, Sumithra et al. (2018) Clinical Natural Language Processing in languages other than English: opportunities and challenges. J Biomed Semantics 9:12

Gonzalez-Hernandez, G; Sarker, A; O'Connor, K et al. (2017) Capturing the Patient's Perspective: a Review of Advances in Natural Language Processing of Health-Related Text. Yearb Med Inform 26:214-227

Miller, Timothy; Dligach, Dmitriy; Bethard, Steven et al. (2017) Towards generalizable entity-centric clinical coreference resolution. J Biomed Inform 69:251-258

Savova, Guergana K; Tseytlin, Eugene; Finan, Sean et al. (2017) DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records. Cancer Res 77:e115-e118

Lin, Chen; Dligach, Dmitriy; Miller, Timothy A et al. (2016) Multilayered temporal modeling for the clinical domain. J Am Med Inform Assoc 23:387-95

Lin, Chen; Karlson, Elizabeth W; Dligach, Dmitriy et al. (2015) Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. J Am Med Inform Assoc 22:e151-61

Pradhan, Sameer; Elhadad, Noémie; South, Brett R et al. (2015) Evaluating the state of the art in disorder recognition and normalization of the clinical narrative. J Am Med Inform Assoc 22:143-54

Luo, Xiaoqiang; Pradhan, Sameer; Recasens, Marta et al. (2014) An Extension of BLANC to System Mentions. Proc Conf Assoc Comput Linguist Meet 2014:24-29

Styler 4th, William F; Bethard, Steven; Finan, Sean et al. (2014) Temporal Annotation in the Clinical Domain. Trans Assoc Comput Linguist 2:143-154

Pfiffner, Pascal B; Oh, JiWon; Miller, Timothy A et al. (2014) ClinicalTrials.gov as a data source for semi-automated point-of-care trial eligibility screening. PLoS One 9:e111055

Showing the most recent 10 out of 18 publications

Comments

Be the first to comment on Guergana Savova's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: