Robust Inference from Observational Data with Distributed Representations of Conceptual Relations

Cohen, Trevor

Abstract

This proposal concerns the development and evaluation of computational methods through which linguistic manifestations of cognitive changes in Alzheimer?s Disease (AD) dementia can be identified in transcribed speech. Such methods are of value even prior to the availability of disease modifying treatments, as through earlier detection they provide the means to reduce the emotional and financial burden on patients, caregivers, and the healthcare system. Lack of a clear diagnosis in the face of cognitive manifestations of dementia can produce uncertainty, and negatively impact planning of future care. Misattributed AD symptoms can lead to social isolation. In addition, it is estimated that early and accurate diagnosis can help save an estimated $7.9 trillion in medical and care costs. With ~30-40% of healthy adults subjectively reporting forgetfulness on a regular basis, there is an urgent need to develop sensitive and specific, easy-to-use, safe, and cost-effective tools for monitoring AD-specific cognitive markers in individuals concerned about their cognitive function. Language reflects cognitive status, but manual analysis of language data is prohibitively time-consuming. In the proposed research we will develop and evaluate computational methods to identify linguistic biomarkers of AD, leveraging perplexity estimates derived from neural language models trained on transcripts of the speech of healthy controls only. This approach deviates from the supervised learning paradigm that characterizes most computational linguistics approaches to identifying AD, obviating the danger of overfitting to the characteristics of participants with dementia represented in the small datasets available for training. Nonetheless, our preliminary research has demonstrated that classification performance on the basis of such perplexity estimates rivals that documented with supervised machine learning models trained on hundreds of manually engineered features. The proposed research will result in a validated set of methods for detection of AD using transcribed speech, methods with the potential for broad dissemination on account of recent advances in automated speech recognition.

Public Health Relevance

The need to monitor unintended effects of medications has been highlighted by several high-profile events in which fatal side effects of approved drugs were detected after their release to market. In the proposed research, we will develop and evaluate methods to identify biologically plausible adverse drug events using both observational data and knowledge extracted from the biomedical literature. If successful, these methods will provide the means for earlier detection of harmful drug effects, limiting consequent morbidity and mortality.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Research Project (R01)
Project #: 3R01LM011563-06S1
Application #: 9881862
Study Section: Special Emphasis Panel (ZLM1)
Program Officer: Sim, Hua-Chuan

Project Start: 2013-09-01
Project End: 2021-05-31
Budget Start: 2019-07-03
Budget End: 2020-05-31
Support Year: 6
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: University of Washington
Department: Other Health Professions
Type: Schools of Medicine
DUNS #: 605799469

City: Seattle
State: WA
Country: United States
Zip Code: 98195

Related projects


NIH 2020 R01 LM	Robust Inference from Observational Data with Distributed Representations of Conceptual Relations Cohen, Trevor / University of Washington
NIH 2019 R01 LM	Robust Inference from Observational Data with Distributed Representations of Conceptual Relations Cohen, Trevor / University of Washington
NIH 2019 R01 LM	Robust Inference from Observational Data with Distributed Representations of Conceptual Relations Cohen, Trevor / University of Washington
NIH 2018 R01 LM	Robust Inference from Observational Data with Distributed Representations of Conceptual Relations Cohen, Trevor / University of Washington
NIH 2015 R01 LM	Using Biomedical Knowledge to Identify Plausible Signals for Pharmacovigilance Cohen, Trevor / University of Texas Health Science Center Houston
NIH 2014 R01 LM	Using Biomedical Knowledge to Identify Plausible Signals for Pharmacovigilance Cohen, Trevor / University of Texas Health Science Center Houston
NIH 2014 R01 LM	Using Biomedical Knowledge to Identify Plausible Signals for Pharmacovigilance Cohen, Trevor / University of Texas Health Science Center Houston	$160,000
NIH 2013 R01 LM	Using Biomedical Knowledge to Identify Plausible Signals for Pharmacovigilance Cohen, Trevor / University of Texas Health Science Center Houston	$309,937

Publications

Mower, Justin; Subramanian, Devika; Cohen, Trevor (2018) Learning predictive models of drug side-effect relationships from distributed representations of literature-derived semantic predications. J Am Med Inform Assoc 25:1339-1350

Cai, Ruichu; Liu, Mei; Hu, Yong et al. (2017) Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artif Intell Med 76:7-15

Yu, Zhiguo; Wallace, Byron C; Johnson, Todd et al. (2017) Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness. Stud Health Technol Inform 245:657-661

Amith, Muhammad; Cunningham, Rachel; Savas, Lara S et al. (2017) Using Pathfinder networks to discover alignment between expert and consumer conceptual knowledge from online vaccine content. J Biomed Inform 74:33-45

Cohen, Trevor; Widdows, Dominic (2017) Embedding of semantic predications. J Biomed Inform 68:150-166

Mower, Justin; Subramanian, Devika; Shang, Ning et al. (2016) Classification-by-Analogy: Using Vector Representations of Implicit Relationships to Identify Plausibly Causal Drug/Side-effect Relationships. AMIA Annu Symp Proc 2016:1940-1949

Malec, Scott A; Wei, Peng; Xu, Hua et al. (2016) Literature-Based Discovery of Confounding in Observational Clinical Data. AMIA Annu Symp Proc 2016:1920-1929

Widdows, Dominic; Cohen, Trevor (2015) Reasoning with Vectors: A Continuous Model for Fast Robust Inference. Log J IGPL 23:141-173

Shang, Ning; Xu, Hua; Rindflesch, Thomas C et al. (2014) Identifying plausible adverse drug reactions using knowledge extracted from the literature. J Biomed Inform 52:293-310

Cohen, T; Widdows, D; Stephan, C et al. (2014) Predicting high-throughput screening results with scalable literature-based discovery methods. CPT Pharmacometrics Syst Pharmacol 3:e140

Comments

Be the first to comment on Trevor Cohen's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: