Efficient electronic phenotyping using APHRODITE in the Million Veteran Program

Lee, Jennifer; Assimes, Themistocles

Abstract

The Million Veteran Program (MVP) is currently the largest biobank study in the world. The resource provides an unprecedented opportunity to identify the genetic causes of a variety of human diseases that disproportionally affect our veterans including diseases that affect the neurological, cardiovascular, pulmonary, gastrointestinal, endocrine, and musculoskeletal organs. Fast-paced technological progress over the last 10 years now allows us to reliably and densely profile individuals across their entire genome. Such data has already been generated and linked to a wide spectrum of human diseases and physiologic traits. However, many more links remain to be made which will provide the scientific community with additional important clues on the root causes of many life-threatening diseases as well as valuable insights on how to develop new drugs to treat or prevent these same diseases. The current challenge in making these additional discoveries is no longer the generation of high quality genetic data in large numbers but rather the organization and querying of very large and complex electronic health records (EHR) being leveraged by these large biobank studies. Until now, much effort and time has been expended to painstakingly develop and validate rules-based definitions to identify individuals with a specific disease, syndrome, or state across a variety of EHR platforms. However, the recent mapping of the VA corporate data warehouse to the Observational Medical Outcomes Partnership common data model (OMOP-CDM) provides us with unprecedented opportunities to apply new ?electronic phenotyping? tools that can identify individuals with a specific disease, syndrome, or state in a much more efficient manner than rules-based methods. The goal of this proposal is to comprehensively test the ability of one of these new tools named APHRODITE (Automated PHenotype Routine for Observational Definition, Identification, Training and Evaluation) to identify established genetic links among MVP participants. APHRODITE was developed at Stanford by one of our co-investigators and uses state of the art machine learning algorithms to identify individuals with a condition in a fraction of the time it takes to identify them through rules-based definitions. The algorithm has shown great promise within the Stanford clinical data warehouse but requires validation in other EHR cohorts.
In aim 1, we will test the accuracy of an APHRODITE classifier to that of a rules-based classifier for at least 5 diseases using gold-standard sets in the VA.
In aim 2, we will test whether APHRODITE classifiers from aim 1 can be applied to MVP participants to replicate established genetic associations. If automated methods in APHRODITE perform equally well or better than rules-based methods for multiple diseases, automated methods may be leveraged for phenotypes where rules based methods may not exist, maximizing the efficiency of genetic discovery in MVP and facilitating rapid replication of findings within MVP in other EHRs mapped to the OMOP-CDM.

Public Health Relevance

Inherited differences in our DNA play an important role in the development of nearly all human diseases. Linking these differences to diseases has recently been greatly facilitated by large studies of humans with electronic health records and genetic profiling. In this proposal, we will test the capability of a new computer algorithm named APHRODITE in efficiently identifying individuals with a disease within the Million Veteran Program and linking them to inherited changes in the DNA that are known to predispose to the same disease.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: Veterans Affairs (VA)
Type: Non-HHS Research Projects (I01)
Project #: 5I01HX002487-02
Application #: 9955052
Study Section: Special Initiatives - MVP Projects (SPLM)

Project Start: 2019-08-01
Project End: 2021-07-31
Budget Start: 2020-08-01
Budget End: 2021-07-31
Support Year: 2
Fiscal Year: 2021
Total Cost
Indirect Cost

Institution

Name: Veterans Admin Palo Alto Health Care Sys
Department
Type
DUNS #: 046017455

City: Palo Alto
State: CA
Country: United States
Zip Code: 94304

Related projects


NIH 2021 I01 VA	Efficient electronic phenotyping using APHRODITE in the Million Veteran Program Lee, Jennifer Shuwen; Assimes, Themistocles Leonard / Veterans Admin Palo Alto Health Care Sys
NIH 2019 I01 VA	Efficient electronic phenotyping using APHRODITE in the Million Veteran Program Lee, Jennifer Shuwen / Veterans Admin Palo Alto Health Care Sys

Comments

Be the first to comment on Jennifer Lee's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: