Significant effort has been dedicated to the study of complex diseases through genome-wide association studies (GWASs). However, GWASs have provided few medically actionable results due to several limitations. One limitation of the GWAS approach is the difficulty in identifying functional variants when most assayed SNPs have no known function and/or are considered tags for an ungenotyped or uncharacterized functional variant. A second limitation is the lack of heritability or appreciation for the environmental contribution to the disease under study. Alternative and complementary approaches to the GWAS technique are necessary. A novel strategy to address these limitations includes the use of electronic medical records (EMRs) to conduct a phenome-wide association study (PheWAS) when plausible genetic targets are identified. Whereas GWAS asks What genetic variants are associated with a disease? PheWAS asks What diseases are associated with a genetic variant? The hypothesis being tested in this project is that loss-of-function variants - a class o variation with the highest probability of being clinically relevant - may cause disease phenotypes described in EMRs. To test this hypothesis, we propose the following specific aims: (1) measure associations between thousands of disease phenotypes coded in the EMR for 10,000 Marshfield Clinic patients and loss-of-function SNPs, (2) replicate findings in an independent cohort, and (3) investigate the biological relevance of these associations through functional genomic experiments using patient biospecimens, cell lines, and animal model systems. The results from this study will combine association-based testing with biological experimentation. Therefore, this study is not only innovative in its approach, but also in its capacity to assess the genetic component of many diseases simultaneously, including understudied diseases. In addition, the PheWAS approach has the capacity to characterize multiple diseases that share a common genetic etiology. This may be important for drug repurposing, in that drugs used to treat one disease may also be therapeutic for a different disease, if both share a common genetic link.

Public Health Relevance

Alternative approaches are needed to efficiently identify clinically actionable genetic variants for use in personalized medicine. One such approach includes linking genetic data with in-depth patient data from electronic medical records that describe many diseases simultaneously, coupled with experiments that are designed to understand the biology of the genes and diseases. This strategy may be useful not only in predictive medicine, but may also provide new therapies by expanding treatment options for existing drugs impacting biological pathways shared by multiple diseases.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM114128-03
Application #
9333383
Study Section
Genetics of Health and Disease Study Section (GHD)
Program Officer
Krasnewich, Donna M
Project Start
2015-09-15
Project End
2020-08-31
Budget Start
2017-09-01
Budget End
2018-08-31
Support Year
3
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Marshfield Clinic Research Foundation
Department
Type
DUNS #
074776030
City
Marshfield
State
WI
Country
United States
Zip Code
54449
Bastarache, Lisa; Hughey, Jacob J; Hebbring, Scott et al. (2018) Phenotype risk scores identify patients with unrecognized Mendelian disease patterns. Science 359:1233-1239
Blue, Elizabeth; Louie, Tin L; Chong, Jessica X et al. (2018) Variation in Cilia Protein Genes and Progression of Lung Disease in Cystic Fibrosis. Ann Am Thorac Soc 15:440-448
Huang, Xiayuan; Elston, Robert C; Rosa, Guilherme J et al. (2018) Applying family analyses to electronic health records to facilitate genetic research. Bioinformatics 34:635-642
Carter, Tonia C; Hebbring, Scott J; Liu, Jixia et al. (2018) Pilot screening study of targeted genetic polymorphisms for association with seasonal influenza hospital admission. J Med Virol 90:436-446
Karnes, Jason H; Bastarache, Lisa; Shaffer, Christian M et al. (2017) Phenome-wide scanning identifies multiple diseases and disease severity phenotypes associated with HLA variants. Sci Transl Med 9:
Kim, TaeWon; Havighurst, Thomas; Kim, KyungMann et al. (2017) RNA-Binding Protein IGF2BP1 in Cutaneous Squamous Cell Carcinoma. J Invest Dermatol 137:772-775
Liu, Jixia; Zhao, Ran; Ye, Zhan et al. (2017) Relationship of SULT1A1 copy number variation with estrogen metabolism and human health. J Steroid Biochem Mol Biol 174:169-175
Mosley, Jonathan D; Witte, John S; Larkin, Emma K et al. (2016) Identifying genetically driven clinical phenotypes using linear mixed models. Nat Commun 7:11433
Verma, Anurag; Verma, Shefali S; Pendergrass, Sarah A et al. (2016) eMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants. BMC Med Genomics 9 Suppl 1:32
Simonti, Corinne N; Vernot, Benjamin; Bastarache, Lisa et al. (2016) The phenotypic legacy of admixture between modern humans and Neandertals. Science 351:737-41

Showing the most recent 10 out of 12 publications