Understanding the genetic etiology of diseases, both common and rare, is paramount to the application of genetics in """"""""personalized medicine."""""""" All diseases are the result of a combination of environmental and/or genetic factors. The amount of variation in a disease/trait that is due to a genetic contribution is called """"""""heritability."""""""" Conducing genetic/genomic studies is difficult without evidence of a strong genetic component by heritability measurements. Heritability can be measured in twins or other family structures, but is challenged by the resources required to collect the limited available families with appropriate phenotypic data. Moreover, even when these families are identified, heritability measurements are typically restricted to a single disease. The proposed research addresses these challenges by applying well-developed statistical methods in novel ways to simultaneously calculate heritability for many diseases using data available in the electronic medical record (EMR). The proposed project will test the hypothesis that thousands of clinical phenotypes, defined by patient medical records in families, can be used to measure heritability to direct further genetic/genomic studies. We call this novel bioinformatic method Phenome-wide Scan of Heritability (PheSH). The PheSH concept resulted from my work on Phenome-Wide Association Studies (PheWAS) conducted during my NLM-supported mentored training. Both PheWAS and PheSH are phenotype- independent approaches that allow for the genetic study of many clinical diseases or traits simultaneously. In the independent phase of my career, I plan to continue developing phenotype-independent techniques, including PheSH, to study the genetic etiology of human disease.

Public Health Relevance

All diseases are the result of a combination of genetic factors, also known as heritable factors, and/or environmental factors. This study is designed to measure the heritability of thousands of clinically significant diseases using multiple family structures and electronic medical records. The goal of this project is to identify genetic mutations that explain the strong heritability measurements so that genetics can be applied in personalized medicine.

National Institute of Health (NIH)
National Library of Medicine (NLM)
Career Transition Award (K22)
Project #
Application #
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Marshfield Clinic Research Foundation
United States
Zip Code
Liu, Jixia; Ye, Zhan; Mayer, John G et al. (2016) Phenome-wide association study maps new diseases to the human major histocompatibility complex region. J Med Genet 53:681-9
Fritsche, Lars G; Igl, Wilmar; Bailey, Jessica N Cooke et al. (2016) A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants. Nat Genet 48:134-43
Simonti, Corinne N; Vernot, Benjamin; Bastarache, Lisa et al. (2016) The phenotypic legacy of admixture between modern humans and Neandertals. Science 351:737-41
Mosley, Jonathan D; Witte, John S; Larkin, Emma K et al. (2016) Identifying genetically driven clinical phenotypes using linear mixed models. Nat Commun 7:11433
Ye, Zhan; Mayer, John; Ivacic, Lynn et al. (2015) Phenome-wide association studies (PheWASs) for functional variants. Eur J Hum Genet 23:523-9
Rastegar-Mojarad, Majid; Ye, Zhan; Kolesar, Jill M et al. (2015) Opportunities for drug repositioning from phenome-wide association studies. Nat Biotechnol 33:342-5
Hebbring, Scott J; Rastegar-Mojarad, Majid; Ye, Zhan et al. (2015) Application of clinical text data for phenome-wide association studies (PheWASs). Bioinformatics 31:1981-7
Mayer, John; Kitchner, Terrie; Ye, Zhan et al. (2014) Use of an electronic medical record to create the marshfield clinic twin/multiple birth cohort. Genet Epidemiol 38:692-8