With the increased use of genetic testing, whether through broadening of testing criteria or the well documented ordering of tests by physicians because of patient demand, and furthermore the commoditization of measurements of thousands of variants on a single individual, the number of healthy individuals undergoing genetic testing is accelerating. The growth in the number of false positives is projected to rise dramatically because many of the current annotations for known mutations have not been developed for the asymptomatic, well population. With the false positive results, patients will be alarmed unnecessarily, and also clinicians will order unnecessary and occasionally risky tests. Furthermore, insurance companies will find the growth in unnecessary secondary testing triggered by this tsunami of false positive tests (""""""""the incidentalome"""""""") as an unanticipated financial risk and thereby endanger the very real benefits that the sound use of genetic testing and long-anticipated genomically-enable """"""""personalized medicine"""""""" can bring. We propose to demonstrate that this risk of the incidentalome is substantial by automatically mining the biomedical literature, by scanning public genomic data sets, and computationally predicting the effects of mutations to identify a set of candidate mutations that are predicted to not contribute to disease in the general population. This despite their annotation in authoritative genetic databases and texts as """"""""highly penetrant"""""""" in causing disease congenitally or in childhood. Two thousand adult patients known not to have these diseases will be then tested for each of these candidate mutations with the hypothesis that we will demonstrate the presence of some of these mutations in these health individuals. Evidence supporting this hypothesis will both serve as an important caution in the use of genetic testing and will also demonstrate the value of population studies to find the broader clinical significance of genetic mutations.

Public Health Relevance

The first rule of medicine is to do no harm. The current state of annotation of genetic tests is inadequate to support clinical application to large segments of our patient population. In the context of increased use of genetic testing, we are poised to unintentionally inflict anxiety, unnecessary and costly and risky additional testing. This proposal will provide a demonstration of the reality of this risk and point the way to developing evidence-based annotations that can be safely used for the interpretation of genetic tests in the general population.

National Institute of Health (NIH)
National Library of Medicine (NLM)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZLM1-AP-E (M3))
Program Officer
Ye, Jane
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
Schools of Medicine
United States
Zip Code
Kohane, Isaac S; Valtchinov, Vladimir I (2012) Quantifying the white blood cell transcriptome as an accessible window to the multiorgan transcriptome. Bioinformatics 28:538-45
Palmer, Nathan P; Schmid, Patrick R; Berger, Bonnie et al. (2012) A gene expression profile of stem cell pluripotentiality and differentiation is conserved across diverse solid and hematopoietic cancers. Genome Biol 13:R71
Kohane, Isaac S; Hsing, Michael; Kong, Sek Won (2012) Taxonomizing, sizing, and overcoming the incidentalome. Genet Med 14:399-404
Krallinger, Martin; Vazquez, Miguel; Leitner, Florian et al. (2011) The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text. BMC Bioinformatics 12 Suppl 8:S3
Alexandrov, Boian S; Valtchinov, Vlad I; Alexandrov, Ludmil B et al. (2011) DNA dynamics is likely to be a factor in the genomic nucleotide repeats expansions related to diseases. PLoS One 6:e19800
Agarwal, Shashank; Yu, Hong; Kohane, Issac (2011) BioNOT: a searchable database of biomedical negated sentences. BMC Bioinformatics 12:420
Lu, Zhiyong; Kao, Hung-Yu; Wei, Chih-Hsuan et al. (2011) The gene normalization task in BioCreative III. BMC Bioinformatics 12 Suppl 8:S2
Cao, Yonggang; Li, Zuofeng; Liu, Feifan et al. (2010) An IR-aided machine learning framework for the BioCreative II.5 Challenge. IEEE/ACM Trans Comput Biol Bioinform 7:454-61
Agarwal, Shashank; Yu, Hong (2010) Biomedical negation scope detection with conditional random fields. J Am Med Inform Assoc 17:696-701