Clinical classification and prediction are key components of clinical care. Even modest improvements in classification and predictive performance may have significant healthcare consequences in terms of improved patient outcomes and reduced healthcare costs. This proposal describes a new, efficient Bayesian machine-learning method for performing clinical predictions. The method is designed to use both clinical and genome-wide data in predicting patient outcomes. The method's performance will be evaluated using existing clinical and genome-wide data from the Framingham Heart Study that is being made available to qualified researchers through the SHARe resource of the National Center for Biotechnology Information. The outcomes to be predicted in individuals include the onset of major cardiovascular events and death from all causes. The study will evaluate how well these predictions can be made with a new Bayesian method when using traditional clinical data alone, genome-wide data alone, and both types of data together. The method's performance will also be compared that of existing models and methods. The main hypothesis to be tested is that the proposed machine-learning method will be an advancement over existing methods in that it will be computationally feasible to apply it using a combination of traditional clinical data and genome-wide data, and it will yield better predictive performance than do existing predictive models and methods. If shown to be so, this new method is anticipated to provide substantial benefit in future electronic patient-care systems, including applications to computer-based decision support that have available both traditional clinical data and genome-wide data.

Public Health Relevance

Predicting whether an individual will acquire a disease is an important clinical task. This project will develop and evaluate a new computer-based method to predict diseases in individuals based on the use of both traditional clinical data and data about an individual's genetic make up. Advances obtained in predictive performance may have significant healthcare consequences in terms of improved patient care and outcomes.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM010020-02
Application #
7860710
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Sim, Hua-Chuan
Project Start
2009-09-01
Project End
2012-08-31
Budget Start
2010-09-01
Budget End
2012-08-31
Support Year
2
Fiscal Year
2010
Total Cost
$582,590
Indirect Cost
Name
University of Pittsburgh
Department
Miscellaneous
Type
Schools of Medicine
DUNS #
004514360
City
Pittsburgh
State
PA
Country
United States
Zip Code
15213
López Pineda, Arturo; Ye, Ye; Visweswaran, Shyam et al. (2015) Comparison of machine learning classifiers for influenza detection from emergency department free-text reports. J Biomed Inform 58:60-69
Balasubramanian, Jeya B; Visweswaran, Shyam; Cooper, Gregory F et al. (2014) Selective model averaging with bayesian rule learning for predictive biomedicine. AMIA Jt Summits Transl Sci Proc 2014:17-22
Avali, Viji R; Cooper, Gregory F; Gopalakrishnan, Vanathi (2014) Application of Bayesian logistic regression to mining biomedical data. AMIA Annu Symp Proc 2014:266-73
Montefusco, David J; Chen, Lujia; Matmati, Nabil et al. (2013) Distinct signaling roles of ceramide species in yeast revealed through systematic perturbation and systems biology analyses. Sci Signal 6:rs14
Hennings-Yeomans, Pablo H; Cooper, Gregory F (2012) Improving the prediction of clinical outcomes from genomic data using multiresolution analysis. IEEE/ACM Trans Comput Biol Bioinform 9:1442-50
Jiang, Xia; Barmada, M Michael; Cooper, Gregory F et al. (2011) A bayesian method for evaluating and discovering disease loci associations. PLoS One 6:e22075
Lustgarten, Jonathan L; Visweswaran, Shyam; Gopalakrishnan, Vanathi et al. (2011) Application of an efficient Bayesian discretization method to biomedical data. BMC Bioinformatics 12:309
Wei, Wei; Visweswaran, Shyam; Cooper, Gregory F (2011) The application of naive Bayes model averaging to predict Alzheimer's disease from genome-wide data. J Am Med Inform Assoc 18:370-5
Jiang, Xia; Neapolitan, Richard E; Barmada, M Michael et al. (2010) A fast algorithm for learning epistatic genomic relationships. AMIA Annu Symp Proc 2010:341-5