The completion of the human genome project brought the promise of genomic medicine-the use of genomic information for prevention, diagnosis and treatment of diseases. Yet, despite great progress in genotyping technologies, our ability to predict genetic predisposition to complex human traits and diseases remains very limited. Part of the explanation of our paradoxical lack of ability to predict complex human traits may reside in the limitations posed by the statistical methods commonly used in genome wide association studies. We believe that alternative methods, largely adapted from the field of animal breeding (WGP, whole-genome prediction), can enhance our ability to predict complex human traits and diseases, thus paving the way towards more intensive use of genomic information in personalized medicine. However, the populations to which WGP has been successfully applied differ greatly from human populations in aspects such as selection history, distribution of allele frequency, extent linkage disequilibrium (LD) and inbreeding. And preliminary evidence indicates that these factors can impact the predictive performance of WGP. Therefore, a comprehensive evaluation of WGP with human data is needed, and new methods may need to be developed to cope with the challenges posed by the prediction of complex human traits. We propose a framework to study the factors affecting the ability of WGP to account for and to predict variance at un-observed QTL. Using this framework, and a combination of simulation and real data analysis, we will produce the first comprehensive evaluation of existing WGP with human data and will quantify the effects of key features of the data, of the trait of interest, and of the regression method on the prediction accuracy of existing WGP procedures. We will use this information to develop new methods designed to confront the limitations of existing ones.

National Institute of Health (NIH)
Research Project (R01)
Project #
Application #
Study Section
Genetic Variation and Evolution Study Section (GVE)
Program Officer
Krasnewich, Donna M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Alabama Birmingham
Biostatistics & Other Math Sci
Schools of Public Health
United States
Zip Code
de los Campos, G; Sorensen, D (2014) On the genomic analysis of data from structured populations. J Anim Breed Genet 131:163-4
PĂ©rez, Paulino; de los Campos, Gustavo (2014) Genome-wide regression and prediction with the BGLR statistical package. Genetics 198:483-95
Jarquin, Diego; Crossa, Jose; Lacaze, Xavier et al. (2014) A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor Appl Genet 127:595-607
Fernando, Rohan L; Dekkers, Jack Cm; Garrick, Dorian J (2014) A class of Bayesian methods to combine large numbers of genotyped and non-genotyped animals for whole-genome analyses. Genet Sel Evol 46:50
Klimentidis, Yann C; Wineinger, Nathan E; Vazquez, Ana I et al. (2014) Multiple metabolic genetic risk scores and type 2 diabetes risk in three racial/ethnic groups. J Clin Endocrinol Metab 99:E1814-8
de los Campos, Gustavo; Sorensen, Daniel A (2013) A commentary on Pitfalls of predicting complex traits from SNPs. Nat Rev Genet 14:894
Crossa, Jose; Beyene, Yoseph; Kassa, Semagn et al. (2013) Genomic prediction in maize breeding populations with genotyping-by-sequencing. G3 (Bethesda) 3:1903-26
Daetwyler, Hans D; Calus, Mario P L; Pong-Wong, Ricardo et al. (2013) Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking. Genetics 193:347-65
Habier, David; Fernando, Rohan L; Garrick, Dorian J (2013) Genomic BLUP decoded: a look into the black box of genomic prediction. Genetics 194:597-607
de Los Campos, Gustavo; Perez, Paulino; Vazquez, Ana I et al. (2013) Genome-enabled prediction using the BLR (Bayesian Linear Regression) R-package. Methods Mol Biol 1019:299-320

Showing the most recent 10 out of 12 publications