The number of known human polymorphisms is growing rapidly and the cost, both in time and money, for ascertaining genotypes has decreased substantially. The result can potentially drive forward the search for genes affecting human phenotypes with positive consequences for understanding, avoiding and treating disease. It will also impact the understanding of drug response and the avoidance of allergic reactions to drugs. While the rate of data gathering has advanced so significantly, the statistical methods to deal with such large and complex data sets have not kept pace. The most pressing problem is that of how to test for association between phenotypes and a vast array of potentially associated loci. Most current methods require multiple testing and suffer the associated loss of statistical power. This problem is further complicated as the loci, and hence the tests, are not independent which leads to over correction. Pairwise measures of linkage disequilibrium, haplotype blocks and hot spots inadequately describe the patterns of associations between the allelic states at proximal loci. More sophisticated coalescent and population genetic models have problems of tractability or of fully incorporating the information from haplotype samples. Graphical modeling provides a statistical framework for characterizing precisely this sort of complex stochastic data. This empirical approach can provide concise, accurate and tractable representations of the joint distribution of alleles at proximal loci. This is directly relevant to such problems as detecting association with phenotypic variables and selecting informative subsets of loci. The great potential of this approach is that categorical phenotypes can be included in the same analysis and association with polymorphisms assessed jointly with the inter locus associations. This proposal is to extend graphical modeling methods already developed by the investigators for haploid data to diploid data, larger genomic regions, admixed populations and family data.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Exploratory/Developmental Grants (R21)
Project #
5R21GM070710-02
Application #
7017782
Study Section
Genome Study Section (GNM)
Program Officer
Anderson, Richard A
Project Start
2005-03-01
Project End
2008-02-28
Budget Start
2006-03-01
Budget End
2008-02-28
Support Year
2
Fiscal Year
2006
Total Cost
$182,484
Indirect Cost
Name
University of Utah
Department
Biostatistics & Other Math Sci
Type
Schools of Medicine
DUNS #
009095365
City
Salt Lake City
State
UT
Country
United States
Zip Code
84112
Abo, Ryan; Wong, Jathine; Thomas, Alun et al. (2010) Haplotype association analyses in resources of mixed structure using Monte Carlo testing. BMC Bioinformatics 11:592
Thomas, Alun (2009) Estimation of graphical models whose conditional independence graphs are interval graphs and its application to modeling linkage disequilibrium. Comput Stat Data Anal 53:1818-1828
Thomas, Alun; Green, Peter J (2009) Enumerating the junction trees of a decomposable graph. J Comput Graph Stat 18:930-940
Thomas, Alun; Green, Peter J (2009) Enumerating the decomposable neighbours of a decomposable graph under a simple perturbation scheme. Comput Stat Data Anal 53:1232-1238
Rausch, Tobias; Thomas, Alun; Camp, Nicola J et al. (2008) A parallel genetic algorithm to discover patterns in genetic markers that indicate predisposition to multifactorial disease. Comput Biol Med 38:826-36
Thomas, A; Camp, N J; Farnham, J M et al. (2008) Shared genomic segment analysis. Mapping disease predisposition genes in extended pedigrees using SNP genotype assays. Ann Hum Genet 72:279-87
Thomas, Alun (2007) Towards linkage analysis with markers in linkage disequilibrium by graphical modelling. Hum Hered 64:16-26
Thomas, Alun; Camp, Nicola J (2006) Maximum likelihood estimates of allele frequencies and error rates from samples of related individuals by gene counting. Bioinformatics 22:771-2
Thomas, Alun (2005) Characterizing allelic associations from unphased diploid data by graphical modeling. Genet Epidemiol 29:23-35
Thomas, Alun (2005) GMCheck: Bayesian error checking for pedigree genotypes and phenotypes. Bioinformatics 21:3187-8

Showing the most recent 10 out of 11 publications