One of the most pressing problems in identifying and localizing genes influencing diseases is the need to model linkage disequilibrium between single nucleotide polymorphisms in dense genotyping assays. Currently available assays determine genotypes at over 500,000 loci per sample, and this data is being used in multiple study designs. Statistical models and methods are needed to account appropriately for linkage disequilibrium, as well as observational error and population admixture. Naive approaches to the problem that rely on single locus analyses are swamped by the need to correct for multiple and correlated tests. Other approaches, such as those that thin out the loci used to reduce linkage disequilibrium or assume that alleles occur in blocks based on location, while sensible and reasonably efficient, do not exploit all of the potential statistical power and resolution made possible by this kind of data. Graphical models are a class of statistical models that can be applied to joint distributions of multivariate observations. In preliminary work by the principal investigator under a current R21 grant, these have been shown to give both accurate and tractable representations of the patterns of allelic association that occur between proximal genetic loci in a variety of problems. Results have been consistent with other sophisticated modeling methods, such as ancestral recombination graphs. In contrast models in which strong assumptions are made based on physical location of loci, such as low order Markov models, have been shown to be inappropriate for this problem. The purpose of this proposal is to further develop graphical modeling methods for linkage disequilibrium in association studies, identity by descent mapping, and linkage analysis. In particular we focus on model restrictions that will give an order of magnitude improvement in computational efficiency;a new formulation for the linkage analysis problem that should improve the mixing properties of Markov chain Monte Carlo methods;and a novel and general method for approximating complex graphical models with simpler ones. In addition, we pursue an approach to identity by descent mapping that incorporates linkage disequilibrium and is scalable to the whole genome level. For this aim of the project we intend to apply the methods developed to dense genotype assays obtained for distantly related breast cancer cases in extended Utah pedigrees.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM081417-03
Application #
7627382
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Krasnewich, Donna M
Project Start
2007-07-02
Project End
2011-08-31
Budget Start
2009-09-01
Budget End
2010-08-31
Support Year
3
Fiscal Year
2009
Total Cost
$282,192
Indirect Cost
Name
University of Utah
Department
Miscellaneous
Type
Schools of Medicine
DUNS #
009095365
City
Salt Lake City
State
UT
Country
United States
Zip Code
84112
Cai, Zheng; Thomas, Alun; Teerlink, Craig et al. (2012) Pairwise shared genomic segment analysis in three Utah high-risk breast cancer pedigrees. BMC Genomics 13:676
Knight, Stacey; Abo, Ryan P; Abel, Haley J et al. (2012) Shared genomic segment analysis: the power to find rare disease variants. Ann Hum Genet 76:500-9
Thomas, Alun; Abel, Haley J; Di, Yanming et al. (2011) Effect of linkage disequilibrium on the identification of functional variants. Genet Epidemiol 35 Suppl 1:S115-9
Hasstedt, Sandra J; Thomas, Alun (2011) Detecting pleiotropy and epistasis using variance components linkage analysis in jPAP. Hum Hered 72:258-63
Cai, Zheng; Camp, Nicola J; Cannon-Albright, Lisa et al. (2011) Identification of regions of positive selection using Shared Genomic Segment analysis. Eur J Hum Genet 19:667-71
Abel, Haley J; Thomas, Alun (2011) Accuracy and computational efficiency of a graphical modeling approach to linkage disequilibrium estimation. Stat Appl Genet Mol Biol 10:Article 5
Thomas, Alun (2010) Assessment of SNP streak statistics using gene drop simulation with linkage disequilibrium. Genet Epidemiol 34:119-24
Teerlink, Craig C; Thomas, Alun (2010) An application of the latent p value method to assess linkage in asthma pedigrees. Hum Hered 70:1-8
Thomas, Alun (2010) The conditional independences between variables derived from two independent identically distributed Markov random fields when pairwise order is ignored. Math Med Biol 27:283-8
Abo, Ryan; Wong, Jathine; Thomas, Alun et al. (2010) Haplotype association analyses in resources of mixed structure using Monte Carlo testing. BMC Bioinformatics 11:592

Showing the most recent 10 out of 15 publications