? ? Recently, association studies are being favored over traditional linkage analyses of extended families to determine the genetics of complex diseases. The use of bi-allelic single nucleotide polymorphisms (SNPs) as markers for association/linkage studies has become more common due to their availability and high frequency throughout the genome. With the advent of such new technologies as the Perlgen chip, it is possible to genotype about 1.6 million SNPs on the whole genome on each individual. However, this creates a new set of problems and challenges in analyzing data with missing values. Missing values are inevitable in genotypic as well as phenotypic data, and clearly incomplete data could give rise to undependable and/or biased results in gene mapping. The missing data problems in familial data with phenotypes and the large number of available genotypes have not been adequately addressed. Most statistical analysis software uses list-wise or pair-wise deletion that can potentially create a problem by deleting most of the data for the analysis thus resulting in serious bias. Currently, there is no software available for Multiple Imputation for familial or case-control genetic data. The overall goal of this proposal is to provide a bridge between, on the one hand, these well-established techniques for estimation in the presence of missing data and, on the other hand, the most commonly used statistical analysis methods of genetics data. In particular, the Specific Aims of this proposal are: ? ? Specific Aim 1: To model and simulate multivariate phenotypic and genotypic data for family and case-control design based on real data. ? ? Specific Aim 2: To compare missing data procedures for robustness and reliability using simulation and then to develop Multiple Imputation software. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Exploratory/Developmental Grants (R21)
Project #
5R21LM008791-02
Application #
7363679
Study Section
Special Emphasis Panel (ZLM1-ZH-R (O1))
Program Officer
Ye, Jane
Project Start
2007-03-01
Project End
2010-02-28
Budget Start
2008-03-01
Budget End
2010-02-28
Support Year
2
Fiscal Year
2008
Total Cost
$138,453
Indirect Cost
Name
University of Alabama Birmingham
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
063690705
City
Birmingham
State
AL
Country
United States
Zip Code
35294
Tiwari, Hemant K; Birkner, Thomas; Moondan, Ankur et al. (2011) Accurate and flexible power calculations on the spot: Applications to genomic research. Stat Interface 4:353-358
Tiwari, Hemant K; Patki, Amit; Allison, David B (2010) Within-Cluster Resampling for Analysis of Family Data: Ready for Prime-Time? Stat Interface 3:169-176
Padilla, Miguel A; Divers, Jasmin; Vaughan, Laura K et al. (2009) Multiple imputation to correct for measurement error in admixture estimates in genetic structured association testing. Hum Hered 68:65-72
Tiwari, Hemant K; Barnholtz-Sloan, Jill; Wineinger, Nathan et al. (2008) Review and evaluation of methods correcting for population stratification with a focus on underlying statistical principles. Hum Hered 66:67-86
Wineinger, Nathan E; Kennedy, Richard E; Erickson, Stephen W et al. (2008) Statistical issues in the analysis of DNA Copy Number Variations. Int J Comput Biol Drug Des 1:368-95
Wojczynski, Mary K; Tiwari, Hemant K (2008) Definition of phenotype. Adv Genet 60:75-105