The main theme of this research is haplotype, multilocus, general genetic association methods, and statistical issues that arise in large scale data analysis, such as in genome-wide association scans (GWAS). Some of our research is focusing on developing of methods to combine genetic association signals across different samples of the same disease, or signals across multiple, etiologically similar diseases. These methods will help to identify genetic loci involved in several diseases with shared pathogenesis. For example, the genetic variant can be involved in susceptibility to several autoimmune diseases. Association signals can be correlated. One example that leads to correlated signals is shared controls design for GWAS, where the fact of reusing a control group while testing for genetic association with different diseases may create strong correlation between association signals. The methods we have developed are general (Zaykin, Kozbur 2010), and they are being applied to diverse problems in collaborations with NIH and extramural scientists (Costigan et al., 2010, Reimann et al., 2010;with Dr. Raja Jothi, ongoing) Ongoing research include development of statistical approaches to address multiplicity issues in whole genome scans. This research includes investigation of novel approaches aimed to improve ranks of true positives in whole genome scans (in collaboration with Dr. Jack Taylor). We have been developing methods that allow evaluation of chances that a true association will rank among best results in a genome scan. A standard calculation in the design of GWAS is a sample size determination needed to achieve adequate power at the genome-wide level of significance. We are taking an alternative approach: to calculate the probability that a true positive will rank among a specific number of best results, when they are sorted by an association statistic. The rank-based approach allows one to find the number of most significant results to follow up on, as determined by the desired probability of capturing a true association. The rank-based approach is appealing, since it provides guidance for the number of SNPs needed in a replication study. Unlike the power-based approach, it does not require specification of a particular significance level. The problem here is that evaluation of ranking probabilities can be very difficult, because it requires non-standard numerical methods and simulations that model realistic patterns of linkage disequilibrium. Linkage disequilibrium may be specific to a particular scan, thus one would have to perform a customized analysis that involves access to the individual genotype data for a given genome scan. At GWAS densities such analysis can take many weeks to run. We have been concerned with development of practical methods for evaluation of ranking probabilities. We have been developing a method that is completely general in that the same simple approach applies regardless of the extent and structure of linkage disequilibrium. Other statistical genetics research included investigation of methods for estimation of relative risk for family data and imprecisely scored genotypes (in collaboration with Drs. Weinberg, Shi, Umbach, London and Hancock).

Project Start
Project End
Budget Start
Budget End
Support Year
6
Fiscal Year
2010
Total Cost
$352,415
Indirect Cost
City
State
Country
Zip Code
Martin, Loren J; Smith, Shad B; Khoutorsky, Arkady et al. (2017) Epiregulin and EGFR interactions are involved in pain processing. J Clin Invest 127:3353-3366
Vsevolozhskaya, Olga; Ruiz, Gabriel; Zaykin, Dmitri (2017) Bayesian prediction intervals for assessing P-value variability in prospective replication studies. Transl Psychiatry 7:1271
Vsevolozhskaya, Olga A; Kuo, Chia-Ling; Ruiz, Gabriel et al. (2017) The more you test, the more you find: The smallest P-values become increasingly enriched with real findings as more tests are conducted. Genet Epidemiol 41:726-743
Dong, Jing; Wyss, Annah; Yang, Jingyun et al. (2017) Genome-Wide Association Analysis of the Sense of Smell in U.S. Older Adults: Identification of Novel Risk Loci in African-Americans and European-Americans. Mol Neurobiol 54:8021-8032
Shi, Min; O'Brien, Katie M; Sandler, Dale P et al. (2017) Previous GWAS hits in relation to young-onset breast cancer. Breast Cancer Res Treat 161:333-344
O'Brien, Katie M; Shi, Min; Sandler, Dale P et al. (2016) A family-based, genome-wide association study of young-onset breast cancer: inherited variants and maternally mediated effects. Eur J Hum Genet 24:1316-23
Vsevolozhskaya, Olga A; Zaykin, Dmitri V; Barondess, David A et al. (2016) Uncovering Local Trends in Genetic Effects of Multiple Phenotypes via Functional Linear Models. Genet Epidemiol 40:210-221
Vsevolozhskaya, Olga A; Greenwood, Mark C; Powell, Scott L et al. (2015) Resampling-based multiple comparison procedure with application to point-wise testing with functional data. Environ Ecol Stat 22:45-59
Meloto, Carolina B; Segall, Samantha K; Smith, Shad et al. (2015) COMT gene locus: new functional variants. Pain 156:2072-83
Weinberg, Clarice R; Zaykin, Dmitri (2015) Response. J Natl Cancer Inst 107:

Showing the most recent 10 out of 29 publications