The broad, long-term objective of this project is to develop new methodology for analyzing pedigree data with complex models. With the current explosion of genetic data, scientists have become more ambitious in their effort to locate disease susceptibility genes. Instead of being satisfied with mapping simple dominant or recessive d , more attention is directed towards complex traits such as diabetes or Alzheimer disease. At the same time, data on multiple markers may be used in a single analysis, either to increase the power in detecting linkage or to localize the gene after linkage has been found. These changes created two separate, but related, problems in data analysis. The first problem is computations. Because of the complex models fitted to the disease phenotype and/or the use of multiple markers, simply computing the lod score can be extremely time consuming and sometimes infeasible using existing techniques. The second problem concerns statistical inference. The current criterion for declaring linkage is to have a lod score of 3.0 or above. While this criterion is quite appropriate for a simple trait, it is clear that adjustments axe needed when the genetic models fitted have many parameters and when multiple diagnostics schemes are considered. Getting the lod score alone should not be considered as the end of an analysis. Other inference tools, such as P-values and posterior odds, are needed. The proposed research is directed towards the two problems mentioned. Computationally, efficient algorithms are developed by combining the strengths of several techniques. These include the traditional peeling algorithm, importance sampling, and Gibbs sampling. Algorithms are not just developed for computing lod scores, but also for obtaining P-values and posterior odds. In addition, probabilities of key recombination events can also be obtained as a bonus, which can sometimes greatly assist the task of understanding the data. The new methodology will be tested using both simulated and real data sets. The later include data sets on malignant melanoma, Alzheimer disease, a rare form of diabetes, and osteoporosis.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM046800-01A1
Application #
3306266
Study Section
Special Emphasis Panel (SSS (R7))
Project Start
1992-08-01
Project End
1994-07-31
Budget Start
1992-08-01
Budget End
1993-07-31
Support Year
1
Fiscal Year
1992
Total Cost
Indirect Cost
Name
University of Chicago
Department
Type
Schools of Arts and Sciences
DUNS #
225410919
City
Chicago
State
IL
Country
United States
Zip Code
60637
Cox, N J; Frigge, M; Nicolae, D L et al. (1999) Loci on chromosomes 2 (NIDDM1) and 15 interact to increase susceptibility to diabetes in Mexican Americans. Nat Genet 21:213-5
Boehnke, M; Cox, N J (1997) Accurate inference of relationships in sib-pair linkage studies. Am J Hum Genet 61:423-9
Wright, F A; Kong, A (1997) Linkage mapping in experimental crosses: the robustness of single-gene models. Genetics 146:417-25
Kong, A; Cox, N J (1997) Allele-sharing models: LOD scores and accurate linkage tests. Am J Hum Genet 61:1179-88
Pluzhnikov, A; Donnelly, P (1996) Optimal sequencing strategies for surveying molecular genetic diversity. Genetics 144:1247-62
Kong, A; Wright, F (1994) Asymptotic theory for gene mapping. Proc Natl Acad Sci U S A 91:9705-9
Irwin, M; Cox, N; Kong, A (1994) Sequential imputation for multilocus linkage analysis. Proc Natl Acad Sci U S A 91:11684-8
Kong, A; Cox, N; Frigge, M et al. (1993) Sequential imputation and multipoint linkage analysis. Genet Epidemiol 10:483-8
Kong, A; Frigge, M; Irwin, M et al. (1992) Importance sampling. I. Computing multimodel p values in linkage analysis. Am J Hum Genet 51:1413-29