Variation is a fundamental property of data obtained in modern cancer research, be they genomic changes, gene expression abnormalities, or phenotypic data from gene mapping. Statistical methods arise from a logic that decomposes variation into that which is sporadic and that which may have some biological significance. The purpose of this research proposal is to develop statistical methods tailored to current and emerging data structures in cancer biology. If successful, this research will improve inference about cancer biology by enabling more efficient and robust extraction of information from the complex data that will be upon us. Four specific problems will be tackled. New microarray-based technologies have enabled DNA sequence copy number variations to be measured at very high resolution in cancer tumor cells, thus enhancing the characterization of suppressor genes and oncogenes. Sources of variation complicate inference.
The first aim i s to develop statistical methods for analyzing copy-number variation by extending existing models of allelic-imbalance data. New mathematical formulations and inference methods are proposed for this purpose. Microarray technology is also creating a wealth of data on gene expression in cancer cells.
In Aim 2, hierarchical modeling methods are proposed to characterize the normal variation of these profiles, to enable comparison at various levels, such as among genes, or among microarrays, and to enable data reduction via nonparametric mixture modeling.
The third aim concerns interval mapping methods which have for some time enabled the localization of genes in controlled animal experiments. Methods which are nonparametric in the phenotype distribution are highly robust, but available methods can lose too much information by working with sums of ranks. Sensitive nonparametric interval mapping methodology is proposed to enhance efficiency. Finally, phenotype-driven mutagenesis experiments based on quantitative phenotypes require statistical methods to efficiently screen mutagenized animals and to trace mutant genotypes through progeny testing and mapping. Parametric and nonparametric methods are proposed for this purpose. Developments on these four specific aims are linked by common biological features, by structural similarities in the statistical models, and in the computational issues raised by data analysis.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA064364-08
Application #
6512966
Study Section
Special Emphasis Panel (ZRG1-SNEM-5 (01))
Program Officer
Erickson, Burdette (BUD) W
Project Start
1995-07-01
Project End
2004-02-29
Budget Start
2002-07-01
Budget End
2004-02-29
Support Year
8
Fiscal Year
2002
Total Cost
$97,200
Indirect Cost
Name
University of Wisconsin Madison
Department
Biostatistics & Other Math Sci
Type
Schools of Medicine
DUNS #
161202122
City
Madison
State
WI
Country
United States
Zip Code
53715
Sarkar, Deepayan; Goldstein, Steve; Schwartz, David C et al. (2012) Statistical significance of optical map alignments. J Comput Biol 19:478-92
Stanhope, Stephen A; Sengupta, Srikumar; den Boon, Johan et al. (2009) Statistical use of argonaute expression and RISC assembly in microRNA target identification. PLoS Comput Biol 5:e1000516
Durkee, Benjamin Y; Shinki, Kazuhiko; Newton, Michael A et al. (2009) Longitudinal assessment of colonic tumor fate in mice by computed tomography and optical colonoscopy. Acad Radiol 16:1475-82
Newton, Michael A; Clipson, Linda; Thliveris, Andrew T et al. (2006) A statistical test of the hypothesis that polyclonal intestinal tumors arise by random collision of initiated clones. Biometrics 62:721-7
Thliveris, Andrew T; Halberg, Richard B; Clipson, Linda et al. (2005) Polyclonality of familial murine adenomas: analyses of mouse chimeras with low tumor multiplicity suggest short-range interactions. Proc Natl Acad Sci U S A 102:6960-5
Gould, Karen A; Tochacek, Martin; Schaffer, Beverly S et al. (2004) Genetic determination of susceptibility to estrogen-induced mammary cancer in the ACI rat: mapping of Emca1 and Emca2 to chromosomes 5 and 18. Genetics 168:2113-25
Kendziorski, C M; Newton, M A; Lan, H et al. (2003) On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles. Stat Med 22:3899-914