The primary long term goal of this project is the development or more efficient statistical designs and more efficient and flexible statistical methods of analysis for both analytic and descriptive studies in cancer epidemiology. A secondary goal is the rapid dissemination of research results in a form suitable for assimilation and use by epidemiologists who may lack advanced technical training. Epidemiologic studies play a major role in the identification of carcinogenic agents and in the quantification of the dose-time-response relationships upon which regulation and prevention strategies are based. Epidemiology as a science depends critically upon statistical concepts of design and methods of statistical analysis which it is the goal of this project to improve.
The specific aims are to develop efficient and computationally feasible statistical methods for 3 problems: (i) estimation of random effects and variance components in generalized linear mixed models; (ii) analysis of data from two-stage case-control studies and other stratified epidemiologic designs; and (iii) analysis of epidemiologic data with missing covariable or exposure information. Random effects of mixed models for epidemiologic data, especially those that come in the form of counts, proportions or ordinal responses, are of increasing importance for: (i) incorporation of historical control information in case-control studies; (ii) accounting for inter-institutional variation in multi- center studies; (iii) recovery of """"""""interstratum information"""""""" in finely stratified analyses; (iv) smoothing of cancer incidence rates for construction of disease maps; (v) exploratory regression analyses with smoothing based on auto-regressive models; (vi) combination of relative risk estimates from independent studies (meta-analyses); and (vii) clinical epidemiological prediction of individual responses to therapeutic or preventive interventions. Two-stage case-control studies and other similar stratified designs are of great value in limiting the collection of costly covariable data to the subsets of cancer patients and controls who are most informative regarding the association between cancer and specific exposures. Such designs and appropriate efficient methods of analysis have as a goal the collection of precise scientific data at minimal possible cost. Finally, missing covariable and exposure information is a pervasive problem in cancer epidemiology. Standard approaches based on """"""""complete case"""""""" analyses may be biased and inefficient. Recent work on multiple imputation techniques, first proposed and used for sample surveys, promises to improve the analysis of epidemiologic data provided that it can be adapted to account for generally smaller sample sizes. The methods used to achieve these goals will include mathematical and statistical analysis, computer simulation and application of newly developed methods to important datasets collected by cancer epidemiologists.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
2R01CA040644-08
Application #
3180946
Study Section
Special Emphasis Panel (SSS (R2))
Project Start
1985-09-01
Project End
1997-11-30
Budget Start
1993-01-15
Budget End
1993-11-30
Support Year
8
Fiscal Year
1993
Total Cost
Indirect Cost
Name
University of Washington
Department
Type
Schools of Public Health
DUNS #
135646524
City
Seattle
State
WA
Country
United States
Zip Code
98195
Breslow, Norman E; Hu, Jie; Wellner, Jon A (2015) Z-estimation and stratified samples: application to survival models. Lifetime Data Anal 21:493-516
Breslow, Norman E; Amorim, Gustavo; Pettinger, Mary B et al. (2013) Using the Whole Cohort in the Analysis of Case-Control Data: Application to the Women's Health Initiative. Stat Biosci 5:
Breslow, Norman E; Lumley, Thomas; Ballantyne, Christie M et al. (2009) Using the whole cohort in the analysis of case-cohort data. Am J Epidemiol 169:1398-405
Breslow, Norman E; Lumley, Thomas; Ballantyne, Christie M et al. (2009) Improved Horvitz-Thompson Estimation of Model Parameters from Two-phase Stratified Samples: Applications in Epidemiology. Stat Biosci 1:32
Nelson, Kerrie P; Leroux, Brian G (2006) Statistical models for autocorrelated count data. Stat Med 25:1413-30
Breslow, Norman E (2003) Are statistical contributions to medicine undervalued? Biometrics 59:1-8
Platt, R W (2000) Saddlepoint approximations for small sample logistic regression problems. Stat Med 19:323-34
Leroux, B G (2000) Modelling spatial disease rates using maximum likelihood. Stat Med 19:2321-32
Platt, R W; Leroux, B G; Breslow, N (1999) Generalized linear mixed models for meta-analysis. Stat Med 18:643-54
McKnight, B; Tierney, C; McGorray, S P et al. (1998) Likelihood-based inference for the genetic relative risk based on affected-sibling-pair marker data. Biometrics 54:426-43

Showing the most recent 10 out of 32 publications