As researchers investigate the relationship between cancer and exposure to environmental chemicals such as trace elements, pesticides, and dioxins, they often find concentrations that are lower than limits deemed reliable enough to report as numerical values. The detection limit (DL) may be a fixed number in some studies, but it can also vary widely from sample to sample in other studies. For the latter, the DL may be correlated with the exposure level, as observed in a colon cancer study in Kentucky. The data subject to DLs present challenges for data analysis and interpretation. In this proposal we focus on two important statistical problems encountered in the analysis of data from environmental epidemiologic studies: (a) estimation of the chemical distribution in a specific group; and (b) comparison of distributions among groups. For these two problems, ad hoc, parametric, and nonparametric methods have been proposed. Ad hoc methods are ill-advised unless there are relatively few measurements below DLs; and parametric methods can lead to markedly biased results when the parametric model is misspecified. Nonparametric methods have received increasing attention in recent years because of their robustness. However, current nonparametric methods simply borrow the commonly used methods for right-censored survival data, and do not take into account the following two unique characteristics of environmental exposure data with DLs: (a) it is not meaningful to define the hazard function for an exposure measurement; and (b) DL values are observable for all subjects including those whose actual exposure levels are detected. In addition, current nonparametric methods do not allow for sampling weights, which are typically present in survey data such as the National Health and Nutrition Examination Survey (NHANES). Due to these issues, current nonparametric methods may lead to the following four problems for the analysis of environmental exposure data with DLs: (a) lack of meaningful interpretation; (b) inefficient results; (c) inability to deal with the situation that the exposure level and DL are correlated; and (d) inability to handle survey data with sampling weights. To address the aforementioned problems, we will develop unified and efficient nonparametric estimation and testing methods that can (a) deal with possible correlation between the exposure level and DL; (b) incorporate sampling weights. We will utilize state-of-the-art methods for censored survival data and tailor them to environmental exposure data with DLs. The proposed methods will be applied to data from a recently conducted colon cancer case-control study in Kentucky, an ongoing lung cancer case-control study in Kentucky, and the NHANES. 1

Public Health Relevance

This application proposes to develop a novel statistical method to improve the analysis of environmental data subject to detection limits. The success of this project will result in an innovative statistical approach to more accurately assess the potential risk posed by environmental carcinogens such as arsenic, chromium, and nickel on the development and progression of colon and lung cancers. The proposed method, if shown to be more optimal than existing methods, will benefit any environmental exposure study (regardless of health outcome) where the measurements are subject to detection limits.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Research Grants (R03)
Project #
5R03CA179661-02
Application #
9061638
Study Section
Special Emphasis Panel (ZCA1)
Program Officer
Liu, Benmei
Project Start
2015-05-01
Project End
2017-04-30
Budget Start
2016-05-01
Budget End
2017-04-30
Support Year
2
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of Kentucky
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
939017877
City
Lexington
State
KY
Country
United States
Zip Code
40506
Yang, Yuchen; Shelton, Brent J; Tucker, Thomas T et al. (2017) Estimation of exposure distribution adjusting for association between exposure level and detection limit. Stat Med 36:2935-2946