We propose to develop software can cluster statistics appropriate when the exact space-time location of health events are known. Health professionals are investigating an increasing number of possible disease clusters, and statistical tests play an important role in cluster description and analysis. Existing cluster statistics assume precise data, when in reality health events are often imprecise (e.g. place-of- residence is known only to the census district or zip-code) and uncertain (e.g. 'I first became ill sometime in 1985'). Most cluster statistics can be written as the cross product of two matrices where one matrix reflects nearest neighbor, distance of adjacency relationships and the second matrix is health related (e.g. case-control identities). This research will explore a general approach to clustering which incorporates uncertainty regarding space-time locations into the nearest neighbor, distance or adjacency relationship. Because the approach is general the proposed methods can be used with almost all exiting cluster tests. In phase 1 we will determine feasibility by implementing this general approach for Cuzick & Edwards (nearest neighbor-based), Mantel's (distance-based) and Knox's (adjacency-based) tests. The delivery of the prototype software and Manual at the end of phase 1 will be the criterion for demonstrating project feasibility. In phase 2 we will extend the approach to 10 other cluster tests and evaluate the fuzzy clustering algorithms using statistical power comparisons based on 3 realistic disease simulations.

Proposed Commercial Applications

The resulting software will be a powerful tool for the statistical description and detection of realistic clusters of health events characterized by uncertain space-time locations.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Business Innovation Research Grants (SBIR) - Phase I (R43)
Project #
1R43CA065366-01A1
Application #
2108319
Study Section
Special Emphasis Panel (ZRG7-SSS-1 (15))
Project Start
1995-04-01
Project End
1996-01-31
Budget Start
1995-04-01
Budget End
1996-01-31
Support Year
1
Fiscal Year
1995
Total Cost
Indirect Cost
Name
Biomedware
Department
Type
DUNS #
City
Ann Arbor
State
MI
Country
United States
Zip Code
48103
Jacquez, G M (1996) A k nearest neighbour test for space-time interaction. Stat Med 15:1935-49
Jacquez, G M; Grimson, R; Waller, L A et al. (1996) The analysis of disease clusters, Part II: Introduction to techniques. Infect Control Hosp Epidemiol 17:385-97
Jacquez, G M (1995) The map comparison problem: tests for the overlap of geographic boundaries. Stat Med 14:2343-61