Title
Real time spatial cluster detection using interpoint distances among precise patient locations.
Abstract
BACKGROUND: Public health departments in the United States are beginning to gain timely access to health data, often as soon as one day after a visit to a health care facility. Consequently, new approaches to outbreak surveillance are being developed. When cases cluster geographically, an analysis of their spatial distribution can facilitate outbreak detection. Our method focuses on detecting perturbations in the distribution of pair-wise distances among all patients in a geographical region. Barring outbreaks, this distribution can be quite stable over time. We sought to exemplify the method by measuring its cluster detection performance, and to determine factors affecting sensitivity to spatial clustering among patients presenting to hospital emergency departments with respiratory syndromes. METHODS: The approach was to (1) define a baseline spatial distribution of home addresses for a population of patients visiting an emergency department with respiratory syndromes using historical data; (2) develop a controlled feature set simulation by inserting simulated outbreak data with varied parameters into authentic background noise, thereby creating semisynthetic data; (3) compare the observed with the expected spatial distribution; (4) establish the relative value of different alarm strategies so as to maximize sensitivity for the detection of clustering; and (5) measure factors which have an impact on sensitivity. RESULTS: Overall sensitivity to detect spatial clustering was 62%. This contrasts with an overall alarm rate of less than 5% for the same number of extra visits when the extra visits were not characterized by geographic clustering. Clusters that produced the least number of alarms were those that were small in size (10 extra visits in a week, where visits per week ranged from 120 to 472), diffusely distributed over an area with a 3 km radius, and located close to the hospital (5 km) in a region most densely populated with patients to this hospital. Near perfect alarm rates were found for clusters that varied on the opposite extremes of these parameters (40 extra visits, within a 250 meter radius, 50 km from the hospital). CONCLUSION: Measuring perturbations in the interpoint distance distribution is a sensitive method for detecting spatial clustering. When cases are clustered geographically, there is clearly power to detect clustering when the spatial distribution is represented by the M statistic, even when clusters are small in size. By varying independent parameters of simulated outbreaks, we have demonstrated empirically the limits of detection of different types of outbreaks.
Year
DOI
Venue
2005
10.1186/1472-6947-5-19
BMC Med. Inf. & Decision Making
Keywords
Field
DocType
real time,demography,ambulatory care,health care,cluster analysis,public health,geography,limit of detection
Health care,Public health,Outbreak,Medical emergency,Health informatics,Cluster analysis,Ambulatory care,Medicine
Journal
Volume
Issue
ISSN
5
1
1472-6947
Citations 
PageRank 
References 
5
1.01
2
Authors
4
Name
Order
Citations
PageRank
Karen L. Olson193.23
Marco Bonetti2103.46
Marcello Pagano35715.02
Kenneth D. Mandl427567.17