# Review of software for space-time disease surveillance

- Colin Robertson
^{1}Email author and - Trisalyn A Nelson
^{1}

**9**:16

https://doi.org/10.1186/1476-072X-9-16

© Robertson and Nelson; licensee BioMed Central Ltd. 2010

**Received: **22 December 2009

**Accepted: **12 March 2010

**Published: **12 March 2010

## Abstract

Disease surveillance makes use of information technology at almost every stage of the process, from data collection and collation, through to analysis and dissemination. Automated data collection systems enable near-real time analysis of incoming data. This context places a heavy burden on software used for space-time surveillance. In this paper, we review software programs capable of space-time disease surveillance analysis, and outline some of their salient features, shortcomings, and usability. Programs with space-time methods were selected for inclusion, limiting our review to ClusterSeer, SaTScan, GeoSurveillance and the Surveillance package for R. We structure the review around stages of analysis: preprocessing, analysis, technical issues, and output. Simulated data were used to review each of the software packages. SaTScan was found to be the best equipped package for use in an automated surveillance system. ClusterSeer is more suited to data exploration, and learning about the different methods of statistical surveillance.

## Keywords

## Introduction

Disease surveillance is an ongoing process of information gathering, organizing, analyzing, interpreting, and communicating. It is the principal means by which public health information is generated and disseminated, informing policy, research, and response measures. For outbreaks of infectious disease, timely information on the spread of cases in space and time can facilitate action by public health officials [e.g., [1]]. For chronic and endemic diseases, monitoring space-time trends in disease occurrence can highlight changing patterns in risk and help identify new risk factors [e.g., [2]]. Analysis of spatial-temporal patterns in public health data is an increasingly common task for public health analysts as more surveillance data become available. Surveillance datasets are often massive in size and complexity, and the availability and quality of software capable of analyzing space-time disease surveillance data on an ongoing basis is integral to practical surveillance [3–5]. Geographic information systems (GIS) used for disease mapping can visualize the spatial variation in disease risk. However, statistical methods are often required to detect changes in the underlying disease process. GIS are also poorly equipped to handle temporal data [6].

In Fall of 2008, a workshop on training priorities in the use of GIS in health research conducted in Victoria, British Columbia, polled 78 researchers, graduate students, faculty, and others working in health and GIS regarding barriers to the use of space-time disease surveillance [7]. Training and software availability were cited as the primary barriers to the uptake of space-time disease surveillance. Currently, statistical methods for space-time disease surveillance are not included in most conventional GIS or statistical software. These methods are available in specialist cluster analysis software such as ClusterSeer http://www.terraseer.com, or as extensions to general statistical analysis software packages (e.g., R, S-Plus). Our goal is to provide researchers and public health analysts with a review and demonstration of software packages for space-time disease surveillance. We aim to facilitate expanded use of these methods by providing a means to quickly determine the software options and to identify the ways in which programs differ. We limit our scope to methods that use both space and time, rather than purely temporal or spatial analysis.

This paper is organized as follows. First, we briefly review basic classes of methods for space-time disease surveillance in the background section. Readers familiar with these methods may wish to skip ahead. Second, in the methods section we outline how we selected software to review, the review methodology and datasets used to demonstrate software features. Third, we present the results of our review. Finally, we conclude with some guidelines for the use of these software packages for public health researchers and analysts.

## Background

Statistical approaches to disease surveillance have been the subject of a number of texts and review papers [8–10]. A key factor in the selection of methods of analysis is the objective of surveillance, such as outbreak detection, trend monitoring, case detection, or situational awareness. Additional contextual factors are also important to consider such as scale and scope of the system, disease characteristics, and technical considerations [11]. Methods can be broadly categorized as either statistical tests or model-based approaches. Statistical tests are the dominant class of approaches used for outbreak detection. The aim of most methods is to test a subset of data, defined by spatial and temporal constraints (i.e., a window or kernel), against an expected rate of disease occurrence over the study area as a whole. Methods differ with respect to how the window that defines each subset is constructed, how statistical significance is determined, and how the baseline expectation varies over space and time.

The most widely used testing methods are cumulative sum (cusum) methods and scan statistics. Briefly, cusum approaches keep a running sum of deviations from the expected value, and once the cumulative deviation reaches some threshold, an alarm is triggered. For space-time applications, individual cumulative sums for each area under surveillance are monitored and can be adjusted for spatial relationships [12]. Depending on the statistic being monitored in the cusum, different surveillance objectives can be addressed. For example, a measure of spatial pattern monitored in a cusum framework can be sensitive to slight changes in spatial pattern which may signal a shift in dynamics of an endemic disease [e.g., [13]]. Scan statistics are used mostly in outbreak detection contexts. Here, circular search windows of varying radii scan a map of disease and test if the number of cases within the search area is unexpectedly high. In the space-time scan statistic [14, 15], the search area is extended to a cylinder where the height of the cylinder is defined by time periods of varying lengths. The mostly likely cluster is assessed using monte carlo simulations.

Modeling approaches are used mostly for adjusting the expected number of cases (i.e., denominator) of disease. Disease incidence varies spatially with population and known risk factors. Disease mapping models aim to estimate the true *relative risk* across the study area by incorporating the spatial variation in these risk factors. The standardized mortality ratio (SMR) is the crudest measure of risk, computed as the observed cases divided by the expected in each area. The SMR is often of limited use in surveillance because it can fluctuate widely for rare diseases or in rural areas where populations are small. Further, abrupt (i.e., unrealistic) changes at the boundaries of areal units are sometimes observed. Models allow both covariate effects to be estimated, and for sparsely populated areas to have their expected values adjusted towards the mean (i.e., borrow strength). When used in surveillance applications, models confer these same advantages. Disease surveillance models have been either space-time Bayesian models [e.g., [16]] or generalized linear mixed models [e.g., [17]]. Modeling approaches are complementary to other methods as tests are still required to determine how well the most recently observed data fit with the model [18]. Adjustments can also be such that models can be refit over time to adjust to long-term changes in disease occurrence or surveillance effort/efficacy (e.g., improved diagnostic tests), and parameters can be included to model spatial relationships and seasonal and day of the week effects, common features of some types of disease surveillance data.

In addition to testing and modeling methods, new computation-based tools are also being developed for surveillance. These approaches tend to be in either experimental and/or theoretical stages or algorithms designed for specific surveillance systems. Some hybrid approaches include networks [19], simulation-based methods [20], and space-time hidden markov models [21]. While many of these new approaches appear promising, most are not yet available in software.

## Methods

### Inclusion Criteria

List of software packages for review of space-time disease surveillance software

Software Package | Source | Reference | Description |
---|---|---|---|

SaTScan 8.0 | Kulldorff and Information Management Services 2009 [38] | Cluster detection software with several spatial, temporal and space-time scan statistics. | |

ClusterSeer 2.3 | Jacquez et al. 2002 [39] | Cluster analysis software includes many methods for spatial, temporal, and space-time analysis. | |

GeoSurveillance 1.1 | Yamada et al. 2009 [40] | Implementation of cumulative sum surveillance statistics. | |

Surveillance package 1.1-2 | http://cran.r-project.org/web/packages/surveillance/index.html | Höhle 2007 [41] | Package for statistical surveillance includes test-based and model-based methods. |

### Reviewing Framework

Criteria and review approach for review of space-time disease surveillance software

Criteria | Review |
---|---|

Data preprocessing | Number of steps involved to process a point event (cases) shapefile and a polygon census shapefile (population) |

Methods | Description of methods offered by each program |

Technical issues | Speed of computation, system stability, automation, operating requirements |

Analysis output | Output options (graphs, maps, reporting) |

User facility | Qualitative assessment rated on scale of 1 - 5 on each of: |

• Ease of learning | |

• Use | |

• Set up | |

• Documentation/Help |

The second step is conducting the analysis and we briefly describe methods and analysis options for each software package. We highlight technical issues and potential problems or requirements such as stability, speed of computation, and required operating systems. The final step is outputting results and we overview output options available in each package. In addition, we qualitatively assess user facility based on our experience operating the software with test datasets. It should be noted that we do not discuss parameterization of different methods. This is a major issue in practical surveillance, suited to a review and comparison of surveillance methods themselves.

### Datasets

Data were simulated to model a syndromic surveillance system monitoring calls to a health hotline in the Greater Vancouver Area. For simplicity, we refer to each simulated call as a case. Cases were simulated over one year from January 1^{st} to December 31^{st}. Cases were aggregated to census dissemination areas (DA) and were spatially allocated proportional to the population in each census DA. The total population in all DAs was 578,642, and total cases were 4303, giving an annual incidence of 743.64 cases per 100,000. This level of incidence is similar to what might be expected for the total volume of calls made to a telephone health hotline in a major Canadian city [25].

^{th}and lasted until June 5

^{th}, with 148 cases occurring over 10 sq km, covering 33 geographically adjacent census DAs (light grey cluster, Figure 1). Outbreak cases were allocated proportional to census DA population. In outbreak two, 6 spatial clusters constituting a total 501 cases occurred over an area of 16 km

^{2}, covering a total of 104 census DAs (dark grey cluster, Figure 1). The number of cases in clusters ranged from 51 to 140, and cases occurred over the full year. Data were stored in Environmental Systems Research Institute (ESRI) shapefile format, a standard spatial data format which can represent data as points, polygons, or lines.

### Review of Programs

#### Data preprocessing

Data preprocessing steps for each software package to perform a space-time analysis starting with daily data as point events in an ESRI point shapefile and a polygon shapefile of census dissemination area boundaries

Software | Type of Analysis | Required Data Structure | Data Preprocessing Steps |
---|---|---|---|

SaTScan | Space-time cluster scan with Poisson model | • Case file with number of cases, date, and DA id • Population file with population, date, and DA id • Coordinates file with DA id, centroid X and Y coordinates | • Associate DA identifier with each point event • Calculate DA centroid coordinates |

ClusterSeer | Space-time cluster scan with Poisson model | • One table with population • One table with counts of cases for each location and date during study period | • Associate DA identifier with each point event • Calculate week numbers • Aggregate cases by week for each DA (zero counts included) |

GeoSurveillance | Univariate cusum on individual DAs | • DA shapefile with counts of number of cases for each time period named and ordered sequentially in the table | • Calculate week numbers • Split point events into unique shapefiles for each week • Count number of events in each DA by week (zero counts included) • Calculate weekly counts as new fields |

R-Surveillance | Univariate cusum on individual DAs | • Matrix of counts of cases with spatial locations as columns and time periods as rows | • Calculate week numbers • Split point events into unique shapefiles for each week • Count number of events in each DA by week (zero counts included) • Calculate weekly counts as new fields • Read table into R as matrix and transpose |

ClusterSeer requires unique records for every space-time unit under surveillance. Running a daily space-time scan statistic for our simulated data would require a dataset with four columns (location, date, cases, population) and 478,515 records (365 days × 1311 census DAs). Additionally, all areas need a record for every time period. Generating the necessary table required use of specialized data restructuring functions in R statistical software (reshape package). Data were aggregated to counts of cases by week. (52 weeks × 1311 census DAs) giving a table with 68,172 records. For weeks where DAs had no cases, zero counts had to be inserted.

Preparing data for analysis in GeoSurveillance required aggregation temporally and spatially. Counts of cases were required to be attributes of the polygon shapefile (or text file), and fields were required to be named in sequential order. This process was automated by custom programming in ArcGIS which performed spatial joins and added new fields to the attribute table. This was an extensive process to get the data in the proper format for analysis, and similar to ClusterSeer, GeoSurveillance does not allow flexibility in the level of temporal aggregation. ClusterSeer and GeoSurveillance can both read in polygon shapefiles and automatically calculate centroid coordinates.

For analysis with the Surveillance package in R, data were required to be in a matrix with temporal observations as rows and spatial units as columns, giving a 365 × 1311 matrix for daily analysis and 52 × 1311 for weekly analysis. All of the programs except SaTScan had inflexible data input requirements, specifically for temporal aggregation of cases. None of the software programs could input the two shapefiles (points and polygons) without any data preprocessing. This was surprising as previous experience and a review of SaTScan [26] suggested cumbersome input format as a major limitation of SaTScan.

#### Methods

The programs reviewed here are of two types: specialized implementation of a specific class of surveillance algorithms (SaTScan, GeoSurveillance) and full suite surveillance/space-time analysis packages that implement multiple methods (ClusterSeer, R-surveillance). SaTScan offers a number of scan statistics such as spatial [27], temporal [28], and space-time versions [14, 15], as well as retrospective and prospective (clusters must be current) modes. Different data types can be accommodated by the many probability models including Poisson, Bernoulli, space-time permutation, multinomial, ordinal, exponential, and normal. The circular search area used in the classical scan statistic can also be altered to search using an ellipse, or along user-defined connections of spatial units. GeoSurveillance implements the cusum approach to surveillance [e.g., [13]]. The retrospective mode does global spatial analysis only (i.e., reports one cusum test statistic for the map), while the prospective mode does univariate parallel surveillance with the cusum statistic. The multivariate cusum is not yet implemented in GeoSurveillance.

ClusterSeer had the widest range of space-time methods implemented. Those particularly suited to disease surveillance included space-time scanning [14], a cusum approach similar to that in GeoSurveillance [13], and tests for space-time interaction [29–31]. This makes ClusterSeer a useful tool for exploring disease surveillance data. Once data is formatted for use in ClusterSeer, a variety of methods can be used to examine the data. The R-Surveillance package contains a number of algorithms such as the Farrington et al. (1996) method [32], Poisson cusum [33], and the two-component negative binomial model in Held et al. [34]. The algorithms in the surveillance package are mostly model-based and non-spatial, though some space-time surveillance applications can be treated as a multivariate time series problem.

#### Technical Issues

Technical issues encountered in running the software programs varied considerably. SaTScan was capable of running the space-time scan statistic in retrospective mode on daily case data. ClusterSeer was not run on daily data. Initially, memory requirements were a serious limitation of undertaking analysis in ClusterSeer with both test datasets; however an updated version (2.3.22.0) was obtained to complete the analysis on weekly data. The analysis took longer to run than on SaTScan with daily data, though results were very similar. GeoSurveillance ran the univariate cusum in parallel on each of the 1311 census DAs. The analysis ran well on weekly data, however the linked display between the maximum cusum and the map was very slow. The cusum methods were also used for our analysis in R-Surveillance. The time taken to run the analysis on the weekly data was similar to that of GeoSurveillance and results were also similar.

R-Surveillance is the only package that runs on windows, mac and linux operating systems. Currently, SaTScan has versions for windows and linux, and a mac version is in development. Both ClusterSeer and GeoSurveillance run only on the windows operating system. All analyses were run on a Pentium 4 PC with 3.00 GHz processor and 2 GB of RAM running the Windows XP operating system. SaTScan completed analysis in the shortest time compared to all other programs.

#### Data Output

Output options in SaTScan are limited to text file and database file output. Database files can be linked back to the input shapefile in a GIS for further examination of clusters, however no data exploration functionality is available in SaTScan itself. In GeoSurveillance results of an analysis can be written to text file which can be easily manipulated in other software. GeoSurveillance provides a basic map interface linked to a list of cusum scores. A cusum chart is also displayed showing the temporal pattern of cusum scores for the study area as a whole and individual units.

ClusterSeer has advanced data output facility such as mapping and graphing which can be exported as images. Results can also be exported with the data to new files for further examination inside statistical or GIS software. The Surveillance package has access to extensive visualization and exporting functions available in the R environment. The objects specific to the Surveillance package also have default methods for creating plots. This of course requires familiarity with the R programming language.

#### User Facility: Ease of Learning, Ease of Use, Help & Documentation

Comparative review of software packages for space-time disease surveillance: User Facility

Software | Learning | Use | Set Up | Help/Documentation | Comments |
---|---|---|---|---|---|

SaTScan | 4 | 5 | 5 | 4 | Requires knowledge of scan statistics. Basic analysis is straightforward though many advanced options available. Well referenced methodology in the user guide. |

ClusterSeer | 5 | 5 | 3 | 5 | Excellent documentation and learning resources for the many different methods. Data format requirements can be cumbersome. |

GeoSurveillance | 3 | 3 | 3 | 3 | Data structure is peculiar, though the basic user interface is straightforward. Documentation not integrated within the menu itself. |

R - Surveillance | 1 | 3 | 5 | 2 | Command driven system requires knowledge of R language. Examples are easy to replicate. Very easy to install within R. Documentation is not extensive. |

## Conclusions

With the advent of electronic medical records, syndromic data sources, and low-cost location sensors, disease data are increasingly encoded with both spatial and temporal information. These new data sources represent an opportunity for greater understanding of disease distributions, risk factors, and changes to population health over time and space. While analysis of surveillance data represents an expanding opportunity for public health practice and research, these new datasets, methods, and software also bring challenges. There are inherent problems in using traditional statistics for hypothesis testing, or applying simple GIS visualization, to these data sources. As is evidenced by the growing literature on statistical surveillance of disease data [9], methods need to be specifically suited to these data. In addition to statistical methods however, computer software is now essential for the analysis of surveillance data.

The four software programs reviewed in this paper provide functionality for different kinds of analysis and serve different purposes. Based on our review, SaTScan is the most developed and robust software package for implemention in an automated cluster detection system. However, SaTScan only implements scan statistic methods, so those wishing to explore modeling-based approaches may want to use the Surveillance package. Additionally, examining the results in detail requires other software for graphing and mapping. Reasons for taking a modeling approach include making refined estimates of expected rates based on modeled covariate effects, adjusting for spatial heterogeneity in disease rate, and smoothing relative risks. The Surveillance package implements models, but currently has very limited capability for true space-time surveillance. The large number of temporal methods make it a useful environment for exploring surveillance data, in addition to the advantages afforded by being able to integrate with other R packages. As a command-based system, it also is easy to automate and integrate with data processing scripts. The learning curve for R is quite steep, and those requiring a GUI-based system to explore surveillance data would be better served by ClusterSeer. The extensive documentation and many purely spatial and temporal methods, in addition to space-time methods, makes it a convenient tool for initial data exploration. There is also a range of output options in ClusterSeer. ClusterSeer may be more suitable for exploratory studies than as part of an ongoing, automated cluster detection system because there is limited capacity for automated surveillance. ClusterSeer project files can be set to run automatically, though because they are binary files they cannot be automatically configured to increment parameters (e.g., study period). Finally, though methods (and software) have been classified as testing or model-based approaches, it is important to note that these approaches are complimentary rather than opposing [18]. For example, one approach is to develop a model of the expected risk of disease using the Surveilllance package, and use the estimated smoothed rates as the expected values in a SaTScan analysis.

All of the programs reviewed in this paper were applications installed on a local computer. While this is the architecture of most computer software applications, new developments in computing are taking advantage of the internet to perform ongoing, high-powered computing tasks [35]. Online delivery of analytic services (such as cluster analysis) allows software to be centralized on one server, and accessible from anywhere with an internet connection. In the context of disease surveillance, this could facilitate standardization of analysis among different regional health authorities, increase transparency of analysis, and offer significant improvements in costs and performance. Initial steps towards web-based surveillance analysis are underway, with a web-based version of ClusterSeer https://www.clusterseer.com currently in development, RWeb [36], a web-based interface to a server instance of R, as well as a newer project called rapache [37], which integrates R into the popular Apache web server. These developments hold considerable promise for the development of future surveillance systems.

The threat of emerging diseases and the growing burden of chronic diseases requires integrated approaches to surveillance. Analysis of disease trends in space-time provides context which can be linked to possible risk factors in a research environment, flag unusual events in an automated surveillance system, and provide epidemiologists with current information during an outbreak. Well-studied and understood methods are required to ensure appropriate use and transparent and reproducible results. The literature on statistical surveillance is extensive and provides this basis, yet software implementations are far from standardized. As space-time surveillance statistical methods mature further, software is also surely to improve. The open-source environments, such as R, may be the optimal venue for future development of surveillance software as they afford easy integration with many statistical and mapping packages, and being open-source, the underlying code can be viewed and modified easily. However data structure remains a major issue when handling space-time data, especially when data has to be moved between different software packages. Standardized space-time data classes in R or another open-source environment may be a fruitful area of development.

## Declarations

### Acknowledgements

This project was supported in part by the Teasdale-Corti Global Health Research Partnership Program, National Sciences and Engineering Research Council of Canada, and GeoConnections Canada.

## Authors’ Affiliations

## References

- Ekpo UF, Mafiana CF, Adeofun CO, Solarin AR, Idowu AB: Geographical information system and predictive risk maps of urinary schistosomiasis in Ogun State, Nigeria. BMC Infect Dis. 2008, 874-Google Scholar
- Kim A, Martinez A, Klausner J, Goldenson J, Kent C, Liska S, McFarland W: Use of sentinel surveillance and geographic information systems to monitor trends in HIV prevalence, incidence, and related risk behavior among women undergoing syphilis screening in a jail setting. J Urban Health. 2008, 86 (1): 79-92. 10.1007/s11524-008-9307-6.PubMedPubMed CentralView ArticleGoogle Scholar
- Aylin P, Maheswaran R, Wakefield J, Cockings S, Jarup L, Arnold R, Wheeler G, Elliott P: A national facility for small area disease mapping and rapid initial assessment of apparent disease clusters around a point source: the UK Small Area Health Statistics Unit. J Public Health. 1999, 21 (3): 289-298. 10.1093/pubmed/21.3.289.View ArticleGoogle Scholar
- Richards TB, Croner CM, Rushton G, Brown CK, Fowler L: Geographic information systems and public health: mapping the future. Public Health Rep. 1999, 114 (4): 359-360. 10.1093/phr/114.4.359.PubMedPubMed CentralView ArticleGoogle Scholar
- Rushton G: Public health, GIS, and spatial analytic tools. Annu Rev Public Health. 2003, 2443-56.Google Scholar
- Langran G: Time in Geographic Information Systems. 1992, New York: Taylor & FrancisGoogle Scholar
- Population Data BC: Setting a health geomatics education and training agenda. Victoria. 2008Google Scholar
- Sonesson C, Bock D: A review and discussion of prospective statistical surveillance in public health. J R Stat Soc Ser A. 2003, 166 (1): 5-21. 10.1111/1467-985X.00256.View ArticleGoogle Scholar
- Lawson AB, Kleinman K: Spatial and Syndromic Surveillance for Public Health. 2005, West Sussex: John WileyView ArticleGoogle Scholar
- Wagner MM, Moore AW, Aryel RM: Handbook of Biosurveillance. 2006, London: ElsevierGoogle Scholar
- Robertson C, Nelson TA, MacNab YC, Lawson AB: Review of methods for space-time disease surveillance. Spat Spattemporal Epidemiol.Google Scholar
- Rogerson PA, Yamada I: Monitoring change in spatial patterns of disease: comparing univariate and multivariate cumulative sum approaches. Stat Med. 2004, 23 (14): 2195-2214. 10.1002/sim.1806.PubMedView ArticleGoogle Scholar
- Rogerson P: Surveillance systems for monitoring the development of spatial patterns. Stat Med. 1997, 16 (18): 2081-2093. 10.1002/(SICI)1097-0258(19970930)16:18<2081::AID-SIM638>3.0.CO;2-W.PubMedView ArticleGoogle Scholar
- Kulldorff M: Prospective time periodic geographical disease surveillance using a scan statistic. J R Stat Soc Ser A. 2001, 164 (1): 61-72. 10.1111/1467-985X.00186.View ArticleGoogle Scholar
- Kulldorff M, Heffernan R, Hartman J, Assuncao R, Mostashari F: A space-time permutation scan statistic for disease outbreak detection. PLoS Med. 2005, 2 (3): e59-10.1371/journal.pmed.0020059.PubMedPubMed CentralView ArticleGoogle Scholar
- Vidal Rodeiro CL, Lawson AB: Monitoring changes in spatio-temporal maps of disease. Biom J. 2006, 48 (3): 463-480. 10.1002/bimj.200510176.PubMedView ArticleGoogle Scholar
- Kleinman K, Lazarus R, Platt R: A generalized linear mixed models approach for detecting incident clusters of disease in small areas, with an application to biological terrorism. Am J Epidemiol. 2004, 159 (3): 217-224. 10.1093/aje/kwh029.PubMedView ArticleGoogle Scholar
- Kleinman KP, Abrams AM, Kulldorff M, Platt R: A model-adjusted space-time scan statistic with an application to syndromic surveillance. Epidemiol Infect. 2005, 133 (03): 409-419. 10.1017/S0950268804003528.PubMedPubMed CentralView ArticleGoogle Scholar
- Reis BY, Kohane IS, Mandl KD: An epidemiological network model for disease outbreak detection. PLoS Med. 2007, 4 (6): e210-10.1371/journal.pmed.0040210.PubMedPubMed CentralView ArticleGoogle Scholar
- Kim Y, O'Kelly M: A bootstrap based space--time surveillance model with an application to crime occurrences. J Geograph Systems. 2008, 11 (2): 155-173.Google Scholar
- Watkins R, Eagleson S, Veenendaal B, Wright G, Plant A: Disease surveillance using a hidden Markov model. BMC Med Inform Decis Mak. 2009, 9 (1): 39-10.1186/1472-6947-9-39.PubMedPubMed CentralView ArticleGoogle Scholar
- Tsui FC, Espino JU, Dato VM, Gesteland PH, Hutman J, Wagner MM: Technical description of RODS: a real-time public health surveillance system. J Am Med Inform Assoc. 2003, 10 (5): 399-408. 10.1197/jamia.M1345.PubMedPubMed CentralView ArticleGoogle Scholar
- Reis BY, Kirby C, Hadden LE, Olson K, McMurry AJ, Daniel JB, Mandl KD: AEGIS: A Robust and scalable real-time public health surveillance system. J Am Med Inform Assoc. 2007, 14 (5): 581-588. 10.1197/jamia.M2342.PubMedPubMed CentralView ArticleGoogle Scholar
- Gotway C, Young L: Combining incompatible spatial data. J Am Stat Assoc. 2002, 97: 632-648. 10.1198/016214502760047140.View ArticleGoogle Scholar
- Perry A: Forecasting hospital emergency department visits for respiratory illness using ontario's telehealth system: an application of real-time syndromic surveillance to forecasting health services demand. MSc Thesis. 2009, Queen's University, Department of Community Health and EpidemiologyGoogle Scholar
- Block R: Scanning for clusters in space and time. Soc Sci Comput Rev. 2007, 25: 272-278. 10.1177/0894439307298562.View ArticleGoogle Scholar
- Kulldorff M, Nagarwalla N: Spatial disease clusters: detection and inference. Stat Med. 1995, 14: 799-810. 10.1002/sim.4780140809.PubMedView ArticleGoogle Scholar
- Nagarwalla N: A scan statistic with a variable window. Stat Med. 1996, 15 (79): 845-850. 10.1002/(SICI)1097-0258(19960415)15:7/9<845::AID-SIM254>3.3.CO;2-O.PubMedView ArticleGoogle Scholar
- Knox E: The detection of space-time interactions. Appl Stat. 1964, 13: 25-29. 10.2307/2985220.View ArticleGoogle Scholar
- Mantel N: The detection of disease clustering and a generalized regression approach. Cancer Res. 1967, 27: 209-220.PubMedGoogle Scholar
- Jacquez G: A k nearest neighbour test for space-time interaction. Stat Med. 1996, 15: 1935-1949. 10.1002/(SICI)1097-0258(19960930)15:18<1935::AID-SIM406>3.0.CO;2-I.PubMedView ArticleGoogle Scholar
- Farrington CP, Andrews N, Beale AD, Catchpole MA: A statistical algorithm for the early detection of outbreaks of infectious disease. J R Stat Soc Ser A. 1996, 159 (3): 547-563. 10.2307/2983331.View ArticleGoogle Scholar
- Rossi G, Lampugnani L, Marchi M: An approximate CUSUM procedure for surveillance of health events. Stat Med. 1999, 18 (16): 2111-2122. 10.1002/(SICI)1097-0258(19990830)18:16<2111::AID-SIM171>3.0.CO;2-Q.PubMedView ArticleGoogle Scholar
- Held L, Hofmann M, Hohle M, Schmid V: A two-component model for counts of infectious diseases. Biostatistics. 2006, 7 (3): 422-437. 10.1093/biostatistics/kxj016.PubMedView ArticleGoogle Scholar
- Armbrust M, Fox A, Griffith R, Joseph AD, Katz RH, Konwinski A, Lee G, Patterson DA, Rabkin A, Stoica I: Above the clouds: A berkeley view of cloud computing. 2009, EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2009-28Google Scholar
- RWeb.http://www.math.montana.edu/Rweb
- Horner J: rapache: Web application development with R and Apache. 2009,http://biostat.mc.vanderbilt.edu/rapache/Google Scholar
- Kulldorff M, Information Management Services Inc: SaTScan™ v8.0: Software for the spatial and space-time scan statistics. 2009,http://www.satscan.orgGoogle Scholar
- Jacquez GM, Greiling DA, Durbeck H, Estberg L, Do E, Long A, Rommel B: ClusterSeer User Guide 2: Software for identifying disease clusters. 2002, Ann Arbor, MI: TerraSeer PressGoogle Scholar
- Yamada I, Rogerson P, Lee G: GeoSurveillance: a GIS-based system for the detection and monitoring of spatial clusters. J Geograph Systems. 2009, 11 (2): 155-173. 10.1007/s10109-009-0080-1.View ArticleGoogle Scholar
- Höhle M: Surveillance: An R package for the monitoring of infectious diseases. Comput Stat. 2007, 22 (4): 571-582. 10.1007/s00180-007-0074-8.View ArticleGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## Comments

View archived comments (1)