# A comparison of methods for calculating population exposure estimates of daily weather for health research

- Ivan Hanigan
^{1}Email author,### Affiliated with

- Gillian Hall
^{2}and### Affiliated with

- Keith BG Dear
^{2}### Affiliated with

**5**:38

**DOI: **10.1186/1476-072X-5-38

© Hanigan et al. 2006

**Received: **04 July 2006

**Accepted: **13 September 2006

**Published: **13 September 2006

## Abstract

### Background

To explain the possible effects of exposure to weather conditions on population health outcomes, weather data need to be calculated at a level in space and time that is appropriate for the health data. There are various ways of estimating exposure values from raw data collected at weather stations but the rationale for using one technique rather than another; the significance of the difference in the values obtained; and the effect these have on a research question are factors often not explicitly considered. In this study we compare different techniques for allocating weather data observations to small geographical areas and different options for weighting averages of these observations when calculating estimates of daily precipitation and temperature for Australian Postal Areas. Options that weight observations based on distance from population centroids and population size are more computationally intensive but give estimates that conceptually are more closely related to the experience of the population.

### Results

Options based on values derived from sites internal to postal areas, or from nearest neighbour sites – that is, using proximity polygons around weather stations intersected with postal areas – tended to include fewer stations' observations in their estimates, and missing values were common. Options based on observations from stations within 50 kilometres radius of centroids and weighting of data by distance from centroids gave more complete estimates. Using the geographic centroid of the postal area gave estimates that differed slightly from the population weighted centroids and the population weighted average of sub-unit estimates.

### Conclusion

To calculate daily weather exposure values for analysis of health outcome data for small areas, the use of data from weather stations internal to the area only, or from neighbouring weather stations (allocated by the use of proximity polygons), is too limited. The most appropriate method conceptually is the use of weather data from sites within 50 kilometres radius of the area weighted to population centres, but a simpler acceptable option is to weight to the geographic centroid.

## Background

A study of the possible effect of temperature and precipitation on gastroenteritis inspired an assessment of different methods for small area population exposure estimation techniques. The health data were obtained from a survey with respondents from Australia conducted between September 2001 and August 2002 [1]. In area-level analysis such as this the health outcome data and population exposure variables would ideally be available at the finest resolution of aggregation in space and time. The daily health outcome data were available for individuals and the postcode of their residence was recorded. The level of aggregation of weather observations for this analysis was also at the postcode level. The focus of this study was to therefore find the best method for representing the exposure of populations to daily weather in small geographic areas.

Within spatial units there are factors that may complicate the computation of exposure values. For instance there can potentially be large variation of temperature influenced by ground elevation. The design of the monitoring network can also have an impact. The density of weather recording stations is an important factor in computation of exposures, with more reliable estimates expected from areas that have many sites. The way the sites are spread throughout the area can also affect exposure estimates, such as whether they are evenly distributed or clustered together. An additional factor to consider when dealing with human health outcomes is the distribution of the population within the area, as our primary interest is in human exposure to environmental conditions.

The Australian Bureau of Statistics (ABS) does not publish census populations for postcodes, but instead for approximations termed Postal Areas (POA) [2]. Although there are inconsistencies when matching postcodes to POA [3], these were used in order to utilise information on the population distribution in the computation of exposure.

Population weighted exposure data are conceptually appealing as they more closely estimate the weather being experienced by the majority of the population. A complication is that some postcodes consist of multiple non-contiguous parts, or are large single-part postcodes with multiple population clusters. Therefore calculating an estimate for each sub-population separately gives better information from a population exposure perspective, although this is more computationally intensive.

Non-computationally intensive methods of calculating weather exposure estimates used by others have included taking the mean of all stations' observations within a geographic region, a method used by the Australian Bureau of Meteorology for precipitation since 1910 [4]. A similar method is to calculate the mean from the nearest neighbouring stations. This method has been used for a variety of purposes including rainfall [5, 6]. A more sophisticated method is inverse distance weighted averages. This approach has been used in many area, point and gridding contexts [7, 8].

An inverse distance weighted average is:

Q_{j} = ∑W_{ij}Z_{i}/∑W_{ij}

where Q_{j} is the estimate of a day's weather for the jth spatial unit, Z_{i} is the data value measured at the ith station, and the W_{ij} are weights calculated as the reciprocal of the distance, or squared distance, between the jth spatial unit centroid and each of the stations in the neighbourhood. Stations outside the neighbourhood are given zero weight.

The inverse of the squared distance is most commonly used as the weight, however the inverse of the distance is also often used [9]. Using the inverse of the squared distance gives higher weight to closer observations. Note that the part of the spatial unit used to calculate distances from is a very important decision to be made. Some of the options available are: geographic centroid; population weighted centroid; the area boundary; sub-unit centroids; and sub-unit boundaries [10].

Other studies have compared the results from different methods for spatial interpolation of weather, focused on comparing the cells of gridded surfaces [11–14] or imputed data for stations with gaps [15]. The area estimates derived from different methods have been compared less often [16], and rarely in a population health context.

A recent study of health effects of air pollutants and weather in an Australian city estimated exposure at the aggregate level using the average of internal stations without assessing estimates weighted by distance or population [17]. This was noted as a possible limitation of the study design even though the authors considered that any measurement error would be "non-differential and produce conservative relative risks".

There has been much work in the air pollution research community investigating different methods of combining exposure data. Air pollution research that has addressed these issues includes those that compare modelled pollution (using dispersion models or geostatistical surface computation) and compared areal averages with those gained from simple averages of monitors [18–20], and others that use the distance from addresses or area centroids to monitors [21, 22].

However in weather exposure studies often the rationale for using one technique rather than another is not explicitly considered and the differences in the values obtained by the different methods are generally unknown. Comparison between the results of the different estimates is required to ascertain the differential in particular contexts.

There has been a proliferation of approaches to the problems of spatial estimation of daily weather. Some of the methods are splining [23]; kriging and co-kriging [15]; gridded inverse distance weighting algorithms [4, 11, 24]; multiplicatively weighted proximity polygons [25]; artificial neural networks [26]; additive spatial regression models [27]; physically based numerical models of the three-dimensional atmospheric processes [28]; indirect methods such as radar [29]; and remote sensors mounted on satellites [30]. Some of these methods would enable the inclusion of relevant covariates such as elevation, wind speed and wind direction.

However there is no consensus about which is the best to use, some methods are computationally intensive and some commercially available options are expensive [31–33]. In addition, even if one of these were identified as a gold-standard to be used for creating gridded surfaces at each time point, it remains unclear whether it is worthwhile to undertake the extra computational burden needed to estimate population weighted exposure values. These could be based on fine resolution population distribution within spatial units, or on less computationally intensive approaches. It is not known which methods yield adequate weather estimates for health research. This paper addresses this important problem.

### Five methods for population exposure estimation

#### Option 1: average of internal or nearest neighbouring stations (using intersecting proximity polygons)

The first option used to estimate daily temperature and precipitation for POA was to calculate the average of internal stations, or the nearest neighbours if no internal stations exist. The first step in Option 1 was to identify stations in the POA boundaries. If there were no stations then the nearest neighbours were used. The "nearest neighbours" were found by the overlay and intersection of proximity polygons (also known as Thiessen or Voronoi polygons) with the POA boundary. In this approach each monitoring station is the focal point used to calculate the boundaries of a proximity polygon, whose area is defined so that all other points in it are nearest to the focal point than to any other focal point. The corresponding POA code is joined to each of the daily observations in a many-to-many relationship. Then the averages of each daily observation from the stations are calculated for each POA on each day.

#### Option 2: average of nearest neighbouring stations (using interesting proximity polygons)

The second option was to calculate the average of "nearest neighbours" regardless of their location inside or outside each POA boundary. Proximity polygons were used to allocate nearest neighbours as described in Option 1.

#### Option 3: geographic centroid inverse distance weighted average (using stations ≤ 50 km distant from centroid)

In the third option the distance between the geographic centroid and each station was used to calculate an inverse distance weighted average. The geographic centroid (also known as the mean centre) is the geographic centre of the boundary. The inverse distances from this centroid are used to weight the average of the station observations.

An arbitrary maximum distance of 50 km from the centroids of each spatial unit was used because it is likely that stations further away will not be similar to the area of interest [7]. The distance-weighting factor was also compared as the inverse of the distance (Option 3a) and the inverse of the squared distance (Option 3b).

#### Option 4: population weighted centroid inverse distance weighted average (using stations ≤ 50 km distant from centroid)

Option 4 used the distance between the POA population weighted centroid and the stations for an inverse distance weighted average. The population weighted centroid is calculated by subdividing the POA into its population census constituent sub-units (collector's districts) and calculating the centroids of these. The population-weighted centroid is found by weighting the average of the latitude and longitude coordinates of the sub-unit centroids by the populations of those sub-units. The choice of weights was also compared as the inverse of the distance (Option 4a) and the inverse of the squared distance (Option 4b).

#### Option 5: population weighted average of census collector's district distance weighted averages (using stations ≤ 50 km distant from centroid)

In the fifth option we calculated inverse distance weighted averages for each sub-unit geographic centroids (collector's districts) and then averaged these within POA using sub-unit populations as a weight. In this option each centroid had a weather estimate calculated for each day. Then the sizes of the populations are used to weight the contribution of these into each POA on each day. Option 5 differs from Option 4 in that it estimates the weather exposure for each sub-unit first and then gives a weighted summary of these for the POA. The choice of weights was also compared as the inverse of the distance (Option 5a) and the inverse of the squared distance (Option 5b).

### Examples

In Option 2 the only differences are that now POA Y is given the average of the internal station 3 AND the nearest neighbouring stations 1 and 4. POA W now includes the neighbouring station 5 in the average of internal stations 1 and 4.

In Options 3–5 the process is only described for POA Y to avoid excessive detail. In figure 2, Option 3 is shown. The distances from the stations to the geographic centroid of POA Y (shown by the star) are calculated. The distances between the centroid and stations within the search radius are shown by the lines. The inverse distance weighted average will include stations 1, 3 and 4. The station 3 is so close to the centroid that the inverse distance weighted average will be dominated by this observation. This is especially the case using the weight calculated by the reciprocal of the squared distance.

Figure 3 shows that Option 4 uses a centroid weighted by the population of the sub-unit collector's districts (CD) to calculate the distances from the stations. For POA Y the centroid is pulled to the southeast because of the dominance of population in that direction. Distances are calculated from this centroid to the stations within the search radius which now includes the stations 3, 4 and 5.

In figure 4, Option 5 is shown. Here the distance from each sub-unit centroid to each station within the search radius is used to calculate a daily estimate. These are then weighted by the population and aggregated to give a POA level estimate.

We considered Option 5b the most conceptually appealing because it incorporates fine resolution population distribution patterns and is more sensitive to observations close to these sub-populations than the other options.

## Data

### Meteorological data

We obtained average daily temperature (the average of daily maximum and minimum temperature) in degrees Celsius and the daily precipitation in the 24 hours before 9 am in millimetres from the National Climate Centre of the Bureau of Meteorology Research Centre [34].

### Postcodes and postal areas

Inconsistencies between Australian postcode areas (for which health data are available) and Australian Bureau of Statistics POA boundaries (for which population data are available) are sometimes considerable [3, 35]. Despite this we used the POA boundaries to enable the incorporation of fine resolution population data from the Australian Census [36]. The population data are based on the smaller CDs, which are then combined, by the Australian Bureau of Statistics, into the larger POA units in such a way as to align them as closely as possible to postcodes.

When a CD crosses more than one postcode, the decision rule for allocating it to a POA is the area that contains the majority of the population [37]. This is done in a subjective way using indicators such as how much of the area of the CD lies in each region and the distribution of land-use parcels [38].

A further complication is that some postcodes, and therefore some POA, comprise two or more separate land areas. In 2001 there were 72 such multipart POA in NSW and the ACT. The maximum distance between the geographic centroids of any two parts of the same POA was 350 km (POA 2831) and the mean of the split POAs centroid to centroid distance was 33 km. The maximum number of parts in any one POA was 16 (POA 2324). This is a common problem in coastal areas with many small islands allocated a single code, however these cases normally have small distances between parts. There are some inland POA with fewer numbers of parts but greater distances between these, due to the way Australia Post operates its delivery system.

## Results

### Summary of estimates from all five options

The time taken to calculate exposure estimates using Options 1 and 2 (2–3 hours per weather parameter on a desktop PC) was appreciably less than that required for Options 3 and 4 (around 8 hours per parameter). Option 5 required much more processing than the other options because each CD needed an inverse distance weighted weather estimate on each day (approximately 9,500 CD). This method was completed using a Structured Query Language server. Even using this more powerful computer, the time taken was approximately 8 hours per parameter.

The monitoring network is sparse in the west of the state and Options 1 and 2 suffer from a paucity of neighbourhood proximity polygon information. Of the 620 NSW and ACT POA, there were 375 that had internal precipitation stations and 130 with internal temperature stations.

Percentage of POAs with complete, and a majority, of weather estimates by option

Precipitation | Average temperature | |||
---|---|---|---|---|

Option | 100% days with estimates | >=90% days with estimates | 100% days with estimates | >=90% days with estimates |

1 | 60% | 95% | 51% | 94% |

2 | 80% | 98% | 61% | 96% |

3 | 100% | 100% | 85% | 95% |

4 | 100% | 100% | 86% | 96% |

5 | 100% | 100% | 93% | 98% |

The problem stems from the fact that proximity polygon size is inversely related to the density of monitoring stations. In sparsely monitored regions the large size of polygons increases the probability that a POA will be allocated to only one monitoring station, causing gaps in the series on days when no weather information is available for that station. The example of POA Z in figure 1 shows that the estimate is determined by one station in Options 1 and 2. In contrast the distance-weighting scheme uses all stations within a given distance (50 km for this study) and thus incorporates more information.

The inverse distance weighting methods overcame this problem because it is more likely that there will be another station observing which could be used, and the information from these will be incorporated even if the nearest neighbour is not observing on a particular day. However this may cause some problems on days when there are only distant stations observing and these are given full weight because there are no close observations.

### Difference between the options

The difference between Option 5b and the daily estimates of each of the options was calculated.

Summary of daily differences between each option with Option 5b for temperature and precipitation

Mean | Standard Deviation | Median | Minimum | Maximum | 25–75 percentile range | 5–95 percentile range | |
---|---|---|---|---|---|---|---|

| |||||||

Difference(1-5b) | -0.01 | 0.66 | 0.00 | -8.83 | 6.77 | 0.44 | 1.96 |

Difference(2-5b) | 0.01 | 0.74 | 0.01 | -8.83 | 6.43 | 0.53 | 2.32 |

Difference(3a-5b) | -0.06 | 0.46 | -0.02 | -6.61 | 6.14 | 0.31 | 1.22 |

Difference(3b-5b) | -0.02 | 0.37 | 0.00 | -6.61 | 6.14 | 0.05 | 0.78 |

Difference(4a-5b) | -0.04 | 0.30 | -0.02 | -4.24 | 2.72 | 0.25 | 0.95 |

Difference(4b-5b) | 0.00 | 0.16 | 0.00 | -4.24 | 2.85 | 0.01 | 0.31 |

Difference(5a-5b) | -0.04 | 0.25 | -0.01 | -2.24 | 2.64 | 0.20 | 0.83 |

| |||||||

Difference(1-5b) | 0.08 | 2.57 | 0.01 | -64.54 | 115.40 | 0.80 | 5.63 |

Difference(2-5b) | 0.02 | 2.38 | 0.01 | -102.90 | 115.40 | 0.66 | 5.63 |

Difference(3a-5b) | -0.02 | 1.78 | 0.01 | -121.27 | 45.55 | 0.15 | 2.97 |

Difference(3b-5b) | -0.01 | 1.41 | 0.00 | -128.70 | 103.17 | 0.02 | 1.22 |

Difference(4a-5b) | -0.02 | 1.36 | 0.01 | -49.14 | 50.09 | 0.12 | 2.41 |

Difference(4b-5b) | 0.00 | 0.65 | 0.00 | -33.65 | 102.90 | 0.01 | 0.54 |

Difference(5a-5b) | -0.01 | 1.33 | 0.01 | -49.31 | 21.41 | 0.12 | 2.31 |

In Option 1 the mean of the temperature differences is negative, implying that this option estimates lower temperatures on average than Option 5b. On the other hand the mean of Option 2 differences is positive implying that this option estimates higher temperatures on average. In the precipitation estimates for options 1 and 2 the mean is positive, implying higher rainfall estimates than Option 5b. The range and standard deviation for the differences for these options is large implying that the results are broadly inconsistent with the Option 5b estimates.

For the inverse distance weighted options (3, 4 and 5a) in both precipitation and temperature the mean difference shows that there is a tendency for Option 5b to have higher values with Option 4b the closest to 5b and 3a the most different. However, for precipitation the median differences are all positive or zero, suggesting that the mean is affected by some extreme values where rainfall estimates by Option 5b are considerably higher than the comparative option.

For temperature both the median and the mean are negative or zero for all options apart from Option 2, which implies that Option 5b consistently estimates higher temperatures.

The differences between Option 5b rainfall and the inverse distance weighted options on the bottom row (3b and 4b) show that the difference generally does not vary when dealing with rainfall of greater magnitude (with a very few extreme exceptions such as a few precipitation differences greater than 100 mm). This implies that increasing the local weighting by squaring the distance changes estimates from Options 3 and 4.

### Regional differences

### Temporal differences

To see if there was variation in the daily differences from Option 5b during the year, the differences were also grouped by month. There were greater precipitation differences for both Options 3b and 4b in February 2002, a month with high rainfall in some parts of NSW, increasing the likelihood of greater differences.

The differences for daily temperatures grouped by month showed that the daily differences for Option 3b in the winter months June, July and August were more strongly negative than those in Option 5b.

## Discussion

The primary focus of this work is a comparison of options for calculation of weather exposure measures for health analysis of small area populations. The exposures were average temperature and rainfall, although other measures could be similarly compared including humidity, ultraviolet radiation, air pollution and other environmental exposures. The criteria used to assess the different options were: conceptually sound; computer time required; low variation across methods; and completeness of values at the daily POA level. The population weighted average of inverse distance weighted averages (Option 5) fulfilled these criteria best. The other geographic and population weighting methods performed similarly, and were quite close to the Option 5 estimates in most regions. The use of data from weather stations internal to the area or using neighbour allocation methods based on proximity polygons performed poorly. This was because the density of the monitoring stations is very low resulting in dependence on only a few observations to calculate values.

A possible limitation of Option 5 is that the population used to describe the fine resolution distributions were based on the August 2001 census enumeration counts. This single estimate may not be an accurate representation of the population at other times. If the data are available then the distribution of population at specific times could be taken into account in the calculations. As this study calculates weather estimates shortly after the census this issue will not greatly affect the application presented here. As the census is based on residence, the population distribution does not take account of frequent movement of people such as daily travel to other areas for work, which may differ by population. In the absence of such data describing movement of people, the census currently represents the best available data on population distribution.

The nearest neighbour method (using proximity polygons) allocates less monitoring stations to each POA and thus limits access to regional information and may give unrepresentative estimates. This also causes these methods to be susceptible to large gaps in the series. The problem of missing data in Options 1 and 2 could be resolved in a number of ways by imputation. However problems associated with having less monitoring station observations per POA cannot be easily dealt with in this approach.

The inverse distance weighting approaches incorporate information from many more stations. For this reason they are less susceptible to the gaps found in Options 1 and 2. However when no stations are close then far off stations are given full weight. We set the limit at 50 km. As some POA estimates were derived from stations almost this far from their centroids these values may be untrustworthy. A tighter search radius would reduce this, but would increase the number of missing values, while a larger radius would incorporate more values but potentially more unreliable data. Sensitivity analyses could be done to study the effect of the different cut-off levels.

Option 5b was based on localised population weighting. This gives higher estimates at greater rainfall magnitude than any of the other methods using non-squared distance weighting. In some coastal areas of Australia there is highly localised intense rainfall, which is the probable cause of this effect.

The inverse-squared-distance-weighting options (3b and 4b) decreased the influence of stations at a greater distance and gave more similar results to Option 5b.

## Conclusion

Daily temperature and rainfall estimates calculated by using data from internal sites or nearest neighbours (proximity polygon) methods give poor representations of local area weather patterns for health studies based on daily data. The weighting approaches using weather stations less than 50 kilometres from area centroids were considerably better in this regard and the majority of daily differences across the options were small. The extent of the differences depended to some extent on the climatology of the location of the spatial unit and the time of the year. For studies of human health in the Australian context the distance to a regional geographic centroid is not as precise as a population weighted centroid, as large areas of uninhabited land (and the weather of these areas) may not provide relevant information about weather exposures. The population weighted average of sub-unit inverse distance weighted estimates is the most conceptually appealing method applied here. However, it is more computationally intensive than simpler population weighted centroid estimates and there is little difference in daily average temperature and rainfall estimates.

## Methods

### Hardware and software

Options 1 to 4 were calculated on a desktop PC. Option 5 was performed on a Structured Query Language (SQL) server. GIS operations used ArcGIS 9.1 [40]. Microsoft Access was used to join the concordance table of POA-to-monitoring station proximity polygons to the daily observations, and averaged these whilst grouping by POA code and date in Options 1 and 2. Options 3 and 4 used "joinby" and "collapse" commands in STATA 8 [41] to join the distance weights with the daily observations, and Option 5 used the SQL server.

### Meteorological data

Individual station files of daily meteorological data for 1990–2005 were parsed for integration in MS Access databases using visual basic code written by Melissa Goodwin at the National Centre for Epidemiology and Population Health.

### Postcode/postal area populations and concordance

The CD populations from the 2001 census were obtained from the ABS [36]. These data were enumeration counts rather than area of usual residence which cost more.

Some postcodes don't exist as POA and for these the locality names were found using the online postcode finder from the electronic telephone directory [42]. These locality names were georeferenced using the online Geoscience Australia Place Name Finder [43] or the ABS 'Urban Centres and Localities' spatial boundaries (also CD aggregates from the ABS). These locations were then overlaid and intersected with the POA boundaries and given this code instead of their real postcode.

Multipart POA were assessed by first using the ArcGIS multipart to single-part tool (features toolbox) and then counting the number of parts per feature (using the frequency tool).

### Internal stations

Internal stations were found using the intersect tool in the ArcGIS Spatial Analyst extension. This information was joined to the meteorological data using Microsoft Access.

### Nearest neighbour

Nearest neighbour concordances were calculated by first creating proximity polygons of the appropriate stations (using the coverage tools), then overlaying and intersecting these with POA (using Spatial Analyst tools in ArcGIS).

### Distance

Centroids were calculated using the Visual Basic for Applications script from the ArcGIS help menu. Then distances were calculated using the coverage toolbox "point-distance" tool. The projection was set to Albers South Asia Conic (metres) projection. This is necessary to avoid the distortion of length inherent with other cartographic projections [44].

## Declarations

### Acknowledgements

IH was employed by the National Centre for Epidemiology and Population Health at the time this work was conducted. The Authors would like to thank Melissa Goodwin and Aaron Petty for assistance with programming, Agus Salim and Rosalie Woodruff for editorial advice, Graham de Hoedt, Neville Nicholls, Cathy Toby and Mike Manton from the Bureau of Meteorology for access to the meteorological data and general advice.

## Authors’ Affiliations

## References

- Hall G, and the OzFoodNet Working Group:
**Results from the national gastroenteritis survey 2001–2002.***National Centre for Epidemiology and Population Health Working Papers*Canberra , Australian National University 2004, [No. 50]. - Australian Bureau of Statistics:
**Statistical geography volume 2: census geographic areas (Cat. No. 2905.0) .**Canberra, Australia 2001. - Jones SD, Eagleson S, Escobar FJ, Hunter GL:
**Lost in the mail: the inherent errors of mapping Australia Post postcodes to ABS derived postal areas.***Australian Geographical Studies*2003,**41**(2):171–179.View Article - Jones D, Beard G:
**Verification of Australian monthly district rainfall totals using high resolution gridded analyses.***Australian Meteorological Magazine*1998,**47**(1):41–54. - Thiessen AH:
**Precipitation averages for large areas.***Monthly Weather Review*1911,**39:**1082–1084. - Aurenhammer F:
**Voronoi diagrams - a survey of a fundamental geometric data structure.***Computing Surveys*1991,**23**(3):345–405.View Article - Mills GA, Weymouth G, Jones D, Ebert EE, Manton M, Lorkin J, Kelly J:
**A national objective daily rainfall analysis system.**Melbourne, Australia , Bureau of Meteorology Research Centre 1997. - Cressie N:
**Geostatistical methods for mapping environmental exposures.***Spatial Epidemiology: Methods and Applications**(Edited by: Elliott P, Wakefield JC, Best NG, Briggs DJ).*Oxford , Oxford University Press 2000. - Moore K:
**Resel filtering to aid visualisation within an exploratory data analysis system.***Journal of Geographical Systems*2000,**2**(4):375–398.View Article - Bailey TC, Gatrell AC:
**Interactive Spatial Data Analysis.**Essex , Longman Scientific and Technical 1995. - Stillman ST, Wilson JP, Daly C, Hutchinson MF, Thornton P:
**Comparison of ANUSPLIN, MTCLIM-3D, and PRISM precipitation estimates.***Proceedings of the Third International Conference/Workshop on Integrating GIS and Environmental Modeling, January 21–25, 1996*Santa Fe 1996. - Shine JA, Krause PF:
**Exploration and estimation of North American climatological data.***Proceedings of "Computing Science and Statistics: Modeling the Earth's Systems: Physical to Infrastructural": April 5–8, 2000*New Orleans 2000. - Jolly W, Graham J, Michaelis A, Nemani R, Running S:
**A flexible, integrated system for generating meteorological surfaces derived from point sources across multiple geographic scales.***Environmental Modelling & Software*2005,**20:**873–882.View Article - Naoum S, Tsanis IK:
**Ranking spatial interpolation techniques using a GIS-based DSS.***Global Nest: The International Journal*2004,**6**(1):1–20. - Jeffrey SJ, Carter JO, Moodie KB, Beswick AR:
**Using spatial interpolation to construct a comprehensive archive of Australian climate data.***Environmental Modelling & Software*2001,**16**(4):309–330.View Article - Pardoiguzquiza E:
**Comparison of geostatistical methods for estimating the areal average climatological rainfall mean using data on precipitation and topography.***International Journal of Climatology*1998,**18**(9):1031–1047.View Article - Jalaludin B, Morgan G, Lincoln D, Sheppeard V, Simpson R, Corbett S:
**Associations between ambient air pollution and daily emergency department attendances for cardiovascular disease in the elderly (65+ years), Sydney, Australia.***Journal of Exposure Science & Environmental Epidemiology*2006,**16**(3):225–237.View Article - Bell ML:
**The use of ambient air quality modeling to estimate individual and population exposure for human health research: a case study of ozone in the Northern Georgia Region of the United States.***Environ Int*2006,**32**(5):586–593.View ArticlePubMed - Jerrett M, Arain A, Kanaroglou P, Beckerman B, Potoglou D, Sahsuvaroglu T, Morrison J, Giovis C:
**A review and evaluation of intraurban air pollution exposure models.***J Expo Anal Environ Epidemiol*2005,**15**(2):185–204.View ArticlePubMed - Jerrett M, Burnett RT, Ma R, Pope CA, Krewski D, Newbold KB, Thurston G, Shi Y, Finkelstein N, Calle EE, Thun MJ:
**Spatial analysis of air pollution and mortality in Los Angeles.***Epidemiology*2005,**16**(6):727–736.View ArticlePubMed - Dominici F, Peng RD, Bell ML, Pham L, McDermott A, Zeger SL, Samet JM:
**Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases.***JAMA*2006,**295**(10):1127–1134.View ArticlePubMed - Zandbergen PA, Chakraborty J:
**Improving environmental exposure analysis using cumulative distribution functions and individual geocoding.***Int J Health Geogr*2006,**5:**23.View ArticlePubMed - Hutchinson MF:
**Interpolation of mean rainfall using thin plate smoothing splines.***International Journal of Geographical Information Systems*1995,**9:**385–403.View Article - Thornton P, Running S, White M:
**Generating surfaces of daily meteorological variables over large regions of complex terrain.***Journal of Hydrology*1997,**190:**214–251.View Article - Mu L:
**Polygon characterization with the multiplicatively weighted voronoi diagram.***Professional Geographer*2004,**56**(2):223–239. - Rigol JP, Jarvis CH, Stuart N:
**Artificial neural networks as a tool for spatial interpolation.***International Journal of Geographical Information Science*2001,**15**(4):323–343.View Article - Zoppou C, Roberts S, Hegland M:
**Spatial and temporal rainfall approximation using additive models.***Australian & New Zealand Industrial and Applied Mathematics Journal*2000,**42**(E):C1599-C1611. - Hurley PJ, Physick WL, Luhar AK:
**TAPM: a practical approach to prognostic meteorological and air pollution modelling.***Environmental Modelling and Software*2005,**20**(6):737–752.View Article - Curtis DC:
**Storm sizes and shapes in the arid southwest.***Proceedings of the Arizona Floodplain Management Association Fall 2001 Meeting, Nov 8–9, 2001*Parker 2001. - Neteler M:
**Time series processing of MODIS satellite data for landscape epidemiological applications.***International Journal of Geoinformatics*2005,**1**(1):133–138. - Houlder D, McMahon J, Hutchinson MF:
**ANUSPLIN and ANUCLIM.**[http://cres.anu.edu.au/outputs/orderform-aust-print.php] - Queensland Government Department of Natural Resources and Mines:
**SILO Australian Daily Historical Climate Surfaces.**[http://www.nrm.qld.gov.au/products/cat_services.php?category=534&description=Digital+climate+data] - University of Montana Numerical Terradynamic Simulation Group:
**DAYMET Daily Surface Weather Data and Climatological Summaries.**[http://www.daymet.org/] - National Climate Centre of the Bureau of Meteorology Research Centre:
**Daily weather data for Bureau of Meteorology stations.**150 Lonsdale Street, Melbourne 3000, AUSTRALIA 2005. - Jenner A, Blanchfield F:
**Population estimates for non-standard geographical areas - practices, processes, pitfalls and problems.***Proceedings of the Joint AURISA and Institution of Surveyors Conference: 25 – 30 November 2002*Adelaide 2002. - Australian Bureau of Statistics:
**CDATA2001, census of population and housing data by age and sex for census collector's districts.**Canberra 2001. - Australian Bureau of Statistics:
**Postcodes and census data - factsheet.**Canberra 2001. - Blanchfield F. Director of Australian Bureau of Statistics Geography:
**Written communication regarding criteria for allocating CD-derived postal areas.**Canberra 2004. - Bureau of Meteorology:
**Bureau of Meteorology Rainfall Districts.**[http://www.bom.gov.au/climate/how/newproducts/images/raindist.pdf] - Environmental Systems Research Institute:
**ArcGIS 9.1.**Redlands 1999. [http://www.esri.com] - Stata Corporation:
**STATA statistical software versions 8.**College Station 2001. [http://www.stata.com] - Australian Whitepages:
**Australian Whitepages check a postcode.**[http://www.whitepages.com.au/wp/search/tools.jhtml] - Australian Government Geoscience Australia:
**Geoscience Australia place name search.**[http://www.ga.gov.au/map/names/] - Snyder JP:
**Map projections - a working manual.***US Geological Survey Professional Paper 1395*Washington , United States Government Printing Office 1987.

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.