- Open Access
Mapping outdoor habitat and abnormally small newborns to develop an ambient health hazard index
International Journal of Health Geographicsvolume 16, Article number: 43 (2017)
The geography of where pregnant mothers live is important for understanding outdoor environmental habitat that may result in adverse birth outcomes. We investigated whether more babies were born small for gestational age or low birth weight at term to mothers living in environments with a higher accumulation of outdoor hazards.
Live singleton births from the Alberta Perinatal Health Program, 2006–2012, were classified according to birth outcome, and used in a double kernel density estimation to determine ratios of each outcome per total births. Individual and overlay indices of spatial models of 136 air emissions and 18 land variables were correlated with the small for gestational age and low birth weight at term, for the entire province and sub-provincially.
There were 24 air substances and land sources correlated with both small for gestational age and low birth weight at term density ratios. On the provincial scale, there were 13 air substances and 2 land factors; sub-provincial analysis found 8 additional air substances and 1 land source.
This study used a combination of multiple outdoor variables over a large geographic area in an objective model, which may be repeated over time or in other study areas. The air substance-weighted index best identified where mothers having abnormally small newborns lived within areas of potential outdoor hazards. However, individual air substances and the weighted index provide complementary information.
A truly ecologically-based study of health integrates habitat, population, and behavior—encompassing a more complete geography as framed by Meade’s triangle of human ecology [1, 2]. Three vertices conceptualize what is known about an important pediatric topic: maternal exposure to outdoor pollution and neonatal outcomes (Fig. 1). Here we focus on the lesser studied habitat vertex, specifically the outdoor environment, since less attention is traditionally given to incorporating ecological factors in understanding disease . The location aspect of habitat—where pregnant women live, where industry and services are situated, where demographic groups congregate—is important because where one lives and where one starts out in life, even during fetal development, ultimately influences lifelong health [3,4,5,6].
Toxicant exposures and environmental influences on mothers during crucial stages of pregnancy may result in newborns that are too small or born too early. Adverse birth outcomes (ABO) are important markers of infant survival, development and future health. Our research focuses on being born too small, clinically defined as Small for Gestational Age (SGA) when newborns are below the 10th percentile weight based on sex and weeks of pregnancy, or Low Birth Weight at Term (LBWT) when newborns are less than 2500 g weight at term, 37 or more weeks gestation .
The province of Alberta, Canada, had a population of 3,645,257 at the 2011 Census . That was a 10.8% increase from 2006 while the national average increase was 5.9%. For a land area of 640,082 km2, the population density was 5.7 persons/km2, where 83% of the population lived in or near urban centers. Alberta’s economic activities were focused on agriculture, natural resources, and nonrenewable energy—having a higher number of industrial facilities reporting to the National Pollutant Release Inventory (NPRI) than any other province or territory . The NPRI is a valuable data source on industrial-based pollutants . Alberta also has higher ABOs: SGA was 8.8% (Canada was 8.4%); and low birth weight for all gestational ages was 6.7% (Canada was 6.0%) . Alberta rates also increased during 2000–2014: SGA from 10 to 11.5%; and low birth weight for all gestational ages from 6.1 to 6.7% .
ABO complications include death, physical and cognitive disabilities, and chronic health problems later in life, costing emotional stress and the majority of the health care expenses among all newborns . Disorders related to short gestation and low birth weight are consistently ranked as the 2nd leading cause of infant death (congenital deformation is the leading cause) —and have increased in Canada since the year 2000 [11, 12].
Abnormally small newborns are the result of growth restriction, which may be due to environmental pollutants thought to cause inflammation in mothers, direct toxic effects on the placenta and the fetus, interruption of oxygen-hemoglobin, or DNA damage represented by the formation of DNA adducts [15, 16].
The environment includes social, built, and natural features. Individual risks are also very important to ABOs, but are neither readily available nor easily mapped. These include personal, behavioral, social, and indoor exposures, such as: adequate prenatal care; food type and contaminants; rest, stress, and pre-existing health conditions; occupation and socioeconomic status; smoking and other substance use; drinking water contaminants. Our focus is on the outdoor environmental habitat because it is a common source of shared exposures susceptible to regulation (biology and behavior are not). These include air, water, human-constructed, and natural outdoor hazards, such as: industrial emissions; traffic pollution; agricultural chemical inputs of pesticides, herbicides, and fertilizers; electromagnetic radiation; proximity to oil and gas extraction activities; wildfire smoke.
Environmental health research has found many environmental factors to be associated with various health outcomes [17,18,19,20,21,22,23,24,25]. However, these are typically explored singly: one exposure or category of exposure at a time. A unified environmental measure may be constructed across multiple variables to encompass the complex nature of the overall environment.
Environmental indices have history: Inhaber had proposed an integrated national index for Canada in the 1970s . Rather than relying on individual pollutants to reflect the state-of-the-environment, Inhaber mathematically combined such indicators for the purpose of resource allocation, ranking of locations, enforcement of standards, trend analysis, public information, or scientific research . Under that premise, Messer et al.  developed a California county-level environmental quality index using principal components analysis (PCA) to calculate 5 environmental domains (air, water, land, built, and sociodemographic), which were then combined into a single index using PCA on the first components, and stratified by rural–urban continuum codes. Similarly, Messer et al.’s CalEnviroScreen 2.0 [29, 30] superimposed 19 individual indicators that related to pollution exposures, environmental conditions, and population characteristics, weighted and summed each set of indicators, and then multiplied together pollution and population (i.e. Threat × Vulnerability = Risk). We have not found similar environmental health indices available for Canada, or the province of Alberta, and especially none focused primarily on maternal exposures associated with ABO.
Using a Geographical Information System (GIS), we developed a simplified and reproducible index for Alberta by estimating and aggregating pollutants from communal outdoor factors. GIS supports the inclusion of diverse data and enables modelling of hazard-exposure-dose–response processes in space [31, 32]. To capture the relevant pollutant estimates, spatially and temporally appropriate GIS data files were overlaid to develop a vulnerability map of combined disparate (in theme and measurement units) environmental factors, similar to Messer et al. [28, 29]. The index will aid our examination of maternal ambient health hazards and abnormally small newborns by providing a relative ranking of locations across the province that are not limited by administrative boundaries.
Our research is part of the Data Mining and Neonatal Outcomes (DoMiNO) project that is exploring the collocation of adverse birth outcomes and environmental variables in Canada . For our geographical perspective on the project we hypothesized that SGA or LBWT babies are more likely to be born to mothers living in environments with a higher number of outdoor hazards (especially pollutants) than in relatively healthier habitats with fewer exposure hazards. Our objective was to examine how the separate and combined exposures to the outdoor built-up, natural, and social environments of pregnant mothers coincided with patterns in adverse birth outcomes (ABO). We also expected that the large Alberta province would have regional variations in the outdoor environment and investigated this effect on the associations.
We used Esri’s ArcGIS Desktop 10.5 software to perform all spatial database processing, management, distribution analyses, hazard estimations, and index calculations . Proximity was extremely important in our spatial analysis; therefore, we customized an Alberta-focused map projection, based on the following parameters: name Azimuthal Equidistant; central meridian − 113.5; latitude of origin 53.5; linear unit meter (1.0); and geographic coordinate system (GCS) datum North American 1983 (NAD 1983). We projected all GIS data to this distance-preserving spatial reference.
For raster files we used a 250 m by 250 m cell size to reasonably represent both urban and rural areas in the very large study area, and to match the coarsest dataset: MODIS Terra satellite .
Because Alberta is landlocked, we included data features within 50 km surrounding the provincial boundary where available: by doing so, any potential pollutant source close to the outer edge of the province was included.
We produced sub-provincial maps of the percent ratios for each ABO to facilitate comparisons more meaningful to health care and environmental management. We assigned administrative attributes to postal code locations. This allowed grouping by health region  or airshed zone  because both are health-related administrative boundaries that help identify where there may be different outdoor factors of importance.
Health regions are designated by the provincial Ministry of Health to identify geographic areas where hospital boards or regional health authorities administer and deliver public health care, and are subject to change . At the start of our study period, there were 9 health regions for Alberta (Table 2): Chinook Regional Health Authority (4821); Palliser Health Region (4822); Calgary Health Region (4823); David Thompson Regional Health Authority (4824); East Central Health (4825); Capital Health (4826); Aspen Regional Health Authority (4827); Peace Country Health (4828); and Northern Lights Health Region (4829).
Airshed zones are endorsed by the multi-stakeholder Clean Air Strategic Alliance (CASA) to identify geographic areas where the air quality is similar in emission sources, volumes, impacts, dispersion and administrative characteristics . Because Alberta has several unique topographical, meteorological, or ecological conditions for resolving air quality, there are 9 airsheds currently recognized (Table 2): Alberta Capital Airshed Alliance (ACAA); Calgary Region Airshed Zone (CRAZ); Fort Air Partnership (FAP); Lakeland Industry and Community Association (LICA); Palliser Airshed Society (PAS); Parkland Airshed Management Zone (PAMZ); Peace Airshed Zone Association (PASZA); West Central Airshed Society (WCAS); and Wood Buffalo Environmental Association (WBEA). It is important to note that the entire province is not monitored by airshed zones, with the southwest corner, east-central, and majority of the north having no airshed (NA).
The Alberta Perinatal Health Program (APHP) provided anonymized data for the province of Alberta, from 2006 to 2012 . We obtained ethics approval from the Research Ethics Board at the University of Alberta and the APHP.
We selected for live single births between 22 and 42 weeks gestation, and geocoded them to the centroid of the 6-character postal code of the mothers’ residences at the time of the birth registration. DMTI Spatial’s Platinum Postal Code Suite  provided the longitude and latitude coordinates for the years 2001–2013, which we uniquely selected to guarantee static locations through the entire study period. 95% of the original data had valid coordinates for use in spatial analyses. Using the previous definitions, we classified the birth records as binary variables identifying SGA or LBWT. Details are available in Serrano et al. .
To eliminate the confines of arbitrary administrative boundaries, we followed the double kernel density (DKD) method [43,44,45,46,47,48] to calculate distributions of SGA and LBWT, normalized by all births. DKD involves kernel density estimation—a non-parametric method that spreads point values across a surface by calculating the magnitude-per-unit area from points (representing the counts of birth events), fitted to a smoothly tapered function that spreads the values within a specified distance (25 km for this study) around each point . Points within the radius that are further from the center are weighted lower than those closer, and helps indicate “hot spots”. Dividing each ABO by the kernel density of total births yielded ratios of the birth outcome that also masked locations of the residences, helping protect privacy.
Personal maternal monitoring data were not available for this retrospective study. We used landscape features as spatial proxies of exposure hazards, as done in previously published research . In total, we chose 18 outdoor sources, identified in published studies [17,18,19,20,21,22,23] or added for novel exploration (10 built; 5 social; 3 natural) plus 136 industrial air substance emissions. Table 1 lists the environmental variables and indicates specific characteristics and processing details.
We applied kernel density to spread industrial emissions from the NPRI database as tonnes per area within a 10 km radius (based on distances determined from the project’s data mining algorithm ). We used the count of other point features—industrial facilities, gas stations, waste/landfills, oil/gas well pads, food stores, and health care/hospitals—in kernel density to calculate the number per area within a 3-km radius. We also applied kernel density to roads and electrical power lines to calculate length per area within a 3-km radius. A main advantage of using kernel density is it accounts for distance decay (features have less influence further away). When linear features are the input it also helps to approximate the number of intersections—important when analyzing pollution sources from roads because vehicles idle at intersections.
For areal features, we used focal statistics, also known as moving-window or neighborhood analyses, on binary surfaces of feedlots, mine sites, cultivated lands, aboriginal lands, water/blue space, and wildfires. The mean statistic on binary values of 1, indicating presence of the feature, and 0, indicating absence, yielded proportions. For vegetation/naturalness, the mean statistic returned the mean Normalized Difference Vegetation Index (NDVI), where higher values identify more chlorophyll-producing healthy green vegetation captured by the satellite imagery pixels. Except for the 50-km wildfire radius, all others had a 3-km radius. We accepted the original values for the coarser resolution nighttime lights and area-based, neighborhood-level socioeconomic index.
Spearman’s rank correlation
We joined values from the DKD distributions and each independent variable surface extracted to unique postal codes where births occurred. Our data were non-normally distributed due to many zero values in both the dependent and independent variables. We used Python 2.7 software  with the pandas 0.16 site package  to calculate Spearman’s rank correlations among ABO and each environmental variable. To test the association of the combined environmental factors, we calculated a second set of Spearman’s rank correlations using DKD values to test the indices. Correlation was calculated for the entire province and aggregated by sub-provincial unit.
Overlay analysis is a simple and reproducible method to combine several inputs into a single output . It is most common for optimal site selection and suitability modeling, especially for mapping habitat. The class values represent rankings from higher to lower suitability or risk. In our study, we applied it to essentially map “reverse suitability” to identify maternal ambient health hazards.
Because the values of continuous surfaces varied in measurement units, we standardized them into a similar ratio scale by reclassifying the environmental variables into five standard classes using quintiles. The ordering of the reclassification corresponded with the direction of the correlation: most were straightforward but if the variable was negatively correlated then the reclassification was applied in a backwards fashion; e.g. vegetation, water, and socioeconomic status classes were ranked 5–1 because lower original values were considered to be more hazardous. We calculated the sub-indices as weighted sum overlays with equal weightings on air substances and land-based sources separately, which were then overlaid together. We were interested in preserving the combined effects of the industrial air substances; therefore, in addition to an equal weighted sum of both, we also approximated a conservative two-thirds (0.7) weighting to the air substances summed with a one-third (0.3) weighting of the land-based sources. In the two different indices—Overlay Equal and Overlay 0.7/0.3—the class rankings were accumulations that represent where the study area had more environmental hazards.
Overall, the reverse suitability indices were calculated by modeling each individual pollutant surface using distance-centered analyses (i.e. kernel densities and focal statistics), reclassified into quintiles of class rankings, and overlaid as weighted sums. The detailed GIS methods for the map-based calculations of all the independent variables and subsequent indices are specified in Table 1 (i.e. features, methods, and radii) and shown graphically in Fig. 2.
Spatial distribution of adverse birth outcomes
Table 2 shows raw counts of births, SGA, and LBWT, based on valid postal codes. For 2006–2012, the entire province of Alberta had 333,247 births with a valid spatial location (95% of total registered), allocated to 53,399 postal codes. 29,679 geocoded births were classified as SGA (8.9%) and 5485 were classified as LBWT at term (1.6%).
Figure 3 depicts the percentages of ABO for each sub-provincial unit relative to Alberta (marked by *). For health regions, SGA ranged from 7.0 to 10.3% and LBWT ranged from 1.2 to 1.9%. Health region 4823 had the highest number of births (n = 121,965), highest SGA (n = 12,543, 10.3%), and highest LBWT (n = 2339, 1.9%); 4825 had the lowest number of births (n = 9737), SGA (n = 698, 7.2%), and LBWT (n = 140, 1.4%); but 4827 had the lowest SGA (n = 1252, 7.0%) and LBWT (n = 222, 1.2%). For airshed zones, SGA ranged from 6.5 to 10.3% and LBWT ranged from 1.0 to 1.9%. Airshed zone CRAZ had the highest number of births (n = 120,392), SGA (n = 12,409, 10.3%), and LBWT (n = 2310, 1.9%); FAP had the lowest number of births (n = 3.547) and LICA had the lowest SGA (n = 293, 6.5%) and lowest LBWT (n = 43, 1.0%).
The distributions of births per area in Fig. 4 show higher concentrations of more than 3 births per km2 in the sub-provincial units containing the major cities of Edmonton and Calgary, with medium densities in the adjacent units and in the airshed zones containing Grande Prairie (west-central) and Cold Lake (east-central).
The patterns differ by sub-provincial unit for ABOs mapped as numbers per births (Fig. 5). SGA is highest in the units containing Edmonton and along the west–east Banff–Calgary-Brooks corridor. Health regions have medium SGA adjacent to the high SGA. Airsheds also show medium SGA in the west and north-east. LBWT is highest in the north–south Edmonton-Red Deer-Calgary corridor. Medium LBWT is adjacent to the higher units, except for the northern health regions containing Grande Prairie-Peace River and Fort McMurray-Fox Lake. The lower LBWT in the central health region 4827 separates the province; LBWT in the airshed containing Cold Lake is the lowest in the province.
Figure 6 maps the results of the DKD method for each ABO. Both ABOs cover the same areas of the province and the darker colors indicate higher values for SGA (purple) and LBWT (green). The result of DKD is a continuous value, but the maps classified with tertiles visually enhance the slightly different distributions for SGA and LBWT: urban (Edmonton and Calgary) areas shared highest values for both ABO; central areas had more LBWT; and southeast areas had more SGA.
The Spearman’s rank correlation values were sorted in descending order for each of the independent variables (Table 3). Provincially, variables having correlations greater than 0.40 (low value accepted since data were not adjusted for epidemiological factors because they were not available for mapping) with SGA were: i-Butyl alcohol (rho = 0.56); Asbestos; Nighttime Light; Toluenediisocyanate; Toluene-2,4-diisocyanate; Toluene-2,6-diisocyanate; Chromium Aluminum; Hydrogen sulphide; Road; 2-Ethoxyethanol; *Nickel; Quinoline; Aniline; Cyclohexane; Acetaldehyde; and *Phosphorus (rho = 0.42). Variables with correlations greater than 0.40 with LBWT were: i-Butyl alcohol (rho = 0.54); Asbestos; Toluenediisocyanate; Toluene-2,4-diisocyanate; Toluene-2,6-diisocyanate; Aluminum; Chromium; Nighttime Light; Hydrogen sulphide; 2-Ethoxyethanol; Quinoline; Aniline; Road; Cyclohexane; Acetaldehyde; *Isopropyl alcohol; and *Ethylene oxide (rho = 0.41). Both ABOs were strongly associated with 15 air substances (the asterisk * marks those that differed: Nickel and Phosphorous for SGA; Ethylene oxide and Isopropyl alcohol for LBWT) and 2 land sources (both Nighttime Light and Road). Both ABOs had negative correlations (< − 0.40) with Vegetation (SGA rho = − 0.56; LBWT rho = − 0.48), Oil/Gas Wellpad (SGA rho = − 0.53; LBWT rho = − 0.49), and Cultivated Land (SGA rho = − 0.47; LBWT rho = − 0.41).
The dilution effect of spreading the hazards across the large study area highlighted regional importance. Using the criteria of four or more health regions having a rho greater than 0.40 indicated the importance of Nitrogen oxides, Sulphur dioxide, Particulate Matter less than or equal to 2.5 μ (PM2.5), and Acetaldehyde with SGA. The same criteria identified Xylene, Mine Site, Manganese, and Lead for LBWT. Four or more airshed zones having a rho greater than 0.40 highlighted Sulphur dioxide and Acetaldehyde with SGA, and Xylene, Particulate Matter less than or equal to 10 microns (PM10), and PM2.5 for LBWT.
The number of unique environmental variables having rho values greater than 0.40 province-wide or within four or more sub-provincial units totaled 30 (24 air substances and 6 land-based).
Spatial distribution of the indices
Figure 7 maps the results from the weighted overlay sum of the five class rankings for 136 emitted air substances, 18 land sources, the overlay equal weighting of both, and the Overlay 0.7/0.3 weighting of air substances and land. The distribution of the higher rankings spatially coincides with Alberta’s populated places, except for higher values along the foothills, the Fort McMurray oil sands area in the north, and some scattered areas in the northeast. Quantile class breaks were used to visualize the contrast of higher to lower areas.
Associations with the hazards and indices
The actual index values were used for the correlations with ABO DKD (Table 4). The correlations of the overlay indices with ABOs were very low for the entire province. The Air Substances were highest for both SGA (rho = 0.21) and LBWT (rho = 0.16). Land Factor correlations were slightly negative for SGA (rho = − 0.26) and LBWT (rho = − 0.23). Both overlay indices were lower than the Air Substances for SGA: Overlay Equal had a rho = 0.18 and Overlay 0.7/0.3 had a rho = 0.15. Overlay Equal was lower for LBWT (rho = 0.13) but Overlay 0.7/0.3 was higher (rho = 0.20).
Figure 8 displays index correlations with ABOs, by health region and airshed zone. In the graph symbols, longer bars mean greater association and bar direction designates positive (up) or negative (down). The air substances and land-based sources were included to demonstrate how much of an effect each had on the indices. The following indices had correlations greater than 0.40 with an ABO:
Air Substances with SGA in four health regions—4829 (rho = 0.85), 4828 (rho = 0.67), 4826 (rho = 0.55), 4823 (rho = 0.42); and with LBWT in three health regions—4828 (rho = 0.73), 4826 (rho = 0.59), and 4823 (rho = 0.56).
Air Substances with SGA in four airshed zones—WBEA (rho = 0.89), PASZA (rho = 0.57), ACAA (rho = 0.55) and CRAZ (rho = 0.42); and with LBWT in four airshed zones—LICA (rho = 0.85), PASZA (rho = 0.66), ACAA (rho = 0.60), and CRAZ (rho = 0.56).
Land sources were weakly associated with both SGA and LBWT in most health regions and airshed zones.
Overlay Equal index with SGA in four health regions—4828 (rho = 0.58), 4826 (rho = 0.54), 4823 (rho = 0.42), and 4827 (rho = 0.42); and with LBWT in four health regions—4828 (rho = 0.0.63), 4826 (rho = 0.0.59), 4821 (rho = 0.57), and 4823 (rho = 0.57).
Overlay Equal index with SGA in three airshed zones—ACAA (rho = 0.55), PASZA (rho = 0.45), and CRAZ (rho = 0.42); and with LBWT in three airshed zones—ACAA (rho = 0.60), CRAZ (rho = 0.57), and PASZA (rho = 0.51).
Overlay 0.7/0.3 index with SGA in four health regions—4829 (rho = 0.75), 4828 (rho = 0.62), 4826 (rho = 0.55), and 4823 (rho = 0.42); and with LBWT in four health regions—4828 (rho = 0.68), 4826 (rho = 0.59), 4823 (rho = 0.57), and 4821 (rho = 0.51).
Overlay 0.7/0.3 index with SGA in four airshed zones—WBEA (rho = 0.78), ACAA (rho = 0.55), PASZA (rho = 0.50), and CRAZ (rho = 0.42); and with LBWT in four airshed zones—LICA (rho = 0.60), ACAA (rho = 0.60), PASZA (rho = 0.59), and CRAZ (rho = 0.57).
The health regions having the least association with SGA were 4821, 4822, 4824, and 4825; with LBWT these were 4822, 4824, 4827, and 4829. The airshed zones having the least association with SGA were FAP, PAMZ, PAS, and WCAS; with LBWT these were FAP, PAMZ, PAS, and WBEA.
SGA and LBWT were negatively correlated with all indices in health region 4822 and two airshed zones (PAS, FAP). The negative association also occurred in health region 4825 and airshed zone WCAS, but a higher positive correlation occurred with the Land Sources.
Using the criteria of correlations higher than 0.40, the Overlay 0.7/0.3 index had the highest overall count of sub-provincial units—both ABOs represented by at least 4 health regions and 4 airshed zones.
Of 136 NPRI substances reported in Alberta, 24 air-emitted substances had moderate correlations with one or both ABO DKD ratios. Of these, 2-Ethoxyethanol and Lead are recognized developmental toxicants [53, 54]. Acetaldehyde, Aluminum, Ethylene oxide, Isopropyl alcohol, Nickel, Nitrogen oxides, PM10, PM2.5, Sulphur dioxide, Xylene, Chromium, Hydrogen sulphide, Manganese, Phosphorus, and Quinoline are suspected developmental toxicants, with more than half of the air substances associated with decreased fetal/offspring weight in animal studies [53, 54]. The following air substances are neither recognized or suspected as no studies were reported: Aniline, Asbestos, Cyclohexane, i-Butyl alcohol, Toluene-2,4-diisocyanate, Toluene-2,6-diisocyanate, and Toluenediisocyanate (note: the latter three have been combined in later versions of the NPRI database ).
Of the 18 land sources mapped, 6 had moderate correlations with one or both ABOs. Provincially, Cultivated Land was negatively associated with SGA and LBW (likely because residences were not inside agricultural fields), but some regions were positive, similar to the Almberg et al.  study on proximity to pesticide-treated agricultural fields. Proximity to Mine Sites were associated for 2–3 health regions or airsheds; a related study found positive association for a single mine site indicating this is likely a more localized factor . Nighttime Lights have not been explored with ABOs; however, breast cancer, which has other similar exposures, has a positive association [44, 57]. The smaller area airsheds showed high correlations of ABOs with Oil/Gas Wellpads, but was negative for the entire province and by health regions; mixed associations were also reported by Mckenzie et al.  and Casey et al. . The moderate to higher correlations of Roads match much published research on the effect of maternal proximity to roads [60, 61]. Green or natural Vegetation was negatively correlated at the provincial level, but very mixed within health regions and airsheds; the sub-provincial dissimilarity with other studies [62, 63] was likely affected by the radii, resolution of the satellite sources, and the widely varying ecoregions in the province.
Ambient hazard indices
Both indices identified where there was an accumulation of hazards and therefore directly addressed the hypothesis that there were more small newborns where there were more outdoor hazards during the mothers’ pregnancies. Since we were interested in preserving combined effects that the industrial air substances contributed to the outdoor environment, we weighted the sum of those more highly than the sum of all the land-based sources. Province-wide, the Overlay Equal index better identified SGA and the Overlay 0.7/0.3 better identified LBWT.
Differences in index associations were likely due to the spatial distributions (i.e. DKD) of the ABOs. Both SGA and LBWT showed that hot spots did not occur strictly within the large urban centers. Calgary and Edmonton exhibited higher ratio classes, but not for their entire core. The peripheral edges of the Calgary-Red Deer corridor, the communities along the Banff-Calgary-Brooks corridor, the Fort McMurray surroundings, and the northern Fox Creek area were high for both SGA and LBWT. Jasper and south-east Alberta had higher SGA, while the communities west and east of Edmonton had higher LBWT. The distributions of the type of ABO spatially varied across the province—differences that may have been due in part to population and behavior, but also visually collocated with the higher amounts of outdoor hazard mapping.
Separately, the air substances and land sources varied in association with the ABO distributions. On the provincial scale, there were 13 hazards spatially related to both the SGA and LBWT ratios. Assessing the relationships sub-provincially found many more factors involved, including those already supported in the scientific literature, including: nitrogen oxides, particulate matter (PM2.5 and PM10), and sulphur dioxide.
Despite the disparate boundaries, spatially corresponding health regions (HR) and airshed zones (AZ) had comparable patterns in spatial relationships to the hazard indices. HR 4822/AZ PAS had highly negative correlations with all indices, suggesting that factors other than the outdoor environment may be more important in these regions. HR 4829/AZ WBEA exhibited opposite correlations with indices: SGA was positive and LBWT was negative. HR 4826/AZ ACAA and FAP and HR 4823/AS CRAZ for SGA and LBWT were positively correlated with the indices—these are the more populated regions. 4828/PASZA also had positive index correlations with SGA and LBWT. HR 4824/AZ PAMZ for SGA was positive with the indices, but for LBWT had no association. The reverse was found in HR 4821 (no corresponding AZ), where SGA was negative and LBWT was positive. HR 4825 had no relationship with the ABOs, and AZ LICA had no association with SGA and a positive one with LBWT. HR 4827 and AZ WCAS are too large and diverse to compare. Inconsistent relationships for each ABO with the indices may be due to: (1) the variable geography within the administrative boundaries; (2) differences in etiology of the ABOs; and/or (2) the actual distribution of each ABO exhibiting slightly different patterns: SGA and LBWT appear to be more of a heartland issue.
The combination of the outdoor hazards into a single index were very weakly associated with SGA and LBWT provincially. This was not surprising given Alberta includes forestry, agriculture, and energy extraction activities, thus yielding diverse “pockets” of different pollutants. Analyzing smaller geographic areas, based on health regions or airsheds, helped recognize possible differences in the outdoor environmental factors.
The large area of some units may capture populations that are more similar in size to the smaller units, but the environmental variability may have diluted the effects of hazards. The sub-provincial units that had negative correlations will need further analysis to determine the regionally important hazards. Relationships found here show that province-wide (i.e. large region) approaches to outdoor hazards may be inappropriate or inefficient. Where health regions and airshed zones are more similar, policy and monitoring may be more agreeable.
Existing ambient hazard indices are not available for comparison. Environmental Quality Indices (EQI), such as those developed by Messer et al.  and Stieb et al.  depict the state-of-the-environment from actual measured conditions . The Air Quality Health Index (AQHI) by Stieb et al. does a very good job at aggregating the monitored criteria air contaminants for risk communication. Messer’s EQI was associated with pre-term birth , but still has the limitation of fixed administrative units. And because a main goal was a continuous index, we were unable to incorporate an effective rural classification without the introduction of administrative boundaries, as done by Messer et al. Our more ecologically-encompassing index incorporated industrial air pollutants and land-based sources, similar to the holistic model developed for a single urban area by Tarocco et al. .
We analyzed the entire registered birth population for the study period that had valid locations. The 6-character postal codes provided good accuracy for urban neighborhoods, especially within the context of the 250-m cell size, but the rural residences were not as exact. DMTI Spatial had applied algorithms to weight the postal code local delivery area centroid toward the more populated communities , but that did not guarantee an actual residence contained within the cell. The problem of rural resolution was exhibited by oil–gas wellpads and agricultural land that may be closer to actual residences, but postal codes were not accurate enough due to too large of delivery areas for the centroids.
Although there is concern that the mother did not live at that postal code for the entire pregnancy, previous research determined low mobility during pregnancy and any relatively short distances moved did not substantially change the exposure assignment .
The spatial data for the independent variables were restricted to publicly available sources that may not have had the most temporally appropriate capture date of the mapped features. We also did not have access to reliable province-wide data for other possible environmental factors, such as water quality, noise, or non-industrial pollution sources. And as suitable as the NPRI data were, the values were annually reported estimates and not actual measurements . Despite these shortcomings, the available data provided an as inclusive as possible foundation for the index.
Many of the GIS methods involved the selection of radius distances. The size of the radius used in calculating the DKD affected how “hot” an area appeared, and may have exaggerated the extent for large distances; the 25-km radius may have been too large for rural communities with diverse topographies. When estimating air-emitted pollutants, wind would have varied by season and throughout the years; therefore, the use of circular shapes in calculating the tonnes per area may not have accurately reflected wind-dispersion for some areas. The conservative 10 km radius for spreading the air substances may have remedied this for upwind locations, but potentially underrepresented it for downwind locations. For the index, not all variables may be equally important, but the use of expert judgment would have introduced subjectivity that was not reproducible. Therefore, the equal treatment of the air substances and land sources in the overlay analyses was used.
The correlation threshold value of 0.40 may have overrepresented the inclusion of some of the independent variables. The choice of this statistical threshold was based on inspection of the data to ensure that a wide variety of hazards would be represented and not erroneously overlooked due to the modifiable areal unit problem introduced by the boundaries of the sub-provincial units .
It is important to stress that our research was not able to find causal relationships, but identified where outdoor environmental hazards collocate with residences of mothers who gave birth to abnormally small newborns.
The calculations of the outdoor environmental variables were continuous and covered the entire study area. Therefore, the DKD calculation of the SGA and LBWT ratios was appropriately consistent because it also was not confined to arbitrary geographical boundaries. Aggregation early in the analysis would have produced an inflexible distribution of the ABOs. The introduction of health regions and airsheds afterward allowed for scenario investigations relevant to health care administration, policy implications, and airshed monitoring.
The primary outdoor pollutants associated with abnormally small newborns agreed with published research, but additional unstudied air substances were discovered. For many regions, the reduction of data into a single index was achievable.
The development and application of the ambient health hazard indices for any study area, any time period, and where relevant data are available is simplified by the reverse suitability approach in a standard GIS. The distance-centered methods and weighted sum overlay, commonly used in wildlife habitat studies, are also relevant to human habitat related to various environmental health outcomes.
This is to date the first study on abnormally small newborns that used a combination of multiple outdoor variables over a large geographic area. Our results showed that SGA and LBWT varied sub-provincially with outdoor environmental factors, suggesting that provincial government should be aware of multiple sources of place-dependent exposures. Summing up class rankings of hazards provided a simple model for correlating with the sub-provincial distributions of ABO. There were regions/airsheds that were higher than the national and provincial rates. The temporal nuances had been masked by combining all years: spatial patterns in the hazards and birth outcomes likely varied through time; therefore, future research should consider the timing of exposures. Research should also combine the vertices of habitat, population, and behavior to investigate the complex interactions of the outdoor hazards found here by including maternal characteristics revealed in traditional epidemiological studies. We found that the industrial air substances were important—and the Overlay 0.7/0.3 weighted index had the most associations in the sub-provincial units. Therefore, both the individual air substance associations and the convenient single-measure index provide complementary information to move us toward a better understanding of the links between the outdoor environment and birthweight. Mapping the outdoor environmental hazards for mothers giving birth to abnormally small newborns provides insight for preventative or remedial recommendations where they may be needed to help determine healthier futures.
adverse birth outcome
Alberta perinatal health program
data mining and neonatal outcomes
double kernel density
geographical information systems
low birth weight at term
national pollutant release inventory
small for gestational age
Meade MS, Emch M. Medical geography. 3rd ed. New York: Guilford Press; 2010.
Meade MS. Medical geography as human ecology: the dimension of population movement. Geogr Rev. 1977;64:379–93.
Barker DJ. The fetal and infant origins of adult disease. BMJ Br Med J. 1990;301:1111.
Barker DJP. Maternal nutrition, fetal nutrition, and disease in later life. Nutrition. 1997;13:807–13.
Barker DJP. The developmental origins of chronic adult disease. Acta Paediatr Suppl. 2004;93:26–33.
Barker DJP, Osmond C. Infant mortality, childhood nutrition, and ischaemic heart disease in England and wales. Lancet. 1986;327:1077–81.
Kramer MS. The epidemiology of adverse pregnancy outcomes: an overview. J Nutr. 2003;133:1592S–6S.
Statistics Canada. Population, urban and rural, by province and territory, Alberta [digital data]. Statistics Canada, Government of Canada; 2011. Available from: http://www.statcan.gc.ca/tables-tableaux/sum-som/l01/cst01/demo62j-eng.htm.
Environment Canada. National pollutant release inventory, 2005–2012 [digital data]. Natl Pollut Release Invent. 2014. Available from: https://www.ec.gc.ca/inrp-npri/.
Wine O, Hackett C, Campbell S, Cabrera-Rivera O, Buka I, Zaiane O, et al. Using pollutant release and transfer register data in human health research: a scoping review. Environ Rev. 2014;22:51–65.
Statistics Canada. Table 102-4318—Birth-related indicators (low and high birth weight, small and large for gestational age, pre-term births), by sex, three-year average, Canada, provinces, territories, census metropolitan areas and metropolitan influence zones, occasional. Can Socio-Econ Inf Manage Syst. 2014. Available from: http://www5.statcan.gc.ca/cansim/a05?lang=eng&id=01024318.
Alberta Health. Alberta interactive health data application—reproductive health—singleton small for gestational age percent by geography and low birth weight percent by geography, 2000–2014 [digital data]. Interact Heal Data Appl. 2016. Available from: http://www.ahw.gov.ab.ca/IHDA_Retrieval/ihdaData.do.
Canadian Institute for Health Information (CIHI). Too early, too small: a profile of small babies across Canada. 2009. Available from: https://secure.cihi.ca/free_products/too_early_too_small_en.pdf.
Statistics Canada. Table 102-0562—Leading causes of death, infants, by sex, Canada, annual, 2006–2012 [digital data]. Can Socio-Economic Inf Manage Syst 2012. Available from: http://www5.statcan.gc.ca/cansim/pick-choisir?lang=eng&searchTypeByValue=1&id=1020562.
Shah PS, Balkhair T. Air pollution and birth outcomes: a systematic review. Environ Int. 2011;37:498–516.
Vadillo-Ortega F, Osornio-Vargas A, Buxton MA, Sánchez BN, Rojas-Bracho L, Viveros-Alcaráz M, et al. Air pollution, inflammation and preterm birth: a potential mechanistic link. Med Hypotheses. 2014;82:219–24.
Maisonet M, Correa A, Misra D, Jaakkola JJK. A review of the literature on the effects of ambient air pollution on fetal growth. Environ Res. 2004;95:106–15.
Weselak M, Arbuckle TE, Foster W. Pesticide exposures and developmental outcomes: the epidemiological evidence. J Toxicol Environ Heal Part B, Crit Rev. 2007;10:41–80.
Triche EW, Hossain N. Environmental factors implicated in the causation of adverse pregnancy outcome. Semin Perinatol. 2007;31:240–2.
Koranteng S, Osornio-Vargas AR, Buka I. Ambient air pollution and children’s health: a systematic review of Canadian epidemiological studies. Paediatr Child Health. 2007;12:225–33.
Windham G, Fenster L. Environmental contaminants and pregnancy outcomes. Fertil Steril. 2008;89:e111–6.
Dadvand P, Parker J, Bell ML, Bonzini M, Brauer M, Darrow LA, et al. Maternal exposure to particulate air pollution and term birth weight: a multi-country evaluation of effect and heterogeneity. Environ Health Perspect. 2013;121:367–73.
Stieb DM, Chen L, Eshoul M, Judek S. Ambient air pollution, birth weight and preterm birth: a systematic review and meta-analysis. Environ Res. 2012;117:100–11.
Meng G, Thompson ME, Hall GB. Pathways of neighbourhood-level socio-economic determinants of adverse birth outcomes. Int J Health Geogr. Int J Health Geogr. 2013;12:1–16.
Meng G, Hall GB, Thompson ME, Seliske P. Spatial and environmental impacts on adverse birth outcomes in Ontario. Can Geogr. 2013;57:154–72.
Inhaber H. Environmental quality: outline for a national index for Canada. Science. 1974;186:798–805.
Ott WR. Environmental indices: theory and practice. Ann Arbor, MI: Ann Arbor Science Publishers, Inc.; 1978.
Messer LC, Jagai JS, Rappazzo KM, Lobdell DT. Construction of an environmental quality index for public health research. Environ Heal. 2014;13:1–22.
Office of Environmental Health Hazard Assessment. Draft California communities environmental health screening tool (CalEnviroScreen). 2014. Available from: http://oehha.ca.gov/ej/pdf/CES20PublicReview04212014.pdf.
Messer LC, Vinikoor LC, Laraia BA, Kaufman JS, Eyster J, Holzman C, et al. Socioeconomic domains and associations with preterm birth. Soc Sci Med. 2008;67:1247–57.
Cromley EK, McLafferty SL. GIS and public health. 2nd ed. New York: Guilford Press; 2012.
Nuckols JR, Ward MH, Jarup L. Using geographic information systems for exposure assessment in environmental epidemiology studies. Environ Health Perspect. 2004;112:1007–15.
Osornio-Vargas AR, Zaiaine O, Wine O. Domino project: data mining and newborn outcomes exploring environmental variables, abstract number 2187. Int Soc Environ Epidemiol. Seattle, WA: National Institute of Environmental Health Sciences; 2015. pp. 3–607. Available from: http://ehp.niehs.nih.gov/isee/p3-607/.
Esri. ArcGIS Desktop, Release 10.5 [software]. Esri Inc; 2016. Available from: www.esri.com.
NASA EOSDIS Land Processes DAAC, Didan K. MOD13Q1: MODIS/Terra Vegetation Indices 16-Day L3 Global 250 m Grid SIN V006 [digital data]. MOD13Q1 Version 6. 2014. Available from: https://lpdaac.usgs.gov/dataset_discovery/modis/modis_products_table/mod13q1_v006.
Statistics Canada. Health region boundary files, ArcInfo, Alberta, 2007, Catalog no. 82-402-X [digital data]. 82-402-X. 2008. Available from: http://www.statcan.gc.ca/pub/82-402-x/82-402-x2009001-eng.htm.
Alberta Environment and Sustainable Resource Devlopment. Alberta Airsheds [digital data]. GeoDiscover Alberta. 2010. Available from: https://genesis.srd.alberta.ca/genesis_tokenauth/rest/services/Air-Layers/Latest/MapServer/generatekml.
Statistics Canada. Health regions: boundaries and correspondence with census geography 2007 (updates). 2009. Available from: http://www.statcan.gc.ca/pub/82-402-x/82-402-x2009001-eng.pdf.
Clean Air Strategic Alliance. Airshed Zones Guidelines. Edmonton, Alberta; 2004. Available from: http://www.casahome.org/.
Alberta Health Services. Alberta Perinatal Health Program, 2006–2012 [digital data]. Alberta Perinat Heal Progr 2014. Available from: http://aphp.dapasoft.com.
DMTI Spatial. CanMap content suite—platinum postal code and enhanced points of interest, 2001–2013 [digital data]. Markham, Ontario: DMTI Spatia Incl; 2014. Available from: http://www.dmtispatial.com/canmap/.
Serrano-Lomelin J, Nielsen C, Aziz K, Kumar M, Chandra S, Aelicks N, et al. Co-occurrence of maternal risk factors and neighborhood socio-economic status profiles associated with adverse birth outcomes in Alberta, Canada.
Davarashvili S, Zusman M, Keinan-Boker L, Rybnikova N, Kaufman Z, Silverman BG, et al. Application of the double kernel density approach to the analysis of cancer incidence in a major metropolitan area. Environ Res. 2016;150:269–81.
Kloog I, Haim A, Portnov BA. Using kernel density function as an urban analysis tool: investigating the association between nightlight exposure and the incidence of breast cancer in Haifa, Israel. Comput Environ Urban Syst. 2009;33:55–63.
Müller AH, Stadtmüller U, Tabnak F, Moller H, Stadtmuller U, Tabnak F. Spatial smoothing of geographically aggregated data, with application to the construction of incidence maps. J Am Stat Assoc. 2016;92:61–71.
Portnov BA, Dubnov J, Barchana M. Studying the association between air pollution and lung cancer incidence in a large metropolitan area using a kernel density function. Socioecon Plann Sci. 2009;43:141–50.
Zusman M, Broitman D, Portnov BA. Application of the double kernel density approach to the multivariate analysis of attributeless event point datasets. Lett Spat Resour Sci. 2015;3:1–20.
Zusman M, Dubnov J, Barchana M, Portnov BA. Residential proximity to petroleum storage tanks and associated cancer risks: Double Kernel Density approach vs. zonal estimates. Sci Total Environ. 2012;441:265–76.
Silverman BW. Density estimation for statistics and data analysis. New York: Chapman and Hall; 1997.
Python Software Foundation. Python Language Reference, Version 2.7. 2016. Available from: www.python.org.
PyData Development Team. pandas 0.16 [software]. 2015. Available from: http://pandas.pydata.org/pandas-docs/version/0.16.0/.
Mitchell A. The Esri guide to GIS analysis, volume 3: modeling suitability, movement, and interaction. Redlands: Esri Press; 2012.
GoodGuide. Chemical profiles. Scorec Pollut Inf Site. 2011. Available from: http://scorecard.goodguide.com/chemical-profiles/.
Air Toxics Assessment Group. Health effects notebook for hazardous air pollutants. Environ Prot Agency. 2017. Available from: https://www.epa.gov/haps/health-effects-notebook-hazardous-air-pollutants.
Almberg KS, Turyk M, Jones RM, Anderson R, Graber J, Banda E, et al. A study of adverse birth outcomes and agricultural land use practices in Missouri. Environ Res. 2014;134:420–6.
Ahern M, Mullett M, MacKay K, Hamilton C. Residence in coal-mining areas and low-birth-weight outcomes. Matern Child Health J. 2011;15:974–9.
Curtis A, Leitner M. Geographic information systems and public health: eliminating perinatal disparity. Hershey: IRM Press; 2006.
Mckenzie LM, Guo R, Witter RZ, Savitz DA, Newman LS, Adgate JL. Birth outcomes and maternal residential proximity to natural gas development in rural Colorado. Environ Health Perspect. 2014;122(4):412–17.
Casey JA, Savitz DA, Rasmussen SG, Ogburn EL, Pollak J, Mercer DG, et al. Unconventional natural gas development and birth outcomes in Pennsylvania, USA. Epidemiology. 2016;27:163–72.
Miranda ML, Edwards SE, Chang HH, Auten RL. Proximity to roadways and pregnancy outcomes. J Expo Sci Environ Epidemiol. 2012;23:32–8.
Wilhelm M, Ritz B. Residential proximity to traffic and adverse birth outcomes in Los Angeles County, California, 1994–1996. Environ Health Perspect. 2003;111:207–16.
Donovan GH, Michael YL, Butry DT, Sullivan AD, Chase JM. Urban trees and the risk of poor birth outcomes. Health Place. 2011;17:390–3.
Hystad P, Davies HW, Frank L, Van Loon J, Gehring U, Tamburic L, et al. Residential greenness and birth outcomes: evaluating the influence of spatially correlated built-environment factors. Environ Health Perspect. 2014;122:1095–102.
Stieb DM, Burnett RT, Smith-Doiron MH, Brion O, Shin HH, Economou V. A new multipollutant, no-threshold air quality health index based on short-term associations observed in daily time-series analyses. J Air Waste Manage Assoc. 2008;58:435–50.
Rappazzo KM, Messer LC, Jagai JS, Gray CL, Grabich SC, Lobdell DT. The associations between environmental quality and preterm birth in the United States, 2000–2005: a cross-sectional analysis. Environ Health. 2015;14:50.
Tarocco S, Amoruso I, Caravello G. Holistic model-based monitoring of the human health status in an urban environment system: pilot study in Verona city, Italy. J Prev Med Hyg. 2011;52:73–82.
Chen L, Bell EM, Caton AR, Druschel CM, Lin S. Residential mobility during pregnancy and the potential for ambient air pollution exposure misclassification. Environ Res. 2010;110:162–8.
Amrhein CG. Searching for the elusive aggregation effect: evidence from statistical simulations. Environ Plan A. 1995;27:105–19.
Statistics Canada. Census and intercensal road network. Statistics Canada Catalogue no. 92-500-X [digital data]. 92-500-X. 2012. Available from: http://www12.statcan.gc.ca/census-recensement/2011/geo/RNF-FRR/index-eng.cfm.
AltaLIS. 20 K Alberta Base Features [digital data]. AltaLIS, Agent Alberta Data Partnerships Ltd. 2012. Available from: http://www.altalis.com/products/base/20k_base_features.html.
Alberta Biodiversity Monitoring Institute. ABMI Human Footprint Inventory for 2012 conditions Version 3 [digital data]. Alberta Biodivers Monit Inst. 2012. Available from: http://www.abmi.ca/home/data/gis-data/human-footprint-download.html?scroll=true.
National Oceanic and Atmospheric Administration. DMSP-OLS nighttime lights time series, 2005–2012, Version 4 [digital data]. Def Meteorol Satel Progr Oper Linescan Syst (OLS), Natl Ocean Atmos Adm Natl Centers Environ Inf 2012. Available from: https://ngdc.noaa.gov/eog/dmsp/downloadV4composites.html.
Natural Resources Canada. Free Data—GeoGratis: Atlas of Canada National Frameworks Data, Aboriginal Lands, and CanVec [digital data]. Nat Resour Canada (NRCan), Gov. Canada. 2013. Available from: https://www.nrcan.gc.ca/earth-sciences/geography/topographic-information/free-data-geogratis/download-directory-documentation/17215.
Chan E, Serrano J, Chen L, Stieb DM, Jerrett M, Osornio-Vargas A. Development of a Canadian socioeconomic status index for the study of health outcomes related to environmental pollution. BMC Public Health. 2015;15:714.
Alberta Agriculture and Forestry. Historical wildfire information—spatial wildfire data, 2005–2012 [digital data]. Alberta Agric For (AgFor), Gov Alberta, Agric For Prov For Fire Cent. 2016. Available from: http://wildfire.alberta.ca/wildfire-maps/historical-wildfire-information/default.aspx.
CN conceived the study, developed the methodology, designed and carried out the analyses, and drafted the manuscript. CA provided expertise in developing the methodology, and reviewed and edited the manuscript. AOV oversaw the data collection, provided expertise in developing the methodology, and reviewed and edited the manuscript. All authors read and approved the final manuscript.
This research is part of the Data Mining and Neonatal Outcomes (DoMiNO) interdisciplinary collaborative project using spatial data mining to explore the colocation of adverse birth outcomes and environmental variables; team members: Aelicks N, Arbour L, Aziz K, Blagden P, Buka I, Chan E, Chandra S, Demers P, Erickson A, Jabbar S, Hystad P, Kumar M, Li J, Nicol A, Nielsen C, Phipps E, Serrano-Lomelin J, Setton E, Shah P, Stieb D, Villeneuve P, Wine O, Yuan Y, Zaiane O, and Osornio-Vargas A. The authors wish to thank summer students, Rusk B and Silverman E, for their assistance in acquiring some of the environmental data, as well as anonymous reviewers and Griffith D for their suggestions to improve the manuscript.
The authors declare they have no competing interests.
The environmental data that support the findings of this study are available from public data sources listed in Table 1. The health data are available by application to the Alberta Perinatal Health Program (APHP).
We obtained ethics approval from the Research Ethics Board at the University of Alberta and the Alberta Perinatal Health Program (APHP).
Funding is courtesy of the Canadian Institutes of Health Research (CIHR) and The Natural Sciences and Engineering Research Council of Canada (NSERC) Collaborative Health Research Program (Application Number 290275).
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.