Skip to main content

Remote and field level quantification of vegetation covariates for malaria mapping in three rice agro-village complexes in Central Kenya



We examined algorithms for malaria mapping using the impact of reflectance calibration uncertainties on the accuracies of three vegetation indices (VI)'s derived from QuickBird data in three rice agro-village complexes Mwea, Kenya. We also generated inferential statistics from field sampled vegetation covariates for identifying riceland Anopheles arabiensis during the crop season. All aquatic habitats in the study sites were stratified based on levels of rice stages; flooded, land preparation, post-transplanting, tillering, flowering/maturation and post-harvest/fallow. A set of uncertainty propagation equations were designed to model the propagation of calibration uncertainties using the red channel (band 3: 0.63 to 0.69 μm) and the near infra-red (NIR) channel (band 4: 0.76 to 0.90 μm) to generate the Normalized Difference Vegetation Index (NDVI) and the Soil Adjusted Vegetation Index (SAVI). The Atmospheric Resistant Vegetation Index (ARVI) was also evaluated incorporating the QuickBird blue band (Band 1: 0.45 to 0.52 μm) to normalize atmospheric effects. In order to determine local clustering of riceland habitats Gi*(d) statistics were generated from the ground-based and remotely-sensed ecological databases. Additionally, all riceland habitats were visually examined using the spectral reflectance of vegetation land cover for identification of highly productive riceland Anopheles oviposition sites.


The resultant VI uncertainties did not vary from surface reflectance or atmospheric conditions. Logistic regression analyses of all field sampled covariates revealed emergent vegetation was negatively associated with mosquito larvae at the three study sites. In addition, floating vegetation (-ve) was significantly associated with immature mosquitoes in Rurumi and Kiuria (-ve); while, turbidity was also important in Kiuria. All spatial models exhibit positive autocorrelation; similar numbers of log-counts tend to cluster in geographic space. The spectral reflectance from riceland habitats, examined using the remote and field stratification, revealed post-transplanting and tillering rice stages were most frequently associated with high larval abundance and distribution.


NDVI, SAVI and ARVI generated from QuickBird data and field sampled vegetation covariates modeled cannot identify highly productive riceland An. arabiensis aquatic habitats. However, combining spectral reflectance of riceland habitats from QuickBird and field sampled data can develop and implement an Integrated Vector Management (IVM) program based on larval productivity.


Prediction of vegetation index (VI) associated with vector larval habitats in malaria endemic areas can be remarkably accurate [13]. A VI is a dimensionless, radiation-based measurement computed from spectral combinations of remotely sensed data [4]. It is used to infer vegetation properties by isolating the attributes of vegetation from other materials (e.g., soil or water). The appeal of a VI is its simplicity and its relationship either empirically or theoretically to biophysical variables [5]. VI's have been proven to be well correlated with various vegetation parameters such as green biomass [6], chlorophyll concentration [7], leaf area index (LAI) [8], foliar loss and damage [9], photosynthetic activity [10], and carbon fluxes [11]. Also, they have been found to be useful for different image analyses like crop classification [12], and crop phenology [13].

A widely used VI for identifying mosquito aquatic habitats in malaria epidemiology is the Normalized Difference Vegetation Index (NDVI) [14]. Current emphasis in satellite data for identification of malaria mosquito habitats and VI's involves operational 'external' noise removal through improved calibration using atmospheric correction and soil adjustment factors [15]. In order to evaluate the efficiency of proxy variables for determining land use land cover (LULC) for making inferences of riceland mosquito larval abundance, we evaluated NDVI and two NDVI variants, the Soil Adjusted Vegetation Index (SAVI) and the Atmospheric Resistant Vegetation Index (ARVI) using QuickBird 0.61 m spatial resolution data within the village complexes of Kangichiri, Kiuria and Rurumi in the Mwea Rice Scheme in Kenya. The basis of the SAVI in minimizing soil noise inherent in the NDVI has been corroborated in past research [16]. Sensitivity studies by Myneni and Asrar [17] found that the ARVI reduces atmospheric effects and mimics ground-based NDVI data.

Modeling mosquito larvae and aquatic habitat characteristics requires accounting for correlational effects arising from varying geographical data. Using regression models, two annual peaks in the numbers of An. arabiensis corresponding with irrigation of rice paddies were observed [18]. Land cover sites identified from high resolution optical data provided a basis for spatial prediction of Anopheles larval habitats and the risk of malaria transmission in humans [19]. Moreover, spatial autocorrelation can be used to identify anomalies (hotspots) based on clusters of high and low larval density riceland aquatic habitats. [20].

Vegetation land cover data from remotely and field sampled information may develop and implement an Integrated Vector Management (IVM) program based on larval productivity in a riceland agro-ecosystem. Therefore, the objectives of this study were to: (a) establish the sensitivities and dynamic ranges of NDVI, ARVI and SAVI, (b) examine spectral reflectance of vegetated land cover surrounding riceland habitats using QuickBird data, and (c) determine the environmental variables associated with distribution and abundance of An. arabiensis larvae in Mwea rice fields, Kenya.


Study area

The sampling strategy used for the collection of larval site data was developed for an earlier research project and has been described in detail elsewhere [19]. Briefly, base maps including major roads and hydrography for the three study sites were generated using differentially corrected global positioning systems (DGPS) data in ArcInfo 9.1® (Earth Systems Research Institute Redlands, CA, USA) (Figure 1). Each riceland An. arabiensis larval habitat with its associated land cover attributes from the three study sites were entered into a Vector Control Management System® (VCMS) (Advanced Computer Resources Corp (ACR), 100 Perimeter Road Nashua, NH, USA) database). A digitized custom grid, tracing each riceland habitat was generated in Arc Info 9.1® [21]. A unique identifier was placed in each grid cell (i.e. polygon). The digitized grid extended out to a 1 km distance from the external boundary of the three study sites providing a 1 km radial area. Grid cells were stratified based on LULC transition throughout the crop season and defined as: 1) flooding; 2) land preparation; 3) post-transplanting; 4) tillering; 5) flowering; and 6) maturation and 7) fallow/post-harvest. Probability proportional to size sampling, based on the proportion of grid cells within each stratum was used to randomly select 50 grid cells in each study site for entomological sampling.

Figure 1
figure 1

Map of the three study sites: Kangichiri, Kiura and Rurumi in the Central Kenyan Rice Scheme.

QuickBird images, encompassing visible and near-infra-red (NIR) data was acquired for each study site July 2005. The QuickBird data were classified using the Iterative Self-Organizing Data Analysis Technique (ISODATA) unsupervised routine in ERDAS Imagine V8.7® (Atlanta, GA, USA). This approach to classification has been used widely in the identification of land covers and mosquito habitats associated with intermediate hosts and disease vector [2426]. The geographic projection used for all of the spatial datasets is the Universal Transverse Mercator (UTM) Zone 37S datum WGS-84 projection.

Measurement of proxy variables

In order to identify LULC change by rice growth stage for identifying An. arabiensis aquatic habitats abundance and distribution in the three study sites using proxy variables, a false-color composite was pre-classified based on NDVI, SAVI and ARVI from the QuickBird data. The Image Analysis extension of ArcView 3.3® was used to perform the VI calculations of the ERDAS Imagine V8.7®.

The NDVI is generated by converting raw data into an entirely new image using algorithms to calculate the color value of each pixel [27]. This type of product is especially useful in multi-spectral riceland remote sensing since transformations can be created that highlight relationships and differences in spectral intensity across multiple bands of the electromagnetic spectrum. NDVI is calculated as a ratio between measured reflectivity in the red and NIR portions. These two spectral bands are chosen because they are most affected by the absorption of chlorophyll in leafy green vegetation and by the density of green vegetation on the surface [28]. The pigment in rice plant leaves, chlorophyll, strongly absorbs visible light (from 0.40 to 0.70 μm) for use in photosynthesis while the cell structure of the rice leaves, strongly reflects NIR light (from 0.70 to 1.10 μm) [29]. Also, in red and NIR bands, the contrast between vegetation and soil is at a maximum.

NDVI was calculated using radiance, surface reflectance (r), or apparent reflectance (measured at the top of the atmosphere) values in the QuickBird red (R), (0.63 to 0.69 μm) and NIR (0.76 to 0.90 μm) spectral bands. The difference in reflectance was divided by the sum of the two reflectance. As a simple transformation of two spectral bands, NDVI are computed directly without any bias or assumptions regarding plant physiognomy, land cover class, soil type, or climatic conditions [30]. NDVI was calculated as:

To account for changing soil brightness, SAVI was also calculated utilizing an adjustment factor L that effectively shifts the origin of vegetation isolines in NIR/VIS reflectance space. Because the NDVI does not account for variations in soil brightness [31], a darkening of the soil following a rainfall or periodic flooding will cause a change in NDVI that will be interpreted as a change in vegetation [32]. SAVI was calculated using radiance, surface reflectance (r), using reflectance values in the QuickBird red (R), and NIR spectral bands. The L factor is determined by the relative percentage of vegetation and is dependent on whether the soil is light or dark; it is used as an exponent assigned to the red band value in the denominator and as a multiplier (L+1) of the first term [33]. The SAVI was calculated as:

In this analyses, SAVI was calculated where L was a rice height adjustment factor that accounted for differential red and NIR extinction though the crop season. L = 0.5 was used for all riceland habitats.

An atmospherically resistant vegetation index (ARVI) was developed for remote sensing of vegetation from the QuickBird data. The index took advantage of the presence of the blue channel (0.45 to 0.52 μm) in the QuickBird sensor, in addition to the red and the NIR channels that composed the riceland NDVI. The resistance of the ARVI to atmospheric effects (in comparison to the NDVI) is accomplished by a self-correction process for the atmospheric effect on the red channel, using the difference in the radiance between the blue and the red channels to correct the radiance in the red channel [34]. Aerosols, absorbing gases such as water vapor, and undetected clouds affect upwelling radiances measured by satellite instruments [35]. ARVI was calculated using radiance, surface reflectance (r), using reflectance values in the QuickBird blue channel (0.05 to 0.06 μm), red (R), and NIR spectral bands. The ARVI was defined as:

in which the subscript RB denotes the red (R) and blue bands (B) and γ is the gamma value which was defined as:

A single value of γ = 1.0 was used to substantially reduce the sensitivity of atmospheric effects.

Spatial modeler tools from ERDAS Imagine 9.1® was used to perform the VI calculations. The VI calculations resulted in a grid storing floating-point values. The validation was performed by identifying and recording X, Y coordinates from the Imagine format data images, recording the VI values at specific locations, and then pointing to the corresponding locations in the Arc/Info GRID format file and comparing values. This process was useful for calculation validation as well as for verifying floating-point values. Values for NDVI were successfully aggregated and overlaid onto georeferenced field-based data for the riceland study sites. The VI's were used to select all paddy and canal habitats with heavy, moderate and low vegetated values. A database was generated for each study site with the mean, minimum, maximum, and standard deviations for VI data aggregated to the riceland habitat level. VI's have the benefit that pixels are not forced into inappropriate land cover classes, but instead provide a proportional measure of vegetation [5]. The VI datasets for the riceland study sites were then merged with the entomological datasets.

Leaf Area Index (LAI)

A sensitivity analysis was conducted on the NDVI and the NDVI variants by analyzing the atmospheric and soil-perturbed responses as a continuous function of rice plant LAI. Plant samples were taken from each randomly selected grid cell and spectral measurements were assessed to determine rice plant LAI during the growing season. Estimations of LAI production were conducted by correlation analysis with spectral reflectance ratio and measured values. We selected the best fitting waveband ratio among calculated reflectance and NDVI, SAVI and ARVI. Percent relative error and vegetation equivalent 'noise' (VEN) were calculated for soil and atmospheric influences, separately and combined using LAI. Soil and atmospheric error were of similar magnitudes, but varied with VI's. Both new variants outperformed the NDVI. The atmospherically resistant version minimized atmospheric noise, but enhanced soil noise, while the soil adjusted variant minimized soil noise, but remained sensitive to the atmosphere. The SAVI and ARVI had a relative error of 10 percent and VEN of +/- 0.24 LAI and +/- 0.23. The NDVI had a relative error of 20 percent and VEN of +/- 0.86 LAI.

All paddy and canal habitats were visually examined using the QuickBird visible and NIR data. The satellite datasets were used to calculate the measure of spatial complexity and configuration of each paddy and canal habitats. Variations in vegetation land cover along the riceland habitats at each study site were disaggregated into smaller subsets using the digitized grid-based algorithm. Paddy and canal vegetation were classified as present, if vegetation land cover pixels were identified along the riceland habitat, and absent, if there were no vegetation land cover pixels. The remote stratification based on spectral analyses of vegetation land cover was used to 'screen' all habitats in the study sites to provide an initial indication of the location of highly productive riceland habitats.

Data analyses

Field data parameters were entered in Microsoft Excel files and analyzed using and SAS 9.1.3® (SAS inc. Carey, NC, USA). Before analyses the data was tested for collinearity using design matrix from a logistic regression model and run through SAS PROCREG/VIF (Variant Inflation Factor) procedure which indicated the absence of problematic correlation among the predictors. ANOVA test was used to compare the differences in mosquito larval abundance among the rice stages. Differences in mosquito larval abundance between vegetated and non-vegetated canals were compared using Student's t-test. Forward multiple logistic regression analysis was used to obtain the best predictor variables explaining the abundance of mosquito immatures. The relative abundance of mosquitoes was expressed as the number of mosquito larvae per 20 dips because the number of larvae sampled was low. Statistical analyses was done using log-transformed (log 10 n+1) larval counts to normalize the data. Results were considered significant at P < 0.05.

Spatial autocorrelation was used to identify the locations of clusters of sites with high and low An. arabiensis aquatic habitat density. The G i (d) statistic was developed for tests of hypotheses about the spatial concentration of the sum of x values associated with the j points within d of the ith point, which was defined by:

Where xi is the observed value at location i, x ¯ i = 1 / ( n 1 ) j , j i n x j , { w ij } MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG4baEgaqeamaaBaaaleaacqqGPbqAaeqaaOGaeyypa0JaeGymaeJaei4la8IaeiikaGIaeeOBa4MaeyOeI0IaeGymaeJaeiykaKYaaabCaeaacqWG4baEdaWgaaWcbaGaemOAaOgabeaakiabcYcaSiabcUha7jabbEha3naaBaaaleaacqqGPbqAcqqGQbGAaeqaaOGaeiyFa0haleaacqWGQbGAcqGGSaalcqWGQbGAcqGHGjsUcqWGPbqAaeaacqqGUbGBa0GaeyyeIuoaaaa@4D46@ is a symmetric binary special weight matrix, w i = j , j i n w i j ( d ) MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaWgaaWcbaGaemyAaKgabeaakiabg2da9maaqahabaGaem4DaC3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGGOaakcqWGKbazcqGGPaqkaSqaaiabdQgaQjabcYcaSiabdQgaQjabgcMi5kabdMgaPbqaaiabb6gaUbqdcqGHris5aaaa@4283@ , and S i 2 = 1 / ( n 1 ) j , j i n ( x j x ¯ i ) 2 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGtbWucqWGPbqAdaahaaWcbeqaaiabikdaYaaakiabg2da9iabigdaXiabc+caViabcIcaOiabb6gaUjabgkHiTiabigdaXiabcMcaPmaaqahabaGaeiikaGIaemiEaG3aaSbaaSqaaiabdQgaQbqabaGccqGHsislcuWG4baEgaqeamaaBaaaleaacqWGPbqAaeqaaOGaeiykaKYaaWbaaSqabeaacqaIYaGmaaaabaGaemOAaOMaeiilaWIaemOAaOMaeyiyIKRaemyAaKgabaGaeeOBa4ganiabggHiLdaaaa@4C6F@ . It was shown [36] that E(gi) = 0, Var(gi) = 1, and the permutations distribution of G i (d) under null hypothesis of no spatial association among xi approaches normality. In this research a significant and positive indicates that the location i is surrounded by relatively high larval density riceland habitats whereas a significant and negative G i (d) indicates that the location i is surrounded by relatively low larval density habitats.

All spatial statistics were calculated in R program. R is an integrated suite of software facilities for spatial data manipulation, calculation and graphical display. The spdep (spatial dependence) package in R was used to generate spatial weights matrix from habitat point patterns by distance between habitats and tessellations, for summarizing riceland An. arabiensis aquatic habitats in the three study sites and for permitting their use in a collection of tests for Getis/Ord G.


The abundance of 1st instars larvae/dip collected in Rurumi and Kangichiri study sites respectively was significantly lower than in the Kiuria study site (F = 5.16, df 2, 179, P < 0.01). The abundance of 2nd instars riceland larvae differed significantly among study sites with the Rurumi study site being significantly lower than in the Kangichiri and Kiuria study sites, respectively (F = 3.79, df 2, 179, P < 0.05). The abundance of 3rd and 4th instars larvae and pupae did not differ significantly among study sites (F = 1.64, 0.97 and 1.04, df 2, 179,P > 0.05). Table 1 shows the abundance of riceland Anopheles arabiensis larvae/20 dips collected in the paddy and canal habitats at the 3 study sites. In the Kangichiri study site, the difference in the abundance of pupae and 1st, 2nd and 3rd instars larvae collected in paddy and canal habitats was not significant (P > 0.05), while that of 4th instars larvae was significantly higher in the paddy habitats than in the canal habitats. (t = 5.19, df 179, P < 0.05). In the Kiuria study site, significantly higher abundance of 3rd instars larvae were collected in the canal habitats (t = 4.68, df 179, P < 0.05) while the other immature stages did not differ significantly between canal and paddy habitats. In the Rurumi study site, paddy habitats had significantly higher abundance of 1st and 2nd instars larvae compared with the canal habitats (t = 5.60 and 3.94, df 179, P < 0.05) but the other immature stages did not vary significantly between paddy and canal habitats.

Table 1 The mean number of Anopheles arabiensis larvae collected (mean ± SE) per 20 dips in paddies and canals identified using field sampled and QuickBird 0.61 m visible and near infra-red (NIR) data

The relative abundance of immature stages of An. arabiensis at the three study sites were significantly higher during post-transplanting and tillering stages of the rice growth than in the other rice stages (Table 2, F = 5.21, df 179, P < 0.05). In the canal habitats, the presence of vegetation had significant impact on relative abundance of An. arabiensis larvae (Tables 3 and 4). At the Kangichiri study site, the relative abundance of 1st and 2nd instars larvae was significantly higher in non-vegetated than vegetated canals while the differences in the other aquatic stages was not significant. At the Kiuria study site, the abundance of all the 4 larval instars of An. arabiensis was significantly higher in non-vegetated canals whereas at the Rurumi study site, the same trend was observed for the 2nd and 3rd instars larvae.

Table 2 The mean number of Anopheles arabiensis larvae collected (mean ± SE) per 20 dips in paddies containing different stages of rice growth using field sampled and QuickBird 0.61 m visible and near infra-red (NIR) data
Table 3 The mean number of Anopheles arabiensis larvae collected (mean ± SE) per 20 dips in vegetated and non-vegetated canals identified using field and QuickBird 0.61 m visible and near infra-red (NIR) data
Table 4 Statistical values comparing the differences in the mean number of Anopheles arabiensis larvae collected (mean ± SE) per 20 dips between the vegetated and non-vegetated canals using field and QuickBird visible and near infra red (NIR) data

Of the 13 predictors that were entered into the model, three were found to be significant predictors of larval abundance. Emergent vegetation was negatively associated with mosquito larvae at the three study sites. In addition floating vegetation (-ve) was significantly associated with immature mosquitoes in Rurumi and Kiuria, while turbidity was also important in Kiuria (Table 5).

Table 5 Logistic regression results on the significance level of vegetation covariates in Kiuria, Kangichiri, and Rurumi study sites for Anopheles mosquitoes

Values for NDVI, SAVI, and ARVI calculated from the QuickBird satellite information were successfully overlaid onto the georeferenced field-based data of the three study sites. The VI's were used to select all paddy and canal habitats with low, intermediate and heavy vegetated values. A database was generated for each study site with the mean, minimum, maximum, and standard deviations for NDVI, SAVI, and ARVI aggregated to the riceland level. The VI datasets for the three study sites were then merged with the entomological datasets. The NDVI was not sensitive to the presence of vegetation and were not affected differently by ecological changes at the three study sites. The change in the soil background caused by the transition in LULC throughout the crop season did not alter the red and NIR rice plant reflectance and calculated SAVI. Visually the data suggest that there was no higher soil influences in the SAVI as compared with the NDVI for all rice stages. It was of interest to determine how the blue band inclusion into the VI would identify the riceland LULC's for making inferences of anopheline abundance. The resistance of the ARVI to atmospheric effects, in comparison to NDVI was accomplished by a self-correction process for the atmospheric effect on the QuickBird red channel using the difference between the imager's blue and red channels to correct the radiance in the red channel. However, the results suggest that the ARVI was not able to normalize atmospheric conditions in the study sites. The percent atmospheric and noise in the ARVI was at the rice height of 0–1 for all LULC's. Overall the NDVI was not associated to rice height much higher than the SAVI for identifying LULC's in all three study sites (Figure 2). NDVI and SAVI exhibit decreasing percent error due to increasing rice height. At rice height beyond 50 cm all the NDVI and SAVI are the same.

Figure 2
figure 2

NDVI, SAVI and ARVI variables plotted against rice height. *A represent land preparation stage and B represent post harvest stage

The QuickBird images of riceland An. arabiensis aquatic habitats and the riceland stratifications are represented (Figures 3). QuickBird visible and NIR bands were able to spatially distinguish levels of all canal habitats and their surrounding vegetation in the study site (Figure 3a). In the satellite image canal vegetation generates shades of green in the rice fields. Spectral regions of paddies were separated by color differences (e.g. flooded, dark blue tone, Figure 3b).

Figure 3
figure 3

QuickBird images of riceland and An. arabiensis aquatic habitats in Kangichiri village in the Mwea Rice Scheme. 3a. Canal habitats and the surrounding vegetation. 3b. Flooded paddies

To identify clusters of riceland habitats with high abundance of An. arabiensis we applied the G i (d) statistic and found a significant cluster in Kangichiri (Z score > 3.70, P < 0.05). At the northern extreme of the 1 km buffer, clustering was highest at a distance of 400 m. When the analysis was conducted in Rurumi, two clusters were noted (Z score > 3.70, P < 0.05) but only up to a maximum distance of 400 m, one for a northern cluster and 150 m for a southern cluster. In Kiuria clustering was much more localized, peaking at 100 m and remaining significant only up to 150 m (Figure 4). Gi*(d) statistics classified the riceland habitats locations by type of association (Gi*(d) cluster maps). When the autocorrelated data was plotted against distance the magnitude of covariation of adjacent riceland habitat effects tended to decrease.

Figure 4
figure 4

Clustering of riceland habitats with high abundance of An. arabiensis in Kiuria study site in the Mwea Rice Scheme, Kenya.


In the current study, significantly higher An. arabiensis larval counts were observed in non-vegetated canals and post transplanting stage of rice development. This is in agreement with previous findings that An. arabiensis prefers open sunlit habitats devoid of vegetation [37, 38]. The occurrence of higher An. arabiensis larval counts during the post-transplanting stage of rice cycle has been attributed to the presence of numerous open sun lit pools created by rice workers during rice seedlings transplanting [39]. Later in the rice growing cycle, rice plants increase in height and tiller numbers and floating vegetation becomes established making the rice fields unsuitable for this species. Emergent and floating vegetation reduce the amount of sunlight reaching the water surface resulting in lower temperatures and consequently a decrease in microbial growth upon which mosquito larvae depend on [40]. Emergent and floating vegetation also may obstruct this species from ovipositing [41]. The negative association between emergent and floating vegetation confirms these findings. Previous studies have reported conflicting results on the effect of turbidity on abundance of An. arabiensis. Muturi and colleagues [42] and Ye-Ebiyo and colleagues [43] reported a positive association between turbidity and An. arabiensis. Bates [44], Robert and colleagues [45], and Shililu and colleagues [46] found An. arabiensis breeding in rather clear water. Water, which is turbid from particles not edible for riceland Anopheles sp. larvae, could disfavor the production of larvae, while water turbid from food particles represents a very suitable habitat.

In these analyses NDVI, SAVI, ARVI equations could not identify LULC change for making inferences of riceland An. arabiensis larval abundance and distribution. Many studies have found NDVI's to be unstable varying with soil, sun-view geometry, atmospheric conditions and the presence of dead material as well as changes within soil moisture [4749]. Factors that reduce reflectivity of soils in the visible region include soil moisture or self-shadow [50]. NDVI equation has a simple open loop structure (no feedback) which renders it susceptible to large sources of error [51]. The SAVI may also exhibit asymptotic (saturated) signals over riceland areas decreasing atmospheric visibility with changing LULC during the crop season. Baret and Guyot [52] express inconsistencies in SAVI especially in soils in which the slope is exactly unity and the intercept is zero. Bausch [53] tested a step-wise variable L function in the SAVI but found no significant reduction in noise reduction.

In the ARVI analyses, the aerosol and water vapor variation during the LULC shifts in the rice season may have considerably reduced the self-correcting coefficient of the red and blue channels in the QuickBird data and this susceptibility to atmospheric noise may have caused error in the derivation of atmospheric-corrected reflectance. The resistance of the ARVI to the atmospheric variations depends on the accuracy of the determination of the atmosphere self-correction coefficient [54]. The atmospherically resistant versions may have minimized atmospheric noise in the study sites but enhanced soil noise, while the soil adjusted variants minimized soil noise but remained sensitive to the atmosphere. Simulations using radioactive transfer computations on arithmetic and natural surface spectra for various atmospheric conditions show that ARVI has a similar dynamic range to the NDVI but is on the average four times less sensitive to atmospheric effects than the NDVI [54].

Aquatic habitats positive for riceland Anopheles larvae in the study sites contained positive autocorrelation in the ecological datasets which may be due to multiple factors that cause habitats to spatially cluster and partially govern riceland mosquito population dynamics. Jacob and colleagues [55] report detecting quite low levels of positive spatial autocorrelation of Anopheles mosquitoes in East African urban areas of Kisumu and Malindi, Kenya. Positive autocorrelation is often driven by causes that may be exogenous (e.g., auto correlated environment, disturbance) and/or endogenous (e.g., conspecific attraction, dispersal limitation, demography) [56]. The use of impregnated bed nets, and adult, larvae and breeding site mosquito control programs tend to have socio-economic/demographic dimensions with spatial expressions [57]. All of these factors can impact contagion diffusion, inducing positive spatial autocorrelation in riceland An. arabiensis aquatic habitats.

An unsupervised algorithm using QuickBird visible and NIR data in ArcInfo 9.1® did not provide informative VI data for riceland Anopheles aquatic habitat suitability in the three study sites. However, one of the most important considerations of satellite data is the increased error in geo-referencing on a pixel-by-pixel basis. ArcInfo 9.1® operations involving adding and rationing map values which requires application of the operation to each pixel; in turn, the problem of error propagation such as location errors through the use of these operations may be relevant to ArcInfo 9.1®. The presence of location error interacting with the spatial structure in the source maps, the presence of spatial correlation in the errors of the attribute measurement process, or indeed their simultaneous presence are capable of generating spatially complex maps of propagated error [58].

For temporal and dynamic VI analyses sources of error include sub-pixel clouds and sun-target sensor angular (bidirectional) considerations [59]. Thin clouds, such as the ubiquitous cirrus, or small clouds with typical linear dimensions smaller than the diameter of the area actually sampled by optical sensors, can significantly contaminate NDVI measurements [60]. Variations in viewing and solar geometry effect NDVI and NDVI variant data acquisition [61]. The radiometric corrections applied to the QuickBird data product included relative radiometric response between detectors, non-responsive detector fill, and a conversion for absolute radiometry. The QuickBird sensor corrections accounted for internal detector geometry, optical distortion, scan distortion, any line-rate variations, and registration of the multispectral bands. However, seasonal variation in vegetation and water level can alter land/water and vegetation interface depiction, which can lead to misregistration of land cover at those sites. For example, the homogeneity of the land cover can affect a particular pixel if an area of high reflectivity, such as flooded fields, is next to an area of low reflectivity, such as fallow fields, creating an average value that may be confused with another LULC change during the rice season [19]. Because of the disturbances and the problems outlined, perfect linearity was not obtained by any of the VI's examined.

In conclusion, NDVI, SAVI and ARVI were not able to determine ecological conditions in a riceland area or assess information relevant to planning malaria control. NDVI is highly susceptible to error over varying atmospheric and canopy background conditions in a riceland agro-village complex. For employing NDVI variants in a in a riceland ecosystem it may be necessary to further explore the L factor in the SAVI equation and the gamma term in the ARVI equation to find ways to optimize normalization of soil and atmospheric influences. The cluster analyses in all models showed a weak tendency for positive autocorrelation. However, a remote stratification using field sampled ecological covariates and QuickBird visible and NIR data distinguished highly productive riceland An.arabiensis aquatic habitats and LULC data solely in terms of detected spectral reflectance. QuickBird data can display spatial data associated with mapped features for identification and characterization of larval anopheline mosquito habitats [62]. Mapping the spatial pattern of seasonal vegetation land cover and habitat productivity using QuickBird visible and NIR and field sampled data of a rice village-complex can dictate where and when microbial larvicides are applied. Treatments or habitat perturbations should be based on surveillance of larvae in the most productive areas of the agro-ecosystem and adjacent village [63]. Additional remote and field coverage for malaria epidemiological investigations in Kangichiri, Kiuria and Rurumi study sites should include various riceland operational time frames such as nursery preparation, channel repairing, weeding, and field drainage.


  1. Rejmankova E, Roberts DR, Pawley A, Manguin S, Polanco J: Predictions of adult Anopheles albimanus densities in villages based on distances to remotely sensed larval habitats. Am J Trop Med Hyg. 1995, 53: 482-488.

    PubMed  CAS  Google Scholar 

  2. Roberts D, Paris J, Manguin S, Harbach R, Woodruff R, Rejmankova E, Polanco J, Wullschleger B, Legters L: Predictions of malaria vector distribution in Belize based on multispectral satellite data. Am J Trop Med Hyg. 1996, 54: 304-308.

    PubMed  CAS  Google Scholar 

  3. Hay SI, Omunbo JA, Craig MH, Snow RW: Earth observation, geographic information systems and Plasmodium falciparum malaria in sub-Saharan Africa. Advances in Parasitology, Remote sensing and Geographical Information Systems in Epidemiology. Edited by: Hay SI, Randolph SE, Rogers GJ. 2000, London: Academic Press, 173-215.

    Chapter  Google Scholar 

  4. Huete AR: Soil Radioactive Influences in Satellite Monitoring of Vegetation. presented at the Sixth Annual Aspen Global Change Institute Summer Science Session in Global Vegetation Patterns and Their Relationship to Human Activity: Aspen, CO. 101-118. 23–28 February; 1995

  5. Bannari A, Morin D, Bonn F, Huete A: A review of vegetation indices. Remote Sens Rev. 1995, 13: 95-120.

    Article  Google Scholar 

  6. Tucker CJ, Fung IY, Keeling CD, Gammon RH: Relationship between atmospheric CO2 variations and a satellite-derived vegetation index. Nature.

  7. Buschmann C, Nagel E: In vivo spectroscopy and internal optics of leaves as basis for remote sensing of vegetation. Int J Remote Sens. 1993, 14: 711-722.

    Article  Google Scholar 

  8. Asrar G, Fuchs M, Kanemasu ET, Hatfield JL: Estimating absorbed photosynthetic radiation and leaf area index from spectral reflectance in wheat. Agron J. 1984, 76: 300-306.

    Article  Google Scholar 

  9. Volgemann JE: Comparison between two vegetation indices for measuring different types of forest damage in the north-eastern United States. Int J Remote Sens. 1990, 11: 2281-2297.

    Article  Google Scholar 

  10. Sellers PJ: Canopy reflectance, photosynthesis and transpiration. Int J Remote Sense. 1985, 6: 1335-1372.

    Article  Google Scholar 

  11. Tucker CJ, Fung IY, Keeling CD, Gammon RH: Relationship between atmospheric CO2 variations and a satellite-derived vegetation index. Nature. 1986, 319: 195-199.

    Article  Google Scholar 

  12. Ehrlich D, Lambin EF: Broad scale land-cover classification and interannual climatic variability. Int J Remote Sens. 1996, 17: 845-862.

    Article  Google Scholar 

  13. Justice Co, Townshend JRG, Holben BN, Tucker CJ: Phenology of global vegetation using meteorological satellite data. Int J Remote Sens. 1985, 6: 1271-1318.

    Article  Google Scholar 

  14. Hay SI, Snow RW, Rogers DJ: Predicting malaria seasons in Kenya using multitemporal meteorological satellite sensor data. Trans Roy Soc Trop Med Hyg. 1998, 92: 12-20.

    Article  PubMed  CAS  Google Scholar 

  15. Asner GP, Wessman CA, Bateson CA, Privette JL: Impact of tissue, canopy, and landscape factors on the hyperspectral reflectance variability of arid ecosystems. Remote Sens Environ. 2000, 74: 69-84.

    Article  Google Scholar 

  16. Huete AR, Hua G, Qi J, Chehbouni A, van Leeuwen WJD: Normalization of multidirectional red and NIR reflectances with the SAVI. Remote Sens Environ. 1992, 41: 143-154.

    Article  Google Scholar 

  17. Myneni RB, Asrar G: Atmospheric effects and spectral vegetation indices. Remote Sens Environ. 1994, 47: 390-402.

    Article  Google Scholar 

  18. Muturi EJ, Shililu J, Jacob B, Gu W, Githure J, Novak R: Mosquito species diversity and abundance in relation to land use in a riceland agroecosystem in Mwea, Kenya. J Vector Ecol. 2006, 31 (1): 129-137.

    Article  PubMed  Google Scholar 

  19. Jacob BG, Muturi E, Halbig P, Mwangangi J, Wanjogu RK, Mpanga E, Funes J, Shililu J, Githure J, Regens JL, Novak R: Environmental abundance of Anopheles (Diptera: Culicidae) larval habitats on land cover change sites in Karima village, Mwea rice scheme, Kenya. Am J Trop Med Hyg. 2007, 76: 73-80.

    PubMed  Google Scholar 

  20. Kleinschmidt I, Bagayoko M, Clarke GPY, Craig M, Le Sueur D Le: A spatial statistical approach to malaria mapping. Int Epid Assoc. 2000, 29: 355-361.

    Article  CAS  Google Scholar 

  21. Jacob BG, Muturi EJ, Funes JE, Shililu JI, Githure JI, Kakoma II, Novak RJ: A grid-based infrastructure for ecological forecasting of rice land Anopheles arabiensis aquatic larval habitats. Malaria J. 2006, 5: 91-

    Article  Google Scholar 

  22. []

  23. Wood B, Washino R, Beck L, Hibbard K, Pitcairn M, Roberts D, Rejmankova E, Paris J, Hacker C, Salute J, Sebasta P, Legters L: Distinguishing high and low anopheline-producing rice fields using remote sensing and GIS technologies. Prev Vet Med. 1991, 11: 277-288.

    Article  Google Scholar 

  24. Beck LR, Rodriguez MH, Dister SW, Rodriguez AD, Rejmankova E, Ulloa A, Meza RA, Roberts DR, Paris JF, Spanner MA: Remote sensing as a landscape epidemiologic tool to identify villages at high risk malaria transmission. Am J Trop Med Hyg. 1994, 51 (3): 271-280.

    PubMed  CAS  Google Scholar 

  25. Pope KO, Rejmankova E, Savage H, Arredonde-Jimenez JI, Rodriguez MH, Roberts DR: Remote sensing of tropical wetlands for malaria control in Chiapas, Mexico. Ecol Appl. 1994, 4 (1): 81-90.

    Article  PubMed  CAS  Google Scholar 

  26. Thomson MC, Connor SJ, D'Alessandro U, Rowlingson B, Diggle P, Cresswell M, Greenwood B: Predicting malaria infection in Gambian children from satellite data and bed net use surveys: the importance of spatial correlation in the interpretation of results. Am J Trop Med Hyg. 1999, 61: 2-8.

    PubMed  CAS  Google Scholar 

  27. []

  28. []

  29. Wang Q, Adiku S, Tenhunen J, Granier A: On the relationship of NDVI with leaf area index in a deciduous forest site. Remote Sens Environ. 2005, 94: 244-255.

    Article  Google Scholar 

  30. Jackson RD, Huete AR: Interpreting vegetation indices. Prevent Vet Med. 1991, 11: 185-200.

    Article  Google Scholar 

  31. Huete A, Justice C, Liu H: Development of vegetation and soil indexes for Modis-Eos. Remote Sens Environ. 1994, 49: 224-234.

    Article  Google Scholar 

  32. Asner GP, Hicke JA, Lobell DB: Per-pixel analysis of forest structure: Vegetation Indices, Spectral Mixture analysis and Canopy Reflectance Modeling. Remote Sens Forest Environ. 2003, 1-43.

    Google Scholar 

  33. Huete AR, Hua G, Qi J, Chehbouni A, van Leeuwen WJD: Normalization of multidirectional red and NIR reflectances with the SAVI. Remote Sens Environ. 1992, 41: 143-154.

    Article  Google Scholar 

  34. Kaufman YJ, Tanre D: Atmospherical resistant vegetation index (ARVI) for EOSMODIS, IEEE. Trans Geoscience RemSens. 1992, 30: 261-270.

    Article  Google Scholar 

  35. Hay SI, Snow RW, Rogers DJ: From predicting mosquito habitat to malaria seasons using remotely sensed data: practice, problems and perspectives. Parasiltol Tod. 1998, 14: 306-313.

    Article  CAS  Google Scholar 

  36. Ord K, Getis A: Local spatial autocorrelation statistics: distribution issues and an application. Geogr Anal. 1995, 24: 286-306.

    Google Scholar 

  37. Gimnig JE, Ombok M, Kamau L, Hawley WA: Characteristics of larval Anopheline (Diptera: Culicidae) habitats in Western Kenya. J Med Entomol. 2001, 38: 282-288.

    Article  PubMed  CAS  Google Scholar 

  38. Shililu J, Ghebremeskel T, Seulu F, Mengistu S, Fekadu H, Zerom M, Ghebregziabiher A, Sintasath D, Bretas G, Mbogo C, Githure J, Brantly E, Novak R, Beier JC: Larval habitat diversity and ecology of anopheline larvae in Eritrea. J Med Entomol. 2003, 40: 921-929.

    Article  PubMed  Google Scholar 

  39. Chandler JA, Highton RB, Hill MN: Mosquitoes of the Kano plain, Kenya 1. Results of indoor collections in irrigated and nonirrigated areas using human bait and light traps. J Med Entomol. 1975, 12: 504-510.

    Article  PubMed  CAS  Google Scholar 

  40. Mutero C, Blank H, Konradsen F, van der Hoek W: Water management for controlling the breeding of Anopheles mosquitoes in rice irrigation schemes in Kenya. Acta Trop. 2000, 76: 253-263.

    Article  PubMed  CAS  Google Scholar 

  41. Rao TR: The anophelines of India. Malaria Research Centre (ICMR), New Delhi, India. 1984

    Google Scholar 

  42. Muturi EJ, Shililu JI, Jacob BG, Githure JI, Novak R: Larval habitat dynamics and diversity of Culex mosquitoes in rice agro-ecosystem in Mwea, Kenya. Am J Trop Med Hyg. 2007, 76 (1): 95-102.

    PubMed  Google Scholar 

  43. Ye-Ebiyo Y, Pollack RJ, Kiszewski A, Spielman A: Enhancement of development of larval Anopheles arabiensis by proximity to flowering maize (Zea mays) in turbid water and when crowded. Am J Trop Med Hyg. 2003, 68: 748-752.

    PubMed  Google Scholar 

  44. Bates M: The natural history of mosquitoes. New York: MacMillan, 379-

  45. Robert V, Awono-Ambene HP, Thioulousse J: Ecology of larval mosquito, with special reference to Anopheles arabiensis (Diptera: Culicidae) in market-garden wells in the urban area of Dakar, Senegal. J Med Entomol. 1998, 35: 948-955.

    Article  PubMed  CAS  Google Scholar 

  46. Shililu J, Ghebremeskel T, Seulu F, Mengistu S, Fekadu H, Zerom M, Ghebregziabiher A, Sintasath D, Bretas G, Mbogo C, Githure J, Brantly E, Novak R, Beier JC: Larval habitat diversity and ecology of anopheline larvae in Eritrea. J Med Entomol. 2003, 40: 921-929.

    Article  PubMed  Google Scholar 

  47. Sellers PJ: Canopy reflectance, photosynthesis and transpiration. Int J Remote Sense. 1985, 6: 1335-1372.

    Article  Google Scholar 

  48. Jackson RD, Pinter PJ: Spectral response of architecturally different wheat canopies. Remote Sens Environ. 1986, 20 (1): 43-56.

    Article  Google Scholar 

  49. Myneni RB, Ganapol BD, Asrar G: Remote sensing of vegetation canopy photosynthetic and stomatal conductance efficiencies. Remote Sense Environ. 1992, 42: 217-238.

    Article  Google Scholar 

  50. Karnieli A, Kaufman YJ, Remer L, Wald A: AFRI-aerosol free vegetation index. Rem Sens Environ. 2001, 77: 10-21.

    Article  Google Scholar 

  51. Liu HQ, Huete A: A feedback based modification of the NDVI to minimize canopy background atmospheric noise. Geoscience and Remote Sensing, IEEE Trans Geoscience Remote Sens. 33 (2): 457-465.

  52. Baret F, Guyot G: Potentials and limitations of vegetation indices for LAI and APAR assessment. Remote Sens Environ. 1991, 35: 161-173.

    Article  Google Scholar 

  53. Bausch W: Soil background effects on reflectance-based crop coefficients for corn. Remote Sens Environ. 1993, 46: 1-10.

    Article  Google Scholar 

  54. Kaufman YJ, Tanre D: Atmospherical resistant vegetation index (ARVI) for EOSMODIS, IEEE. Trans Geoscience Remote Sense. 1992, 30: 261-270.

    Article  Google Scholar 

  55. Jacob BG, Arheart KL, Griffith DA, Mbogo CM, Githeko AK, Regens JL, Githure JI, Novak R, Beier JC: Evaluation of environmental data for identification of Anopheles (Diptera: Culicidae) aquatic larval habitats in Kisumu and Malindi, Kenya. J Med Entomol. 2005, 42 (5): 751-755.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Sokal RR, Oden NL: 1. Methodology. Biological. J Linnean Soc. 1978, 10: 199-228.

    Article  Google Scholar 

  57. Griffith DA: A comparison of six analytical disease mapping techniques as applied to West Nile Virus in the coterminous United States. Int J Health Geogr. 2005, 4: 18-

    Article  PubMed  PubMed Central  Google Scholar 

  58. Arbia G, Griffith D, Haining R: Error propagation modeling in raster GIS: overlay operations. Int J Geogr Inf Science. 1998, 12: 145-167.

    Article  Google Scholar 

  59. Los SO, Collatz GJ, Sellers PJ, Malmstrom CM, Pollack NH, DeFries RS, Bounoua L, Parris MT, Tucker CJ, Dazlich DA: A global 9-yr biophysical land surface dataser from NOAA AVHRR data. J of Hydrometeorol. 2000, 1: 183-199.

    Article  Google Scholar 

  60. Pinty B, Verstraete MM: GEMI: a non-linear index to monitor global vegetation from satellites. Plant Ecol. 1992, 101: 15-20.

    Article  Google Scholar 

  61. Slater PN, Jackson RD: Atmospheric effects on radiation reflected from soil and vegetation as measured by orbital sensors using various scanning directions. Appl Optics. 1982, 21: 3923-3921.

    Article  CAS  Google Scholar 

  62. Sithiprasasna R, Lee WJ, Ugsang DM, Linthicum KJ: Identification and characterization of larval and adult anopheline mosquito habitats in the Republic of Korea: potential use of remotely sensed data to estimate mosquito distributions. Int J Health Geogr. 2005, 4: 17-

    Article  PubMed  PubMed Central  Google Scholar 

  63. Gu W, Novak R: Habitat-based modeling of impacts of mosquito larval interventions on entomological inoculation rates, incidence, and prevalence of malaria. Am J Trop Med Hyg. 2005, 73: 546-552.

    PubMed  Google Scholar 

Download references


We would like to thank the data collection efforts of the ICIPE Mwea Rice Mosquito Team: provided by James Wauna, Peter Barasa, Nelson M. Muchiri, Glady Kamari, Charles C. Kiura, William M. Waweru, Christine W. Maina, Peter M. Mutiga, Irene Kamau, Paul K. Mwangi, Nicholus G. Kamari, Martin Njigoya, and Naftaly Gichuki at the Mwea Divison in Kenya for conducting the study. We would also like to thank Dr. Clifford Mutero and Samuel Munyi, for providing data for the various base maps of the three study sites. This research was funded by the National Institute of Health Grant U01A154889 (Novak Robert) University of Illinois, Urbana-Champaign.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Benjamin G Jacob.

Additional information

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

BGJ conceived and designed the study. EJM, EXC and SM performed the analyses. JS and JG supervised the field data collection. JM performed the VI operations and JF did the spatial analysis. RJ is the Principal Investigator. All authors interpreted the results and wrote the paper.

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Jacob, B.G., Muturi, E.J., Mwangangi, J.M. et al. Remote and field level quantification of vegetation covariates for malaria mapping in three rice agro-village complexes in Central Kenya. Int J Health Geogr 6, 21 (2007).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: