- Open Access
A Bayesian approach to study the space time variation of leprosy in an endemic area of Tamil Nadu, South India
International Journal of Health Geographicsvolume 7, Article number: 40 (2008)
In leprosy endemic areas, patients are usually spatially clustered and not randomly distributed. Classical statistical techniques fail to address the problem of spatial clustering in the regression model. Bayesian method is one which allows itself to incorporate spatial dependence in the model. However little is explored in the field of leprosy. The Bayesian approach may improve our understanding about the variation of the disease prevalence of leprosy over space and time.
Data from an endemic area of leprosy, covering 148 panchayats from two taluks in South India for four time points between January 1991 and March 2003 was used. Four Bayesian models, namely, space-cohort and space-period models with and without interactions were compared using the Deviance Information Criterion. Cohort effect, period effect over four time points and spatial effect (smoothed) were obtained using WinBUGS. The spatial or panchayat effect thus estimated was compared with the raw standardized morbidity (leprosy prevalence) rate (SMR) using a choropleth map. The possible factors that might have influenced the variations of prevalence of leprosy were explored.
Bayesian models with the interaction term were found to be the best fitted model. Leprosy prevalence was higher than average in the older cohorts. The last two cohorts 1987–1996 and 1992–2001 showed a notable decline in leprosy prevalence. Period effect over 4 time points varied from a high of 3.2% to a low of 1.8%. Spatial effect varied between 0.59 and 2. Twenty-six panchayats showed significantly higher prevalence of leprosy than the average when Bayesian method was used and it was 40 panchayats with the raw SMR.
Reduction of prevalence of leprosy was 92% for persons born after 1996, which could be attributed to various intervention and treatment programmes like vaccine trial and MDT. The estimated period effects showed a gradual decline in the risk of leprosy which could be due to better nutrition, hygiene and increased awareness about the disease. Comparison of the maps of the relative risk using the Bayesian smoothing and the raw SMR showed the variation of the geographical distribution of the leprosy prevalence in the study area. Panchayat or spatial effects using Bayesian showed clustersing of leprosy cases towards the northeastern end of the study area which was overcrowded and population belonging to poor economic status.
Leprosy is a chronic infectious disease caused by the bacterium, Mycobacterium leprae, which can affect all ages and both sexes. Over the last two decades, prevalence of leprosy has come down substantially at the global level. Introduction and expansion of Multi drug therapy (MDT) in leprosy control programmes have dramatically lowered the prevalence level in almost all the endemic countries. For instance, in India leprosy prevalence has come down from 51 per 10,000 in 1981 to around 2.4 per 10,000 in March 2004  and further below 1 per 10,000 by December 2005 .
In disease epidemiology, many of the infectious disease events do not occur randomly in geographical context but occur in clusters. In fact, leprosy epidemiology shows such uneven distribution between different geographic areas of the country, e.g. in China  and in Indonesia where in the leprosy endemic was high, the cases were extensively clustered and not equally distributed . Hence, data analyses and interpretation should not ignore spatial dependence . In India, we have observed that distribution of leprosy is uneven  even within the smallest community groups such as villages, right up to the family level .
Geographical or spatial analysis comes into play due to the existence of spatial dependence in data. Bayesian method lends itself for representing the spatial dependence during the estimation of model parameters. Application of Bayesian analysis in the field of leprosy is very limited . A large controlled, double blind, randomized, prophylactic leprosy vaccine trial was conducted in South India to assess the prophylactic efficacies of four different candidate vaccines . Within the trial area, we observed that leprosy was not randomly distributed and showed significant spatial dependence confirmed by using various measures of spatial autocorrelation like Moran's I, Geary's C and Kulldorff's SATSCAN statistics . Hence, our main objective was to examine the variation in the prevalence of leprosy using four Bayesian models described by Arbyn etal  which was earlier proposed by Lagazio et al  and to explore possible factors that might have influenced these variations in the study area.
Materials and Methods
Leprosy prevalence data from an endemic area, covering 148 panchayats (rural administrative units) comprising of 264 contiguous villages from Chingleput district, Tamil Nadu, South India was studied. This area was specifically identified for a leprosy vaccine trial  because of the high endemicity of leprosy. The entire population of about 300,000 people was screened by house to house examinations for leprosy and cases were identified at four time schedules between January 1991 and March 2003. Few socio- economic factors like population density and economic status were also collected.
Definition of leprosy
A case of leprosy was defined as a person, having one or more of the following manifestations and who needed antileprosy treatment: hypopigmented or reddish skin lesion(s) with definite loss of sensation; damage to the peripheral nerves, as demonstrated by loss of sensation and or weakness of the muscles in parts supplied by these nerves; skin smear positive for acid fast bacilli.
The entire population in this area was screened for leprosy before vaccination. First, paramedical workers, trained in leprosy detection, screened the population. A proportion (5%) of this population was randomly allotted to "blinded" senior persons – either medical officers or senior paramedical officers for quality control. All cases and suspects detected by the junior paramedical workers were examined for diagnosis by two senior persons and by a third independent examiner in case of disagreement. Skin smear examination for detecting acid fast bacilli was done for all suspects and definite cases by the senior workers. A team of independent clinicians visited the field at frequent intervals to monitor the procedures for diagnosis of leprosy .
The data collected was also validated in many ways with the earlier surveys . Hence the quality of data collected was remarkable and comparable to world standard as certified by the independent assessment committee consisting of national and international experts.
Leprosy cases and population for each panchayat were cross-classified into 20 age groups (1–4,5–9,...,90–94,95–99) (there were no cases of leprosy under one year age) and four survey time periods (1991–93,1993–95,1997–98 &1999–2003). Since the time schedule for each of the survey was of varying length (January 1991 to March 2003) the mid-point of the four surveys (1992, 1994, 1997 & 2001) was considered for the time period. Cohorts were computed on the basis of survey time period and age. There were 20 overlapping or rolling birth cohorts like moving averages defined in this model, considering the mid-point, the cohorts were labeled as 1902, 1907,...,1997. The variation of prevalence of leprosy over space and time was modeled from January 1991 to March 2003 over 148 panchayats, after controlling for age. The term cohort effect refers to population born during a particular survey period identified by period of birth so that its characteristics can be ascertained as it enters successive and age strata .
Four Bayesian models, namely
(i) Space-Cohort (SC) with interactions,
(ii) Space-Cohort (SC) without interactions,
(iii) Space-Period (SP) with interactions and
(iv) Space-Period (SP) without interactions were fitted.
Models with and without interaction terms were compared using the Deviance Information Criterion (DIC) , being a generalization of the Akaike's Information Criterion in the Bayesian framework, i.e., lower the DIC values better the model. Posterior distributions (using the priors and the available data) of the parameters of interest were obtained using Gibbs sampling in WinBUGS . The data for the SC model (observed and expected cases) were obtained by aggregating the data over the four time periods. The data for the SP model (observed and expected cases) were obtained by aggregating the data over the age groups. For the SC model the average of all the cohorts and for SP model the average of all periods was used as the reference.
The spatial effects were measured using each of the models. These smoothed effects were compared using the raw standardized morbidity rate (SMR) [unsmoothed].
We have dealt with subgroups like panchayats hence we used prevalence instead of incidence to get sufficiently larger number of cases to draw meaningful conclusions. Moreover if we use incidence rather than prevalence, we will have only three survey data leaving the baseline survey. Generally the second survey incidence (a mixed bag of old prevalence- missed and new cases) is not considered for vaccine efficacy and also for trend analysis. Hence we would be left with the third and fourth survey and ultimately this exercise of Bayesian model would not be possible to examine the trend over years. Hence as a pragmatic measure, we considered prevalence cases for this study.
In the study area majority of leprosy cases belonged to paucibacillary variety and the multibacillary cases constituted a smaller proportion (17.4%, 3.3%, 5.4% and 1.4% in the four surveys respectively). Fixed duration of MDT for six months was practiced all through the study period. For the purpose of analysis in this paper we restricted it only for paucibacillary variety. Hence, prevalence trends also indicate trends in leprosy incidence and both go hand-in-hand. Any change in beneficial effect that happens during the survey period decreases incidence as well as prevalence. Since we had limited data collected specifically by panchayat on socio-economic factors and there could be other important factors apart from the two mentioned above, we did not include any of these factors directly in the model. Though the period of study was very short, in the region, major remarkable changes have taken place both in the leprosy control programme and improvement in the socio-economic status. According to the World Development Indicators 2005  the rural poverty in India declined from 53% to 27% between 1977–78 and 1999–2000 and the Indian social structure was transformed between 1991 and 2001 like increased literacy rate, urbanization, industrialization, new economic liberalization etc.
The formulae used are described in Appendix-1
There were 6,601 cases, 4,731 cases 3,342 cases and 2,098 cases of leprosy respectively at the four time periods. The DIC value for the space cohort model with and without interaction were compared. Since the models with the interaction term out- performed with smaller DIC values (Table 1 and Table 2). Further analyses were carried out with interaction term included into the SC and SP model. The Markov Chain Monte Carlo Simulation (MCMC) scalar parameters and their 95% credible intervals for the SC and SP models with interactions are shown in Table 3 and Table 4 respectively.
Cohort effect using the SC model with interaction
The median cohort effects declined from 1.86 to 0.08 over the successive cohorts. The cohort effect using SC model (Table 5) measured in terms of relative risk, was significantly higher than the average in the older cohorts C ≤ 11, i.e. persons born before 1957. The cohort effect steadily decreased up to 1922–1931 (C ≤ 6) and increased substantially during 1927–1936,1932–1941 (C = 7 & C = 8) and again significantly declined to a higher risk of 36% at C = 11 and finally reached to a lower risk of 92% (C = 20) than the average for persons born after 1996. There were four major jumps (difference between a cohort and the preceding one) observed in the risk pattern of the cohorts i.e. 1927–1936 (higher risk of 45%), 1942–51(reduced risk of 31%), 1987–1996 (reduced risk of 33%) and 1992–2001(reduced risk of 40%).
Period effect using the SP model with interaction
The period effect over four time points using SP model showed a significantly higher risk of 1.032 than the average, i.e. 3.2% to a lower risk 1.8% (Table 6).
Space effect using SC and SP model with interaction
The spatial effect values (smoothed Bayesian) using SC and SP models using interaction terms were similar as observed in the Belgium study . The spatial effects of different panchayats are listed in Table 7 and it can be further visualized in the choropleth map (Figure 1). The spatial effects varied between 0.59 and 2. Bayesian model identified 26 panchayats that had a significantly higher risk of leprosy. There was a higher risk of leprosy (50% or more) found in 10 panchayats and most of them lay close to each other towards the North-Eastern end of the study area. The lowest risk (relative risk 0.59) was observed in two panchayats. Raw SMR identified 43 panchayats at a higher risk, which was statistically significant (Figure 2); whereas it was 26 panchayats by the Bayesian models.
We examined variation in prevalence of leprosy using Bayesian methods over 20 rolling cohorts and four time periods from a meticulously collected dataset of a vaccine trial conducted in South India. Observing the cohort effects it neither showed a steady decreasing nor an increasing effect. It showed an intricate effect, whereas leprosy prevalence trends in period effects has decreased continuously over four time points.
The steady decreasing period effect reflects that the control programme had reached all the target population whereas the intricate cohort effects showed that the effects captured were not similar in all the cohorts.
The higher difference in the risk in successive cohorts could be attributable to persons coming forward for seeking treatment, gradual awareness in the community that leprosy is curable, slowly combating the social stigma, early screening programmes, better case detection methods, availability of therapies and elimination programmes. This is particularly true in the vaccine trial setting where health systems operations and leprosy programme have been implemented in total .
The turning point in the risk pattern (from high risk to low risk) occurred during the cohort 1952–1961 (C = 12). This change could be due to the impact of Dapsone based National Leprosy Control Programme introduced during 1955. Gradual reduction of risk in the latter cohorts (C = 13, C = 14) may be due to continuing effect of programs and possibly improvement in socio-economic conditions the transition that has been taking place in India. Further reduction in risk (C = 19) could be due to more effective and intensified treatment programmes like Multi-Drug Therapy in 1991, that has changed the face of leprosy  and introduction of effective prophylactic vaccines through the leprosy vaccine trial .
Our data indicate 92% reduction in the leprosy prevalence for persons born after 1996 (C = 20). The case detection and treatment activity in the community has brought down the new infection rate in the younger age group as a secondary effect of MDT in addition to the primary effect of prophylactic vaccines. The older age group might have been infected in the past i.e. before the introduction of Dapsone and MDT as well as before the introduction of the vaccination programme. They might already be harbouring the infection and the break down could result in fresh disease. This is similar to the endogenous reactivation observed in tuberculosis .
We observed slow decline in the estimated period effects of leprosy in the study area. It agrees with the fact that the prevalence of leprosy has come down over years globally as well as in India . Leprosy being a chronic disease, temporal changes within endemic regions are slow . This phenomenon is observed in the study area. Moreover, the decline in the risk of leprosy over the time periods in the study area could be due to better understanding on nutrition, hygiene, and increased public understanding of the disease (socio-economic factors) which limit the spread of leprosy or increased resistance to leprosy.
The spatial effect using Bayesian model was compared with the raw SMR gives the variation in the geographical distribution of the leprosy prevalence. The use of Bayesian smoothing approach accounts for the variability in the population at risk and clustering effect. Observing the spatial Bayesian effect, there was a strong pattern of clustering towards the North-Eastern region of the study area. Though the effects obtained through the models showed the reduction in the risk of prevalence of leprosy still a few pockets of high prevalence exist. The situation was similar in Tuscany  where the epidemic of lung cancer when analyzed using birth cohorts showed a decline but the spatial pattern was evident and strong towards the north-west/south-east gradient.
In the state of Ceara North- East Brazil , the spatial pattern of the leprosy disease was heterogeneous and municipalities with very high prevalence were clustered towards the North-South axis. Surprisingly the region with the highest incidences were most urbanized and economically developed. According to the authors the reasons for spatial clustering of disease rates might be related to a heterogeneous distribution of the factors such as crowding, social inequality and environmental characteristics which by themselves determine the transmission of Mycobacterium leprae. It could also be due to more efficient health system present in these regions to detect new cases of leprosy more efficiently. We tried to explore the authors' above perception and observed that the pockets identified by the Bayesian model had a population density of about 1,427 per sq km which is two times higher than the district and three times higher than the state population density . Hence, possibly as result of this, environmental factors, such as urbanization and overcrowding due to inadequate housing, could have led to more frequent close contact with the source of infection and favoured the spread of leprosy. We also observed that nearly 37% of the people from this pocket belong to the economically poorer strata. Generally people from economically poorer strata are more prone to infectious diseases like leprosy as they live in close proximity to one another resulting in higher risk of contracting the disease . Dharmendra  emphasizes that one cannot control, eliminate or eradicate leprosy without improving the socio-economic status or changing the in sanitary habits of the common people. Hence these few pockets or strata need greater care to bring down the leprosy prevalence to much greater extent and to make the study area free from leprosy.
The step by step algorithms and winbug codes can be downloaded from annexes of Arbyn etal .
Let O ijt denote the observed count of leprosy cases in panchayat i (i = 1, 2,...,148) in the jth cohort (j = 1, 2 ..... 20) and during the time point t (t = 1, 2, 3 & 4);
be the observed number of leprosy cases in the ith panchayat and jth cohort;
be the expected number of leprosy cases in the ith panchayat and jth cohort;
O ij ~ Poisson (α ij ),
with α ij = rr ij . X ij , where i = 1, 2,...,148 panchayats,
j = 1, 2 ..... 20 cohorts,
ξ ij is a linear predictor,
rr ij is the relative risk of the ith panchayat and the jth cohort.
is the estimated cohort effects and similar to the standardized cohort morbidity ratios described by Beral .
ξij, the linear predictor can be specified in two ways.
Model without interactions: ξ ij = a + βistr + βiunstr + βjcoh,
Model with interactions: ξij = a + βistr + βiunstr + βjcoh + SSijac,
Where SSijac = Sijac - S1jac - Si1ac + S11ac and
The first term Sijac represents the interaction term of panchayat and cohort and SSijac is the centering used to improve a convergence of the Markov Chain Monte Carlo Simulation (MCMC) same as discussed in detail by Arbyn et al .
Where βistr represents structured spatial variability;
βiunstr represents unstructured spatial variability;
βjcoh represents the effect of the jth cohort;
Prior distribution for the model
βstr = (β1str,......, β148str)T,
βunstr = (β1unstr,......, β148unstr)T,
βcoh = (β1coh,......, β20coh)T,
Sac = (Sac1,1,......, Sac148,20)T.
The priors are multivariate normals.
βstr~N(0,(μ str σ str )-1),
βunstr~N(0,(μ unstr I148)-1),
βcoh~N(0,(μ coh σ coh )-1),
Sac~N(0,(μ ac σ ac )-1).
The structured spatial term βstr and cohort effect βcoh are assigned the Gaussian Conditional Autoregression (CAR) prior distribution. They followed the closer specification of matrices σ str , σ coh and σ ac as mentioned by Lagazio . They are implemented using WinBUGS' function car.normal().
The above function constraints, the random effects to add up to zero, so that the following constraints are satisfied in the model.
The prior for the interaction vector Sac is a Markov random field.
The intercept term 'a' was given a flat prior through WinBUGS function dflat().
The precision terms μ str and μ coh and μ unstr were given Gamma priors.
The space period model is similar as above instead of cohorts, periods used (t = 4). The algorithmic steps for the four models using WINBUGS with and without interactions similar to the space cohort model are discussed in detail elsewhere .
Deviance Information Criterion (DIC) is defined as
– the posterior expectation of the deviance and summarizes the fit of the model,
– the deviance evaluated at the posterior expectations of parameters.
– the effective number of parameters.
In the model the spatial effect (autocorrelation) depends on
whether any two panchayats share a common boundary and
the number of shared neighbours (panchayats).
For fitting the models with and without interactions, two chains, with 1:10 thinning was used to obtain a sample of 10 000 values.
In case of the model without interactions a burn-in of 50,000 iterations and an additional 50,000 iterations were used. For fitting the models with interactions, a burn-in of 100 000 iterations and an additional 50,000 were used.
Sasakawa Yohei: WHO Goodwill Ambassador's Newsletter. For the elimination of leprosy no.12;. 2005
Special Correspondent, India achieves leprosy eradication target. The Hindu News paper, 31 January, 2006 (page 15 col 1)
Chen XS, Li WZ, Jiang C, Ye GY: Leprosy in China: epidemiological trends between 1949 and 1998. Bull World Health Organ. 2001, 79 (4): 306-12.
Bakker MI, Hatta M, Kwenang A, Klatser PR, Oskam L: Epidemiology of leprosy on five isolated islands in the Flores Sea, Indonesia. Trop Med IntHealth. 2002, 7 (9): 780-7. 10.1046/j.1365-3156.2002.00931.x.
Anselin L, Griffith DA: Do spatial effects really matter in regression analysis?. Papers of the Regional Science Association. 1988, 65: 11-34.
SundarRao PSS: Current epidemiology of leprosy in India. Lepr Rev. 2006, 77: 292-294.
Gupte MD: Leprosy Epidemiology. Text Book and Atlas of Dermatology (II). 2001, chapter 65: 1543-52. second
Bailey TC, Carvalho MS, Lapa TM, Souza WV, Brewer MJ: Modeling of under-detection of cases in disease surveillance. Annals of Epidemiology. 15: 335-343. 10.1016/j.annepidem.2004.09.013.
Gupte MD, Vallishayee RS, Anantharaman DS, Nagaraju B, Sreevatsa , Balasubramanyam S, de Britto RL, Elango N, Uthayakumaran N, Mahalingam VN, Lourdusamy G, Ramalingam A, Kannan S, Arokiasamy J: Comparative Leprosy Vaccine Trial in South India. Indian Journal of Leprosy. 1998, 70 (4): 369-88.
Identification of clusters using measures of spatial autocorrelation in an endemic area of leprosy, Tamil Nadu, SouthIndia. Unpublished manuscript, National Institue of Epidemiology, Chennai 600 077, India
Arbyn M, Capet F, Komarek A, Lesaffre E: Space-time variation of cervical cancer mortality in Belgium using an Hierarchical Bayesian model – (Belgium, 1969–1994). accessed on 25th December 2006, http://www.iph.fgov.be/EPIDEMIO/epien/cervixen/space_time.pdf
Lagazio C, Dreassi E, Biggeri A: A hierarchical Bayesian model for space-time variation of disease risk. Statistical Modelling. 2001, 1: 17-29. 10.1191/147108201128069.
John ML: A Dictionary of Epidemiology,. 2001, Oxford University press, Fourth
Spiegelhalter DJ, Best NG, Carlin BP, Van der Linde A: Bayesian measures of model complexity and fit. J Royal Stat Soc B. 2002, 64: 583-616. 10.1111/1467-9868.00353.
Spiegelhalter D, Thomas A, Best N: WINBUGS: Bayesian Inference Using Gibs Sampling. MRC Biostatistics Unit, Institute of Public Health,. 2000, Cambridge & Department of Epidemiology and Public Health, Imperial College School of Medicine, London, accessed on 5th December 2003, http://www.mrc-bsu.cam.ac.uk/bugs
India Country Health System profile-Trends in Socio-economic Development. accessed on 5th April 2008,http://www.searo.who.int/EN/Section313/Section1519_10850.htm
Meima A, Gupte MD, van Oortmarssen GJ, Habbema JDF: Trends in leprosy case detection rates. Int J Lepr and Other Mycobact Dis. 1997, 65: 305-19.
Sutherland I: Recent studies in the epidemiology of tuberculosis, based on the risk of being infected with tubercle bacilli. Advances in tuberculosis research. Edited by: Urbancyik G, Birkhauser H, Fox W. 1976, 1-63.
Gupte MD, Pannikar V, Manickam P: Leprosy case detection trends in India Health. Administrator. 18 (2): 28-36. http://medind.nic.in/haa/t06/i2/haat06i2p28.pdf
Montenegro ACD, Werneck LG, Kerr-Pontes LRS, Barreto ML, Feldmeier H: Spatial Analysis of the Distribution of Leprosy in the State of Ceará, Northeast Brazil. Mem Inst Oswaldo Cruz, Rio de Janeiro. 2004, 99 (7): 683-6. accessed on 25th December 2006, https://tspace.library.utoronto.ca/bitstream/1807/3855/1/oc04139.pdf
Census of India 2001. accessed on 25th December 2006,http://www.censusindia.gov.in/
Leprosy Malayasian Medical Association, Press releases. accessed on 25th December 2006, http://www.mma.org.my/current_topic/leprosy.htm
Dharmendra : Control and Eradication of Leprosy – a strategy. The Health Administrator. 1988, http://medind.nic.in/haa/t06/i2/haat06i2p1.pdf
Beral V: Cancer of the cervix: a sexually transmitted infection?. Lancet. 1974, 25: 1037-40. 10.1016/S0140-6736(74)90432-2.
Geweke J: Evaluating the Accuracy of Sampling-Based Approaches to the Calculation of Posterior Moments,. Proceedings of the Fourth Valencia International Meeting on Bayesian Statistics. Edited by: Berger JO, Bernardo JM, Dawid AP, Smith AFM. 1992, Oxford: Oxford University Press, 169-94.
Gelman A, Rubin DB: Inference from iterative simulation using multiple sequences. Statistical Science. 1992, 7: 457-511. 10.1214/ss/1177011136.
We thank Dr. P Manickam@ and Dr. V Selvaraj@ for their critical comments and reading the manuscript. We thank Mr. A Elangovan@ for digitizing the map.
@National Institute of Epidemiology.
The authors declare that they have no competing interests.
VJ conceived the ideas, performed the statistical analysis and drafted the manuscript. MDG conceived the study, coordinated and participated in the trial, gave critical and intellectual comments for the improvement of the manuscript. MB contributed to data analysis and gave critical comments for the improvement of the manuscript. All the authors read and approved the final manuscript.
Mohan D Gupte and M Bhagavandas contributed equally to this work.