The spatial distribution of esophageal and gastric cancer in Caspian region of Iran: An ecological analysis of diet and socioeconomic influences
 Mohammadreza Mohebbi^{1}Email author,
 Rory Wolfe^{1},
 Damien Jolley^{1},
 Andrew B Forbes^{1},
 Mahmood Mahmoodi^{2} and
 Robert C Burton^{1}
DOI: 10.1186/1476072X1013
© Mohebbi et al; licensee BioMed Central Ltd. 2011
Received: 1 November 2010
Accepted: 15 February 2011
Published: 15 February 2011
Abstract
Recent studies have suggested a systematic geographic pattern of esophageal cancer (EC) and gastric cancer (GC) incidence in the Caspian region of Iran. The aims of this study were to investigate the association between these cancers and the region's dietary and socioeconomic risk factors and to map EC and GC after adjustment for the risk factors and the removal of random and geographic variations from area specific age standardised incidence ratios (SIRs).
We obtained cancer data from the Babol cancer registry from 2001 to 2005, socioeconomic indices from the Statistical Centre of Iran, and dietary patterns from the control group in a case control study conducted in the study region. Regression models were fitted to identify significant covariates, and clusters of elevated rates were identified.
We found evidence of systematic clustering for EC and GC in men and women and both sexes combined. EC and GC SIRs were lower in urban areas, and were also lower in areas of high income. EC SIRs were lower in areas with higher proportions of people having unrestricted food choice and higher in areas with higher proportions of people with restricted food choice.
EC and GC were associated with aggregated risk factors, including income, urbanisation, and dietary patterns. These variables represent the influence of improved lifestyle which has coincided with a decrease in upper gastrointestinal cancer frequency over recent decades but which has not necessarily been uniform throughout the region.
Introduction
Iran has high rates of both EC (esophageal cancer) and GC (gastric cancer) [1, 2]. There is evidence of sharp gradients in incidence rates over relatively short geographical distances in the Caspian region of Iran [3]. While EC incidence has decreased to less than half the rate reported three decades ago [4], a recent study highlighted the existence of a strong systematic geographical pattern in EC and GC incidence in the southern region of the Caspian Sea, but did not consider arearelated risk factors for analytical purposes [5]. In this study we investigate the association between the geographic pattern of EC and GC incidence and the dietary and socioeconomic patterns in this region.
A greater incidence of both EC and GC has been shown to occur in populations with low socioeconomic status, SES [7]. This may be accounted for by the relationship between socioeconomic indicators and environmental exposures, occupational exposure and individual habits [8].
Observational studies have found that fruit and vegetable consumption generally protects against EC and GC risk, with stronger support for this association coming from casecontrol studies than from cohort studies, whereas salt, processed meats and foods, and sweets have usually been linked with increased risk of the disease [9–12]. Analysing dietary patterns can elicit a role of overall diet in EC and GC etiology, an association which has been demonstrated in previous studies [13–15].
This article reports the application of a fivepart methodology as follows: (1) calculate and map sexstratified agestandardised incidence ratios (SIRs) for EC and GC; (2) use appropriate statistical measures to evaluate geographic autocorrelation; (3) identify major socioeconomic and dietary patterns in the study region; (4) evaluate the association of SES (socioeconomic status) and dietary patterns with EC and GC using multilevel modelling; and (5) compare maps of model adjusted smoothed estimates with the maps in part (1) that are not adjusted for geographic correlation or SES and dietary patterns.
Methods
The study was ecological in design, and used census derived area data, map data, and individual person data as described below.
Study Population
The estimated midyear population of Mazandaran and Golestan provinces between 2001 and 2005, stratified for sex, age in fiveyear intervals, and place of residence were obtained from the statistical centre of Iran [6]. These estimates were projections for 2001 to 2005, based on 1995 census data using the 2000 geographic boundaries [16, 17]. Geographic coordinates for each agglomeration were also obtained that approximately reflected the geographical centroid of each agglomeration [6].
Data sources
The cases of interest were all EC and GC patients registered between 2001 and 2005 among the study population. Data on incident cases of cancer were obtained from the Babol Cancer Registry; issues related to methods, quality and completeness of data collection for this cancer registry are described elsewhere [5, 18]. In summary, the major sources of data collection related to cancer in the Babol cancer registry were reports from pathology laboratories, hospitals, and radiology clinics. Coding of cancer diagnosis samples was based on the international classification of disease for oncology (ICDO) coding [19] and was done under direct supervision of pathology specialists. Microscopic verification was available for 47.7% of esophageal and 49.6% of gastric cancer cases. The reference address for all cases was the address at diagnosis. About 3% of cases lacked residential information at the agglomeration level. In order to use the cases with unknown residential information, the geographic referral pattern for each hospital or diagnosis centre was used to assign residences on a proportional aslikely basis. Concordance of residential place information within one year of diagnosis was examined for patients with multiple records during 19982000. Agreement on place of residence between the first diagnosis record and the next was 94% for gastric and 92% for esophageal cancer [20].
Explanatory variables were classified into two groups: Socioeconomic characteristics of the 152 agglomerations and dietary patterns of the 26 wards. For each agglomeration the following socioeconomic variables were obtained from the 1995 statistical yearbooks of Mazandaran and Golestan [16, 17] or the income and expenses survey in urban and rural area in 1995 [21, 22]: population density (inhabitants per square kilometre), relative level of activity (a synthetic indicator devised by the statistical centre of Iran that is calculated from the number of households, number of telephone lines, number of bank offices, number of commercial licences, electricity consumption, annual construction budget), annual income per family, annual expenditure on food per family, annual expenditure on fruit and vegetables per family, percentage of occupation in the industrial sector, percentage of occupation in the services sector, percentage of occupation in the agricultural sector, percentage of occupation in the construction sector, percentage of male unemployment, percentage of illiteracy. In addition to rural villages, some agglomerations contain one or more cities; a proportional aslikely basis method was used to calculate socioeconomic characteristics of these agglomerations.
Dietary pattern loadings from factor analysis (Restricted and Unrestricted food choice) of dietary consumption
Rotated Component Matrix*  

Items  Components  
Unrestricted food choice  Restricted food choice  
Fresh and frozen fish  .848   
Total fruit  .748  .120 
Sweets  .261  .215 
Poultry  .444   
Red meat, liver  .230  .180 
Salted/preserved food    .631 
Potatoes: baked, boiled    .561 
Canned fish    .516 
Regular fibre  .112  .254 
Eggs    .279 
White bread, rice, pasta  . 241  .653 
Total vegetables  .427   
Soft drinks     
French fries  .183   
Dairy    .212 
Nuts    .179 
Pickles    .113 
Factor analysis of socioeconomic and dietary variables
A factor analysis was performed to summarise socioeconomic information into a few uncorrelated factors. Factor analysis was also used for diet variables. Principal components followed by Varimax rotation with Kaiser normalisation was used to facilitate interpretation of the factors. The AndersonRubin method was used to create factor scores from the factor solution. The factors extracted with this method are uncorrelated with a zero average and variance of one [25]. We attached labels to the factors by considering the interpretation of items with sizable pattern coefficients. All factor scores were divided into sextiles for illustration purposes. Factor scores extracted from dietary patterns were divided into tertiles for all controls and the percentage of controls in each ward with factor scores in the highest tertile (3^{rd}) was used in the regression model. For socioeconomic components, factor scores related to each agglomeration were used in the regression model as a continuous covariate.
Standardised incidence rates (SIR) calculation
Adjustment of incidence rates for differences in the age and sex structure of agglomerations was accomplished by sexstratified agestandardisation (in 5year intervals of age). The SIR for a certain agglomeration was obtained from the ratio of the observed and expected number of cases in that agglomeration. We used the indirect method of standardisation for internal comparisons [26]. Since the population of the region was stable between 2001 and 2005, the 2003 population size was used for computing the incidence rates in age and sex categories of the overall region and the subsequent expected number of cases in each agglomeration. In order to compare the incidence rates in the Mazandaran and Golestan region with other parts of the world, directly standardized incidence rates were also calculated, using the 1970 Segi's World population for historical comparisons [27], and 2000 WHO World Population for contemporary comparisons [28].
Exploratory spatial data analysis
Two methods were used to measure spatial aggregation of the agglomeration SIRs; Moran's I [29] and semivariogram [30].
Moran's I is a correlationtype index based on continuous data values, but its interpretation is different from conventional correlation coefficients which take values in the range (1, 1). The numeric scale of Moran's I is related to its expected value, E(I), under a random spatial pattern. Values less than E(I) are typically associated with a uniform/dispersed pattern and values greater than E(I) typically indicate a clustered pattern. We adjusted Moran's I for agglomeration counts by comparing the observed count in an agglomeration with its expected value under the constant risk hypothesis [31].
A graph of a semivariogram plotted against separation distance gives information about the geographical variability of the SIRs. If SIRs close together are more alike than those farther apart, a semivariogram plot increases as the separation distance (in kilometre) increases reflecting decreasing spatial autocorrelation. The height of the jump of the semivariogram at the discontinuity at the origin is called the nugget. Often, the semivariogram will level off to nearly a constant value (called the sill) at a large separation distance (called the range). Beyond this distance, observations are spatially uncorrelated. To obtain a succinct statistical description of the spatial correlation in the data we fitted three different parametric models (exponential, Gaussian, and spherical) to the empirical semivariogram, each of which can be described in terms of nugget, partial sill and range parameters [32]. The model we considered most appropriate was that which minimized the residual sum of squares between the theoretical model and the empirical semivariogram.
Ecologic regression model incorporating spatial correlation
where the offset term log(E_{ij}) was the (log of the) expected number of cases for the j^{th} agglomeration in the i^{th} ward (assumed fixed), X^{SES} and X^{diet} were that agglomeration's rows from design matrices for the socioeconomic and dietary factors, respectively; β_{0} was the intercept, and β_{SES} and β_{diet} were vectors of coefficients describing associations with the socioeconomic and dietary factors, respectively [33]. Since SIR = μ_{ij}/E_{ij}, this is a model for agglomeration level SIRs with exp(β) interpretable as relative risk parameters within each agglomeration. Exploratory spatial data analysis showed evidence of both distancebased and neighbourhoodbased geographical autocorrelation. To complete the model specification, we made distancebased and neighbourhoodbased correlation structures for the spatial random effects u_{ij}. We assumed that the vector of random effects followed the multivariate normal distribution MVN(0, Σ_{u}(θ)), with the elements of Σ_{u}(θ) defined as either conditional autoregressive (CAR) [34] or spatial point referenced (SPR) structures [35].
For the CARtype model, we employed the intrinsic conditional autoregressive structure in which Σ_{u}(θ) = ρW, with W being a spatial proximity matrix containing binary connectivity elements.
where H(.) is a correlation matrix depending on a parameter Φ. Exponential, spherical and Gaussian semivariogram models were used to describe the elements of Σ_{u}(θ) as a function of nugget (τ^{2}), partial sill (σ^{2}), and range (Φ) parameters with the parametric form determined by empirical semivariogram analysis.
Model comparison
The 2 LogLikelihood and two most commonly used penalized model selection criteria, the Bayesian information criterion (BIC) and Akaike's information criterion (AIC), were used for model comparison.
Cartographic display
Software
SIR calculation was performed in Microsoft Excel, exploratory spatial analyses were performed using SAS's VARIOGRAM Procedure [36], factor analyses were conducted in SPSS 17 and the SAS Glimmix procedure was used to carry out MGLM regression [37, 38].
Results
Factor analysis
Dietary factors: Table 1 shows factor loadings of the 17 food group items on the two factors with eigenvalues greater than 0.1. The first dietary pattern, accounting for 13% of the variability, was characterized by high intake of foods generally thought to be preventive including vegetables, fruit, fish, and regular fibre, and was thus labelled "unrestricted food choice diet," whereas the second dietary pattern, accounting for 8% of the variability and labelled "restricted food choice diet," was characterized by high consumption of processed/salted meat, sweets, potatoes, soft drinks and low consumption of fish, fruit and vegetables.
Socioeconomic loadings from factor analysis (Income, Urbanisation and Literacy)*
Rotated Component Matrix  

Items  Components  
Income  Urbanisation  Literacy  
Annual income per family  .846     
Annual expenditure on food per family  .654  .165   
Annual expenditure on fruit and vegetables per family  .455  .151   
Population density    .285   
Relative level of activity  .318  .221  .533 
% of male unemployment  .321  .679   
% of employment in agriculture  .213  .808   
% of employment in industry  .199  .341   
% of employment in construction  .208    .470 
% of employment in services  .189  .824  .198 
Female illiteracy      .642 
Male illiteracy      .669 
Exploratory analysis
Incidence rate, directly standardized incidence rates (per 100,000 personyears using the 1970 and 2000 world population) and Moran's I autocorrelation for esophageal and gastric cancers in Mazandaran and Golestan provinces of Iran
Cancer Type  Sex  No. of Cases  Incidence Rate  1970 world population  2000 world population  Moran's I* 

Male  891  8.10  12.16  14.61  0.28  
Esophageal  Female  810  7.23  11.27  12.73  0.30 
Both sexes  1693  7.67  11.72  13.71  0.22  
Male  1838  15.62  23.04  26.78  0.22  
Gastric  Female  827  6.46  9.92  11.25  0.12 
Both sexes  2665  11.04  16.50  19.02  0.26 
Ecologic regression
Comparison of model goodness of fit using nonspatial Poisson regression and spatial Poisson models with conditional autoregressive (CAR), and spatial point referenced (SPR), autocorrelation structures
Model  2LogLikelihood  AIC  BIC  

Esophageal cancer  Poisson regression with uncorrelated random effect  Female  511.3  517.2  521.8 
Male  498.2  501.3  510.9  
Both sexes  453.2  455.5  458.4  
Spatial Poisson regression with Gaussian SPR correlation function  Female  481.3  485.2  491.6  
Male  453.7  455.0  460.3  
Both sexes  446.5  448.5  451.5  
Spatial Poisson regression with CAR correlation function  Female  470.9  476.9  485.9  
Male  411.5  417.5  426.5  
Both sexes  332.0  338.0  347.0  
Gastric cancer  Poisson regression with uncorrelated random effect  Female  560.5  566.5  575.5 
Male  485.2  488.0  491.2  
Both sexes  463.4  468.3  471.8  
Spatial Poisson regression with Gaussian SPR correlation function  Female  483.2  488.0  491.1  
Male  469.1  471.0  580.5  
Both sexes  447.8  453.8  462.8  
Spatial Poisson regression with CAR correlation function  Female  511.3  519.6  523.4  
Male  467.0  473.0  482.1  
Both sexes  384.0  386.0  389.0 
Parameter estimation for SES and dietary patterns
Esophageal cancer  Gastric cancer  

Factor  RR  95% CI  Pvalue  RR  95% CI  Pvalue  
lower  upper  lower  upper  
Female  Unrestricted food choice*  0.91  0.84  0.99  0.04  0.89  0.67  1.18  0.42 
Restricted food choice*  1.27  1.13  1.44  <0.001  1.08  0.87  1.34  0.49  
Income  0.78  0.68  0.90  <0.001  0.66  0.56  0.77  <0.001  
Urbanisation  0.71  0.62  0.81  <0.001  0.86  0.76  0.97  0.02  
Literacy  0.94  0.84  1.06  0.25  0.98  0.86  1.11  0.83  
Male  Unrestricted food choice*  0.75  0.68  0.82  <0.001  0.91  0.64  1.31  0.31 
Restricted food choice*  1.15  1.05  1.26  0.004  1.09  0.91  1.31  0.33  
Income  0.85  0.78  0.93  0.009  0.75  0.65  0.87  <0.01  
Urbanisation  0.76  0.69  0.83  <0.001  0.90  0.75  0. 91  0.02  
Literacy  0.94  0.79  1.13  0.84  0.95  0.78  1.16  0.30  
Both sexes  Unrestricted food choice*  0.81  0.75  0.88  <0.001  0.92  0.84  1.00  0.05 
Restricted food choice*  1.36  1.24  1.49  <0.001  1.08  0.97  1.20  0.09  
Income  0.86  0.80  0.93  <0.001  0.92  0.83  1.03  0.06  
Urbanisation  0.83  0.77  0.89  <0.001  0.73  0.68  0.84  <0.001  
Literacy  0.96  0.89  1.04  0.32  0.88  0.76  1.01  0.08 
Discussion
In this ecologic study we observed statistically significant associations between agglomerationspecific EC and GC SIR and SES and dietary patterns. We hypothesised that strong geographical EC and GC risk patterns highlighted in previous studies [3, 5] could be explained by the existence of important geographical differences in the prevalence of two wellestablished and modifiable risk factors (SES and dietary pattern).
Two dietary patterns were identified: "restricted food choice" and "unrestricted food choice" that explained approximately 21% percent of the variance in responses to the FFQ. The unrestricted food choice pattern was positively correlated with total fruit, total vegetables, seafood, poultry and regular fibre, and negatively correlated with sweets. This dietary pattern was linked to an inverse risk of EC in male, female and both sexes combined. The restricted food choice was negatively correlated with total fruit and regular fibre, positively correlated with salted and preserved foods and had very small factor loading on total vegetables, seafood and poultry. This dietary pattern was associated with higher risk of EC in male, female and both sexes combined; Low intake of fruit and vegetables has been consistently associated with higher risk of EC with a metaanalysis suggesting that protective effects were more pronounced for fruit than vegetables [9]. Families in the regions of high incidence of EC in our study reported very limited intake of fruit and vegetables relative to families in the low incidence areas, consistent with a casecontrol study in the region that showed a higher intake of raw vegetables reduced the risk of esophageal cancer by 4050% [39].
The restricted food choice was linked with GC increase in both sexes combined. We also found a high intake of salted/preserved meat, canned fish and pickles was associated with increased GC risk in both sexes combined.
A link between certain demographic and economic features of regions and the risk for EC and GC has been shown in several studies [7, 8]. The socioeconomic variables used in our study enabled three such indices to be studied: income, urbanisation and literacy. We found higher incidences of EC and GC in men and/or women were related to lower annual income, lower annual expenditure on food, lower annual expenditure on fruit and vegetables, higher percentage of unemployment, and higher percentage of employment in agriculture and construction sectors. Both cancer sites analysed in this study had higher SIR in the rural setting. This association may be related to lower SES, higher unemployment and high levels of farming in rural agglomerations.
In our study, expenditure on food in general and expenditure on fruit and vegetables had large positive factor loadings on the income and urbanisation indices. In addition, income and urbanisation indices were positively correlated with unrestricted food pattern and negatively correlated with restricted food pattern. This correlation was stronger in the eastern region, especially in the Turkmen plain. Therefore, lower SES was linked to a diet deficient in fruit and vegetables in rural agglomerations, which is an important risk factor for EC and GC. An increased risk of gastric cancer associated with agricultural occupations has been consistently reported, and exposure to pesticides, organic and inorganic dusts, fertilizers, and nitrates has been suggested as the major contributing risk factors [40–42]. There is no Pesticide Register in Iran to compile information on the use of these products. As a result, specific ecological indicators cannot be used to measure the populations' exposure to pesticides. Consequently, the percentage in agricultural occupations, where pesticide exposure could be assumed to be higher, and the urbanisation score were used as indirect indicators of the use of pesticides in agglomerations. We found a significant negative association between EC and GC risk and urbanisation score.
Some details of our study methods require discussion. First, the exact timing of SES and dietrelated exposures and cancer occurrence is important for our study. The lag time between risk factors exposure and EC and GC cancer development was ascertained for 3 large prospective cohort studies involving more than half a million men and women [43–45]. In these prospective cohort studies a lag time between 6 to 12 years was long enough for the development of EC and GC in healthy participants, and, more importantly, to find a significant association between SES and dietary exposures and EC and GC cancer occurrence. Our study had an average lag time of 10 years, with a range of 612 years, between exposure measurements (19931996) and outcomes (20012005), which is consistent with these findings.
Second, could human migration in the study region have caused enough selection bias to influence the result? It is known that external migrants to the study region have lower incidence of EC and similar GC incidence to the national rate [46]. Between the 1995 and 2005 censuses 556,455 people (on average 1.4% per annum of the study population) migrated to the study region. Most immigrants (83%) were healthy labour force participants and their younger relatives, explaining the lower cancer rates of migrants. However, external migration from other provinces, occurring mainly to the major cities of the study region, was accountable for only 29% of total migration with internal migration accounting for the reminder. It seems unlikely that these modest migration figures would strongly influence the observed associations.
Third, controls from a local casecontrol study were used to identify dietary patterns. The number of controls per wards ranged from 26 in the low populated ward Bandar Gaz to >250 for wards with major cities like Babol [24]. In order to find any selection bias due to percentage of coverage in different wards or urban and rural areas we compared age, residential place (urban/rural), sex and ward distribution of cases with EC and GC incidence for 2003 to 2006 period. There was no significant difference in these demographic characteristics between controls from the case control study and cases on the registry. About one third of the controls were selected as neighbouring the cases in the casecontrol study. This mechanism of control selection possibly obtained a nonrandom representation of dietary habits in wards. This may the dilute association between EC and GC and dietary patterns.
Fourth, in this study SES and dietary pattern scores were used as markers of the heterogeneous distribution of lifestyle and dietary factors influencing EC and GC risk. Selection of these variables was limited by the availability of information at agglomeration or ward level, so they only partially reflect the distribution of related risk factors. However, their inclusion served to smooth SIR, taking into account both the spatial relation among agglomerations and the variability associated with these indices.
Fifth, justification of sample size is necessary. For factor analysis it is recommended that five subjects per item, with a minimum of 100 subjects regardless of the number of items is a sufficient sample size [47]. There were 17 food items and 2322 subjects in the dietary pattern analysis and 12 Socioeconomic items and 152 units (agglomerations) for the SES factor analysis, and so these met the minimum sample size criteria. To the best of our knowledge no study has focused on sample size and robustness issues in multilevel Poisson regression in a comprehensive manner. However, results from a simulation study suggest that for generalised linear mixed models with low prevalent events at least a minimum of 100 groups and 30 to 50 individuals per group were necessary [48]. Our study contained 152 groups (agglomerations) and a mean of 11 and 16 cases for EC and GC. While the group size was large enough for accurate regression parameter estimation, small sample size within agglomerations suggested possible bias in the second level standard errors.
Ecologic studies are perhaps best considered to be hypothesis generating, although small area analysis tends to reduce ecological fallacy, since the populations defined by agglomerations boundaries are more homogeneous. While this might well be true of villages and towns of average size, in large cities this may not be so. However, the results reported here correspond to an overall mean, and socioeconomic and dietary patterns differences inside cities have been disregarded. It would be interesting to extend our work by assessing whether such differences exist in major cities, such as Sari, Ghaemshahr and Gorgan.
Conclusion
Multilevel spatial modelling revealed associations between EC and GC incidence and SES and dietary indices. High EC and GC incidence and low SES scores often coincided in rural areas. Higher prevalence of restricted food choice was associated with higher EC in the eastern agglomerations, especially in the Turkmen plain. Our study revealed that there were systematic geographical variations in EC and GC SIRs across the Caspian region, and particularly an elevated risk in contiguous highrisk eastern areas. Further studies targeted to specific regions could help to identify the risk factors that may contribute to the geographical patterns in EC and GC SIR's identified here.
Abbreviations used
 AIC:

Akaike's information criterion
 BIC:

Bayesian information criterion
 CAR:

conditional autoregressive
 EC:

esophageal cancer
 FFQ:

frequency questionnaire GC: gastric cancer
 MGLM:

multilevel generalised linear model RR: risk ratio
 SES:

socioeconomic status
 SIR:

standardised incidence ratio
 SPR:

spatial point referenced
Declarations
Acknowledgements
We would like to thank the survey team and colleagues of the Babol Cancer Registry.
Authors’ Affiliations
References
 Mahboubi E, Kmet J, Cook P, Day N, Ghadirian P, Salmasizadeh S: Oesophageal cancer studies in the Caspian Littoral of Iran:the caspian cancer registry. British Journal of Cancer. 1973, 28: 197214. 10.1038/bjc.1973.138.PubMed CentralView ArticlePubMed
 Saidi F, Sepehr A, Fahimi S, Farahvash MJ, Salehian P, Esmailzadeh A, Keshoofy M, Pirmoazen N, Yazdanbod M, Roshan MK: Oesophageal cancer among the Turkomans of northeast Iran. British Journal of Cancer. 2000, 83: 12491254. 10.1054/bjoc.2000.1414.PubMed CentralView ArticlePubMed
 Joint Iran and IARC Study Group: Esophageal cancer studies in the Caspian littoral of Iran: Results of population studies: A prodrome. Journal of National Cancer Institute. 1977, 54: 11271138.
 Semnani S, Sadjadi A, Fahimi S, Nouraie M, Naeimi M, Kabir J, Fakheri H, Saadatnia H, Ghavamnasiri MRRM: Declining incidence of esophageal cancer in the Turkmen Plain, eastern part of the Caspian Littoral of Iran: a retrospective cancer surveillance. Cancer Detection and Prevention. 2006, 30: 1419. 10.1016/j.cdp.2005.11.002.View ArticlePubMed
 Mohebbi M, Mahmoodi M, Wolfe R, Nourijelyani K, Mohammad K, Zeraati H, Fotouhi A: Geographical spread of gastrointestinal tract cancer incidence in the Caspian Sea region of Iran: spatial analysis of cancer registry data. BMC Cancer. 2008, 8: 13710.1186/147124078137.PubMed CentralView ArticlePubMed
 Iran statistical yearbook. 2000, Tehran: Statistical Center of Iran
 Kogevinas M, Pearce N, Susser M, Boffetta P, (Eds.): Social inequalities and cancer. 1997, Lyon: IARC
 Mackenbach J, Bakker M, Kunst A, Diderichsen F: Socioeconomic inequalities in health in Europe: An overview. Reducing inequalities in health: a European perspective. Edited by: Mackenback J, Bakker M. 2002, London and. New York: Routledge, 324.
 Riboli E, Norat T: Epidemiologic evidence of the protective effect of fruit and vegetables on cancer risk. American Journal of Clinical Nutrition. 2003, 78: 559S569S.PubMed
 Lunet N, LacerdaVieira A, Barros H: Fruit and vegetables consumption and gastric cancer: a systematic review and metaanalysis of cohort studies. Nutrition & Cancer. 2005, 53: 110. 10.1207/s15327914nc5301_1.View Article
 Lunet N, Valbuena C, Vieira AL, Lopes C, Lopes C, David L, Carneiro F, Barros H: Fruit and vegetable consumption and gastric cancer by location and histological type: casecontrol and metaanalysis. European Journal of Cancer Prevention. 2007, 16: 312327. 10.1097/01.cej.0000236255.95769.22.View ArticlePubMed
 Terry P, Terry JB, Wolk A: Fruit and vegetable consumption in the prevention of cancer: an update. Journal of Internal Medicine. 2001, 250: 280290. 10.1046/j.13652796.2001.00886.x.View ArticlePubMed
 Chen H, Ward MH, Graubard BI, Heineman EF, Markin RM, Potischman NA, Russell RM, Weisenburger DD, Tucker KL: Dietary patterns and adenocarcinoma of the esophagus and distal stomach.[see comment]. American Journal of Clinical Nutrition. 2002, 75: 137144.PubMed
 Campbell PT, Sloan M, Kreiger N: Dietary patterns and risk of incident gastric adenocarcinoma. American Journal of Epidemiology. 2008, 167: 295304. 10.1093/aje/kwm294.View ArticlePubMed
 Bahmanyar S, Ye W: Dietary patterns and risk of squamouscell carcinoma and adenocarcinoma of the esophagus and adenocarcinoma of the gastric cardia: a populationbased casecontrol study in Sweden. Nutrition & Cancer. 2006, 54: 171178. 10.1207/s15327914nc5402_3.View Article
 Reconstruction and estimation of Golestan province population according to 2000 geographic boundaries. 2003, Tehran: Statistical Center of Iran
 Reconstruction and estimation of Mazandaran province population according to 2000 geographic boundaries. 2003, Tehran: Statistical Center of Iran
 Mohebbi M, Nourijelyani K, Mahmoudi M, Mohammad K, Zeraati H, Fotouhi A, Moghadaszadeh B: Time of Occurrence and Age Distribution of Digestive Tract Cancers in Northern Iran. Iranian Journal of Public Health. 2008, 37: 819.
 Fritz P, Percy C, Jack A, Shanmugaratnuers K, Solin L, Parkin D: International classification of diseases for oncology. 2000, Geneva: World Health Organization, 3
 Annual Report of Babol Health Research Station (2000). 2000, Tehran: Tehran Medical University
 Income and expenses survey in rural families in 1995. 1996, Tehran: Statistical Center of Iran
 Income and expenses survey in urban families in 1995. 1996, Tehran: Statistical Center of Iran
 Alaeddini F, Holakuei K, Mahmoudi M, Siyasi F, Nadim A: Esophageal cancer and type of food and beverage consumption. Archives of Iranian Medicine. 2001, 4: 197200.
 Annual Report of Babol Health Research Station (1999). 1999, Tehran: Tehran Medical University
 Anderson T, (Ed.): An introduction to multivariate statistical analysis. 1984, New York: John Wiley & Sons
 Esteve J, Benhamou E, Raymond L: Descriptive Epidemiology. 1994, Lyon: IARC Scientific Publication
 Segi M: Cancer mortality for selected sites in 24 countries (19501957). 1960, Sendai: Department of Public Health, Tohoku University of Medicine
 Ahmad O, BoschiPinto C, Lopez A, Murray C, Lozano R, Inoue M: Age standardization of rates: a new WHO standard. GPE Discussion Paper Series: no31. 2000, World Health Organization
 Moran P: Notes on continuous stochastic phenomena. Biometrika. 1950, 37: 1723.View ArticlePubMed
 Cressie N: Statistics for spatial data. 1993, New York: Wiley and Sons
 Walter SD: The analysis of regional patterns in health data: I. Distributional considerations. American Journal of Epidemiology. 1992, 136: 730741.PubMed
 Cressie N: Statistics for Spatial Data, rev. edn. 1993, New York: Wiley
 Langford IH, Leyland AH, Rasbash J, Goldstein H: Multilevel modelling of the geographical distributions of diseases. Journal of the Royal Statistical Society Series C, Applied Statistics. 1999, 48: 253268. 10.1111/14679876.00153.View ArticlePubMed
 Clayton D, Kaldor J: Empirical Bayes estimates of agestandardized relative risks for use in disease mapping. Biometrics. 1987, 43: 671681. 10.2307/2532003.View ArticlePubMed
 Zimmerman DL, Harville DA: A Random Field Approach to the Analysis of FieldPlot Experiments and Other Spatial Experiments. Biometrics. 1991, 47: 223239. 10.2307/2532508.View Article
 SAS/STAT 9.2 User's Guide, Chapter 95: The VARIOGRAM Procedure. 2008, SAS Publishing
 Littell R, Milliken G, Stroup W, Wolfinger R: SAS system for mixed models; Chapter 11: Spatial Variability. 2006, Cary, NC: SAS Institute, Inc, 2
 Rasmussen S: Modelling of discrete spatial variation in epidemiology with SAS using GLIMMIX. Computer Methods and Programs in Biomedicine. 2004, 76: 8389. 10.1016/j.cmpb.2004.03.003.View ArticlePubMed
 CookMozaffari P, Azordegan F, Day N, Ressicaud A, Sabai C, Aramesh B: Oesophageal cancer studies in the Caspian littoral of Iran: results of a casecontrol study. British Journal of Cancer. 1979, 39: 293309. 10.1038/bjc.1979.54.PubMed CentralView ArticlePubMed
 Ji J, Hemminki K: Socioeconomic and occupational risk factors for gastric cancer: a cohort study in Sweden. European Journal of Cancer Prevention. 2006, 15: 391397. 10.1097/0000846920061000000003.View ArticlePubMed
 Lee WJ, Son M, Chun BC, Park ES, Lee HK, Coble J, Dosemeci M: Cancer mortality and farming in South Korea: an ecologic study. Cancer Causes & Control. 2008, 19: 505513. 10.1007/s1055200891122.View Article
 OcanaRiola R, SanchezCantalejo C, Rosell J, SanchezCantalejo E, Daponte A: Socioeconomic level, farming activities and risk of cancer in small areas of Southern Spain.[see comment]. European Journal of Epidemiology. 2004, 19: 643650. 10.1023/B:EJEP.0000036808.26094.43.View ArticlePubMed
 Bingham S, Riboli E: Diet and cancer  the European prospective investigation into cancer and nutrition. Nature Reviews Cancer. 2004, 4: 206215. 10.1038/nrc1298.View ArticlePubMed
 Larsson S, Bergkvist L, Wolk A: Fruit and vegetable consumption and incidence of gastric cancer: A prospective study. Cancer Epidemiology, Biomarkers & Prevention. 2006, 15: 19982001. 10.1158/10559965.EPI060402.View Article
 Tsugane S, Sasazuki S, Kobayashi MSS: JPHC Study Group. Salt and salted food intake and subsequent risk of gastric cancer among middleaged Japanese men and women. Brithish Journal of Cancer. 2004, 90: 128134. 10.1038/sj.bjc.6601511.View Article
 Sadjadi A, Nouraie M, Mohagheghi MA, MousaviJarrahi A, Malekzadeh R, DM P: Cancer occurrence in Iran in 2002, an International perspective. Asian Pac J Cancer Prev. 2005, 6: 359363.PubMed
 Gorsuch RL: Factor analysis. 1983, Hillsdale, NJ: Lawrence Erlbaum, 2
 Moineddin R, Matheson F, Glazier R: A simulation study of sample size for multilevel logistic regression models. BMC Medical Research Methodology. 2007, 7: 3410.1186/14712288734.PubMed CentralView ArticlePubMed
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.