Research | Open | Published:
Heterogeneity in mammography use across the nation: separating evidence of disparities from the disproportionate effects of geography
International Journal of Health Geographicsvolume 7, Article number: 32 (2008)
Mammography is essential for early detection of breast cancer and both reduced morbidity and increased survival among breast cancer victims. Utilization is lower than national guidelines, and evidence of a recent decline in mammography use has sparked concern. We demonstrate that regression models estimated over pooled samples of heterogeneous states may provide misleading information regarding predictors of health care utilization and that comprehensive cancer control efforts should focus on understanding these differences and underlying causal factors. Our study population includes all women over age 64 with breast cancer in the Surveillance Epidemiology and End Results (SEER) cancer registries, linked to a nationally representative 5% reference sample of Medicare-eligible women located in 11 states that span all census regions and are heterogeneous in racial and ethnic mix. Combining women with and without cancer in the sample allows assessment of previous cancer diagnosis on propensity to use mammography. Our conceptual model recognizes the interplay between individual, social, cultural, and physical environments along the pathways to health care utilization, while delineating local and more distant levels of influence among contextual variables. In regression modeling, we assess individual-level effects, direct effects of contextual factors, and interaction effects between individual and contextual factors.
Pooling all women across states leads to quite different conclusions than state-specific models. Commuter intensity, community acculturation, and community elderly impoverishment have significant direct impacts on mammography use which vary across states. Minorities living in isolated enclaves with others of the same race/ethnicity may be either advantaged or disadvantaged, depending upon the place studied.
Careful analysis of place-specific context is essential for understanding differences across communities stemming from different causal factors. Optimal policy interventions to change behavior (improve screening rates) will be as heterogeneous as local community characteristics, so no "one size fits all" policy can improve population health. Probability modeling with correction for clustering of individuals within multilevel contexts can reveal important differences from place to place and identify key factors to inform targeting of specific communities for further study.
Mammography is essential for early detection of breast cancer and both reduced morbidity and increased survival among breast cancer victims . A lot of attention has been paid to low mammography use rates, and evidence of a decline in national rates over 2000–2005 has sparked renewed concern . Evidence of persistent disparities in mammography use among women of different races and ethnicities is abundant . A promising trend in public health research toward more multidisciplinary and multi-institutional research has predicated a new interest in examining community-level determinants of health disparities. Also, there is increasing recognition that health disparities may vary widely from place to place, such that the effects of place may be difficult to disentangle from social or cultural determinants of health [4–9].
Chandra and Skinner  argue that several factors are at work to confound the problem: there is considerable variation in health care utilization and outcomes across regions, minorities may use different providers than whites, and racial disparities may be higher in some areas. Together, these conditions create strong statistical interactions between geography and racial or ethnic identity, a fact that may lead researchers to falsely diagnose geographic variations as the determinant of racial disparities. Virnig et al.  show that disparities in several health care quality measures are wider across geographic regions than they are within the regions. Similarly, Coughlin et al.  contrast Southern counties with other counties in the United States and find that racial disparities in cancer screening are wider across counties than they are within them.
Relative homogeneity in socioeconomic conditions and health outcomes among people within regions contrasted with disparities in these things across regions suggests that spatial heterogeneity in such things as beliefs, practices, and resources may be a causal factor driving the observed disparities across regions. Probst et al.  argue that, because minorities tend to be concentrated more heavily in certain rural regions of the country, contextual factors that have impacted resource availability in those regions may produce worse health outcomes for all residents in those places. In this regard, Slifkin et al.  compare the health status of urban and rural minorities and find that several health status measures exhibit wide disparity between urban and rural minorities, specifically cancer screening and management, cardiovascular disease, and diabetes.
A better understanding of the socio-ecological factors impacting health outcomes is crucial, because a better balance of both the medical and nonmedical determinants of health is required to achieve "optimal" health outcomes. This theme is central to Smedley's recent commentary  regarding the necessity to focus on social and economic systems if we are to truly understand (and eradicate) health disparities. This literature supports the notion that place-specific resources, not minority status per se, may be driving some of the observed national health disparities statistics.
Clearly, disparities in health outcomes across races and ethnicities is a complex phenomenon with many determinants. Mervyn Susser  argues that traditional risk factor epidemiology has focused on a single level of analysis (the person or population) while ignoring social structures and dynamics that link individuals. Susser and Susser  advocate "eco-epidemiology" as a useful new paradigm for modern epidemiological research. This paradigm views the individual as existing within a set of nested constructs, where each level is part of a broader system and interacts with those above and below it. The eco-epidemiological paradigm has been embraced in public health research through multilevel modeling [13–25].
Multilevel modeling approaches have evolved over time from basic approaches using fixed effects to model higher-level data structures (in the absence of higher-order contextual data), to intermediate approaches that account for the redundancy in information from repeated higher-level contextual measures. Failure to account for the redundancy in information (i.e., the repeated county-level or PCSA-level variables for every woman in the county or PCSA) biases down the standard errors of the higher-order (county, PCSA) effect estimates. More complex approaches model the random effects of missing variables and covariances between individual-level effect parameters and higher-order (ecological) data structures . There are two basic types of ecological effects: (1) a direct effect of a community-level variable on an individual-level health outcome; and (2) effect modification or interaction, whereby a community characteristic modifies the effect of an individual characteristic on an individual-level health outcome. In this paper, we investigate both types of effects using the intermediate approach to multilevel modeling, which includes many contextal measures at higher orders directly in the model and accounts for redundancies in these measures across individuals in areas by adjusting the standard errors using generalized estimating equations (GEE) clustering correction methods.
Public health researchers refer to area aggregates based on individual-level attributes reflecting characteristics of clients in an area as "collective" or "compositional" variables, to distinguish them from other ecological variables classified more broadly as "contextual" effects reflecting the nature of the physical or social environment [4, 23, 25, 26]. Most public health research has included only collective effects in multilevel models, rather than contextual effects reflecting the broader political, cultural, social, or institutional expressions that affect access to and allocation of resources and opportunity [4, 26, 27]. For example, Litaker and Tomolo  and Litaker et al.  use an intermediate multilevel model like ours to model direct contextual effects and interactions between a woman's income and average income in her community on mammography use in Ohio. The studies by Litaker et al. are the only multilevel modeling studies we know of that attempt to estimate interaction effects for mammography use, and they find no significant effects. The limited geography or spatial homogeneity of ecological factors in Ohio may have impacted the significance of Litaker et al.'s findings.
The main objective of this paper is to use a multilevel modeling approach with a binary probability model of mammography use to examine various factors affecting mammography use in a large sample of women across 11 heterogeneous regions of the United States (Table 1). Our multilevel model is based on a carefully developed theoretical model (described below) of the ecological environment for mammography use. We posit that spatial heterogeneity in a variety of contextual factors, operating at different levels of influence, can help explain observed disparities within and across regions. We include a broad range of factors that include traditional access and health system supply variables, population/demand variables, and a rich set of socio-ecological variables describing other aspects of the community–besides health system and medical aspects–that are important correlates of observed behavior.
Our conceptual model is a hybrid developed from several models from the behavioral health, socio-ecological, and health geography fields, as fully described elsewhere . This spatial-interaction model conceptualizes the interplay between individual, social, and physical environments while delineating individual, local, and more distant levels of influence among compositional and contextual variables. The idea that the community factors influence human behavior is not new, but the explicit consideration of "what is the relevant zone of influence?" for ecological variables has only recently begun to appear in the literature [18, 20, 27, 30, 31].
The conceptual model (Figure 1) positions the individual as making utilization choices (the final outcome) in a market context that has differentiated levels of influence for different classes of contextual variables and guides the selection of variables to be included in the analysis. The model also suggests the appropriate level of aggregation for compositional and contextual factors through its classification of the levels of influence. Some of the classification is derived from a synthesis of the body of literature cited herein; however, not all aspects we model have been considered in previous studies. The multilevel data provided by the authors as public use files should foster additional research in this fruitful area.
Individual characteristics are differentiated into the enabling, predisposing, and need constructs from the traditional Aday-Andersen behavioral health model . The large box surrounding the entire figure represents the Fundamental, or macro-level, factors such as regulations, public policy, or media campaigns, represented at the state level in the hierarchy. These impinge upon the Intermediate community-level factors, such as characteristics of health care systems, land use patterns and development, crime, and housing conditions. Intermediate factors in turn impinge upon the Interpersonal neighborhood-level factors, such as social support, cultural cohesion, driver courtesy, and social capital. Interpersonal factors in turn impinge upon the Individual, who decides whether or not to use mammography.
We define the Intermediate, community-level factors at the county level, as these are the political units defined to manage the finances associated with community services. The Interpersonal, neighborhood-level factors should be defined at a smaller resolution than the community factors, but there are no guiding principles for defining these areal units from the literature. We developed data at various scales and thereby had the option to use either ZCTA or PCSA areal units to measure the Interpersonal factors. ZCTAs are U.S. Census ZIP code tabulation areas used to approximate the delivery area for a U.S. Postal Service five-digit ZIP code or collections of ZIP codes in urban areas, or three-digit ZIP codes in rural areas. Census 2000 long-form and population data are tabulated by the Census for these areal units. PCSAs are Primary Care Service Areas defined by Dartmouth College researchers for the Health Resources and Services Administration (HRSA), based on Medicare fee-for-service (FFS) patients' flows from home address to primary care physician offices, using ZIP code address of person . In previous work, we found that using either ZCTAs or PCSAs as the areal units for the Interpersonal factors performed comparably . We chose here to measure the Interpersonal factors at the PCSA unit, which has a natural preventive care market interpretation and is composed of one or more ZCTAs.
The variables used in this work and their sources are described in Table 2, which is divided into three sections corresponding to the conceptual model: Individual characteristics (categorized into enabling, predisposing, and need categories), Interpersonal (local neighborhood) factors, Intermediate (larger community) factors, and Fundamental (state level) factors. Table 3 gives sample statistics for women by state, and Table 4 gives sample statistics for the PCSA and county areas by state. We provide simple means, standard deviations, and the number of observations for each level of data (person, PCSA, county). If desired, the reader can convert the standard deviations to standard errors by dividing by the square root of sample size.
Hypothesized associations between factors and mammography use
An important enabling characteristic is type of health coverage. All women in the sample are well-insured with traditional FFS Medicare Parts A and B insurance, which allow free choice of provider and mammography facility. Some women have additional coverage through a variety of state Medicaid programs for low-income or disabled elderly, which makes them dually eligible for Medicare and Medicaid insurance (which covers the Part B premium). We do not have information on individual income, but we assume that the dually eligible are lower-income than others in their state. Although there may be additional resources associated with dual eligibility, we hypothesize that disability and dual eligibility status are disabling characteristics, because physical limitations and poverty present additional burdens to care-seeking behavior. Shorter distance to the closest mammography facility is seen as an enabling characteristic. Another characteristic is recent address change–we hypothesize that moving is disruptive and a disabling characteristic. Predisposing factors included in the model are Medicare health maintenance organization (HMO) coverage in the 2 years prior to the study period. A recent mammogram might have been obtained under the HMO prior to joining FFS, which could lower the probability of utilization in 2002–2003. Also included are age and race or ethnicity. We include cancer diagnosis and utilization of flu shots as indicators of need. Individuals with a previous cancer diagnosis are more likely to experience another cancer, and mammography is used in the course of treatment as a diagnostic. Those utilizing flu shots are considered to have stronger health-seeking behavior.
These include local neighborhood characteristics that impact one's perception of risk or information about health care through interactions with neighbors that shape opinions and beliefs. Communities may also provide support–both physical and psychological–for health-seeking behaviors. While residential segregation is often viewed as a harmful Fundamental factor–because it can influence the distribution of wealth, opportunity, and political influence toward the majority in the state–in the local neighborhood, residential segregation may impact social integration and support. We use Massey and Denton's  isolation index as our residential segregation measure, defined separately for each race or ethnicity relative to whites. These indices by race or ethnicity reflect the propensity for the minorities to come into contact with whites in residential neighborhoods. The index for a specific minority group ranges from 0 to 1, where higher values for the index reflects greater segregation among the minority from the white population. We hypothesize that the index may have positive impacts for some groups and negative impacts for others, because residential segregation effects have exhibited varied findings by race and ethnicity in the literature [35–38].
Two compositional variables are included to reflect social or cultural cohesion: the proportion of community members who have recently immigrated into the United States and the proportion of elderly community members with little or no English language ability. Both of these variables might reduce cohesion and the probability of mammography use. Several stressor variables are included for each woman's local community: commuter intensity, elderly women in poverty, and elderly women living alone. Inter-driver courtesy, which in our real-life experience decreases in communities with high commuter intensity, might affect the difficulty experienced by elderly who drive or for their caregivers who drive them. Areas with greater commuter intensity have been found to exhibit lower access to preventive care services among the elderly ; thus, we hypothesize that commuter intensity will reduce mammography use. We hypothesize that areas with higher proportions of elderly women living in poverty or living alone will exhibit lower mammography use rates due to lower social and material support and that women living in such areas will exhibit lower probability of utilization.
Intermediate community factors
At the wider community level are social context and physical environment factors that are affected by Fundamental resources that shape the infrastructure supporting community life . Among these are characteristics of the health care system, such as physician shortage, facility density and proximity, and managed care climate. Managed care penetration in an area can change the way medicine is practiced, with spillover effects on FFS Medicare patients , so we hypothesize that women living in areas with greater managed care penetration may exhibit different probabilities of mammography use. Use would be higher if area attitudes among seniors regarding prevention were enhanced.
Availability of primary care physicians, medical oncologists, and nurses might increase the probability of mammography use. Women living in areas with primary care physician shortages might lower the probability of use. Primary care physician shortage is indicated using HRSA's measure at the county level. An alternative measure of physician availability is the ratio of International Medical Graduates (IMGs)–physicians of foreign origin who train in the United States–to native U.S.-born physicians. One study found that IMGs have disproportionately located in U.S. counties of greatest need, compared with native medical graduates, which reflects successful efforts through the J-1 visa waiver program .
We hypothesize that women living in counties with higher violent crime rates will be less likely to use mammography. There is considerable research examining the link between crime/disorder and fear  and evidence that fear may be limiting women's movement around their environments , especially for older women .
We include a land use mix index in the model to differentiate between sprawling suburbia, rural places, and mixed inner-city environments. The measure is an entropy index defined over the proportion of land in several different uses, at the 30-meter square level of resolution. The landmix measure is lower when there is more homogeneous use of land, so rural places and sprawling suburban housing developments have low values, and more urban areas with mixtures of homes and businesses have higher values. Because the more mixed environments are more urban in these data, we hypothesize that there will be a negative association between our landmix measure and probability of use if urban congestion impedes travel or reduces the desire to travel for care or if less congested rural settings were associated with improved probability of use.
Data and study sample
Our study population includes all women over age 64 with a breast cancer diagnosis in the Surveillance Epidemiology and End Results (SEER) cancer registries and a convenience sample of women over age 64 from the 5% Medicare file (see Table 1). The 5% Medicare file is linked by NCI to the SEER registry data and to all available Medicare claims . All women over age 64 from the linked files who have Medicare claims and a valid address during the period 2002–2003 are included in our study population of 224,585 women.
The NCI linked SEER-Medicare database follows subjects longitudinally over the course of their remaining lives. NCI links the SEER registry data with a 5% sample of Medicare-eligible people residing in the SEER registry states. The 5% Medicare sample is nationally representative, drawn randomly from the 100% enrollment file containing all Medicare beneficiaries. People in the 5% file are drawn annually based on having specific digits in their Health Insurance Claim number (a permutation of the social security number), providing a nationally representative longitudinal sample that is useful as a reference sample for analysis of medical treatment paths, costs, utilization, and outcomes over time and comparisons between women with and without cancer . This longitudinal feature allows us to use 2 years of claims data for each individual in our sample.
Some people in the 5% sample are also in the SEER registries because they have been diagnosed with cancer. In our work, we use women from the 5% sample (who may or may not have cancer) in combination with women from the SEER registries (who have been diagnosed with cancer). This combination of women with and without cancer diagnosis allows assessment of previous cancer diagnosis on propensity to use mammography. We include all women over age 64 with a breast cancer diagnosis rather than limiting the registry cohort to those also included in the 5% file. This sample design results in greater numbers of women with cancer diagnoses in the small areas that we study, allowing for a more robust inference.
NCI links to the registry and 5% Medicare subjects all available Medicare claims, providing information about the timing and type of mammography used. A recent study validates the use of Medicare claims data to assess mammography utilization . We study women in the eight states (CA, IA, KY, UT, NM, LA, CT, NJ) and three portions of states (GA, WA, MI) covered by the SEER registries. Table 1 provides the counts of sample women in each state, by age, from SEER and 5% Medicare samples. The time period studied is the 2-year interval 2002–2003, and the outcome we study is any mammography use during this period.
Mammography utilization behavior is inferred from the Medicare claims files linked by NCI to the SEER registry and 5% Medicare reference subjects. However, only persons with traditional Medicare FFS coverage for both Part A (mandatory, covers hospitalizations) and Part B (elective, covers outpatient services) will have medical claims available for study. Other forms of health insurance, such as Medicare private insurance plans (Medicare HMOs, others) will produce no claims because these plans are not required to file claims with Medicare. While the vast majority of persons aged 65 and older have traditional Medicare FFS coverage, the proportion is dwindling and varies considerably across geography with the prevalence of Medicare private insurance plans . Thus, the SEER-Medicare subsample we study is not nationally representative but conditional on a person's having traditional Medicare FFS coverage for both Parts A and B. Only a few women (less than 1%) were dropped from the analysis because their addresses could not be mapped to one of the SEER states due to bad or missing ZIP codes.
Lower-income or poor women are included in our study when they have coverage for Part B services, often achieved through dual eligibility for Medicare and Medicaid . In our sample, the proportion of dually eligible varies from about 23% to 25% in CA and LA to less than 7% in UT, while about 17% on average over all areas. Thus, all women in our study sample had traditional, FFS Medicare, which allows choice of any provider and pays for annual mammograms.
We obtained the ZIP code of address for each woman in the sample and first checked to see that the address from the enrollment database file was consistent with the address on the claims file. Enrollment file addresses are not updated continuously and women may migrate to other locations for services. For more accurate address location at time of service, when enrollment and claims addresses differed, we used the claims address as the valid address ZIP code.
We then calculated the longitude and latitude of the ZIP code boundary's centroid to determine which other areas (PCSA, county) to associate with each woman's address. In this geocoding process, we assumed that the ZIP code was associated with the census ZCTA or county that contained its centroid. This assumption was necessary because ZIP codes are not always neatly contained completely within ZCTA or county areas. Once all ZIP codes were associated with ZCTAs, finding the associated PCSAs was straightforward, because ZCTAs nest completely inside PCSAs. The contextual variables used for modeling were derived from census data defined at the ZCTA level or obtained from other PCSA- and county-level data sources (see Table 2).
Multilevel data and empirical modeling
The multilevel data used in the empirical modeling were defined at the following levels: person, PCSA, and county. The fourth level, state, is omitted because there are not enough states to account for this level explicitly in the estimation. The multilevel data structure necessitates some form of multilevel modeling, and several alternatives are available. When there are not good measures of the contextual factors operating at different levels to include directly in the empirical model, researchers can model some of the unexplained place-specific variability using a random effects model specification . However, when the place-specific heterogeneity is modeled directly through a rich set of covariates, the random effects variances often shrink to zero, and a random effects model specification is no longer necessary . In this latter situation, the focus is on robustly estimating the effects of the higher-level covariates, which are repeated (redundant) over the lower units of analysis (i.e., women in the same county all are assigned the same HMO penetration variable). This redundancy can reduce the standard errors on the estimated coefficients of the higher-level variables, making them seem more statistically significant than they in fact are [49, 21].
Because we have a very rich set of higher-order covariates, we use an intermediate modeling approach and focus on efficient estimation to produce reliable standard errors for the contextual covariates. We use the GEE robust empirical approach to correct the standard errors for biases stemming from redundancies in the contextual variables within areal units [50–52]. Horton and Lipsitz  note that in generalized linear models (GLMs) when the outcome variable is approximately normally distributed, standard likelihood approaches are useful for analysis of clustered data. To extend the GLM approach to models with discrete outcomes, such as our binary probit regression, Liang and Zeger  formulated the GEE approach, which is not likelihood-based and does not require parameterization assumptions for the second-order variance terms, which they refer to as a "working" matrix. The GEE approach is attractive because it provides a nonparametric, empirical approach that is robust to inappropriate assumptions about the variance-covariance matrix. The empirical approach is preferred when the number of clusters is large, which is a feature of our data (see Table 4) .
It is important to note that GEE estimators are used to characterize the average response for observations sharing the same set of covariates within an area (the unit of clustering) . In our analyses, these GEE estimators provide efficient estimates of the association between a community contextual covariate and average women's response to it, in terms of the propensity to use mammography, in the community. Thus, we interpret the findings in the context of "women living in communities like X," rather than as specific community effects on individual women's behaviors.
We estimate binary probit regression models with factors representing the various levels in our contextual model: individual, neighborhood, and community. We examine direct effects of the neighborhood and community variables and interaction between individual-specific and contextual variables which we hypothesize will modify the direct effect estimates. The four interactions we examine are as follows:
1. individual's own race or ethnicity and same racial or ethnic segregation in her neighborhood (i.e., living in a segregated neighborhood among others of one's same ethnicity);
2. individual's age category (see Table 1) and managed care penetration in her community;
3. individual's age category (see Table 1) and commuter intensity in her neighborhood; and
4. individual's disability status and commuter intensity in her neighborhood, where disability status is determined by whether a women had personal disability as the original reason for Medicare entitlement.
Because of extreme multicollinearity between age or commuter intensity in interactions numbered 2 through 4 above, we are not able to include all interaction effects of interest simultaneously in one model. We estimate three separate models, all including the first set of interactions (own race and segregation) and one other set. Model 1 contains the disability by commuter intensity interaction, Model 2 includes the age group by managed care interaction, and Model 3 includes the age group by commuter intensity interaction. The three interaction models are presented side-by-side in the results (Table 5), by state. Age is kept as a continuous variable in Model 1 to estimate the linear effect of another year in age on probability of use. In Models 2 and 3, age is entered as a categorical variable to assess nonlinearities in the interaction term effects. The effect estimates that reach statistical significance at the 5% level or better are included in the body of Table 5. Effect estimates in Table 5 are interpreted as the effect of a one unit increase in the covariate on probability of mammography use in 2002–2003.
We propose several hypotheses. If living in a segregated community among one's own race or ethnicity increases social support, the interaction effect on mammography use will be positive. If disability is associated with enhanced transportation alternatives, then disabled women may be less affected by commuter conditions than other women. If FFS-insured Medicare beneficiaries closer to retirement age (age 65–72) are less affected (i.e., more independent in dictating their own health care activities) by market conditions (managed care practice spillovers) than older beneficiaries, then the effect of managed care spillovers will be greater for older beneficiaries (age 73–80, 81+) than younger beneficiaries (age 65–72). If advancing age makes one less affected by commuter intensity, the effect of commuter intensity will be less significant for older than younger elderly groups. This may happen if women of younger age (65–72) are still driving themselves, compared with the older age groups (73–80, 81+).
To assess these hypothesized relationships, we estimate each state as a separate region, a strategy that allows the maximum amount of heterogeneity possible, as the regression slopes can vary from state to state. We contrast the findings from the state-specific models with a pooled model containing state-specific dummy variables, which forces each effect estimate to be the same across states. The comparison is used to suggest how misleading a pooled approach (which ignores spatial heterogeneity) can be when examining disparities in mammography use. However, because the binary probit GEE models are estimated using maximum likelihood methods, there is no way to conduct a statistical test of the observed differences in the estimated effect parameters across the state-specific models (for those models with identical specifications, which include all states except MI, GA, KY). The pooled models cannot be used to assess meaningful differences across states either, because the state-specific dummies only capture differences in the average probability of mammography use across the states, forcing all of the effect estimates to be the same across states. Thus, we can provide descriptive comparisons only for differences noted across states.
The sample size in states varies. Also, MI, GA, and WA samples only contain the substate portions of those states covered by the SEER registries (see Table 1). The GA sample spans urban and rural areas in GA and is larger than the UT or NM samples, which cover those entire states. The WA sample also covers urban and rural areas, but the MI sample is very urban, covering the tri-county Detroit metropolitan area. All other states have both urban and rural areas and complete statewide coverage. Results from Michigan are not strictly comparable to other states because the three Detroit metropolitan area counties exhibit too little variation for all of the county-level variables to be included in the model. Managed care penetration and provider supply variables were correlated more than 95% using simple Pearsonian correlations; only two county-level variables–number of mammography facilities and oncologists per capita–were sufficiently uncorrelated to allow inclusion. Also, so few Native American women were present in GA, KY, and MI that they were pooled into the "other race" category and the interaction between Native American race and area segregation could not be included for these states. Similarly, the Hispanic ethnicity and its interaction with segregation were not possible in KY. Thus, the model specifying interaction between race or ethnicity and isolation of that same race or ethnicity is not uniform across states due to data limitations.
Several variables included in the estimation and in the sample statistics presented in Tables 3 and 4 were dropped from the reported results (see Table 5). Their estimated impacts were not significant in one or more states, so for brevity we dropped proportion of elderly women living alone, violent crime rate, land use mix, the ratio of international medical graduates to U.S.-trained physicians (these were only significant in CA, see ); and proportion of the population moving into the area 1995–2000 (significant in UT, effect estimate 0.21 with p-value 0.01), and the binary indicator of primary care physician shortage (not statistically significant anywhere). We also dropped a significant variable, number of nurses per thousand elderly, which had a significant but tiny positive effect in the pooled, CA, and NJ models (the effect was 0.00 for a 1-nurse increase per 1,000 elderly). One statistically significant individual-level control variable was also dropped for brevity: the number of months enrolled in a Medicare HMO in the 2 years prior to the study. The estimated effect was tiny, negative (-0.00), and statistically significant in CA, CT, NJ, and WA reflecting the fact that women in an HMO plan prior to the period under study may have received recent mammography that very slightly lowered their probability of use in the period studied. Full results are available from the authors upon request.
Table 5 contains the multilevel regression results from the pooled and state-specific models. Only the effects that are statistically significant at the 5% level or better are presented in the table. The pooled model contains state-specific intercepts that suggest lower average probabilities of mammography use in all states relative to CA, the reference state, after adjusting for other covariates. These state-specific intercepts from the pooled models (in the first three columns of Table 5) are presented in Table 6.
Individual-level effects are fairly consistent across interaction models and states. Disability is associated with significantly lower probabilities of mammography use, ranging from about 3% to 7% lower probability across states. Dual eligibility effects are smaller, amounting to at most 1% lower probability in all states. Use of mammography declines about 1% to 2% with each additional year of age (Model 1) and declines more with higher age categories (Models 2 and 3). Having recently moved to a new ZIP code is associated with lower probability of use in six states, reaching a substantial 10% to 11% lower probability for IA and MI. Distance to closest provider effects are quite small and only present in four states: CA, KY, LA, and UT. Flu-shot behavior is associated with about 12% to 22% higher probability of mammography and is significant across all states, suggesting that women who receive flu shots from their doctors are also more likely to utilize mammography. Having a cancer diagnosis is associated with a much higher probability of use in every state. With so much agreement across states, the pooled model results are quite consistent with the state-level findings in terms of size and sign of effect for these individual-level variables. Individual's race or ethnicity effects, where statistically significant, are negative, suggesting that women of other races and ethnicities are less likely to use mammography than white women.
Turning to the neighborhood variables, the effect estimates associated with the segregation indices vary in numerical sign across states. On average, women living in more segregated communities appear to have lower probability of use in some states and higher probability in others. The pooled effect estimates reflect this variability across states, often not achieving statistical significance. Women living in more segregated African American communities appear to have lower probabilities of mammography use in CA, IA, and WA but may have higher probability in KY (Model 1 only). Women living in more segregated Hispanic communities appear to have lower probabilities of use in CA, IA, and NM, but higher probabilities in NJ. Women living in more segregated Asian or American Indian/Alaska Native communities appear to have lower probability of use in NM only.
Residential segregation in the immediate neighborhood is a measure of cohesion; however, to understand social support, we examine whether women of a particular race or ethnicity living in same-race segregated communities are more or less likely to use mammography. If living among others of the same race or ethnicity encourages healthier behaviors, then there would be a positive effect from this interaction on mammography use. For African American women, the effect is positive in NJ, MI, and WA (Model 1 only) and also positive in the pooled model. For Hispanic women, the effect is positive in CT, NJ, and NM (Model 1 only). For Native Americans, the effect is positive in CA and IA but negative in NM and in the pooled estimate. We note that when the sign of the effect varies across states, the pooled model will inevitably contradict some state findings and agree with others, when it achieves significance.
Several PCSA-level variables had significant effects. Commuter intensity–the proportion of the workforce in a local neighborhood who commuted more than 60 minutes each way to work–is a large negative effect on the probability of use in CA, CT, IA, LA, and MI. The effect is positive in NJ, suggesting that elderly women living in commuter communities there are actually more likely to use mammography. Interaction between age and commuter-intensity (Model 3) suggests that older women living in commuter intense areas are less likely to use mammography than the younger elderly group (age 65–73), in the pooled model and in MI. By contrast, in LA, older women living in more commuter-intense areas are more likely to use mammography than younger elderly women. In the interaction between commuter intensity and disability status (Model 1), evidence suggests that disabled women living in commuter-rich communities are more likely to use mammography in CA, GA, MI, and NM but less likely in LA; the pooled results reflect the findings for LA only.
Living in communities where proportionately more elderly have little or no English language ability is associated with lower utilization in five states (CA, CT, IA, NJ, and UT), consistent with the pooled results. Living in communities where proportionately more elderly are impoverished is associated with lower utilization in CA only, where the effect is larger than the language ability variable.
Health services provider variables such as mammography facilities and oncologists per thousand older persons have significant positive effects in the pooled sample and in some states but negative effects in others. Oncologist supply is associated with significantly higher use in the pooled model, IA, KY, LA (Model 3 only), and WA. However, having a greater number of facilities available per capita is associated with lower utilization in IA but higher utilization in MI and CA; effects are quite large in MI.
Area HMO penetration is a significant negative predictor in KY (Model 2 only) but is positive in NM and in the pooled average. Positive HMO effects are consistent with change in area behavior toward greater utilization of preventive care services where there is greater HMO penetration. Interaction effects with HMO penetration and age suggests that older women are more likely (than those in the age 65–72 cohort) to use mammography in HMO-rich markets in CA, KY, and NM.
It is well known that the race and ethnicity coding in the Centers for Medicare & Medicaid Services (CMS) Enrollment Data Base (EDB) is not perfect, with a greater degree of error for persons who are not white or African American [53, 54]. However, large administrative databases are thought to be more reliable sources of population race or ethnicity data than small household or other surveys, which typically underrepresent minorities . In efforts to improve and expand the coding, CMS conducted a postcard survey of over 2 million beneficiaries with Hispanic surnames or countries of origin in 1997 and beneficiaries with "other" or "missing" race or ethnicity codes. The survey resulted in updated coding for about 858,500 beneficiaries, improving the race or ethnicity coding of the EDB, as the number of persons with a race code of "other" or "unknown" decreased from 978,000 in 1993 to 473,000 in 1997 [53, 54]. An analysis comparing the distribution of race or ethnicity for Medicare beneficiaries aged 65 or over as coded in the updated EDB to that of U.S. Census estimates for the same age group found very similar distributions, concluding that studies of disparities in utilization rates using the EDB as the source of racial or ethnic coding would likely provide unbiased estimates of these utilization rates . However, because the race and ethnicity coding is less reliable for counts of individuals who are not white or African American, our findings for these groups should be interpreted with caution and validated in future research.
Differences in findings across states may derive in part from the different sample sizes and the spatial sufficiency of samples within states, and we cannot test for this in these data. Also, in the single time interval examined here, this cross-sectional analysis is limited to suggesting evidence of associations, not causal relationships. Another limitation is the ability to generalize the policy findings to other areas or states. Hence, our results should be interpreted as descriptive findings, which may offer some insights into areas of potentially fruitful further study.
Summary of findings
In this paper, we examine various factors affecting mammography use and disparities in use from a large random sample of women aged 64 or older with traditional Medicare coverage across 11 heterogeneous regions of the United States. We estimate a binary probability model of mammography use, with multilevel modeling to account for redundancies in higher-level contextual variables assigned to women within geographic units. Our multilevel model is based on a carefully developed theoretical model of the ecological environment for mammography use. The theory posits that a variety of contextual factors, operating at different levels of influence, can help explain observed mammography use. Factors include traditional access and health system supply variables, population/demand variables, and a rich set of socio-ecological variables describing other aspects of the community–besides health system and medical aspects–that are important correlates of observed behavior.
We find considerable variation among states in the effect estimates of contextual variables with some policy implications. Commuter intensity seems to deter mammography use in some states and increase it in another. Given this variability in findings across the states, a transportation policy aimed at improving access to mammography for elderly women in commuter communities should perhaps be tailored to serve regions where commuter crowding is apparently an impediment. Elderly living in communities with worse English language ability among the elderly are less likely to use mammography in five states. This finding suggests that health communication policy to increase health literacy may be needed in some communities where the elderly are isolated due to poor language abilities. Women living in impoverished elderly communities are less likely to use mammography, although all have both Parts A and B of traditional Medicare coverage, allowing free choice of provider and annual mammograms free of charge. This finding suggests that impoverished environs are important determinants of mammography utilization even for those residents with the means to pay for mammography services. Prevalence of HMOs is positively associated with utilization in some states and negative in others. The impact of HMO spillovers on preventive care behaviors is not consistent across states and seems greater for older women. Because the tendency is to utilize mammography less with increasing age, the HMO spillovers seem to counter this trend in a few states.
The main objective of this paper is to explore the hypothesis that regression models estimated over pooled samples of heterogeneous states may provide misleading information regarding predictors of health care utilization. In particular, race effects may be largely identified by differences across states with different racial and ethnic compositions. This confounding of race and place is an important issue receiving some recent attention in the literature [5–8]. We find that Georgia, Louisiana, and Detroit metropolitan Michigan have by far the greatest proportions of African Americans in these data (21%, 21%, and 17%, respectively) but no statistically significant difference between the propensity for blacks and whites to use mammography within these states. Other states with smaller proportions of blacks–namely CA and NJ (with 5% and 9%, respectively)–show significant differences between black and white utilization rates. However, looking across states with the 11-state average produced by the pooled model, results suggest that there are disparities in use between blacks and whites, which clearly contradicts what we find for the three states with the largest black populations. With this much heterogeneity across states, it is difficult to conclude anything about an average effect.
We also estimate a set of interaction effects between individual-level attributes and contextual factors in the woman's environment that have not been well studied. We are particularly interested in aspects of social support that might be captured by the interaction of a person's race/ethnicity with living in a more segregated place of the same race or ethnicity. The pooled model suggests a supportive effect for black women living in segregated black communities, but this is only found in a single state, NJ. The pooled model finds no supportive effect for Hispanic women living in segregated Hispanic communities, but CT, NJ, and NM all show significant interactions. The pooled model finds a negative effect on screening probability for Native Americans living in segregated Native American communities, but this is consistent with one state (NM) and inconsistent with two others (CA and IA).
Among the several contextual factors studied, commuter intensity, poor elderly English language ability, and elderly poverty have the greatest apparent impacts on mammography use; however, the impacts varied across the states studied. Pooling across states yields consistent results only when there is agreement across states in the sign of effect from the contextual covariate. In many cases, the pooled results can be misleading because states are quite heterogeneous. A recent review article uses meta-analysis to combine the results from various studies of disparities in mammography use conducted in specific regions of the United States with those that were nationwide in scope . This meta-analytical approach imposes a great degree of statistical abstraction from the reality that places are quite different from one another, yielding average effect estimates across the pooled studies. This is similar to what happens when we pool states together, forcing the effect estimates to be the same everywhere. Pooling results in effect estimates that are an abstraction from reality, masking the fact that places are quite different. Comprehensive cancer control efforts should recognize that people and places exhibit a complex joint spatial distribution of characteristics. Efforts to reduce disparities must model the diversity in order to highlight the differences.
Berry D, Cronin K, Plevritis S, Fryback D, Clarke L, Zelen M, Mandelblatt J, Yakovlev A, Habbema J, Feuer E: Effect of screening and adjuvant therapy on mortality from breast cancer. N Engl J Med. 2005, 353: 1784-1792. 10.1056/NEJMoa050518.
Centers for Disease Control and Prevention: Use of mammograms among women aged ≥ 40 years–United States, 2000–2005. MMWR. 2007, 56: 49-51.
Peek M, Han J: Disparities in screening mammography: current status, interventions, and implications. J Gen Intern Med. 2004, 19: 184-194.
Probst J, Moore C, Glover S, Samuels M: Person and place: the compounding effects of race/ethnicity and rurality on health. Am J Public Health. 2004, 94: 1695-1703.
Virnig B, Scholle S, Chou A, Shih S: Efforts to reduce racial disparities in Medicare managed care must consider the disproportionate effects of geography. Am J Manag Care. 2007, 13: 51-56.
Chandra A, Skinner J: Geography and racial health disparities. NBER Working Paper. 2003, No.W9513
Coughlin S, Thompson T, Seeff L, Richards T, Stallings F: Breast, cervical, and colorectal carcinoma screening in a demographically defined region of the southern US. Cancer. 2002, 95: 2211-2222. 10.1002/cncr.10933.
Slifkin R, Goldsmith L, Ricketts T: Race and place: urban-rural differences in health for racial and ethnic minorities. North Carolina Rural Health Research and Policy Analysis Program Working Paper Series. 2000, No. 66.
Mobley L, Kuo M, Andrews L: How sensitive are multilevel regression findings to defined area of context? A case study of mammography use in California. Med Care Res Rev. 2008, 65: 315-337. 10.1177/1077558707312501.
Smedley B: Expanding the frame of understanding health disparities: from focus on health systems to social and economic systems. Health Educ Behav. 2006, 33: 538-541. 10.1177/1090198106288340.
Susser M: Does risk factor epidemiology put epidemiology at risk? Peering into the future. J Epidemiol Community Health. 1998, 52 (10): 608-611.
Susser M, Susser E: Choosing a future for epidemiology: from black box to Chinese boxes and eco-epidemiology. Am J Public Health. 1996, 86: 674-677.
Bronfenbrenner U: The Ecology of Human Development. Experiments by Nature and Design. 1979, Cambridge, Massachusetts: Harvard University Press
Sallis J, Bauman A, Pratt M: Environmental and policy interventions to promote physical activity. Am J Prev Med. 1998, 15: 379-97. 10.1016/S0749-3797(98)00076-2.
Sallis J, Owen N: Physical Activity and Behavioral Medicine. 1999, Thousand Oaks, CA: Sage
Sallis J, Owen N, Frank L: Behavioral epidemiology: a systematic framework to classify phases of research on health promotion and disease prevention. Ann Behav Med. 2000, 22: 294-8. 10.1007/BF02895665.
Baranowski T, Anderson C, Carmack C: Mediating variable framework in physical activity interventions: How are we doing? How might we do better?. Am J Prev Med. 1998, 15: 266-297. 10.1016/S0749-3797(98)00080-4.
Smedley B, Syme S, Eds: Promoting Health: Strategies from Social and Behavioral Research. 2000, Institute of Medicine, Washington DC: National Academies Press
Schmid T, Pratt M, Witmer L: A framework for physical activity policy research. J Phys Act Health. 2006, 3 (Suppl 1): S20-S29.
Schulz A, Kannan S, Dvonch J, Israel B, Allen A, James S, House J, Lepkowski J: Social and physical environments and disparities in risk for cardiovascular disease: the healthy environments partnership conceptual model. Environ Health Perspect. 2005, 113: 1817-1825.
Blakely T, Woodward A: Ecological effects in multi-level studies. J Epidemiol Community Health. 2000, 54: 367-374. 10.1136/jech.54.5.367.
Duncan C, Jones K, Moon G: Health-related behavior in context: a multilevel modeling approach. Soc Sci Med. 1996, 42: 817-830. 10.1016/0277-9536(95)00181-6.
Duncan C, Jones K, Moon G: Context, composition, and heterogeneity: using multilevel models in health research. Soc Sci Med. 1998, 46: 97-117. 10.1016/S0277-9536(97)00148-2.
Diez-Roux A: Bringing context back into epidemiology: variables and fallacies in multilevel analysis. Am J Public Health. 1998, 88: 216-222.
Krieger N: Epidemiology and the web of causation: has anyone seen the spider?. Soc Sci Med. 1994, 39: 887-903. 10.1016/0277-9536(94)90202-X.
Hillemeir M, Lynch J, Harper S, Casper M: Measuring contextual characteristics for community health. Health Serv Res. 2003, 38: 1645-1717. 10.1111/j.1475-6773.2003.00198.x.
Pickett K, Pearl M: Multilevel analyses of neighborhood sociodemographic context and health outcomes: a critical review. J Epidemiol Community Health. 2000, 5: 111-122.
Litaker D, Tomolo A: Association of contextual factors and breast cancer screening: finding new targets to promote early detection. J Womens Health. 2007, 16 (1): 36-45. 10.1089/jwh.2006.0090.
Litaker D, Koroukian S, Love T: Context and healthcare access: looking beyond the individual. Med Care. 2005, 43: 531-540. 10.1097/01.mlr.0000163642.88413.58.
Northridge M, Sclar E, Biswas P: Sorting out the connections between the built environment and health: a conceptual framework for navigating pathways and planning healthy cities. J Urban Health. 2003, 80: 556-568.
Booth S, Sallis J, Ritenbaugh C, Hill J, Birch L, Frank L, Glanz K, Himmelgreen D, Mudd M, Popkin B, Rickard K, St Jeor S, Hays N: Environmental and societal factors affect food choice and physical activity: rationale, influences, and leverage points. Nutr Rev. 2001, 59 (Suppl 3 Pt 2): S21-S39. discussion S57–S65.
Aday L, Andersen R: A framework for the study of access to medical care. Health Serv Res. 1974, 9: 208-220.
Goodman D, Mick S, Bott D, Stukel T, Chang C, Marth N, Poage J, Carretta H: Primary care service areas: A new tool for the evaluation of primary care services. Health Serv Res. 2003, 38: 287-310. 10.1111/1475-6773.00116.
Massey D, Denton N: The dimensions of residential segregation. Social Forces. 1988, 7: 281-315. 10.2307/2579183.
Williams D, Collins C: Racial residential segregation: a fundamental cause of racial disparities in health. Public Health Rep. 2001, 116: 404-416.
Schulz A, Williams D, Israel B, Lempert L: Racial and spatial relations as fundamental determinants of health in Detroit. Milbank Q. 2002, 80: 677-707. 10.1111/1468-0009.00028.
Palloni A, Arias E: Paradox lost: explaining the Hispanic adult mortality advantage. Demography. 2004, 41: 385-415. 10.1353/dem.2004.0024.
Mobley L, Root E, Finkelstein E, Khavjou O, Will J: Relationships between the built environment and other contextual factors, obesity, and cardiac risk. Am J Prev Med. 2006, 30: 327-332. 10.1016/j.amepre.2005.12.001.
Mobley L, Root E, Anselin L, Lozano-Gracia N, Koschinsky J: Spatial analysis of elderly access to primary care services. Int J Health Geogr. 5: 19-10.1186/1476-072X-5-19. 2006 May 15
Schulz A, Kannan S, Dvonch J, Israel B, Allen A, James S, House J, Lepkowski J: Social and physical environments and disparities in risk for cardiovascular disease: the healthy environments partnership conceptual model. Environ Health Perspect. 2005, 113: 1817-1825.
Baker L: Managed care spillover effects. Annu Rev Public Health. 2003, 24: 435-456. 10.1146/annurev.publhealth.24.100901.141000.
Mick S, Lee S, Wodchis W: Variations in geographical distribution of foreign and domestically trained physicians in the United States: "safety nets" or "surplus exacerbation"?. Soc Sci Med. 2000, 50: 185-202. 10.1016/S0277-9536(99)00183-5.
Loukaitou-Sideris A, Eck J: Crime prevention and active living. Am J Health Promot. 2007, 21 (Suppl 4): 380-389.
Keane C: Evaluating the influence of fear of crime as an environmental mobility restrictor on women's routine activities. Environment and Behavior. 1998, 30: 60-74. 10.1177/0013916598301003.
Meyer E, Post L: Alone at night: a feminist ecological model of community violence. Feminist Criminology. 2006, 1: 207-227. 10.1177/1557085106289919.
Warren J, Klabunde C, Schrag D, Bach P, Riley G: Overview of the SEER-medicare data: content, research applications, and generalizability to the United States elderly population. Med Care. 2002, 40 (Supp IV): IV-3-IV-18.
Smith-Bindman R, Quale C, Chu P, Rosenberg R, Kerlikowske K: Can Medicare billing claims data be used to assess mammography utilization among women ages 65 and older?. Med Care. 2006, 44: 463-470. 10.1097/01.mlr.0000207436.07513.79.
MEDPAC: Dual eligible beneficiaries: An overview. Report to the Congress: New Approaches in Medicare. 2004, Chapter 3:
Gumpertz M, Pickle L, Miller B, Bell S: Geographic patterns of advanced stage breast cancer in Los Angeles: associations with biological and sociodemographic factors. Cancer Causes Control. 2006, 17: 325-339. 10.1007/s10552-005-0513-1.
Hardin J, Hilbe J: Generalized Estimating Equations. 2003, New York: Chapmann & Hall/CRC
Horton N, Lipsitz S: Review of software to fit generalized estimation equation regression models. The American Statistician. 1999, 53: 60-169. 10.2307/2685737.
Liang K, Zeger S: Longitudinal data analysis using generalized linear models. Biometrika. 1986, 73: 13-22. 10.1093/biomet/73.1.13.
Arday S, Arday D, Monroe S, Zhang J: HCFA's racial and ethnic data: current accuracy and recent improvements. Health Care Financ Rev. 2000, 21: 107-116.
Waldo D: Accuracy and bias of race/ethnicity codes in the Medicare enrollment database. Health Care Financ Rev. 2005, 26: 61-72.
Purc-Stephenson R, Gorey K: Lower adherence to screening mammography guidelines among ethnic minority women in America: a meta-analytic review. Prev Med. 2008 Jan 16,
This work was supported by a National Cancer Institute grant (1R01CA126858-01A1), for the project "Geospatial Factors & Impacts: Measurement and Use." The content is solely the responsibility of the authors and does not necessarily represent the official views of RTI International, the National Cancer Institute, or the National Institutes of Health. Thanks to the funding support, multilevel data developed for this work are available to other researchers free of charge.
The authors declare that they have no competing interests.
LRM led the study and participated in all aspects of the study. T-MK led the multilevel analysis and contributed to the study design and writing. LC led the GIS data development and contributed to the writing. DD and LA contributed to data analysis and interpretation and the writing. LA also provided expert statistical and econometric consulting advice. All authors read and approved the final manuscript.