Methodology | Open | Published:
U.S. congressional district cancer death rates
International Journal of Health Geographicsvolume 5, Article number: 28 (2006)
Geographic patterns of cancer death rates in the U.S. have customarily been presented by county or aggregated into state economic or health service areas. Herein, we present the geographic patterns of cancer death rates in the U.S. by congressional district. Many congressional districts do not follow state or county boundaries. However, counties are the smallest geographical units for which death rates are available. Thus, a method based on the hierarchical relationship of census geographic units was developed to estimate age-adjusted death rates for congressional districts using data obtained at county level. These rates may be useful in communicating to legislators and policy makers about the cancer burden and potential impact of cancer control in their jurisdictions.
Mortality data were obtained from the National Center for Health Statistics (NCHS) for 1990–2001 for 50 states, the District of Columbia, and all counties. We computed annual average age-adjusted death rates for all cancer sites combined, the four major cancers (lung and bronchus, prostate, female breast, and colorectal cancer) and cervical cancer. Cancer death rates varied widely across congressional districts for all cancer sites combined, for the four major cancers, and for cervical cancer. When examined at the national level, broad patterns of mortality by sex, race and region were generally similar with those previously observed based on county and state economic area.
We developed a method to generate cancer death rates by congressional district using county-level mortality data. Characterizing the cancer burden by congressional district may be useful in promoting cancer control and prevention programs, and persuading legislators to enact new cancer control programs and/or strengthening existing ones. The method can be applied to state legislative districts and other analyses that involve data aggregation from different geographic units.
Cancer death rates presented by geographic boundaries such as state and county, state economic areas, and health service areas have been useful in monitoring temporal trends in allocating public health resources [1, 2], and in some instances, in generating etiological hypotheses. These rates are less useful for communicating to legislators and policy makers whose jurisdictions are not defined by state or county boundaries. There have been no published studies that attempted to measure cancer death rates within congressional districts.
Public policy and legislation play a critically important role in efforts to reduce the burden of cancer. For example, the American Cancer Society estimates that in 2006 about 170,000 of the 564,830 cancer deaths are expected to be caused by tobacco use alone . Policy measures that are proven to reduce smoking prevalence include excise taxes and funding for state comprehensive tobacco control programs [4–6]. Declines in smoking prevalence among men as a result of public health efforts have had a major influence on the declines in cancer mortality in the last decade.
We present a method to calculate cancer death rates according to congressional district that may be useful in advocating for legislative initiatives and funding for cancer research and prevention programs.
Results and discussion
Maps of cancer death rates by congressional district were prepared for men and women, for all races combined, and for African Americans, non-Hispanic whites, and Hispanics (Figures 1, 2, 3, 4, 5); Hispanics are not mutually exclusive of whites and African Americans. Regional patterns of cancer mortality for African Americans and non-Hispanic whites were compared to previously published maps based on counties and state economic areas . Although maps of cancer mortality by congressional district were also prepared for Hispanics, regional patterns are difficult to interpret because of insufficient data to calculate rates for most parts of the country. When examined at the national level, broad patterns of mortality for African Americans and non-Hispanic whites by sex and region were consistent with those previously observed . Geographic variations in cancer death rates may reflect, in part, regional variations in risk factors such as smoking and obesity, early detection and screening, and access to and utilization of medical services.
Figure 1 shows geographic patterns of death rates for all cancer sites combined by congressional district in the United States. In men, rates range from 186.3 in Utah congressional district #3 to 343.7 in District of Columbia (Table 1) and in women, from 123.4 in Utah congressional district #1 to 217.4 in Pennsylvania congressional district #2 (Table 2). Generally, the patterns for all cancer sites combined are strikingly similar to those for lung cancer (Figure 2), reflecting the importance of lung cancer as a cause of cancer death, and the strong association of lung and cancers of several other sites with tobacco smoking. Lung cancer death rates in all races combined range from 35.7 in Utah congressional district #1 to 130.3 in Kentucky congressional district #5 for men and from 14.8 in Utah congressional district #3 to 57.9 in Kentucky congressional district #5 for women. Lung cancer death rates are the highest in congressional districts in Appalachia and the south among non-Hispanic white men and in the Midwest and the south among African American men. In contrast, among women, rates are the highest in congressional districts in the Midwest among African Americans and in the west, Appalachia, and the coastal south among non-Hispanic whites. Historically, smoking was more common in the south among men and in the west among women, especially among whites . Although patterns of lung cancer mortality in the 1990's primarily reflect smoking patterns in the 1950's and 1960's, the burden of death from all cancers and lung cancer by congressional district can be used to illustrate the importance of tobacco control measures as well as to document local needs for cancer treatment and associated services.
Historically, female breast cancer death rates have been elevated in the Northeastern and North Central regions; North-South differences have diminished over time as female breast cancer death rates decreased in the Northeast but increased in the South . For all races combined, female breast cancer death rates vary from 20.6 in Hawaii to 39.4 in District of Columbia. Among African American women, breast cancer death rates are highest in congressional districts in the south, Midwest, and west coast, while among non-Hispanic whites, breast cancer mortality is highest in congressional districts in the Northeast and west coast (Figure 4, right panel). Patterns of breast cancer mortality partly reflect the influence of known risk factors as well as access to and utilization of cancer screening and treatment. Important cancer control measures include access to mammography for the uninsured and under-insured, and availability of Medicaid coverage for diagnosis and treatment.
Colorectal cancer death rates are highest overall in the Northeast and parts of the South and Midwest. Generally, death rates range from 18.4 in Texas congressional district #15 to 37.1 in Pennsylvania congressional district #1 for men and from 11.3 in Texas congressional district #15 to 24.1 in District of Columbia for women (Figure 3). Although a strong geographic pattern for colorectal cancer mortality has existed since the 1950's, the reasons are not well-understood . The current priority for colorectal cancer control is to increase the proportion of individuals over 50 who receive recommended screening tests. Illustrating colorectal cancer mortality by legislative district may be influential in encouraging legislative support for mandated insurance coverage of colorectal screening tests and for programs to provide testing for the uninsured and under-insured.
For all races combined, prostate cancer death rates range from 23.8 in Texas congressional district #15 and Hawaii to 58.2 in District of Columbia. Generally, rates are highest in congressional districts in the mid-Atlantic and Southern coastal areas, reflecting in large part the higher proportion of the African American men in the population of these areas (Figure 4, left panel). Death rates for African American men are more than twice the rates for non-Hispanic white men, reflecting higher incidence, later stage at diagnosis and poorer survival among African American men. Among non-Hispanic whites, rates are highest in congressional districts in the Rocky Mountain region; high rate (40.2) is observed in Hispanics in Texas congressional district #13. A recent study suggested that 10% to 30% of the geographic variation in prostate cancer death rates might relate to variations in access to medical care . Although cancer control measures for prostate cancer are less well-defined than measures for some other cancer sites, illustrating prostate cancer mortality by congressional district may be helpful in advocating for funding of research on the prevention, early detection and treatment of prostate cancer and highlighting the importance of access to medical care for African American men.
Mortality from cervical cancer in all races combined is highest in congressional districts in Appalachia, in the South and parts of the Southwest, with rates ranging from 1.4 in Minnesota congressional district #2 to 5.7 in New York congressional district #16 (Figure 5). Among African American women, rates are highest in congressional districts in the south and southeast, among non-Hispanic whites, rates are highest in congressional districts in Appalachia, and in Hispanics rates are highest in congressional districts in the coastal parts of California and Texas and in Colorado congressional district #3. Important cancer control measures include access to Pap tests for the uninsured and under-insured, and availability of Medicaid coverage for diagnosis and treatment.
The cancer mortality patterns by congressional district are generally similar to the patterns seen using other geographic boundaries. However, the patterns by congressional district may be useful to cancer control advocates to illustrate the importance of cancer control measures (prevention, early detection, and treatment) for their constituents. The method can be applied to state legislative districts and other analyses that involve data aggregation from different geographic units. Further research is needed to validate the estimates using mortality data geocoded to the lower geographic level such as block.
Death rates for U.S. states and counties
Mortality data were obtained from the National Center for Health Statistics (NCHS). We computed annual average age-adjusted death rates for all cancer sites combined, the four major cancers (lung and bronchus, prostate, female breast, and colorectal cancer) and cervical cancer from 1990–2001 for 50 states, District of Columbia, and all counties using SEER*Stat . Death rates, counts (number of deaths), and populations for counties were directly obtained for men and women, for all races combined, and for African Americans, non-Hispanic whites, and Hispanics. Except for the years of 1990 and 2000, the intercensal populations computed by the Census Bureau were used to obtain the total populations for the study time period. Since county designation for Alaska and Hawaii was not available from NCHS, death rates for Alaska and Hawaii reflect state rates. Rates were standardized to the 2000 U.S. population and expressed per 100,000 person-years.
Death rates for U.S. congressional districts
There are 436 (excluding Puerto Rico) federal congressional districts in the U.S. . Among these, eight congressional districts followed state boundaries or their equivalent (Alaska, District of Columbia, Delaware, Montana, North Dakota, South Dakota, Vermont, and Wyoming). Further, since county-specific mortality data were not provided for Hawaii in SEER*Stat, we assigned the state death rate to both congressional districts. For congressional districts whose boundaries did not follow state and county boundaries (n = 426), death rates were calculated by assigning county-level age-adjusted death rates to census block and then aggregating death rates over blocks by congressional district using GIS  and SAS . By doing so, we assume that blocks within a county have same death rates.
There are three major areal interpolation methods (area weighting, surface smoothing, and dasymetric technique) for generating estimates for target zones from data available for source zones when the two geographic units are not comparable. Areal weighting assumes that data are homogeneously distributed across geographic units, which is generally unrealistic; it also involves the direct superimposition of source zones and target zones , which often leads to a lot of geographic boundary-line discrepancies . Surface smoothing models data available for source zones as a continuous surface across the adjacent zones, assuming that the density declines with distance, taking into account the proximity of neighboring centroids [16, 17]. Dasymetric technique uses ancillary information to refine uneven data distributions across geographic units. Land cover from remote sensing  and the street layer [15, 19] have been used as subzone ancillary information. A recent study uses parish level (the lowest administrative unit) population data to derive weights . However, there is no universal rule to construct areal interpolation, and the best solution depends on various factors: the variables of interest, the spatial relationships between source zones and target zones, and the availability of ancillary information related to both.
In this study, we constructed a dasymetric method based on the hierarchical spatial relationships between blocks and counties and between blocks and congressional districts. Generally, congressional district and county share census block as a common basic spatial unit (Table 3) [21, 22]. We used block level sex- and race- specific population to devise a dasymetric approach that assigns county-level measures such as cancer death rates to census block and then aggregates census blocks at the congressional district level, using block population as a weighting factor. We did not use area weighting because of its unrealistic homogeneity assumption and boundary-line discrepancies associated with direct superimposition of two incomparable geographic units. Surface smoothing gives reliable estimates when smoothness is the real property of the density. However, the occurrence of cancer rarely follows a smooth distance-decay surface because major risk factors that affect cancer occurrence do not have smooth paths from the centroid to its adjacent neighboring centroids.
To make the calculations, the following steps were taken:
1. The number of people living within each census block by sex and race was determined from the 2000 U.S. census (covering 42 states, 426 congressional districts). Therefore, block population is sex- and race- specific.
2. Block population was spatially assigned to congressional districts by block centroids.
3. The age-adjusted cancer death rates for counties by sex and race were assigned to block by county FIPS (Federal Information Processing Standards) codes; FIPS codes are a standardized set of numeric or alphabetic codes issued by the National Institute of Standards and Technology (NIST) to ensure uniform identification of geographic entities through all federal government agencies .
4. Cancer death rate for each congressional district by sex and race was calculated by aggregating sex- and race- specific cancer death rates over blocks. Taking non-Hispanic white men as an example, suppose that r i was the age-adjusted cancer death rate for block i (obtained from the corresponding county rate calculated from SEER*Stat). Suppose that a ij was the population of block i within district j, and that the population for district j, , were known. Then the aggregated cancer death rate for district j, p j , was the summation of r i , weighted by the proportion of block population within the district,. Other sex- and race-specific cancer death rates were calculated similarly.
5. The number of cancer deaths for each congressional district by sex and race was calculated by aggregating the sex- and race- specific number of cancer deaths over blocks. The number of cancer deaths for a block was the product of crude death rate for the block (inherited from the corresponding county, which is the number of deaths for the county divided by the county population) and the block population. Again, taking non-Hispanic white men as an example, suppose that n i and c i were the number of deaths and the population for the county to which block i belongs, the crude death rate for block i was . Given a ij was the population of block i within district j, then the number of deaths for block i within district j was a ij , and the aggregated number of deaths for district j was . Other sex- and race- specific number of cancer deaths were calculated in a similar way.
6. The aggregated cancer death rates and the number of cancer deaths for the congressional districts (n = 426) from step 4 & 5 were exported back to GIS and linked with the other ten congressional districts (Alaska, District of Columbia, Delaware, Montana, North Dakota, South Dakota, Vermont, Wyoming, and two Hawaii districts) for producing maps. The estimates of the number of deaths were not presented separately. Instead, they were used as the criteria when mapping death rates across congressional districts. Death rates based on the small number of deaths (< 20) for the study time period were considered not reliable and thus excluded.
7. Maps were generated using ArcGIS . For all cancer sites combined and for each cancer site, the maps for all races combined were created by categorizing the rates into five groups. Cut points for the lowest and highest groups are approximately the 10th and 90th percentiles, except for cervical cancer which are 20th and 80th percentiles. Intervening groups are set at equal length between the lower bound cut point of 90th or 80th and the upper bound of 10th or 20th. Thus each interval represents the same absolute change over the middle range of rates, while the most extreme rates fall into the first and fifth categories. For each cancer site, to allow comparison among ethnic subgroups, the cut points for all races combined are used for race specific maps if rates are in the same range as those for all races combined. When the race specific rates fall out of the range of rates for all races combined, cut points for the exceeded portion are equally set at the length of rates in the highest category for all races combined. Cancer death rates based on the small number of deaths (< 20) are considered unstable and congressional districts with such rates are marked with hatches.
In describing the cancer burden by congressional district, we used direct age adjustment instead of indirect age adjustment because direct method is more statistically correct when the rates are being compared . Direct age-adjusted death rates describe the cancer death rate each congressional district would have if it had the age-sex-race distribution of the U.S. in the year 2000. In so far as congressional districts have age-sex-race compositions different from the U.S. in 2000, the need for resources to eliminate disparities between districts might be more or less than that suggested by the results described in this paper.
The views and opinions expressed in this article do not necessarily reflect those of the National Cancer Institute.
Devesa SS, Grauman DJ, Bolt WJ, Pennello GA, Hoover RN, Fraumeni JFJ: Atlas of cancer mortality in the United States, 1950-94. 1999, Bethesda, MD: National Institutes of Health, National Cancer Institute , NIH Publication No. 99-4564
Freeman HP, Wingrove BK: Excess Cervical Cancer Mortality: A Marker for Low Access to Health Care in Poor Communities. 2005, Rockville, MD: National Cancer Institute, Center to Reduce Cancer Health Disparities , NIH Pub No. 05-5282
American Cancer Society: Cancer Facts and Figures 2006. 2006, Atlanta , American Cancer Society
Hu TW, Bai J, Keeler TE, Barnett PG, Sung HY: The impact of California Proposition 99, a major anti-smoking law, on cigarette consumption. J Public Health Policy. 1994, 15 (1): 26-36.
Meier KJ, Licari MJ: The effect of cigarette taxes on cigarette consumption, 1955 through 1994. Am J Public Health. 1997, 87 (7): 1126-1130.
Peterson DE, Zeger SL, Remington PL, Anderson HA: The effect of state cigarette tax increases on cigarette sales, 1955 to 1988. Am J Public Health. 1992, 82 (1): 94-96.
Haenzsel W, M.B. S, Miller HP: Tobacco smoking patterns in the United States. Public Health Monograpah 45. 1955, Washington (DC) , US Government Print Off
Sturgeon SR, Schairer C, Grauman D, El Ghormli L, Devesa S: Trends in breast cancer mortality rates by region of the United States, 1950-1999. Cancer Causes Control. 2004, 15 (10): 987-995. 10.1007/s10552-004-1092-2.
Jemal A, Ward E, Wu X, Martin HJ, McLaughlin CC, Thun MJ: Geographic patterns of prostate cancer mortality and variations in access to medical care in the United States. Cancer Epidemiol Biomarkers Prev. 2005, 14 (3): 590-595. 10.1158/1055-9965.EPI-04-0522.
National Cancer Institute Cancer Statistics Branch DCCPS Surveillance Research Program: Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database: Incidence - SEER 9 Regs Public-Use, Nov 2003 Sub (1973-2001). 2004, Bethesda, MD
US Census Bureau: 108th Congressional District Summary Files, United States Census 2000. [http://www.census.gov/Press-Release/www/2003/108th.html]V1-D00-C108-08-US1
Environmental Science and Research Institute (ESRI): ArcGIS Software for Windows [computer program]. Version 9.0. 2005, Redlands, CA , Environmental Science and Research Institute (ESRI)
SAS Institute INC.: SAS-Statistical Analysis Software for Windows [computer program]. Version 9.0. 2005, Cary, NC , SAS Institute INC.
Flowerdew R, Green M: Developments in areal interpolation methods and GIS. Annals of Regional Science. 1992, 26: 67-78. 10.1007/BF01581481.
Reibel M, Bufalino ME: Street-weighted interpolation techniques for demographic count estimation in incompatible zone systems. Environment and Planning A. 2005, 37 (1): 127-139. 10.1068/a36202.
Martin D: An assessment of surface and zonal models of population. International Journal of Geographical Information Systems. 1996, 10 (8): 973-989. 10.1080/026937996137684.
Tobler WR: Smooth Pycnophylactic Interpolation for Geographical Regions. Journal of the American Statistical Association. 1979, 74 (367): 519-530. 10.2307/2286968.
Mennis J: Generating surface models of population using dasymetric mapping. Professional Geographer. 2003, 55 (1): 31-42.
Xie YC: The overlaid network algorithms for areal interpolation problem. Computers Environment and Urban Systems. 1995, 19 (4): 287-306. 10.1016/0198-9715(95)00028-3.
Gregory IN, Ell PS: Breaking the boundaries: Geographical approaches to integrating 200 years of the census. Journal of the Royal Statistical Society Series a-Statistics in Society. 2005, 168: 419-437.
US Census Bureau: Hierarchical relationship of census geographic entities. [http://www.census.gov/geo/www/cengeoga.pdf]
US Census Bureau: TIGER®, TIGER/Line® and TIGER-Related Products. [http://www.census.gov/geo/www/tiger/tgrcd108/spblk108.txt]
US Census Bureau: Federal Information Processing Standards (FIPS) Codes. [http://www.census.gov/geo/www/fips/fips.html]
Pickle LW, White AA: Effects of the choice of age-adjustment method on maps of death rates. Stat Med. 1995, 14 (5-7): 615-627.
We gratefully acknowledge Dr. Lance A Waller from Rollins School of Public Health at Emory University for his comments and suggestions on the early version of the manuscript.
The author(s) declare that they have no competing interests.
YH, EMW, and AJ conceived the analysis and wrote the final version of the manuscript. LWP provided technical support on the method and critically revised the manuscript. MJT conceptualized and critically revised the manuscript.