Spatial and multidimensional visualization of Indonesia's village health statistics
- Bambang Parmanto1Email author,
- Maria V Paramita†1,
- Wayan Sugiantara†1,
- Gede Pramana†1,
- Matthew Scotch†2 and
- Donald S Burke†3
© Parmanto et al; licensee BioMed Central Ltd. 2008
Received: 18 December 2007
Accepted: 11 June 2008
Published: 11 June 2008
A community health assessment (CHA) is used to identify and address health issues in a given population. Effective CHA requires timely and comprehensive information from a wide variety of sources, such as: socio-economic data, disease surveillance, healthcare utilization, environmental data, and health resource allocation.
Indonesia is a developing country with 235 million inhabitants over 13,000 islands. There are significant barriers to conducting CHA in developing countries like Indonesia, such as the high cost of computing resources and the lack of computing skills necessary to support such an assessment.
At the University of Pittsburgh, we have developed the Spatial OLAP (On-Line Analytical Processing) Visualization and Analysis Tool (SOVAT) for performing CHA. SOVAT combines Geographic Information System (GIS) technology along with an advanced multidimensional data warehouse structure to facilitate analysis of large, disparate health, environmental, population, and spatial data.
The objective of this paper is to demonstrate the potential of SOVAT for facilitating CHA among developing countries by using health, population, healthcare resources, and spatial data from Indonesia for use in two CHA cases studies.
Bureau of Statistics administered data sets from the Indonesian Census, and the Indonesian village statistics, were used in the case studies. The data consisted of: healthcare resources (number of healthcare professionals and facilities), population (census), morbidity and mortality, and spatial (GIS-formatted) information.
The data was formatted, combined, and populated into SOVAT for CHA use. Case study 1 involves the distribution of healthcare professionals in Indonesia, while case study 2 involves malaria mortality. Screen shots are shown for both cases. The results for the CHA were retrieved in seconds and presented through the geospatial and numerical SOVAT interface.
The case studies show the potential of spatial and multidimensional analysis using SOVAT for community health assessment in developing countries. Since SOVAT is based primarily on open-source components and can be deployed using small personal computers, it is cost-effective for developing countries. Also, combining the strength in analysis and the ease of use makes tools like SOVAT ideal for healthcare professionals without extensive computer skills.
Effective community health assessment (CHA) requires timely and comprehensive information from a wide variety of sources . A CHA might be conducted in order to: identify or predict community problems; develop strategies to solve the problems; manage resources allocation; and, in turn, improve the quality of life in the community. Data sets used to conduct CHA usually come from several sources such as socio-economic data, disease surveillance, healthcare utilization, healthcare resources, environmental, and health resource allocation. By integrating these data sets, CHA can identify factors that affect the health of a population and determine the availability of resources within the community to adequately address these factors. Geographic Information System (GIS) is a technology to store, manipulate, analyze, and display geographically referenced information. Outside of healthcare, it has been shown to be valuable in a wide range of situations such as urban planning, environmental resource management, emergency planning, and transportation forecasting. CHA has also started to use GIS. A survey conducted by Canadian health professionals shows that 70% of respondents felt that community health decision making could be enhanced using a GIS application . A previous study by the authors suggests that in the United States, GIS is used in conjunction with other software to analyze health and population data and perform numerical-spatial problem solving in CHA . Numerical analysis only involves numerical data such as the calculation of a morbidity or mortality rate, while spatial analysis only involves spatial data such as geographical coordinates (such as latitude and longitude). The public health decision-making process can be considerably enhanced by the development of decision-support tools that allow spatial and multidimensional processing of information relatively quickly and easily. We have developed such an integrated tool called SOVAT (Spatial OLAP (On-Line Analytical Processing) Visualization and Analytical Tool) that allows public health professionals to conduct data linkage and perform quick multidimensional analysis visually . In this paper, we present spatial and multidimensional visualization techniques that can be used to conduct CHA in different administrative levels using data sets that are routinely collected by Indonesia's Bureau of Statistics. The case presents the potentials and challenges of using a visual decision-support system like SOVAT in developing countries such as Indonesia.
Indonesia, with its 235 million inhabitants and more than 13,000 islands, is the world's 4th largest country (in terms of population). The use of a spatial decision-support system for CHA can greatly enhance the health assessment process. Until recently, most of the visible CHAs in Indonesia were conducted by international organizations. For example:
An environmental assessment after the Aceh tsunami (December 2004) to identify the environmental and community issues related to the disaster and to prioritize reconstruction processes .
• An assessment of the poverty and damage to health facilities after the Yogyakarta earthquake (May 2006) to provide a reconstruction plan for the victims .
• These examples suggest that the challenges in integrating and analyzing data from various sources require skills and resources that can only be assembled in the face of catastrophic events and by international organizations. The availability of a visual decision-support system can simplify the process of data integration and analysis and make the CHA process affordable to local and regional government offices in Indonesia.
The results presented in this paper are case studies of CHA in Indonesia by combining numerical and spatial data sets. The source of the numerical data is primarily from the Village Potential Statistics (PODES) of 2003, while the source of the spatial data is from the Indonesian Bureau of Statistics. There were a total of 60,000 villages in Indonesia in 2003. Data are collected from each individual village and then aggregated into a sub-district level. We integrated the numerical data with spatial data consisting of three levels of administrative boundaries: provinces, counties, and sub-districts. Administrative divisions in Indonesia consist of four levels. The country is divided into 33 provinces, and each province is sudivided into counties/regencies/city, which are further subdivided into subdistricts (kecamatan), and again subdivided into villages. Since subdistrict is usually the lowest unit for planning and assessment, we use the first three administrative levels. The result of the integration is a multidimensional database of 33 provinces, 445 counties, and 4,000 sub-districts.
The village statistics data sets contain indicators that are related to environmental and health conditions. The health-related variables include disease outbreak and mortality number, primary water sources, waste-treatment facilities, number of health facilities and medical staffs, natural disaster, and pollution.
There are two study cases presented in this paper. The first one is the distribution of physicians in Indonesia. The second one is the comparison of mortality numbers due to malaria diseases in rural and urban areas.
Case Study 1: The distribution of healthcare professionals in Indonesia
Indonesia has an uneven distribution of population among its major islands. More than two-thirds of the country's population of 235 million is squeezed into the small Java island, just over 7% of its land mass. By contrast, the island of Papua in the easternmost part of the country represents 22% of the total land mass, yet has only 1% of the population. Like most countries, healthcare facilities and healthcare professionals tend to be concentrated in urban areas. Multidimensional analysis can provide information as to whether rural areas are indeed underserved by healthcare professionals and facilities. It can also identify which rural areas are in greatest need of healthcare services. The analysis of CHA can potentially be useful for planning and resource allocation. In this case study, healthcare service is represented by the number of physicians in the region.
To analyze the number of healthcare professionals, two measurements have been constructed in SOVAT: the number of healthcare professionals and the ratio of healthcare professionals/100,000 population. Healthcare professionals consist of physicians, nurses, and midwives. A dimension called healthcare professional is constructed in SOVAT with physician, nurse, and mid-wife as its members. Using this dimension, researchers can select which healthcare professionals he or she wants to analyze by choosing one of the members of the dimension.
Case Study 2: The malaria mortality cases in rural and urban areas
The mortality rate due to malaria is usually higher in rural areas. Using the multidimensional analytical feature in SOVAT, users can determine if this is the case with Indonesia. To do this analysis, "Mortality Number" is used as a measurement in conjunction with a dimension called "Diseases." Malaria is one of the diseases being tracked in the "Disease" dimension. In addition to the "Mortality Number" measurement and "Diseases" dimension, we also use another dimension called "Area", whose members comprise "Urban" and "Rural." By analyzing the data using two dimensions (disease and urban-rural areas), we can compare the mortality number of malaria in urban and rural areas.
This kind of analysis allows public health professionals to determine which areas need priority in malaria prevention. Our results conclude that the eastern part of Indonesia, especially the rural areas, is in greatest need.
The case studies show the potential of spatial and multidimensional analysis using SOVAT for CHA in developing countries.
In CHA, timely information is important for decision making. The process of data integration and analyses has been the stumbling block in the CHA process. Using an integrated tool that makes the process fast and easy will allow the assessment to be conducted rapidly. A faster CHA process, in turn, will provide timely information to support the decision-making process by public health professionals.
The strength of multidimensional and spatial analysis is that it allows the user to see the information visually on a map that is otherwise hidden in the complexity of the data and variables. Since the information is presented visually and no statistical skills are required, this type of decision-support system can potentially be used by more users and higher level executives.
Spatial and multidimensional analysis can be even more useful when extensive data from various sources are available. The case studies presented in this paper can be extended by adding more data sets, for example disease surveillance data, hospitalization data, or environmental data. The more data sets integrated into the decision-support system, the more dimensions and richer information that can potentially be uncovered by the system, hence providing higher quality of information for public health decision makers.
SOVAT is an integrated decision support system, and is not designed to replace GIS systems such as ArcGIS. The map capabilities in SOVAT include basic operations such as zoom in/out and buffering, handling of vector data such as point and line, exporting to an image, and printing the map. However, the map in the current version of SOVAT cannot be used to handle more advanced GIS operations such as map projection and map customization.
Spatial and Multidimensional Visualization using SOVAT
SOVAT (Spatial OLAP Visualization and Analytical Tool) is a novel decision-support system developed by the University of Pittsburgh . SOVAT combines two key technologies: On-Line Analytical Processing (OLAP) and a Geographic Information System (GIS) to provide advanced visualization and analyses for large multidimensional data sets. OLAP technology supports multidimensional data modeling that allows for rapid queries of multidimensional data and enables powerful analysis and discovery through a visual display on easy-to-use graphical user interfaces. With OLAP, data are represented conceptually as a multidimensional cube which enables the user to view different dimensions of multiple datasets and then query several dimensions at once. OLAP supports several distinct functions for data retrieval and analysis, such as: drill-up (decreasing granularity, for example, from data by country to data by province), drill-down (increasing granularity, for example, from province to country), and slice and dice (retrieving a sub-section of data, for example, data for May and June for only one province). All of these functions act on the multidimensional data cube and are performed almost instantaneously.
The Indonesian data sets are comprised of demographic data, health indicators, and spatial data (maps). The data sets come in different levels of detail and were collected using different collection methods. Indonesia is divided into provinces. Provinces consist of regencies (kabupaten) and cities (kota) which together are called counties in this paper. One level below counties is sub-district (kecamatan), while the lowest administrative level – that is the one below sub-district – is called village (desa). The definition of village applies to both rural and urban areas. Political changes in Indonesia since the last decade have affected the administrative division, with the tendency of a growing number of provinces and regencies. The latest data from the Ministry of Internal Affairs show that Indonesia currently has 33 provinces and 445 counties . There are more than 75 new counties since the year 2000, an increase of more than 20%.
Statistical Data: Census and Village Statistics
Data sets from the Indonesian Census and the Indonesian village statistics were used in the case studies. Similar to many countries, a census in Indonesia is conducted every decade. Indonesia conducts a series of population, agricultural, and economic censuses. The population census takes place in the years ending with "0"; the agricultural is conducted in the years ending with "3"; while the economic census is held in the years ending in "6" . Of these censuses, the population census is the most comprehensive and is aimed at gathering characteristics of the Indonesian population such as gender, age, marital status, education level, and occupation. The 2000 Population Census is the latest census and the first census conducted using complete enumeration. Since the 2000 Census was aimed at providing users with small area statistics, statistics of villages can be established from the data collected. In addition to the censuses, the Indonesian Bureau of Statistics also conducts an intercensal population survey (SUPAS) in between the two censuses . The survey is designed to collect the population statistic that is comparable to the population census. Another approach to data collection, rather than to collect data on each household and individual, is to collect statistics on villages [11, 12]. Village statistics, called Potensi Desa (PODES), are the main data source of this project. Village statistics provide information that otherwise is not available. Among the objectives of village-level data collection are:
• Providing information of potential and actual development in the village by providing socio-economic conditions and available facilities;
• Providing a database for regional planning as well as a progress report on the development at the village-level; and,
• Providing core data of the small area statistics.
The information in village-level collection includes: the number of the population and households, the housing and environmental data, the education and health-related data, socio-cultural information, recreation and sport facilities, transportation, and communication.
While demographic data are mostly from the Bureau of Statistics (BPS), more specific health indicators are available from the Ministry of Health . These data include: a general mortality rate, an infant mortality rate, life expectancy, top diagnoses for in-patients and out-patients, and morbidity of infectious diseases Although these data are not used for this project, it is a potential source to use in future works.
SOVAT uses spatial data in polygon format that consists of administrative-boundary maps. The map is rendered using certain color schemes to display the results of OLAP queries. For example, in Figure 2, the darker the color, the higher are the results of performed queries. In addition to the polygon data, SOVAT can also have additional layers using lines and point data, for example to represent rivers, streets, cities, or industrial places. The additional layers can be used to perform other spatial analysis such as buffering.
The digital map of Indonesia is provided by the BPS. The existing spatial data come from four different levels: from province level down to village level. However, due to the low accuracy of the village-level spatial data, we chose to use one level higher than the village – that is the sub-district (kecamatan) level.
Data Linkage and Multidimensional Modeling
Geographic location was used as the primary linkage variable that connects statistical and spatial data sets. The linking process is done using an administrative code that is uniquely defined for every administrative unit. Standardization is the key for the linking process. Most developed countries have a uniform identification for every geographic entity that can be used by the government and private sector. For example, in the United States there is the Federal Information Processing Standards codes (FIPS codes), a standardized code for every geographic entity in the US issued by the National Institute of Standards and Technology (NIST). This code is used by the US Census Bureau and other government agencies that generate statistical data sets.
Unfortunately, there is no such uniform identification standard for geographic entities in Indonesia. The lack of a uniform code leads to the use of geographic names (such as the name of counties and villages) as the key identifiers of the geographic entity. Geographic names are very susceptible to typographical errors and inconsistent spelling. As a result, the same geographic entities can be written differently in different reports even if the reports come from the same government institution (Bureau of Statistics). Several solutions were tried for this problem. Some of the data were corrected using a pattern-matching approach, while the remainder that could not be recognized using pattern matching were manually corrected.
The need of a uniform code is more important in light of the rapid changes of administrative boundaries in the past decade. The problem with spatial data becomes more complex since the updating process of spatial data is not as fast as the process of administrative changing. While the administrative code is easily updated with more recent changes, the map is still outdated. For example, the administrative boundaries are changed up to 2007, however the latest version of the map we use is from 2000.
Since there is no official new map released, there is no other option rather than to translate the data into an older map.
The project is supported in part by grant # 5U01GM070708-04 from the National General Medical Sciences (NGMS) through the MIDAS (Models of Infectious Disease Agent Study) – Indonesia supplement project and by NLM training grant #T15 LM07056 to MS.
- Committee for the Study of the Future of Public Health, Division of Health Care Services, Institute of Medicine: The Future of Public Health. 1988, Washington, DC: National Academy PressGoogle Scholar
- Bedard Y, Gosselin P, Jerrett M, Elliott SJ, Catelan R, Poitras P, Gingras A: GIS and OLAP in Health Surveillance: Needs Analysis for Successful Integration. 2000, Presented to the Health Protection Branch, Health CanadaGoogle Scholar
- Scotch M, Parmanto B, Gadd C, Sharma R: Exploring the role of GIS during community health assessment problem solving: experiences of public health professionals. International Journal of Health Geographics. 2006, 5: 39-10.1186/1476-072X-5-39. doi:10.1186/1476-072X-5-39.PubMedPubMed CentralView ArticleGoogle Scholar
- Scotch M, Parmanto B: Development of SOVAT: A numerical-spatial decision support system for community health assessment research. International Journal of Medical Informatics. 2006, 75 (10–11): 771-784. 10.1016/j.ijmedinf.2005.10.008.PubMedView ArticleGoogle Scholar
- United Nations Environment Programme (UNEP): National Rapid Environmental Assessment – Indonesia. Retrieved December 13, 2007,http://www.unep.org/tsunami/reports/TSUNAMI_INDONESIA_LAYOUT.pdf
- Officer for the Coordination of Humanitarian Affairs: Map of Poverty and Health Facility Damaged. Retrieved December 13, 2007.http://www.reliefweb.int/rw/fullMaps_Sa.nsf/luFullMap/52DDD154B130C0AE8525722600738274/$File/ocha_HLT_idn041006.pdf?OpenElement
- Scotch M, Parmanto B, Monaco V: Usability Evaluation of the Spatial OLAP Visualization and Analysis Tool (SOVAT). Journal of Usability Studies. 2007, 2: 76-95.PubMedPubMed CentralGoogle Scholar
- Indonesia Ministry of Internal Affairs: Code and Data Administrative Areas in Indonesia. 2004Google Scholar
- Dwijosumono S: National Statistical System: A statistical strategies in Indonesia. BPS-Statistics, Jakarta Indonesia. 2001, Retrieved December 13, 2007,http://www.singstat.gov.sg/statsres/conferences/governance/indonesia.pdfGoogle Scholar
- Data Statistik Indonesia (Statistics Indonesia). Retrieved December 13, 2007,http://www.datastatistik-indonesia.com/content/view/926/948/
- Smeru Research Report: Developing a Poverty Map for Indonesia: A Tool for Better Targeting in Poverty Reduction and Social Protection Programs. Retrieved December 13, 2003,http://www.unescap.org/pdd/projects/pov_map/5b-Indonesia%20Poverty%20Mapping%20-%20Main%20Report.doc
- Food and Agriculture Organization of United Nations: Collection of Village-Level Data Through The. 2003, Retrieved December 13, 2007,http://www.fao.org/ES/ess/meetings/download/apcas21/APCAS-06-16.pdfGoogle Scholar
- Bank Data Department Kesehatan RI (Indonesia Ministry of Health Data Repository). Retrieved December 13, 2007,http://bankdata.depkes.go.id
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.