Web GIS in practice IX: a demonstration of geospatial visual analytics using Microsoft Live Labs Pivot technology and WHO mortality data
© Kamel Boulos et al; licensee BioMed Central Ltd. 2011
Received: 4 March 2011
Accepted: 16 March 2011
Published: 16 March 2011
The goal of visual analytics is to facilitate the discourse between the user and the data by providing dynamic displays and versatile visual interaction opportunities with the data that can support analytical reasoning and the exploration of data from multiple user-customisable aspects. This paper introduces geospatial visual analytics, a specialised subtype of visual analytics, and provides pointers to a number of learning resources about the subject, as well as some examples of human health, surveillance, emergency management and epidemiology-related geospatial visual analytics applications and examples of free software tools that readers can experiment with, such as Google Public Data Explorer. The authors also present a practical demonstration of geospatial visual analytics using partial data for 35 countries from a publicly available World Health Organization (WHO) mortality dataset and Microsoft Live Labs Pivot technology, a free, general purpose visual analytics tool that offers a fresh way to visually browse and arrange massive amounts of data and images online and also supports geographic and temporal classifications of datasets featuring geospatial and temporal components. Interested readers can download a Zip archive (included with the manuscript as an additional file) containing all files, modules and library functions used to deploy the WHO mortality data Pivot collection described in this paper.
What is visual analytics?
Visual analytics is an emerging area of research and practice aimed at leveraging the excellent capabilities of humans in terms of visual information exploration. Thanks to the enormous increase in the processing power of computers and their graphics handling capabilities, we are now able to implement extremely powerful visual and interactive knowledge discovery environments  that can empower individual researchers or groups of researchers to make well-informed decisions in complex situations . The goal of visual analytics is to make the processes of data elaboration, information gathering and knowledge generation transparent to tool users. Visual analytics is an enhancement of the information visualisation concept and can be seen as an integrated approach combining visualisation, human factors and data analysis .
The basic idea of visual analytics is to visually represent the information, allowing the human to directly interact with the underpinning data to gain insight, draw conclusions, and ultimately make better decisions. The synergic integration between computation, visual representation, and interactive thinking supports intensive analysis by harnessing the human visual system to support the process of sense-making, in which information is collected, organised, and analysed to generate knowledge and plan actions. The goal is not only to permit users to detect expected events, such as might be predicted by models, but also to help users discover the unexpected--the surprising anomalies, changes, patterns, and relationships that are then examined and assessed to develop new insights .
Visual analytics is an inherently multi-disciplinary field [5–7] that aims to combine the methods and strengths of various research areas, including human-computer interaction (HCI) and usability engineering, cognitive and perceptual science, decision science, information visualisation, scientific visualisation, geospatial visualisation, databases, data mining, statistics, knowledge discovery, data management and knowledge representation, geospatial analytics, and graphics and rendering, among others.
Visual analytics comprises several areas spanning analytical reasoning techniques (to enable users to assess, plan, and make decisions), visual representations and interaction techniques (to take advantage of the human eye's broad bandwidth pathway into the mind; to allow users to see, explore, and understand large amounts of information at once), data representations and transformations (that convert all types of conflicting and dynamic data into visual and analytical representations), and techniques to support production, presentation, and dissemination of the results.
Visual analytics can easily deal with highly heterogeneous and noisy data; it is intuitive and requires no understanding of complex mathematical or statistical algorithms or parameters by its operators and it is invaluable when little is known about the data and the exploration goals are vague; and
Visual analytics can tackle hard analytical problems where neither the machine nor the human alone can efficiently and effectively find a solution.
Geospatial visual analytics as a specialised subtype of visual analytics: resources and examples
Geospatial visual analytics is an emerging multidisciplinary area which supports spatio-temporal analytical reasoning and decision-making through interactive visual interfaces (such as maps and other visual artefacts) that are linked to computational methods [9, 10]. A good online resource and starting point to learn about geospatial visual analytics is GeoAnalytics.net , a portal run by the Commission on GeoVisualization of the International Cartographic Association. Gennady and Natalia Andrienko also offer a comprehensive 114-slide tutorial that can be downloaded at .
In this paper, we demonstrate a basic geospatial visual analytics exercise using Microsoft Live Labs Pivot technology , a free, general purpose visual analytics tool that offers a fresh way to visually browse and arrange massive amounts of data (and images, e.g., ) online. Microsoft Pivot enables powerful visual zooming in and out of Web databases and the discovery of new patterns and relations in them that would otherwise be invisible in standard Web browsing of large datasets . It can also be used for geographic classifications, if the explored dataset includes a geospatial component, as we are going to show in this article.
Microsoft Pivot technology demonstration using a publicly available WHO mortality dataset
Microsoft Pivot is a data visualisation technology that can be used to easily interact with, analyse, filter, classify and cluster large amounts of image data . Pivot builds on Silverlight  and Deep Zoom  technologies, both also by Microsoft. Pivot can be created and deployed as a standalone tool using Microsoft Windows 7, Vista, or XP [29, 32, 33]. It can also be created using Microsoft Windows and deployed in a Web-enabled environment using most Web servers such as Apache, IIS (Microsoft Internet Information Services), etc. In this tutorial we will show how to create and deploy a Pivot project (also known as Pivot collection) using Microsoft Windows and a Web-enabled environment running a Linux (CentOS 5) operating system and Apache Web server.
Pivot collection creation methods
Pivot collections can be created by either automated or manual processes. In this tutorial, we will focus on the manual process using the Excel plug-in for Pivot collections that can be freely downloaded at . The manual process is both simple and easy to implement. Below are some notes about the automated and manual methods for creating Pivot collections:
- Automated process: In this method, a computer program is developed that automates the process of creating Pivot collection images by making use of already existing libraries and modules. Common modules and libraries include, but are not limited to:
- Python Imaging Library (PIL) : Version 1.1.6 or later is used in processing, creating and formatting images in various formats (JPEG--Joint Photographic Experts Group, PNG--Portable Network Graphics, etc.) for different zoom levels to produce the required visualisation; and
- Python Deep Zoom Tools: Python has a Deep Zoom library, version 0.1.0 or later , that is used for converting and subdividing images into various zoom levels and tiles (DZI--Deep Zoom Image file format or 'tiled image pyramid' ) in order to produce multi-scale zoomable high-resolution images.
- Microsoft Silverlight : Used in developing and creating the Pivot platform for standalone and Web-enabled applications; and
- PivotViewer [26, 38]: The PivotViewer Silverlight control is included in the 'PivotViewer Collection Tool for Microsoft Excel' (the above mentioned Excel plug-in ). It is used for presenting Pivot collections (data and images) on demand and for visualising Pivot collections as a group.
Essential components of a deployed Microsoft Pivot collection
Irrespective of the Pivot collection creation method used, there are some basic modules/libraries provided by Microsoft Live Labs Pivot that are needed to complete the creation and deployment of Pivot collections, namely:
- PivotSimpleDemo.xap: This module/library is provided by Microsoft Live Labs. It is in compiled Silverlight application file format (actually a renamed .zip archive that contains all the files necessary for the application). When run in a Web browser, it produces a graphical user interface (GUI) for the Pivot collection.
- Collection_files: This is a directory generated by the Pivot Collection Tool when the collection (images and data) is published using the Excel plug-in for Pivot collections. It contains the files (.cxml, .xml and .html) and images in various Deep Zoom levels necessary to launch and deploy the Pivot collection.
- Deep Zoom Images: These are the images at various zoom levels created with the Excel plug-in for Pivot collections or automatically using the Python Deep Zoom Tools and Python Imaging Library. The images are in DZI format and are stored in a subfolder ([images]) of the [collection_files] directory.
Creation and deployment of a Pivot collection with the PivotViewer Collection Tool (add-in) for Microsoft Excel using a publicly available WHO mortality dataset (1990-2008)
In this demonstration, we used a sample of World Health Organization (WHO) mortality data, downloadable at . The data provide mortality statistics and causes of death by country (WHO member countries), sex, age, and year. We used the following sample of data in creating the Pivot collection for this demonstration (the total number of records in the collection data sample is 984 out of 38215 records in the source WHO sheet):
- Country: Sample data records from 35 WHO member countries were used. The included WHO member countries are Afghanistan, Albania, Algeria, Andorra, Angola, Barbados, Belarus, Belgium, Belize, Benin, Bhutan, Cameroon, Canada, Cape Verde, Chile, China, Denmark, Djibouti, Dominica, Dominican Republic, Ecuador, Egypt, El Salvador, Equatorial Guinea, Nigeria, United Kingdom, United Republic of Tanzania, United State of America, Uruguay, Uzbekistan, Vanuatu, Venezuela, Yemen, Zambia, and Zimbabwe. Please note that only a subset of data from each country (subsets of country records and of fields in sampled records) is used in this demonstration and not the whole data for each country. Consequently, the current technology demonstrator  should not be used to perform any serious country analyses or comparisons, or to draw any mortality data-related scientific conclusions.
- Age: The age range of the population in the sample is 0-100 (0, 1, 5, 10, ..., 95, 100).
- Sex: Males (represented by blue man icon image), females (represented by pink woman icon image), and both (both sexes; represented by black man-and-woman-together icon image). Developers need to carefully decide on an appropriate unit value for each icon image instance (the numeric value represented by one icon or how many cases each icon will represent, e.g., one case, 10 cases, 100 cases, etc.), depending on the nature, distribution, and value range of the underpinning data.
- Year: The years covered by the data are 1990, 2000, and 2008.
The following steps should be followed in order to create the demonstrator Pivot collection using the WHO mortality data:
- Step 1: Download WHO mortality data at  and select the sample dataset to be used.
- Step 4: Create a project folder, name it [demodata] (could also be any other name), and ensure that PivotSimpleDemo.xap, Silverlight.js and collection.html are all placed in it.
- Step 5: Open the WHO sample data as an Excel spreadsheet.
- Step 6: Click on the Pivot Collection tab of the Excel workbook, then click on the New Collection tab to start a new Pivot collection workbook (the new workbook will look similar to the one shown in Figure 2).
- Image Location: This is the directory/file location of the images. Use the Choose Image Item of the Pivot workbook to locate where the images are stored;
- Preview: shows a preview of the images in Image Location column;
- Name: One of the columns that will accommodate the sample data source. Use the Insert function of the Pivot workbook to insert more columns that are needed and name the columns as required. The extra columns are the same as the Excel spreadsheet columns (Country, Year, Sex, Age);
- Href: Used to provide a hyperlink to data sources or image locations (can be left blank); and
- Description: This column is used to describe the images of data sources in the collection (can be left blank).
- Step 10: Finally, publish the Pivot collection to the Web (use a Web server such as Apache or IIS) by copying the whole folder generated in Step 9 (when the Pivot collection was published on the local hard drive) to the hosting Web server. Also ensure that collection.html, Silverlight.js, and PivotSimpleDemo.xap files are all copied to the same Web server folder containing the Pivot collection. Edit the collection.html file to point to the URL of the .cxml file on the Web server. Figure 5 shows a snippet of the collection.html file, with a red arrow pointing to the URL of .cxml file.
Please note that Pivot collections are (at the time of writing) only supported in 32-bit Internet Explorer 7.0 and later or Firefox 3.6 and later; there is no support for 64-bit versions of Internet Explorer, but this might change soon, with the expected release of a 64-bit runtime of Silverlight .
Discussion and conclusions
Conventional visualisations vs. visual analytics
Visualisations enable not only the intuitive representation and communication of textual and numerical information, but also the highlighting of patterns, trends and relationships in the visualised information that might be difficult to discern using written narrative and numerical forms alone . Visual analytics builds on this, but takes conventional visualisations many steps further by focusing on what a user wants to know (user > question via interactive visual interfaces > query data > data transformation > presentation), rather than merely on what data are available (data > data transformation > presentation). Indeed, the goal of visual analytics is to facilitate the discourse between the user and the data by providing dynamic displays and versatile visual interaction opportunities with the data that can support parallel lines of thought, analytical reasoning, and the exploration of data from multiple user-customisable aspects .
Collaborative and participatory aspects on the Social Web ("Web 2.0")
The rise of Web-based geospatial information visualisation has opened up many powerful opportunities. Web-based visualisations can potentially be combined into visualisation 'mash-ups' and multi-panel analytics dashboards with reusable components. The social and collaborative aspects of visualisation also seem to have taken on new importance on the Web . A number of options exist today for users to aggregate, present, and share these visualisations and analytical artefacts on their own Web sites, in online or e-mail discussions, or in social applications such as blogs, Facebook, etc. (see, for example, Figure 1). Furthermore, some research groups have explored the collaborative and participatory development of Web-based information presentation within the field of visual analytics .
Capturing and documenting analytical artefacts
The ability to document and share analytical artefacts is considered one of the essential requirements in geospatial visual analytics software. Andrienko and Andrienko  highlight the importance of the software being able to adequately capture and save analytical observations and artefacts, as well as their spatio-temporal references. There needs to be an explicit and easily manageable representation of the knowledge gained from an analysis; of how the knowledge was obtained (observations, inferences, involvement of background knowledge and additional information from other sources); of any associated assumptions, confidences, conditional judgements, inconsistencies, competing hypotheses, alternative interpretations; and of any relations between the collected analytical artefacts. The ideal tool should be able to capture and store all these details in a knowledgebase without disrupting the analytical process by the user. The knowledgebase should be made browsable and searchable, as well as editable and updatable by authorised users, who can also use it to produce reports about a given analysis and explanations of conclusions made .
Andrienko and Andrienko  also mention the scalability challenges that need to be addressed in geospatial visual analytics tools, including information scalability (capability to extract relevant information from massive data streams), visual scalability (capability to effectively display massive datasets), human scalability (scaling gracefully from single users to collaborative environments), software scalability (capability of a software system to interactively manipulate large datasets), and display scalability (effective use of everything from wall-sized to phone-sized displays). Yuan et al.  describe a visual analytics exercise on a large display wall consisting of 32 tiled 32-inch LCD (Liquid Crystal Display) panels, while Pattah et al.  discuss some techniques that can be used to overcome display limitations on small form factor mobile devices.
While the practical demonstration described in this paper might not fully address all these requirements and challenges, we hope it could still serve as a good introduction to the subject of geospatial visual analytics and stimulate wider adoption of these technologies to empower users at all levels in the public health and healthcare sectors, as well as ordinary citizens , to make the most out of the massive and ever-growing amounts of data that are available to us today.
- Aigner W, Bertone A, Miksch S: Tutorial: Introduction to Visual Analytics. In HCI and Usability for Medicine and Health Care, Third Symposium of the Workgroup Human-Computer Interaction and Usability Engineering of the Austrian Computer Society, USAB 2007, Graz, Austria, 22 November 2007, Proceedings, Lecture Notes in Computer Science--LNCS 4799. Edited by: Holzinger A. Springer-Verlag; 2007:453–456.
- Heer J, Agrawala M: Design considerations for collaborative visual analytics. Information Visualization 2008,7(1):49–62.View Article
- Keim D, Robertson G, Thomas J, van Wijk J: Guest Editorial: Special Section on Visual Analytics. IEEE Transactions on Visualization and Computer Graphics 2006,12(6):1361–1362.View Article
- Cook K, Earnshaw R, Stasko J: Discovering the Unexpected. IEEE Computer Graphics and Applications 2007,27(5):15–19.PubMedView Article
- Keim D, Andrienko G, Fekete J, Carsten G, Kohlhammer J, Melancon G: Visual Analytics: Definition, Process, and Challenges. In Information Visualization--Human-Centered Issues and Perspectives. Lecture Notes in Computer Science--LNCS 4950. Edited by: Kerren A, Stasko JT, Fekete J-D, North C. Springer-Verlag; 2008:154–175.
- Keim DA, Mansmann F, Thomas J, Ziegler H: Visual Analytics: Scope and Challenges. In Visual Data Mining--Theory, Techniques and Tools for Visual Analytics. Lecture Notes in Computer Science--LNCS 4404. Edited by: Simoff SJ, Böhlen MH, Mazeika A. Springer-Verlag; 2008:76–90.
- Thomas JJ, Cook KA, (Editors): [http://nvac.pnl.gov/agenda.stm] Illuminating the path: Research and Development Agenda for Visual Analytics. IEEE Press; 2005.
- Keim D: Information visualization and visual data mining. IEEE Transactions on Visualization and Computer Graphics 2002,8(1):1–8.View Article
- Andrienko G, Andrienko N, Jankowski P, Keim D, Kraak MJ, MacEachren A, Wrobel S: Geovisual analytics for spatial decision support: Setting the research agenda. Int J Geogr Inf Sci 2007,21(8):839–857.View Article
- De Amicis R, Stojanovic R, Conti G, (Editors): GeoSpatial Visual Analytics: Geographical Information Processing and Visual Analytics for Environmental Security (NATO Science for Peace and Security Series C: Environmental Security). Dordrecht, The Netherlands, Springer; 2009.
- Web portal for GeoSpatial Visual Analytics [http://geoanalytics.net/]
- Andrienko G, Andrienko N: Geospatial Visual Analytics Tutorial (2007, updated 2009). [http://www.peer.eu/fileadmin/user_upload/opportunities/metier/course4/c4_visual_analytics_geospatial.pdf]
- Guo D: Visual analytics of spatial interaction patterns for pandemic decision support. Int J Geogr Inf Sci 2007,21(8):859–877.View Article
- Maciejewski R, Rudolph S, Hafen R, Abusalah AM, Yakout M, Ouzzani M, Cleveland WS, Grannis SJ, Ebert DS: A visual analytics approach to understanding spatiotemporal hotspots. IEEE Trans Vis Comput Graph 2010,16(2):205–20.PubMedView Article
- Yuan X, He X, Guo H, Guo P, Kendall W, Huang J, Zhang Y: Scalable multi-variate analytics of seismic and satellite-based observational data. IEEE Trans Vis Comput Graph 2010,16(6):1413–20.PubMedView Article
- Livnat Y, Gesteland P, Benuzillo J, Pettey W, Bolton D, Drews F, Kramer H, Samore M: Epinome - a novel workbench for epidemic investigation and analysis of search strategies in public health practice. AMIA Annu Symp Proc 2010, 2010:647–51.PubMed
- Livnat Y, Samore M: Visual Analytics in Surveillance and Epidemiology: Challenges and Opportunities. [http://www.cdc.gov/osels/ph_informatics_technology/DOCS/PDF/Livnat%20Samore%20CoE%20Webinar%20Oct%207%202010.pdf] 2010.
- Chui KK, Wenger JB, Cohen SA, Naumova EN: Visual analytics for epidemiologists: understanding the interactions between age, time, and disease with multi-panel graphs. PLoS One 2011,6(2):e14683.PubMedView Article
- Epinome [http://www.sci.utah.edu/gallery2/v/software/epinome/]
- GeoVISTA Software [http://www.geovista.psu.edu/software/index.html]
- Gapminder [http://www.gapminder.org/]
- Quick guide to the Motion Chart Gadget [http://www.gapminder.org/upload-data/motion-chart/]
- Google Public Data Explorer [http://www.google.com/publicdata/home]
- Google Public Data Explorer Dataset Directory [http://www.google.com/publicdata/directory]
- Google Public Data Explorer: Upload Datasets [http://www.google.com/publicdata/admin]
- PivotViewer Home [http://www.silverlight.net/learn/pivotviewer/]
- Williams RW, Yan L, Zhou X, Lu L, Centeno A, Kuan L, Hawrylycz M, Rosen GD: Global Exploratory Analysis of Massive Neuroimaging Collections using Microsoft Live Labs Pivot and Silverlight. [http://www.neuroinformatics2010.org/images/Neuroinformatics2010_AbstractBook_web%20–2.pdf] Proceedings of Neuroinformatics 2010, 3rd INCF Congress, Kobe, Japan, 30 August - 1 September 2010162.
- Flake G: Is Pivot a turning point for Web exploration? [http://www.ted.com/talks/gary_flake_is_pivot_a_turning_point_for_web_exploration.html]
- Kniskern K: Creating the Microsoft Executive Pivot. [http://www.liveside.net/2009/11/29/creating-the-microsoft-executive-pivot-we-play-and-you-can-too] 2009.
- Microsoft Silverlight [http://www.silverlight.net/]
- Microsoft Deep Zoom [https://www.silverlight.net/learn/quickstarts/deepzoom/]
- Heuer T: Quick Steps to displaying data using PivotViewer and Silverlight. [http://timheuer.com/blog/archive/2010/07/09/data-visualization-with-silverlight-and-pivot-viewer-quickstart.aspx] 2010.
- Lin J: FAQ - What to know about Pivot (Community-powered support for Live Labs Pivot). [http://getsatisfaction.com/live_labs_pivot/topics/faq_what_to_know_about_pivot] 2010.
- PivotViewer Collection Tool for Microsoft Excel [http://www.silverlight.net/learn/pivotviewer/excel-tool/]
- Python imaging library (PIL) [http://www.pythonware.com/products/pil/]
- Open-zoom: Python Deep Zoom Tools (0.1.2) [http://code.google.com/p/open-zoom/downloads/detail?name=deep-zoom-tools-0.1.2.zip&can=2]
- Deep Zoom File Format Overview [http://msdn.microsoft.com/en-us/library/cc645077(v=vs.95).aspx]
- PivotViewer Interactive Demos [http://www.microsoft.com/silverlight/pivotviewer/]
- Silverlight.js 4.0.50401.0 [http://archive.msdn.microsoft.com/silverlightjs/Release/ProjectReleases.aspx?ReleaseId=4148]
- WHO: Health statistics and health information systems - Mortality data [http://www.who.int/healthinfo/statistics/mortality/en/index.html]
- Demonstrator Pivot collection using a WHO mortality dataset sample [http://edtech-dev.uthsc.edu/CTSI/matthewDev1/pivot/WHO/demodata/collection.html]
- Why a 64-bit runtime for Silverlight 5 Matters [http://silverlighthack.com/post/2010/12/02/Why-a-64-bit-runtime-for-Silverlight-5-Matters.aspx]
- WHO Global Infobase > Mortalities [https://apps.who.int/infobase/Mortality.aspx]
- Bing Maps [http://www.bing.com/maps/]
- Noble R: Extending PivotViewer with Bing Maps. [http://www.rogernoble.com/2011/01/04/extending-pivotviewer-with-bing-maps/] 2011.
- Farley R: Visually stunning maps and PivotViewer. [http://sqlblog.com/blogs/rob_farley/archive/2011/01/04/visually-stunning-maps-and-pivotviewer.aspx] 2011.
- Flickr Browser [http://pivot.lobsterpot.com.au/flickr]
- Hidalgo CA:[http://hdr.undp.org/en/reports/global/hdr2010/papers/HDRP_2010_39.pdf] Graphical Statistical Methods for the Representation of the Human Development Index and its Components. United Nations Development Programme, Human Development Reports Research Paper 2010/39; 2010.
- Heer J, Wattenberg M, Agrawala M: Point, Talk, Publish: Visualization and the Web. In Trends in Interactive Visualization--State-of-the-Art Survey. Edited by: Liere R, Adriaansen T, Zudilova-Seinstra E. Springer-Verlag; 2009:269–283.View Article
- Mckeon M: Harnessing the Web Information Ecosystem with Wiki-based Visualization Dashboards. IEEE Transactions on Visualization and Computer Graphics 2009,15(6):1081–1088.PubMedView Article
- Pattath A, Ebert D, Pike W, May R: Contextual Interaction for Geospatial Analytics on Mobile Devices. In : Proceedings of Multimedia on Mobile Devices 2009 Conference, San Jose, California, USA, 19–20 January 2009 (Proceedings of SPIE Volume 7256, 72560H). Edited by: Creutzburg R, Akopian D. Society of Photographic Instrumentation Engineers; 2009.
- Andrienko G, Andrienko N, Demšar U, Dransch D, Dykes J, Fabrikant S, Jern M, Kraak MJ, Schumann H, Tominski C: Space, Time, and Visual Analytics. Int J Geogr Inf Sci 2010,24(10):1577–1600.View Article
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.