Comparing the accuracy of two secondary food environment data sources in the UK across socio-economic and urban/rural divides
© Burgoine and Harrison; licensee BioMed Central Ltd. 2013
Received: 29 October 2012
Accepted: 13 January 2013
Published: 17 January 2013
Interest in the role of food environments in shaping food consumption behaviours has grown in recent years. However, commonly used secondary food environment data sources have not yet been fully evaluated for completeness and systematic biases. This paper assessed the accuracy of UK Points of Interest (POI) data, compared to local council food outlet data for the county of Cambridgeshire.
Percentage agreement, positive predictive values (PPVs) and sensitivities were calculated for all food outlets across the study area, by outlet type, and across urban/rural/SES divisions.
Percentage agreement by outlet type (29.7-63.5%) differed significantly to overall percentage agreement (49%), differed significantly in rural areas (43%) compared to urban (52.8%), and by SES quintiles. POI data had an overall PPV of 74.9%, differing significantly for Convenience Stores (57.9%), Specialist Stores (68.3%), and Restaurants (82.6%). POI showed an overall ‘moderate’ sensitivity, although this varied significantly by outlet type. Whilst sensitivies by urban/rural/SES divides varied significantly from urban and least deprived reference categories, values remained ‘moderate’.
Results suggest POI is a viable alternative to council data, particularly in terms of PPVs, which remain robust across urban/rural and SES divides. Most variation in completeness was by outlet type; lowest levels were for Convenience Stores, which are commonly cited as ‘obesogenic’.
KeywordsFood environment Secondary data Data completeness Geographic information systems
Interest in the role food environments play in shaping behaviours related to food consumption and food choice has grown in recent years. Researchers have often studied this relationship between individuals and their environments through creating metrics of environmental ‘exposure’ , for example neighbourhood availability of fast food outlets [2–6]. However, the resulting evidence base is equivocal and the degree to which the environment determines behaviour remains unknown. In terms of study design, investigations into the ‘obesogenic environment’  are frequently large scale, quantitative, often Geographical Information Systems (GIS) based [1, 8–11], and importantly, rely heavily on the use of secondary data. Despite this, relatively little is known about the accuracy of commonly used secondary food environment datasets. In creating measures of food environment exposure that hope to realistically model individual-environment relationships, having accurate food outlet location data is critical, and so data accuracy should be better understood.
Several recent studies have addressed the accuracy and reliability of secondary food outlet data sources in relation to their utility for use in health research [12–20], although most assessments have been made in the US. Whilst collecting primary food outlet data might be the ideal, primary data collection is resource and time intensive. There is therefore an important place for secondary data in the quantification of food environments, yet the quality and completeness of such data are not always clear. In the US, companies such as Dun and Bradstreet (D&B) and InfoUSA can provide a minimal-fuss, geographically large and ready classified dataset, whilst in the UK, commercial Yellow Pages data can be purchased in bulk through providers such as Experian. The use of such data represents the lowest time resource cost option for secondary data acquisition. ‘Collecting’ data from local councils (governing bodies at the local level) or state departments is more complex, requiring a substantial time and resource investment to both obtain and streamline the data prior to use . These three types of data source (‘primary’, ‘intensive secondary’ (such as council data), and ‘extensive secondary’ (such as Yellow Pages or InfoUSA data)) are all potentially important, allowing accuracy to be traded for convenience where imperatives such as research timelines prevail. However, in order to make the best decisions about which data to use, it is important to know how these different data sources compare.
Lake et al. compared online and paper editions of the Yellow Pages telephone directories to the gold standard of a ground truthed food outlet database in North East England, finding positive predictive values (PPVs) of 79.1% and 82.4%, respectively. Even better was the PPV for food outlet data from local councils’ environmental health departments as compared with reality, at 91.5% in this area . In the UK, food outlets are required to register their business with local councils by law in order to facilitate routine hygiene inspections, which may explain this accuracy. Other UK studies have re-iterated the accuracy of council records, reporting PPVs of 86.6% (1997) and 87.3% (2007) at two time points in Glasgow , and between 79-87% across urban/rural and socio-economic divides in North East England . The sensitivity of council data compared to ‘reality’ has consistently shown itself to be ‘moderate’ to ‘excellent’ [14, 15], according to a classification system developed by Paquet et al.. In North America, although the accuracy of state level data was questioned in one paper , improved PPVs and sensitivities have been found for state level food records (ground truthed data as the gold standard) as compared with the much used D&B and InfoUSA commercial datasets [18–20].
This said, most assessments of data validity have been made across entire study areas, not accounting for differences in completeness across socio-economic lines or urban/rural boundaries. There is some suggestion that the accuracy of food outlet records may vary systematically across such divides [14, 22], which do exist in the UK, albeit perhaps less overtly than in the US, for example. Whilst one small study in North East England did not find any significant differences in data validity by area SES or urban/rural status , potential differences in data integrity across these divides are important to consider as they might imbue systematic biases in downstream analyses.
In the UK, Ordnance Survey (OS) Points of Interest (POI) data are increasingly used in the literature as a source of information on environmental attributes such as the locations of food stores or physical activity facilities [23–25], and hold potential to be an accurate and useful source of ‘extensive secondary’ data due to its updateability, positional accuracy (co-ordinates are provided for environmental attributes with 1m precision), and theoretical comprehensiveness ; POI contains information from over 170 data suppliers, chosen for being “the most authoritative source…for the particular type of feature they supply and for the quality and completeness of [their] data” . Inaccuracies demonstrated in other sources of commercial data only enhance the appeal of POI , however the accuracy of these data has not yet been assessed in the published academic literature, leaving its efficacy for use in health research in question.
Using accurate council food outlet location data as the reference standard, this study aims to assess the validity of POI data for use in research into the (obesogenic) food environment for the first time, in Cambridgeshire, UK. Reliability will be assessed as the completeness of POI records as compared to council data, which has been shown to be moderately to highly accurate in other regions of the UK, with a PPV of 91.5% in North East England . We aim to undertake this assessment for all POI records across the study area and to assess whether POI completeness varies by outlet type, by urban/rural status and across socio-economic divides.
Food outlet data
Outlets were matched based on their name, address and postcode. Outlets were matched, even where spelling of business name was similar but not identical, where supporting evidence (such as the same address and/or postcode) was present. Food outlet locations for council and POI data were geocoded according to their postcodes and overlaid atop Lower Super Output Area (LSOA) boundaries for Cambridgeshire, using ArcGIS 10 (ESRI Inc., Redlands, CA). LSOAs were attributed an urban/rural status (according to Communities and Local Government guidelines, defining small towns, villages and hamlets with fewer than 10,000 residents as ‘rural’ ), with a good mix of urban and rural areas present throughout the study area, as shown in Figure 1. LSOAs were also attributed a measure of area level socio-economic status (SES) (quintiles of Index of Multiple Deprivation (IMD) scores 2010 , relative to Cambridgeshire county), as also shown in Figure 1. IMD is a compound measure of SES across seven principle domains (income deprivation, employment deprivation, crime, health deprivation and disability, education, skills and training deprivation, barriers to housing and services and living environment deprivation), with scores increasing as deprivation increases .
Completeness of POI data compared to the reference standard council data was assessed by calculating percentage agreement, positive predictive values (PPVs) and sensitivities for all outlets, and by type of food outlet, using PASW Statistics 18 (PASW Statistics Inc., Chicago, 2009). These statistics have been widely employed in the literature to date [12–14, 16, 18, 19]. Percentage agreement computes the percentage of food outlets present in both POI and council data (true positives/(true positives + false negatives + false positives)). PPVs represent the percentage of outlets listed in the POI dataset that were also present in the council data (true positives/(true positives + false positives)). Sensitivity represents the percentage of outlets listed in the council data that were also listed in the POI data (true positives/(true positives + false negatives)). As is common in the literature, accepted sensitivity cut-offs will be applied here : ‘poor’ <30%; ‘fair’ 31-50%; ‘moderate’ 51-70%; ‘good’ 71-90%; ‘excellent’ >91%. Lake et al. present a useful diagram showing how PPVs and sensitivities are calculated and relate to each other. Differences between PPVs, sensitivities and percentage agreements for all food outlets as compared to food outlets by type were assessed using Fisher’s Exact tests (preferred over chi-squared tests due to potentially small expected values). PPVs and sensitivities were calculated separately for urban and rural areas and for each IMD quintile; comparisons with PPVs and sensitivities in relation to urban and least deprived reference categories were again made using Fisher’s Exact tests. A value of p<0.05 was used as the marker of statistical significance for differences.
Descriptive statistics and percentage agreement for all food outlets, food outlets by type, and all food outlets across urban/rural divides and socio-economic status quintiles
Food outlet category
Missing POI records (%)
Percentage agreement (%)a
95% CI for difference
All Food Outlets
SES-1 (Least Deprived)
SES-5 (Most Deprived)
Positive predictive value analysis
PPVs for all food outlets, food outlets by type, and all food outlets across urban/rural divides and socio-economic status quintiles
Food outlet category
95% CI for difference
All Food Outlets
SES-1 (Least Deprived)
SES-5 (Most Deprived)
Sensitivity values for all food outlets, food outlets by type, and all food outlets across urban/rural divides and socio-economic status quintiles
Food outlet category
Sensitivity category b
95% CI for difference
All Food Outlets
SES-1 (Least Deprived)
SES-5 (Most Deprived)
This work examined the validity of a potentially important and increasingly used ‘extensive secondary’ dataset in the UK. As has been noted, despite general epidemiological concern with regards to measurement accuracy  and the determination of exposure ‘truth’ , surprisingly little is known about the validity of commonly used secondary data sources in the field. This study assessed the accuracy of POI data (at least as compared to previously validated local council records) for the first time in the published literature. Although the results of this study are therefore specific to POI data, as compared with local council records in Cambridgeshire, UK, the importance of considering the validity of secondary data in these ways and across pertinent divisions remains important across all secondary datasets; this study is novel in this respect.
In terms of concordance between the datasets, the POI data contained 524 fewer gross records than were present in the council data, with a percentage agreement of 49.9%, translating into an overall PPV of 74.9% and sensitivity of 59.9% (‘moderate’). These results are largely in line with previous studies examining the accuracy of other secondary food environment data [12–15, 18–20], the caveat being that this study did not use a ground truthed dataset as a gold standard, and instead used a reliable secondary reference dataset (demonstrated to have a PPV of 91.5% in Newcastle, UK ) to increase the scale of the investigation.
Differentiation by type of food outlet revealed PPVs between 57.9% and 82.6%, with sensitivities between 37.8% (‘fair’) and 77.2% (‘good’). These assessments by food outlet type are roughly in line with those demonstrated in the literature [12, 19], but rather below those shown for some commercial US datasets . As these statistics were largely significantly sensitive to food outlet type, this research highlights the importance of considering the accuracy of secondary data for specific types of food outlet, as has been noted elsewhere . Although we find the lowest levels of gross completeness for cafés/coffee shops (39%), in terms of the number of missing records in POI data, convenience store records are especially incomplete with regards to percentage agreement, PPVs and sensitivity. These small grocery shops are commonly cited as being ‘obesogenic’ [27, 38, 39], being less likely than larger supermarkets to sell ‘healthful’ foods . Given this potential gap in the POI data, this might be an area to focus on if future research is considering supplementing POI data with either council records or field work. It is of note that POI appears to represent a particularly robust source of data on restaurant locations.
Importantly, PPVs across socio-economic and urban/rural divides were similar, both to each other, and to the statistic for all outlets. Such similarities have been demonstrated elsewhere [14, 18]. For sensitivity and percentage agreement, there were exceptions, including significantly better estimates of both in some more deprived quintiles, although no evidence of a trend existed, and in urban areas. This said, sensitivies across urban/rural and SES divides mostly remained ‘moderate’ and as such aligned with the overall sensitivity description. Whilst the data should still be seen as ‘imperfect’ , some had suggested that substantial differences in food outlet representation across SES and urban/rural divides such as those tested here might prevail [14, 22], and whilst this hypothesis should be further tested in validation studies of other datasets, we do not believe this was the case here.
The utility of POI data may be research specific, however, if selected as a source of food outlet location data, we suggest they should be used with confidence particularly with respect to data completeness over socio-economic divides, in urban areas, and where research focuses on restaurant, supermarket or takeaway locations.
Strengths of this study include the fair comparison of contemporaneous datasets, the application of a 6 category food outlet classification scheme whose outlet types should relate directly to future deductive research, and its large geographical scale, which enabled an assessment of over 2000 food outlets in each dataset. In particular, using established statistics (percentage agreement, PPVs and sensitivies) across urban/rural and socio-economic divides allowed an assessment of the likelihood of systematic geographical differences in completeness. To our knowledge, this is the first time that such an appraisal has been made in the published literature on a large scale.
There were several key limitations to this study. In order to enable the large study area, field work was not conducted, choosing instead to use local council data as our ‘gold (reference) standard’. Local council data have been shown accurate in several other regions of the UK, however they are unlikely to be complete, resulting in a potential lack of comparability with previous studies that can relate directly to the food environment reality. Despite this limitation, the strength of results found here suggest that if council data are indeed less complete than we might hope, or are systematically incomplete (for example, across socio-economic divides) they are at least aligned in these respects with POI records. In order to maximise heterogeneity in socio-economic status throughout the study area, quintiles of SES were calculated relative to the study area only. Increased sensitivity in detecting SES differences between LSOAs was useful for these analyses, however, our findings may not be applicable to the most deprived locales, which are substantially under-represented throughout Cambridgeshire (IMD scores are positively skewed towards being lower (less deprived); mean IMD for Cambridgeshire=15.51 (SD=11.44), range of possible IMD scores for England as a whole 0.53-87.80). This potential limitation may lead to a lesser degree of generalisability outside this study area, however it does not compromise the accuracy of these results. To facilitate a fair comparison of the datasets, we attempted to obtain as contemporaneous information as possible. We asked OS and local councils for current data in January 2012 to facilitate this, however, it is possible that either dataset may not reflect the food environment at precisely the same time. Whilst some exclusions in the datasets were made based on food not sold directly to the public (food producers, for example), exclusions of market traders or mobile food stands were made predominantly because addresses were for the traders’ home addresses and not the retail sites themselves. These types of food retailers are likely important sources of food [14, 22], potentially with a socio-economic gradient of use [41, 42], and should be considered where possible in future validation work.
In terms of the POI dataset itself, the data were not without duplicates that needed to be found and removed (n=105). The classification system supplied was too general to be of real use in most health research (for details see, http://www.ordnancesurvey.co.uk/oswebsite/docs/product-schemas/points-of-interest-classifications-scheme.pdf) so a project specific classification scheme such as the one used here would almost certainly be required. POI contains records beyond simply the foodscape, making it difficult to discern whether listed establishments sold food or not. In council datasets, outlets are listed precisely because they sell food. This breadth may lead to the omission of important sources of food within the environment, for example from pharmacies, such as Boots the Chemist, a national chain that often but not always sells food items. Investigative work would be required when using POI data to determine whether or not each of these individual stores sells food.
Accurate analysis in health and policy research begins with accurate data. Ordnance Survey Points of Interest records generally compared favourably here in relation to data from local councils’ environmental health departments. We observed few notable systematic variations in POI completeness (PPV/sensitivity) over urban/rural and SES divides, however when type of outlet was considered, convenience stores appeared to be the least well represented in the POI, and consideration must therefore be given to the types of outlets being studied when selecting a dataset.
The utility of POI is boosted when its relative ease of acquisition is considered (in relation to both ‘intensive secondary’ council data, and primary data collection). However, this is not to say that by combining POI data with local council data, one might be able to build an even more accurate picture of the food environment. Future research using a ground truthed dataset over an equivalent study area is necessary to ascertain whether this is likely to be the case.
This work was undertaken by the Centre for Diet and Activity Research (CEDAR), a UK Clinical Research Collaboration (UKCRC) Public Health Research Centre of Excellence. Funding from the British Heart Foundation, Economic and Social Research Council, Medical Research Council, the National Institute for Health Research and the Wellcome Trust under the auspices of the UK Clinical Research Collaboration, is gratefully acknowledged. The digital maps used hold Crown Copyright from EDINA Digimap, a JISC supplied service. We are grateful to Cambridgeshire local councils and Ordnance Survey for kindly supplying data to enable this work.
- Charreire H, Casey R, Salze P, Simon C, Chaix B, Banos A, Badariotti D, Weber C, Oppert J-M: Measuring the food environment using geographical information systems: a methodological review. Public Health Nutrition 2010, 13:1773–1785.PubMedView Article
- Boone-Heinonen J, Gordon-Larsen P, Kiefe CI, Shikany JM, Lewis CE, Popkin BM: Fast food restaurants and food stores: longitudinal associations with diet in young to middle-aged adults: the CARDIA study. Arch Intern Med 2011, 171:1162–1170.PubMedView Article
- Maddock J: The relationship between obesity and the prevalence of fast food restaurants: state-level analysis. Am J Heal Promot 2004, 19:137–143.View Article
- Chou S-Y, Grossman M, Saffer H: An economic analysis of adult obesity: results from the behavioural risk factor surveillance system. J Heal Econ 2004, 23:565–587.View Article
- Mehta NK, Chang VW: Weight status and restaurant availability: a multilevel analysis. American Journal of Preventive Medicine 2008, 34:127–133.PubMedView Article
- Thornton LE, Bentley RJ, Kavanagh AM: Fast food purchasing and access to fast food restaurants: a multilevel analysis of VicLANES. Int J Behav Nutr Phys Act 2009, 6:1–10.View Article
- Swinburn B, Egger G: Preventive strategies against weight gain and obesity. Obes Rev 2002, 3:289–301.PubMedView Article
- Caspi CE, Sorensen G, Subramanian SV, Kawachi I: The local food environment and diet: a systematic review. Health and Place 2012, 18:1172–1187.PubMedView Article
- Kelly B, Flood VM, Yeatman H: Measuring local food environments: an overview of available methods and measures. Health and Place 2011, 17:1284–1293.PubMedView Article
- Giskes K, van Lenthe F, Avendano-Pabon M, Brug J: A systematic review of environmental factors and obesogenic dietary intakes among adults: are we getting closer to understanding obesogenic environments? Obes Rev 2011, 12:e95-e106.PubMedView Article
- Fleischhacker SE, Evenson KR, Rodriguez DA, Ammerman AS: A systematic review of fast food access studies. Obes Rev 2011, 12:460–471.View Article
- Lake AA, Burgoine T, Greenhalgh F, Stamp E, Tyrrell R: The foodscape: classification and field validation of secondary data sources. Health and Place 2010, 16:666–673.PubMedView Article
- Cummins S, Macintyre S: Are secondary data sources on the neighbourhood food environment accurate? Case study in glasgow UK. Prev Med 2009, 49:527–528.PubMedView Article
- Lake AA, Burgoine T, Stamp E, Grieve R: The foodscape: classification and field validation of secondary data sources across urban/rural and socio-economic classifications. Int J Behav Nutr Phys Act 2012, 9:3–12.View Article
- Svastisalee CM, Holstein BE, Due P: Validation of presence of supermarkets and fast-food outlets in Copenhagen: case study comparison of multiple sources of secondary data. Public Health Nutr 2012, doi:10.1017/S1368980012000845:1–4.
- Paquet C, Daniel M, Kestens Y, Léger K, Gauvin L: Field validation of listings of food stores and commercial physical activity establishments from secondary data. Int J Behav Nutr Phys Act 2008, 5:1–7.View Article
- Wang MC, Gonzalez AA, Ritchie LD, Winkleby MA: The neighbourhood food environment: sources of historical data on retail food stores. Int J Behav Nutr Phys Act 2006, 3:1–5.View Article
- Liese AD, Colabianchi N, Lamichhane AP, Barnes TL, Hibbert JD, Porter DE, Nichols MD, Lawson AB: Validation of 3 food outlet databases: completeness and geospatial accuracy in rural and urban food environments. Am J Epidemiol 2010, 172:1324–1333.PubMedView Article
- Powell LM, Han E, Zenk SN, Khan T, Quinn CM, Gibbs KP, Pugach O, Barker DC, Resnick EA, Myllyluoma J, Chaloupka FJ: Field validation of secondary commercial data sources on the retail food outlet environment in the US. Health and Place 2011, 17:1122–1131.PubMedView Article
- Bader MDM: Measurement of the local food environment: a comparison of existing data sources. Am J Epidemiol 2010, doi:10.1093/aje/kwp419:1–9.
- Burgoine T: Collecting accurate secondary foodscape data: a reflection on the trials and tribulations. Appetite 2010, 55:522–527.PubMedView Article
- Sharkey JR, Horel S: Neighbourhood socioeconomic deprivation and minority composition are associated with better potential spatial access to the ground-truthed food environment in a large rural area. J Nutr 2008, 138:620–627.PubMed
- Harrison F, Jones AP, van Sluijs EMF, Cassidy A, Bentham G, Griffin SJ: Environmental correlates of adiposity in 9–10 year old children: considering home and school neighbourhoods and routes to school. Social Science and Medicine 2011, 72:1411–1419.PubMedView Article
- Skidmore P, Welch A, van Sluijs E, Jones A, Harvey I, Harrison F, Griffin S, Cassidy A: Impact of neighbourhood food environment on food consumption in children aged 9–10 years in the UK SPEEDY (sport, physical activity and eating behaviour: environment determinants in young people) study. Public Health Nutr 2009, 13:1022–1030.PubMedView Article
- Jennings A, Welch A, Jones AP, Harrison F, Bentham G, van Sluijs EMF, Griffin S, Cassidy A: Local food outlets, weight status, and dietary intake: associations in children aged 9–10 years. American Journal of Preventive Medicine 2011, 40:405–410.PubMedView Article
- Points of interest: technical information. http://www.ordnancesurvey.co.uk/oswebsite/products/pointsofinterest/techinfo.html.
- Morland K, Wing S, Diez-Roux AV: The contextual effect of the local food environment on residents' diets: the atherosclerosis risk in communities study. Am J Public Health 2002, 92:1761–1767.PubMedView Article
- Moore LV, Diez-Roux AV, Nettleton JA, Jacobs DR: Associations of the local food environment with diet quality - a comparison of assessments based on surveys and geographic information systems. Am J Epidemiol 2008, 167:917–924.PubMedView Article
- Bodor JN, Rose D, Farley TA, Swalm C, Scott SK: Neighbourhood fruit and vegetable availability and consumption: the role of small food stores in an urban environment. Public Health Nutr 2007, 11:413–420.PubMed
- Edmonds J, Baranowski T, Baranowski J, Cullen KW, Myres D: Ecological and socioeconomic correlates of fruit, juice, and vegetable consumption among african-american boys. Prev Med 2001, 32:476–481.PubMedView Article
- Mobley LR, Root ED, Finkelstein EA, Khavjou O, Farris RP, Will JC: Environment, obesity, and cardiovascular disease risk in low-income women. Am J Prev Med 2006, 30:327–332.PubMedView Article
- Raja S, Yin L, Roemmich J, Ma C, Epstein L, Yadav P, Ticoalu AB: Food environment, built environment, and women's BMI: evidence from erie county, New york. J Plan Educ Res 2010, 29:444–460.View Article
- Black JL, Macinko J, Dixon LB, Fryer GE Jr: Neighbourhoods and obesity in New york city. Health and Place 2010, 16:489–499.PubMedView Article
- Commission for Rural Communities: What is rural?. London: Countryside Agency; 2004.
- The English indices of deprivation. 2010. http://www.communities.gov.uk/publications/corporate/statistics/indices2010.
- Indices of deprivation. 2007. http://www.communities.gov.uk/communities/neighbourhoodrenewal/deprivation/deprivation07.
- White E, Armstrong BK: Principles of measurement in epidemiology: collecting, evaluating, and improving measures of disease risk factors. 2nd edition. Oxford: Oxford University Press; 2008.View Article
- Rundle A, Neckerman KM, Freeman L, Lovasi GS, Purciel M, Quinn J, Richards C, Sircar N, Weiss C: Neighborhood food environment and walkability predict obesity in New york city. Environ Heal Perspect 2009, 117:442–447.
- Galvez MP, Hong L, Choi E, Liao L, Godbold J, Brenner B: Childhood obesity and neighbourhood food-store availability in an inner-city community. Acad Pediatr 2009, 9:339–343.PubMedView Article
- Liese AD, Weis KE, Pluto D, Smith E, Lawson A: Food store types, availability, and cost of foods in a rural environment. J Am Diet Assoc 2007, 107:1916–1923.PubMedView Article
- Odoms-Young AM, Zenk SN, Mason MM: Measuring food availability and access in african-american communities: implications for intervention and policy. Am J Prev Med 2009, 36:S145-S150.PubMedView Article
- Bagwell S: The role of independent fast-food outlets in obesogenic environments: a case study of east london in the UK. Environment and Planning A 2011, 43:2217–2236.View Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.