Incorporating geographical factors with artificial neural networks to predict reference values of erythrocyte sedimentation rate
© Yang et al; licensee BioMed Central Ltd. 2013
Received: 15 January 2013
Accepted: 6 March 2013
Published: 12 March 2013
The measurement of the Erythrocyte Sedimentation Rate (ESR) value is a standard procedure performed during a typical blood test. In order to formulate a unified standard of establishing reference ESR values, this paper presents a novel prediction model in which local normal ESR values and corresponding geographical factors are used to predict reference ESR values using multi-layer feed-forward artificial neural networks (ANN).
Methods and findings
Local normal ESR values were obtained from hospital data, while geographical factors that include altitude, sunshine hours, relative humidity, temperature and precipitation were obtained from the National Geographical Data Information Centre in China.
The results show that predicted values are statistically in agreement with measured values. Model results exhibit significant agreement between training data and test data. Consequently, the model is used to predict the unseen local reference ESR values.
Reference ESR values can be established with geographical factors by using artificial intelligence techniques. ANN is an effective method for simulating and predicting reference ESR values because of its ability to model nonlinear and complex relationships.
The erythrocyte sedimentation rate (ESR) is a well-established clinical test in diseased patients that is commonly used for estimating the body's acute phase reaction to inflammation and infection [1, 2]. For many years, physicians have found normal ESR values useful for predicting specific disease severity and assessing general sickness index, among other uses. The origin of the concept of the ESR dates back to the early 19th century, when the Greeks observed the relation between the sedimentation of red blood cells and fibrinogen . In 1918, Fahraeus discovered that erythrocyte sedimentation in plasma occurred more rapidly in pregnant women than they did in non-pregnant women . Since then, with minor modifications, the ESR has been used in the evaluation of variety of diseases.
The most commonly used method of measuring the ESR is the Wintrobe method that is performed using a 100-mm tube containing oxalate as the main anticoagulant . In order to compare the difference between the ESR value of patients and normal ESR value, the reference ESR values were measured in local hospitals and research institutes. Some studies have found that in addition to the reference ESR values varying with seasonal changes, they also have a significant variation with the age, gender, smoking habits and weight of patients [6–8]. One study proposed a formula for calculating the maximum normal ESR at any given age . In the aforementioned study, the ESR value is calculated as (age in years/2) for men and (age in years +10)/2 for women . While ESR values increase as people become older , the tendency of the ESR values to increase with age flattens out after age 60 [7, 9]. Other observations regarding ESR values include the following: a general pattern of high ESR values in spring and autumn and low values in summer, and a significant increase of mean ESR values due to smoking and obesity . In addition to age, gender, smoking habit and weight of patient, some studies have found that normal ESR values also vary with geographical factors [10–12]. For example, some studies found that ESR is significantly correlated with altitude, latitude, relative humidity, mean annual temperature and annual precipitation [10–12]. In our study, we maintain the inclusion of five geographical factors similar to a previous study ; however, we replace latitude with annual sunshine hours because of the effects of seasonal variation on ESR values suggested by a study in which high ESR values were observed in the spring and autumn while low values were observed in the summer . The decrease of reference ESR values is significantly associated with increase in altitude and decrease of relative humidity, mean annual temperature and annual precipitation [10–12]. In order to find how such geographical factors affect the reference ESR values, some studies modeled the relationship using stepwise regression [10–12]. While geographical factors have been found to improve the prediction accuracy of local reference ESR values, the reasons as to why they do so are not definitive due to cross-correlation: for example, humidity, temperature and precipitation generally decrease as altitude increases, while annual sunshine hours are affected by seasonal variation of the other geographical factors at a specific altitude. Consequently, the relationship between reference ESR values and geographical factors is nonlinear and thus complicated in a manner that introduces limitations when a stepwise regression statistical model is used, given the cross-correlation of the independent variables .
In solving variable cross-correlations when calculating reference ESR values by incorporating geographical factors, this paper presents a new method of simulating and predicting local reference ESR values using artificial neural networks (ANN). This proposed method has a number of advantages over other methods. First, the training procedure is simple and convenient because the parameter values are obtained automatically by neural networks. Second, the method is efficient because it uses the well-developed procedure of back-propagation training [14, 15], such that it is able to deal with complex interactions among variables. Finally, the use of a neural network means that the variables do not have to be independent of each other. All in all, the proposed model structure is more robust and stable compared with linear regression models.
Methods and materials
Artificial Neural Network
An Artificial Neural Network (ANN) is a nonlinear regression method that is inspired by the way biological nervous systems, such as the brain, process information. ANNs have been widely applied in many disciplines with a high degree of difficulty. Neural networks, with their remarkable ability to derive meaning from nonlinear data, can be used to extract patterns and detect trends that are too complex to be evaluated by simple regression techniques. A trained neural network can be thought of as an ‘expert’ in the category of information it has been given to analyze. This ‘expert’ can then be used to provide projections given new situations of interest.
The basic processing units in a neural network are the so-called neurons or nodes which are organized in several layers. All the neurons, except those in the input layer, perform two simple processing functions: collecting the activation of the neurons in the previous layer and generating activation as the input to the following layer. The neurons in the input layer only send signals to the next layer but process input data.
Where I i is the signal from neuron i of the sender layer, net j is the collection signal for receiver neuron j in the next layer and W ij is the parameter or weight that sums up the signals from different input nodes. The receiver neuron in the next layer creates activation in response to the signal net j . The activation then becomes the input for its next layer. The activation is usually created in the form of a sigmoid or linear function.
The learning process of neural networks entails determining the adaptive weights which are used to address the strengths of network interconnection between associated neurons. The values of the weights are not set by the users but rather are determined by the network during training. One of the most popular training methods is a back-propagation learning algorithm which iteratively minimizes an error function over the network (calculated) outputs and desired outputs on the basis of a training data set [14, 15]. An advantage of the back-propagation neural network is that the learning algorithm is not programmed into the network a priori . The weights are initially set by a random process. The error, computed as the difference between calculated and desired activation for the output neuron, is propagated back through the network and used to adjust the weights. The process of adjusting the weights according to the errors is repeated over many iterations until the error rate is minimized or reaches an acceptable level. Once the optimized weights have been obtained from the training data set, the network is ready for prediction. Prediction is based on the activation level (I i ) in the output neuron. The activation level of a neuron ranges from 0 to 1, a scale which reflects the variation from extremely low to extremely high strength of membership, respectively.
ANN-based reference ESR values training and predicting model
S 1 - Altitude (m);
S 2 - Annual sunshine hours (hours);
S 3 - Annual average relative humidity (%);
S 4 - Annual average temperature (°C);
S 5 - Annual average precipitation (mm).
The Geographical data listed above are obtained from National Geographical Data Information Center China.
An essential task is to design the network structure for the prediction. The design of the network structure is simplified because the numbers of layers and neurons in the layers can be subjectively determined. However, an increase in the number of layers and neurons will drastically increase the computation time for the model. The principle is to use as few layers and neurons as possible without severely compromising model accuracy. Based on tests specific to our data, it is sufficient to use 3 layers in the neural network: one input layer, one hidden layer, and one output layer. The input layer has five neurons corresponding to the five geographical factors chosen for the study. There are 5 neurons in the hidden layer (Figure 1). The output layer has only one neuron which indicates normal ESR value. There are 25 (5×5) weights to be determined for the links between the input layer and the hidden layer, and 5 weights between the hidden layer and output layer. Consequently, a total of 30 parameters are used for the neural network model.
Where x is a case, net i (x) is the received signal for neuron i of case x in the input layer, and S k ’(x) is the kth geographical factor of case x.
Where net1,j(x) is the signal received by the first neuron j of the hidden layer for case x, W ij is the weight from neuron i of the input layer to neuron j of the hidden layer.
Where O(x) is the signal received by a neuron of the output layer for case x, W i is the weight from neuron i of the hidden layer to a neuron of the output layer.
The network was trained using the Levenberg-Marquardt (LM) algorithm. The values of these weights are automatically determined by the learning process which is based on the back-propagation algorithm in the MATLAB Neural Network toolbox.
The whole set of samples is automatically divided into three groups of 70%, 15% and 15% for the learning process of neural networks. The first group is the training dataset and the others are validation dataset and test dataset, respectively. The training dataset is used to obtain the weights for each link between a pair of neurons, the validation dataset is used to generalize the training data and the test dataset is then used to verify the learning results. A set of weights is finally obtained from the training process. One of the most important characteristics of trained neural networks is their ability to generalize training data from validation data. If the network simply memorizes or overfits the training data, it will generally perform poorly on the test data. It is important to decide the number of iterations so that the training can be stopped properly. If the training process takes too long, the network may be overtrained and consequently cause large prediction errors for the test data. In this study, the aforementioned potential problem was averted by stopping the training when the error of the validation data began to increase.
Results and discussion
The structure and specification of ANN
No. of hidden layers
No. of neurons in the input layer
No. of neurons in the hidden layer
No. of neurons in the output layer
No. of epochs
Adaption learning function
Levenberg-Marquardt (LM) algorithm
Where meas and pred stand for measured and predicted values respectively.
Where N is the number of cases.
The maximum and minimum of ΔV and AAD %
Comparison of the measured ESR (V meas ) with predicted ESR (V pred ), error difference, error deviation
Annual sunshine hours (hours)
Annual average relative humidity (%)
Annual average temperature (°C)
Annual average precipitation (mm)
Predicted reference ESR values
Annual sunshine hours (hours)
Annual average relative humidity (%)
Annual average temperature (°C)
Annual average precipitation (mm)
The original data, result data and ANN model parameters can be downloaded from additional file 1.
It is important to explore the complex relationship between local ESR values and geographical factors. Studies have shown that the ESR values decrease with increase in altitude because oxygen content gradually decreases while altitude rises [10–12]. As a result, the amount of red blood cells increases, inducing a fall in ESR reference value in healthy subjects [11, 18]. The decrease in ESR reference value is correlated with a decrease in temperature, humidity and precipitation, all of which affect annual sunshine hours based on seasonal variation. Therefore, while incorporating all the aforementioned geographical factors can help explain ESR differences within similar altitudes, the relationship is nonlinear and thus complicated, hence the use of ANN to model the dependent relationships.
In this study, a multi-layer feed-forward neural network model has been developed to predict the local reference ESR values, taking into account corresponding local geographical factors. The network is trained using training data, after which the latter is then used to predict unseen reference ESR values. The results show that predicted values are in statistical agreement to measured values. The use of ANN is an effective method for simulating and predicting reference ESR values because of its ability to model nonlinear and complex relationships. The main advantage of our proposed method is the simplicity and stability of the model structure. Although the reference ESR values can be predicted with geographical factors by using artificial neural networks, the reason why reference ESR values vary with geographical factors should be further explored.
The authors would like to thank Olaf Menzer, Arvis Sulovari and the two anonymous reviewers for their valuable comments and feedback. This study was supported by NSFC (No: 40971060).
- Zacharsk L, Kyle RA: Significance of extreme elevation of erythrocyte sedimentation rate. J Amer Med Assoc. 1967, 202: 264-10.1001/jama.1967.03130170064008.View ArticleGoogle Scholar
- Wyler DJ: Diagnostic implications of markedly elevated erythrocyte sedimentation-rate - re-evaluation. Southern Med J. 1977, 70: 1428-1430. 10.1097/00007611-197712000-00015.PubMedView ArticleGoogle Scholar
- Ropes MW, Rossmeisl E, Bauer W: The relationship between the erythrocyte sedimentation rate and the plasma proteins. J Clin Invest. 1939, 18: 791-798. 10.1172/JCI101096.PubMedPubMed CentralView ArticleGoogle Scholar
- Fahraeus R: The suspension stability of the blood. Physiol Rev. 1929, 9: 241-274.Google Scholar
- Olshaker JS, Jerrard DA: The erythrocyte sedimentation rate. J Emerg Med. 1997, 15: 869-874. 10.1016/S0736-4679(97)00197-2.PubMedView ArticleGoogle Scholar
- Ansell B, Bywaters EGL: The unexplained high erythrocyte sedimentation rate. Brit Med J. 1958, 1: 372-374. 10.1136/bmj.1.5067.372.PubMedPubMed CentralView ArticleGoogle Scholar
- Pincherle G, Shanks J: Value of the erythrocyte sedimentation rate as a screening test. Brit J Prev Soc Med. 1967, 21: 133-136.Google Scholar
- Miller A, Green M, Robinson D: Simple rule for calculating normal erythrocyte sedimentation-rate. Brit Med J. 1983, 286: 266-266.View ArticleGoogle Scholar
- HILDER F, GUNZ F: The effect of age on normal values of the westergren sedimentation rate. J Clin Path. 1964, 17: 292-293. 10.1136/jcp.17.3.292.PubMedPubMed CentralView ArticleGoogle Scholar
- Ge M, Ren ZY, Yang QS, Wei HY: The relationship between reference value of old people's erythrocyte sedimentation rate and altitude. Clin Hemorheol Micro. 2001, 24: 155-159.Google Scholar
- Ge MA: Reference value of younger people's erythrocyte sedimentation rate and altitude. J Lab Clin Med. 2004, 143: 367-368. 10.1016/j.lab.2004.03.006.View ArticleGoogle Scholar
- Ge M, Yan Y, Ren ZY, Guo CL, Ma JF, Huang P: The relationship between normal erythrocyte sedimentation rate of Chinese young people and geographical factors. Clin Hemorheol Micro. 1999, 20: 151-157.Google Scholar
- Li X, Yeh AG: Neural-network-based cellular automata for simulating multiple land use changes using GIS. Int J Geogr Inf Sci. 2002, 16: 323-343. 10.1080/13658810210137004.View ArticleGoogle Scholar
- Foody GM: Relating the land-cover composition of mixed pixels to artificial neural networkclassification output. Photogramm Eng Rem S. 1996, 62: 491-499.Google Scholar
- Rumelhart DE, Hinton GE, Williams RJ: Learning representations by back-propagating errors. Nature. 1986, 323: 533-536. 10.1038/323533a0.View ArticleGoogle Scholar
- Hepner GF, Logan T, Ritter N, Bryant N: Artificial neural network classification using a minimal training set - comparison to conventional supervised classification. Photogramm Eng Rem S. 1990, 56: 469-473.Google Scholar
- Gong P: Integrated analysis of spatial data from multiple sources: using evidential reasoning and artificial neural network techniques for geological mapping. Photogramm Eng Rem S. 1996, 62: 513-523.Google Scholar
- Vennapusa B, Cruz LDL, Shah H, Michalski V, Zhang QY: Erythrocyte sedimentation rate (ESR) measured by the streck ESR-auto plus is higher than with the sediplast Westergren method. Am J Clin Pathol. 2011, 135: 386-390. 10.1309/AJCP48YXBDGTGXEV.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.