Chapter 4 Bayesian geostatistical modeling to assess spatio- spatio-temporal variations and elapsing time for malaria incidence
4.2.4 Determining important lags using Bayesian variable selection We applied Bayesian variable selection to determine the most important lag time
Where is the categorical variable for the covariate corresponding to coefficient and is the intercept for locations . The model describes the relationship between an independent value of and the corresponding dependent mean . This is summarized as . The model gives the expected of malaria cases given the corresponding value of each categorical variable at location s.
4.2.4 Determining important lags using Bayesian variable selection We applied Bayesian variable selection to determine the most important lag time between environmental factors and the onset of malaria incidence. We used a Stochastic Search Variable Selection method that tries to find those independent variables that are greatly associated with the outcome variable of interest thus allowing us to fit the model only for those variables that are important in the final model. The set of predictors were fixed into polynomial function describing the distribution of each set of predictors where the third power was selected following first stage testing of the different polynomials. The model was then restricted to power 4 for all the predictors comprising of LST, NDVI and Rainfall. For each of the polynomial functions we introduced a binary indicator with 50% chance of inclusion into the final model by restricting the variable selection to a Bernoulli distribution with probability of inclusion whereby the best set of covariates was indicated by the model with the highest posterior probability ranging from 0 to 1. Any variables with coefficients above 50 % were selected for the final model. An inverse gamma prior was used. To enable prediction we ran the model for 444 parameters using Markov Chain Monte Carlo (MCMC) where prior distributions were assigned to the parameters in order to complete the model formulation. We then applied Bayesian kriging to predict the malaria incidence risk at unsampled locations and produce a parasitaemia risk map at 1 km spatial resolution. We used a randomly selected sample of 150 locations as a training set for fitting the final prediction model.
4.3 Results 69 4.3 Results
4.3.1 Bayesian geostatistical modeling
Spatial variations of malaria incidence with climatic and environmental factors were estimated for each full transmission season in Swaziland. Results were predictions of malaria incidence presented in smoothed surfaces showing monthly variations in malaria incidence making it possible to visualize months of initial and peak transmission in the country. The predictions were based on bi-weeks polynomial lags of up to power 4 which were aggregated up to three months. There was a positive association in increase of temperature, rainfall and NDVI and malaria cases at polynomial lags of up to three months. For instance the current bi-week (LST day[1]) was positively associated with malaria incidence 2.18 (95% CI: 0.98 to 3.19) while the first lag or power 1 (LST day[2]) was negatively associated with malaria cases -2.63 (95% CI: -2.89 to -2.34).
Interestingly, third bi-week or power 2 (LST day[3]) and fourth bi-week or power 3 (LST day[4]) were again positively associated with malaria cases (0.1, 95% CI: 0.05 to 0.17 and 0.22, 95% CI: 0.22 to 0.2357) until the situation start to change in bi-week or power 4 (LST day[5]) which is negatively associated. In LST night, the effects of the lags was different in the sense that the current week and the first lag were not associated with malaria incidence until the second bi-week or power 2 (LST night[3]) which was negatively associated with malaria (-0.12 95% CI: -0.13 to -0.10). The third bi-week or power 4 was also positively associated with malaria which again changes in bi-week of power 4 with negative association. On the other hand NDVI was negatively associated with malaria until the third power (NDVI[4]) which was yet again positively associated (0.06, 95% CI: 0.05 to 0.06). Rain showed negative association in the power 1 (Rain[2]), power 3 rain (Rain[4] and power 4 (Rain[5]) and was only positively associated with malaria in power 2 (Rain[3]) with mean 0.23 (95% CI: 0.22 to 0.25). The results are summarized in Table 4.3 which shows the posterior probabilities of the variables polynomial functions ( 2)
4.3 Results 70 Table 4.3: Posterior estimates of the distributed lags constrained to power four
Variable( 2) mean sd val2.5pc median val97.5pc LST night[4] 0.02868 0.00142 0.02657 0.02925 0.03084 LST night[5] -0.00271 3.32E-04 -0.00326 -0.00258 -0.00221
NDVI[1] -0.3156 0.3682 -0.996 -0.2119 0.2182
NDVI[2] -0.2407 0.1474 -0.4127 -0.2853 0.008079
NDVI[3] 0.001152 0.01623 -0.01974 -0.00238 0.04276
NDVI[4] 0.06359 0.003442 0.05515 0.06441 0.06808
NDVI[5] -0.00881 2.29E-04 -0.00917 -0.00876 -0.00847
The bi-week lags were also fitted into the negative binomial model as fixed covariates in order to assess the effects of each lag on malaria cases. Other fixed non time varying covariates like altitude, waterbody and seasonality, were also added into the model. The earlier lags of the 2nd and 3rd bi-week for LST day were negatively associated with malaria compared to later lags of the 4th, 5th and 6th bi-weeks. The effects changes from a negative association to a positive one after about one month.
Interestingly LST night was positively associated with malaria cases in the 2nd and 3rd bi-week lags with means 0.49 (95% CI: 0.17 to 0.91), 1.9 (95% CI: 1.55 to 2.52) and 1.66 (95% CI: 1.39 to 1.62) respectively which jointly are also equivalent to an effect of about one month. Similarly NDVI was negatively associated with malaria in the earlier lags of about 1 month (bi-week lags 1, 2 & 3) and was positively associated in the latter lags of the 2nd and 3rd months (bi-week lags 4, 5 & 6). Rainfall was negatively associated with malaria case on the 2nd bi-week lag and positively associated in the 5th and 6th lags of the 2nd and 3rd months with means 0.35 (95% CI:
0.09 to 0.93) and 0.36 (CI: 0.16 to 0.67) respectively. The results are summarized in Table 4.4 and the resultant maps are displayed in Figures 4.3.1, 4.3.2 and 4.3.3.
4.3 Results 71
Table 4.4: Posterior probabilities for fixed bi-week lags of environmental factors
variable Lag mean sd val2.5pc median val97.5pc
LST day Lag 1 -0.1451 0.445 -1 -0.05983 0.5578
Lag 2 -1.297 0.2059 -1.739 -1.261 -0.9551
Lag 3 -0.9561 0.09027 -1.122 -0.9623 -0.7872
Lag 4 0.4922 0.2407 0.1771 0.4447 0.9126
Lag 5 1.963 0.3005 1.555 1.892 2.528
Lag 6 1.668 0.1928 1.393 1.625 2.091
Lag 7 -2.883 0.2861 -3.391 -2.884 -2.448
LST night Lag 1 0.1838 0.1798 -0.1447 0.2328 0.4541
Lag 2 0.2475 0.07481 0.1214 0.2427 0.4208
Lag 3 0.2718 0.1498 0.03271 0.2404 0.577
Lag 4 0.2662 0.199 -0.0304 0.1975 0.6114
Lag 5 0.1751 0.1532 -0.04733 0.1231 0.4599
Lag 6 -0.1222 0.1056 -0.3501 -0.1102 0.06071
Lag 7 -0.8112 0.479 -1.7 -0.6842 -0.03606
NDVI Lag 1 -0.5004 0.2157 -0.9203 -0.459 -0.1579
Lag 2 -0.4247 0.07643 -0.5662 -0.4235 -0.27 Lag 3 -0.02417 0.112 -0.1951 -0.05566 0.1894
Lag 4 0.554 0.1873 0.3336 0.4903 0.9159
Lag 5 0.9512 0.1771 0.7373 0.8958 1.299
Lag 6 0.5974 0.0683 0.4687 0.595 0.7303
Lag 7 -1.289 0.3305 -1.922 -1.184 -0.8273
Rain Lag 1 -0.1135 0.3018 -0.9562 -0.02793 0.2342
Lag 2 -0.2749 0.1329 -0.5975 -0.2561 -0.07677 Lag 3 -0.1505 0.09798 -0.3069 -0.1598 0.08106
Lag 4 0.1186 0.1824 -0.1032 0.06273 0.6572
Lag 5 0.3595 0.2065 0.09986 0.2959 0.9364
Lag 6 0.3677 0.1306 0.1619 0.3469 0.6715
Lag 7 -0.0928 0.1959 -0.5978 -0.0567 0.189
Wet season - 1.532 0.9761 0.04397 1.381 3.896
Dry season - -0.07404 9.999 -19.6 -0.07795 19.65
Altitude - -0.2638 0.1592 -0.5927 -0.2573 0.03222
Waterbody - 0.01113 0.09569 -0.1697 0.007787 0.2111
4.3 Results 72
Fi
Figure 4.3.1: Predicted malaria incidence for July-October
4.3 Results 73
Figure 4.3.2: Predicted malaria incidence for November-February .
4.3 Results 74
Figure 4.3.3: Predicted malaria incidence for March-June
4.4 Discussion 75 4.4 Discussion
This study showed a consistent spatio-temporal correlation between climatic factors and malaria incidence risk and as such the maps provide an explicit guide for resource optimization as they show the areas to be targeted with intervention to achieve high impact. While high incidence risk is predominantly in the eastern lowlands of the country its magnitude varies from month to month and the results also suggest that additional non-climatic factors including socio-economic conditions, elevation, etc.
also influence malaria transmission. Most of the areas at risk are below an altitude of 400 meters and also dominated by rural settlements. Although the influence of climatic and environmental factors on malaria transmission is known, this study is the first to identify a geostatistically significant correlation between climatic factors and malaria incidence risk in Swaziland. The maps depict a considerable month to month fluctuation in malaria incidence with highest incidence rates occurring in September and January respectively. In sharp contrast, very low incidence rates were observed for the months of February and March. It can be noted that a high proportion of cases that are seen in the months of December to March are usually a result of importation from nearby endemic countries, hence it can be concluded that local transmission essentially begin in the months April to November.
At three months lags, the study was able to quantify, the delay or buildup period for climatic factors and onset of malaria cases. Therefore, the algorithm used in this modeling can be adopted and used by the control programme for short to medium term forecasting of epidemics thus enabling intervention measures to be timely and well placed. Since the study produced the first monthly explicit maps of spatial variations in the country, these can be used as on-the-ground guide for not only timing interventions but also identifying priority areas for high impact as shown in the smoothed maps of monthly predicted malaria incidence in Figures 4.3.1, 4.3.2 and 4.3.3. These predicted maps illustrated spatial variations between months with areas of high predicted malaria incidence risk occurring in the eastern low part of Swaziland and a few locations with low predicted malaria incidence spreading towards the western part of the country. Of note is that the spread towards the west follows areas of low altitude which are channels with the same elevation as the eastern high incidence risk areas.
4.4 Discussion 76
4.4.1 Conclusions
The Bayesian modeling methods developed and used in this study can be built on and modified for rapid forecasting models to assist countries with very low malaria incidence rates understand the micro epidemiology of the disease over very small areas and therefore optimally deploy malaria interventions in accordance with the severity of the observed malaria episodes. High resolution maps are useful in analyzing factors influencing transmission for very low endemic settings like Swaziland. Results show that the country is on track towards elimination as average predicted malaria cases become less in some months and the area predicted to be at risk demonstrate clearly a case of diminishing malaria.