Lassa fever is caused by a viral haemorrhagic arenavirus that affects two to three million people in West Africa, causing a mortality of between 5,000 and 10,000 each year. The natural reservoir of Lassa virus is the multi-mammate rat Mastomys natalensis, which lives in houses and surrounding fields. With the aim of gaining more information to control this disease, we here carry out a spatial analysis of Lassa fever data from human cases and infected rodent hosts covering the period 1965–2007. Information on contemporary environmental conditions (temperature, rainfall, vegetation) was derived from NASA Terra MODIS satellite sensor data and other sources and for elevation from the GTOPO30 surface for the region from Senegal to the Congo. All multi-temporal data were analysed using temporal Fourier techniques to generate images of means, amplitudes and phases which were used as the predictor variables in the models. In addition, meteorological rainfall data collected between 1951 and 1989 were used to generate a synoptic rainfall surface for the same region.
Three different analyses (models) are presented, one superimposing Lassa fever outbreaks on the mean rainfall surface (Model 1) and the other two using non-linear discriminant analytical techniques. Model 2 selected variables in a step-wise inclusive fashion, and Model 3 used an information-theoretic approach in which many different random combinations of 10 variables were fitted to the Lassa fever data. Three combinations of absence:presence clusters were used in each of Models 2 and 3, the 2 absence:1 presence cluster combination giving what appeared to be the best result. Model 1 showed that the recorded outbreaks of Lassa fever in human populations occurred in zones receiving between 1,500 and 3,000 mm rainfall annually. Rainfall, and to a much lesser extent temperature variables, were most strongly selected in both Models 2 and 3, and neither vegetation nor altitude seemed particularly important. Both Models 2 and 3 produced mean kappa values in excess of 0.91 (Model 2) or 0.86 (Model 3), making them ‘Excellent’.
The Lassa fever areas predicted by the models cover approximately 80% of each of Sierra Leone and Liberia, 50% of Guinea, 40% of Nigeria, 30% of each of Côte d'Ivoire, Togo and Benin, and 10% of Ghana.
Previous studies on the eco-epidemiology of Lassa fever in Guinea, West Africa, have shown that the reservoir is two to three times more infected by Lassa virus in the rainy season than in the dry season. None of the intrinsic variables of the murine population, such as abundance or reproduction, was able to explain this seasonal variation in prevalence. We therefore here investigate the importance of extrinsic environmental variables, partly influenced by the idea that in the case of nephropathia epidemica in Europe contamination of the environment, and therefore survival of the pathogen outside the host, appears to be an important factor in this disease's epidemiology. We therefore made an extensive review of the literature, gathering information about the geographical location of sites where Lassa fever has been certainly identified. Environmental data for these sites (rainfall, temperature, vegetation and altitude) were gathered from a variety of sources, both satellites and ground-based meteorological stations. Several statistical treatments were applied to produce Lassa ‘risk maps’. These maps all indicate a strong influence of rainfall, and a lesser influence of temperature in defining high risk areas. The area of greatest risk is located between Guinea and Cameroon.
Citation: Fichet-Calvet E, Rogers DJ (2009) Risk Maps of Lassa Fever in West Africa. PLoS Negl Trop Dis 3(3): e388. doi:10.1371/journal.pntd.0000388
Editor: Robert Tesh, University of Texas Medical Branch, United States of America
Received: September 26, 2008; Accepted: February 5, 2009; Published: March 3, 2009
Copyright: © 2009 Fichet-Calvet, Rogers. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: We are grateful for the support of the UK's Joint Environment and Human Health Programme (funded by NERC, Defra, EA, MOD and the MRC). The statistical models were developed as part of the EU FP6 project grant code GOCE-2003-010284 EDEN. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Lassa fever (LF) is a viral haemorrhagic fever the pathogenic agent of which is an arenavirus Lassa virus (LASV) first discovered in 1969 in Nigeria, in a missionary nurse living in Lassa, a village close to the border with Cameroon . Lassa fever is widespread in West Africa, affecting 2 million persons per annum with 5,000–10,000 fatalities annually . Since its initial discovery, nosocomial outbreaks of Lassa fever have occurred repeatedly in Sierra Leone: Panguma, Kenema, 1971–83, 1997, Liberia: Zorzor, 1972; Phebe 1972, 1977, 1982; Ganta 1977, 1982 and Nigeria: Jos, 1970, 1993; Onitsha, 1974; Zonkwa, 1975; Vom, 1975–77, Imo, 1989; Lafia, 1993; and Irrua, 2004 ,,,,,,. In Guinea, some acute but isolated cases were recorded in hospitals  and a single rural outbreak was recorded on the Sierra Leone border in 1982–83 . Between these two areas, namely in Côte d'Ivoire, Ghana, Togo and Benin, no outbreak has ever been recorded, though isolated cases show evidence of viral circulation in that area ,,. Lassa fever therefore appears to have 2 geographically separate endemic areas: the Mano River region (Guinea, Sierra Leone, Liberia) in the West, and Nigeria in the East.
The reservoir host of this virus is the multimammate rat, Mastomys natalensis, which was found infected for the first time in Sierra Leone and in Nigeria in 1972 ,, and recently in Guinea . In Upper Guinea, these commensal rodents aggregate in houses during the dry season, and disperse into the surrounding fields in the rainy season, foraging in cultivated areas before harvesting . Villages where LASV-positive rodents have been trapped are all located in rain forest areas or in the transition zone between forest and savannah, within the 1500 mm rainfall isohyet. Rainfall seems to be an important ecological factor because a recent longitudinal study in rodents demonstrated that LASV infection was two to three times higher in the rainy season than in the dry season . There are no studies to date indicating that the virus can survive better in humid than in dry soil, but evidence points in this direction. For example, the recent discovery of a new arenavirus in Mus minutoides (Kodoko virus ) and of hantavirus in Hylomyscus simus (Sangassou virus) in Guinea , were both made in rodents trapped in wet habitats, swamps or along river edges. In the USA, many new hantaviruses discovered within the last 15 years are found in damp or wet places such as arroyos or canyons, i.e. Black Creek canal virus, Blue river virus, El moro Canyon virus, Limestone Canyon virus. In the case of Sin Nombre virus, responsible for hemorrhagic fever with pulmonary syndrome, high risk areas are associated with higher elevation and mesic vegetation whereas low risk areas are associated with lower elevation and xeric vegetation. Soil moisture appears to be a key factor explaining the maintenance of this virus in high risk areas ,. In Europe, the transmission and persistence of Puumala virus, responsible for nephropathia epidemica, seems possible only if indirect transmission through a contaminated environment is included in a mathematical model. The combination of viral dynamics inside and outside the host, rodent demographic patterns and humid periods seems to explain the geographical distribution of this disease . These advances all indicate the possible importance of rainfall patterns and humidity for Lassa Fever. We present our analysis of LF in West Africa in three steps: a first univariate analysis linking LF with high rainfall areas (Model 1) and the other two, multivariate analyses quantifying associations between LASV presence and a number of environmental parameters, derived from earth-observing satellites, that lead to the production of the first predictive risk maps for Lassa fever. One of these multivariate modelling approaches uses step-wise variable selection procedures (Model 2) whilst the other uses random combinations of predictor variables to identify the individual best predictors of LASV presence and absence (Model 3).
Materials and Methods
Nosocomial outbreaks and prevalences of Lassa fever in humans were derived from the dataset, and were placed on a map of West and Central Africa (see table 1 for the detailed references by country). The null prevalences recorded in Cameroon, CAR, Gabon and Congo were derived from samples taken in towns ,,, whereas the low prevalence of 5% recorded in Pool region in Congo came from samples taken in villages . Elsewhere, prevalences appear as a mean, estimated regionally from several villages or from hospital staffs. Data on human infections cover the period 1965 to 2007.
Table 1. Positive localities recorded from humans and rodents indicating the presence of Lassa virus in West Africa.doi:10.1371/journal.pntd.0000388.t001
A synoptic rainfall map of West Africa was obtained from L'Hôte&Mahé  and is shown in Figure 1. This synoptic map is derived from rainfall records for the period 1951 to 1989. In West Africa, the highest rainfall regions are located either side of the Dahomey gap, which separates the 2 great rainforest zones of Guinea and Congo, each region receiving more than 1500 mm of rainfall per year. On the western side, the region includes Guinea, Sierra Leone, Liberia, the extreme West of Côte d'Ivoire and coastal Ghana. The eastern side includes the Congolese zone and south eastern Nigeria (Figure 1).
Figure 1. West and Central Africa mean annual rainfall (1951–1989 ), Lassa fever nosocomial outbreaks (stars) and human seroprevalence (numbers in %).doi:10.1371/journal.pntd.0000388.g001
Models 2 and 3
The new Lassa fever database was developed with all indications of Lassa fever presence in West Africa in the period 1965 to 2007. These indications included sero- and virologically positive rodents and human beings. For the rodents, all the localities where M. natalensis was screened for LASV were included. Localities were defined as positive when at least one M. natalensis was positive, and negative when none was infected. Because of the heterogeneous data for humans, the database was more complicated to establish. The localities were defined as positive when clinical cases were confirmed by a laboratory test or when sampled populations had a seroprevalence ≥10%. The ‘negative’ localities were defined when seroprevalence was <10%. This cut off was defined on the basis of the combined screening of both rodents and humans in the same locality. Rodents were always negative when seroprevalence in humans was <10%. This low human prevalence could be due to the movement of infected humans into an area without infection, whereas one positive rodent always indicates local transmission of LASV. Rodent and human data were acquired from an extensive review of the literature (Table 1).
The latitude and longitude of each recorded locality were then derived from the National Geographic Agency database (http://earth-info.nga.mil/gns/html/namefiles.htm). Because data on rodent infections came mostly or only from targeted samples of these animals, whereas it is assumed that the distribution of human infections is more likely to reflect the distribution of Lassa fever in humans, only data referring to the latter were used in the models presented here. Data referring to humans and rodents were also modelled, but are not presented here because they add only 8 new points, and make little difference to the final map (all data are recorded in Table 1).
Sets of environmental data were derived from remotely sensed imagery from the MODIS instrument on board the NASA Terra satellite for the period 2001–2005  and the processed version 4 of these data were downloaded from NASA's EOS data gateway (http://edcimswww.cr.usgs.gov/pub/imswelcome/). A complete description of the MODIS satellite data used to make these maps, and their processing, is provided by Scharlemann et al 2008 . Data for daytime and night-time land surface temperature (dLST and nLST respectively; MOD11A2 datasets) are available as 8-day composites (compositing removes many of the problems associated with cloud contamination in individual images) , whilst data for the Middle Infra Red channel (MODIS band 7, 2105–2155 nm, closest spectrally to the NOAA/AVHRR Channel 3 found useful in previous distribution studies), for the Normalised Difference Vegetation Index (NDVI = [near infrared (NIR)−RED]/[NIR+RED], where NIR is MODIS band 2 and RED is band 1, 841–876 nm and 620–670 nm, respectively) and for the Enhanced Vegetation Index (EVI = (2.5 * [[NIR−RED]/[NIR+6.0 * RED−7.5 * BLUE+1.0]], where BLUE is MODIS band 3, 459–479 nm and NIR and RED are as described above for NDVI) were all derived from 16-day composites after nadir Bidirectional Reflectance Distribution Function (BRDF)-adjustment (MOD43B4 dataset) . The BRDF adjustment removes directional effects of view angle and illumination, providing reflectance values as if every pixel were viewed from nadir, an important correction especially for any channel involving human visible wavelengths. All the MODIS data were available at a nominal resolution of c. 1 km at the equator and in the Sinusoidal projection. In addition to thermal and vegetation index data from MODIS a series of monthly rainfall images was obtained from the CMORPH project that uses a variety of satellite data to generate precipitation estimates within the latitudinal range of ±60 degrees .
The MODIS and CMORPH data were then temporal Fourier processed to extract, for each channel, a mean, the amplitudes and phases of the annual, bi-annual and tri-annual cycles (i.e. the Fourier harmonics corresponding to these frequencies), the minimum and maximum of the fitted signal and the variance of the original signal. Temporal Fourier processing produces a set of orthogonal (i.e. uncorrelated) variables that capture important elements of habitat seasonality that is often an important driver of vector-borne and other diseases ,; the particular problems of temporal Fourier processing of MODIS data (and their solutions) are described in Scharlemann et al 2008 . In addition to the satellite variables, the descriptor datasets also included a digital elevation image (DEM) derived from GTOPO30 . All the Fourier variables and the GTOPO30 layer were resampled (by bi-linear interpolation) initially to a resolution of 1/120th degree in the Geographical (latitude/longitude) ‘projection’ and these were then progressively averaged (1/60th, 1/30th etc.) to a resolution of 1/15th of a degree, giving a total of 51 Fourier and other (DEM) variables for modelling purposes. All modelling was carried out at this resolution, at which there were 94 unique database records of LASV presence in humans across West Africa. This total number of datapoints is less than the number of human records in Table 1, because some of the records fell within the same pixels at the spatial resolution of the analysis.
There are many different approaches to mapping species' distributions, recently reviewed by Elith et al . The approach adopted here is described in detail in Rogers 2006  and is based on non-linear maximum likelihood discriminant analysis techniques. For this approach we needed to identify not only areas of presence of each of the cases (from the database), but also equivalent areas of absence. There were insufficient records of absence in the database itself, so an alternative approach was followed, and one thousand points no closer that 0.5 degrees and no farther than 10 degrees away from any of the presence points in the database were chosen at random across West Africa. Because the rodent hosts occur much more extensively across West Africa than does LF, many of these randomly generated absence points fell within the distribution limits of these vertebrate hosts. Thus the models constructed were designed specifically to distinguish the presence and absence of the disease in humans, and not of the hosts of the disease. All satellite and other data were then extracted for both the presence and absence points (hereafter the ‘training set’). These data were first clustered within SPSS for Windows (version 13.0, copyright SPSS Inc., 1989–2004), using the means maxima and minima of each of the MODIS channels, and also the DEM, to produce cluster assignments of the presence and absence data that ran from 1 to 8 clusters each. Within the model the user selected the required combination of numbers of presence and absence clusters at the start of each model run. The LF models described here all used two absence and either one or two presence clusters.
Because of the incomplete nature of the presence (and presumably absence) data in each dataset, it was decided to bootstrap sample the training set data one hundred times, to produce a series of modelled predictions which were averaged to produce the final output map for the disease. Each bootstrap sample contained equal numbers of presence and absence points (this tends to maximise model accuracy; ) randomly drawn from the training set, sampled with replacement. The relationship between the bootstrap sample and the training set is imagined to be the same as that between the training set and the entire real world of which the training set itself is a sample. By modelling each bootstrap sample separately, and then averaging the results, it should be possible to establish the variability of model predictions arising from the incomplete sampling of the real world that the training set represents.
Model 2 variable selection.
Each model involved step-wise inclusive selection of the predictor variables to maximise a goodness of fit criterion; kappa the index of agreement, the area under the curve (AUC) or the Akaike corrected Information Criterion (AICc), all described in Rogers 2006 ; a maximum of ten predictor variables was selected for each bootstrap model, but model efficiency (as judged by the AICc) was often highest with fewer than 10 variables; where this applied the final prediction was made using this lower total number of variables. Results for each of the 100 models were kept separate and later brought together to generate accuracy statistics, and to discover whether or not particular variables were consistently included in the predictor datasets. This was done by establishing the mean ranking of each variable in the model selections. The variable selected first in any model run was given a rank of 1, the one selected second a rank of 2, and so on, up to rank 10 for the tenth variable. All non-selected variables in that model run were given a rank of 11. By averaging the ranks of each variable across all models it was possible to establish that variable's importance in the overall predictions.
Model 3 variable selection.
The problems of step-wise variable selection are well documented; the occurrence of one variable within a dataset can exclude a closely correlated variable that may in fact be more important in determining a disease's distribution. The end product of step-wise selection is therefore a group of variables that are often not strongly correlated with each other, but which are more strongly correlated with those variables left out of the selection. The question then arises about the real importance of the individual variables in determining any particular distribution. Burnham and Anderson  suggest a way of answering this important question, and this was followed here. Many random combinations of 10 variables from the entire predictor dataset were made, sampling without replacement (i.e. no variable occurred twice in the same combination), with each variable finally occurring one thousand times across all combinations. Each combination (of 10 variables) was then used to construct a model of LASV distribution using the same bootstrap samples as before. Model accuracy was measured by the corrected Akaike Information Criterion (AICc, a smaller value indicating a better model). Once all the models had been constructed, the mean AICc value of all models containing each variable in turn was calculated, and these mean values were then finally ranked, lowest to highest. The variable giving the lowest mean AICc is then regarded as the ‘best’ predictor of LASV since, regardless of the other (random) variables with which it was associated in the full set of models in which it occurred, those models were overall better than models involving any other single variable. The variable giving the next lowest AICc was the second best individual predictor; and so on.
The difference between the step-wise selected sets of variables (Model 2) and the list of top-ten variables produced by the combination method described above (Model 3) is analogous to the difference between a team (e.g. of footballers) and the top ten runners in an Olympic race. The team players co-operate with each other to win the football match; whilst no individual player may stand out from all the rest, it is the individual's ability to work well with the others that wins the match. In contrast, each runner in a race is competing against all the others. The winner is clearly better than the one who came second who, in turn is better than the one who came third; there is no cooperation between them. They are all collectively better than all the other runners in the race, but this is a result of individual, not collective, ability. It is unlikely that the top ten runners in an Olympic race would make a very good, co-operative team of footballers (and vice versa), so the team selection and the individual selection methods explored in Models 2 and 3 are unlikely to come up with the same results. Differences between them may however be illuminating.
In both Models 2 and 3 the selected sets of predictor variables were used within each bootstrap model to generate an image of the posterior probability for each image pixel of belonging to the category of presence pixels as defined within that model. Posterior probabilities are on the scale from 0.0 to 1.0 and a probability in excess of 0.5 is taken as indicating presence. The 100 images from each set of bootstrap samples in each model run were then averaged to produce a single output risk map for the disease.
Figure 1 shows the location of LF outbreaks (or areas of high human seroprevalence) from 1951 to 1989. The Jos plateau in Nigeria receives more rainfall than the surrounding areas and is disconnected from the wet coastal area by lowland areas of lower rainfall. Only the initial case in Lassa (800 mm/year) is located outside the high rainfall area. The map in Figure 1 suggests that areas with between 1200 mm and 1500 mm of rainfall per year are at relatively low risk of LF; areas with above 1500 mm have a much higher risk and, finally, areas with in excess of 3000 mm of rainfall annually appear to be at zero risk (i.e. had no outbreaks of LF in that period), although these very high rainfall areas are not widespread.
The predictor variables chosen for the three different cluster versions of Model 2 are shown in Table 2 with their mean ranks across the 100 bootstrap models for each. The average accuracy of these models is shown in Table 3 and the mean values of the selected predictor variables for one of the top models from the 2 Absence: 1 Presence cluster combination is shown in Table 4. Figure 2 shows the mean predicted risk map of LF from the 100 bootstrap models using this same combination of absence and presence clusters. With only one cluster each, LF appeared to be over-predicted whilst with two clusters each LF appeared to be more strongly limited to the training set data points and their immediate surrounding areas (i.e. the disease was possibly under-predicted). The 2 Absence:1 Presence cluster combination was therefore considered to give the best overall result.
Figure 2. Mean predicted Lassa risk map for West Africa from the Model 2 series with two absence and one presence clusters, with positive localities indicated by stars.
The posterior probability colour scale, from 0.0 (no risk) to 1.0 (highest risk) is shown as an inset. Grey areas are either areas with no suitable imagery (because of cloud contamination; coastal Nigeria and Cameroon) or else are so far from any of the training set sites in their environmental conditions that no predictions are made for them.doi:10.1371/journal.pntd.0000388.g002
Table 2. Mean ranking of the key predictor variables (selected by minimising the AICc) across 100 bootstrap models for each Absence:Presence cluster combination for Model 2.doi:10.1371/journal.pntd.0000388.t002
Table 3. Mean accuracy statistics across 100 bootstrap models for each Absence:Presence cluster combination for Model 2 (see text for definitions) (variables selected by minimising the AICc).doi:10.1371/journal.pntd.0000388.t003
Table 4. Example of the mean values of the ten selected variables from one of the 100 bootstrap models for the 2 Absence:1 Presence cluster situation (Model 2).doi:10.1371/journal.pntd.0000388.t004
The rainfall variables were disproportionately selected by all cluster combinations in Model 1; each ‘top ten’ list in Table 2 contains four such variables, where the random expectation (5 satellite channels) is only two. At the same time, the vegetation index channels (NDVI and EVI) are under-represented, with only a single one of 20 such variables (10 Fourier variables per channel) chosen across all cluster combinations; the balance of the important predictor variables were thermal ones (either LST or MIR). The relatively high values for the average ranks of even the top variables in all cluster combinations in Table 2, however, reflects the fact that each of the 100 bootstrap samples gave rather different results in terms of the variables selected, and in their order of selection. This is a common feature of relatively small datasets.
Despite the variability in the selected predictor variables, mean model accuracies were very high (Table 3) with, as expected, model accuracy increasing with increasing cluster numbers. The mean values of kappa put all models well within the ‘Excellent’ category of the Landis and Koch  scale (where kappa<0.4 is ‘Poor’; 0.4<kappa<0.75 is ‘Good’; and kappa>0.75 is ‘Excellent’).
The mean values of the key predictor variables may differ considerably, or only by rather small amounts (Table 4). Table 4 shows that the mean values for the single clusters of presence points in the model are often intermediate between those of the two absence clusters. This applies to mean rainfall, night-time LST minimum, MIR phase 2 and daytime LST (mean and maximum). In other cases, mean values for the presence points are well outside those for either absence cluster. This applies to rainfall (amp1, amp3, phase1 and minimum) and NDVI phase 3. Concentrating on the important rainfall variables in Table 4 it is possible to suggest that LASV requires high (but not the highest) mean rainfall areas (rain mean), but with very high annual variation of this variable (rain amp1), and with peak rainfall occurring much later in the year (during August rather than during May or March, the months of peak rainfall of the absence clusters in Table 4, rain phase1). The significance of the higher amp3 rainfall value in Table 2 (the first selected variable) is unclear; often such higher harmonics act to modulate the lower frequency – annual or bi-annual – harmonics, and thus adjust the seasonal pattern of rainfall (extending or reducing high rainfall periods, depending on the timing of this tri-annual harmonic).
The predicted risk map (Figure 2) captures most of the presence points in the database (the grey areas in Figure 2 in southern Nigeria and Cameroon are regions where cloud contamination is so continuous that it was not possible to obtain either sufficient cloud-free images or their temporal Fourier derivatives for modelling; these are therefore areas where it is not possible to make predictions of risk). The predicted risk areas in Figure 2 contract towards the coast in the ‘Dahomey gap’ between the western and central forests of Africa (see Introduction) but are still more extensive than the rainfall map and data in Figure 1 suggest. In fact the satellite rainfall image (CMORPH mean, not shown) also indicates a lower mean rainfall area in this region, so that the positive LASV predictions for this area must arise from the values of other key predictor variables. The differences between Figure 1 and Figure 2 in the basin of the River Zaire, towards Central Africa, arise because these areas (though high in rainfall) are environmentally quite distinct from those of the training set area and so the risk map models classify them as ‘No prediction’ areas (coloured grey in Figure 2).
Tables 4 and 5 show results analogous to those of Tables 2 and 3 but for Model 3, where the important variables were identified using the combination method of Burnham and Anderson . This method highlights even more the importance of rainfall variables (only 8 out of the 30 variables in Table 5 are not directly rainfall related), with slightly different combinations in each case for the different cluster combinations. Overall model accuracies are still excellent (Table 6) though not quite as good as those for Model 2. Figure 3 shows the mean predicted risk map obtained by using in Model 3 the selected combination of the top 10 variables for the same 100 bootstrap samples that were used in Model 2 to generate Figure 2. Figure 3 is less equivocal about risk areas than is Figure 2 (i.e. there are fewer regions of intermediate probability of LASV risk) for the simple reason that the same 10 variables were used throughout, whereas different combinations of variables were often selected in the Model 2 models, giving more variable results. Figure 3 again captures most of the presence sites within the training set, with rather different predictions for the Dahomey Gap region than those in Figure 2.
Figure 3. Mean predicted Lassa risk map for West Africa from the Model 3 series with two absence and one presence clusters, with positive localities indicated by stars.
Other information as for Figure 2.doi:10.1371/journal.pntd.0000388.g003
Table 5. Mean ranking of the key predictor variables across 100 bootstrap models for each Absence:Presence cluster combination for Model 3.doi:10.1371/journal.pntd.0000388.t005
The question that comes immediately to mind is: why does Lassa fever occur only in West Africa, whereas the range of its vertebrate host extends into East and Southern Africa? This is a recurrent question for other rodent-borne diseases (such as plague and hemorrhagic fevers with renal or pulmonary syndrome; see  for a review), which are also much more restricted in their distributions than are their hosts. Our analyses here show quite clearly that Lassa fever requires a particular combination of high (but not the highest) rainfall, and with a particular form of variability and seasonal timing, whereas its hosts can and do occur over regions experiencing a much wider range of rainfall conditions. Temperature appears to be less important in determining LASV distribution, although there are large differences between different areas; for example the annual mean and maxima in high risk areas are 27°C and 32°C respectively, whereas in low risk areas the mean temperature was approx. 38°C. Such high temperatures are known to increase LASV decay . One curious feature of the present results is the seeming unimportance of vegetation variables in the predictor data sets. This lack of importance is not due to their strong correlation with rainfall variables (such a correlation might exclude them in step-wise inclusive variable selection), because Model 3 (using a method that avoids the problems of step-wise methods) independently and quite categorically failed to identify vegetation variables as important in determining LASV distribution. Taken together these results suggest that the survival of the virus outside of the vertebrate host might be a key to determining its distribution, and that this survival depends upon moisture or rainfall conditions above more or less all other environmental variables. This result differs from the conditions favouring other viral transmission; for example, low relative humidity and temperature favour avian influenza . In the case of Lassa, the virus appears to survive better in humid conditions, during the rainy season. Rodents will be more often contaminated during their frequent movements at this season, for mating or dispersing into the surrounding fields . Conversely, viral aerosol stability, seems to be higher when the humidity is lower , a condition that obviously occurs more frequently in the dry season. The experiments of Stephenson help to explain the numerous LF cases recorded in hospitals during the late dry season, between January and March in Sierra Leone and Nigeria (, Omilabu, pers. com.) but they do not necessarily throw much or any light on the persistence of Lassa fever in the general environment. We suggest that rainfall, within defined limits, is the single most important abiotic determinant of this persistence.
M. natalensis, the most important host of LASV, does not occur in the western part of the region, in coastal Guinea and Sierra Leone and west to the 12th meridian. Only M. erythroleucus occurs in these regions, and our surveys have always found it to be negative for LASV infections . The low human sero-prevalences recorded in these coastal areas are most likely due to the movement of people from highly endemic zones, or to human-to-human transmission. Towns and villages in these coastal areas, from Guinea to Gabon, have been invaded by the black rat Rattus rattus, and the domestic mouse, Mus musculus, probably taken there in historical times by Arab and European traders, explorers and colonisers. Absence of M. natalensis from coastal areas, for whatever reason (e.g. unsuitable habitats, or competition from other, non-Lassa-reservoir rodents), would explain the absence of Lassa fever in these areas, despite the apparently favourable (for LASV) climatic conditions (although the models suggest that some areas may be too wet for LASV). In Conakry for example, rodent sampling (330 specimens) showed that the most abundant species was M. musculus (70%), followed by R. rattus (25%) (unpublished data).
In East and South Africa, the same reservoir species is present but the virus is replaced by other Lassa-like viruses such as Ippy, Morogoro and Mopeia, found in M. natalensis in CAR, Tanzania, Mozambique and Zimbabwe (CRORA database in Pasteur Institute website, http://www.pasteur.fr/recherche/banques/CRORA/, ,,). These different Lassa –like viruses are not known to be pathogenic in humans and are considered ancestral by phylogenetic studies . The scenario of multiple infection with both Lassa-like and Lassa virus is highly unlikely, and so we consider that central and eastern Africa are Lassa free. This is supported by many negative serological studies in Cameroon, in CAR, Congo, Equatorial Guinea and Gabon ,,,. However, the situation in south-west Cameroon bordering Nigeria remains problematic because this zone appears to be at high risk according to Figure 2. This is a volcanic area, which could provide a geographic barrier (Mt Cameroon, 4100 m, and the volcano chain up to the Adamaoua plateau). Furthermore, another species of Mastomys is suspected to be present in this area, M. kollmannspergeri, which is found in Niger, NE Nigeria, N Cameroon, S. Sudan and Chad . In Zakouma National Park in Chad, some specimens were found in a village and in camps, indicating a potential synanthropy of this species . The predictive risk map in Figure 2 identifies the central parts of Cameroon and CAR as risky areas, where it is possible that other Lassa-like viruses could occur, intermediate between Ippy/Mobala and Lassa (Mobala is another Lassa-like virus found in Praomys sp., a closely related species to Mastomys spp, in CAR .).
According to the risk maps shown here, with the reservations noted above, the LF risk area covers approximately 80% of the area of each of Sierra Leone and Liberia, 50% of Guinea, 40% of Nigeria, 30% of each of Côte d'Ivoire, Togo and Benin and 10% of Ghana. Such maps help public health policies and research, in targeting disease control and studies in potentially infected areas.
Translation of the abstract into French by Elisabeth Fichet-Calvet
(0.02 MB DOC)
We are grateful for the support of the UK's Joint Environment and Human Health Programme (funded by NERC, Defra, EA, MOD and the MRC). Many thanks to Kekoura Koulemou (Guekedou hospital, Guinea) for detailed information on positive localities investigated in ter Meulen et al. 1996. The statistical models were developed as part of the EU FP6 project grant code GOCE-2003-010284 EDEN, and this article is catalogued by the EDEN Steering Committee as EDEN 129 (http://www.eden-fp6project.net/). The contents of this publication are the sole responsibility of the authors and do not reflect the views of any of the funding bodies.
Conceived and designed the experiments: EFC. Performed the experiments: EFC DJR. Analyzed the data: EFC DJR. Contributed reagents/materials/analysis tools: EFC DJR. Wrote the paper: EFC DJR. Compiled the literature source: EFC. Performed the statistical modelling: DJR.
- 1. Frame JD, Baldwin JMJ, Gocke DJ, Troup JM (1970) Lassa fever, a new virus disease of man from West Africa. Am J Trop Med Hyg 19: 670–676.
- 2. McCormick JB (1999) Lassa fever. In: Saluzzo JF, Dodet B, editors. Emergence and control of rodent-borne viral diseases. Elsevier. pp. 177–195.
- 3. Carey DE, Kemp GE, White HA, Pinneo L, Addy RF, et al. (1972) Lassa fever epidemiological aspects of the 1970 epidemic, Jos, Nigeria. T Roy Soc Trop Med H 66: 402–408.
- 4. Bowen GS, Tomori O, Wulff H, Casals J, Noonan A, et al. (1975) Lassa fever in Onitsha, East Central State, Nigeria, in 1974. B World Health Organ 52: 599–603.
- 5. Frame JD, Jahrling PB, Yalley-Ogunro JE, Monson MH (1984) Endemic Lassa fever in Liberia. II. Serological and virological findings in hospital patients. T Roy Soc Trop Med H 78: 656–60.
- 6. Fisher-Hoch SP, Tomori O, Nasidi A, Perez-Oronoz GI, Fakile Y, et al. (1995) Review of cases of nosocomial Lassa fever in Nigeria: the high price of poor medical practice. BMJ 311: 857–859.
- 7. Bajani MD, Tomori O, Rollin PE, Harry TO, Bukbuk ND, et al. (1997) A survey for antibodies to Lassa virus among health workers in Nigeria. T Roy Soc Trop Med H 91: 379–381.
- 8. McCormick JB, Fisher-Hoch SP (2002) Lassa fever. Curr Top Microbiol Immunol 262: 75–109.
- 9. Omilabu SA, Badaru SO, Okokhere P, Asogun D, Drosten C, et al. (2005) Lassa fever, Nigeria, 2003 and 2004. Emerg Infect Dis 11: 1642–4.
- 10. Bausch DG, Demby AH, Coulibaly M, Kanu J, Goba A, et al. (2001) Lassa fever in Guinea: I. Epidemiology of human disease and clinical observations. Vector Borne Zoonotic Dis 1: 269–281.
- 11. Boiro I, Lomonossov NN, Sotsinski VA, Constantinov OK, Tkachenko EA, et al. (1987) Eléments de recherches clinico-épidémiologiques et de laboratoire sur les fièvres hémorragiques en Guinée. B Soc Pathol Exot 80: 607–612.
- 12. Akoua-Koffi C, ter Meulen J, Legros D, Akran V, Aidara M, et al. (2006) Detection of anti-Lassa antibodies in the Western Forest area of the Ivory Coast. Med Trop 66: 465–8.
- 13. Gunther S, Lenz O (2004) Lassa virus. Crit Rev Clin Lab Sci 41: 339–90.
- 14. Saltzmann S (1978) La fièvre de Lassa. Neuchâtel: Editions des Groupes Missionnaires.
- 15. Monath TP, Newhouse VF, Kemp GE, Setzer HW, Cacciapuoti A (1974) Lassa virus isolation from Mastomys natalensis rodents during an epidemic in Sierra Leone. Science 185: 263–265.
- 16. Wulff H, Fabiyi A, Monath TP (1975) Recent isolation of Lassa virus from Nigerian rodents. B World Health Organ 52: 609–613.
- 17. Lecompte E, Fichet-Calvet E, Daffis S, Koulémou K, Sylla O, et al. (2006) Mastomys natalensis and Lassa fever, West Africa. Emerg Infect Dis 12: 1971–1974.
- 18. Fichet-Calvet E, Lecompte E, Koivogui L, Soropogui B, Dore A, et al. (2007) Fluctuation of Abundance and Lassa Virus Prevalence in Mastomys natalensis in Guinea, West Africa. Vector Borne Zoonotic Dis 7: 119–28.
- 19. Lecompte E, ter Meulen J, Emonet S, Daffis S, Charrel RN (2007) Genetic identification of Kodoko virus, a novel arenavirus of the African pigmy mouse (Mus Nannomys minutoides) in West Africa. Virology 364: 178–83.
- 20. Klempa B, Fichet-Calvet E, Lecompte E, Auste B, Aniskin V, et al. (2006) Hantavirus in African wood mouse, Guinea. Emerg Infect Dis 12: 838–40.
- 21. Glass GE, Cheek JE, Patz JA, Shields TM, Doyle TJ, et al. (2000) Using remotely sensed data to identify areas at risk for hantavirus pulmonary syndrome. Emerg Infect Dis 6:
- 22. Glass GE, Yates TL, Fine JB, Shields TM, Kendall JB, et al. (2002) Satellite imagery characterizes local animal reservoir populations of Sin Nombre virus in the southwestern United States. Proc Natl Acad Sci U S A 99: 16817–22.
- 23. Sauvage F, Langlais M, Yoccoz NG, Pontier D (2003) Modelling hantavirus in fluctuating populations of bank voles: the role of indirect transmission on virus persistence. J Anim Ecol 72: 1–13.
- 24. Georges AJ, Gonzalez JP, Abdul-Wahid S, Saluzzo JF, Meunier DMY, et al. (1985) Antibodies to Lassa and Lassa-like viruses in man and mammals in the Central African Republic. T Roy Soc Trop Med H 79: 78–79.
- 25. Gonzalez JP, Josse R, Johnson ED, Merlin M, Georges AJ, et al. (1989) Antibody prevalence against haemorrhagic fever viruses in randomized representative Central African populations. Res Virol 140: 319–31.
- 26. Nakounne E, Selekon B, Morvan J (2000) [Microbiological surveillance: viral hemorrhagic fever in Central African Republic: current serological data in man]. B Soc Pathol Exot 93: 340–7.
- 27. Talani P, Konongo J, Gromyko A, Nanga-Maniane J, Yala F, et al. (1999) Prévalence des anticorps anti-fièvres hémorragiques d'origine virale dans la région du Pool (Congo-Brazzaville). Médecine d'Afrique Noire 46: 424–427.
- 28. L'Hôte Y, Mahé G (1995) West and Central Africa mean annual rainfall (1951–1989). Paris: ORSTOM Editions.
- 29. Justice C, Vermote E, Townshend J, Defries R, Roy D, et al. (1998) The Moderate Resolution Imaging Spectroradiometer (MODIS): Land remote sensing for global change research. IEEE T Geosci Remote 36: 1228–1249.
- 30. Scharlemann JP, Benz D, Hay SI, Purse BV, Tatem AJ, et al. (2008) Global data for ecology and epidemiology: a novel algorithm for temporal Fourier processing MODIS data. PLoS ONE 3: e1408. doi:10.1371/journal.pone.0001408.
- 31. Wan Z, Li Z-l (1997) A physics-based algorithm for retrieving land-surface emissivity and temperature from EOS/MODIS data. IEEE T Geosci Remote 35: 980–996.
- 32. Schaaf C, Gao F, Strahler A, Lucht W, Li X, et al. (2002) First operational BRDF, albedo nadir reflectance products from MODIS. Remote Sens Environ 83: 135–148.
- 33. Joyce RJ, Janowiak JE, Arkin PA, Xie P (2004) CMORPH: a method that produces global precipitation estimates from passive microwave and infrared data at high spatial and temporal resolution. J Hydrometeorol 5: 487–503.
- 34. Rogers DJ, Williams BG (1994) Tsetse distribution in Africa: seeing the wood and the trees. In: Edwards PJ, May RM, Webb NR, editors. Large Scale Ecology and Conservation Biology. British Ecological Society Symposium. pp. 247–271.
- 35. Rogers DJ (2000) Satellites, space, time and the African trypanosomiases. Adv Parasit 47: 129–171.
- 36. Anonymous (2006) GTOPO30. Global 30 arc second elevation data. http://edc.usgs.gov/products/elevation/gtopo30/gtopo30.html. Sioux Falls, SD: U.S. Geological Survey, Center for Earth Resources Observation and Science (EROS).
- 37. Elith J, Graham CH, Anderson RP, Dudi'k M, Ferrier S, et al. (2006) Novel methods improve prediction of species' distributions from occurrence data. Ecography 29: 129–151.
- 38. Rogers D (2006) Models for vectors and vector-borne diseases. Adv Parasit 62: 1–35.
- 39. McPherson JM, Jetz W, Rogers DJ (2004) The effect of species' range sizes on the accuracy of distribution models: ecological phenomenon or statistical artefact? J Appl Ecol 41: 811–823.
- 40. Burnham KP, Anderson DR (2002) Model Selection and Multimodel Inference: a Practical Information-Theoretic Approach. New York: Springer.
- 41. Landis JR, Koch GC (1977) The measure of observer agreement for categorical data. Biometrics 33: 159–174.
- 42. Sauvage F (2004) La synergie entre dynamique des populations réservoirs de campagnols roussâtres et excrétion du hantavirus Puumala: mise en évidence du mécanisme d'émergence de la néphropathie épidémique humaine. Lyon: Lyon 1.
- 43. Stephenson EH, Larson EW, Dominik JW (1984) Effect on environmental factors on aerosol-induced Lassa virus infection. J Med Virol 14: 295–303.
- 44. Lowen AC, Mubareka S, Steel J, Palese P (2007) Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog 3: e151. doi:10.1371/journal.ppat.0030151.
- 45. Fichet-Calvet E, Lecompte E, Koivogui L, Daffis S, ter Meulen J (2008) Reproductive characteristics of Mastomys natalensis and Lassa virus prevalence in Guinea, West Africa. Vector Borne Zoonotic Dis 8: 41–8.
- 46. McCormick JB, King IJ, Webb PA, Johnson KM, O'Sullivan R, et al. (1987) A case-control study of the clinical diagnosis and course of Lassa fever. J Infect Dis 155: 445–455.
- 47. Wulff H, McIntosh BM, Hamner DB, Johnson KM (1977) Isolation of an arenavirus closely related to Lassa virus from Mastomys natalensis in south-east Africa. B World Health Organ 55: 441–444.
- 48. Johnson KM, Taylor P, Elliott LH, Tomori O (1981) Recovery of a Lassa-related Arenavirus in Zimbabwe. Am J Trop Med Hyg 30: 1291–1293.
- 49. Vieth S, Drosten C, Lenz O, Vincent M, Omilabu S, et al. (2007) RT-PCR assay for detection of Lassa virus and related Old World arenaviruses targeting the L gene. T Roy Soc Trop Med H 101: 1253–64.
- 50. Clegg JCS (2002) Molecular phylogeny of the Arenaviruses. Curr Top Microbiol Immunol 262: 1–24.
- 51. Musser GG, Carleton MD (2005) Order Rodentia, suborder Myomorpha, superfamily Muroidea. In: Wilson DE, Reeder AM, editors. Mammal species of the world. Baltimore: The Johns Hopkins University Press. pp. 894–1531.
- 52. Granjon L, Houssin C, Lecompte E, Angaya M, Cesar J, et al. (2004) Community ecology of the terrestrial small mammals of Zakouma National Park, Chad. Acta Theriol 49: 215–234.
- 53. Gonzalez JP, McCormick JB, Saluzzo JF, Herve JP, Georges AJ, et al. (1983) An arenavirus isolated from wild-caught rodents (Praomys species) in the Central African Republic. Intervirology 19: 105–112.
- 54. Frame JD (1975) Surveillance of Lassa fever in missionaries stationed in West Africa. B World Health Organ 52: 593–598.
- 55. Lukashevich LS, Clegg JC, Sidibe K (1993) Lassa virus activity in Guinea: distribution of human antiviral antibody defined using enzyme-linked immunosorbent assay with recombinant antigen. J Med Virol 40: 210–217.
- 56. Demby AH, Inapogui A, Kargbo K, Koninga J, Kourouma K, et al. (2001) Lassa fever in Guinea: II. Distribution and prevalence of Lassa virus infection in small mammals. Vector Borne Zoonotic Dis 1: 283–296.
- 57. ter Meulen J, Lukashevich I, Sidibe K, Inapogui A, Marx M, et al. (1996) Hunting of peridomestic rodents and consumption of their meat as possible risk factors for rodent-to-human transmission of Lassa virus in the Republic of Guinea. Am J Trop Med Hyg 55: 661–666.
- 58. Frame JD, Yalley-Ogunro JE, Hanson AP (1984) Endemic Lassa fever in Liberia. V. Distribution of Lassa virus activity in Liberia: hospital staff surveys. T Roy Soc Trop Med H 78: 761–3.
- 59. Bloch A (1978) A serological survey of Lassa fever in Liberia. B World Health Organ 56: 811–3.
- 60. Frame JD, Casals J, Dennis EA (1979) Lassa virus antibodies in hospital personnel in western Liberia. T Roy Soc Trop Med H 73: 219–24.
- 61. Yalley-Ogunro JE, Frame JD, Hanson AP (1984) Endemic Lassa fever in Liberia. VI. Village serological surveys for evidence of Lassa virus activity in Lofa County, Liberia. T Roy Soc Trop Med H 78: 764–70.
- 62. Monath TP, Mertens PE, Patton R, Moser CR, Baum JJ, et al. (1973) A hospital epidemic of Lassa fever in Zorzor, Liberia, March-April 1972. Am J Trop Med Hyg 22: 773–9.
- 63. Mertens PE, Patton R, Baum JJ, Monath TP (1973) Clinical presentation of Lassa fever cases during the hospital epidemic at Zorzor, Liberia, March-April 1972. Am J Trop Med Hyg 22: 780–784.
- 64. ProMED-mail (2006) Lassa fever - Liberia (02). Number 20061001.2812.
- 65. ProMED-mail (2007) Lassa fever - Liberia (03). Number 20070430.1406.
- 66. Tomori O, Fabiyi A, Sorungbe A, Smith A, McCormick JB (1988) Viral hemorrhagic fever antibodies in Nigerian populations. Am J Trop Med Hyg 38: 407–410.
- 67. Henderson BE, Gary GW, Kissling RE, Frame JD, Carey DE (1972) Lassa fever virological and serological studies. T Roy Soc Trop Med H 66: 409–416.
- 68. ProMED-mail (2007) Lassa fever - South Africa ex Nigeria. Number 20070222.0657.
- 69. Merlin (2002) “Licking” Lassa fever (a strategic review). London: Merlin.
- 70. Bottineau MC, Kampudu F (2003) Lassa Fever, summary of the situation in Sierra Leone focussing on refugee camps and way stations (1st of January to 30th of April 2003): UNHCR.
- 71. McCormick JB, Webb PA, Krebs JW, Johnson KM, Smith ES (1987) A prospective study of the epidemiology and ecology of Lassa fever. J Infect Dis 155: 437–444.
- 72. ter Meulen J, Lenz O, Koivogui L, Magassouba N, Kaushik SK, et al. (2001) Lassa fever in Sierra Leone: UN peacekeepers are at risk. Trop Med Int Health 6: 83–84.
- 73. Allan R, Mardell S, Ladbury R, Pearce E, Skinner K (1999) The progression from endemic to epidemic Lassa fever in war-torn West Africa. In: Saluzzo JF, Dodet B, editors. Emergence and control of rodent-borne viral diseases. Annecy: Elsevier. pp. 197–205.
- 74. Grein T, Mardel S, Guardo-Martinez M, O'Bai Kamara KB, Wurie AR (2004) Lassa fever outbreak in Kenema, Sierra Leone January–April 2004. Geneva: WHO.
- 75. Fraser DW, Campbell CC, Monath TP, Goff PA, Gregg MB (1974) Lassa fever in the eastern province of Sierra Leone, 1970–1972. Am J Trop Med Hyg 23: 1131–1139.
- 76. Monath TP, Maher M, Casals J, Kissling RE, Cacciapuoti A (1974) Lassa fever in the eastern province of Sierra Leone, 1970–1972. II Clinical observations and virological studies on selected Hospital cases. Am J Trop Med Hyg 23: 1140–1149.
- 77. Keane E, Gilles HM (1977) Lassa fever in Panguma hospital, Sierra Leone, 1973–6. BMJ 1: 1399–1402.