Leishmania species of the subgenus Leishmania and especially L. donovani are responsible for a large proportion of visceral leishmaniasis cases. The debate on the mode of reproduction and population structure of Leishmania parasites remains opened. It has been suggested that Leishmania parasites could alternate different modes of reproduction, more particularly clonality and frequent recombinations either between related individuals (endogamy) or between unrelated individuals (outcrossing) within strongly isolated subpopulations. To determine whether this assumption is generalized to other species, a population genetics analysis within Leishmania donovani complex strains was conducted within a single village. The results suggest that a mixed-mating reproduction system exists, an important heterogeneity of subsamples and the coexistence of several genetic entities in Sudanese L. donovani. Indeed, results showed significant genetic differentiation between the three taxa (L. donovani, L. infantum and L. archibaldi) and between the human or canine strains of such taxa, suggesting that there may be different imbricated transmission cycles involving either dogs or humans. Results also are in agreement with an almost strict specificity of L. donovani stricto sensu to human hosts. This empirical study demonstrates the complexity of population structure in the genus Leishmania and the need to pursue such kind of analyses at the smallest possible spatio-temporal and ecological scales.
Leishmaniases are a serious public health problem, especially in developing countries, caused by Leishmania parasites and transmitted by sandfly bites. More information is needed on the population biology of these pathogens for diagnostic and epidemiological inquiries and for drug and vaccine elaboration. For studies dealing with the population genetics, exploring the genetic patterns of such organisms at microgeographic scales is fundamental. In this context, we made a population genetic study, based on 20 microsatellite loci, on 61 strains of Leishmania donovani complex collected in a Sudanese village, Babar El Fugara, during the epidemic of 1996–2000. Results showed that considering the whole sample as a single population was not adequate because of the coexistence of several genetic entities and a genetic differentiation between the human or canine strains. In addition, our findings suggested that clonality may have a strong impact on the L. donovani complex, unlike other Leishmania species. This study demonstrates the need to pursue population genetics studies in Leishmania species from sampling designs that control maximum possible confounding factors and to elaborate such kinds of analyses at the smallest possible spatio-temporal and ecological scales.
Citation: Rougeron V, De Meeûs T, Hide M, Le Falher G, Bucheton B, et al. (2011) Multifaceted Population Structure and Reproductive Strategy in Leishmania donovani Complex in One Sudanese Village. PLoS Negl Trop Dis 5(12): e1448. doi:10.1371/journal.pntd.0001448
Editor: Hechmi Louzir, Institut Pasteur de Tunis, Tunisia
Received: March 3, 2011; Accepted: November 6, 2011; Published: December 20, 2011
Copyright: © 2011 Rougeron et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The authors thank the French National Reference Center of Leishmania for providing the samples. They are grateful to the Institut de Recherche pour le Développement and the Centre National de la Recherche Scientifique for their support. This study was also supported by a French National Project, ANR 06-SEST-20 IAEL. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Leishmaniases are worldwide vector-borne diseases of humans and domestic animals, caused by protozoan parasites of the genus Leishmania. These parasitic infections are a serious public health problem, with about 350 million persons at risk and 2,357,000 new cases per year . The genus Leishmania totals approximately 20 described species causing human infections (reviewed in ) with a wide variety of clinical symptoms: cutaneous, visceral, mucocutaneous, mucosal and post-kala-azar dermal (PKDL) leishmaniases. Visceral leishmaniasis is the most severe form of the disease, which can be lethal if it goes untreated. It is the most widespread leishmaniasis form, especially in India, Bangladesh, Nepal, Sudan, Ethiopia and Brazil , , . In this study, we focused on human and canine samples collected in Sudan, where visceral leishmaniasis is endemic in the eastern and southern parts of the country and has claimed the lives of thousands of people .
Visceral leishmaniasis is mainly caused by species from the Leishmania donovani complex . Multilocus enzyme electrophoresis [MLEE] studies generated the description of three different species in this complex: L. donovani in the Old World, L. infantum in the Old World and the New World (also named L. chagasi there), and L. archibaldi in Sudan and Ethiopia , . In Sudan, the taxonomic status of these three species has been challenged using several different molecular markers, such as random amplified polymorphic DNA [RAPD], restriction fragment length polymorphism [RFLP] and microsatellites , . On the basis of both sequencing and microsatellite analysis, Jamjoom et al. proposed that Leishmania donovani sensu lato was the only cause of visceral leishmaniasis in East Africa (the three species falling in one clade), including Sudan . Lukes et al. , by a multifactorial genetic analysis that includes DNA sequences of protein-coding genes as well as noncoding segments, microsatellites, restriction-fragment length polymorphisms, and randomly amplified polymorphic DNAs, suggested that Leishmania infantum and L. donovani were the only recognized species of the L. donovani complex . It was even recently suggested that the only valid name is L. donovani .
Nowadays, with the development of elaborated experimental techniques and sophisticated statistical tools, our understanding of the evolutionary processes that govern the propagation of these parasites is continuously improving. Since 1990, Leishmania parasites have been recognized as presenting a basic clonal mode of reproduction associated with rare recombination events , , . However, recent studies based on population genetic analyses of Leishmania species in different environments showed strong levels of homozygosity and little amount of multilocus repeated genotypes (MLGs) , , , , , an observation incompatible with a strict or predominant clonal mode of reproduction . More specifically, our team has proposed that Leishmania parasites could alternate different modes of reproduction: clonality in both vertebrate host and insect vector and recombination (recombination between related or unrelated individuals, or even interspecific recombinations) within the vector , . The need to work within different species and at finer scales was also suggested, as the study published in Rougeron et al. showed a heterogeneity at the scale studied (country) , . Working at finer scales indeed allows much more precise inferences to be made and a predominantly sexual signature in the genetic data. The objective of the present study was to explore such issues in another taxon, Leishmania donovani sensu lato within a sample collected in a single Sudanese village. We therefore analyzed the population structure of 61 L. donovani s.l. strains, collected in Barbar El Fugara, a village of the Atbara River region on the Sudan-Ethiopian border, at 20 polymorphic microsatellite loci. The results of this work suggest that L. donovani complex is a heterogeneous taxon, that dogs are not infected by the same entities as human hosts and that the different units that compose this complex are probably strongly subdivided with a significant impact of sexual recombination between related individuals. We discuss sampling strategy issues regarding further studies and insist on the need to narrow as much as possible the spatio-temporal and ecological sampling scales.
Materials and Methods
Study site, parasites, cultures and DNA extraction
A census of the village population was conducted by Bucheton et al. , making personal and clinical data available. From 1997 to 2000, 61 isolates of Leishmania donovani complex were collected and then cultured. We obtained the samples for this study from the “the French National Reference Center of Leishmania”, under the agreement of Dr. Alain Dessein.
The 61 strains from Sudan were isolated from dogs (ten strains) and humans (51 strains) and characterized using the MLEE technique by Dereure et al. . Thirty-three strains were identified as L. donovani, 17 strains as L. infantum and 11 strains as L. archibaldi (see supplementary data Table S1). Promastigotes were cultured at 26°C by weekly subpassages in RPMI 1640 medium, buffered with 25 mM HEPES, 2 mM NaHCO3 and supplemented with 20% heat-inactivated fetal calf serum, 2 mM glutamine, 100 U/ml penicillin and 100 µg/mL streptomycin. Cultures were harvested by centrifugation and stored at −80°C until DNA extraction. Genomic DNA was extracted using the DNeasy Blood and Tissues Kit (Qiagen, Courtaboeuf, France), following the manufacturer's recommendations.
The 20 microsatellite loci investigated (15 already published  and five developed in the laboratory) are listed in Supplementary data Table S1. The 61 strains (and M9702, as L. chagasi outgroup) under study were amplified according to the following conditions. Every 30-µL reaction mix was composed of 1 µL of each primer (10 µM), the forward being labelled, 100 ng template DNA, 0.9 µL dNTP mix (5 mM), 3 µL buffer 10× and 0.3 µL Taq Polymerase (Roche Diagnostics, 5 UI/µL). Amplifications were carried out in a thermal cycler using the following reaction conditions: 35 cycles of 94°C for 30 s, annealing temperature of each locus (see Table 1) for 1 min, 72°C for 1 min and a final extension step of 72°C for 10 min. The reaction products were visualized on a 1.5% agarose gel stained with EZ VISION™ DNA Dye (Amresco). Fluorescence-labelled PCR products were sized on Applied Biosystems Prism 310, with a Genescan 500 LIZ internal size standard. All 61 isolates were genotyped at all 20 loci.
Table 1. Description of the 20 microsatellite loci used in this study for Leishmania donovani complex.doi:10.1371/journal.pntd.0001448.t001
Data were processed through Create V 1.1  to convert the data for different usage. We mainly analysed data with Fstat Version 184.108.40.206 software (Goudet 2002, updated from Goudet ), which computes estimates and tests the significance of the following population genetics parameters. Genetic polymorphism was measured by the number of alleles per locus (Na) and by Nei's unbiased estimate of genetic diversity within subsamples Hs . We estimated Wright's F statistics  with Weir and Cockerham's method : FIS measures the relative inbreeding of individuals due to the local non-random union of gametes in each subpopulation, and FST measures the relative inbreeding in subpopulations attributable to the subdivision of the total population into subpopulation of limited size. FST thus also measures genetic differentiation between subpopulations. FIS ranges between −1 and 1: a negative value corresponds to an excess of heterozygotes, a positive value to heterozygote deficiency; 0 is expected under panmixia. The significance of the departure from 0 was tested by 10,000 randomisations of alleles within subpopulations (to test random mating) and individuals across subsamples (for differentiation). The statistic used for random mating (Hardy-Weinberg Equilibrium) testing was simply Weir and Cockerham's estimator f (FIS and FST). For the genetic differentiation test, we used the log likelihood ratio G-based test of Goudet et al.  summed over all loci. Confidence intervals were estimated by bootstrapping over loci or jack-knifing over populations with Fstat as described in De Meeûs et al .
Genetic diversity, as measured by Nei's Hs, can lower the maximum possible value for FST. According to classical formulation (e.g.  FST = (QS−QT)/(1−QT), where QS is the probability to sample twice the same allele in a subpopulation and QT is the probability to sample twice the same allele in different subpopulations. If a population was totally subdivided, then the probability to sample twice the same allele in two different subpopulations should be null and thus FST should be equal to the probability to sample twice the same allele in a subpopulation QS. Hs being the probability to sample two alleles that are different hence QS = 1−Hs. The maximum possible value for FST in a sample with a given Hs can thus be estimated as 1−Hs and a corrected version of FST as FST′ = FST/(1−Hs) , .
Data were heterogeneous regarding Leishmania species (as recognized by MLEE typing), year of sampling and host species. To assess the possible contribution of these factors to genetic partitioning (Wahlund effect), we compared FIS obtained with four different sampling strategies. The first sampling strategy considered each Leishmania species-year of sampling–host species combinations as different subsamples (14 subsamples, “All separated” strategy). The second strategy ignored the Leishmania species distinction (six subsamples, “Species fused” strategy). The third strategy ignored the year of sampling (six subsamples, “Years fused” strategy) and the fourth one ignored the host species (10 subsamples, “Hosts fused” strategy). For significant difference testing, we undertook planned paired Wilcoxon signed rank tests between “All separated” and each of the other three strategies ordered as above with sequential Bonferroni correction (multiplying the P-values by 3, 2 and 1, respectively). Unilateral (“All separated” has a smaller FIS than the other three strategies) Wilcoxon signed rank tests were undertaken under R . Differentiation between the relevant units controlled for the other factors were then undertaken with paired subsample differentiation tests (FST estimation and G-based randomisation test). When two values were obtained for the same type of differentiation (e.g. differentiation between L. archibaldi and L. infantum in 1997 and 1998), these values were combined with an unweighted mean for FST (e.g. over years) and Stouffer's Z test (Whitlock, 2005) for P-values as recommended .
Linkage disequilibrium between pairs of loci (non-random association of alleles at different loci) was assessed with a randomisation test (genotypes at two loci are associated at random a number of times) using Fstat software Version 220.127.116.11 software (Goudet 2002, updated from Goudet ). The statistic used was the log likelihood ratio G summed over all subpopulations, known to be more powerful than other combinatory procedures . Because there are as many tests as locus pairs tested (here 15×14/2 = 190), we expected 0.05×190~9.5 significant tests under the null hypothesis of no linkage disequilibrium at significance level α = 0.05. Thus we used the unilateral (“greater”) exact binomial test to check if there was significantly more than 5% significant tests in the 190 tests series under R .
The BAPS version 5.1 software identifies a hidden structure within populations (admixture) through a Bayesian analysis . This software was used to detect possible Wahlund effects and has been successfully applied to other parasites , , . The BAPS software uses stochastic optimization to infer the posterior mode of genetic structure. To obtain the best distribution of the entire population, we ran the program 50 times in order to obtain the right number of clusters. The same approach has been applied within L. donovani 1997, L. donovani 1998 and L. infantum from humans for which enough individuals were available. Each of the three samples was submitted to a clustering exploration by BAPS with a maximum number of clusters set (19, 13 and 12, respectively, these values corresponding to the number of individuals in each sample). FIS was recalculated in each best distribution identified by BAPS and noted FIS_C. Then, for the three samples corresponding to the three species of Leishmania, the FIS_C was compared with the initial FIS using a unilateral Wilcoxon signed-rank test for paired data (with the software R), the pairing units being the 20 loci. If FIS_C is significantly lower than FIS, it is probable that the initial subsamples were composed of several genetically distinct entities (e.g. geographical microstructure or subpopulations).
Since we got the data's prevalence from Dereure et al. study , the prevalences were compared for each Leishmania species between humans and dogs (50 human strains and 20 dog strains), and the significance was tested using an exact Fisher test under the software R .
A Neighbor-Joining (NJ) tree  was constructed out of a Cavalli-Sforza and Edwards genetic distance matrix . The robustness of tree topology was obtained by bootstrap resampling of loci, with 500 replications per set. We used PHYLIP software (version 3.5c; J. Felsenstein, Department of Genetics, University of Washington, Seattle, 1993) and the tree was edited using TreeDyn software .
Simulations where also handled with Easypop 2.0.1 (Balloux 2006, updated from Balloux 2001 ) to find possible sets of parameters fitting our observations.
The approval for human strain study was obtained by both the federal and state Ministries of Health and by the Faculty of Medicine of Khartoum. Approval of the project to be performed was also approved for each field visit by the village committee, which included elected delegates from all ethnic groups and as well elected citizens. Since an important proportion of the population in Barbar El Fugara was illetrate, oral informed consent was obtained after the aim of the study was explained to study participants in their own language by a translator. For child participants, oral consent was obtained from their parents. The verbal consent was also obtained in the presence of the ethic group leader, who eventually provide more explanations if required. After verbal informed consent obtained from the patient, the clinician recorded it on a written form.
We obtained clear electrophoregrams for all genotypes at all 20 loci investigated, with only one or two alleles per strain at each locus, which excludes events of aneuploidy (for which we would have also expected individuals with no alleles, three or four alleles). The genotypes obtained are presented in supplemental Table S1. The data showed a low level of genetic diversity, with an average number of alleles per locus of 4.25±1.74, ranging from 2 (LIST7026, LIST7028 and LIST7030) to 8 (LIST7025) and a mean genetic diversity HS = 0.475±0.148 (Table 1).
Phylogenic analysis and genetic differentiation
The dendrogram, based on 20 polymorphic microsatellite loci, represented in Figure 1 underlined two main clusters. Cluster A (36% bootstraps) regroups strains from L. infantum and L. donovani. Cluster B (sustained by a bootstrap of 32%) corresponded to L. archibaldi taxon and three L. infantum from dogs. It has to be noticed that other studies have observed, using microsatellite method, small bootstrap for large clusters and important bootstrap values only for small clusters for L. braziliensis  and L. infantum .
Figure 1. Genetic distance relationships among 61 strains of the Leishmania donovani complex in Sudan.
Neighbor-Joining tree depicting genetic distance relationships based on Cavalli-Sforza's chord distances among 61 strains of the Leishmania donovani complex based on 20 polymorphic microsatellite loci. Leishmania chagasi M9702 reference strain was used as outgroup. Values on the nodes represent the percentage of bootstrap replicates over loci (n = 500). Samples from the species L. archibaldi, L. donovani and L. infantum are noted “L. a.”, “L. d.”, and “L. i.” respectively. Hosts are noted “H. s.” for Homo sapiens and “C. f.” for Canis familiaris. The majority of L. archibaldi strains are grouped in cluster B. Cluster A contains almost all the L. donovani strains. L. infantum strains are distributed in the two clusters. The single L. donovani strain (LEM3785) genotyped from a dog is noted “***”.doi:10.1371/journal.pntd.0001448.g001
FIS comparisons between “All separated” strategy and the three others gave significant differences, as illustrated in Figure 2, meaning each factor, Leishmania species, year of sampling and host species in order of importance, displays a significant signature on the apportioning of genetic information. Consequently, each Leishmania species of each year and each host species must be considered as separate subsamples. It has to be noticed that the significant results we obtained cannot come from an insufficient number of samples. Indeed, the significant differences evidenced are statistically valid and ignoring it might lead to overlook important ecological processes currently involved in the population biology of these Leishmania “lineages”. Moreover, these differentiations were confirmed by paired subsample differentiation tests, as indicated in Table 2. All Leishmania species are genetically different. Species differentiation seems very pronounced between L. donovani and L. archibaldi (FST′~0.767) and smaller for the two other pairs (FST′~0.2–0.3) (Table 2). Temporal differentiation seems only to affect L. donovani in humans. Considering the host origin, a weak and marginally non-significant differentiation is found between human and dog strains for L. archibaldi, while a strong differentiation seems to affect L. infantum strains between the two host species (Table 2).
Figure 2. FIS estimations between four sampling strategies.
Comparison of FIS estimated between the “All separated” strategy and the other three sampling strategies as defined in the text. Wilcoxon signed rank test P-values between pairs of strategies, as indicated by double arrows, are provided with the corresponding sequential Bonferroni corrected value in brackets. The 95% confidence intervals were obtained by bootstrapping over loci.doi:10.1371/journal.pntd.0001448.g002
Table 2. Differentiation measures (FST) and testing (P-value) between different Leishmania donovani sl strains.doi:10.1371/journal.pntd.0001448.t002
Clinical forms (visceral versus PKDL in humans, see Supplementary Table S1) could only be compared for L. donovani in 1997 and 1998 where no differentiation could be evidenced (FST~0, P-value>0.4 in both cases). Consequently, clinical forms were not considered further in our analyses.
The data's prevalence from Dereure et al.  was compared for each Leishmania species between humans and dogs (50 human strains and 20 dog strains). The results, presented in Table 3, show that L. donovani is clearly found in humans rather than in dogs (P-value = 0.001), that L. infantum displays a tendency to infect dogs more often (P-value = 0.04), while the difference is not significant for L. archibaldi (P-value = 0.2). If Bonferroni adjusted, only L. donovani test stays significant (P-value = 0.003).
Table 3. Comparison between prevalence on humans and dogs for the different species of Leishmania.doi:10.1371/journal.pntd.0001448.t003
Linkage disequilibrium study
This analysis was undertaken over all the data but considering each Leishmania species, year of sampling and host species combination as a distinct subsample. This provided 19 locus pairs out of 190 tests in significant linkage. This is far above the 5% expected under the null hypothesis (P-value = 0.0001). These significant tests involved 18 of the 20 loci. Within each Leishmania species, small subsample sizes limited the power of the test. For L. archibaldi (very small subsamples of four and seven individuals in dogs and human hosts respectively) only five tests out of 190 were significant (P-value = 1). In L. donovani 22 tests were significant (P-value = 0.0003) and in L. infantum 19 tests were significant (P-value = 0.0034). There is thus a global linkage at a genome-wide scale in the three Leishmania species populations.
Genetic diversity and heterozygote deficiency within Leishmania species
For each Leishmania species, a global and highly significant heterozygote deficit, highly variable across loci, was observed (Figure 3). These heterozygote deficits significantly decrease (P-values<0.005) in the best partitions found by BAPS for the two species for which such analyses could be done (L. donovani and L. infantum) (Table 4 and Figure 4). Simulations, undertaken using the software EasyPop, provided patterns convergent with the pattern observed for some parameter sets only for very high clonal rates (minimum c = 0.99) and strong Wahlund effects (pooling one representative of each strongly isolated subpopulation into one subsample). Nevertheless, in each of these simulations, fairly numerous multilocus genotypes (MLGs) appeared, in contrast to the real data, where on the whole data set only two MLGs (2 observations of two samples presented the same multilocus genotypes) were observed. Consequently, something else is occurring. Finally, using the NJ Tree pattern of Figure 1, keeping only L. donovani strains belonging to most homogeneous clusters (no leaf longer than 0.1, see Figure 1) and subdividing it into subclusters belonging to the same year indeed produced lower FIS~0.27, but still with a very strong variance across loci (ranging from −0.1 to 0.7), no significant linkage disequilibrium and a reasonable proportion of MLGs (one repeated twice and a second repeated three times) but very small subsample sizes. It has to be noticed, that the global same topology of the NJ tree using Cavali Sforza distances has been obtained using shared allele distances, and also the Minimum Evolution tree using either Cavali Sforza and shared allele distances.
Figure 3. FIS variation across loci and mean value for the three Leishmania species.
The confidence intervals are the values obtained for dogs and humans for L. archibaldi and L. infantum and are minimum and maximum values obtained in 1997, 1998 or 1999 for L. donovani, except for FIS over all loci (All*) where confidence intervals (CI) are the 95% CI obtained after bootstrap over the loci.doi:10.1371/journal.pntd.0001448.g003
Figure 4. FIS for L. donovani and L. infantum strains in the entire population and within subdivisions.
These subdivisions have been identified by the software BAPS. The 95% confidence intervals were obtained by bootstrapping over loci. The decrease of FIS in the subdivisions suggests a Wahlund effect.doi:10.1371/journal.pntd.0001448.g004
Despite the latest studies in this area, the debate on population structure and Leishmania reproductive mode is far from being settled and therefore deserves further investigation. Recent publications on different Leishmania species and in different environments seriously challenge the view that the species of the genus should display a predominantly clonal genetic signature because of important homozygosity levels and rarity of MLGs , , , , . As suggested for L. braziliensis , these parasites could alternate different modes of reproduction: clonality in both vertebrate host and insect vector and sexual recombination (similar to other kinetoplastid parasites, such as Trypanosoma brucei s.l. , or other Trypanosomatidae such as Crithidia bombi ) between genetically related cells (endogamy) resulting in high levels of inbreeding. Most of these studies also revealed strong heterogeneities within Leishmania subsamples that probably results from Wahlund effects (mixture of differentiated true populations), because strains were collected at too large spatial and/or temporal scales. To prevent such possible biases, we selected a sample of L. donovani, collected at a village scale, reducing the risk of hidden substructuring.
In this Sudanese village, the validity of the distinction between L. donovani sensu stricto, L. archibaldi and L. infantum, be it a true species, a subspecies or any other taxonomic level, is supported by our results, in contradiction with recent papers , , , . As shown here, ignoring such delimitations dangerously biases genetic data interpretation. It remains that taxonomic distinction based on isoenzymes does not seem very clear as can be seen from Figure 1 and it would be worth trying other kind of markers as MultiLocus Sequencing Typing or MultiLocus Sequencing Analysis  to clarify this issue.
Another significant subdivision arose between dogs and human hosts, particularly regarding L. infantum and to a much lesser extent L. archibaldi. Gene flow (gene flow) appears much reduced between dogs and human hosts for L. infantum and two different kinds of cycles must be present here, involving probably different vector's species and reservoirs. For L. archibaldi the difference is much less obvious but may be as a result of modest sampling sizes. In L. donovani, the greater specificity of strains to human (Table 3) and the resulting reduced number of strains found in dog did not allow for such testing. Nevertheless, the single L. donovani strain (LEM3785) genotyped from a dog did not show any originality as regard to its human counterparts (Figure 1). This apparent strong specificity for human hosts would mean, at least for L. donovani, that dogs are not a significant reservoir for these pathogens, in agreement with the anthroponotic feature of this species.
Time also appeared as a significant subdividing factor but only for L. donovani. The simplest interpretation being that, as patient once diagnosed are treated, a drop in subpopulation size may occur in the following year, thus leading to a genetic differentiation as a result of a bottleneck or of the replacement of empty places by other strains. Genetic diversities being not significantly different between 1998 and 1999 (Wilcoxon signed rank test, P-value = 0.27), the second hypothesis appears more likely.
Failing to consider all the above factors as relevant resulted in a very odd FIS distribution as illustrated by Supplementary Figure S1.
Our data, and especially the NJTree approach, also suggest that hybridization between the different taxa is not impossible, though rare enough to prevent homogenization, but frequent enough to enhance heterogeneity within each cluster that could be defined.
An interesting point to notice is the absence of genetic differentiation obtained between L. donovani clinical forms (visceral leishmaniasis and PKDL, FST~0, P-value>0 in 1997 and 1998). Indeed, this result could suggest that the development of PKDL in treated patients is more likely link to host's factors than to parasite's factors. This potential association between PKDL and host has already been suggested by Blackwell J.M.'s team. Indeed, results of this study proposed a genetic association between the polymorphism at IFNGR1 and the susceptibility of patients after treatments to PKDL (and not to visceral leishmaniasis) .
Regarding the reproductive strategy and population structure of these parasites, further studies should focus on the effect of individual hosts to detail the respective contribution of population differentiation as well as clonal, endogamic and outcrossing modes of reproduction in the genotypic distribution of these parasites. Nonetheless, clonality does not totally explain the strong variance across FIS loci, that displayed a much wider range than what was observed for the much more homozygous L. braziliensis . Our simulation approach suggested that obtaining the FIS and its variance across loci with very few MLGs, as in the real data, was impossible to achieve. The existence of a strongly structured hierarchical meta-population, with for instance the individual hosts playing the role of micro-populations for the parasites, in combination with occasional gene flow between different genetically distant entities (species hybridizations) and/or different cycle types (zoonotic vs. anthroponotic), could explain the pattern observed on our microsatellite loci. However, as previously said, this requires further investigation. We cannot exclude the possible disturbing role played by gene conversion known to occur in Leishmania  though we do not favour much such a hypothesis. If gene conversion is a genome wide process in Leishmania (genomic conversion) we would have expected a much more homogeneous homozygosity across loci than what was observed. Some loci are indeed almost always homozygous while some others display substantial amounts of heterozygosity (Figure 3). If gene conversion is site specific, we would expect it to preferentially affect coding sequences and its surrounding more than non coding zones. A glance at the localisation in the chromosome of markers did not suggest that microsatellite markers situated closer to coding sequences were more prone to display positive FIS than the other microsatellites. Moreover, even if the correlation between species is good, it can be seen that it is not perfect and that some loci with FIS~1 in one species can display a fairly lower FIS in another. This does not strongly support the site specific DNA conversion hypothesis. But here again, further studies would be worth being undertaken on that issue.
Null alleles are often encountered in population genetics studies. They may be frequent in allozymes ,  and in DNA markers such as microsatellites , , . In our data, no blank has ever been observed in the genotypes (no missing data, i.e. all individuals were amplified at all loci), which, given the high homozygosity encountered (increased probability of blank homozygotes), makes the null allele explanation very unlikely.
Rarity of MLGs, variable but globally positive FIS and strong heterogeneity within subsamples seem to be the rule for L. donovani as such a pattern was already reported in Eastern Africa . Such results suggest the existence of strongly differentiated hidden entities. A different pattern was found in L. donovani from the Indian subcontinent  where all loci appeared weakly polymorphic, dominated by a single MLG with a few variants at one locus and, in spatially and temporally homogeneous subsamples no deviation from panmixia. Just as if this subcontinent had been colonised by one of the entities we are dealing with Africa.
The village Babar El Fugara is characterized by an epidemic context, with the occurrence of several epidemic episodes. The genetic diversity revealed by our results is not due to the arrival of a new variant but more likely was already present. Indeed, during this epidemic, all the population have been exposed to the disease and only ¼ develop visceral leishmaniases. This observation means that the majority of the population is probably asymptomatic and constitute a reservoir for the transmission . In this context, this suggested the need to pursue research in order to identify which reservoir could be involved in the maintenance of the diversity and the transmission cycles (vectors or mammal reservoirs).
To conclude on this population genetics study within the L. donovani complex, it clearly appears that considering the whole sample as a single population was not adequate. In addition, our findings suggested that clonality may have a stronger impact on the L. donovani complex than on L. braziliensis. It also suggested that exploring the possible strong impact of the host individual (sandfly or mammal hosts) was worth trying and indeed represents a too often neglected factor in Leishmania population studies in particular and in pathogenic microbes in general , , , , . These results demonstrate the need to pursue population genetics studies in Leishmania species from sampling designs that control maximum possible confounding factors. These parasites indeed seem to be subdivided at very narrow spatio-temporal and ecological (host) scales.
FIS for each of the loci in the entire population of L. donovani complex. There is a large heterozygote deficiency at each locus.
STROBE checklist. Checklist of items included in this population genetic study.
Description of data set and microsatellite genotypes. Each sample is detailed by sample code, species attribution by MLEE , host, clinical forms (VL for Visceral Leishmaniases and PKDL for PostKala azar Dermatite Leishmaniases) and year of collection, and microsatellite genotypes obtained at each locus.
The authors acknowledge F. Prugnolle, F. Renaud and M. Choisy for helpful discussions and their assistance in analyses and interpretation of the results. We acknowledge A. Dessein's team for the strains and patient data collection. We also thank the French National Reference Center of Leishmania for providing the samples. The authors acknowledge the assistance of Dr. Linda Northup for the correction in English of the manuscript and the IFR-119 platform from Montpellier, France.
Conceived and designed the experiments: VR TDM A-LB. Performed the experiments: VR. Analyzed the data: VR TDM A-LB. Contributed reagents/materials/analysis tools: VR TDM A-LB MH BB. Wrote the paper: VR TDM A-LB. Samples providers or collectors: BB AD SHE-S JD GLF.
- 1. WHO (2002) Leishmaniases.
- 2. Bañuls AL, Hide M, Prugnolle F (2007) Leishmania and the leishmaniases: a parasite genetic update and advances in taxonomy, epidemiology and pathogenicity in humans. Advances in Parasitology 64: 1–109.
- 3. WHO (2002) Annul report. Geneva.
- 4. Prevalence CfDCa (2010) Parasites. Leishmaniasis.
- 5. Seaman J, Mercer AJ, Sondorp E (1996) The epidemic of visceral leishmaniasis in western Upper Nile, southern Sudan: course and impact from 1984 to 1994. Int J Epidemiol 25: 862–871.
- 6. Lainson R, Ryan L, Shaw JJ (1987) Infective stages of Leishmania in the sandfly vector and some observations on the mechanism of transmission. Mem Inst Oswaldo Cruz 82: 421–424.
- 7. Pratlong F, Dereure J, Bucheton B, El-Saf S, Dessein A, et al. (2001) Sudan: the possible original focus of visceral leishmaniasis. Parasitology 122: 599–605.
- 8. Rioux JA, Lanotte G, Serres E, Pratlong F, Bastien P, et al. (1990) Taxonomy of Leishmania. Use of isoenzymes. Suggestions for a new classification. Ann Parasitol Hum Comp 65: 111–125.
- 9. El Tai NO, El Fari M, Mauricio I, Miles MA, Oskam L, et al. (2001) Leishmania donovani: intraspecific polymorphisms of Sudanese isolates revealed by PCR-based analyses and DNA sequencing. Exp Parasitol 97: 35–44.
- 10. Lewin S, Schonian G, El Tai N, Oskam L, Bastien P, et al. (2002) Strain typing in Leishmania donovani by using sequence-confirmed amplified region analysis. Int J Parasitol 32: 1267–1276.
- 11. Jamjoom MB, Ashford RW, Bates PA, Chance ML, Kemp SJ, et al. (2004) Leishmania donovani is the only cause of visceral leishmaniasis in East Africa; previous descriptions of L. infantum and “L. archibaldi” from this region are a consequence of convergent evolution in the isoenzyme data. Parasitology 129: 399–409.
- 12. Lukes J, Mauricio IL, Schonian G, Dujardin JC, Soteriadou K, et al. (2007) Evolutionary and geographical history of the Leishmania donovani complex with a revision of current taxonomy. Proc Natl Acad Sci U S A 104: 9375–9380.
- 13. Schonian G, Mauricio I, Cupolillo E (2010) Is it time to revise the nomenclature of Leishmania? Trends in Parasitology 26: 466–469.
- 14. Tibayrenc M (1992) Leishmania: sex, karyotypes and population genetics. Parasitol Today 8: 305–306; author reply 306.
- 15. Tibayrenc M (1993) Clonality in leishmania. Parasitol Today 9: 58.
- 16. Tibayrenc M, Ayala FJ (2002) The clonal theory of parasitic protozoa: 12 years on. Trends Parasitol 18: 405–410.
- 17. Amro A, Schonian G, Al-Sharabati MB, Azmi K, Nasereddin A, et al. (2009) Population genetics of Leishmania infantum in Israel and the Palestinian Authority through microsatellite analysis. Microbes Infect 11: 484–492.
- 18. Chargui N, Amro A, Haouas N, Schonian G, Babba H, et al. (2009) Population structure of Tunisian Leishmania infantum and evidence for the existence of hybrids and gene flow between genetically different populations. Int J Parasitol 39: 801–811.
- 19. Gelanew T, Kuhls K, Hurissa Z, Weldegebreal T, Hailu W, et al. (2010) Inference of population structure of Leishmania donovani strains isolated from different Ethiopian visceral leishmaniasis endemic areas. PLoS Negl Trop Dis 4: e889.
- 20. Rougeron V, De Meeus T, Hide M, Waleckx E, Dereure J, et al. (2010) A battery of 12 microsatellite markers for genetic analysis of the Leishmania (Viannia) guyanensis complex. Parasitology 137: 1879–1884.
- 21. Rougeron V, Meeûs TD, Hide M, Waleckx E, Bermudez H, et al. (2009) Extreme inbreeding in Leishmania braziliensis. Proceedings of the National Academy of Sciences of the United States of America 106: 10224–10229.
- 22. De Meeûs T, Lehmann L, Balloux F (2006) Molecular epidemiology of clonal diploids: A quick overview and a short DIY (do it yourself) notice. Infection Genetics and Evolution 6: 163–170.
- 23. Rougeron V, De Meeus T, Kako Ouraga S, Hide M, Banuls AL (2010) “Everything you always wanted to know about sex (but were afraid to ask)” in Leishmania after two decades of laboratory and field analyses. PLoS Pathog 6:
- 24. Bucheton B, Kheir MM, El-Safi SH, Hammad A, Mergani A, et al. (2002) The interplay between environmental and host factors during an outbreak of visceral leishmaniasis in eastern Sudan. Microbes Infect 4: 1449–1457.
- 25. Dereure J, El-Safi SH, Bucheton B, Boni M, Kheir MM, et al. (2003) Visceral leishmaniasis in eastern Sudan: parasite identification in humans and dogs; host-parasite relationships. Microbes Infect 5: 1103–1108.
- 26. Jamjoom MB, Ashford RW, Bates PA, Kemp SJ, Noyes HA (2002) Towards a standard battery of microsatellite markers for the analysis of the Leishmania donovani complex. Ann Trop Med Parasitol 96: 265–270.
- 27. Coombs JA, Letcher BH, Nislow KH (2008) CREATE: a software to create input files from diploid genotypic data for 52 genetic software programs. Molecular Ecology Resources 8: 578–580.
- 28. Goudet J (1995) FSTAT (Version 1.2): A computer program to calculate F-statistics. Journal of Heredity 86: 485–486.
- 29. Nei M, Chesser RK (1983) Estimation of fixation indices and gene diversities. Annals of Human Genetics 47: 253–259.
- 30. Wright S (1965) The interpretation of population structure by F-statistics with special regard to system of mating. Evolution 19: 395–420.
- 31. Weir BS, Cockerham CC (1984) Estimating F-statistics for the analysis of population structure. Evolution 38: 1358–1370.
- 32. Goudet J, Raymond M, De Meeûs T, Rousset F (1996) Testing differentiation in diploid populations. Genetics 144: 1933–1940.
- 33. de Meeus T, McCoy KD, Prugnolle F, Chevillon C, Durand P, et al. (2007) Population genetics and molecular epidemiology or how to “debusquer la bete”. Infect Genet Evol 7: 308–332.
- 34. Rousset F (2004) Genetic Structure and Selection in Subdivided Populations. Princeton: Princeton University Press.
- 35. Hedrick PW (2005) A standardized genetic differentiation measure. Evolution 59: 1633–1638.
- 36. R-Development-core-team (2010) R: A Language and Environment for Statistical Computing.
- 37. De Meeus T, Guegan JF, Teriokhin AT (2009) MultiTest V.1.2, a program to binomially combine independent tests and performance comparison with other related methods on proportional data. BMC Bioinformatics 10: 443.
- 38. Corander J, Marttinen P, Siren J, Tang J (2008) Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations. BMC Bioinformatics 9: 539.
- 39. Koffi M, Solano P, Barnabé C, De Meeûs T, Bucheton B, et al. (2007) Genetic characterisation of Trypanosoma brucei ssp by microsatellite typing: new perspectives for the molecular epidemiology of human African trypanosomosis. Infection Genetics and Evolution 7: 675–684.
- 40. Ravel S, De Meeûs T, Dujardin JP, Zeze DG, Gooding RH, et al. (2007) The tsetse fly Glossina palpalis palpalis is composed of several genetically differentiated small populations in the sleeping sickness focus of Bonon, Côte d'Ivoire. Infection Genetics and Evolution 7: 116–125.
- 41. Saitou N, Nei M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular biology and evolution 4: 406–425.
- 42. Cavalli-Sforza LL, Edwards AWF (1967) Phylogenetic analysis: model and estimation procedures. American Journal of Human Genetics 19: 233–257.
- 43. Chevenet F, Brun C, Bañuls AL, Jacq B, Christen R (2006) TreeDyn: towards dynamic graphics and annotations for analyses of trees. BMC bioinformatics 7: 439.
- 44. Balloux F (2001) EASYPOP (version 1.7): A computer program for population genetics simulations. Journal of Heredity 92: 301–302.
- 45. Oddone RSC, Schönian G, dos Santos de Sousa C, Cupolillo E, Espinosa D, Arevalo J, Noyes H, Mauricio I, Kuhls K (2009) Development of a multilocus microsatellite typing 1 approach for discriminating strains of the Leishmania subgenus (L.) Viannia. Journal of Clinical Microbiology.
- 46. Kuhls K, Keilonat L, Ochsenreither S, Schaar M, Schweynoch C, et al. (2007) Multilocus microsatellite typing (MLMT) reveals genetically isolated populations between and within the main endemic regions of visceral leishmaniasis. Microbes Infect 9: 334–343.
- 47. Tait A, MacLeod A, Tweedie A, Masiga D, Turner CMR (2007) Genetic exchange in Trypanosoma brucei: Evidence for mating prior to metacyclic stage development. Molecular and Biochemical Parasitology 151: 133–136.
- 48. Schmid-Hempel R, Salathe R, Tognazzo M, Schmid-Hempel P (2011) Genetic exchange and emergence of novel strains in directly transmitted trypanosomatids. Infect Genet Evol 11: 564–571.
- 49. Mauricio IL, Gaunt MW, Stothard JR, Miles MA (2001) Genetic typing and phylogeny of the Leishmania donovani complex by restriction analysis of PCR amplified gp63 intergenic regions. Parasitology 122: 393–403.
- 50. Richter D, Postic D, Sertour N, Livey I, Matuschka FR, et al. (2006) Delineation of Borrelia burgdorferi sensu lato species by multilocus sequence analysis and confirmation of the delineation of Borrelia spielmanii sp nov. International Journal of Systematic and Evolutionary Microbiology 56: 873–881.
- 51. Mohamed HS, Ibrahim ME, Miller EN, Peacock CS, Khalil EA, et al. (2003) Genetic susceptibility to visceral leishmaniasis in The Sudan: linkage and association with IL4 and IFNGR1. Genes Immun 4: 351–355.
- 52. Mauricio IL, Gaunt MW, Stothard JR, Miles MA (2007) Glycoprotein 63 (gp63) genes show gene conversion and reveal the evolution of Old World Leishmania. Int J Parasitol 37: 565–576.
- 53. Gaffney D, Campbell RA (1994) A PCR based method to determine the Kalow allele of the cholinesterase gene: the E1k allele frequency and its significance in the normal population. J Med Genet 31: 248–250.
- 54. Nébavi F, Ayala FJ, Renaud F, Bertout S, Eholié S, et al. (2006) Clonal population structure and genetic diversity of Candida albicans in AIDS patients from Abidjan (Cote d'lvoire) (vol 103, pg 3663, 2006). Proceedings of the National Academy of Sciences of the United States of America 103: 5632–5632.
- 55. Brookfield J (1996) Population genetics. Curr Biol 6: 354–356.
- 56. Paetkau D, Strobeck C (1995) The molecular basis and evolutionary history of a microsatellite null allele in bears. Mol Ecol 4: 519–520.
- 57. Pemberton JM, Slate J, Bancroft DR, Barrett JA (1995) Nonamplifying alleles at microsatellite loci: a caution for parentage and population studies. Mol Ecol 4: 249–252.
- 58. Alam MZ, Kuhls K, Schweynoch C, Sundar S, Rijal S, et al. (2009) Multilocus microsatellite typing (MLMT) reveals genetic homogeneity of Leishmania donovani strains in the Indian subcontinent. Infect Genet Evol 9: 24–31.
- 59. Prugnolle F, De Meeus T (2010) Apparent high recombination rates in clonal parasitic organisms due to inappropriate sampling design. Heredity 104: 135–140.
- 60. De Meeûs T, Prugnolle F, Agnew P (2009) pp. 517–533. Lost Sex: The Evolutionary Biology of Parthenogenesis Asexual reproduction in infectious diseases.: Schön I, Martens K, van Dijk P.
- 61. Razakandrainibe FG, Durand P, Koella JC, De Meeûs T, Rousset F, et al. (2005) “Clonal” population structure of the malaria agent Plasmodium falciparum in high-infection regions. Proceedings of the National Academy of Sciences of the United States of America 102: 17388–17393.
- 62. Whitlock MC (2005) Combining probability from independent tests: the weighted Z-method is superior to Fisher's approach. J Evol Biol 18: 1368–1373.