A single mutation in the GSTe2 gene allows tracking of metabolically based insecticide resistance in a major malaria vector
Genome Biology volume 15, Article number: R27 (2014)
Metabolic resistance to insecticides is the biggest threat to the continued effectiveness of malaria vector control. However, its underlying molecular basis, crucial for successful resistance management, remains poorly characterized.
Here, we demonstrate that the single amino acid change L119F in an upregulated glutathione S-transferase gene, GSTe2, confers high levels of metabolic resistance to DDT in the malaria vector Anopheles funestus. Genome-wide transcription analysis revealed that GSTe2 was the most over-expressed detoxification gene in DDT and permethrin-resistant mosquitoes from Benin. Transgenic expression of GSTe2 in Drosophila melanogaster demonstrated that over-transcription of this gene alone confers DDT resistance and cross-resistance to pyrethroids. Analysis of GSTe2 polymorphism established that the point mutation is tightly associated with metabolic resistance to DDT and its geographical distribution strongly correlates with DDT resistance patterns across Africa. Functional characterization of recombinant GSTe2 further supports the role of the L119F mutation, with the resistant allele being more efficient at metabolizing DDT than the susceptible one. Importantly, we also show that GSTe2 directly metabolizes the pyrethroid permethrin. Structural analysis reveals that the mutation confers resistance by enlarging the GSTe2 DDT-binding cavity, leading to increased DDT access and metabolism. Furthermore, we show that GSTe2 is under strong directional selection in resistant populations, and a restriction of gene flow is observed between African regions, enabling the prediction of the future spread of this resistance.
This first DNA-based metabolic resistance marker in mosquitoes provides an essential tool to track the evolution of resistance and to design suitable resistance management strategies.
Insecticide-based control interventions, notably long lasting insecticide nets (LLINs) and indoor residual spraying (IRS), are key components of malaria control in Africa. Unfortunately, increasing resistance to available insecticide classes across Africa in major malaria vectors, such as the mosquito Anopheles funestus, is threatening the continued effectiveness of these control tools. Elucidating the molecular basis of insecticide resistance in these vectors is crucial to designing suitable resistance management strategies and preventing potentially devastating public health consequences .
Pyrethroids are the only class of insecticide used for LLIN impregnation, and are also the predominant insecticide class used in IRS, while dichlorodiphenyltrichloroethane (DDT) is still retained for use in IRS, due to the limited number of cost-effective alternatives. The two major causes of pyrethroid and DDT resistance are target-site insensitivity and metabolic resistance . Target-site resistance (knockdown resistance (kdr)) can be easily monitored by PCR, while metabolic resistance is not easily tracked due to its complex molecular basis, despite its greater operational impact on malaria control . The detailed molecular and structural basis through which candidate metabolic resistance genes confer resistance remains poorly characterized [3–5]. No single metabolic resistance marker has been identified in malaria vectors. Consequently, there is still no DNA-based diagnostic tool to detect metabolic resistance easily in field populations unlike target site resistance. Such tools are needed to detect and track resistance at an early stage allowing control programs to design rational, evidence-based resistance management strategies.
To address this gap in our knowledge, we dissected the molecular basis of metabolic resistance in a DDT/pyrethroid-resistant An. funestus population from Benin that is kdr-free . Using genome-wide transcriptional and functional analyses combined with structural and population genetics studies, we conclusively demonstrated that a single amino acid change in the binding pocket of the glutathione-s-transferase epsilon 2 (GSTe2) gene, coupled with increased transcription, confers a high level of DDT resistance and also cross-resistance to pyrethroids in the major African malaria vector An. funestus.
Results and discussion
DDT resistance profile of Anopheles funestusin Benin and Cameroon mosquitoes
The Pahou population (Benin) had previously been described as highly DDT resistant  with no mortality 24 hr after 1 hr of exposure. The WHO bioassays conducted in Kpome (Benin) in this study indicated that this An. funestus population, which is located approximately 100 km from Pahou, was also resistant to DDT, with only 5% mortality 24 hr after 1 hr of exposure to 4% DDT for females and 21% mortality for males. The population from Gounougou in Cameroon was also resistant to DDT, but at a moderate level, with 52% mortality for females and 65% for males.
Genome-wide transcription microarray analysis
A comparative genome-wide transcription analysis using the 44,000-probe (60-mer) custom microarray chip previously designed by Riveron et al.  was successfully used to identify the set of genes associated with DDT resistance in Pahou (Benin). A total of 1,352 probes were differentially expressed (≥ twofold at P < 0.01) between the DDT-resistant samples from Pahou and the susceptible strain FANG, with 321 upregulated and 1,031 downregulated in the Pahou samples compared with the susceptible strain FANG (Additional file 1: Figure S1A). The lists of the top probes that were differentially upregulated or downregulated are presented in Table 1 and Additional file 2: Table S1 respectively.
The most upregulated detoxification gene in Benin was a glutathione S-transferase gene, GSTe2, with a fold-change (FC) of 11.9 (P = 0.0076) (Table 1 and Additional file 3: Table S2; Additional file 1: Figure S1A). The consistency of this upregulation is further supported by the fact that both probes designed for GSTe2 were upregulated in the Pahou population (Table 1). Furthermore, orthologs of GSTe2 were also upregulated in DDT-resistant strains in other mosquito species such as An. gambiae[3, 7] and Aedes aegypti, suggesting that this gene most likely plays a key role in DDT resistance in many mosquito species.
The two duplicated cytochrome P450 genes, CYP6P9a (FC = 6.4) and CYP6P9b (FC = 3.9), which confer pyrethroid resistance in southern African populations , were also upregulated in the Benin population. However, because their encoded proteins are unable to metabolize DDT , they are most likely associated with the permethrin resistance observed in the Pahou mosquitoes. Other genes with a known association with insecticide resistance were also upregulated in Pahou mosquitoes and are detailed in Additional file 4 Supplementary text and listed in Table 1.
Validation of the microarray upregulation with quantitative RT-PCR
Quantitative reverse-transcriptase PCR (qRT-PCR) confirmed the significant upregulation of GSTe2 in Benin (FC = 44.8, P = 0.007) (Additional file 1: Figure S1B) in comparison to the FANG susceptible strain. The GSTe2 gene is significantly over-transcribed in DDT-exposed mosquitoes compared to unexposed mosquitoes (FC = 82.0 vs 44.8, P < 0.05) (Figure 1A), suggesting that induction of GSTe2 occurs in addition to constitutive over-expression in resistant mosquitoes. The expression pattern of GSTe2 across Africa significantly correlates with DDT-resistance patterns. This gene is 69.5- and 79.1-fold upregulated in highly DDT-resistant mosquitoes from Benin (P < 0.05) compared with the fully DDT-susceptible Mozambique and Malawi samples, respectively (Figure 1A). It is 35-fold upregulated (P < 0.05) compared to Ugandan samples (moderate DDT resistance ). Similarly, the two P450s CYP6P9a and CYP6P9b had 2.96 and 7.13 times more expression in mosquitoes from Pahou than in the susceptible FANG mosquitoes.
Overall, transcription analyses indicated that GSTe2 is the main detoxification gene associated with DDT resistance in Pahou mosquitoes. In other mosquito species, DDT resistance is conferred by additional mechanisms, such as the knockdown resistance (kdr) mutation (An. gambiae and Aedes aegypti), or by upregulation of cytochrome P450s, as observed for An. gambiae for CYP6Z1 and CYP6M2. None of these mechanisms are present in An. funestus because no kdr mutation was detected in this population , and the only upregulated P450s in this population (CYP6P9a and CYP6P9b) are unable to metabolize DDT . These results indicate that GSTe2 is the main detoxification gene associated with DDT resistance in this Benin population of An. funestus. Therefore, further analysis of this gene would be very good for elucidating metabolic resistance mechanisms and for understanding the detailed molecular basis through which it confers resistance to insecticides.
Transgenic expression of GSTe2 in Drosophilaflies
To establish whether the upregulation of GSTe2 alone can confer DDT and pyrethroid resistance, transgenic Drosophila melanogaster flies expressing GSTe2 cloned from Benin were generated using the GAL4/UAS system under the ubiquitous Act5C-GAL4 driver (Act5C-GSTe2). After confirming the expression of GSTe2 in transgenic F1 progeny by qRT-PCR (Additional file 1: Figure S1C), bioassays revealed that the transgenic Act5C-GSTe2 flies were resistant to 4% DDT (19.1% mortality after 24 hr exposure), whereas, all the controls that did not over-express GSTe2 were susceptible (83.5% to 97.9% mortality, P < 0.0001) (Figure 1B). These results indicate that GSTe2 upregulation alone is sufficient to confer DDT resistance. Such a transgenic expression approach was also recently successfully used to confirm the involvement of two P450s (CYP6P9a and CYP6P9b) in conferring pyrethroid resistance in An. funestus and previously in Drosophila, indicating the usefulness of this technique.
Noticeably, the upregulation of GSTe2 also conferred cross-resistance to pyrethroids. Indeed, bioassays performed with 2% permethrin revealed that the transgenic Act5C-GSTe2 flies (35.6% mortality after 24 hr exposure) but not the controls (78.9% to 97.2% mortality, P < 0.001) (Figure 1C) were permethrin resistant. A moderately reduced mortality rate (P < 0.05) was observed in transgenic flies after 0.15% deltamethrin exposure only after 24 hr (Additional file 1: Figure S1D). However, the mortality rate obtained for pyrethroids was higher than for DDT, suggesting that the resistance conferredto pyrethroids by GSTe2 is lower than to DDT. Such observations are in accordance with the resistance profile of the Pahou population, which is highly resistant to DDT but only moderately resistant to permethrin . This potential role of GSTe2 in permethrin resistance is in agreement with previous reports that suggested that orthologs of GSTe2 in other mosquitoes are associated with pyrethroid resistance. Indeed, microarray analyses of An. gambiae reported upregulation of GSTe2 in a permethrin-resistant strain, with the suggestion that the protein for this gene could be involved in permethrin resistance either by acting as a pyrethroid-binding protein and sequestering the insecticide  or by protecting mosquitoes against the oxidative stress and lipid peroxidation caused by exposure to permethrin . In the dengue fever mosquito Ae. aegypti, a partial knockdown of the ortholog of GSTe2 led to increasing mortality to permethrin, indicating that GSTe2 is also associated with permethrin resistance in this species . This cross-resistance to pyrethroids is of significant concern for malaria control as GSTe2 could protect mosquitoes against the major insecticides used in public health.
Detection of resistance markers in GSTe2
To detect resistance markers for GSTe2, we analyzed the full-length cDNA (666 bp) from resistant Benin mosquitoes, and four susceptible mosquito strains from Uganda, Malawi, Mozambique and Zambia. There were 19 polymorphic sites (8 replacement substitutions) from 24 clones. The major difference in the Benin strain was a fixed leucine (CTT) to phenylalanine (TTT) replacement (L119F). The Benin haplotypes form a unique clade compared with the haplotypes from the other countries (Figure 2A).
Analysis of the full-length GSTe2 (881 bp including introns) of six resistant and six susceptible mosquitoes from another Benin location (Kpome, 6°55′N, 2°19′E, because no susceptible mosquitoes were available from Pahou) confirmed that the GSTe2 polymorphism was associated with DDT resistance. No polymorphisms were identified for the resistant mosquitoes and there was a single haplotype (Hap1-R) bearing the resistant 119 F allele (Figure 2B,C; Additional file 3: Table S2). Five heterozygote polymorphic sites were identified for susceptible mosquitoes with a single amino acid change, L119F. The presence of a susceptible haplotype Hap2-S bearing the L119 susceptible allele was also observed (Figure 2B,C).
Correlation between the L119F mutation and DDT and pyrethroid resistance
The TaqMan assay, designed to genotype the L119F mutation, unambiguously detected the three genotypes (Additional file 5: Figure S2A) in 35 susceptible and 35 resistant mosquitoes from Gounougou (9°05′N, 13°40′E) in Cameroon (west-central Africa) (as very few susceptible mosquitoes were obtained from Benin). The distribution of the genotype frequencies between susceptible and resistant mosquitoes significantly differed (χ2 = 99.7, degrees of freedom = 2 and P < 0.0001) (Figure 3B). A significant association was observed between the L119F mutation and DDT resistance, with an odds ratio of 5.74 (P < 0.0001) when comparing the frequency of the resistant and susceptible alleles in both sample sets. When comparing the frequency of the two homozygous genotypes (T/T and C/C), the T/T mutant genotype was more highly associated with DDT resistance than the C/C wild genotype with an odds ratio of 22.3 (P < 0.0001). Indeed, 72.5% of the mosquitoes with the homozygote mutant allele (T/T) were resistant to DDT whereas approximately half of the heterozygote mosquitoes were resistant to DDT (54.5%), and only 10% of the homozygote wild-type (C/C) were resistant to DDT, suggesting co-dominance of this allele (Figure 3B). The correlation of the L119F mutation with DDT resistance indicates that this TaqMan assay could be used as a diagnostic tool to detect and map the distribution of DDT resistance in An. funestus in Africa.
This is the first report to our knowledge of a point mutation conferring metabolic resistance in a mosquito species. To date, metabolic resistance to DDT has only been associated with a single haplotype of the P450 gene CYP6G1 in D. melanogaster or to a potential allelic variation in the ortholog of GSTe2 in Ae. aegypti but not to a single point mutation as detected in this study in An. funestus. Such a point mutation conferring metabolic resistance has previously only been described for the house fly (Musca domestica) for the alpha esterase E7 gene  and for the sheep blowfly (Lucilia cuprina) for the carboxylesterase E3 gene . Detection of the L119F mutation in An. funestus, which confers high DDT resistance, indicates that similar point mutations conferring resistance to other insecticide classes could also be found in mosquitoes, notably in malaria vectors, allowing the design of reliable molecular diagnostic assays that can accurately detect and track metabolic resistance in the field. Although a similar trend was observed for the pyrethroids permethrin and lambda-cyhalothrin, the correlation between L119F and resistance to these insecticides was not significant (P = 0.08 for permethrin, P = 0.1 for lambda-cyhalothrin) (Additional file 5: Figure S2C,D). Hence, the cross-resistance to pyrethroids may be primarily conferred by the quantitative change in GSTe2 expression rather than the qualitative change from the L119F mutation.
In contrast, DDT resistance was more closely associated with the L119F mutation than the level of GSTe2 over-expression (Figure 2D). Indeed, despite a lower fold-change in the susceptible L119 genotype, no significant difference (P > 0.05) in the GSTe2 expression level was observed between the homozygote resistant RR (FC = 26.6), the heterozygote RS (FC = 23) and the homozygote susceptible SS (FC = 16.5) (Figure 2D). These results suggest that the L119F mutation could be the predominant cause of DDT resistance.
Geographical distribution of L119F across Africa
Genotyping 30 mosquitoes from each of nine African countries revealed that the geographical distribution of the 119 F resistant allele strongly correlates with the distribution of DDT resistance across Africa [19, 20] (Figure 3C; Additional file 5: Figure S2B). The 119 F resistant allele is fixed in highly DDT-resistant Benin mosquitoes but completely absent in fully susceptible southern African populations [5, 21, 22]. It also occurs in other DDT-resistant populations in West Africa with a frequency of 48.2%, 44.2% and 25%, respectively, in Cameroon, Ghana and Burkina Faso, in correlation with the previously reported prevalence of DDT resistance in these countries [19, 20]. The resistant 119 F allele has also been detected in the eastern African countries of Uganda (20.4%) and Kenya (7.8%) but with lower frequencies, reflecting the moderate level of DDT resistance that was previously reported . The L119F distribution in Africa is similar to that of the A296S resistance-to-dieldrin (RDL) mutation, which confers dieldrin resistance  and probably reflects contemporary patterns of gene flow between An. funestus populations across Africa. Indeed, as for L119F, no resistance allele was found for RDL in southern Africa in line with the full susceptibility to dieldrin in this region. Similarly, as for L119F, only a low frequency of RDL was detected in Uganda in east Africa.
Metabolic assays with heterologous GSTe2 enzyme
An assessment of the DDT dehydrochlorinase activity of the recombinant 119 F resistant enzyme (Benin) and that of susceptible L119 revealed that the 119 F allele is 3.4 times more efficient at metabolizing DDT (82.9% DDT depletion after 1 hr reaction in the presence of the cofactor glutathione (GSH); P < 0.001) than the L119 allele (24.4% depletion) (Figure 1D and Additional file 6: Figure S3A). This is confirmed by their respective kinetic parameters (Figure 1E), with the 119 F allele having a higher catalytic efficiency (kcat/K m ratio) for DDT (316.3 μM−1.s−1 vs 92.0 μM−1.s−1) (Table 2).
Additionally, significant permethrin metabolism was observed for the Benin 119 F-GSTe2 allele (45% depletion, P = 0.016) (Figure 1F; Additional file 6: Figure S3B and S3C) with the high-performance liquid chromatography (HPLC) metabolism profile showing three potential metabolite peaks. This suggests that GSTe2 confers permethrin resistance by directly metabolizing this insecticide. Similarly, recent research has shown that a glutathione S-transferase (GST) from Culex pipiens (CpGSTD1) was able to metabolize pyrethroid-like fluorescent substrates directly, including a permethrin-like substrate . The nature of the permethrin metabolites corresponding to the three peaks observed in this study remains unknown. Three potential metabolic pathways have been proposed for GST metabolism of pyrethroid-like substrates including halogen substitution, Michael addition, and thiolysis . Because none of the three metabolite peaks in this study matched the permethrin ester hydrolysis products, phenoxybenzyl alcohol or phenoxybenzoic acid (produced by the Michael addition and the thiolysis routes), it is likely that a GSH conjugate to permethrin through the halogen substitution metabolic route could be the main mechanism. However, further investigations are needed to determine the nature of these metabolites. But it is possible that GSTe2 could also confer permethrin resistance through other mechanisms such as sequestration  or protection against oxidative stress and lipid peroxidation .
In contrast, no significant metabolism was observed for deltamethrin, which is in line with the low deltamethrin resistance observed in the field in Pahou (88% mortality for deltamethrin vs 66% for permethrin), but also the lower deltamethrin resistance observed in transgenic UAS-GSTe2 Drosophila flies.
Genetic diversity of GSTe2across Africa
Signature of positive selection on GSTe2in Benin
Analysis of the 882-bp full-length sequence of GSTe2 (663 bp for the three exons and 219 bp for the two introns and part of the 3′ UTR) from field-collected mosquitoes from six countries revealed that GSTe2 is under strong directional selection in Benin in contrast to other countries (Table 3 and Additional file 7: Table S3, Additional file 8: Figure S4A). Indeed, the significant reduction of genetic diversity of GSTe2 in Benin is apparent with only one polymorphic site observed in a single mosquito out of 12 from Pahou, in contrast to the seven to nineteen polymorphic sites in the other populations. Only two haplotypes were present in Pahou mosquitoes (the minor haplotype was only present as a heterozygote in one out of twelve individuals), in contrast to the seven to ten haplotypes for the other countries (Additional file 8: Figure S4A; Table 3). Genetic diversity parameters such as haplotype diversity (hd) and nucleotide diversity for Benin are significantly lower than for the other five countries (Additional file 8: Figure S4A). This reduced diversity of GSTe2 for Benin mosquitoes suggests the presence of a selective sweep across and around this gene. A future assessment will establish the extent of this selective sweep in this genomic region on chromosome 2 L.
However, other tests of selection such as the McDonald and Kreitman (MK), Hudson–Kreitman–Aguade (HKA), dN/dS and K a /K s ratios and Tajima’s D tests did not show any signature for positive selection in Benin (Additional file 7: Table S3). This could be due to the fact that the sweep around GSTe2 in the Benin population is nearing fixation as shown by the 100% frequency of the resistant 119 F allele in this population. In a situation of near fixation of the selective sweep, directional selection is better shown by reduced levels of genetic variation . Evidence of selection through reduced genetic diversity was not observed in Cameroon and Ghana, where DDT resistance has also been detected, although not at the high level as in Benin. However, the presence of the predominant Benin-resistant haplotype in these west African countries, although at a lower frequency, could result from the combined effect of local DDT selection and migration.
Directional selection has previously been observed in this species in southern African pyrethroid-resistant populations at the two tandemly duplicated P450 genes CYP6P9a and CYP6P9b. Another P450, CYP6G1, which confers DDT resistance in D. melanogaster, is also under strong directional selection, with a single resistant haplotype containing an Accord transposable element in the 5′ UTR region distributed globally . The same CYP6G1 gene is also under directional selection in D. simulans, with a 100-kb region having extensive reduced heterozygosity due to a selective sweep around CYP6G1. These observations further suggest that insecticide resistance is an excellent example of natural selection at work. As expected, the non-coding regions exhibit greater genetic diversity than the coding regions in all countries, with more polymorphic sites and haplotypes apart from the Benin population, where the unique polymorphic site is in the coding region (Table 3).
Haplotype distribution of GSTe2 across Africa
A total of 39 haplotypes were observed for the full gene (Additional file 8: Figure S4B), 16 for coding regions only (Additional file 8: Figure S4C), and there were seven protein variants (Additional file 8: Figure S4D) across Africa. Resistant haplotypes (with the 119 F resistant allele) resolving around a predominant haplotype BN23 (nearly fixed in Benin and only present in DDT-resistant locations) all had reduced diversity, which is indicative of a recent selection with fewer haplotypes (11 out of 39), and more homogeneity (less mutational step differences ≤4). The susceptible haplotypes (with the L119 allele) resolved around a predominant haplotype MAL3 and were more diverse, with more haplotypes (29 out of 39) and greater heterogeneity (up to 13 mutational step differences). Overall, the haplotype distribution of GSTe2 across Africa (Additional file 8: Figure S4B,C,D) reveals that the L119F mutation is the main polymorphism shaping GSTe2 genetic diversity with the mutational step at this allele consistently defining a resistant and a susceptible group of haplotypes (Figure 3D and Additional file 9: Figure S5).
Population substructure of GSTe2 across Africa
Analysis of the genetic diversity of GSTe2 indicated that An. funestus populations are clearly structured according to their pattern of DDT resistance and by geographical distance. The construction of a maximum likelihood phylogenetic tree of GSTe2 sequences revealed that Cameroon and Ghana mosquitoes cluster with Benin mosquitoes (Figure 4), correlating with DDT-resistance profiles. Susceptible populations from Mozambique and Malawi have the lowest genetic differentiation (K st = 0.016) (Additional file 10: Table S4; Figure 3E). The moderately resistant east Africa Ugandan population has an intermediate differentiation to all populations. This pattern of gene flow correlates with the genetic structure patterns of An. funestus populations across Africa based on microsatellite markers , suggesting that there are barriers to gene flow between African An. funestus populations, which will affect the spread of insecticide-resistance genes.
Structural basis of DDT resistance conferred by GSTe2
To identify the structural changes responsible for the higher activity in the resistant GSTe2 Benin allele (119 F), its X-ray three-dimensional structure was determined and analyzed in comparison with that of a L119-GSTe2 susceptible allele cloned from Uganda (UG-GSTe2) (Figure 5A,B). The topology of GSTe2 indicates that the 221 amino acids of the An. funestus GSTe2 (afGSTe2) are divided into two distinct domains similar to that of An. gambiae (92% similarity between the two species, with a difference of 20 amino acids). The N-terminal domain and a C–terminal domain are connected by a short hinge loop called the linker (amino acids 80–89) (Figure 5C). The N-terminal domain (1–79) has the typical TRX-fold (βαβαββα) domain in which the central four-stranded mixed β-sheet (B1-B4) is flanked on one side by helices H1 and H3 and on the other side by H2. The C-terminal domain (90–221) is a five α-helix bundle (H4-H8) in which the long α-helix H4 is noticeably bent . The active site is located in a deep cleft formed at the interface of the two domains, specifically by the interaction of H1 and H3 from the N-terminal domain and H4, H6 and H8 from the C-terminal domain.
Overall, the structure of BN-GSTe2 and UG-GSTe2 are very similar (Figure 5C, Additional file 11: Figure S6). However, significant local differences can be observed between the alleles at the C-terminal ends of H4 (containing the L119F mutation; residues from 113 to 128) and H8 (residues from 213 to 220) (Figure 6A). These differences can be described as a concerted movement of the BN-GSTe2 H4 and H8 helices with respect to those of UG-GSTe2, which opens the active site cleft. The root mean squared deviation (RMSD) between the structures is 0.64 Å for all Cα atoms. There is a significant decrease of the RMSD between the alleles from 0.64 to 0.42 Å when excluding H4 (containing the 119 F mutation) and the RMSD is only 0.33 Å when excluding both H4 and H8. A higher RMSD is observed when the two alleles are compared only for the H4 helix (RMSD = 0.9 Å) (Figure 6A). Furthermore, the significant difference between the BN-GSTe2 and UG-GSTe2 alleles is further compounded by the observation that the UG-GSTe2 allele structure is more similar to the structure of the An. gambiae GSTe2 protein [PDB:2imi] (RMSD = 0.4 Å) than to Benin-GSTe2 (RMSD = 0.64 Å) despite a 20-amino-acid difference (92% similarity) (Figure 6B), indicating that the L119F mutation is the key factor that modifies the GSTe2 structure (Figure 6C).
The active site of GSTe2 can be divided into two sub-sites. Firstly, there is the glutathione binding site, called the G-site, where one GSH molecule is bound. Secondly, there is the H-site, which recognizes the hydrophobic substrate. The architecture of the G-site of this superfamily of proteins is well described and nearly identical to that observed in BN-GSTe2 and UG-GSTe2. Although slight variations were observed between the G-sites of both alleles, there is no significant conformational difference, and these G-binding pockets do not have any major rearrangement (Additional file 11: Figure S6). The H-site is a large and slightly open hydrophobic cavity adjacent to the G-site. It is built of residues from different loops at the N-terminal domain and noticeably includes the variable C-terminal ends of helices H4 and H8 at the C-terminal domain (Figure 5C). Consequently, the differences observed between BN-GSTe2 and UG-GSTe2 mainly affect the structure of the substrate-binding pocket (H-site) (Figure 6A).
Putative DDT-binding pocket (H-site) and the DDT metabolism reaction
Leu119 normally forms part of the solvent-inaccessible hydrophobic core that stabilizes the conformation of helices H4, H5 and H8 of Uganda-GSTe2 (Figure 6A). However, to accommodate the bulkier Phe119 side chain in the Benin-GSTe2 structure, the N-terminal end of H4 has a dramatic bend, which increases the size of the H-site in comparison to the UG allele, where the cavity is tighter (Figure 6C). The H-site surface representation of both proteins with a bound GSH confirms that the resistant BN allele has a larger putative DDT cavity than the susceptible UG allele (Figure 6C).
Attempts to co-crystallize the substrate DDT or the product DDE with BN-GSTe2 and UG-GSTe2 were unsuccessful. Therefore, to investigate the substrate-binding properties of both alleles and assess their capacity to hydrolyze DDT, a substrate molecule was docked into the H-site (Figure 6C). This docking of a DDT molecule into the H-site of both alleles demonstrated that the Benin-GSTe2 has the appropriate size and shape to accommodate DDT better into a ‘close to reactive’ conformation. This includes the proper stabilization of the DDT chloride-phenyl rings and the optimal positioning of the DDT Cα, pointing towards the thiolate from glutathione (Figure 6C) leading to increased DDT metabolism. In contrast, the smaller H-site of Uganda-GSTe2 does not allow DDT recognition in this conformation (Figure 6C) leading to reduced DDT access and metabolism. This difference at the H-site explains the increased active site accessibility, the higher activity and consequently the high DDT resistance of the Benin population compared with other populations such as those from Uganda or Malawi.
A comparison of the structure of GSTe2 for An. funestus to the structures of AgGSTe2 and agGSTd1-6 clearly indicates that the key factors in the high metabolic activity of BN-GSTe2 are the H4 helix position and the shape and size of the DDT-binding cavity (Additional file 12: Figure S7). GSTd1-6, which belongs to the delta class of GST, possesses less than 1/350 the DDT-metabolizing activity of AgGSTe2 and was previously shown to differ from AgGSTe2 mainly at the H4 and H8 helices . The H-site surface representation of these four proteins shows that the AgGSTd1-6 cavity, although larger than those of AfGSTe2s and AgGSTe2, does not have the proper shape for docking a DDT molecule inside. This explains why it has the lowest DDT-metabolizing activity. BN-GSTe2, UG-GSTe2 and agGSTe2 have a V-shaped cavity, which will fit DDT. However, BN-GSTe2 has the largest binding pocket, which is able to accommodate the DDT molecule in a ‘close to reactive’ conformation leading to the higher resistance it confers to DDT.
The proteins for the BN-GSTe2 and UG-GSTe2 alleles are separated by two amino acid changes, L119F and I131V. However, only L119F is located in the H4 helix whereas I131V is located in the H5 helix, where no conformational change is observed between the BN and UG forms. In addition, the AgGSTe2 form, which is structurally similar to UG-GSTe2, contains I131, like the BN form. Therefore, this indicates that the I131V mutation does not play a role in the DDT metabolic activity of the BN allele and confirms that the molecular basis of the high DDT resistance conferred by the BN-GSTe2 allele compared with the susceptible allele is solely explained by the significant conformational changes that are induced by the L119F mutation. This finding is in agreement with what was previously predicted by . This previous study suggested that for the An. gambiae GSTe2 to be more active against DDT, more conformational changes in the DDT-binding pocket were needed to further accommodate a DDT substrate and that such adjustments should not be on a large scale because the pocket was already well shaped. These changes are exactly what happened in the Benin form, where a single amino acid change induced the adjustment needed to accommodate more DDT in this resistant An. funestus strain, conferring a high level of DDT resistance in contrast with the UG allele, where this adjustment is absent. Overall, these structural analyses demonstrate that the L119F mutation is the causative mutation that confers DDT resistance.
This study presents a comprehensive and detailed dissection of the genetic, molecular and structural basis of the metabolic resistance to an insecticide in a major malaria vector. Firstly, we conclusively detected the main gene responsible and showed that the resistance is conferred by a qualitative and a quantitative change in the DDT metabolizing enzyme (GSTe2). Secondly, we detected, for the first time, to our knowledge, for a mosquito species, a molecular resistance marker for metabolic resistance and designed a reliable DNA-based diagnostic assay that can accurately detect and map the distribution of resistance across Africa. This diagnostic tool will be valuable for vector control programs as resistance can be detected at an early stage, allowing suitable resistance management strategies to be implemented. Thirdly, we showed that the geographical distribution of DDT resistance, its origin and future spread patterns can be established or predicted based on the patterns of GSTe2 genetic diversity, which are mainly due to the single L119F mutation.
Materials and methods
Blood-fed adult female An. funestus mosquitoes resting indoors were collected in houses between 6.00 AM and 12.00 PM in Pahou (6°23′N, 2°13′E), which is located near the Atlantic coast in southern Benin, west Africa. Several collections were conducted between July 2009 and April 2011. Another collection was conducted in Kpome (6°55′N, 2°19′E), which is located inland approximately 100 km from Pahou, in December 2011. The other samples used in this study have been previously described, and the respective references are provided when the samples are mentioned. Mosquito collection and rearing were performed as described previously [6, 9]. Briefly, F1 adults were generated from field-collected female mosquitoes (using an egg-forced laying method ) and were randomly mixed in cages for subsequent experiments. All females used for individual oviposition were morphologically and molecularly identified as An. funestus ss, as previously described .
Insecticide susceptibility assays with 4% DDT and 0.75% permethrin were conducted using 2- to 5-day-old F1 adults from pooled F1 mosquitoes, as described previously .
A custom An. funestus microarray chip containing 44,000 probes (60-mer) (A-MEXP-2245), previously described by Riveron et al. , was used to identify the set of genes associated with DDT resistance in Benin mosquitoes. Labeled complementary RNA (cRNA) was obtained from three biological replicates of DDT-resistant mosquitoes from Pahou that had been unexposed to insecticide and from susceptible unexposed mosquitoes (from the fully susceptible laboratory strain FANG). The Pahou mosquitoes were all DDT resistant as no mortality was recorded after 1 hr exposure to 4% DDT .
RNA was extracted from three batches of ten An. funestus females that were all 2 to 5 days old from the two sample sets using the Picopure RNA isolation kit (Arcturus, Applied Biosystems, Carlsbad, CA, USA). The quantity and quality of extracted RNA were assessed using the NanoDrop ND1000 spectrophotometer (Thermo Fisher, Waltham, MA, US) and Bioanalyzer (Agilent, Santa Clara, CA, USA), respectively. The cRNA from each sample was amplified using the Agilent Quick Amp labelling Kit (two-color) following the manufacturer’s protocol. The cRNA from the resistant Pahou samples was labelled with cy5 dye whereas the susceptible strain FANG (S) was labelled with the cy3 dye. The cRNA quantity and quality were assessed before labelling using the NanoDrop and the Bioanalyzer. Labelled cRNAs were hybridized to the arrays for 17 hr at 65°C, according to the manufacturer’s protocol. Five hybridizations were conducted by swapping the biological replicates.
Microarray data were analyzed using Genespring GX 12.0 software. To identify the differentially expressed genes, a twofold-change cutoff and a significance level of P < 0.01 with Benjamin-Hochberg correction for multiple testing were applied.
The genes that were most associated with resistance from the microarray analysis were assessed by qRT-PCR to validate their expression pattern using three biological replicates for Pahou resistant mosquitoes (R) (alive after 24 hr exposure to DDT), Pahou control mosquitoes (C) (not exposed to any insecticide) and susceptible FANG mosquitoes (S). The primers are listed in Additional file 13: Table S5. Next, 1 μg total RNA from each of the three biological replicates from Pahou resistant, Pahou control and susceptible FANG mosquitoes was used as the template for cDNA synthesis using Superscript III (Invitrogen, Carlsbad, CA, US) with oligo-dT20 and RNase H, according to the manufacturer’s instructions. The qRT-PCR amplification was conducted as previously described . The relative expression and fold-change of each target gene in R and C relative to S was calculated according to the 2-ΔΔCT method, incorporating PCR efficiency  after normalization with the housekeeping genes RSP7 (ribosomal protein S7, AGAP010592) and actin 5C (AGAP000651).
Transgenic expression of GSTe2 in Drosophilaflies
Construction of transgenic Drosophilalines
The full-length GSTe2 gene was amplified from the cDNA of the resistant Benin mosquitoes using Phusion High-Fidelity DNA Polymerase (Fermentas, Burlington, Ontario, Canada) and the following conditions: 1 cycle at 95°C for 5 min; 35 cycles of 94°C for 20 s, 57°C for 30 s and 72°C for 60 s; and 1 cycle at 72°C for 5 min. The primers used are listed in Additional file 13: Table S5. The PCR products were purified using the QIAquick PCR Purification Kit (Qiagen, Valencia, CA, USA) and cloned into the pJET1.2/blunt cloning vector using the CloneJETTM PCR Cloning Kit (Fermentas, Burlington, Ontario, Canada). Five positive clones from both samples were purified with a QIAprep® Miniprep kit (Qiagen, Valencia, CA, USA) and sequenced on both strands. After sequence analysis, one clone of the GSTe2 that was predominant in Benin mosquitoes was selected for constructing the transgenic flies. This clone was re-amplified as described above with primers containing restriction sites for BglII and XbaI. The purified PCR products were digested using the BglII and XbaI enzymes (Fermentas, Burlington, Ontario, Canada) , cloned into the pUASattB vector (provided by Dr J Bischof, University of Zurich), pre-digested with the same restriction enzymes and transformed into JM109 cells (Promega, Madison, Wi, US). Using the PhiC31 system, clones were injected into the germ-line of D. melanogaster carrying the attP40 docking site on chromosome 2 (y1 w67c23; P (CaryP) attP40,1;2) . One transgenic line, UAS-GSTe2, was generated and balanced. For the expression of the transgene GSTe2, ubiquitous expression was obtained in the flies using the Act5C-GAL4 strain (y1 w*; P (Act5C-GAL4-w) E1/CyO,1;2) (Bloomington Stock Center, IN, USA).
Confirmation of GSTe2 expression in the transgenic flies by quantitative RT-PCR
To confirm the expression of GSTe2 in the experimental groups and the absence of expression in the control groups, total RNA was extracted from three pools of five flies. The cDNA synthesis was conducted as described above. PCR was performed using the synthesized cDNA as a template and primers specific to GSTe2 (Additional file 13: Table S5). In addition, the relative expression levels of the GSTe2 transgene were assessed by qRT-PCR in the F1 progeny under the Act5C driver and in respective controls with normalization using the RPL11 housekeeping gene.
Drosophila contact bioassays
Females from F1 expressing GSTe2 were selected as the experimental group for the insecticide bioassays. The parental lines and the female progeny from the cross between the Act5C-GAL4 females and males that did not carry the GSTe2 transgene (but with the same attP40 background) were used as controls. A comparison of the mortality rates between the experimental groups and the control groups was used to assess whether GSTe2 was conferring resistance. In addition, 2- to 5-day-old females post-eclosion were used in a contact assay with 4% DDT and the pyrethroid insecticides permethrin (2%) and deltamethrin (0.15%), as described previously . Then 20 to 25 flies were placed in each vial, and the mortality plus knockdown was scored after 1 hr, 2 hr, 3 hr, 6 hr, 12 hr and 24 hr of exposure to the insecticide. For all assays, at least six replicates were performed. Student’s t-test was used to compare the mortality plus knockdown in the experimental group with each control group.
Analyzing GSTe2polymorphisms in relation to DDT resistance
Analysis of cDNA polymorphisms
The full-length cDNA of GSTe2 was cloned and sequenced for DDT-resistant mosquitoes from Benin and for DDT-susceptible An. funestus across Africa (Uganda, Malawi, Mozambique and Zambia) to detect potential mutations that could be associated with DDT resistance. The cDNA amplification was performed using the same cDNA synthesized for qRT-PCR with the Phusion polymerase, and the product was cloned and sequenced as described above. The primers used are listed in Additional file 13: Table S5.
Polymorphism analysis between susceptible and resistant field mosquitoes in Benin
A further assessment of the correlation of the polymorphism of GSTe2 and DDT resistance was conducted by individually amplifying and direct-sequencing the genomic full-length sequence of GSTe2 (all exons and introns) for six susceptible mosquitoes (dead after 1 hr exposure) and six resistant mosquitoes (alive after 1 hr exposure) from Kpome because no susceptible mosquitoes were obtained in Pahou. The polymorphic positions were detected through a manual analysis of sequence traces using BioEdit and as sequence differences in multiple alignments using ClustalW . dnaSP 5.10  was used to define the haplotype phase (through the Phase option) and to assess genetic parameters, such as nucleotide diversity π, haplotype diversity and the D and D* selection estimates. A maximum likelihood tree of the haplotypes for both cDNA and genomic amplifications was constructed using MEGA 5.2 , and a haplotype network was built using the TCS program  (95% connection limit, gaps treated as a fifth state) to assess the potential connection between haplotypes and resistance phenotypes.
Diagnostic assay and assessment of whether L119F correlates with DDT resistance
To genotype the GSTe2 L119F mutation in field populations of An. funestus, a custom TaqMan assay was designed after repeated failures from pyrosequencing due to the many thymine nucleotides (Ts) around the C/T mutation site. The primer and reporter sequences are provided in Additional file 14: Table S6. The TaqMan reactions were performed in a 10-μl final volume containing 1× SensiMix (Bioline, London, UK), 800 nM of each primer and 200 nM of each probe using an Agilent MX3005P machine. The following cycling conditions were used: 10 min at 95°C, 40 cycles of 15 s at 92°C and 1 min at 60°C. This assay was used to assess the correlation between the genotypes of the L119F mutation and DDT-resistant phenotypes. For this assay, due to the lack of susceptible mosquitoes from Pahou and the few susceptible mosquitoes from Kpome (in Benin), An. funestus samples were collected in northern Cameroon from Gounougou (9°05′N, 13°40′E), as described above. A WHO bioassay for DDT was conducted as described above. Then 35 mosquitoes that were dead after 1 hr of 4% DDT exposure (susceptible) and 35 alive mosquitoes (resistant) from Gounougou were genotyped for the L119F mutation using the TaqMan assay.
L119F correlation with pyrethroid resistance
To assess the correlation between the genotypes of the L119F mutation and pyrethroid-resistant phenotypes, 25 mosquitoes that were dead after 1 hr of 0.75% permethrin (type I pyrethroid) exposure (susceptible) and 25 alive mosquitoes (resistant) from Gounougou were genotyped for the L119F mutation using the TaqMan assay. The same genotyping was conducted for lambda-cyhalothrin (a type II pyrethroid). Association between resistance phenotypes and the genotypes was assessed by estimating the odds ratios and the statistical significance based on the Fisher exact probability test .
Contribution of GSTe2upregulation and the 119 F mutation to the resistance phenotype
To assess whether GSTe2 upregulation and the presence of the 119 F mutation are both necessary to confer DDT resistance, we compared the expression levels of GSTe2 for the three genotypes of the L119F mutation alongside the susceptible FANG strain using qRT-PCR. Three batches of five mosquitoes from Gounougou were used for homozygote susceptible samples (C/C; L119/L119), heterozygote samples (C/T; L119/119 F) and homozygote resistant samples (T/T; 119 F/119 F) following the protocol described above. The genotype of each mosquito was first established after a TaqMan assay using gDNA (genomic DNA) extracted from the legs, and then mosquitoes from the same genotype were pooled for RNA extraction and cDNA synthesis.
Geographical distribution of the L119Fmutation across Africa
To assess the geographical distribution of the L119F mutation across Africa, 30 field-collected An. funestus ss females from nine countries belonging to different regions of Africa were genotyped by TaqMan: Benin (Pahou, collected in 2010–2011), Ghana (Obuasi, 2009), Burkina (Bobo-Dioualasso, 2010) and Cameroon (Gounougou, 2006) in west-central Africa; Uganda (Tororo, 2009) and Kenya (Kisumu, 2012) in east Africa; and Malawi (Chikwawa, 2009), Zambia (Katete district, 2011) and Mozambique (Chokwe, 2009) in southern Africa.
Population structure of GSTe2 across African An. funestuspopulations
To assess the patterns of genetic variability of GSTe2 across African populations of An. funestus in relation to DDT resistance and to detect the potential signatures of selection on this gene, full-length GSTe2 (exons and introns) was amplified and directly sequenced for 10–15 field-collected female mosquitoes from six countries from different regions of Africa. These countries are Benin, Ghana and Cameroon in west-central Africa, Uganda in east Africa and Mozambique and Malawi in southern Africa. The patterns of genetic variability were analyzed as described above using dnaSP 5.10 . The levels of pairwise genetic differentiation between the populations were estimated in dnaSP 5.10 using the K st statistic , although F st and N st estimates were also obtained for comparison. The significance of the K st *estimates was assessed by permutation of subpopulation identities and re-calculating K st * 10,000 times, as implemented in dnaSP 5.10.
Phylogenetic tree of GSTe2haplotypes
A maximum likelihood phylogenetic tree for the coding sequences of GSTe2 across Africa was constructed using MEGA 5.2 . The best-fit substitution model was firstly assessed based on the Bayesian information criterion. This indicated that the Jukes–Cantor model best describes the GSTe2 haplotype dataset out of 24 candidate models. The Jukes–Cantor model was then used to generate the maximum likelihood tree as implemented in MEGA with 500 bootstrap replications to assess the robustness of the tree. Additionally, a haplotype network was built for both the full-length region (non-coding plus coding) and the coding region only, using the TCS program  (95% connection limit, gaps treated as a fifth state), to assess the potential connection between haplotypes and resistance phenotypes.
Test of selection on GSTe2
To test for positive selection acting on GSTe2 in relation to DDT resistance, several tests were carried out. Because reduced levels of genetic variation are an indication of positive selection particularly when there is a sweep through population nearing fixation , several genetic diversity estimates were computed using dnaSP 5.10 and compared between the six populations. These parameters included the nucleotide diversity (π), the haplotype diversity and θ, an estimate of 4N e μ, with N e the effective population size and μ the mutation rate per nucleotide. Standard deviation estimates were computed by dnaSP 5.10.
Departure from neutrality was also tested using different selection tests such as the HKA test and the MK test as implemented in dnaSP 5.10 using the GSTe2 sequence from An. gambiae (AGAP009194) for the out-group. Possible evidence of positive selection was also investigated using the K a /K s ratio (non-synonymous substitution rate/synonymous substitution rate) with K a / K s > 1 indicating positive selection, K a / K s < 1 implying purifying selection and K a / K s = 1 suggesting neutrality . Additionally, a codon-based Z test of selection was carried out to assess further the signature of positive selection of GSTe2 in resistant samples. This test uses the Nei and Gojobori method to compute the numbers of synonymous (dS) and non-synonymous (dN) substitutions per site and the numbers of potentially synonymous and potentially non-synonymous sites  as implemented in MEGA 5.2. The dN/dS ratio was calculated and the probability of rejecting the null hypothesis of strict neutrality (H0: dN = dS) in favor of the alternative positive selection hypothesis (H1: dN > dS) was estimated using the bootstrap method (1,000 replicates) in MEGA 5.2.
GSTe2 protein expression and purification
A resistant GSTe2 allele (119 F-GSTe2) from Benin (BN) and two susceptible alleles (L119-GSTe2) from Uganda (UG) and Malawi (MAL) were cloned into a pET28a expression plasmid between NdeI and XhoI sites to yield pET28a::BN-GSTe2, pET28a::UG-GSTe2 and pET28a::MAL-GSTe2 constructs. The pET28a::BN-GSTe2, pET28a::UG-GSTe2 and pET28a::MAL-GSTe2 plasmids were transformed into Escherichia coli strain BL21 (DE3) (Novagen, Madison, WI, US) for protein expression using standard protocols. A total of 5 ml of an overnight culture was sub-cultured into 500 ml of fresh 2TY broth medium plus kanamycin (50 μg/ml). The transformed cells were grown at 37°C. GSTe2 expression was then induced with 0.3 mM of isopropyl-β-D-thiogalactoside when the OD (optical density) at 600 nm was 0.6 to 0.8 overnight at 16°C. The cells were harvested by centrifugation (15 min, 4,500 g); resuspended in 25 mM TrisHCl pH 8.0, 500 mM NaCl, 20 mM imidazole and 5 mM β-mercaptoethanol; and disrupted by sonication. After centrifugation (40 min, 40,000 g), the clear supernatant was filtered, and the His-tagged GSTe2s was purified using Ni-NTA agarose (Qiagen,Valencia, CA, US) according to the manufacturer’s instructions. The filtered supernatant was mixed with the previously equilibrated beads. After incubation, a washing step with ten volumes of 25 mM TrisHCl pH 8.0, 500 mM NaCl, 20 mM imidazole and 5 mM β-mercaptoethanol buffer was performed. All constructs yielded 26.8 kDa products. After a full dialysis against 25 mM TrisHCl pH 8.0, 200 mM NaCl and 5 mM β-mercaptoethanol, the His-tag was cleaved using 7.5 units of thrombin per mg of tagged protein. A final purification step was performed using a Superdex 200 16/60 column (Amersham Biosciences Limited, London, UK) to obtain a highly purified sample (Figure 5A). The time courses of the chromatography were monitored by SDS-PAGE (Figure 5B). GSTe2 proteins were concentrated to 23 mg/ml with a 10-kDa cutoff Amicon protein concentrator (YM-10; Millipore Corporation, Bedford, MA, USA). The final protein concentration was determined spectrophotometrically using the calculated molar absorption coefficient at 280 nm. The samples were kept at 4°C.
Metabolic assay to assess the effect of GSTe2on DDT, permethrin and deltamethrin
The activity of GST was determined with a spectrophotometric assay to examine the formation of the conjugate of 1-chloro-2,4-dinitrobenzene (CDNB) and reduced GSH. One unit of enzyme is defined as the amount of enzyme that yields 1.0 μmol of conjugate product per minute at pH 6.5 and 30°C. Metabolism assays were conducted at 30°C for 60 min with shaking at 1,200 rpm, in a total volume of 0.5 ml. The buffer system was 0.1 M potassium phosphate buffer (pH 6.5), 2.5 mM GSH and 0.2 units of enzyme in the presence of 10 μg/ml DDT, 0.025 mg/ml permethrin or 0.03 mg/ml deltamethrin all in methanol (the solvent did not exceed 10% of the reaction total volume). The control sample contained the same reagent mixture with the boiled recombinant enzyme. After 1 hr of incubation, 500 μl of methanol was added to stop the reaction and the samples were then centrifuged at 13,000 rpm for 20 min at room temperature and 200 μl of the resulting supernatant were transferred to HPLC vials. The quantity of DDT, DDE and pyrethroid remaining in the samples was determined by reverse-phase HPLC with a monitoring absorbance wavelength of 232 nm (Chromeleon, Dionex, Sunnyvale, CA, US). Briefly, 100 μl of sample was injected into a 250 mm C18 column (Acclaim 120, Dionex, Sunnyvale, CA, US) at 23°C. Separation of DDT and DDE was achieved using an isocratic mobile phase of 92% methanol and 8% water with a flow rate of 1 ml/min. Kinetic studies were conducted as previously described . The results were analyzed by non-linear regression using GraphPad Prism v4.0 software (GraphPad Software, Inc, San Diego, CA, USA).
Additional GSTe2metabolism of permethrin by gradient run
To better resolve the metabolite peaks observed for permethrin with the initial isocratic run, a further analysis of the metabolism profile of permethrin by GSTe2 was carried out using a gradient run condition. The parent compound and its metabolites were separated on an C18 Acclaim column by injecting 100 μl of reaction products reconstituted in 1 ml methanol after an ethyl acetate extraction and evaporation step. The reaction mixture was 2 ml with 10 μg/ml permethrin and 0.2 unit of GSTe2. The reaction was initiated by adding 2.5 mM GSH. A bovine serum incubation mixture was used as a negative control. The mobile phases used were the solvent acetonitrile (A), methanol (B) and H2O (C). The analytes were eluted with the following gradient programs (linear increase): 0 min (0% A, 5% B, 95% C), 15 min (37% A, 5% B, 58% C), 25 min (60% A, 5% B, 35% C), 50 min (85% A, 5% B, 10% C), 51 min (95% A, 5% B, 0% C), 56 min (95% A, 5% B, 0% C), 61 min (0% A, 5% B, 95% C) and 69 min (0% A, 5% B, 95% C), at a flow rate of 1 ml/min. Peaks were detected at 232 nm. Data collection and analysis were conducted using Chromeleon software.
Determination of GSTe2structure and docking of DDT alleles
GSTe2 from Benin, Uganda and Malawi mosquitoes was purified as described in the supplementary information for the metabolic assays. The GSTe2 initial crystallization conditions were investigated by high-throughput techniques with a NanoDrop robot (Innovadyne Technologies Inc., Santa Rosa, CA, US ) using the commercial screen solutions Crystal Screen 1 and 2 (Hampton Research, Aliso Viejo, CA, US), PACT Suite and JCSG Suite (Qiagen,Valencia, CA, US). Crystallization assays were conducted using the sitting-drop vapor-diffusion method at 18°C in 96-well plates (Innovaplate SD-2 microplates, Innovadyne Technologies Inc., Santa Rosa, CA, US ). Drops of 250 nl protein at 23 mg/ml and 250 nl precipitant solution were mixed and equilibrated against 65 μl of the well solution. Preliminary crystallization conditions led to crystal clusters of thin plates. Several strategies were used to optimize the best crystallization conditions, which included adjusting the protein sample composition, the precipitant concentration and pH values, and screening with different additives (Additive Screen, Hampton Research, Aliso Viejo, CA, US) or detergents. The final conditions were scaled up on 24-well plates (Linbro plates, Hampton Research, Aliso Viejo, CA, US) through hanging-drop experiments and on a 60-well microbath under oil (Terasaki plates) at 18°C.
The crystals used in our analysis grew from a mix of 1 μl of the protein solution at 23 mg/ml and 2 μl of a precipitant solution. After the refinement of several parameters, isolated prismatic and rod-shaped crystals were obtained. BN-GSTe2 was crystallized by hanging-drop vapor-diffusion using a solution as precipitant containing 40% (v/v) PEG 300 and 0.1 M phosphate-citrate pH 4.2. However, UG-GSTe2 and MAL-GSTe2 were crystallized using a microbatch under oil technique with precipitant 25% w/v PEG 1500 and 0.1 M PCB (Propionate, Cacodylate, Bis-Tris Propane) buffer pH 6.0, and 25% w/v PEG 1500 and 0.1 M MMT (L-Malic acid, MES, Tris)buffer pH 5.0, respectively. To grow holo_GSTe2 crystals, 0.5 μl of 10 mM GSH (L-glutathione reduced)/ GSSG (L-glutathione oxidized) from Hampton Research were added to the crystallization drops. Several cryoprotectants were tested, including glycerol, MPD (2-Methyl-2,4-pentanediol) and polyethylene glycol. The best cryoprotectant solution was 20% glycerol on the crystal mother liquor.
Data collection and structure resolution
Crystals were mounted on a fiber loop, transferred to the cryoprotectant solution and flash-frozen at 100 K in a nitrogen gas steam. Preliminary diffraction data were collected using an in-house Imaging Plate Mar345dtb detector (MarResearch, Norderstedt, Germany) with Cu Kα X-rays generated by a rotating-anode generator (MicroStar, Bruker, Billerica, MA, US) with Helios mirrors (Bruker, Billerica, MA, US) operated at 45 kV and 60 mA. The apo_UG-GSTe2 and holo_MAL-GSTe2 crystals were not suitable for X-ray data analysis because their diffraction was very poor. However, the crystals of apo_BN-GSTe2, holo_BN-GSTe2 and holo_UG-GSTe2 had good diffraction patterns. A complete diffraction dataset was collected for each using the European Synchrotron Radiation Facility (Grenoble, France) (see details in Additional file 15: Table S7). Diffraction data were processed with XDS (X-ray Detector Software)  and scaled with SCALA from the CCP4 package (Collaborative Computational Project, Number 4, 1994). Molecular replacement with the program Phaser  was used to resolve the GSTe2 structures. The coordinates from the An. gambiae glutathione S-transferase epsilon 2 (agGSTe2) [PDB:2IL3] (92% sequence identity ) were used to resolve the apo_BN-GSTe2 and holo_UG-GSTe2 structures. The holo_BN-GSTe2 structure was resolved using the holo_UG-GSTe2 coordinates. Several cycles of restrained refinement with PHENIX  and iterative model building with COOT were required to obtain the final models. The water molecules were also modeled. The data collection, data processing and model refinement statistics are summarized in Additional file 15: Table S7.
The stereochemistry of the models was verified with MolProbity, and the molecular model figures were produced using PyMol (The PyMOL Molecular Graphics System, Version 188.8.131.52 Schrödinger, LLC, Portland, OR, US). The RMSDs between the proteins structures were calculated in COOT. The DALI algorithm was used for structure-based sequence alignment of BN-GSTe2 and UG-GSTe2 with other insect GSTs.
Docking of DDT to the different alleles of GSTe2 was performed using the Genetic Optimization for Ligand Docking (GOLD) software from the Cambridge Crystallographic Data Center, UK. Both DDT and DDE were used in the docking calculations. The 3D structures of these molecules were obtained from the Cambridge Structural Database with reference codes CPTCEL and DCLPEY, respectively. The stereochemistry of the ligands was confirmed with the Mercury program. Both holo_BN-GSTe2 and holo_UG-GSTe2 coordinates were used to define the binding site for molecular docking studies. All the solvent molecules in the structures were removed, and hydrogen atoms were added to the whole protein. In addition, hydrogen atoms were added to the cofactor molecule and cysteine SH group was deprotonated to obtain GS− for the docking calculations. The docking cavity was defined as a collection of amino acids enclosed within a sphere with a 10 Å radius around the GS− molecule, giving freedom of movement to F118, R112, E116, L119 or F119, F120, T165 and L207 and more restrained flexibility to M111 and F115 in the side-chain rotamers. The standard default settings were used in all calculations (number of dockings: 10), but early termination was allowed when the top three solutions were within 1.5 Å RMSD from each other.
The DNA sequences reported in this paper have been deposited in the GenBank database [GenBank:KC800340-GenBank:KC800421], the microarray data in ArrayExpress (E-MTAB-1578) and the 3D X-ray structures in the PDB database (BN-GSTe2-GSH [PDB:3zmk] and UG-GSTe2-GSH [PDB:3zml]).
Hudson, Kreitman and Aguade test
indoor residual spraying
long lasting insecticide net
McDonald and Kreitman test
polymerase chain reaction
Protein Data Bank
quantitative reverse transcriptase polymerase chain reaction
resistance to dieldrin
root mean square deviation
World Health Organization.
World Health Organization: Global Plan for Insecticide Resistance Management (GPIRM). 2012, Geneva, Switzerland
Hemingway J, Ranson H: Insecticide resistance in insect vectors of human disease. Annual Review Entomol. 2000, 45: 369-389.
David JP, Strode C, Vontas J, Nikou D, Vaughan A, Pignatelli PM, Louis C, Hemingway J, Ranson H: The Anopheles gambiae detoxification chip: a highly specific microarray to study metabolic-based insecticide resistance in malaria vectors. Proc Natl Acad Sci USA. 2005, 102: 4080-4084. 10.1073/pnas.0409348102.
Muller P, Warr E, Stevenson BJ, Pignatelli PM, Morgan JC, Steven A, Yawson AE, Mitchell SN, Ranson H, Hemingway J, Paine MJ, Donnelly MJ: Field-caught permethrin-resistant Anopheles gambiae overexpress CYP6P3, a P450 that metabolises pyrethroids. PLoS Genet. 2008, 4: e1000286-10.1371/journal.pgen.1000286.
Riveron JM, Irving H, Ndula M, Barnes KG, Ibrahim SS, Paine MJ, Wondji CS: Directionally selected cytochrome P450 alleles are driving the spread of pyrethroid resistance in the major malaria vector Anopheles funestus. Proc Natl Acad Sci USA. 2013, 110: 252-257. 10.1073/pnas.1216705110.
Djouaka R, Irving H, Tukur Z, Wondji CS: Exploring mechanisms of multiple insecticide resistance in a population of the malaria vector Anopheles funestus in Benin. PLoS One. 2011, 6: e27760-10.1371/journal.pone.0027760.
Ranson H, Rossiter L, Ortelli F, Jensen B, Wang X, Roth CW, Collins FH, Hemingway J: Identification of a novel class of insect glutathione S-transferases involved in resistance to DDT in the malaria vector Anopheles gambiae. Biochem J. 2001, 359: 295-304. 10.1042/0264-6021:3590295.
Lumjuan N, Rajatileka S, Changsom D, Wicheer J, Leelapat P, Prapanthadara LA, Somboon P, Lycett G, Ranson H: The role of the Aedes aegypti epsilon glutathione transferases in conferring resistance to DDT and pyrethroid insecticides. Insect Biochem Mol Biol. 2011, 41: 203-209. 10.1016/j.ibmb.2010.12.005.
Morgan JC, Irving H, Okedi LM, Steven A, Wondji CS: Pyrethroid resistance in an Anopheles funestus population from Uganda. PLoS One. 2010, 5: e11872-10.1371/journal.pone.0011872.
Ramphul U, Boase T, Bass C, Okedi LM, Donnelly MJ, Muller P: Insecticide resistance and its association with target-site mutations in natural populations of Anopheles gambiae from eastern Uganda. Trans R Soc Trop Med Hyg. 2009, 103: 1121-1126. 10.1016/j.trstmh.2009.02.014.
Chiu TL, Wen Z, Rupasinghe SG, Schuler MA: Comparative molecular modeling of Anopheles gambiae CYP6Z1, a mosquito P450 capable of metabolizing DDT. Proc Natl Acad Sci USA. 2008, 105: 8855-8860. 10.1073/pnas.0709249105.
Mitchell SN, Stevenson BJ, Muller P, Wilding CS, Egyir-Yawson A, Field SG, Hemingway J, Paine MJ, Ranson H, Donnelly MJ: Identification and validation of a gene causing cross-resistance between insecticide classes in Anopheles gambiae from Ghana. Proc Natl Acad Sci USA. 2012, 109: 6147-6152. 10.1073/pnas.1203452109.
Daborn PJ, Lumb C, Boey A, Wong W, Ffrench-Constant RH, Batterham P: Evaluating the insecticide resistance potential of eight Drosophila melanogaster cytochrome P450 genes by transgenic over-expression. Insect Biochem Mol Biol. 2007, 37: 512-519. 10.1016/j.ibmb.2007.02.008.
Kostaropoulos I, Papadopoulos AI, Metaxakis A, Boukouvala E, Papadopoulou-Mourkidou E: Glutathione S-transferase in the defence against pyrethroids in insects. Insect Biochem Mol Biol. 2001, 31: 313-319. 10.1016/S0965-1748(00)00123-5.
Vontas JG, Small GJ, Hemingway J: Glutathione S-transferases as antioxidant defence agents confer pyrethroid resistance in Nilaparvata lugens. Biochem J. 2001, 357: 65-72. 10.1042/0264-6021:3570065.
Daborn PJ, Yen JL, Bogwitz MR, Le Goff G, Feil E, Jeffers S, Tijet N, Perry T, Heckel D, Batterham P, Feyereisen R, Wilson TG, ffrench-Constant RH: A single p450 allele associated with insecticide resistance in Drosophila. Science. 2002, 297: 2253-2256. 10.1126/science.1074170.
Campbell PM, Newcomb RD, Russell RJ, Oakeshott JG: Two different amino acid substitutions in the ali-esterase, E3, confer alternative types of organophosphorus insecticide resistance in the sheep blowfly, Lucilia cuprina. Insect Biochem Mol Biol. 1998, 28: 139-150. 10.1016/S0965-1748(97)00109-4.
Claudianos C, Russell RJ, Oakeshott JG: The same amino acid substitution in orthologous esterases confers organophosphate resistance on the house fly and a blowfly. Insect Biochem Mol Biol. 1999, 29: 675-686. 10.1016/S0965-1748(99)00035-1.
Okoye PN, Brooke BD, Koekemoer LL, Hunt RH, Coetzee M: Characterisation of DDT, pyrethroid and carbamate resistance in Anopheles funestus from Obuasi, Ghana. Trans R Soc Trop Med Hyg. 2008, 102: 591-598. 10.1016/j.trstmh.2008.02.022.
Wondji CS, Dabire RK, Tukur Z, Irving H, Djouaka R, Morgan JC: Identification and distribution of a GABA receptor mutation conferring dieldrin resistance in the malaria vector Anopheles funestus in Africa. Insect Biochem Mol Biol. 2011, 41: 484-491. 10.1016/j.ibmb.2011.03.012.
Cuamba N, Morgan JC, Irving H, Steven A, Wondji CS: High level of pyrethroid resistance in an Anopheles funestus population of the Chokwe District in Mozambique. PLoS One. 2010, 5: e11010-10.1371/journal.pone.0011010.
Wondji CS, Coleman M, Kleinschmidt I, Mzilahowa T, Irving H, Ndula M, Rehman A, Morgan J, Barnes KG, Hemingway J: Impact of pyrethroid resistance on operational malaria control in Malawi. Proc Natl Acad Sci USA. 2012, 109: 19063-19070. 10.1073/pnas.1217229109.
Huang H, Yao H, Liu JY, Samra AI, Kamita SG, Cornel AJ, Hammock BD: Development of pyrethroid-like fluorescent substrates for glutathione S-transferase. Anal Biochem. 2012, 431: 77-83. 10.1016/j.ab.2012.09.011.
Biswas S, Akey JM: Genomic insights into positive selection. Trends Genet. 2006, 22: 437-446. 10.1016/j.tig.2006.06.005.
Schlenke TA, Begun DJ: Strong selective sweep associated with a transposon insertion in Drosophila simulans. Proc Natl Acad Sci USA. 2004, 101: 1626-1631. 10.1073/pnas.0303793101.
Michel AP, Ingrasci MJ, Schemerhorn BJ, Kern M, Le Goff G, Coetzee M, Elissa N, Fontenille D, Vulule J, Lehmann T, Sagnon N, Costantini C, Besansky NJ: Rangewide population genetic structure of the African malaria vector Anopheles funestus. Mol Ecol. 2005, 14: 4235-4248. 10.1111/j.1365-294X.2005.02754.x.
Wang Y, Qiu L, Ranson H, Lumjuan N, Hemingway J, Setzer WN, Meehan EJ, Chen L: Structure of an insect epsilon class glutathione S-transferase from the malaria vector Anopheles gambiae provides an explanation for the high DDT-detoxifying activity. J Struct Biol. 2008, 164: 228-235. 10.1016/j.jsb.2008.08.003.
Schmittgen TD, Livak KJ: Analyzing real-time PCR data by the comparative C (T) method. Nat Protoc. 2008, 3: 1101-1108. 10.1038/nprot.2008.73.
Markstein M, Pitsouli C, Villalta C, Celniker SE, Perrimon N: Exploiting position effects and the gypsy retrovirus insulator to engineer precisely expressed transgenes. Nat Genet. 2008, 40: 476-483. 10.1038/ng.101.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.
Rozas J: DNA sequence polymorphism analysis using DnaSP. Methods Mol Biol. 2009, 537: 337-350. 10.1007/978-1-59745-251-9_17.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Clement M, Posada D, Crandall KA: TCS: a computer program to estimate gene genealogies. Mol Ecol. 2000, 9: 1657-1659. 10.1046/j.1365-294x.2000.01020.x.
Kwiatkowska RM, Platt N, Poupardin R, Irving H, Dabire RK, Mitchell S, Jones CM, Diabate A, Ranson H, Wondji CS: Dissecting the mechanisms responsible for the multiple insecticide resistance phenotype in Anopheles gambiae s.s., M form, from Vallee du Kou, Burkina Faso. Gene. 2013, 519: 98-106. 10.1016/j.gene.2013.01.036.
Hudson RR, Slatkin M, Maddison WP: Estimation of levels of gene flow from DNA sequence data. Genetics. 1992, 132: 583-589.
Zhang Z, Li J, Zhao XQ, Wang J, Wong GK, Yu J: KaKs_calculator: calculating Ka and Ks through model selection and model averaging. Genomics Proteomics Bioinformatics. 2006, 4: 259-263. 10.1016/S1672-0229(07)60007-2.
Nei M, Gojobori T: Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986, 3: 418-426.
Kabsch W: Xds. Acta Crystallogr D Biol Crystallogr. 2010, 66: 125-132. 10.1107/S0907444909047337.
McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ: Phaser crystallographic software. J Appl Crystallogr. 2007, 40: 658-674. 10.1107/S0021889807021206.
Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, Headd JJ, Hung LW, Kapral GJ, Grosse-Kunstleve RW, McCoy AJ, Moriarty NW, Oeffner R, Read RJ, Richardson DC, Richardson JS, Terwilliger TC, Zwart PH: PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr. 2010, 66: 213-221. 10.1107/S0907444909052925.
This work was supported by a Wellcome Trust RCD Fellowship (083515/Z/07/Z) to CSW. JMR was supported by a Fundación Ramón Aceres fellowship. Two grants (BFU2011-25384/CSD2006-00015) to AA contributed to the crystallographic study. We thank Mark Paine for his insight and comments.
The authors declare that they have no competing interests.
CSW conceived, designed and coordinated the research. RD, BDM and CSW carried out the sample collection and bioassays. HI and CSW performed the transcription analyses and carried out the genetic variability analysis. JMR performed the transgenic expression in Drosophila. CY performed the crystallographic study with AA. HMI, SSI and JMR performed the metabolic assays of GSTe2. HR and JH contributed toward data analysis and significant insights. JMR, CY, HMI and CSW wrote the manuscript with contribution from all the authors. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figure S1: Transcription profiling and functional analyses of GSTe2. (A) Volcano plot showing the differential expression pattern between DDT-resistant Benin mosquitoes and the susceptible FANG strain, with a twofold-change cutoff and P < 0.01. GSTe2 is highlighted as one of the most upregulated genes in Benin mosquitoes. (B) qRT-PCR validation of microarray upregulation of the main detoxification genes that were differentially expressed between resistant and susceptible DDT samples. c738 is a short-chain dehydrogenase (combined_c738) that is upregulated according to the microarray. (C) The relative expression of the GSTe2 transgene in the transgenic D. melanogaster Act5C-GSTe2 strain and the control sample with no transgene expression. The data shown are the mean ± standard error of the mean (n = 3). (D) Deltamethrin bioassay tests on transgenic Act5C-GSTe2 flies (Exp-GSTe2) and control strains (two parental (UAS-GSTe2 and GAL4-Actin) and F1 progeny that do not express the GSTe2 transgene (Cont-NO)). (DOC 146 KB)
Additional file 3: Table S2: Genetic variability parameters for GSTe2 for resistant (alive) and susceptible (dead) mosquitoes fromKpome (Benin). (DOC 27 KB)
Additional file 5: Figure S2: Detection of the 119 F GSTe2 resistance allele in An. funestus. (A) The results of the TaqMan diagnostic assay for genotyping L119F, with three genotypes unambiguously identified (three clusters). (B) The genotype distribution of L119F alleles across nine countries in Africa shows there is a strong correlation between L119F genotypes and known patterns of DDT resistance. For example, the 119 F (T/T) genotype is fixed in the highly DDT-resistant Benin population but is completely absent in the fully susceptible southern African populations (Malawi, Mozambique and Zambia). (C) Assessment of the correlation between the L119F alleles and the permethrin (type I pyrethroid)-resistant phenotype. (D) Assessment of the correlation between the L119F alleles and the lambda-cyhalothrin (type II pyrethroid)-resistant phenotype. (DOC 163 KB)
Additional file 6: Figure S3: Metabolic activity of GSTe2. (A) DDT metabolism by GSTe2 from Benin has a high peak for the DDE metabolite product. (B) There is a reduction in the peak for permethrin metabolism by 119 F-GSTe2 in the reaction with GSTe2 (blue) compared to the control (black) (three replicates) using isocratic conditions. The arrow indicates potential metabolites. (C) Additional permethrin metabolism by the Benin GSTe2 enzyme using gradient conditions. Blue peaks refer to the metabolism profile of active GSTe2 whereas black peaks refer to the bovine serum incubation mixture (negative control). The arrows indicate the three potential metabolites. The analytes were eluted with a linear increase gradient program. (DOC 171 KB)
Additional file 8: Figure S4: Polymorphism patterns of GSTe2 in Africa. (A) Plot of genetic diversity parameters of GSTe2 across Africa that indicates there is strong directional selection of GSTe2 in Benin mosquitoes. hd, haplotype diversity; π, nucleotide diversity. (B) Haplotypes of GSTe2 (coding and non-coding) across six countries in Africa with contrasting DDT phenotypes. The polymorphic positions are indicated with numbers above the nucleotide, and the haplotypes are labeled from 1 to 39 with preceding initials from the country where the haplotype is predominant. An asterisk (*) shows that the haplotype was observed in other countries. N is the number of individuals who share the haplotype. (C) The same but only considering coding regions. (D) The same but only considering non-synonymous substitutions providing the different protein variants of GSTe2 across Africa. (DOC 65 KB)
Additional file 9: Figure S5: Haplotype network for the full-length GSTe2 (coding and non-coding regions) across Africa. The frequency of each haplotype is reflected by the size of its polygon. Each node represents a segregating mutation (the polymorphic position is given above the branches). The polygons in grey are resistant haplotypes that harbor the resistant 119 F allele whereas the haplotypes in white are susceptible. The mutational position of 499 is circled to indicate that this polymorphic position, which corresponds to L119F, is key to shaping the GSTe2 haplotype distribution. (DOC 677 KB)
Additional file 11: Figure S6: The conformations of the GSH binding pocket (G-site) of the resistant BN-GSTe2 allele (purple) and the susceptible UG-GSTe2 allele (green) have no significant differences. The GSH cofactor and the protein amino acids and water molecules hydrogen-bonded to it are represented in ball-and-stick mode. (DOC 807 KB)
Additional file 12: Figure S7: Comparative analysis of the H-site structure of different GSTe2 alleles and agGSTd1-6. The size and shape of the site are key determinants of DDT metabolism capacity. (Top) The molecular surface representations of those residues constituting the H-site of Uganda-GSTe2 (green), Benin-GSTe2 (purple), agGSTe2 (white) and agGSTd1-6 (yellow). Oxygen and nitrogen atoms at the surface are depicted in red and blue, respectively. (Bottom) Overlays of the structures of Uganda-GSTe2 (green) with Benin-GSTe2 (purple), agGSTe2 (white) and agGSTd1-6 (yellow) showing that most differences are in the secondary structural elements related to the H-site. (DOC 361 KB)
Authors’ original submitted files for images
About this article
Cite this article
Riveron, J.M., Yunta, C., Ibrahim, S.S. et al. A single mutation in the GSTe2 gene allows tracking of metabolically based insecticide resistance in a major malaria vector. Genome Biol 15, R27 (2014). https://doi.org/10.1186/gb-2014-15-2-r27