- Open Access
The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery
Genome Biology volume 14, Article number: R88 (2013)
The small ruminant parasite Haemonchus contortus is the most widely used parasitic nematode in drug discovery, vaccine development and anthelmintic resistance research. Its remarkable propensity to develop resistance threatens the viability of the sheep industry in many regions of the world and provides a cautionary example of the effect of mass drug administration to control parasitic nematodes. Its phylogenetic position makes it particularly well placed for comparison with the free-living nematode Caenorhabditis elegans and the most economically important parasites of livestock and humans.
Here we report the detailed analysis of a draft genome assembly and extensive transcriptomic dataset for H. contortus. This represents the first genome to be published for a strongylid nematode and the most extensive transcriptomic dataset for any parasitic nematode reported to date. We show a general pattern of conservation of genome structure and gene content between H. contortus and C. elegans, but also a dramatic expansion of important parasite gene families. We identify genes involved in parasite-specific pathways such as blood feeding, neurological function, and drug metabolism. In particular, we describe complete gene repertoires for known drug target families, providing the most comprehensive understanding yet of the action of several important anthelmintics. Also, we identify a set of genes enriched in the parasitic stages of the lifecycle and the parasite gut that provide a rich source of vaccine and drug target candidates.
The H. contortus genome and transcriptome provide an essential platform for postgenomic research in this and other important strongylid parasites.
Resistance to broad spectrum anthelmintic drugs is now widespread in parasites of domestic livestock [1, 2] and there are increasing concerns about the sustainability of human parasite control programs using mass administration of the same drugs . Consequently, there is an urgent need to understand the genetic mechanisms underlying anthelmintic resistance and to discover new methods of chemical and non-chemical control. However, the genomic and genetic resources required to underpin research in parasitic nematodes are lacking [4–6]. The free-living nematode Caenorhabditis elegans is a powerful model system, but it has clear limitations for the study of parasitic species . Although the need to develop workable parasitic nematode model systems is widely recognized, most human helminth species are not amenable to experimental study. In contrast, Haemonchus contortus, a gastrointestinal parasitic nematode of small ruminants, has a successful track record in anthelmintic resistance [7, 8], drug discovery [9, 10] and vaccine [11–13] research. It is amongst the most experimentally tractable parasites for a number of reasons: adult females are relatively large and produce thousands of eggs per day, allowing the production of large amounts of biological and genetic material, the infective larvae (L3) can be viably cryopreserved, and in vivo studies, including genetic crosses, can be undertaken in the natural host . Its phylogenetic position within the most closely related group of parasites to C. elegans facilitates comparative genomics and heterologous gene expression, allowing functional studies to be performed on H. contortus genes and regulatory elements [4, 14]. As this parasite is a strongylid nematode, research on it is of particular relevance to the most economically significant parasites of grazing ruminants and to the human hookworms . H. contortus itself causes major economic loss in small ruminants worldwide; it is highly pathogenic and unsurpassed in its ability to develop resistance to every anthelmintic used in its control. Notably, these include a number of core drugs used for mass drug administration programs in humans . Anthelmintic resistance in H. contortus and related strongylids now threatens the viability of the sheep industry throughout the world . This represents both a warning and a useful model for the consequences of the widespread intensive use of anthelmintics that are now being used to control human parasites in the developing world .
H. contortus has a direct and rapid lifecycle (Figure S1 in Additional file 1); adults reside and mate in the abomasum of the ruminant host, then females produce eggs to be excreted in the feces. The eggs embryonate, develop and hatch as first stage larvae (L1), develop and molt to become second stage larvae (L2), then molt again to become third stage larvae (L3) in the feces. The L3 migrate onto pasture to be ingested by the grazing ruminant host. The L3 shed the retained L2 cuticle (exsheath) in the rumen, travel to the abomasum and develop through to the fourth larval stage (L4) then adults in two to three weeks . As voracious blood-feeders, H. contortus L4 and adults can cause severe hemorrhagic gastritis, hypoproteinemia, anemia and edema, with acute infections resulting in death of the animal host. One adult female can produce up to 10,000 eggs per day  and a single animal can harbor thousands of worms. This extremely high fecundity, under conducive host and environmental conditions, can give rise to population explosions and devastating disease outbreaks. Development occurs most rapidly in warm humid conditions, but the L4 stage can undergo arrested development within the host to survive adverse conditions such as prolonged drought or cold winters . This feature of facultative arrested development, aided by movement of domestic livestock and climate change, has ensured the worldwide distribution and success of this parasite even though its evolutionary origins were in sub-Saharan Africa .
One striking feature of H. contortus, in common with related nematode species, is the extremely high level of genetic diversity that has been reported in both laboratory and field populations; this is thought to be predominantly a function of its large effective population size [20–22]. Genetic variation may underlie both the parasite's remarkable ability to adapt to different climatic regions and host species  and its alarming propensity to develop drug resistance. This high level of genetic polymorphism has also provided a major challenge to genome assembly, necessitating the production of an inbred line from which to prepare DNA template.
In this paper, we describe the assembly and annotation of the draft genome of MHco3(ISE).N1, a genetically well characterized inbred H. contortus strain that is susceptible to all major anthelmintic drugs. The most extensive comparative transcriptomic sequencing and analysis yet described for a parasitic nematode was undertaken on representative life-stages and the nematode gut to explore different aspects of nematode biology, evolution and parasitism. This draft genome provides a platform for future research in anthelmintic resistance, drug discovery and vaccine development using H. contortus, currently the most important experimental model for the strongylid nematode group.
Results and discussion
Genome structure and content
We assembled a draft sequence of the H. contortus genome based on data from a mixture of sequencing technologies (Material and methods; Additional results and methods in Additional file 1). Our final draft assembly consists of 67,687 contigs linked into 26,044 scaffolds of total length 370 Mb, including 23.8 Mb of inferred gaps between contigs, with an average and N50 scaffold length of 14,206 and 83,287 bp, respectively (Table S1 in Additional file 1). This is a significantly larger genome size than the approximately 60 Mb predicted with flow cytometry  but it is consistent with a prediction of approximately 200 to 300 Mb inferred from a pilot annotation of two large contiguous regions of the genome , making this the largest nematode genome sequenced to date (Table S1 in Additional file 1). The overall base composition of all sequence contigs was close to neutral at a mean 41.3% GC, slightly higher than that for other clade V nematodes. Approximately 93% of conserved eukaryotic genes can be identified in the assembly, suggesting that our draft assembly represents at least that fraction of the H. contortus genome , as many of these models are incomplete or split between contigs (Table S2 in Additional file 1). Despite this, our current assembly is of similar quality to some other published draft nematode genome sequences [26–30] (Table S1 in Additional file 1).
We used extensive transcriptomic evidence from across the H. contortus lifecycle (Materials and methods; Table S3 in Additional file 1) to guide de novo gene model curation for protein-coding genes. We predict a similar number of protein-coding genes to the C. elegans genome (21,799 versus 20,532), but significantly lower gene density of 59 genes per Mb, with only 7% of the genome being protein coding, compared to 200 genes per Mb and a protein coding content of approximately 30% in the C. elegans genome (Figure 1). The average coding sequence length is similar in H. contortus and C. elegans (1,127 bp in H. contortus compared to 1,371 bp in C. elegans) but the average gene length is more than double in the parasite (6,564 bp compared to 3,010 bp in C. elegans). A closer look at a subset of 2,822 H. contortus and C. elegans one-to-one orthologs shows the discrepancy is explained by an expansion in both the number and length of introns in H. contortus (average of 10 introns per gene, average size 633 bp; relative to 6 introns per gene, average size 340 bp in C. elegans). An expansion in intronic sequence has also been reported in the closely related necromenic nematode Pristionchus pacificus but to a less dramatic extent (average of nine introns per gene, average size 309 bp) . Over 80% of H. contortus and C. elegans one-to-one orthologs present on the same scaffold occur on the same C. elegans chromosome (Table S4 in Additional file 1), suggesting widespread conservation of synteny between the two species (Figure 2). However, gene order is generally poorly conserved, which is consistent with comparisons between C. elegans and related nematodes [27, 31], but regions of conserved microsynteny are apparent and may prove useful in supporting orthology of divergent genes.
Reflecting this relatively expanded genome, approximately 29% of the draft genome assembly consists of repetitive sequence (compared to approximately 16% in C. elegans), including 2,434 copies of the characteristic HcRep element previously reported from a number of trichostrongylid species [14, 32–34]. The repetitive sequence also includes representatives of many known repeat families in other nematodes, with approximately 6% of the genome derived from LINEs, 2.5% from long terminal repeat retrotransposons and 5% from DNA transposons, including TTAA-specific, Mariner-like and MuDR-superfamily elements, together with evidence of elements related to Tc1 and Tc4 of C. elegans. Despite belonging to similar families, H. contortus repeats represent independent expansions to those in other clade V nematodes, as repeat libraries from H. contortus show little similarity to genome sequence from other species, and vice versa. Our transcriptomic data suggest that active transposition is occurring, with evidence of expression for 4 out of 26 gene models annotated with transposase domains and 49 out of 482 reverse transcriptase domain-containing proteins. Expressed proteins appear to belong to a range of DNA transposon types, and to Gypsy-related and LINE retro-elements.
In C. elegans, around 17% of genes are in operons  - tightly linked clusters of two to eight genes, which are co-transcribed from the same promoter. The resulting polycistronic pre-mRNAs are resolved by trans-splicing with spliced leader (SL) SL1 and SL2 sequences. Most frequently, SL1 is trans-spliced to the first gene in an operon and downstream genes are SL2 trans-spliced. The structure (gene complement, order and orientation) of around 23% of C. elegans operons is conserved in the H. contortus genome. The structure of a further 10% of C. elegans operons appear to be partially conserved, where at least two orthologs are present on the same scaffold in the expected order and orientation, but one or more genes are in a different order, inverted or absent. Functional constraints are thought to conserve the intergenic distance in C. elegans operons to approximately 100 bp but genes in H. contortus operons are further apart: the average intergenic distance of genes with a conserved operon structure is 992 bp (median 621 bp, largest 8,329 bp), and the operon encoding ion channel subunits Hco-deg-2H and Hco-deg-3H has an intergenic distance of 2,342 bp .
Overall, SL1 trans-splicing was detected in 6,306 H. contortus genes and SL2 trans-splicing was detected in 578 genes. Of these, 318 trans-spliced genes were in the putative conserved operons identified above (Additional methods in Additional file 1). All 126 first genes in operons were trans-spliced to SL1 (SL2 trans-splicing was detected in five putative first genes, but examination of their loci suggests they are downstream genes in new operons in this species); 119 downstream genes were trans-spliced to SL1 and 73 were trans-spliced to SL2. If SL2 trans-splicing is the definitive criterion in identifying operons, the relatively low level of SL2 trans-splicing to downstream genes suggests that either operon function is less frequently conserved than operon structure in H. contortus or that undetected, divergent SL2 sequences are present. However, the relatively high frequency of SL1 trans-splicing in genes that are also SL2 trans-spliced (approximately 77%; 56 of 73 in conserved operons, 445 of 578 in all genes identified) suggests SL1 trans-splicing of downstream genes may also be relatively common in this species.
We employed two complementary approaches to global comparison of the H. contortus gene set, using the Inparanoid algorithm to look in detail at orthologs with C. elegans and P. pacificus, and OrthoMCL for a wider view of gene family evolution with other clade V nematodes. Of 5,937 orthology groups between C. elegans and H. contortus, 5,012 are one-to-one orthologs, while an additional 899 orthologs could be identified in H. contortus and P. pacificus but not C. elegans, suggesting they have been lost in the C. elegans lineage (Table S5 in Additional file 1). A number of orthology groups are significantly expanded in H. contortus, including a family of 180 Haemonchus paralogs to a single C. elegans gene that lacks any functional annotation (Table S6 in Additional file 1). Other expanded groups include genes with likely roles in parasitism, such as cysteine-rich secreted proteins, together with a set of helicase domains that include some with predicted signal peptides. Global analysis of the evolution of entire gene families (clusters of similar genes) across the clade V nematodes confirms this pattern of significant diversity within the clade (Figure 1), and allowed us to identify H. contortus genes lacking clear orthologs in other clade IV or V nematodes (Figure S2 and Table S8 in Additional file 1). This shows that the Haemonchus-specific proteome is enriched in genes encoding polypeptides involved in proteolysis, neurotransmission and carbohydrate metabolism, and in secreted proteins and those expressed in the cuticle. While some of these genes are explored in detail in the more focused analyses below, others may represent novel candidate genes involved in the host-parasite interface.
Gene expression in parasite life-stages and gut
As H. contortus progresses through its lifecycle, it must adapt to different environments with differing food sources and energy requirements (Figure S1 in Additional file 1) and this is reflected in differential gene expression. RNA-seq was used to analyze gene expression in six parasite life-stages and the adult female gut. Samples were made in triplicate from independent batches of parasite material for every stage, allowing statistically robust comparison of relative gene expression between the stages (Materials and methods; Table S3 in Additional file 1). We found significant expression for 17,483 genes in total from the 6 life-stages studied; with between 13,962 (in L3) and 15,569 (in adult males) genes expressed in each stage. A total of 11,295 genes were significantly up- or down-regulated through the lifecycle (Figure 3 and Materials and methods) and we used annotation with Gene Ontology (GO) terms to investigate their broad functions. Metabolic enzyme expression throughout the parasite lifecycle was analyzed in more detail and will be discussed separately.
Genes up-regulated in the development from egg to the L1 stage include those associated with muscle development and motor activity, while up-regulated in the egg are genes associated with oxidoreductase activity, apoptosis, body morphogenesis and development, larval and embryo development, as well as DNA replication and chromosome organization. This pattern is consistent with the progression from a developing embryonated egg to a motile and actively feeding larval stage.
In the transition to the L3 stage, a decrease in the expression of genes associated with the myosin complex and motor activity and various metabolic processes are consistent with the nematode entering a quiescent state. Among the up-regulated genes there is an association with oxygen transport and heme binding. Oxidoreductase enzymes are over-represented and may reflect the increased need to detoxify a build up of endogenous waste, consistent with previous studies showing higher cytochrome P450 (CYP) activity in the H. contortus L3 than in L1 or adult stages . CYPs have also been shown to be up-regulated in response to reduced food intake . This is consistent with the significant increase in gluconeogenesis from the L1 to L3 - which is also increased in the C. elegans dauer  - and the up-regulation of acetyl-CoA metabolic process, likely to reflect metabolism of fat stores in the non-feeding L3. Genes associated with binding of cobalamin (vitamin B12) are also up-regulated. Cobalamin has been shown to be strongly concentrated and stored in the infective L3 of other gastrointestinal nematodes  and a ready supply may be required for rapid larval development after ingestion by the host.
The L4 is the first blood-feeding stage of H. contortus. The transition from the quiescent L3 stage to the motile L4 stage is reflected in significant up-regulation of many genes, including genes associated with motor activity, the myosin complex and locomotion as well as various metabolic processes. The binding of oxygen, lipids and sugars, possibly associated with active feeding, is also up-regulated, as are changes in the expression of genes linked to response to oxidative stress that may reflect the reactivation of the parasite from its dormant stage. A significant increase in the expression of genes associated with collagen and cuticle development and body morphogenesis, consistent with parasite growth, is also observed. Interestingly, heme-binding genes were both up- and down-regulated, perhaps reflecting an increase in heme load from blood feeding and a decrease in CYP activity.
Transition from the L4 to the adult stages is characterized by several changes. In the transition to the female stage, 1,658 genes are up-regulated and correlate with gender-specific development as well as embryogenesis (adult females contain oocytes and eggs at various developmental stages), such as genitalia development, embryo development, oogenesis, ovulation, germ-line cell cycle switching, meiosis regulation and regulation of vulval development. A significant increase in DNA replication processes is also apparent. Between the L4 and adult male stage, lower expression in the adult male of genes linked with body morphogenesis, molting, collagen and cuticle development, oxidoreductase activity, heme binding and response to oxidative stress were observed among the various alterations in expression. Increased expression of genes annotated with the GO term 'structure molecule activity' is due to up-regulation of a number of major sperm protein genes. These sex-specific gene expression patterns were confirmed by comparison between adult female and male parasites, and it is clear that many genes are highly enriched in male worms, being expressed only at low levels in other stages. The large set of genes highly expressed in both eggs and adult females were again apparent.
Finally, we investigated gene expression in the H. contortus intestine, the major organ of digestion and detoxification in the nematode, by comparing the female gut sample with the whole female worm. Consistent with data from H. contortus gut EST libraries , increased expression of genes with protein kinase, cysteine-type peptidase and cysteine-type peptidase inhibitor activities predominated. Genes associated with sugar and cobalamin binding were significantly up-regulated, as were genes associated with transport of cations, anions and oligopeptides. Oxidoreductase activity was also increased, consistent with the expression pattern of detoxification genes in C. elegans .
Metabolic pathways and chokepoint analysis
Comparisons between H. contortus and the free-living nematodes revealed 22 enzyme classifications (ECs) that were restricted to the parasite (Table S9 in Additional file 1). While more detailed analysis is required, metabolism of amino acids and carbohydrates clearly differ between these two groups. For example, lysine 6-aminotransferase (EC 22.214.171.124) catalyzes lysine to glutamate, which can be further converted to α-ketoglutarate, an intermediate of the tricarboxylic acid cycle. Lysine 6-aminotransferase was previously considered restricted to prokaryotes; thus, its activity in H. contortus needs to be confirmed.
A summary of up- and down-regulated metabolic enzymes across all life-stages is shown in Table S10 in Additional file 1. The transition through eggs, L1, L3 and L4 showed a striking pattern: from L1 to L3, most enzyme classifications were down-regulated, including those involved in carbohydrate, lipid and energy metabolism, but many of these were up-regulated again in the transition to L4. This is consistent with the L3 being a stage in which development is arrested, analogous to the dauer larva in C. elegans. Further support for this comparison is the up-regulation of two enzymes that independently convert isocitrate to 2-oxo-glutarate, while most other parts of the tricarboxylic acid cycle are down-regulated. Furthermore, 2-oxo-glutarate is an entry metabolite into the ascorbate and aldarate metabolic pathway, which is implicated in increased lifespan in Drosophila . The L4-to-male transition shows a decrease in lipid metabolism coupled with an increase in amino acid metabolism.
Metabolic chokepoints - reactions that uniquely consume or produce a metabolite - are enzymes that seem likely to be essential to the parasite and so may be potential targets for future drug development. Analysis of the H. contortus network revealed 362 chokepoint reactions, five of which passed additional criteria of lacking isoenzymes and being divergent from known human enzymes  (Table S11 in Additional file 1). Comparison of these against the Therapeutic Target Database revealed that two are known potential targets for anthelmintic drugs: trehalose-6-phosphatase (EC 126.96.36.199) is a member of the sucrose metabolism pathway, and is currently being researched as a potential drug target in the filarial nematode B. malayi , while dTDP-4-dehydrorhamnose 3,5-epimerase (EC 188.8.131.52) is currently of interest in Mycobacterium tuberculosis . This validation of the approach justifies further analysis of the other three chokepoint enzymes identified.
Neuromuscular drug targets
In nematodes, the pentameric ligand-gated ion-channel (pLGIC) family is particularly numerous, with 64 members identified in H. contortus via homology with Caenorhabditis spp., P. pacificus and RNA-seq data (Figure 4). They are of great importance in parasitic nematodes because they are targets of the majority of the currently available anthelmintic drugs (as summarized in Table S12 in Additional file 1; the β-tubulin targets of the benzimidazole group have been described previously ).
The pLGICs regulate the flow of anions, typically chloride ions, or cations, including sodium and calcium, in response to an extracellular signal in the form of an activating ligand or change in pH. They are fundamental to synaptic transmission; interference with their normal function results in paralysis and death. Drugs that activate the anionic channels, such as the macrocyclic lactone ivermectin (IVM), typically inhibit neuronal transmission and muscle contraction. Those that activate the cationic channels, such as levamisole (LEV) and monepantal, stimulate neuronal transmission and typically induce muscle contraction. Here we present the most complete picture of these channels to date and show that, as expected, this parasite is very similar to C. elegans. This supports the use of the free-living worm as a functional model for the parasite nervous system. There are, however, some important differences, most significantly in glutamate signaling, which is sensitive to the macrocyclic lactones, and acetylcholine signaling, which is sensitive to LEV, as well as characteristic loss of some receptor genes associated with biogenic amine signaling.
The macrocyclic lactones, which include IVM and moxidectin, act at several different glutamate-gated chloride channels. Some of these are found in both C. elegans and H. contortus, but it is noteworthy that two H. contortus subunit genes, glc-5 and glc-6, encode glutamate-sensitive channels that are absent from C. elegans. It is likely these were lost from the rhabtidid lineage as homologous sequences can be detected in the genome of the close relative P. pacificus. Both of these subunits are targets for the macrocyclic lactones [48, 49] and changes in either their sequence or expression have been associated with drug resistance in veterinary parasites . The archetypal target of IVM, Cel-glc-1, appears to be a duplication of avr-15, specific to C. elegans. These differences may explain the difficulties encountered in understanding IVM's mode of action and resistance in parasitic nematodes by studying the model organism . Most of the other anionic channel subunits in H. contortus have direct orthologs in C. elegans (Figure 4a). The only notable absences are lgc-48 and lgc-54. lgc-48 encodes an as yet uncharacterized member of the acc-1 acetylcholine gated chloride channels  that appears to have been lost from H. contortus, and lgc-54 encodes an uncharacterized member of the C. elegans biogenic amine-gated chloride channel family that may not be present in H. contortus .
The acetylcholine gated receptor cation channels are of particular interest as they are the targets of several anthelmintic drugs already in use (LEV, monepantel (MPTL) and derquantel) and are considered to be one of the most promising gene families for the identification of new drug targets. Comparison of the H. contortus gene family with those found in C. elegans is shown in Figure 4b. There is a one-to-one correspondence in H. contortus for subunits of the acr-16 clade , named for the homomeric nicotine-sensitive channel in C. elegans, and acr-16 expression is much higher in adult males than in other life-stages, as previously described for C. elegans . The anthelmintic LEV targets receptors composed of α subunits of the unc-38 clade and non-α subunits from the unc-29 clade. In C. elegans three different alpha and two different non-alpha subunits combine to form a channel that responds to acetylcholine, strongly to LEV and only weakly to nicotine . The α-type acr-13 (lev-8) gene, required in C. elegans for the LEV receptor, appears to have been lost in H. contortus as an ortholog of this gene is detectable in P. pacificus. The non-alpha gene lev-1 is present, but the signal peptide appears to have been lost . A LEV receptor in H. contortus can be reconstituted without either of these, requiring only UNC-38, UNC-63 and UNC-29 as in C. elegans  and replacing ACR-13 with ACR-8. Most strikingly, the non-α subunit gene unc-29 in C. elegans corresponds to four paralogous copies in H. contortus . A LEV-sensitive channel has been reconstituted containing the Hco-UNC-29.1 subunit . The degree to which the other copies may have diverged in function is currently under investigation. The deg-3 clade  encodes subunits that form channels involved in chemotaxis, and is of particular interest as channels encoded by des-2/deg-3 and acr-23 in C. elegans are targeted by the relatively new anthelmintic MPTL, with acr-23 as the principal target of MPTL action in vivo . The single subunit gene mptl-1 in H. contortus corresponds to the acr-23/acr-20 pair in C. elegans. Splice site mutations in the mptl-1 gene are associated with resistance to MPTL , which would suggest that acr-23 is functionally equivalent to mptl-1.
H. contortus possesses two acetylcholine gated receptor genes that are not present in C. elegans, acr-26 and acr-27, and these two genes seem to form a distinct clade, which suggests that they are likely to form receptors with a distinct pharmacology. Orthologs of acr-26 are found in many species of parasitic nematode and therefore they might make interesting targets for the development of novel cholinergic anthelmintics.
In summary, the majority of the pLGIC subunit genes in C. elegans have direct orthologs in H. contortus, although there is more variation in the repertoire of orphan family pLGICs (Additional results in Additional file 1). This supports the use of C. elegans as a model to study the neuromusculature of H. contortus in drug screens, but there are important differences in specific anthelminthic targets: for each of the anthelmintics IVM, LEV and MPTL, the drug targets in H. contortus are functionally different to the C. elegans model.
Drug metabolism and efflux
Parasitic nematodes are armed with a large repertoire of inducible metabolizing enzymes and transporters to protect against environmental toxins. The nematode detoxification pathway can be divided into three main phases: modification, conjugation and excretion  involving the cytochrome P450s (CYPs) and the short-chain dehydrogenase/reductases (SDRs) in phase 1, the UDP-glucoronosyl transferases (UGTs) and the glutathione S-transferases (GSTs) in phase 2 and the ATP-binding cassette (ABC) transporters in phase 3. Little is known about the impact of the parasite detoxification system on anthelminthic efficacy or resistance , but a better understanding of these pathways will allow their role in anthelmintic resistance to be assessed and should also be informative in the design of new drugs and synergistic agents . We have annotated a large number of modification and conjugation genes in the current assembly, identifying a total of 42 CYPs, 44 short-chain dehydrogenase/reductases, 34 UDP-glucoronosyl transferases and 28 glutathione S-transferases (Figure S5 in Additional file 1), but here we focus on the excretion transporters implicated in drug resistance.
Members of the ABC transporter family hydrolyze ATP and couple the energy released with the active transport of a wide range of compounds, including small organic molecules, lipids, proteins and metal ions. They are an essential component of many biological processes and are fundamental to the barrier between a nematode and its environment. These transporters consist of a basic structure of six transmembrane domains with an associated intracellular ATP binding motif. The functioning transporter complex requires two of these protein halves, but some members of this family are fusion proteins with 12 transmembrane domains and two ATP binding motifs. We find numerous differences in ABC transporter genes between C. elegans and H. contortus (Additional results and Figure S4 in Additional file 1), with a reduced complement of haf-transporters, including, for example, the loss of haf-6, which is essential for efficient RNA interference in C. elegans , a significant expansion of ced-7, which has an unknown function in amphid and phasmid sensory cells, and an extensive change in the repertoire of multidrug resistance protein genes. The P-glycoprotein (pgp) transporters are of particular interest as they have been implicated in resistance of H. contortus to IVM and other anthelmintics [49, 63], and this family has undergone significant change compared to the free living C. elegans. In total, 10 P-glycoprotein genes have been identified in the H. contortus genome and knowledge of this full complement will now allow a more systematic analysis of the role of P-glycoproteins in resistance to IVM and other anthelmintics. A cluster of four C. elegans genes, pgp-5, 6, 7 and 8, are not found in H. contortus. Cel-pgp-3 and 4 as well as Cel-pgp-12, 13 and 14 represent gene duplication events corresponding to single genes in the parasite. Cel-pgp-9 corresponds to two paralogous copies in H. contortus, and in addition, two genes not present in C. elegans, related to pgp-3 and pgp-11, have been retained in H. contortus. These are named pgp-16 and pgp-17, respectively. Changes in the sequence or expression of pgp-1, pgp-2 and pgp-9 have been reported for IVM-resistant versus-susceptible isolates of H. contortus and in pgp-9 for resistant Telodorsagia circumcincta [50, 64, 65].
Protease vaccine candidates
H. contortus is a voracious blood-feeder, with even modest infections of 1,000 worms generating losses of up to 50 ml of blood per day . Cysteine, aspartic and metallo-proteases as well as aminopeptidases have been implicated in important aspects of parasite function, including hemoglobin digestion and anticoagulant activity, and these enzymes are also important vaccine candidates. Vaccination with gut extracts enriched for these activities can confer up to 75% reduction in worm burden and 90% reduction in egg output . Protection is thought to result from the ingestion of host antibodies by the parasite during blood-feeding, which bind to gut antigens and disrupt function. Female parasites appear to be more affected than males, increasing the relative impact on egg output. Development of a commercial vaccine, however, requires identification and expression of specific proteases that, either singly or combined, induce protective immunity. Comparative genomics and transcriptome data can aid target selection by identifying potential functionally important proteases and those enriched in the gut of blood-feeding L4 and adult stages, suggesting a role in blood-feeding as well as accessibility to host antibodies.
Cathepsin B protease (cbl) genes are part of an ordered hemoglobin degradation pathway, functioning after aspartic proteases (APRs) and upstream of metalloproteases (MEPs) and aminopeptidases [68, 69]. Cathepsin B diversity may therefore be key in generating an array of substrates from ingested nutrients for efficient cleavage by downstream proteases and may be involved in the high blood digestion capacity of H. contortus. Indeed, H. contortus has a higher copy number (63 genes) of cathepsin B protease genes than related free-living nematodes, representing 80% of all cathepsin cysteine protease genes in the genome (Figure 5a). This large expansion of the cathepsin B family has resulted in H. contortus showing a greater diversity of cbl genes than known in any other parasitic nematode. Uniquely, members of each cbl type are arranged in tandem in the H. contortus genome (Figure 5b) and have the same genomic structure, suggesting that diversity has arisen through recent gene duplication. Reflecting this, most CBLs encoded in the H. contortus genome form large monophyletic groups distinct from C. elegans , hookworm  or other strongylid CBLs, suggesting that duplication and divergence has occurred separately following speciation. It is possible that gene amplification has occurred as a mechanism of increasing overall CBL expression associated with the need for H. contortus to digest the huge quantities of host blood it takes in during feeding. The cbl genes show increased expression in L4 and adult stages and many are significantly enriched in gut tissue, identifying these as potentially important control targets. These include both novel and previously identified cbl genes (AC-4, gcp-7 and hmcp-2) [72–74], while other cbl transcripts are significantly up-regulated in the L4 stage and in adult male worms, but not in the gut, suggesting a role in development and reproduction rather than feeding. The CBLs identified here contain a putative amino-terminal signal peptide and several have been identified in adult worm excretory-secretory products . It has also been proposed that sequence variation of Hc-CBLs may confer antigenic diversity , and presentation to the host immune system through secretion may therefore drive the maintenance of the diversity of Hc-cbl genes. Three-dimensional modeling of the CBL repertoire and epitope mapping will clarify this issue.
Other cathepsin cysteine protease genes are not expanded - for example, a cpr-6-like single copy gene is highly conserved in a number of parasitic and free-living nematodes, suggesting a house-keeping role, and there is no expansion found for cathepsin L, F or Z genes in H. contortus. Furthermore, there is less expansion of genes encoding other vaccine candidates - H-gal-GP composed predominantly of pepsinogen-like aspartic and metallo proteases , and H11 aminopeptidases . These are integral gut membrane proteins, considered hidden from the immune system during natural infection , and lower diversity should be advantageous in vaccination studies.
Significant expansion is, however, seen in the cathepsin aspartic protease family (Figure S6 in Additional file 1). Importantly, the genome data identify a novel, single-copy apr gene with 84% amino acid identity to APR-1 of the dog hookworm Ancylostoma caninum and that is significantly enriched in gut tissue. Vaccination with recombinant Ac-APR-1 significantly reduced fecal egg counts, worm burdens and anemia , warranting investigation of the protective potential of the Haemonchus APR-1-related protease. Other novel Hc-apr genes group phylogenetically with C. elegans asp genes (Figure S6 in Additional file 1) and are increased in the environmental L1 stage, indicating a developmental role, consistent with C. elegans data [80, 81] and ruling these out as potential vaccine targets.
H. contortus is the one of the most intensively researched parasitic nematode species and is the most established parasite species used in drug discovery, drug mode-of-action and drug resistance research. It is the first strongylid nematode to have its genome sequenced and the analysis presented here provides a first genomic insight into the biology of a gastro-intestinal strongylid and blood feeding parasitic nematode. It confirms the close relationship between H. contortus and C. elegans and provides a platform from which to apply the C. elegans biological knowledge to investigate the biology of this parasite for both basic and applied research. It also highlights specific differences between the species that will be important to inform researchers of those aspects of C. elegans biology that cannot be extrapolated. The genome sequence, gene models, transcriptome datasets and bioinformatic analyses provide a wealth of information on potential new drug and vaccine targets. They will also be an invaluable resource for the application of post-genomic technologies to this and other related strongylid parasites.
Materials and methods
Parasite material production and quality assurance
The MHco3(ISE) line is a H. contortus strain, susceptible to all the major anthelmintic classes, that has been passaged in the laboratory by serial infection for many years. It has been extensively phenotypically and genetically characterized previously . The inbred MHco3(ISE).N1 strain was derived by a genetically validated single pair mating of an adult male and an adult female MHco3(ISE) worm using a method of direct transplantation into the abomasum of a recipient parasite-free sheep (described in ). The inbred MHco3(ISE).N1 strain was subsequently passaged by oral experimental infection of parasite-free sheep and parasite material was harvested using standard methods (see Additional methods in Additional file 1 for details). All parasite material used in this study was derived from the inbred H. contortus MHco3(ISE).N1 strain except for the female gut transcriptomic data, which were generated from the MHco3(ISE)strain. The MHco3(ISE) and MHco3(ISE).N1 strains were routinely monitored for genetic integrity at each passage, as was all parasite material derived from the strains, using a standard panel of microsatellite markers and a well established 'genetic fingerprinting' methodology [8, 82]. Both the MHco3(ISE) and the MHco3(ISE).N1 are viably cryopreserved and available on request.
Genome sequencing and assembly
We assembled a draft sequence of the H. contortus genome based on data from a mixture of sequencing technologies, including 21 × coverage of paired-end and shotgun reads from a Roche454SLX platform and approximately 163 × coverage of long- and short-insert paired-end libraries from an Illumina HiSeq (coverage based on a genome size of 370 Mb). Genomic and transcriptomic sequence data were generated using largely standard molecular biology methods (Additional methods in Additional file 1), except that whole-genome amplified material was used to generate sufficient material for a large-insert Illumina library from a single male worm, using a modified protocol. The genomic reads from each technology were initially assembled independently using assembly algorithms most suited to the typical coverage and read length of each, and scaffolds from each were then merged to form an initial set of scaffolds that were then improved by automated gap filling, followed by breaking and re-scaffolding using the paired-end data from both sequencing platforms.
Protein coding gene prediction and functional annotation
After quality control and end-trimming, the transcriptome reads were mapped against the reference genome using TopHat software, v 2.0.6 (--mate-std-dev 50 -a 6 -i 10 -I 20000 --microexon-search --min-segment-intron 10 --max-segment-intron 50000) . A reference dataset of 399 H. contortus protein encoding genes was manually curated from predictions of highly conserved genes using CEGMA (version 2.4)  and RNA-seq mapping. Of those, 347 were used to train Augustus  and 52 were used to independently evaluate the accuracy of the predictions. Final gene prediction (v2.0) was performed by Augustus using H. contortus specific parameters and RNA-seq, EST and polyA mappings as evidence hints generated by TopHat2 and PASA , respectively. Gene prediction accuracy was computed at the level of nucleotides, exons and complete genes on 52 manual curated gene models as described previously  and shown in Table S7 in Additional file 1). Functional annotation information was obtained from the interpro databases using interproscan v4.5 , GO  terms were annotated via interpro2GO and from the curated C. elegans annotation in Wormbase (release 235)  by assigning all GO terms shared by all C. elegans genes in a gene family to the H. contortus members of that family. Further functional insight was obtained by BLAST searches for similar genes in the GenBank nr database, and putative signal peptides were identified by SignalP . To investigate H. contortus metabolism, a total of 828 ECs, covering 2,853 proteins, were assigned using KAAS , DETECT  and EFICAz . Of these, 563 ECs covering 1,246 proteins were assigned to a metabolic pathway; the others are non-metabolic enzymes. Similar annotation efforts were carried out on Caenorhabditis species and P. pacificus.
Gene expression and Gene Ontology analysis
The numbers of RNA-seq reads per gene model were counted using custom-made scripts making use of BEDtools and a gff file of the genome annotation, and based on the TopHat mapping described above. Analysis of gene expression was performed using the DESeq (v 1.8.1) package for Bioconductor . Read coverage was normalized to estimate the effective library sizes for each library and negative binomial tests performed between pairs of sample triplicates, using dispersion estimates from the default approach, to obtain P-values for differential expression of each gene adjusted for false discovery rate using the Benjamini-Hochberg procedure for multi-testing . Only genes with adjusted P-values ≤1e-5 were retained. GO terms enriched in the set of differentially expressed genes in each comparison were identified using the 'weight01' algorithm of the TopGO (v 2.8.0) package for Bioconductor . Only GO terms with P < 0.01 were considered for more detailed analysis. Expression data were drawn using Circos-0.62. All differentially expressed genes were clustered using MBCluster.seq with 50 clusters, and clusters then ordered based on which stage had the highest mean expression in that cluster.
All sequences described in this paper have been submitted to the GenBank database, project ID PRJEB506. Sequence data are available at  and annotation has been submitted to GenBank and Wormbase. ENA accession numbers for all genomic and RNA-seq reads are listed in Tables S3, S13 and S14 in Additional file 1. The H. contortus genome assembly and functional annotation are available at .
cathepsin B protease
expressed sequence tag
long interspersed element
pentameric ligand-gated ion-channel
Kaplan RM: Drug resistance in nematodes of veterinary importance: a status report. Trends Parasitol. 2004, 20: 477-481. 10.1016/j.pt.2004.08.001.
Kaplan RM, Vidyashankar AN: An inconvenient truth: global worming and anthelmintic resistance. Vet Parasitol. 2012, 186: 70-78. 10.1016/j.vetpar.2011.11.048.
Prichard RK, Basanez MG, Boatin BA, McCarthy JS, Garcia HH, Yang GJ, Sripa B, Lustigman S: A research agenda for helminth diseases of humans: intervention for control and elimination. PLoS Negl Trop Dis. 2012, 6: e1549-10.1371/journal.pntd.0001549.
Gilleard JS: The use of Caenorhabditis elegans in parasitic nematode research. Parasitology. 2004, 128 (Suppl 1): S49-S70.
Geary TG, Thompson DP: Caenorhabditis elegans: how good a model for veterinary parasites?. Vet Parasitol. 2001, 101: 371-386. 10.1016/S0304-4017(01)00562-3.
Hashmi S, Tawe W, Lustigman S: Caenorhabditis elegans and the study of gene function in parasites. Trends Parasitol. 2001, 17: 387-393. 10.1016/S1471-4922(01)01986-9.
Gilleard JS: Understanding anthelmintic resistance: the need for genomics and genetics. Int J Parasitol. 2006, 36: 1227-1239. 10.1016/j.ijpara.2006.06.010.
Redman E, Sargison N, Whitelaw F, Jackson F, Morrison A, Bartley DJ, Gilleard JS: Introgression of ivermectin resistance genes into a susceptible Haemonchus contortus strain by multiple backcrossing. PLoS Pathog. 2012, 8: e1002534-10.1371/journal.ppat.1002534.
Kaminsky R, Ducray P, Jung M, Clover R, Rufener L, Bouvier J, Weber SS, Wenger A, Wieland-Berghausen S, Goebel T, Gauvry N, Pautrat F, Skripsky T, Froelich O, Komoin-Oka C, Westlund B, Sluder A, Mäser P: A new class of anthelmintics effective against drug-resistant nematodes. Nature. 2008, 452: 176-180. 10.1038/nature06722.
Taylor CM, Martin J, Rao RU, Powell K, Abubucker S, Mitreva M: Using existing drugs as leads for broad spectrum anthelmintics targeting protein kinases. PLoS Pathog. 2013, 9: e1003149-10.1371/journal.ppat.1003149.
Knox DP, Redmond DL, Newlands GF, Skuce PJ, Pettit D, Smith WD: The nature and prospects for gut membrane proteins as vaccine candidates for Haemonchus contortus and other ruminant trichostrongyloids. Int J Parasitol. 2003, 33: 1129-1137. 10.1016/S0020-7519(03)00167-X.
LeJambre LF, Windon RG, Smith WD: Vaccination against Haemonchus contortus: performance of native parasite gut membrane glycoproteins in Merino lambs grazing contaminated pasture. Vet Parasitol. 2008, 153: 302-312. 10.1016/j.vetpar.2008.01.032.
Bethony JM, Loukas A, Hotez PJ, Knox DP: Vaccines against blood-feeding nematodes of humans and livestock. Parasitology. 2006, 133 (Suppl): S63-S79.
Laing R, Hunt M, Protasio AV, Saunders G, Mungall K, Laing S, Jackson F, Quail M, Beech R, Berriman M, Gilleard JS: Annotation of two large contiguous regions from the Haemonchus contortus genome using RNA-seq and comparative analysis with Caenorhabditis elegans. PLoS One. 2011, 6: e23216-10.1371/journal.pone.0023216.
Blaxter ML, De Ley P, Garey JR, Liu LX, Scheldeman P, Vierstraete A, Vanfleteren JR, Mackey LY, Dorris M, Frisse LM, Vida JT, Thomas WK: A molecular evolutionary framework for the phylum Nematoda. Nature. 1998, 392: 71-75. 10.1038/32160.
Taylor MA, Coop RL, Wall RL: Veterinary Parasitology. 2007, Oxford: Blackwell Publishing
Prichard R: Genetic variability following selection of Haemonchus contortus with anthelmintics. Trends Parasitol. 2001, 17: 445-453. 10.1016/S1471-4922(01)01983-3.
Waller PJ, Rudby-Martin L, Ljungstrom BL, Rydzik A: The epidemiology of abomasal nematodes of sheep in Sweden, with particular reference to over-winter survival strategies. Vet Parasitol. 2004, 122: 207-220. 10.1016/j.vetpar.2004.04.007.
Hoberg EP, Lichtenfels JR, Gibbons L: Phlyogeny for species of Haemonchus (Nematoda: Trichostrongyloidea): Considerations of their evolutionary history and global biogeography among Camelidae and Pecora (Artiodactyla). J Parasitol. 2004, 90: 1085-1102. 10.1645/GE-3309.
Blouin MS, Yowell CA, Courtney CH, Dame JB: Host movement and the genetic structure of populations of parasitic nematodes. Genetics. 1995, 141: 1007-1014.
Troell K, Engstrom A, Morrison DA, Mattsson JG, Hoglund J: Global patterns reveal strong population structure in Haemonchus contortus, a nematode parasite of domesticated ruminants. Int J Parasitol. 2006, 36: 1305-1316. 10.1016/j.ijpara.2006.06.015.
Gilleard JS, Beech RN: Population genetics of anthelmintic resistance in parasitic nematodes. Parasitology. 2007, 134: 1133-1147. 10.1017/S0031182007000066.
Brasil BSAF, Nunes RL, Bastianetto E, Drummond MG, Carvalho DC, Leite RC, Molento MB, Oliveira DAA: Genetic diversity patterns of Haemonchus placei and Haemonchus contortus populations isolated from domestic ruminants in Brazil. Int J Parasitol. 2012, 42: 469-479. 10.1016/j.ijpara.2012.03.003.
Leroy S, Duperray C, Morand S: Flow cytometry for parasite nematode genome size measurement. Mol Biochem Parasitol. 2003, 128: 91-93. 10.1016/S0166-6851(03)00023-9.
Parra G, Bradnam K, Korf I: CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics. 2007, 23: 1061-1067. 10.1093/bioinformatics/btm071.
Dieterich C, Clifton SW, Schuster LN, Chinwalla A, Delehaunty K, Dinkelacker I, Fulton L, Fulton R, Godfrey J, Minx P, Mitreva M, Roeseler W, Tian H, Witte H, Yang SP, Wilson RK, Sommer RJ: The Pristionchus pacificus genome provides a unique perspective on nematode lifestyle and parasitism. Nat Genet. 2008, 40: 1193-1198. 10.1038/ng.227.
Ghedin E, Wang S, Spiro D, Caler E, Zhao Q, Crabtree J, Allen JE, Delcher AL, Guiliano DB, Miranda-Saavedra D, Angiuoli SV, Creasy T, Amedeo P, Haas B, El-Sayed NM, Wortman JR, Feldblyum T, Tallon L, Schatz M, Shumway M, Koo H, Salzberg SL, Schobel S, Pertea M, Pop M, White O, Barton GJ, Carlow CK, Crawford MJ, Daub J, et al: Draft genome of the filarial nematode parasite Brugia malayi. Science. 2007, 317: 1756-1760. 10.1126/science.1145406.
Abad P, Gouzy J, Aury JM, Castagnone-Sereno P, Danchin EG, Deleury E, Perfus-Barbeoch L, Anthouard V, Artiguenave F, Blok VC, Caillaud MC, Coutinho PM, Dasilva C, De Luca F, Deau F, Esquibet M, Flutre T, Goldstone JV, Hamamouch N, Hewezi T, Jaillon O, Jubin C, Leonetti P, Magliano M, Maier TR, Markov GV, McVeigh P, Pesole G, Poulain J, Robinson-Rechavi M, et al: Genome sequence of the metazoan plant-parasitic nematode Meloidogyne incognita. Nat Biotechnol. 2008, 26: 909-915. 10.1038/nbt.1482.
Opperman CH, Bird DM, Williamson VM, Rokhsar DS, Burke M, Cohn J, Cromer J, Diener S, Gajan J, Graham S, Houfek TD, Liu Q, Mitros T, Schaff J, Schaffer R, Scholl E, Sosinski BR, Thomas VP, Windham E: Sequence and genetic map of Meloidogyne hapla: A compact nematode genome for plant parasitism. Proc Natl Acad Sci USA. 2008, 105: 14802-14807. 10.1073/pnas.0805946105.
Kikuchi T, Cotton JA, Dalzell JJ, Hasegawa K, Kanzaki N, McVeigh P, Takanashi T, Tsai IJ, Assefa SA, Cock PJ, Otto TD, Hunt M, Reid AJ, Sanchez-Flores A, Tsuchihara K, Yokoi T, Larsson MC, Miwa J, Maule AG, Sahashi N, Jones JT, Berriman M: Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus. PLoS Pathog. 2011, 7: e1002219-10.1371/journal.ppat.1002219.
Guiliano DB, Hall N, Jones SJ, Clark LN, Corton CH, Barrell BG, Blaxter ML: Conservation of long-range synteny and microsynteny between the genomes of two distantly related nematodes. Genome Biol. 2002, 3: RESEARCH0057-
Hoekstra R, Criado-Fornelio A, Fakkeldij J, Bergman J, Roos MH: Microsatellites of the parasitic nematode Haemonchus contortus: polymorphism and linkage with a direct repeat. Mol Biochem Parasitol. 1997, 89: 97-107. 10.1016/S0166-6851(97)00108-4.
Johnson PCD, Webster LMI, Adam A, Buckland R, Dawson DA, Keller LF: Abundant variation in microsatellites of the parasitic nematode Trichostrongylus tenuis and linkage to a tandem repeat. Mol Biochem Parasitol. 2006, 148: 210-218. 10.1016/j.molbiopara.2006.04.011.
Grillo V, Jackson F, Gilleard JS: Characterisation of Teladorsagia circumcincta microsatellites and their development as population genetic markers. Mol Biochem Parasitol. 2006, 148: 181-189. 10.1016/j.molbiopara.2006.03.014.
Allen MA, Hillier LW, Waterston RH, Blumenthal T: A global analysis of C. elegans trans-splicing. Genome Res. 2011, 21: 255-264. 10.1101/gr.113811.110.
Rufener L, Maser P, Roditi I, Kaminsky R: Haemonchus contortus Acetylcholine Receptors of the DEG-3 Subfamily and Their Role in Sensitivity to Monepantel. PLoS Pathog. 2009, 5: e1000380-10.1371/journal.ppat.1000380.
Kotze AC: Cytochrome P450 monooxygenase activity in Haemonchus contortus (Nematoda). Int J Parasitol. 1997, 27: 33-40. 10.1016/S0020-7519(96)00161-0.
Laing ST, Ivens A, Butler V, Ravikumar SP, Laing R, Woods DJ, Gilleard JS: The Transcriptional Response of Caenorhabditis elegans to Ivermectin Exposure Identifies Novel Genes Involved in the Response to Reduced Food Intake. PLoS ONE. 2012, 7: e31367-10.1371/journal.pone.0031367.
Fuchs S, Bundy J, Davies S, Viney J, Swire J, Leroi A: A metabolic signature of long life in Caenorhabditis elegans. BMC Biol. 2010, 8: 14-10.1186/1741-7007-8-14.
Weinstein PP: Vitamin B12 changes in Nippostrongylus brasiliensis in its free-living and parasitic habitats with biochemical implications. J Parasitol. 1996, 82: 1-6.
Yin Y, Martin J, Abubucker S, Scott AL, McCarter JP, Wilson RK, Jasmer DP, Mitreva M: Intestinal transcriptomes of nematodes: comparison of the parasites Ascaris suum and Haemonchus contortus with the free-living Caenorhabditis elegans. PLoS Negl Trop Dis. 2008, 2: e269-10.1371/journal.pntd.0000269.
Chakrapani BP, Kumar S, Subramaniam JR: Development and evaluation of an in vivo assay in Caenorhabditis elegans for screening of compounds for their effect on cytochrome P450 expression. J Biosci. 2008, 33: 269-277. 10.1007/s12038-008-0044-5.
Lai CQ, Parnell LD, Lyman RF, Ordovas JM, Mackay TFC: Candidate genes affecting Drosophila life span identified by integrating microarray gene expression analysis and QTL mapping. Mech Ageing Dev. 2007, 128: 237-249. 10.1016/j.mad.2006.12.003.
Yeh I, Hanekamp T, Tsoka S, Karp PD, Altman RB: Computational analysis of Plasmodium falciparum metabolism: organizing genomic information to facilitate drug discovery. Genome Res. 2004, 14: 917-924. 10.1101/gr.2050304.
Kushwaha S, Singh PK, Rana AK, Misra-Bhattacharya S: Cloning, expression, purification and kinetics of trehalose-6-phosphate phosphatase of filarial parasite Brugia malayi. Acta Tropica. 2011, 119: 151-159. 10.1016/j.actatropica.2011.05.008.
Kantardjieff KA, Kim CY, Naranjo C, Waldo GS, Lekin T, Segelke BW, Zemla A, Park MS, Thomas C: Mycobacterium tuberculosis RmlC epimerase (Rv3465): a promising drug-target structure in the rhamnose pathway. Acta Crystallogr D Biol Crystallogr. 2004, 60: 895-902. 10.1107/S0907444904005323.
Saunders GI, Wasmuth JD, Beech R, Laing R, Hunt M, Naghra H, Cotton JA, Berriman M, Britton C, Gilleard JS: Characterization and comparative analysis of the complete Haemonchus contortus β-tubulin gene family and implications for benzimidazole resistance in strongylid nematodes. Int J Parasitol. 2013, 43: 465-475. 10.1016/j.ijpara.2012.12.011.
Forrester SG, Prichard RK, Dent JA, Beech RN: Haemonchus contortus: HcGluCla expressed in Xenopus oocytes forms a glutamate-gated ion channel that is activated by ibotenate and the antiparasitic drug ivermectin. Mol Biochem Parasitol. 2003, 129: 115-121. 10.1016/S0166-6851(03)00102-6.
Glendinning SK, Buckingham SD, Sattelle DB, Wonnacott S, Wolstenholme AJ: Glutamate-gated chloride channels of Haemonchus contortus restore drug sensitivity to ivermectin resistant Caenorhabditis elegans. PLoS ONE. 2011, 6: e22390-10.1371/journal.pone.0022390.
Blackhall WJ, Pouliot JF, Prichard RK, Beech RN: Haemonchus contortus: selection at a glutamate-gated chloride channel gene in ivermectin- and moxidectin-selected strains. Exp Parasitol. 1998, 90: 42-48. 10.1006/expr.1998.4316.
Beech R, Levitt N, Cambos M, Zhou S, Forrester SG: Association of ion-channel genotype and macrocyclic lactone sensitivity traits in Haemonchus contortus. Mol Biochem Parasitol. 2010, 171: 74-80. 10.1016/j.molbiopara.2010.02.004.
Putrenko I, Zakikhani M, Dent JA: A family of acetylcholine-gated chloride channel subunits in Caenorhabditis elegans. J Biol Chem. 2005, 280: 6392-6398. 10.1074/jbc.M412644200.
De Graef J, Claerebout E, Vercruysse J, Wolstenholme A, Mitreva M, Geldhof P: Gene expression and mutation analysis of the parasite-specific glutamate-gated chloride channel GLC-6 in resistant Cooperia oncophora isolates following in vivoexposure to macrocyclic lactones. Int J Parasitol.
Jones A, Davis P, Hodgkin J, Sattelle D: The nicotinic acetylcholine receptor gene family of the nematode Caenorhabditis elegans: an update on nomenclature. Invert Neurosci. 2007, 7: 129-131. 10.1007/s10158-007-0049-z.
Liu Y, LeBeouf B, Guo X, Correa PA, Gualberto DG, Lints R, Garcia LR: A cholinergic-regulated circuit coordinates the maintenance and bi-stable states of a sensory-motor behavior during Caenorhabditis elegans male copulation. PLoS Genet. 2011, 7: e1001326-10.1371/journal.pgen.1001326.
Boulin T, Gielen M, Richmond JE, Williams DC, Paoletti P, Bessereau JL: Eight genes are required for functional reconstitution of the Caenorhabditis elegans levamisole-sensitive acetylcholine receptor. Proc Natl Acad Sci USA. 2008, 105: 18590-18595. 10.1073/pnas.0806933105.
Neveu C, Charvet CL, Fauvin A, Cortet J, Beech RN, Cabaret J: Genetic diversity of levamisole receptor subunits in parasitic nematode species and abbreviated transcripts associated with resistance. Pharmacogenet Genomics. 2010, 20: 414-425.
Boulin T, Fauvin A, Charvet CL, Cortet J, Cabaret J, Bessereau JL, Neveu C: Functional reconstitution of Haemonchus contortus acetylcholine receptors in Xenopus oocytes provides mechanistic insights into levamisole resistance. Br J Pharmacol. 2011, 164: 1421-1432. 10.1111/j.1476-5381.2011.01420.x.
Lindblom TH, Dodd AK: Xenobiotic detoxification in the nematode Caenorhabditis elegans. J Exp Zool A Comp Exp Biol. 2006, 305: 720-730.
Kotze AC, Dobson RJ, Chandler D: Synergism of rotenone by piperonyl butoxide in Haemonchus contortus and Trichostrongylus colubriformis in vitro: potential for drug-synergism through inhibition of nematode oxidative detoxification pathways. Vet Parasitol. 2006, 136: 275-282. 10.1016/j.vetpar.2005.11.001.
Jones BC, Middleton DS, Youdim K: Cytochrome P450 Metabolism and Inhibition: Analysis for Drug Discovery. Progress in Medicinal Chemistry. Edited by: Lawton G, Witty DR. 2009, Amsterdam: Elsevier, 47: 239-263.
Sundaram P, Echalier B, Han W, Hull D, Timmons L: ATP-binding cassette transporters are required for efficient RNA interference in Caenorhabditis elegans. Mol Biol Cell. 2006, 17: 3678-3688. 10.1091/mbc.E06-03-0192.
Blackhall WJ, Prichard RK, Beech RN: P-glycoprotein selection in strains of Haemonchus contortus resistant to benzimidazoles. Vet Parasitol. 2008, 152: 101-107. 10.1016/j.vetpar.2007.12.001.
Williamson SM, Storey B, Howell S, Harper KM, Kaplan RM, Wolstenholme AJ: Candidate anthelmintic resistance-associated gene expression and sequence polymorphisms in a triple-resistant field isolate of Haemonchus contortus. Mol Biochem Parasitol. 2011, 180: 99-105. 10.1016/j.molbiopara.2011.09.003.
Dicker AJ, Nisbet AJ, Skuce PJ: Gene expression changes in a P-glycoprotein (Tci-pgp-9) putatively associated with ivermectin resistance in Teladorsagia circumcincta. Int J Parasitol. 2011, 41: 935-942. 10.1016/j.ijpara.2011.03.015.
Clark C, Kiesel G, Goby C: Measurements of blood loss caused by Haemonchus contortus infection in sheep. Am J Vet Res. 1962, 23: 977-980.
Newton SE: Progress on vaccination against Haemonchus contortus. Int J Parasitol. 1995, 25: 1281-1289. 10.1016/0020-7519(95)00065-A.
Gluzman IY, Francis SE, Oksman A, Smith CE, Duffin KL, Goldberg DE: Order and specificity of the Plasmodium falciparum hemoglobin degradation pathway. J Clin Invest. 1994, 93: 1602-1608. 10.1172/JCI117140.
Williamson AL, Lecchi P, Turk BE, Choe Y, Hotez PJ, McKerrow JH, Cantley LC, Sajid M, Craik CS, Loukas A: A multi-enzyme cascade of hemoglobin proteolysis in the intestine of blood-feeding hookworms. J Biol Chem. 2004, 279: 35950-35957. 10.1074/jbc.M405842200.
Larminie CG, Johnstone IL: Isolation and characterization of four developmentally regulated cathepsin B-like cysteine protease genes from the nematode Caenorhabditis elegans. DNA Cell Biol. 1996, 15: 75-82. 10.1089/dna.1996.15.75.
Ranjit N, Zhan B, Stenzel DJ, Mulvenna J, Fujiwara R, Hotez PJ, Loukas A: A family of cathepsin B cysteine proteases expressed in the gut of the human hookworm, Necator americanus. Mol Biochem Parasitol. 2008, 160: 90-99. 10.1016/j.molbiopara.2008.04.008.
Pratt D, Armes LG, Hageman R, Reynolds V, Boisvenue RJ, Cox GN: Cloning and sequence comparisons of four distinct cysteine proteases expressed by Haemonchus contortus adult worms. Mol Biochem Parasitol. 1992, 51: 209-218. 10.1016/0166-6851(92)90071-Q.
Rehman A, Jasmer DP: A tissue specific approach for analysis of membrane and secreted protein antigens from Haemonchus contortus gut and its application to diverse nematode species. Mol Biochem Parasitol. 1998, 97: 55-68. 10.1016/S0166-6851(98)00132-7.
Skuce PJ, Redmond DL, Liddel S, Stewart EM, Newlands GFJ, Smith W, Knox D: Molecular cloning and characterization of gut-derived cysteine proteinases associated with a host protective extract from Haemonchus contortus. Parasitology. 1999, 119: 405-412. 10.1017/S0031182099004813.
Yatsuda AP, Bakker N, Krijgsveld J, Knox DP, Heck AJR, de Vries E: Identification of Secreted Cysteine Proteases from the Parasitic Nematode Haemonchus contortus Detected by Biotinylated Inhibitors. Infect Immun. 2006, 74: 1989-1993. 10.1128/IAI.74.3.1989-1993.2006.
Jasmer DP, Mitreva MD, McCarter JP: mRNA sequences for Haemonchus contortus intestinal cathepsin B-like cysteine proteases display an extreme in abundance and diversity compared with other adult mammalian parasitic nematodes. Mol Biochem Parasitol. 2004, 137: 297-305. 10.1016/j.molbiopara.2004.06.010.
Smith SK, Smith WD: Immunisation of sheep with an integral membrane glycoprotein complex of Haemonchus contortus and with its major polypeptide components. Res Vet Sci. 1996, 60: 1-6. 10.1016/S0034-5288(96)90121-6.
Smith TS, Munn EA, Graham M, Tavernor AS, Greenwood CA: Purification and evaluation of the integral membrane protein H11 as a protective antigen against Haemonchus contortus. Int J Parasitol. 1993, 23: 271-280. 10.1016/0020-7519(93)90150-W.
Loukas A, Bethony JM, Mendez S, Fujiwara RT, Goud GN, Ranjit N, Zhan B, Jones K, Bottazzi ME, Hotez PJ: Vaccination with recombinant aspartic hemoglobinase reduces parasite load and blood loss after hookworm infection in dogs. PLoS Med. 2005, 2: e295-10.1371/journal.pmed.0020295.
Roy PJ, Stuart JM, Lund J, Kim SK: Chromosomal clustering of muscle-expressed genes in Caenorhabditis elegans. Nature. 2002, 418: 975-979.
Tcherepanova I, Bhattacharyya L, Rubin CS, Freedman JH: Aspartic proteases from the nematode Caenorhabditis elegans. Structural organization and developmental and cell-specific expression of asp-1. J Biol Chem. 2000, 275: 26359-26369. 10.1074/jbc.M000956200.
Redman E, Packard E, Grillo V, Smith J, Jackson F, Gilleard JS: Microsatellite analysis reveals marked genetic differentiation between Haemonchus contortus laboratory isolates and provides a rapid system of genetic fingerprinting. Int J Parasitol. 2008, 38: 111-122. 10.1016/j.ijpara.2007.06.008.
Sargison N: Development of genetic crossing methods to identify genes associated with macrocyclic lactone resistance in the sheep nematode parasite, Haemonchus contortus. PhD Thesis. 2009, University of Edinburgh
Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009, 25: 1105-1111. 10.1093/bioinformatics/btp120.
Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B: AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 2006, 34: W435-W439. 10.1093/nar/gkl200.
Haas B, Salzberg S, Zhu W, Pertea M, Allen J, Orvis J, White O, Buell CR, Wortman J: Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008, 9: R7-10.1186/gb-2008-9-1-r7.
Guigó R, Flicek P, Abril JF, Reymond A, Lagarde J, Denoeud F, Antonarakis S, Ashburner M, Bajic VB, Birney E, Castelo R, Eyras E, Ucla C, Gingeras TR, Harrow J, Hubbard T, Lewis SE, Reese MG: EGASP: the human ENCODE Genome Annotation Assessment Project. Genome Biol. 2006, 7: S2.1-31.
Zdobnov EM, Apweiler R: InterProScan - an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17: 847-848. 10.1093/bioinformatics/17.9.847.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene Ontology: tool for the unification of biology. Nat Genet. 2000, 25: 25-29. 10.1038/75556.
Yook K, Harris TW, Bieri T, Cabunoc A, Chan J, Chen WJ, Davis P, de la Cruz N, Duong A, Fang R, Ganesan U, Grove C, Howe K, Kadam S, Kishore R, Lee R, Li Y, Muller HM, Nakamura C, Nash B, Ozersky P, Paulini M, Raciti D, Rangarajan A, Schindelman G, Shi X, Schwarz EM, Ann Tuli M, Van Auken K, Wang D, et al: WormBase 2012: more genomes, more data, new website. Nucleic Acids Res. 2012, 40: D735-D741. 10.1093/nar/gkr954.
Petersen TN, Brunak S, von Heijne G, Nielsen H: SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011, 8: 785-786. 10.1038/nmeth.1701.
Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M: KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007, 35: W182-W185. 10.1093/nar/gkm321.
Hung SS, Wasmuth J, Sanford C, Parkinson J: DETECT - A Density Estimation Tool for Enzyme Classification and its application to Plasmodium falciparum. Bioinformatics. 2010, 26: 1690-1698. 10.1093/bioinformatics/btq266.
Arakaki A, Huang Y, Skolnick J: EFICAz2: enzyme function inference by a combined approach enhanced by machine learning. BMC Bioinformatics. 2009, 10: 107-10.1186/1471-2105-10-107.
Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol. 2010, 11: R106-10.1186/gb-2010-11-10-r106.
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B (Methodological). 1995, 57: 289-300.
Alexa A, Rahnenfuhrer J: topGO: Enrichment analysis for Gene Ontology, R package version 2.8.0. [http://www.bioconductor.org/packages/2.12/bioc/html/topGO.html]
Haemonchus contortus strain ISE/inbred ISE, whole genome shotgun sequencing project. [http://www.ncbi.nlm.nih.gov/nuccore/CAVP000000000]
H. contortus genome assembly and functional annotation. [ftp://ftp.sanger.ac.uk/pub/pathogens/Haemonchus/contortus]
The Haemonchus contortus genome project is funded by the Wellcome Trust through their core support of the Wellcome Trust Sanger Institute (grant 098051) and a Wellcome Trust Project Grant to JSG (grant 067811). JSG also acknowledges funding from the Canadian Institutes of Health Research (CIHR) 230927. JSG and JW acknowledge support from the NSERC-CREATE Training Program in Host-Parasite Interaction at the University of Calgary (grant number 403888-2012). RL is funded through the Scottish Government by the Strategic Partnership for Animal Science Excellence (SPASE). SJ is funded through a BBSRC/KTN/Pfizer CASE studentship. We thank Nathalie Smerdon, Andrew Knights and David Willey for technical support in generating RNA-seq libraries; Coline Griffiths, Matthew Jones, Lesley Shirley, Naomi Park and Mike Quail for Illumina genomic libraries; and Richard Rance for 454 genomic libraries. Robert Davies and Quan Lin arranged for data release to the SRA. We thank Doug Jasmer for the H. contortus gut dissection protocol.
The authors declare that they have no competing interests.
MB, JAC, JSG and JDW designed research, which was coordinated by HB, MB, JSG and NH; DJB, ED, RL, AAM and NS maintained the parasite life cycle, performed the inbreeding crosses and contributed biological material; RL, ER and GIS prepared genomic DNA, RNA and transcriptome sequencing libraries; CB, RNB, DC, JAC, AG, MH, SLJ, IK, TK, KL, RL, AM, AJR, IJT, AW and JDW analyzed data; JAC, JSG and RL drafted the complete manuscript, with sections of text contributed by CB, RNB, DC, AG, SLJ, IK, TK, KL, AM, AJR, IJT, AW and JDW. All authors read and approved the final manuscript.
Electronic supplementary material
About this article
Cite this article
Laing, R., Kikuchi, T., Martinelli, A. et al. The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery. Genome Biol 14, R88 (2013). https://doi.org/10.1186/gb-2013-14-8-r88
- Parasitic Nematode
- Mass Drug Administration
- Macrocyclic Lactone
- Anthelmintic Resistance