The genome of the yellow potato cyst nematode, Globodera rostochiensis, reveals insights into the basis of parasitism and virulence
- Sebastian Eves-van den Akker†1Email author,
- Dominik R. Laetsch†2,
- Peter Thorpe†3,
- Catherine J. Lilley†4,
- Etienne G. J. Danchin5,
- Martine Da Rocha5,
- Corinne Rancurel5,
- Nancy E. Holroyd6,
- James A. Cotton6,
- Amir Szitenberg7,
- Eric Grenier8,
- Josselin Montarry8,
- Benjamin Mimee9,
- Marc-Olivier Duceppe9,
- Ian Boyes10,
- Jessica M. C. Marvin4,
- Laura M. Jones4,
- Hazijah B. Yusup4,
- Joël Lafond-Lapalme9,
- Magali Esquibet8,
- Michael Sabeh9,
- Michael Rott10,
- Hein Overmars11,
- Anna Finkers-Tomczak11,
- Geert Smant11,
- Georgios Koutsovoulos2,
- Vivian Blok3,
- Sophie Mantelin3,
- Peter J. A. Cock12,
- Wendy Phillips13,
- Bernard Henrissat14, 15,
- Peter E. Urwin4,
- Mark Blaxter2 and
- John T. Jones3, 16
© The Author(s). 2016
Received: 8 January 2016
Accepted: 12 May 2016
Published: 10 June 2016
The yellow potato cyst nematode, Globodera rostochiensis, is a devastating plant pathogen of global economic importance. This biotrophic parasite secretes effectors from pharyngeal glands, some of which were acquired by horizontal gene transfer, to manipulate host processes and promote parasitism. G. rostochiensis is classified into pathotypes with different plant resistance-breaking phenotypes.
We generate a high quality genome assembly for G. rostochiensis pathotype Ro1, identify putative effectors and horizontal gene transfer events, map gene expression through the life cycle focusing on key parasitic transitions and sequence the genomes of eight populations including four additional pathotypes to identify variation. Horizontal gene transfer contributes 3.5 % of the predicted genes, of which approximately 8.5 % are deployed as effectors. Over one-third of all effector genes are clustered in 21 putative ‘effector islands’ in the genome. We identify a dorsal gland promoter element motif (termed DOG Box) present upstream in representatives from 26 out of 28 dorsal gland effector families, and predict a putative effector superset associated with this motif. We validate gland cell expression in two novel genes by in situ hybridisation and catalogue dorsal gland promoter element-containing effectors from available cyst nematode genomes. Comparison of effector diversity between pathotypes highlights correlation with plant resistance-breaking.
These G. rostochiensis genome resources will facilitate major advances in understanding nematode plant-parasitism. Dorsal gland promoter element-containing effectors are at the front line of the evolutionary arms race between plant and parasite and the ability to predict gland cell expression a priori promises rapid advances in understanding their roles and mechanisms of action.
All major crops are thought to be infected by at least one species of plant-parasitic nematode, which causes damage valued at over $80 billion each year . The majority of these economic losses are attributable to the sedentary endoparasitic nematodes of the genus Meloidogyne (root-knot nematodes) and the genera Heterodera and Globodera (cyst nematodes). These sedentary endoparasites have complex biotrophic interactions with their hosts that include induction of specific feeding sites and long residence times within or on their host(s).
Potato cyst nematodes (PCN) are economically important pathogens of potato, with two major species: the white PCN Globodera pallida and the yellow PCN G. rostochiensis. These nematodes originate in South America [2, 3] and have subsequently been introduced into all major potato-growing regions of the world. Europe has acted as a secondary distribution hub for PCN; worldwide populations outside South America reflect subsequent introductions from Europe [4, 5]. Once established in a field, PCN are effectively impossible to eradicate in the short term and because they persist as long-lived cysts in soils, growing potatoes may not be economically viable for up to two decades. As a result, the US Department of Agriculture (USDA) has classified the yellow PCN as potentially more dangerous than any insect or disease affecting the potato industry (Aphis USDA 12/09/2015). Substantial effort is thus invested into keeping land free of PCN; both species are present on USDA and European Plant Protection Organisation quarantine organism lists.
PCN have been classified to pathotype based on their relative virulence on host plants harbouring different resistance loci. Most of the G. rostochiensis in UK potato-growing regions is of pathotype Ro1 and can be controlled by a single major resistance locus (H1). UK G. rostochiensis populations have therefore been suggested to originate from a genetically restricted introduction into Europe [6, 7]. Other pre-existing G. rostochiensis pathotypes (Ro 2, 3 and 5, but not 4) are able to overcome H1 resistance  and these pathotypes may be selected in response to widespread deployment of H1 plants. The corresponding nematode avirulence gene(s) has not been identified. Understanding the bases of virulence and resistance is of critical importance for agriculture.
G. rostochiensis has a complex life cycle that includes a highly resistant survival stage. Cysts, formed from the body wall of the adult female, encase hundreds of eggs that can lie dormant in the soil for over 20 years. Second stage juveniles (J2) within the eggs hatch in response to root diffusates from suitable host plants growing nearby. The J2 nematodes locate the root and migrate destructively through root tissues until they reach the inner cortex layers. Here the nematodes probe the cells, until a cell that does not respond adversely is detected . This initial syncytial cell is transformed into a large, multinucleate syncytium in response to proteins, peptides and hormones secreted by the nematode. Cell wall openings are formed between the initial syncytial cell and its neighbours, followed by fusion of the protoplasts. Syncytial cells become highly metabolically active and have enriched cytoplasm, enlarged nuclei and a greatly reduced central vacuole. Additional layers of cells are subsequently incorporated into the syncytium, which may eventually be composed of up to 300 cells . A prolonged biotrophic interaction is then maintained for a period of several weeks, while the nematode intermittently withdraws host cytoplasm to derive all food required for development to the adult stage. Each nematode can only induce a single feeding site that must therefore be maintained and protected from host defences.
The complex interactions of PCN with their hosts, like those of other plant parasites and pathogens, are mediated by effectors: secreted proteins that manipulate the host to the benefit of the pathogen. Most PCN effectors are produced in two sets of gland cells, dorsal and subventral , although some apoplastic effectors can be produced in the gland cells surrounding the main anterior sensory organs, the amphids . Effectors play important roles in all aspects of the parasite-host interaction: invasion and migration , suppression of host defences  and induction of the feeding site [14, 15]. The effector repertoire of plant-parasitic nematodes, including PCN, has been augmented by multiple Horizontal Gene Transfer (HGT) events, primarily of bacterial origin . HGT events are suspected to have played an important role in the emergence of plant parasitism in nematodes, enabling degradation of the plant cell wall, nutrient processing and manipulation of plant defences . Due to their importance in the life cycle of plant-parasitic nematodes, a great deal of effort has been put into various approaches for effector identification, including genomic and transcriptomic analyses , transcriptomic analyses of purified gland cells  and proteomic analyses . For some effectors, the likely biological functions, including host proteins targeted, have been identified [14, 20, 21].
Here, we report a high quality draft genome of a Ro1 isolate of G. rostochiensis, in combination with replicated transcriptome data from four key life stages, and genome sequence from eight populations across four pathotypes. We conducted whole genome comparisons between G. rostochiensis and related species [22–25] to explore the genomic and transcriptomic bases of pathogenicity. We discovered an unusually high frequency of well-supported non-canonical splice sites in G. rostochiensis, and found that this phenomenon was also present in related parasitic nematode species. Using an HGT analysis pipeline, we identified hundreds of genes in the G. rostochiensis genome that may have been acquired by gene transfers from non-metazoan origin, some of which likely play important roles in plant parasitism. We identified effectors in G. rostochiensis and found that they frequently grouped together into ‘effector islands’. To explore the genetic bases of virulence, we compared genetic variation in effectors and other genes between pathotypes and found that effectors, in general, contained more non-synonymous mutations. Using the identified G. rostochiensis effectors as a training set, we identified a putative ‘DOrsal Gland promoter element’, or DOG box, which was also associated with effectors in related species. We were able to use the DOG box to predict novel effectors, confirmed by in situ hybridisation, in G. rostochiensis, and to identify all putative DOG effectors from available cyst nematode genomes.
Results and discussion
The genome sequence of Globodera rostochiensis Ro1
Assembly size (Mb)
Scaffold N50 (bp)
Longest scaffold (bp)
Contig N50 (bp)
Longest contig (bp)
Span of N’s in assembly (bp)
CEGMA (Complete/Partial %)
Average CEG gene number (Complete/Partial)
Gene density (per Mb)
Proteins w/Start and Stop codon (n)
14,580 (88.81 %)
13,083 (91.43 %)
Non-canonical splice sites (%)
3.56 % (n = 4059)
3.46 % (n = 3835)
PfamA domains (cutoff 1e-5) (n)
Best BLAST hit to nematode proteins (1e-10) (n)
Collaborative manual gene refinement reveals a uniquely high frequency of non-canonical splice sites in Globodera
The frequency of GC/AG introns in G. rostochiensis was 3.46 %, the highest reported for any nematode. In addition to the GT or GC dinucleotide, 5′ donor sites are characterised by a nine-base consensus sequence, CAGG[T|C]AAGT (where the initial CAG is in the preceding exon ). Although variations in the 5′ donor site sequence were found, G. rostochiensis GC/AG introns conformed equally well, if not better, to this consensus as did GT/AG introns (Fig. 1a and b). We derived a revised 5′ donor consensus for the predicted introns for both GC and GT 5′ sites and found both intron classes to use AAGG[T|C]AAGT (where the first AAG is in the preceding exon). We identified a similarly high frequency of GC/AG introns in G. pallida (3.53 %), and Rotylenchulus reniformis (2.36 %) (PRJNA214681, Showmaker et al., unpublished), a sedentary endoparasite of multiple crop plants that is in a sister group to Globodera in the Tylenchoidea (Additional file 4: Figure S2). While GC/AG introns were apparently absent from the Meloidogyne species gene predictions, we suspect this may be due to restrictive settings during their annotation, as they are present in most species (Additional file 4: Figure S2). The elevated proportion of non-canonical GC/AG introns appear to be restricted to the Heteroderidae.
In species pairs with a low GC/AG intron frequency, such as Caenorhabditis elegans and the closely related C. briggsae, there is no obvious conservation of non-canonical splice site usage in their orthologous genes . However, for genes in G. rostochiensis with at least one GC/AG intron, ~72 % of the corresponding one-to-one orthologues in G. pallida also contained at least one GC/AG intron (n = 2148), compared to an average of 10.8 % for identically sized subsets of non-GC/AG intron containing G. rostochiensis genes (1000 iterations, stdev = 0.8 %). Within those genes, orthologous introns also tended to have conserved non-canonical splice sites. For 30 % of the G. rostochiensis GC/AG introns in one-to-one orthologues, the corresponding G. pallida intron also used GC/AG. GC/AG introns had a biased distribution within genes in both species, tending to be less common in introns in the 5′ portion of genes compared to introns in the 3′ portion (Fig. 1c).
Life stage specific transcriptome
Focusing on these four categories of orthologous clusters (all nematodes, all plant parasites, Meloidogyne plus Globodera and Globodera) we correlated the orthologue definition and transcriptional clustering data to explore possible functional roles of genes unique to subsets of the taxa analysed. Only 34 % of genes in clusters with members from all five nematodes, or clusters lacking only C. elegans, were differentially expressed, compared to 47 % differentially expressed overall (Fig. 3b), congruent with the assumption that these families are likely to include loci with roles in core physiology. Interestingly however, genes specifically upregulated in eggs contain a higher relative abundance of genes in orthologous clusters common to all plant parasites yet absent in C. elegans, compared to other orthologous gene categories (Fig. 3c).
Only 43 % of genes in orthologous clusters private to Meloidogyne and Globodera were differentially expressed. In contrast, of the genes in orthologous clusters only present in the two Globodera species, 60 % were differentially regulated, suggesting that these genes play a dynamic role in parasite development. Furthermore, over two-fifths of genes (42 %) that are differentially regulated in the infective juvenile stage of G. rostochiensis are those that are unique to the Globodera. Expression super-clusters 13 and 24, which describe those genes specifically upregulated or downregulated in the infective juvenile stage, respectively, contain a higher relative abundance of genes in orthologous clusters unique to Globodera species compared to other orthologous gene categories (Fig. 3c).
G. rostochiensis proteins in clusters private to Meloidogyne and Globodera were enriched for GO terms associated with gene silencing by miRNA (p <0.001, FDR 0.05), including nine proteins with highest similarity to worm-specific argonautes (WAGOs) in C. elegans. WAGOs are central to the RNAi pathway, being responsible for binding of small RNAs and mediation interactions with other proteins, and show an exceptional diversity within the phylum Nematoda. It has been suggested that the expansion of WAGOs within Nematoda is associated with extreme functional plasticity . Enrichment of WAGOs in the Meloidogyne and Globodera lineage, in combination with phylogenetically distinct clades of WAGOs in the Heteroderidae (Additional file 8: Figure S5), may indicate functional diversification following expansion. With the exception of GROS_g08854, all G. rostochiensis WAGOs that are differentially regulated are present in differential expression super-clusters 19, 20 and 21. All but one of these differentially expressed WAGOs are in Clades 1/2/4/5 and 10/11. Expression super-clusters 19, 20 and 21 are characterised by significant upregulation at 14 dpi, suggesting a dynamic role for WAGO clade 1/2/4/5 and 10/11 as G. rostochiensis transitions through parasitism.
Genes acquired by horizontal transfer have substantially contributed to the genome of G. rostochiensis
Horizontal gene transfer (HGT) events have played an important role in the emergence of plant-parasitism in nematodes . Numerous plant cell wall degrading enzymes, originally acquired from bacteria, are present in a wide range of tylenchomorph plant-parasitic nematode species, while diplogasterid nematodes have acquired functionally analogous genes from fungi . Using a systematic genome-wide approach, putative HGT events were identified based on the ratio of their sequence similarity to metazoan and non-metazoan sequences (Alien Index (AI), (Alienness [31–33])). Proteins with an AI >0 and more than 70 % identity to a non-metazoan sequence were considered putative contaminants (n = 18) and not included in these analyses.
Genes acquired via HGT in other cyst and root-knot nematodes also found in the genome of G. rostochiensis
G. rostochiensis genes
Cell wall degradation
Cellulase (glycosyl hydrolase family 5)
Softening of non-covalent bonds
Rare lipoprotein A (RlpA)-like double-psi beta-barrel
GH53 candidate Arabinogalactan endo-1,4-beta-galactosidase
Glycosyl hydrolase family 53
PL3 Pecate lyase
Plant defense manipulation
Glycosyl hydrolase family 18
Conversion of Chorismate into SA
Chorismate mutase type II
Conversion of Chorismate into SA
Candidate Cyanate lyase
Cyanate lyase C-terminal domain
Degradation of sucrose in glucose and fructose
Glycosyl hydrolases family 32 N-terminal domain
Vitamin B1 biosynthesis
Vitamin B1 biosynthesis
Thiamine monophosphate synthase/TENI
Vitamin B1 biosynthesis
Vitamin B1 salvage
Hydroxyethylthiazole kinase family
Vitamin B1 salvage
Vitamin B5 biosynthesis
Vitamin B6 biosynthesis
Candidate PolS Polyglutamate synthase
Bacterial capsule synthesis protein PGA_cap
Candidate GSI Glutamine synthase
Glutamine synthetase, catalytic domain
Feeding site induction
NodL - like
Bacterial transferase hexapeptide (six repeats)
Candidate L-threonine aldolase
Candidate Phosphorybosyl transferase
Phosphoribosyl transferase domain
Effectors in G. rostochiensis are sequence diverse between pathotypes
Effectors play central roles in both pathogenicity and virulence. The evolution of virulence on a particular host or variety can involve both gain and loss of effector function. Effectors may become specialised to function in a new host , while effector gene loss (or loss of expression) may allow a pathogen to evade recognition . We identified G. rostochiensis effectors by sequence similarity to effectors with experimentally verified gland cell expression in related taxa (Heterodera, Globodera). Many effectors in plant-parasitic nematodes are members of large multi-gene families, only a subset of which are effectors [10, 13, 37]. For example, in G. pallida there are ~300 SPRY (PF00622) domain containing proteins, fewer than 10 % of which are deployed as effectors . We therefore further filtered the potential effector set for the presence of a signal peptide for secretion and absence of a transmembrane domain to retain a high confidence list of 138 loci (Additional file 12: Table S4), including 101 genes similar to sequences expressed in the dorsal gland cell, 35 genes similar to those expressed in subventral gland cells and two genes similar to those expressed in the amphid sheath cells. The set included representatives of 37 different effector gene families (Additional file 12: Table S4). The vast majority of these effectors (116/138) exhibited expression profiles consistent with a role in parasitism (Additional file 12: Table S4), as would be expected for effectors. The temporal expression profiles of dorsal and subventral effectors were also consistent with the observed changes in activity of these glands throughout nematode development [38–41]. Most subventral gland effectors were primarily expressed at J2, while dorsal gland effectors were expressed at J2 and/or 14 dpi. Approximately 8.5 % of genes putatively acquired via HGT (8.47 % of those with AI >0 and 8.79 % of those with >30) are present on the stringent effector list; examples of which include putative pectate lyase, beta - 1,4 - endoglucanase and expansins.
Intra-species variation within the G. rostochiensis effectorome was examined by mapping whole genome resequencing data from nine populations across five pathotypes (Ro1, Ro2, Ro3, Ro4 and Ro5) to the reference assembly (pathotype Ro1). A total of 1,081,802 variants were detected, of which 794,505 were single nucleotide polymorphisms (SNPs) and 283,434 were insertions/deletions (indels) (Additional file 13: Table S5). Homozygous molecular markers descriptive of pathotypes 4 and 5 were identified (Additional file 14: Table S6). Interestingly, no variants were descriptive of all Ro1, Ro2 or Ro3 populations. Consistent with this, a maximum likelihood phylogeny constructed from 730,705 genome wide SNPs identifies two distinct groups of Ro1, together separate from Ro2, Ro3, Ro4 and Ro5 (Additional file 15: Figure S7A). The variation within pathotype Ro1 is as great as, if not greater than, the variation between Ro1 and the other pathotypes (Additional file 15: Figure S7B).
A total of 108 G. rostochiensis effectors (78 %) contained predicted modification of function (non-synonymous mutation) and/or predicted loss of function (frame shift indel or premature stop codon) in at least one pathotype. When accounting for gene length, G. rostochiensis effectors did not show significantly different numbers of predicted loss of function variants, but did contain significantly more total variants and more predicted modification of function variants per gene (n = 131, Mann–Whitney U test, p <0.028 and p = 0.003, respectively), compared to randomly selected non-effector genes. No individual variant was homozygous for the reference allele in all populations avirulent on H1 (Ro1 and Ro4) and homozygous for the variant allele in all populations virulent on H1 (Ro2, Ro3 and Ro5). This observation is consistent with the suggestion that distinct populations of Ro1 (Additional file 15: Figure S7 and ), in addition to Ro4, have evolved the same phenotype on H1 independently . Convergent evolution of the same phenotype by independent mutations may be explained by identifying genes which are always homozygous present for at least one predicted loss or change of function variant in populations virulent on H1 and always homozygous absent for any predicted loss or change of function variants in populations avirulent on H1. No such cases were identified from these population sequencing data. However, 190 genes were identified with at least one predicted modification or loss of function variant homozygous absent in all avirulent populations and homozygous or heterozygous present in virulent populations. When cross-referenced with the high-confidence effector list, this was reduced to two genes. Gene g13394 is similar to GLAND10 , which encodes a putative cellulose binding protein and originates from the subventral gland cell. Gene g12477 is similar to the 3H07_Ubiquitin_extension effectors that are expressed in the dorsal gland cell [44, 45], and are involved in host immune suppression . Forty-eight SNPs were identified in 19 non-effector genes with a difference in average allele frequencies of 70 % or higher between virulent and avirulent populations and a minimal difference in allele frequencies of 25 % between individual virulent and avirulent populations (Additional file 16: Table S7), of which four SNPs were located in g03129, a Ryanodine receptor-like containing three SPRY domains, and seven in g09064, a molecular chaperone from the Hsp90 family.
Effectors in the G. rostochiensis genome are compartmentalised into islands
Identifying features enriched within effector islands in G. rostochiensis remains challenging; there is no evidence for more AT-rich sequences, contig break points, polymorphisms or microsatellite repeats within islands, flanking islands or scaffolds containing islands (Fig. 5c). However, despite no difference in transposon density within islands (2.7/10 kb ±2.4), in the remainder of scaffolds containing islands (2.4/10 kb ±1.7), in entire scaffolds containing islands (2.8/10 kb ±0.9) or in scaffolds numerically adjacent to those containing islands (2.4/10 kb ±1.5, Kruskal–Wallis, p = 0.515, Fig. 5d), transposable elements are closely associated to island borders. When each island is treated as a single locus, the nearest external transposable element 5′ of the first gene, and 3′ of the last, is significantly closer than expected (ANOVA, n = 39, p = 0.028 accounting for multiple testing, Fig. 5d). Interestingly, the inverse measurement (the closest internal transposon to each island border), is not significantly closer than expected (n = 45, p = 0.116, Fig. 5d), suggesting that this may be a feature of islands as an integral whole, rather than the separate genes comprising the islands.
Identification of a putative enhancer motif associated with dorsal gland effectors
The DOG box as a predictor of effectors
Although not all secreted proteins are effectors, all effector proteins are secreted. Within the 150 G. rostochiensis genes with three or more DOG boxes and a signal peptide, there were 31 known effectors from 14 families, an approximately 100-fold enrichment. The expression patterns of these 150 genes (including newly discovered candidate effector sequences) were consistent with a role in parasitism. For G. pallida, where more comprehensive life stage expression data are available, the same association was observed (Additional file 20: Figure S10) . Despite the fact that most genes with >3 ATGCCA motifs in G. pallida and a signal peptide are expressed at J2, the number of motifs in the promoter region was not a quantitative predictor of gene expression at J2 (R2 = 0.0002, Additional file 20: Figure S10) or at any other life stage, indicating that the ATGCCA motif is not a J2 enhancer. These data most likely reflect the biology of the nematode which dictates that a substantial proportion of effectors are required in the dorsal gland during the infective juvenile stages.
The interactions between plant-parasitic nematodes and their hosts are both complex and specific. In a successful interaction, the nematodes can avoid induction of an effective host immune response, resist any immune response that is expressed and manipulate the host’s developmental and cell biology to induce and maintain a functional feeding site. These interactions are mediated by an armoury of effectors that plant-parasitic nematodes appear to have assembled from adaptation of endogenous genes and also loci acquired by horizontal gene transfer from a diverse range of other taxa. To probe and understand these interactions, genomic analyses complement more directed studies, to drive and focus future programmes. Genomics can deliver whole-system analyses that permit global recovery of likely actors in parasite-host interactions. In turn, these insights can suggest new approaches to the understanding of pathogenesis and ultimately control of parasite-induced crop losses. The expanded effector set, including new effector types, the association of presence of particular effector loci with breaking of plant resistance and the definition of shared transcriptional control systems we have reported here from genomic and transcriptomic analyses of G. rostochiensis are demonstrations of this utility.
Nematode culture and DNA isolation
G. rostochiensis populations Ro1, Ro2, Ro3, Ro4 and Ro5 from the JHI PCN collection were maintained on a mixture of susceptible varieties in glasshouse conditions. For the reference assembly (Ro1), DNA was extracted according to described methods . For population re-sequencing, DNA extraction was carried out as previously described .
Genome sequencing and assembly
Three sequencing libraries were prepared from total genomic DNA (Additional file 23: Table S11). A PCR-free 400–550 bp paired-end Illumina library was prepared using a previously described protocol , with the addition of sample clean up and size selection with Agencourt AMPure XP. DNA was precipitated onto beads after each enzymatic stage with an equal volume of 20 % Polyethylene Glycol 6000 and 2.5 M sodium chloride solution. Beads were not separated from the sample throughout the process until after the adapter ligation stage: fresh beads were then used for size selection. Two mate pair libraries with ~2 kb virtual insert size were constructed . The libraries were denatured using 0.1 M sodium hydroxide and diluted to 8 pM in hybridisation buffer for cluster amplification on the Illumina cBOT using the V3 cluster generation kit following the manufacturer’s protocol, followed by a SYBRGreen cluster density QC prior to paired-end 100 base sequencing on an Illumina HiSeq2000. Raw data were analysed using the Illumina RTA1.8 analysis pipelines.
An initial assembly was produced from a combination of short-fragment paired-end and mate-pair Illumina libraries (Additional file 23: Table S11). Short paired-end sequence reads were first corrected and initially assembled using SGA v0.9.7 30 . This draft assembly was then used to calculate the distribution of k-mers for all odd values of k between 41 and 81, using GenomeTools v.1.3.7 . The k-mer length for which the maximum number of unique k-mers were present in the SGA assembly (k = 63) was then used as the k-mer setting for de Bruijn graph construction in a second assembly with Velvet v1.2.03 32 . The mate-pair library was then used to further scaffold this Velvet assembly using SSPACE  with an iterative approach, in which the number of read-pair links required to scaffold two contigs was initially set to 50, then reduced to 30, 20 and finally set to 10 for two final iterations of SSPACE to produce assembly nGr.v0.9. The three whole genome sequencing libraries were subsequently used to gap fill the assembly (GapFiller v1.10 , 10 iterations and default values for extension parameters), producing the final assembly nGr.v1.0.
A BlobDB (Blobtools v0.9.9 (https://drl.github.io/blobtools/) [58, 59] was constructed using: (1) the assembly; (2) similarity search results against the NCBI Nucleotide database (BLASTn 2.3.0+  megablast, E-value cutoff 1e–65), Uniref90 (Diamond v0.7.12 , blastx, using the options --sensitive, -k 25 and -c 1) and the G. pallida reference genome nGp.v1.0 (BLASTn megablast, E-value cutoff 1e–65); (3) the three DNA-seq read libraries mapped back to the assembly (CLC mapper v4.21.104315 CLCBio, Copenhagen, Denmark). A Taxon-Annotated-Gc-Coverage plot (TAGC) was drawn at the rank of phylum and under taxrule ‘bestsum’. Using Blobtools view, taxonomically annotated non-nematode scaffolds with a bit-score ≥200 were inspected manually and compared against NCBI Nucleotide database (BLASTn). Twenty-three scaffolds could be excluded as contaminants based on strong similarity to Bacteria or Fungi (span = 98.2 kb). TAGC plots pre- and post-filtering are shown in Additional file 24: Figure S11. SSU/LSU rDNA screening was carried out through sequence similarity searches (BLASTn megablast) of the assembly against SILVA SSUParc and LSUParc databases. Hits were only observed against G. rostochiensis SSU (scaffolds GROS_00919, GROS_01231) and LSU (scaffold GROS_00803, GROS_00919, GROS_01231).
Genome annotation was carried out in a two-step process detailed in the Additional file 2: Supplementary information. An initial round of automated gene predictions (nGr.v0.9.auto, 13,650 models) were refined in the collaborative genome annotation editor WebApollo (v1.0.4-RC3 ). Approximately one-eighth of the gene models were manually inspected based on homology to known Globodera genes, RNA-seq evidence and WGS read coverage yielding 1566 manually curated gene models (nGr.v0.9.manual). A second round of de novo gene prediction was carried out on assembly nGr.v1.0 with the addition of manual annotations as protein homology evidence and mapped RNA-seq reads as intron-hints to train and run Augustus (v3.1 ) resulting in the final gene set nGr.v1.0 containing 14,309 protein-coding genes. Functional annotation was performed using InterProScan5 (v5.7-48.0 ) to identify motifs and domains in the proteins by comparing them against databases Gene3D, PRINTS, Pfam, Phobius, ProSitePatterns, ProSiteProfiles, SMART, SUPERFAMILY, SignalP_EUK, TIGRFAM, TMHMM, Annot8r with KEGG, GO, EC, tRNAscan and rfam. GO-Term annotation and GO-enrichment analysis was carried out using Blast2GO 3.1.3 .
Splice sites were extracted from the genomes and GFF3 files present on WormBase for the species in Additional file 4: Figure S2, using custom script extractRegionFromCoordinates.py (https://github.com/DRL/GenomeBiology2016_globodera_rostochiensis/GNU GENERAL PUBLIC LICENSE). Four base pairs up and downstream of the 5′ donor site, and 6 bp upstream of the 3′ acceptor site were used to construct a consensus sequence for all GC/AG introns, and an identical sized sample of randomly selected GT/AG introns, using MEME SUITE v4.9.1 .
Transcriptome sequencing and differential expression
RNA from two life stages (hatched second-stage juvenile and 14 dpi female) was sequenced, each in biological duplicate, with Illumina Hiseq 100 bp paired-end reads (SRA accessions ERR202479, ERR202487 and PRJEB12075). These were compared with two additional life stages (dormant cysts and hydrated eggs), similarly sequenced in biological duplicate (Genbank accessions SAMN03393004 and SAMN03393005). All RNA-seq was carried out on pathotype Ro1. Normalized gene expression values and differentially expressed genes were identified as previously described . In brief, raw reads were trimmed of adapter sequences and low quality bases (Phred <22, Trimmomatic ), mapped to the genome (Tophat2, ), counted on a per gene basis (bedtools v2.16.2 ), TMM normalised and differential expression analysis and clustering were performed using a Trinity wrapper pipeline and associated scripts for RSEM  and EdgeR  (FDR <0.001, minimum fold-change 4, ). Expression clusters were grouped based on the tree height parameter (12 %) and manually assigned to expression super-clusters.
Phylogenetic analysis of WAGO proteins
Putative G. rostochiensis (n = 23), G. pallida (n = 18) and M. hapla (n = 18) WAGOs present in OrthoMCL clusters, which contained at least one G. rostochiensis protein with highest similarity to C. elegans WAGO1, were aligned to 545 WAGO sequences from Buck and Blaxter, 2013 . This comprised WAGOs from Clade I, Clade III, Clade IV and Clade V nematodes, as well as non-Nematode argonaute sequences (http://datadryad.org/resource/doi:10.5061/dryad.5qs11). Alignment was carried out using clustal-omega 1.2.0  and alignment was trimmed to only include the core PIWI PAZ domain section of argonautes. The WAG + G + F model of amino acid sequence evolution was selected under AICC using Prottest 3.4  and phylogenetic trees were inferred using RAxML 8.1.20  (ML search + 100 rapid bootstraps).
Horizontal gene transfer
Sequences derived from species under NCBI Taxonomy’s ‘Tylenchida’ (TaxID: 6300, equivalent to Tylenchomorpha) were not included in this calculation to allow detection of HGT events which took place in an ancestor of cyst nematodes and their tylenchomorph relatives. No AI value could be calculated for proteins returning no similar sequences in the nr database. An AI >0 indicates a better hit to a non-metazoan species than to a metazoan species and thus a possible acquisition via HGT. An AI >30 corresponds to a difference of magnitude e10 between the best non-metazoan and best metazoan E-values and is estimated to be a strong indication of a HGT event . Proteins with an AI >0 and ≥70 % identity to a non-metazoan protein were considered putative contaminants and not included in further analysis.
Genes in the G. rostochiensis genome sequence similar to previously reported effectors with experimentally validated gland cell expression were identified in a two-step process. An inclusive list of effectors was generated by sequence similarity alone. For those effectors that are characterised by the presence of particular domains (e.g. the SPRY domain of SPRY-SEC effectors), hmmsearch  using the appropriate domain was used to identify all sequences predicted to contain the same domain using the gathering significance threshold. For all other effectors, BLASTp was used to identify similar sequences (E-value ≤1e–5). Cell wall degrading enzymes (CWDEs) identified as putatively acquired via HGT were included if they had known in situ localisation to either gland cell. This inclusive list was triaged by removing those without a predicted signal peptide and/or those with one or more transmembrane domain (Phobius ), producing the high-confidence effector list (Additional file 12: Table S4).
Sequence reads (Bioproject PRJNA305631) were mapped against the assembly using bwa mem v0.7.12-r1044 . Duplicated read pairs were removed using Picard (http://broadinstitute.github.io/picard). Variants were called using freebayes v0.9.20-16-g3e35e72 . Haplotypes and other complex variants were decomposed using vcflib vcfallelicprimitives v1.0.0-rc0 (https://github.com/ekg/vcflib/releases/tag/v1.0.0-rc0) followed by normalisation using vt normalize v0.57 . The resulting VCF file was filtered with the following parameters: DP > 10 & MQM > 30 & QUAL > 1 & QUAL/AO > 10 & SAF > 2 & SAR > 2 & RPR > 1 & RPL > 1 using vcffilter from vcflib. Variants were annotated using SnpEff v4.1 L . The resulting VCF file was analysed using vt peek, RTG Tools  and parse_snpeff.py. Variants (vcf file) were filtered to retain only SNPs (TYPE = snp) with no missing data, 730,705 loci were found from whole genome data. Allele frequencies at each locus was computed by dividing the reference allele observation count (RO) by the read depth (DP). In the same manner, allele frequencies for SNPs present in non-coding regions (n = 619,886) were computed. Seqboot module in PHYLIP v3.695  was used to make 100 bootstrapped datasets. Maximum likelihood phylogenetic trees of the nine populations of G. rostochiensis were calculated with the Contml module based on genome-wide SNP allele frequencies and a majority rule consensus tree was constructed using Consense. Principal component analysis (PCA) were calculated with the prcomp() function from the stats package in R based on genome-wide allele frequencies at these 730,705 loci.
Putative one-to-one orthologues between G. pallida and G. rostochiensis were identified by the reciprocal best BLAST hit method. Both proteomes were compared against each other using BLASTp (v2.2.30+) and the resulting files were processed using the script rbbh.py (https://github.com/DRL/GenomeBiology2016_globodera_rostochiensis GNU GENERAL PUBLIC LICENSE, E-value ≤1e–25 and reciprocal-query coverage >75 %). Protein clustering analysis was performed on the proteomes (retrieved from Wormbase WS248) of B. xylophilus, C. elegans, M. hapla, M. incognita, G. pallida (retrieved from WormBase ParaSite WBPS2) and G. rostochiensis (nGr.v1.0) using OrthoMCL (v2.0.9 ) (with an inflation value of 1.5) and following the guidelines specified in . Phylogenetically informative sets of clusters were plotted using UpSetR (Release v1.0.0, https://github.com/hms-dbmi/UpSetR/releases ). For each of four orthologous gene cluster categories (all nematodes tested, all plant parasites tested, Globodera and Meloidogyne and Globodera alone), the percentage of genes present in each differential expression super-cluster was determined. This value was normalised by the total number of genes present in each given differential expression super-cluster, to return a relative measure of abundance used in Fig. 3.
Effector islands, synteny and promoter analyses
The presence of effectors in adjacent (n ± 1), or neighbouring positions (up to ±9), was determined. As a negative control, a subset of 612 G. rostochiensis gene families not predicted to contain effectors was identified from the OrthoMCL. This subset contained gene families of various sizes, the distribution of which with respect to gene family size 1, 2 and ≥3 was the same as that of the effectors. Starting from this initial negative set of 612 gene families, 37 were selected at random and the presence of genes from these 37 families in adjacent (n ± 1), or neighbouring positions (up to ±9), was determined. This process was repeated for 1000 iterations to generate a robust negative for the average frequency in each neighbouring position. The observed frequency of effector occurrence at each position was compared to the average of 1000 iterations. Non-overlapping islands, delineated by furthest distance at which statistically significant enrichment was observed (±6, χ 2 goodness of fit, p <0.001), were manually identified.
Synteny between scaffolds of G. pallida and G. rostochiensis was assessed based on OrthoMCL-cluster membership of both sets of proteins using i-adhore-3.0.01 ((https://github.com/widdowquinn/Teaching/tree/master/Comparative_Genomics_and_Visualisation/Part_2/i-ADHore) type = family, tandem_gap = 10, gap_size = 15, max_gaps_in_alignment = 20, cluster_gap = 20, q_value = 0.9, alignment_method = gg2, prob_cutoff = 0.001, multiple_hypothesis_correction = bonferroni, anchor_points = 5). Syntenic blocks were visualised as clusters in a graph using parse_iadhore.py. G. rostochiensis scaffold GROS_00007 (a member of the biggest syntenic cluster) was plotted with its homologous G. pallida scaffolds using circos 0.67-7, including GC-content and BLASTn results at an E-value cutoff of 1e-65.
To analyse putative enhancer elements, sequences 500 bp upstream of genes of interest (termed the promoter regions) were extracted from the genome using get_upstream_regions.py (https://github.com/peterthorpe5 GNU GENERAL PUBLIC LICENSE). Enrichment of motifs between categories (DG versus all, DG versus SvG, etc.) was calculated using HOMER , specifying max length of six nucleotides. Instances of the motif were identified in FASTA sequences of promoter regions using the FIMO web server .
In situ hybridisation
The spatial expression patterns of two predicted G. rostochiensis dorsal gland effectors were determined in J2s by in situ hybridisation as described previously . Single-stranded digoxygenin-labelled DNA probes were synthesised from amplified cDNA fragments using primers g14226F (5′-CCGTTGAGCCGTCGACTAAT-3′) and g14226R (5′-TTTCCCGACGTCCAGTTGAC-3′) or g04707F (5′-AAGGAGCACCATCGTACCAAG-3′) and g04707R (5′-GTTCTGAGCCTTGTTGAAAG-3′).
Description of additional data files
The following additional data are available with the online version of this paper. Additional file 7: File S1 contains the data matrix of normalised expression values. Additional file 2: Supplementary information file 1 contains various supplemental methods and results.
This work benefited from interactions promoted by COST Action project FA 1208. We thank Nathalie Smerdon and Lesley Shirley for creating the sequencing libraries and Matthew Berriman for supporting this work. The authors are grateful to the Genotoul bioinformatics platform Toulouse Midi-Pyrenees for providing computing resources.
SE-vdA is supported by BBSRC grant BB/M014207/1. Sequencing was funded by BBSRC grant BB/F000642/1 to the University of Leeds and grant BB/F00334X/1 to the Wellcome Trust Sanger Institute). DRL was supported by a fellowship from The James Hutton Institute and the School of Biological Sciences, University of Edinburgh. GK was supported by a BBSRC PhD studentship. The James Hutton Institute receives funding from the Scottish Government. JAC and NEH are supported by the Wellcome Trust through its core funding of the Wellcome Trust Sanger Institute (grant 098051). This work was also supported by funding from the Canadian Safety and Security Program, project number CRTI09_462RD.
Availability of data and materials
Genome sequence data are available in the SRA repository accessions ERR114519, ERR123958 and ERR114520. Transcriptomic data are available in the SRA repository under accession ERR202479, ERR202487 and PRJEB12075 and the GenBank repository accessions SAMN03393004 and SAMN03393005. Whole genome resequencing data are available under the Bioproject PRJNA305631. Raw and parsed VCF files, transposable element prediction and gene coordinates, clusters and normalised expression tables, interproscan results and blast2go results are available in Dryad accession doi:10.5061/dryad.4s5r6. Custom scripts integral to the manuscript are available on GitHub and cited where appropriate in the text. The G. rostochiensis genome is available for query (BLAST etc) at: http://globodera.bio.ed.ac.uk/search/species_search?genomeSelect=1213186&Gid=1213187&GFFid=1213188).
NEH and JAC carried out sequencing and assembly. DRL and GK carried out automated annotations and gap filling. SEVDA, DRL, EGJD, BM, EG, MOD, IB, JL, ME, MS, MR, HO, AT, GS, SM, PJAC, WP, CJL, JMCM, LMJ, JTJ and HBY participated in manual annotation. DRL and SEVDA carried out phylum and genus comparative analyses. DRL, BM, MOD, IB and SEVDA carried out variant analysis. JM carried out microsatellite analysis. SEVDA and MDR carried out transcriptomic analyses. EGJD and CR carried out HGT calculations and analyses. BH carried out carbohydrate-active enzymes analyses. AS predicted transposable elements. SEVDA and DRL carried out splicing analyses. SEVDA, PT and CJL carried out effectors analyses. CJL carried out in situ hybridisation. SEVDA, DRL, EGJD, MB and JTJ wrote the manuscript. VB, PEU and MB critically revised the manuscript. SEVDA, DRL, JTJ, CJL, VB, PEU and MB participated in design of the study. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Ethics approval and consent to participate
Ethics approval was not needed for the study.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Nicol JM, Turner SJ, Coyne DL, den Nijs L, Hockland S, Maafi ZT. Current nematode threats to world agriculture. In: Jones J, Gheysen G, Fenoll C, editors. Genomics and molecular genetics of plant-nematode interactions. The Netherlands: Springer; 2011. p. 21–43.Google Scholar
- Evans K, Franco J, De Scurrah MM. Distribution of species of potato cyst-nematodes in South America. Nematologica. 1975;21:365–9.View ArticleGoogle Scholar
- Plantard O, Picard D, Valette S, Scurrah M, Grenier E, Mugniery D. Origin and genetic diversity of Western European populations of the potato cyst nematode (Globodera pallida) inferred from mitochondrial sequences and microsatellite loci. Mol Ecol. 2008;17:2208–18.View ArticlePubMedGoogle Scholar
- Evans K, Rowe J. Distribution and economic importance. In: Sharma SB, editor. The cyst nematodes. The Netherlands: Springer; 1998. p. 1–30Google Scholar
- Boucher AC, Mimee B, Montarry J, Bardou-Valette S, Bélair G, Moffett P, et al. Genetic diversity of the golden potato cyst nematode Globodera rostochiensis and determination of the origin of populations in Quebec, Canada. Mol Phylogenet Evol. 2013;69:75–82.View ArticlePubMedGoogle Scholar
- Phillips M, Trudgill D. Variation of virulence, in terms of quantitative reproduction of Globodera pallida populations, from Europe and South America, in relation to resistance from Solanum vernei and S. tuberosum ssp. andigena CPC 2802. Nematologica. 1998;44:409–23.View ArticleGoogle Scholar
- Hockland S, Niere B, Grenier E, Blok V, Phillips M, Den Nijs L, et al. An evaluation of the implications of virulence in non-European populations of Globodera pallida and G. rostochiensis for potato cultivation in Europe. Nematology. 2012;14:1–13.View ArticleGoogle Scholar
- Kort J, Ross H, Rumpenhorst H, Stone A. An international scheme for identifying and classifying pathotypes of potato cyst-nematodes Globodera rostochiensis and G. pallida. Nematologica. 1977;23:333–9.View ArticleGoogle Scholar
- Sobczak M, Golinowski W. Cyst nematodes and syncytia. In: Jones J, Gheysen G, Fenoll C, editors. Genomics and molecular genetics of plant-nematode interactions. The Netherlands: Springer; 2011. p. 61–82Google Scholar
- Thorpe P, Mantelin S, Cock PJ, Blok VC, Coke MC, Eves-van den Akker S, et al. Genomic characterisation of the effector complement of the potato cyst nematode Globodera pallida. BMC Genomics. 2014;15:923.View ArticlePubMedPubMed CentralGoogle Scholar
- Eves-van den Akker S, Lilley CJ, Jones JT, Urwin PE. Identification and characterisation of a hyper-variable apoplastic effector gene family of the potato cyst nematodes. PLoS Pathog. 2014;10, e1004391.Google Scholar
- Smant G, Stokkermans J, Yan YT, de Boer JM, Baum TJ, Wang XH, et al. Endogenous cellulases in animals: Isolation of beta-1,4-endoglucanase genes from two species of plant-parasitic cyst nematodes. Proc Natl Acad Sci U S A. 1998;95:4906–11.View ArticlePubMedPubMed CentralGoogle Scholar
- Mei Y, Thorpe P, Guzha A, Haegeman A, Blok VC, MacKenzie K, et al. Only a small subset of the SPRY domain gene family in Globodera pallida is likely to encode effectors, two of which suppress host defences induced by the potato resistance gene Gpa2. Nematology. 2015;17:409–24.View ArticleGoogle Scholar
- Lee C, Chronis D, Kenning C, Peret B, Hewezi T, Davis EL, et al. The novel cyst nematode effector protein 19C07 interacts with the Arabidopsis auxin influx transporter LAX3 to control feeding site development. Plant Physiol. 2011;155:866–80.View ArticlePubMedGoogle Scholar
- Wang XH, Mitchum MG, Gao BL, Li CY, Diab H, Baum TJ, et al. A parasitism gene from a plant-parasitic nematode with function similar to CLAVATA3/ESR (CLE) of Arabidopsis thaliana. Mol Plant Pathol. 2005;6:187–91.View ArticlePubMedGoogle Scholar
- Danchin EGJ, Rosso M-N, Vieira P, de Almeida-Engler J, Coutinho PM, Henrissat B, et al. Multiple lateral gene transfers and duplications have promoted plant parasitism ability in nematodes. Proc Natl Acad Sci U S A. 2010;107:17651–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Haegeman A, Jones JT, Danchin EGJ. Horizontal gene transfer in nematodes: a catalyst for plant parasitism? Mol Plant-Microbe Interact. 2011;24:879–87.View ArticlePubMedGoogle Scholar
- Maier TR, Hewezi T, Peng J, Baum TJ. Isolation of whole esophageal gland cells from plant-parasitic nematodes for transcriptome analyses and effector identification. Mol Plant-Microbe Interact. 2013;26:31–5.View ArticlePubMedGoogle Scholar
- Bellafiore S, Shen Z, Rosso M-N, Abad P, Shih P, Briggs SP. Direct identification of the Meloidogyne incognita secretome reveals proteins with host cell reprogramming potential. PLoS Pathog. 2008;4:e1000192.View ArticlePubMedPubMed CentralGoogle Scholar
- Hewezi T, Juvale PS, Piya S, Maier TR, Rambani A, Rice JH, et al. The cyst nematode effector protein 10A07 targets and recruits host posttranslational machinery to mediate its nuclear trafficking and to promote parasitism in Arabidopsis. Plant Cell. 2015;27:891–907.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang J, Replogle A, Hussey R, Baum T, Wang X, Davis EL, et al. Identification of potential host plant mimics of CLAVATA3/ESR (CLE)-like peptides from the plant-parasitic nematode Heterodera schachtii. Mol Plant Pathol. 2011;12:177–86.View ArticlePubMedGoogle Scholar
- Cotton JA, Lilley CJ, Jones LM, Kikuchi T, Reid AJ, Thorpe P, et al. The genome and life-stage specific transcriptomes of Globodera pallida elucidate key aspects of plant parasitism by a cyst nematode. Genome Biol. 2014;15:R43.View ArticlePubMedPubMed CentralGoogle Scholar
- Abad P, Gouzy J, Aury J-M, Castagnone-Sereno P, Danchin EGJ, Deleury E, et al. Genome sequence of the metazoan plant-parasitic nematode Meloidogyne incognita. Nat Biotechnol. 2008;26:909–15.View ArticlePubMedGoogle Scholar
- Opperman CH, Bird DM, Williamson VM, Rokhsar DS, Burke M, Cohn J, et al. Sequence and genetic map of Meloidogyne hapla: A compact nematode genome for plant parasitism. Proc Natl Acad Sci. 2008;105:14802–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Consortium S. Genome sequence of the nematode C. elegans: a platform for investigating biology. Science. 1998;282:2012–8.View ArticleGoogle Scholar
- Grisi E, Burrows PR, Perry RN, Hominick WM. The genome size and chromosome complement of the potato cyst nematode Globodera pallida. Fundam Appl Nematol. 1995;18:67–70.Google Scholar
- Burset M, Seledtsov I, Solovyev V. Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res. 2000;28:4364–75.View ArticlePubMedPubMed CentralGoogle Scholar
- Farrer T, Roller AB, Kent WJ, Zahler AM. Analysis of the role of Caenorhabditis elegans GC‐AG introns in regulated splicing. Nucleic Acids Res. 2002;30:3360–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Kikuchi T, Cotton JA, Dalzell JJ, Hasegawa K, Kanzaki N, McVeigh P, et al. Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus. PLoS Pathog. 2011;7:e1002219.View ArticlePubMedPubMed CentralGoogle Scholar
- Buck AH, Blaxter M. Biogenesis and turnover of small RNAs: functional diversification of Argonautes in nematodes: an expanding universe. Biochem Soc Trans. 2013;41:881.View ArticlePubMedPubMed CentralGoogle Scholar
- Rancurel C, Da Rocha M, Danchin EGJ. Alienness: rapid detection of horizontal gene transfers in metazoan genomes. F1000Posters 2014. 5:1666 (poster).Google Scholar
- Flot J-F, Hespeels B, Li X, Noel B, Arkhipova I, Danchin EG, et al. Genomic evidence for ameiotic evolution in the bdelloid rotifer Adineta vaga. Nature. 2013;500:453–7.View ArticlePubMedGoogle Scholar
- Gladyshev EA, Meselson M, Arkhipova IR. Massive horizontal gene transfer in bdelloid rotifers. Science. 2008;320:1210–3.View ArticlePubMedGoogle Scholar
- Danchin EGJ, Guzeeva EA, Mantelin S, Berepiki A, Jones JT. Horizontal gene transfer from bacteria has enabled the plant-parasitic nematode Globodera pallida to feed on host-derived sucrose. Mol Biol Evol. 2016;33:1571–79.View ArticlePubMedGoogle Scholar
- Dong S, Stam R, Cano LM, Song J, Sklenar J, Yoshida K, et al. Effector specialization in a lineage of the Irish potato famine pathogen. Science. 2014;343:552–5.View ArticlePubMedGoogle Scholar
- Gilroy EM, Breen S, Whisson SC, Squires J, Hein I, Kaczmarek M, et al. Presence/absence, differential expression and sequence polymorphisms between PiAVR2 and PiAVR2‐like in Phytophthora infestans determine virulence on R2 plants. New Phytol. 2011;191:763–76.View ArticlePubMedGoogle Scholar
- Eves-van den Akker S, Lilley C, Danchin E, Rancurel C, Cock P, Urwin P, Jones J. The transcriptome of Nacobbus aberrans reveals insights into the evolution of sedentary endoparasitism in plant-parasitic nematodes. Genome Biol Evol. 2014;6:2181–94.View ArticlePubMedPubMed CentralGoogle Scholar
- Hussey R, Mims C. Ultrastructure of esophageal glands and their secretory granules in the root-knot nematode Meloidogyne incognita. Protoplasma. 1990;156:9–18.View ArticleGoogle Scholar
- Endo B. Ultrastructure of subventral gland secretory granules in parasitic juveniles of the soybean cyst nematode, Heterodera glycines. J Helminthol Soc Wash. 1993;60:22–34.Google Scholar
- Endo BY. Ultrastructure of esophageal gland secretory granules in juveniles of Heterodera glycines. J Nematol. 1987;19:469.PubMedPubMed CentralGoogle Scholar
- Maule AG, Curtis R. Parallels between plant and animal parasitic nematodes. In: Jones J, Gheysen G, Fenoll C, editors. Genomics and molecular genetics of plant-nematode interactions. The Netherlands: Springer; 2011. p. 221–51.Google Scholar
- Mimee B, Duceppe MO, Véronneau PY, Lafond‐Lapalme J, Jean M, Belzile F, et al. A new method for studying population genetics of cyst nematodes based on Pool‐Seq and genomewide allele frequency analysis. Mol Ecol Resour. 2015;15:1356–65.View ArticlePubMedGoogle Scholar
- Noon JB, Hewezi TAF, Maier TR, Simmons C, Wei J-Z, Wu G, et al. Eighteen new candidate effectors of the phytonematode Heterodera glycines produced specifically in the secretory esophageal gland cells during parasitism. Phytopathology. 2015;105:1362–72.View ArticlePubMedGoogle Scholar
- Gao BL, Allen R, Maier T, Davis EL, Baum TJ, Hussey RS. The parasitome of the phytonematode Heterodera glycines. Mol Plant-Microbe Interact. 2003;16:720–6.View ArticlePubMedGoogle Scholar
- Jones JT, Kumar A, Pylypenko LA, Thirugnanasambandam A, Castelli L, Chapman S, et al. Identification and functional characterization of effectors in expressed sequence tags from various life cycle stages of the potato cyst nematode Globodera pallida. Mol Plant Pathol. 2009;10:815–28.View ArticlePubMedGoogle Scholar
- Chronis D, Chen S, Lu S, Hewezi T, Carpenter SC, Loria R, et al. A ubiquitin carboxyl extension protein secreted from a plant-parasitic nematode Globodera rostochiensis is cleaved in planta to promote plant parasitism. Plant J. 2013;74:185–96.View ArticlePubMedGoogle Scholar
- Raffaele S, Win J, Cano LM, Kamoun S. Analyses of genome architecture and gene expression reveal novel candidate virulence factors in the secretome of Phytophthora infestans. BMC Genomics. 2010;11:637.View ArticlePubMedPubMed CentralGoogle Scholar
- Vens C, Rosso M-N, Danchin EG. Identifying discriminative classification-based motifs in biological sequences. Bioinformatics. 2011;27:1231–8.View ArticlePubMedGoogle Scholar
- Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38:576–89.View ArticlePubMedPubMed CentralGoogle Scholar
- Espada M, Silva AC, Eves van den Akker S, Cock PJ, Mota M, Jones JT. Identification and characterization of parasitism genes from the pinewood nematode Bursaphelenchus xylophilus reveals a multi‐layered detoxification strategy. Mol Plant Pathol. 2016;17:286–95.View ArticlePubMedGoogle Scholar
- Kozarewa I, Ning Z, Quail MA, Sanders MJ, Berriman M, Turner DJ. Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+ C)-biased genomes. Nat Methods. 2009;6:291–5.View ArticlePubMedPubMed CentralGoogle Scholar
- Park N, Shirley L, Gu Y, Keane TM, Swerdlow H, Quail MA. An improved approach to mate-paired library preparation for Illumina sequencing. Methods Next Generation Seq. 2013;1:10–20.Google Scholar
- Simpson JT, Durbin R. Efficient de novo assembly of large genomes using compressed data structures. Genome Res. 2012;22:549–56.View ArticlePubMedPubMed CentralGoogle Scholar
- Gremme G, Steinbiss S, Kurtz S. GenomeTools: a comprehensive software library for efficient processing of structured genome annotations. IEEE/ACM Trans Comput Biol Bioinform. 2013;10:645–56.View ArticlePubMedGoogle Scholar
- Zerbino DR, Birney E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27:578–9.View ArticlePubMedGoogle Scholar
- Boetzer M, Pirovano W. Toward almost closed genomes with GapFiller. Genome Biol. 2012;13:R56.View ArticlePubMedPubMed CentralGoogle Scholar
- Kumar S, Blaxter ML. Simultaneous genome sequencing of symbionts and their hosts. Symbiosis. 2011;55:119–26.View ArticlePubMedGoogle Scholar
- Kumar S, Jones M, Koutsovoulos G, Clarke M, Blaxter M. Blobology: exploring raw genome data for contaminants, symbionts and parasites using taxon-annotated GC-coverage plots. Front Genet. 2013;4:237.View ArticlePubMedPubMed CentralGoogle Scholar
- Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421.View ArticlePubMedPubMed CentralGoogle Scholar
- Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60.View ArticlePubMedGoogle Scholar
- Lee E, Helt GA, Reese JT, Munoz-Torres MC, Childers CP, Buels RM, et al. Web Apollo: a web-based genomic annotation editing platform. Genome Biol. 2013;14:R93.View ArticlePubMedPubMed CentralGoogle Scholar
- Stanke M, Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics. 2003;19:ii215–25.View ArticlePubMedGoogle Scholar
- Jones P, Binns D, Chang H-Y, Fraser M, Li W, McAnulla C, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30:1236–40.View ArticlePubMedPubMed CentralGoogle Scholar
- Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–6.View ArticlePubMedGoogle Scholar
- Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36.View ArticlePubMedPubMed CentralGoogle Scholar
- Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–2.View ArticlePubMedPubMed CentralGoogle Scholar
- Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323.View ArticlePubMedPubMed CentralGoogle Scholar
- Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–40.View ArticlePubMedGoogle Scholar
- Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc. 2013;8:1494–512.View ArticlePubMedGoogle Scholar
- Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, et al. Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7:539.View ArticlePubMedPubMed CentralGoogle Scholar
- Darriba D, Taboada GL, Doallo R, Posada D. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics. 2011;27:1164–5.View ArticlePubMedGoogle Scholar
- Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.View ArticlePubMedPubMed CentralGoogle Scholar
- Finn RD, Clements J, Arndt W, Miller BL, Wheeler TJ, Schreiber F, et al. HMMER web server: 2015 update. Nucleic Acids Res. 2015;43:W30–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Käll L, Krogh A, Sonnhammer EL. Advantages of combined transmembrane topology and signal peptide prediction—the Phobius web server. Nucleic Acids Res. 2007;35:W429–32.View ArticlePubMedPubMed CentralGoogle Scholar
- Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–60.View ArticlePubMedPubMed CentralGoogle Scholar
- Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. Preprint at arXiv 2012:1207.3907v2 [q-bio.GN].Google Scholar
- Tan A, Abecasis GR, Kang HM. Unified representation of genetic variants. Bioinformatics. 2015;31:2202–4.View ArticlePubMedGoogle Scholar
- Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92.View ArticlePubMedPubMed CentralGoogle Scholar
- Cleary JG, Braithwaite R, Gaastra K, Hilbush BS, Inglis S, Irvine SA, et al. Comparing variant call files for performance benchmarking of next-generation sequencing variant calling pipelines. bioRxiv 2015. doi:10.1101/023754.
- Plotree D, Plotgram D. PHYLIP-phylogeny inference package (version 3.2). Cladistics. 1989;5:163–6.View ArticleGoogle Scholar
- Fischer S, Brunk BP, Chen F, Gao X, Harb OS, Iodice JB, et al. Using OrthoMCL to assign proteins to OrthoMCL‐DB groups or to cluster proteomes into new Ortholog groups. Curr Protoc Bioinformatics. 2011;Chapter 6:Unit 6.12.1–19.Google Scholar
- Lex A, Gehlenborg N. Points of view: sets and intersections. Nat Methods. 2014;11:779–9.View ArticleGoogle Scholar
- Grant CE, Bailey TL, Noble WS. FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011;27:1017–8.View ArticlePubMedPubMed CentralGoogle Scholar
- de Boer JM, Yan Y, Smant G, Davis EL, Baum TJ. In situ hybridization to messenger RNA in Heterodera glycines. J Nematol. 1998;30:309–12.PubMedPubMed CentralGoogle Scholar
- Lunt DH, Kumar S, Koutsovoulos G, Blaxter ML. The complex hybrid origins of the root knot nematodes revealed through comparative genomics. Peer J. 2014;2, e356.View ArticlePubMedPubMed CentralGoogle Scholar
- Blaxter ML, De Ley P, Garey JR, Liu LX, Scheldeman P, Vierstraete A, et al. A molecular evolutionary framework for the phylum Nematoda. Nature. 1998;392:71–5.View ArticlePubMedGoogle Scholar
- Qin L, Kudla U, Roze EH, Goverse A, Popeijus H, Nieuwland J, et al. Plant degradation: a nematode expansin acting on plants. Nature. 2004;427:30.View ArticlePubMedGoogle Scholar
- Vanholme B, Haegeman A, Jacob J, Cannoot B, Gheysen G. Arabinogalactan endo-1, 4-galactosidase: a putative plant cell wall-degrading enzyme of plant-parasitic nematodes. Nematology. 2009;11:739–47.View ArticleGoogle Scholar
- Kudla U, Milac AL, Qin L, Overmars H, Roze E, Holterman M, et al. Structural and functional characterization of a novel, host penetration‐related pectate lyase from the potato cyst nematode Globodera rostochiensis. Mol Plant Pathol. 2007;8:293–305.View ArticlePubMedGoogle Scholar
- De Boer J, Davis E, Hussey R, Popeijus H, Smant G, Baum T. Cloning of a putative pectate lyase gene expressed in the subventral esophageal glands of Heterodera glycines. J Nematol. 2002;34:9.PubMedPubMed CentralGoogle Scholar
- Gao B, Allen R, Maier T, McDermott JP, Davis EL, Baum TJ, et al. Characterisation and developmental expression of a chitinase gene in Heterodera glycines. Int J Parasitol. 2002;32:1293–300.View ArticlePubMedGoogle Scholar
- Jones JT, Furlanetto C, Bakker E, Banks B, Blok V, Chen Q, et al. Characterization of a chorismate mutase from the potato cyst nematode Globodera pallida. Mol Plant Pathol. 2003;4:43–50.View ArticlePubMedGoogle Scholar
- Bauters L, Haegeman A, Kyndt T, Gheysen G. Analysis of the transcriptome of Hirschmanniella oryzae to explore potential survival strategies and host–nematode interactions. Mol Plant Pathol. 2014;15:352–63.View ArticlePubMedGoogle Scholar
- Craig JP, Bekal S, Niblack T, Domier L, Lambert KN. Evidence for horizontally transferred genes involved in the biosynthesis of vitamin B1, B5, and B7 in Heterodera glycines. J Nematol. 2009;41:281.PubMedPubMed CentralGoogle Scholar
- Craig JP, Bekal S, Hudson M, Domier L, Niblack T, Lambert KN. Analysis of a horizontally transferred pathway involved in vitamin B6 biosynthesis from the soybean cyst nematode Heterodera glycines. Mol Biol Evol. 2008;25:2085–98.View ArticlePubMedGoogle Scholar
- Veronico P, Jones J, Di Vito M, De Giorgi C. Horizontal transfer of a bacterial gene involved in polyglutamate biosynthesis to the plant-parasitic nematode Meloidogyne artiellia. Febs Lett. 2001;508:470–4.View ArticlePubMedGoogle Scholar
- Scholl EH, Thorne JL, McCarter JP, Bird DM. Horizontally transferred genes in plant-parasitic nematodes: a high-throughput genomic approach. Genome Biol. 2003;4:R39.View ArticlePubMedPubMed CentralGoogle Scholar
- Paganini J, Campan-Fournier A, Da Rocha M, Gouret P, Pontarotti P, Wajnberg E, et al. Contribution of lateral gene transfers to the genome composition and parasitic ability of root-knot nematodes. PLoS One. 2012;7:e50875.View ArticlePubMedPubMed CentralGoogle Scholar
- Clifton W, Bird DM, Waterston RH. Analysis and functional classification of transcripts from the nematode Meloidogyne incognita. Genome Biol. 2003;4:R26.View ArticlePubMedPubMed CentralGoogle Scholar