Divergent evolution of arrested development in the dauer stage of Caenorhabditis elegans and the infective stage of Heterodera glycines
Genome Biology volume 8, Article number: R211 (2007)
The soybean cyst nematode Heterodera glycines is the most important parasite in soybean production worldwide. A comprehensive analysis of large-scale gene expression changes throughout the development of plant-parasitic nematodes has been lacking to date.
We report an extensive genomic analysis of H. glycines, beginning with the generation of 20,100 expressed sequence tags (ESTs). In-depth analysis of these ESTs plus approximately 1,900 previously published sequences predicted 6,860 unique H. glycines genes and allowed a classification by function using InterProScan. Expression profiling of all 6,860 genes throughout the H. glycines life cycle was undertaken using the Affymetrix Soybean Genome Array GeneChip. Our data sets and results represent a comprehensive resource for molecular studies of H. glycines. Demonstrating the power of this resource, we were able to address whether arrested development in the Caenorhabditis elegans dauer larva and the H. glycines infective second-stage juvenile (J2) exhibits shared gene expression profiles. We determined that the gene expression profiles associated with the C. elegans dauer pathway are not uniformly conserved in H. glycines and that the expression profiles of genes for metabolic enzymes of C. elegans dauer larvae and H. glycines infective J2 are dissimilar.
Our results indicate that hallmark gene expression patterns and metabolism features are not shared in the developmentally arrested life stages of C. elegans and H. glycines, suggesting that developmental arrest in these two nematode species has undergone more divergent evolution than previously thought and pointing to the need for detailed genomic analyses of individual parasite species.
Heterodera glycines, the soybean cyst nematode, is the economically most important pathogen in soybean production and causes estimated annual yield losses of $800 million in the USA alone . H. glycines completes its life cycle in about one month . The first molt of the larvae takes place inside the eggs, and, after hatching, infective second-stage juveniles (J2) migrate through the soil and invade soybean roots to become parasitic J2. Once inside host roots, J2 move intracellulary through the root tissue to the central cylinder, where they initiate the formation of feeding sites (syncytia) and become sedentary. Only after feeding commences do nematodes molt and pass through two more juvenile stages (J3, J4) and, after a final molt, develop into adults. The enlarging body of the female, which remains sedentary for the remainder of the life cycle, breaks through the root cortex into the rhizosphere. Males regain motility and leave the root to fertilize females. After fertilization, females produce eggs, the majority of which are retained inside the uterus. Upon death of the adult female, its outer body layers harden and form a protective cyst (hence the name cyst nematodes) around the eggs until the environment is favorable again for a new generation of nematodes [2, 3]. Even though eggs in their cysts are the primary dispersal stage of this nematode in an epidemiological sense, the J2 stage is mobile and, thus, comparable to the dispersal stage of Caenorhabditis spp.
In the past, numerous reports on cyst nematodes (Heterodera spp. and Globodera spp.) focused on selected genes, rather than taking a genomic approach, to elucidate nematode biology or the host-pathogen interactions between these nematodes and their host plants. Many of these studies dealt with so-called parasitism genes that are expressed in the dorsal and subventral esophageal glands during parasitic stages of cyst nematodes. The products of these genes are thought to be secreted into the host tissue to mediate successful plant parasitism [4–13]. However, a comprehensive genomic analysis beyond this limited group of genes has been lacking to date. To fill this gap, we generated 20,100 H. glycines expressed sequence tags (ESTs). Analyses of these ESTs plus approximately 1,900 sequences already in public databases produced a grouping into 6,860 unique genes. We assigned putative functions to these genes based upon sequence homology and established their expression profiles throughout the major life stages of H. glycines. Our data sets and results now represent a comprehensive resource for molecular studies of H. glycines.
Genomic analyses provide powerful tools to elucidate relationships between plant-parasitic nematodes and their hosts. Previous reports focused on the analysis of ESTs of plant-parasitic nematodes [14–17] or used differential display [18, 19] and microarrays [20–22] to study gene expression changes in Arabidopsis and soybean in response to cyst nematode infection. Only recently, the advent of the Affymetrix Soybean Genome Array GeneChip enabled a parallel analysis of gene expression changes in both soybean and soybean cyst nematode during the early stages of infection . The Affymetrix Soybean Genome Array GeneChip contains 37,500 probesets from soybean plants and additionally 15,800 probesets from the oomycete Phythophthora sojae and 7,530 probesets from the soybean cyst nematode H. glycines, two of the most important soybean pathogens. The H. glycines sequences used for the GeneChip have been generated in the study presented here.
The completion of the C. elegans genome sequence  was a milestone for biology at large, but it especially set the stage for comparisons to other (for example, parasitic) nematode species and has ushered in an era of comparative genomics in nematology [25–29]. One question of particular interest is whether the dauer larva, a facultative stage in the free-living species C. elegans, is homologous to the obligate dauer stage in parasitic nematodes. Dauer larvae were first described  as an adaptation to parasitism to overcome adverse environmental conditions and facilitate dispersal, but have been best studied in C. elegans. Genetic analysis has revealed the pathway controlling entry to and exit from the dauer stage . This biochemical pathway, which is highly conserved across the animal kingdom, including humans , assesses and allocates energy resources to nematode development, ageing and fat deposition. The dauer pathway is primarily neuronally mediated, but presumably communicates with endocrine functions.
There is no strict definition of a 'dauer', but these larvae share the properties of being developmentally arrested, motile, non-feeding, non-ageing and hence long-lived [31, 33, 34]. Dauer stages have been well-documented for some plant-associated genera, including Anguina  and Bursaphelenchus , and it has been proposed that the infective stages of the sedentary endo-parasitic forms, including H. glycines, function as dauers . In addition to the developmental attributes of the dauer, H. glycines J2 exhibit detergent resistance , intestinal morphology with sparse luminal microvilli  and numerous lipid storage vesicles characteristic of C. elegans dauers.
The dauer larva stage in C. elegans has distinct metabolic hallmarks . Enzymes involved in the citrate cycle (with the exception of malate dehydrogenase) are less active in dauer larvae relative to adult C. elegans. Dauers show an increased level of phosphofructokinase activity and, therefore, glycolysis relative to adults . The citrate cycle is less active than the glyoxylate cycle in dauer larvae compared to adults, consistent with the important role of lipids in energy storage in the dauer stage . Also, heatshock protein 90 (Hsp90) is up-regulated fifteen-fold in dauer larvae relative to other stages , and superoxide dismutase and catalase activities show significant increases as well [43, 44]. Although it is widely assumed that the dauer pathway per se is utilized to regulate dauer entry/exit in various animal-parasitic [45, 46] and plant-parasitic species , little is known in these diverse species about the nature of the biochemistry that is regulated by the dauer pathways (that is, the effectors). Intriguingly, one experimental study in the human-parasitic nematode Strongyloides stercoralis  could not find clear evidence for a conserved dauer gene-expression signature, suggesting that the effectors of dauer biology might be diverged across nematode species.
However, a common feature among the dauer stage of C. elegans and the infective stage of parasitic nematode species seems to be the down-regulation of collagens, which make up a major portion of the nematode cuticle . Collagens share a high degree of sequence identity due to numerous repeats, but they are not functionally redundant and often are developmentally regulated [47–50]. Previous EST studies found just three collagen transcripts in the infective stage of Meloidogyne incognita  and none in the infective stage of S. stercoralis . In C. elegans, collagens could not be identified among dauer-specific transcripts .
Determining whether developmental arrest in C. elegans and parasitic nematodes like H. glycines is executed via the same mechanisms is a fundamental question of nematode biology. Of more than just academic interest, it may have important ramifications for potential control strategies that focus on the dauer pathway as a promising biochemical target to disrupt parasitic nematode life cycles. For example, it is very appealing to envision a strategy to induce dauer exit and concomitant resumption of ageing and development in the absence of a suitable host.
Here, we analyze and compare for the first time global gene-expression changes throughout all major life stages (eggs, infective J2, parasitic J2, J3, J4, virgin females) except adult males of a parasitic nematode and compare expression profiles to those of the model nematode C. elegans, with a particular focus on developmental arrest using EST and microarray data. Taken together, the sequence generation, sequence analyses and expression profiling work presented in this paper represent the most comprehensive and informative genomic resource available for the study of cyst nematode development and parasitism to date.
EST generation and sequence analysis
Life stage-specific (eggs, infective J2, J3, J4, virgin females) cDNA libraries of H. glycines, the soybean cyst nematode, were generated to provide templates for EST sequencing, totaling 20,100 5' ESTs or almost 10 million nucleotides (GC content 48.9%). Sequences from all five developmental stages were represented in approximately equal proportions (Table 1). In addition to these stage-specific libraries, 1,858 H. glycines sequences previously deposited in GenBank were included in the dataset for this study, bringing the total number of sequences analyzed here to 21,958. This dataset was used by Affymetrix (Santa Clara, CA, USA) to form 6,860 unique contigs (average length 552 nucleotides, average size 3 ESTs), which then were represented by 7,530 probesets on the Affymetrix Soybean Genome Array GeneChip (gene discovery rate 31%; 6,860/21,958). Of the 6,860 unique contigs, 3,499 consisted of only one EST, so-called singletons (16% of all ESTs analyzed). On the other extreme, contig number HgAffx.13905.2 was formed by 599 ESTs. Furthermore, the 40 contigs that contained the largest number of ESTs represented 8.3% of all ESTs studied (Table 2).
In order to determine sequence similarities of our contigs and in particular to identify genes that are conserved between different nematode species, we BLAST searched the 6,860 H. glycines contigs versus three databases (Figure 1). About half of the contigs (44%) matched sequences in at least one of these three databases at a threshold value of E = 1e-20. Examination of the BLAST match distribution revealed that 19% of the contigs that matched all three databases are most likely representing highly conserved genes involved in fundamental housekeeping processes in metazoans, while the 31% of contigs exclusively matching sequences in the cyst nematode database contained genes that likely are important for specific host adaptations of Heterodera spp.
When assessing BLAST hit identities, the cluster that contained the most ESTs (HgAffx.13905.2; 599 ESTs) belonged to a gene coding for a putative cuticular collagen. Identities of other highly represented contigs were actin, tropomyosin and myosin, as well as additional house-keeping genes like ribosomal components, ubiquitin, arginine kinase, synaptobrevin and heat shock proteins. Interestingly, four putative parasitism gene sequences from the esophageal gland cells were among the 40 contigs with the highest EST constituents: three of unknown function (AAO33474.1, AAL78212.1, AAP30835.1) [7, 8] and a β-1,4-endoglucanase-2 precursor.
We further determined which H. glycines genes showed the highest degree of conservation when compared to C. elegans. BLASTX searches of all 6,860 soybean cyst nematode contigs against the Wormpep database revealed that 34.9% matched C. elegans entries at a threshold value of E = 1e-20. The products of the 25 most conserved genes were heat shock proteins, proteins related to transcription and translation (for example, elongation and splicing factors and RNA polymerase II) and structural proteins, including tubulin and actin, as well as enzymes, including guanylate cyclase (Additional data file 3).
A survey and functional classification of developmentally regulated genes
In order to identify H. glycines genes that are developmentally regulated and to document their expression profiles, we designed a microarray experiment using three complete and independent biological replications (that is, three independent sample series representing three complete life cycles). We identified 6,695 probesets (Additional data file 4) as described in Materials and methods that were differentially expressed with a false discovery rate (FDR) of 5% when observed over the entire life cycle of H. glycines. This group of probesets equals 89% of all H. glycines probesets on the microarray. In other words, the vast majority of H. glycines genes represented on the GeneChip significantly changed expression during the nematode life cycle. We then grouped these 6,695 probesets into 10 clusters based on their expression profiles (Figure 2). As an exemplary gene family, we analyzed the expression pattern of FMRF (Phe-Met-Arg-Phe-NH2)-related neuropeptide (FaRP)-encoding genes. This group encodes a specific class of neuronally expressed tetrapeptides that are potent myoactive transmitters in nematode neuromusculature [52–56], which are expressed in motor neurons that act on body wall muscle cells [57–59]. Based on our BLAST searches against various databases as detailed above, we identified five probesets for genes encoding FaRPs (HgAffx.23446.1.S1_at, HgAffx.23636.1.S1_at, HgAffx63.1.S1_at, HgAffx20469.1.S1_at, HgAffx.24161.1.S1_at). All five probesets were co-expressed with each other and showed an expression peak in the infective J2 stage (Additional data file 1). These FaRP probesets were differentially expressed when observed over the entire life cycle of the nematode, and, with the exception of HgAffx.20469.1.S1_at, which was found in cluster 4, all probesets were grouped in cluster 7 (Figure 2). The general profile of cluster 4 showed an expression peak in infective J2 and fell steadily in later life stages, while cluster 7 demonstrated the same overall pattern but showed a more pronounced increase from egg to infective J2.
Furthermore, we formed expression clusters for all 15 possible pairwise comparisons of all six life stages under study, as well as for comparisons of groups of life stages, that is: all pre-penetration (egg, infective J2) versus all post-penetration (parasitic J2, J3, J4, virgin females) life stages; and motile (pre-penetration J2) versus all non-motile parasitic (parasitic J2, J3, J4, virgin females) life stages. A summary displaying the number of probesets showing differential expression (FDR 5%) in these comparisons is given in Table 3.
We used InterProScan  to conduct functional classification for all 6,860 contigs in all expression clusters of each comparison. The relative abundance of the 25 InterPro domains with the highest representation in each of the 10 clusters for contigs that showed differential expression (FDR 5%) throughout the entire life cycle is summarized in Additional data file 5. While most clusters contained a wide range of genes represented by diverse InterPro domains, collagen domains stood out, in that they accumulated in cluster 2 at a high frequency relative to other InterPro domains. Since it has been suggested that down-regulation of collagens might be a common feature in dauer and infective stages of nematodes , we analyzed the expression profiles of H. glycines collagens in more detail. Using a reciprocal BLAST strategy as described in Materials and methods, we identified eight H. glycines probesets representing seven unique contigs orthologous to C. elegans collagens (Table 4). The temporal expression pattern of these seven orthologs was very similar (Figure 3) and congruent with observations in other nematode species [14, 51], which supports the hypothesis that down-regulation of collagen transcription is a conserved characteristic of non-molting infective and dauer-stage nematodes .
Heterodera glycines orthologs of dauer-enriched C. elegansgenes are more likely to be down-regulated upon transition from infective J2 to parasitic J2 and J3 than other genes
In addition to providing a comprehensive gene characterization and expression resource, we wished to demonstrate the applicability and power of our data by addressing the question of whether the infective J2 stage of H. glycines is biochemically analogous to the C. elegans dauer larva stage. We compiled a list of 1,839 C. elegans genes that were identified by Wang and Kim  as so-called dauer-regulated genes by conducting microarray experiments comparing gene expression in C. elegans larvae that were in transition from dauer to non-dauer with that of freshly fed L1 larvae that had been starved. Dauer-regulated genes showed significant expression changes during a dauer exit time course that were not related to the introduction of food . Reciprocal BLAST searches resulted in the identification of 438 H. glycines probesets that could be categorized unambiguously as orthologs of these C. elegans dauer-regulated genes (Additional data file 6). Because of the deliberate redundancy of the Affymetrix GeneChip, these 438 probesets corresponded to 396 unique H. glycines gene predictions. In other words, we identified H. glycines orthologs for 22% of the 1,839 C. elegans dauer-regulated genes (396/1,839).
We also compiled a list of H. glycines genes that are orthologous to the 488 C. elegans gene subset of the C. elegans dauer-regulated genes that Wang and Kim  determined to be up-regulated during the dauer stage and down-regulated upon dauer exit, a group that was called dauer-enriched. These genes presumably define dauer-specific properties, including stress resistance and longevity. These dauer-enriched genes were of particular interest to us because up-regulation of orthologous genes in the H. glycines infective J2 stage would suggest involvement in developmental arrest of these genes not only in C. elegans, but also in H. glycines. Using the same reciprocal BLAST search strategy, we identified 74 H. glycines probesets corresponding to 69 unique H. glycines contigs or genes that are orthologous to 57 unique C. elegans dauer-enriched genes (Table 5), which represent 14%.
To test whether the frequency of H. glycines orthologs to C. elegans dauer-regulated and dauer-enriched genes is similar to that of other, randomly chosen genes, we randomly selected 1,000 C. elegans proteins from the Wormpep database (v. 157) and repeated the reciprocal BLAST searches. In these searches, we identified 159 unique H. glycines contigs that fulfilled our criteria (data not shown). In other words, 16% of these randomly selected C. elegans genes have H. glycines orthologs. These analyses showed that C. elegans dauer-regulated genes have a slightly higher frequency (22%) of having orthologs in H. glycines than either dauer-enriched (14%) or random (16%) genes, both having about the same rate.
Following the identification of H. glycines orthologs for C. elegans dauer-regulated and dauer-enriched genes, we clustered these genes according to their expression profiles throughout the life cycle. Clustering the 438 probesets for the dauer-regulated orthologs led to their placement into nine groups (Additional data file 2), while clustering of the 74 H. glycines probesets for dauer-enriched gene orthologs resulted in seven distinct groups (Figure 4). It is obvious that not all H. glycines dauer-enriched orthologs were down-regulated from infective J2 to parasitic J2. Indeed, only 41% out of 74 probesets were significantly down-regulated, whereas 22% were up-regulated and 38% did not exhibit a statistically significant change in expression. Similarly, when comparing the infective J2 stage with the J3 stage of H. glycines, only 47% out of 74 probesets were down-regulated. Twenty-two percent were up-regulated, and 31% did not exhibit a statistically significant change in expression. In other words, in both comparisons, the majority of H. glycines genes that are orthologous to C. elegans genes down-regulated upon dauer exit were up-regulated or did not exhibit a statistically significant change in expression.
To determine whether H. glycines genes orthologous to C. elegans dauer-enriched genes behave differently from other H. glycines genes, we compared the dauer-enriched H. glycines orthologs with the entire set of 7,530 H. glycines probesets on the Affymetrix Soybean Genome Array, as well as to the 159 H. glycines genes that we determined to be orthologous to 1,000 randomly chosen C. elegans genes. We found that out of 7,530 H. glycines probesets, 19% were down-regulated when infective J2 are compared to parasitic J2, 21% were up-regulated and 60% did not exhibit a statistically significant change. Similarly, when infective J2 are compared to J3, 26% of all probesets were down-regulated, 20% were up-regulated and 54% did not exhibit a statistically significant change. The 159 H. glycines genes that are orthologous to 1,000 randomly chosen C. elegans genes are represented by 181 probesets on the Affymetrix GeneChip. Of those 181, 27% were down-regulated from infective J2 to parasitic J2, 24% were up-regulated and 50% did not exhibit a statistically significant change. When infective J2 were compared to J3, 26% out of 181 were down-regulated, 20% up-regulated and 54% did not exhibit a statistically significant change. We used a Fisher's exact test  to examine whether the proportion of down-regulated genes among the set of dauer-enriched H. glycines orthologs was significantly different from all other genes on the array or from the H. glycines probesets that were orthologous to the 1,000 randomly chosen C. elegans genes, respectively. We found that the observed differences between the proportions of down-regulated probesets between dauer-enriched H. glycines orthologs and the entire set of probesets on the microarray were significant at the 0.05 level in comparisons of both infective J2 versus parasitic J2 (P = 0.000015) and infective J2 versus J3 (P = 0.007160). Similarly, the differences between dauer-enriched H. glycines orthologs and the H. glycines probesets orthologous to random C. elegans genes were significant for comparisons of infective J2 versus parasitic J2 (P = 0.0219190) and for infective J2 versus J3 (P = 0.0094261). If a Bonferroni correction is used to control the overall type I error rate for this family of four tests, all comparisons would remain significant at the 0.05 level except the comparison between dauer-enriched H. glycines orthologs and the H. glycines probesets orthologous to random C. elegans genes for infective J2 versus parasitic J2. In other words, while the majority of H. glycines genes that are orthologous to C. elegans dauer-enriched genes was not down-regulated upon transition to parasitic J2 or J3, the proportion of H. glycines orthologs that were in fact down-regulated was statistically significantly enriched about two times compared to all H. glycines genes on the microarray or to orthologs to random C. elegans genes.
The identities of H. glycines genes that followed the expression pattern of their dauer-enriched C. elegans orthologs (that is, they were down-regulated upon transition to infective J2 or J3) reflect a wide range of effector functions and biochemical pathways, including peptidases, epoxide and glycoside hydrolases, phosphate transporters and neuropeptide-like proteins. H. glycines genes that did not follow the C. elegans pattern of expression (that is, they were not down-regulated) span an equally diverse group of genes and include carbohydrate kinase, catalase and glutathione peroxidase (Table 5).
Metabolism in C. elegans dauer larvae and H. glycinesinfective J2 is dissimilar
To investigate whether the infective J2 stage in H. glycines shows an expression profile of metabolic pathway genes similar to that of C. elegans dauer larvae, we conducted a BLAST search (threshold E = 1e-20) against the Wormpep database (v. 152) to search for Affymetrix probesets coding for H. glycines enzymes active in the citrate cycle, glycolysis and other pathways that undergo marked changes during the dauer state . We identified 37 probesets coding for 24 proteins active in six different pathways (Table 6). We then compared the expression levels of these H. glycines probesets in the assayed H. glycines life stages and determined differential expression (FDR 5%). While phosphofructokinase has been found to be up-regulated in dauer larvae relative to adults in C. elegans , we could not find differential expression between infective J2 and adult females in H. glycines. The citrate cycle is down-regulated in the C. elegans dauer stage and active at a lower level than the glyoxylate pathway [40, 41]. In H. glycines, out of eight genes for citrate cycle enzymes found, all but one (fumarase) showed differential expression in at least one out of three stage-by-stage comparisons (egg/infective J2, infective J2/feeding J2, infective J2/female). However, while pyruvate dehydrogenase and nucleoside diphosphate kinase were down-regulated in infective J2 (which supports similar metabolic patterns in H. glycines J2 and C. elegans dauer larvae), isocitrate dehydrogenase, citrate synthase, succinyl-CoA synthetase and succinate dehydrogenase were up-regulated in this stage when compared to the other life stages tested (which points to significant differences between H. glycines and C. elegans). The gene encoding malate dehydrogenase was up-regulated in infective J2, which is concordinant with observations of high malate dehydrogenase enzyme activity in C. elegans dauer larvae relative to adults. Of genes encoding three enzymes of the glyoxylate pathway, two (citrate synthase and malate dehydrogenase) were differentially expressed between infective J2 and eggs, feeding J2 or adult females. Both enzymes are shared with the citrate cycle. Even though both citrate synthase and malate dehydrogenase transcripts were up-regulated in infective J2, their expression level did not support observations of a higher activity of the glyoxylate pathway, as described for C. elegans dauer larvae  in infective J2 when compared to other citrate cycle enzymes. Apart from these pathways, the dauer stage in C. elegans is known to have high expression levels of DAF-21/Hsp90, superoxide dismutase and catalase to counter environmental stressors . In H. glycines, none of these dauer metabolism hallmarks could be confirmed. The gene encoding Hsp90 was differentially expressed in the infective J2 versus parasitic J2 comparison and was down-regulated in infective J2. The superoxide dismutase gene was differentially expressed and down-regulated in infective J2 when compared to eggs, parasitic J2 and adult females. Catalase-2 was differentially expressed when compared to eggs (up-regulated in infective J2) and females (down-regulated in infective J2), and catalase-3 was differentially expressed and down-regulated in infective J2 when compared to eggs. In summary, we conclude from these data that the physiological and biochemical landscape of developmentally arrested C. elegans dauer larvae must be different from that of developmentally arrested H. glycines infective J2.
Validation of microarray results by quantitative RT-PCR
Quantitative real-time reverse transcription PCR (qRT-PCR) was used to validate selected microarray results. We analyzed the expression patterns of six genes representing different expression patterns for each of the five consecutive pairs of life stages (egg/infective J2, infective J2/parasitic J2, parasitic J2/J3, J3/J4, J4/female), giving a total of 30 different genes. The qRT-PCR template was the same biological material used for hybridization of the microarrays, and the reactions were performed in technical triplicates. Additional data file 7 shows that there is qualitative agreement between our microarray approach and qRT-PCR for 28 out of 30 genes.
This study describes the generation of the most exhaustive EST collection of the soybean cyst nematode H. glycines to date and the analyses and characterization of this EST collection. The Affymetrix Soybean Genome Array GeneChip contains a significant portion of probesets for H. glycines genes and became a possibility only because of the extensive EST collection generated in this project. This GeneChip now is the first commercially available microarray for a nematode other than C. elegans. We analyzed the expression profiles of all 6,860 genes throughout all major developmental stages, excluding the adult male. Furthermore, we classified all genes by predicted function and conducted stage-wise comparisons to identify differentially expressed genes. Finally, these data now represent a resource for any molecular project targeting H. glycines, and we have demonstrated the versatility of this genomics resource by advancing our understanding of arrested development in the infective stage.
It has been proposed that the C. elegans genome can serve as a guide to examine aspects of the biology of other nematode species, particularly those that are parasitic , and we have shown that this comparative genomics approach has great power. In particular, we examined whether the biochemistry underpinning the developmentally arrested, infective J2 stage of H. glycines is functionally analogous to that of the dauer stage in C. elegans. For this purpose, we exploited published microarray expression data obtained from C. elegans during dauer exit . These down-regulated genes, termed 'dauer-enriched,' exhibit high mRNA abundance during the dauer stage. We asked if these genes were conserved in H. glycines, both in sequence and expression pattern. While our reciprocal BLAST searches suggest that a portion of C. elegans dauer-enriched genes is indeed conserved in the H. glycines genome with about the same frequency as randomly selected C. elegans genes, we did not find that H. glycines orthologs of C. elegans dauer-enriched genes are uniformly down-regulated upon transition to the parasitic J2 or J3 stages. While in C. elegans dauer-enriched genes are down-regulated upon dauer exit , we found that only 41% of their H. glycines orthologs are down-regulated upon transition to the feeding J2 stage, which we hypothesize is a developmental transition equivalent to dauer exit in C. elegans. Nevertheless, H. glycines dauer-enriched orthologs are more likely to be down-regulated than: all genes represented on the microarray; and H. glycines orthologs for randomly selected C. elegans genes. In other words, while our data do not support the idea of a broadly conserved gene expression signature between the dauer stage in C. elegans and infective J2 in H. glycines, they indicate that dauer-enriched orthologs are more likely to share a common expression profile than other genes. A similar preliminary observation was made in an EST study  that compared the infective stage of the human-parasitic nematode S. stercoralis and dauer-specific transcripts that were identified in serial analysis of gene expression (SAGE) in C. elegans . Mitreva et al.  found that dauer-specific genes were conserved in S. stercoralis, but that there was no evidence of a broadly conserved expression signature. However, in this study, we were able to compare our exhaustive microarray data for H. glycines with microarray data for C. elegans, which enables us to draw more compelling conclusions regarding dauer-regulated genes in both species.
We further compared the expression of metabolic pathway genes of C. elegans dauer larvae with infective J2 of H. glycines. Our data for H. glycines genes whose products are active in the glyoxylate pathway or citrate cycle, both of which undergo marked gene expression changes in C. elegans dauer larvae, as well as for genes encoding Hsp90 or superoxide dismutase, show dramatic differences between H. glycines infective J2 and C. elegans dauer larvae. Our findings suggest that the C. elegans dauer larva and the H. glycines infective J2 do not share similar expression profiles of metabolic pathway genes.
Although based upon inferences from transcript levels, our data point to striking differences in the underlying biochemistry and physiology of developmentally arrested and recovering C. elegans dauers and H. glycines J2. Does the phenomenon of developmental arrest in these stages therefore reflect a substantially different regulatory pathway, or does it reflect life-style differences between the parasitic and free-living species? Until the genome of H. glycines is complete, it will not be possible to determine if more putative orthologs of the daf and sma genes that compose the C. elegans dauer pathway are present than we were able to identify here. Many of these regulators of dauer entry/exit  are typically expressed at low levels, and thus are not common in EST sets.
Numerous nematode species are carried passively to new habitats as eggs and achieve active dispersion as dauer larvae or infective stage. This active dispersal stage in many nematodes is thought to have evolved from a common ancestral stage, and it has been assumed that there has been sufficient conservation of gene expression over time that the expression patterns in different species would appear similar. Our findings of marked differences in gene expression between C. elegans dauer larvae and H. glycines infective J2 could mean that these two developmentally arrested stages have evolved independently from each other and that all apparent similarities are based on convergent evolution. Alternatively, and we believe more likely, is that these two stages could in fact have a common origin but molecular evolution could have been sufficiently fast that any broadly conserved gene expression patterns would have been lost since the last common ancestor. Perhaps the best example of rapid diversification of genetic and biochemical processes underlying an analogous biological process comes from comparisons of vulval induction in C. elegans and Pristionchus pacificus . In both species, vulva induction occurs post-embryonically via an identical cellular process (inductive signaling to P5.p, P6.p and P7.p cells from the anchor cell). Despite the obvious homology of these processes, the underlying regulation is strikingly distinct . Similarly, other studies have demonstrated that gene expression patterns between mouse and human, which split only about 25 million years ago, changed rapidly [64–66].
Concerning the C. elegans dauer stage and potentially analogous stages in parasitic nematodes, it will be interesting to define the mechanisms that lead to developmental arrest in H. glycines and to compare them further to processes in C. elegans. The comprehensive dataset generated here will be a valuable resource for the field of nematology and sets the stage for many more comparative studies. In particular, a comprehensive analysis of H. glycines parasitism-associated gene expression profiles has been conducted by these authors and will be published later.
Our data indicate that expectations of a conserved phylum-wide dauer expression signature shared across nematodes may not be realistic. Such general expectations severely underestimated the degree to which expression profiles change and should be replaced by careful analyses of dauer-like stages of closely related nematode species instead. For example, one could ask whether the dauer expression profiles of C. elegans and Caenorhabditis briggsae are the same or whether the expression profiles of H. glycines infective J2 and M. incognita infective J2 are conserved.
Materials and methods
Nematode cultivation, cDNA library generation, sequencing and clustering
H. glycines strain OP-50  was cultivated under greenhouse conditions and isolated as described previously . Unidirectional Uni-Zap lambda libraries (Stratagene, La Jolla, CA, USA) were generated for H. glycines strain OP-50 eggs, infective J2, J3, J4 and virgin females. The mRNA concentration used ranged between 1.0 μg (J3) and 4.6 μg (eggs). The libraries were sequenced at the Washington University Medical School Genome Sequencing Center (St Louis, MO, USA), and the sequencing results were deposited in GenBank. Contigs were formed by Affymetrix for the design of the H. glycines group of probesets of the Affymetrix Soybean Genome Array GeneChip and all consensus sequences and contig size details can be accessed at Affymetrix .
Nematode cultivation for microarray experiments
For each replication, 40 pots with 10 seeds each of Kenwood 94 soybeans were planted in a 2:1 sand:soil mixture in the greenhouse. Two weeks after planting, each pot, containing an average of 7 to 8 germinated seedlings, was inoculated with 15,000 to 20,000 H. glycines strain OP-50  infective J2. The inoculum was collected by setting up two hatch chambers, each containing about two million H. glycines OP-50 eggs, and allowing the eggs to hatch for four days. From the same batch of eggs used in the hatch chamber, 50,000 eggs were collected and flash frozen in liquid nitrogen for use as the egg stage of the replication. After 4 days, the hatched infective J2 were collected and counted, and an aliquot of 50,000 larvae was flash frozen in liquid nitrogen for use as the infective J2 stage of the replication. The remainder of hatched infective J2 larvae was divided up among the 40 pots for seedling inoculation. Four days after infection, 12 pots were collected, and the soil was washed away from the root systems of these pots to isolate the parasitic J2 stage. Eight days after infection, another 12 pots were harvested for collection of J3 juveniles, and, 14 days after infection, a further 10 pots were used to isolate J4 juveniles. Finally, 21 days after infection, the final 6 pots were harvested for collection of adult females. These stages were isolated as published previously . All stages were divided in about 30 mg aliquots in 1.5 ml screw cap tubes, flash frozen in liquid nitrogen and stored at -80°C.
RNA extraction for microarray experiments and GeneChip procedures
Frozen (-20°C) 1.0 mm zirconia beads (BioSpec Products, Bartlesville, OK, USA) were added to frozen nematode tissue in a 1.5 ml screw cap tube in a 1:1 tissue:bead ratio. RNA was isolated using the Versagene kit from Gentra Systems (Minneapolis, MN, USA). On ice, 800 μl lysis buffer was mixed with 8 μl tri 2-carboxyethyl phosphine (TCEP), and 50 μl was added to one screw cap tube with about 30 mg nematode tissue. Tissue was homogenized in a beadbeater (BioSpec Products) at setting 48 for 10 s and chilled on ice for 60 s. This step was repeated three times. Two more tubes for the same life stage were treated in the same way. The suspension was collected by pipetting and transferred into a new tube. Beads were rinsed with remaining buffer, and the standard Versagene protocol was followed from then on. For all stages and all repetitions, we obtained 113 to 320 mg frozen tissue and 7.85 to 173.38 μg total RNA. The RNA concentration and quality of each sample were determined by a NanoDrop spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA) and by RNA Nanochip on a 2100 Bioanalyzer (Agilent Technologies Inc., Palo Alto, CA, USA). RNA was submitted to the Iowa State University GeneChip Facility, where standard procedures recommended by Affymetrix were followed for reverse transcription and labeling of the probes and for hybridization and scanning of the GeneChips.
Experimental design of microarray experiments and GeneChip data analysis
Expression was measured using a total of 18 Affymetrix Soybean Genome Array GeneChips (three replications × six life stages) using a randomized complete block design with replications as blocks. Prior to performing the analysis, the Affymetrix signal data were transformed using the natural log (ln) and normalized by median centering (that is, the median ln signal from each particular chip was subtracted from all ln signals on the chip). The normalized data for each gene were analyzed separately using a standard linear model with fixed effects for replications and stages. F tests, resulting in p-values, were performed to test for a difference in expression between the life stages for each probeset. A q-value was computed for each p-value using the method described by Storey and Tibshirani . These q-values can be used to identify differential expression while maintaining approximate control of the FDR. For example, FDR is controlled at approximately 5% if the sets of tests with q-values at or below 0.05 are declared significant. See  for a discussion of linear modeling and FDR control in the context of plant microarray experimentation.
Clustering was used to organize and summarize the observed expression patterns of differentially expressed genes. For each probeset, the mean normalized expression level was estimated for all six life stages. The six estimated values were standardized to have mean 0 and standard deviation 1 within each probeset. The Euclidian distance between any pair of standardized expression profiles was used as a measure of dissimilarity in all clustering algorithms. This approach considers genes with similar expression patterns to be close in six-dimensional space and is equivalent to using (1-r)0.5 as the measure of dissimilarity, where r is the Pearson correlation coefficient between non-standardized expression profiles.
K-medoids clustering  was performed on those probesets with the 1,000 lowest p-values (< 0.0000143) for the overall test for expression change across the life cycle. Using the Gap statistic , we determined the number of clusters to be 10. Probesets with q-values less than 0.05 were added to these 10 base clusters to produce the clustering depicted in Figure 2. All other clusters were produced using hierarchical agglomerative clustering, using average linkage to measure the dissimilarity between clusters.
All clusters and related figures were generated using the free open-source statistical software package R . Hierarchical clustering was carried out using the R function hclust; K-medoids clustering was carried out using the function pam from the R cluster library.
BLAST searches of H. glycinescontigs
We built three nematode-specific nucleotide databases as follows: cyst nematodes (13,643 sequences) - all sequences from the GenBank query "Globodera [ORGN] or Heterodera [ORGN] not Heterodera glycines [ORGN]"; non-cyst nematodes (933,882 sequences) - results of the GenBank query "Nematoda [ORGN] not Globodera [ORGN] not Heterodera [ORGN]"; and non-nematodes (3,607,410 sequences) - GenBank 'nt' database (nucleotide version of 'nr') minus all Nematoda [ORGN] sequences. The BLASTN parameters were set to expect a 75% target frequency as follows: M = 1, N = -1, Q = 3, R = 3, B = 10, V = 10, lcmask, golmax = 10, topcomboN = 1, filter = seg. The following parameters were used for BLASTX searches of all 6,860 contigs against non-redundant GenBank (downloaded 29 November 2005) and Wormpep v. 152: filter = seg, lcfilter, W = 4, T = 20, E = 100, B = 25, V = 25, topcomboN = 1, golmax = 10.
InterProScan was run using InterPro data files  dated November 2005 (iprscan_PTHR_DATA_12.0.tar). InterProScan translated all 6,860 unique contig sequences in six frames and then ran its suite of domain-finding tools. We required a minimum translation length of 20 amino acids to be considered by InterProScan, and we used the EGC.0 translation table. Due to the six-frame translation, each contig typically had several alignments amongst the significant open reading frame (ORF) found in the translation. We kept, as representative of each contig, the single longest aligning ORF that contained an InterPro domain, even though InterProScan may have found several ORFs for each contig with alignments to some domain or motif. Results were parsed into files representing expression clusters.
Identification of C. elegansdauer and collagen orthologs
A previous microarray study identified 1,984 so-called dauer-regulated and 540 dauer-enriched genes in C. elegans . We manually compiled a list of 1,839 dauer-regulated and 488 dauer-enriched C. elegans genes using these data (the remaining genes were either deleted or merged with other genes in Wormbase). To isolate H. glycines orthologs, we conducted BLAST searches between these C. elegans genes and all 7,530 H. glycines probeset nucleotide sequences (threshold values 1e-15, bit score at least 50, identity at least 30%). We then conducted an additional BLAST search between the H. glycines probesets that passed those criteria and the entire Wormpep (v. 159) database using the same cutoff criteria as above. Only those ortholog pairs in which the C. elegans dauer-regulated or dauer-enriched hit was either the best one compared to C. elegans hits of the entire Wormpep database or within 5% of the best hit's bit score were kept. These searches generated a list of 438 H. glycines probesets orthologous to C. elegans dauer-regulated genes and 74 H. glycines probesets orthologous to C. elegans dauer-enriched genes.
To identify collagen orthologs, we downloaded the sequences for collagens listed at the Sanger Institute web site  and followed basically the same BLAST strategy, with the exception that we required a higher stringency with a bit score of at least 100.
The transcript abundance of 30 differentially expressed cDNA clones was analyzed by qRT-PCR to confirm microarray results. Gene-specific primers were designed, and the sequences are shown in Additional data file 7. DNase-treated RNA (10 ng) was used for cDNA synthesis and PCR amplification using an iScript One-Step RT-PCR kit (BIO-RAD, Hercules, CA, USA) according to manufacturer's protocol. The PCR reactions were performed using an iCycler (BIO-RAD) under the following conditions: 50°C for 10 minutes, 95°C for 5 minutes and 40 cycles of 95°C for 30 s and 60°C for 30 s. Following PCR amplification, the reactions were subjected to temperature ramp to create the dissociation curve, measured as changes in fluorescence as a function of temperature, by which the non-specific products can be detected. The dissociation program was 95°C for 1 minute, 55°C for 10 s, followed by a slow ramp from 55°C to 95°C. Three replicates of each reaction were performed, and constitutively expressed Actin 1 (AF318603) was used as internal control to normalize gene expression levels. Quantifying the relative changes in gene expression was performed using the 2-ΔΔCTmethod as described by Livak and Schmittgen .
The Affymetrix Soybean Genome Array GeneChip raw and normalized data files were deposited in the ArrayExpress database  under accession number E-MEXP-1110. This database is MIAME-compliant.
GenBank accession numbers for EST generated in this study: BF013452-BF014867, BF249436-BF249525, BG310659-BG310919, BI396450-BI397002, BI451481-BI451765, BI704104-BI704170, BI749028-BI749665, BI773548-BI773559, CA939105-CA940989, CB238504-CB238733, CB238735-CB238737, CB238740, CB238743-CB238746, CB238750-CB238751, CB238755-CB238759, CB238763, CB238766-CB238769, CB238772-CB238774, CB238776-CB238778, CB238780-CB238782, CB238784, CB238788, CB238790-CB238793, CB278389-CB280465, CB281104-CB281884, CB299031-CB299936, CB373779-CB376363, CB377726-CB380382, CB824093-CB826589, CB934836-CB935636, CD747803-CD749135.
Additional data files
The following additional data are available with the online version of this paper. Additional data file 1 is a figure showing the temporal expression pattern of H. glycines probesets for FaRP-encoding genes. Additional data file 2 is a figure showing temporal expression patterns and clusters of 438 H. glycines probesets orthologous to C. elegans dauer-regulated genes. Additional data file 3 is a table listing the 25 most conserved H. glycines contigs compared to C. elegans. Additional data file 4 is a table of identities and cluster membership of differentially expressed (FDR 5%) probesets over the entire life cycle. Additional data file 5 is a table listing the 25 most abundant InterPro domains for the 10 expression clusters for H. glycines genes that are differentially expressed over the entire life cycle. Additional data file 6 is a table listing H. glycines probesets orthologous to dauer-regulated C. elegans genes and their cluster memberships. Additional data file 7 is a table listing qRT-PCR results, oligonucleotide primers used and probeset identities. Additional data file 8 provides transformed and normalized Affymetrix probeset mean data for each probeset in each life stage and q values for each tested combination of life stages as described in Materials and methods.
expressed sequence tag
FMRF (Phe-Met-Arg-Phe-NH2)-related neuropeptide
false discovery rate
open reading frame
Quantitative real-time PCR.
Wrather JA, Stienstra WC, Koenning SR: Soybean disease loss estimates for the United States from 1996 to 1998. Can J Plant Pathol. 2001, 23: 122-131.
Endo BY: Penetration and development of Heterodera glycines in soybean roots and related anatomical changes. Phytopathol. 1964, 54: 79-88.
Lilley CJ, Atkinson HJ, Urwin PE: Molecular aspects of cyst nematodes. Mol Plant Pathol. 2005, 6: 577-588.
Smant G, Stokkermans JPWG, Yan Y, De Boer JM, Baum TJ, Wang X, Hussey RS, Gommers FJ, Henrissat B, Davis EL, et al: Endogenous cellulases in animals. Isolation of β-1,4-endoglucanase genes from two species of plant-parasitic cyst nematodes. Proc Natl Acad Sci USA. 1998, 95: 4906-4911.
Davis EL, Hussey RS, Baum TJ, Bakker J, Schots A, Rosso MN, Abad P: Nematode parasitism genes. Annu Rev Phytopathol. 2000, 38: 365-396.
Popeijus H, Overmars H, Jones J, Blok V, Goverse A, Helder J, Schots A, Bakker J, Smant G: Degradation of plant cell walls by a nematode. Nature. 2000, 406: 36-37.
Gao B, Allen R, Maier T, Davis EL, Baum TJ, Hussey RS: Identification of putative parasitism genes expressed in the esophageal gland cells of the soybean cyst nematode Heterodera glycines. Mol Plant Microbe Interact. 2001, 14: 1247-1254.
Gao B, Allen R, Maier T, Davis EL, Baum TJ, Hussey RS: The parasitome of the phytonematode Heterodera glycines. Mol Plant Microbe Interact. 2003, 16: 720-726.
Davis EL, Hussey RS, Baum TJ: Getting to the roots of parasitism by nematodes. Trends Parasitol. 2004, 20: 134-141.
Qin L, Kudla U, Roze EH, Goverse A, Popeijus H, Nieuwland J, Overmars H, Jones JT, Schots A, Smant G, et al: Plant degradation: a nematode expansin acting on plants. Nature. 2004, 427: 30-
Vanholme B, De Meutter J, Tytgat T, Van Montagu M, Coomans A, Gheysen G: Secretions of plant-parasitic nematodes: a molecular update. Gene. 2004, 332: 13-27.
Baum TJ, Hussey RS, Davis EL: Root-knot and cyst nematode parasitism genes: the molecular basis of plant parasitism. Genet Eng (N Y). 2007, 28: 17-43.
Elling AA, Davis EL, Hussey RS, Baum TJ: Active uptake of cyst nematode parasitism proteins into the plant nucleus. Int J Parasitol. 2007, 37: 1269-1279.
McCarter JP, Mitreva MD, Martin J, Dante M, Wylie T, Rao U, Pape D, Bowers Y, Theising B, Murphy CV, et al: Analysis and functional classification of transcripts from the nematode Meloidogyne incognita. Genome Biol. 2003, 4: R26-
Mitreva M, Elling AA, Dante M, Kloek AP, Kalyanaraman A, Aluru S, Clifton SW, Bird DM, Baum TJ, McCarter JP: A survey of SL1-spliced transcripts from the root-lesion nematode Pratylenchus penetrans. Mol Genet Genomics. 2004, 272: 138-148.
Vanholme B, Mitreva M, Van Criekinge W, Logghe M, Bird D, McCarter JP, Gheysen G: Detection of putative secreted proteins in the plant-parasitic nematode Heterodera schachtii. Parasitol Res. 2006, 98: 414-424.
Alkharouf NW, Klink VP, Matthews BF: Identification of Heterodera glycines (soybean cyst nematode [SCN]) cDNA sequences with high identity to those of Caenorhabditis elegans having lethal mutant or RNAi phenotypes. Exp Parasitol. 2007, 115: 247-258.
Hermsmeier D, Mazarei M, Baum TJ: Differential display analysis of the early compatible interaction between soybean and soybean cyst nematode. Mol Plant Microbe Interact. 1998, 11: 1258-1263.
Hermsmeier D, Hart JK, Byzova M, Rodermel SR, Baum TJ: Changes in mRNA abundance within Heterodera schachtii -infected roots of Arabidopsis thaliana. Mol Plant Microbe Interact. 2000, 13: 309-315.
Puthoff DP, Nettleton D, Rodermel SR, Baum TJ: Arabidopsis gene expression changes during cyst nematode parasitism revealed by statistical analyses of microarray expression profiles. Plant J. 2003, 33: 911-921.
Alkharouf N, Khan R, Matthews B: Analysis of expressed sequence tags from roots of resistant soybean infected by the soybean cyst nematode. Genome. 2004, 47: 380-388.
Khan R, Alkharouf N, Beard H, MacDonald M, Chouikha I, Meyer S, Grefenstette J, Knap H, Matthews B: Microarray analysis of gene expression in soybean roots susceptible to the soybean cyst nematode two days post invasion. J Nematol. 2004, 36: 241-248.
Ithal N, Recknor J, Nettleton D, Hearne L, Maier T, Baum TJ, Mitchum MG: Parallel genome-wise expression profiling of host and pathogen during soybean cyst nematode infection of soybean. Mol Plant Microbe Interact. 2007, 20: 293-305.
The C. elegans Sequencing Consortium: Genome sequence of the nematode C. elegans : a platform for investigating biology. Science. 1998, 282: 2012-2018.
Blaxter ML, De Ley P, Garey JR, Liu LX, Scheldeman P, Vierstraete A, Vanfleteren JR, Mackey LY, Dorris M, Frisse LM, et al: A molecular evolutionary framework for the phylum Nematoda. Nature. 1998, 392: 71-75.
Bird DM, Opperman CH, Jones SJ, Baillie DL: The Caenorhabditis elegans genome: a guide in the post genomics age. Annu Rev Phytopathol. 1999, 37: 247-265.
Mitreva M, McCarter JP, Martin J, Dante M, Wylie T, Chiapelli B, Pape D, Clifton SW, Nutman TB, Waterston RH: Comparative genomics of gene expression in the parasitic and free-living nematodes Strongyloides stercoralis and Caenorhabditis elegans. Genome Res. 2004, 14: 209-220.
Parkinson J, Mitreva M, Whitton C, Thomson M, Daub J, Martin J, Schmid R, Hall N, Barrell B, Waterston RH, et al: A transcriptomic analysis of the phylum Nematoda. Nat Genet. 2004, 36: 1259-1267.
Mitreva M, Blaxter ML, Bird DM, McCarter JP: Comparative genomics of nematodes. Trends Genet. 2005, 21: 573-581.
Fuchs AG: Die Naturgeschichte der Nematoden und einiger anderer Parasiten. I. Des Ips typographus L. 2. Des Hylobius abietis L. Zoologische Jahrbücher, Abteilung für Systematik Ökologie und Geographie der Tiere, Jena. 1915, 38: 109-122.
Riddle DL, Albert PS: Genetic and environmental regulation of dauer larva development. C. elegans II. Edited by: Riddle DL, Blumenthal T, Meyer BJ, Priess JR. 1997, Cold Spring Harbor: Cold Spring Harbor Laboratory Press, 739-768.
Wolcow CA, Munoz MJ, Riddle DL, Ruvkun G: IRS and p55 orthologous adapter proteins function in the C. elegans daf-2 /insulin-like signaling pathway. J Biol Chem. 2002, 277: 41591-41597.
Cassada RC, Russell RL: The Dauerlarva, a post-embryonic developmental variant of the nematode Caenorhabditis elegans. Dev Biol. 1975, 46: 326-342.
Klass M, Hirsh D: Non-ageing developmental variant of Caenorhabditis elegans. Nature. 1976, 260: 523-525.
Bird AF, Buttrose MS: Ultrastructural changes in the nematode Anguina tritici associated with anhydrobiosis. J Ultrastruct Res. 1974, 48: 177-189.
Fuchs AG: Neue parasitische und halbparasitische Nematoden bei Borkenkäfern und einige andere Nematoden. I. Teil die Parasiten der Waldgärtner Myelophilus piniperda L. und minor Hartig und die Genera Rhabditis Dujardin, 1845 und Aphelenchus Bastian, 1865. Zoologische Jahrbücher, Abteilung für Systematik Ökologie und Geographie der Tiere, Jena. 1937, 70: 291-380.
Riddle DL, Georgi LL: Advances in research on Caenorhabditis elegans : Application to plant parasitic nematodes. Annu Rev Phytopathol. 1990, 28: 247-269.
Opperman CH, Bird DM: The soybean cyst nematode, Heterodera glycines : a genetic model system for the study of plant-parasitic nematodes. Curr Opin Plant Biol. 1998, 1: 342-346.
Endo BY: Ultrastructure of the intestine of second and third juvenile stages of Heterodera glycines. Proc Helminth Sco Wash. 1988, 55: 117-131.
O'Riordan V, Burnell AM: Intermediary metabolism in the dauer larva of the nematode C. elegans. I. Glycolysis, gluconeogenesis, oxidate phosphorylation and the tricarboxylic acid cycle. Comp Biochem Physiol. 1989, 92B: 233-238.
O'Riordan V, Burnell AM: Intermediary metabolism in the dauer larva. II. The glyoxylate cycle and fatty acid oxidation. Comp Biochem Physiol. 1990, 95B: 125-130.
Dalley BK, Golomb M: Gene expression in the Caenorhabditis elegans dauer larva: Developmental regulation of Hsp90 and other genes. Dev Biol. 1992, 151: 80-90.
Larsen PL: Aging and resistance to oxidative damage in Caenorhabditis elegans. Proc Natl Acad Sci USA. 1993, 90: 8905-8909.
Vanfleteren JR, De Vreese A: Rate of aerobic metabolism and superoxide production rate potential in the nematode Caenorhabditis elegans. J Exp Zool. 1996, 274: 93-100.
Blaxter M, Bird D: Parasitic nematodes. C. elegans II. Edited by: Riddle DL, Blumenthal T, Meyer BJ, Priess JR. 1997, Cold Spring Harbor: Cold Spring Harbor Laboratory Press, 851-878.
Bürglin TR, Lobos E, Blaxter M: Caenorhabditis elegans as a model for parasitic nematodes. Int J Parasitol. 1998, 28: 395-411.
Levy AD, Yang J, Kraemer JM: Molecular and genetic analyses of the Caenorhabditis elegans dpy-2 and dpy-10 collagen genes: A variety of molecular alternations affect organismal morphology. Mol Biol Cell. 1993, 4: 803-817.
Johnstone IL: The cuticle of the nematode Caenorhabditis elegans : A complex collagen structure. Bioessays. 1994, 16: 171-178.
Johnstone IL, Barry JD: Temporal reiteration of a precise gene expression pattern during nematode development. EMBO J. 1996, 15: 3633-3639.
Johnstone IL: Cuticle collagen genes. Expression in Caenorhabditis elegans. Trends Genet. 2000, 16: 21-27.
Jones SJ, Riddle DL, Pouzyrev AT, Velculescu VE, Hillier L, Eddy SR, Stricklin SL, Baillie DL, Waterston R, Marra MA: Changes in gene expression associated with developmental arrest and longevity in Caenorhabditis elegans. Genome Res. 2001, 11: 1346-1352.
Cowden C, Stretton AOW: Eight novel FMRFamide-like neuropeptides isolated from the nematode Ascaris suum. Peptides. 1995, 16: 491-500.
Maule AG, Geary TG, Bowman JW, Marks NJ, Halton DW, Shaw C, Thompson DP: Inhibitory effects of nematode FMRFamide-related peptides (FaRPs) on muscle strips from Ascaris suum. Invert Neurosci. 1995, 1: 255-265.
Davis RE, Stretton AOW: The motornervous system of Ascaris: electrophysiology and anatomy of the neurones and their control by neuromodulators. Parasitol. 1996, 113: S97-S117.
Geary TG, Marks NJ, Maule AG, Bowman JW, Alexander-Bowman SJ, Day TA, Larsen MJ, Kubiak TM, Davis JP, Thompson DP: Pharmacology of the FMRFamide-related peptides in helminths. Ann N Y Acad Sci. 1999, 897: 212-227.
Li C, Kim K, Nelson LS: FMRFamide-related neuropeptide gene family in Caenorhabditis elegans. Brain Res. 1999, 848: 26-34.
Brownlee DJA, Holden-Dye L, Fairweather I, Walker RJ: The action of serotonin and the nematode neuropeptide KSAYMRFamide on the pharyngeal muscle of the parasitic nematode, Ascaris suum. Parasitol. 1995, 111: 379-384.
Brownlee DJA, Fairweather I, Holden-Dye L, Walker RJ: Nematode neuropeptides: Localization, isolation, and functions. Parasitol Today. 1996, 12: 343-351.
Kim K, Li C: Expression and regulation of an FMRFamide-related neuropeptide gene family in Caenorhabditis elegans. J Comp Neurol. 2004, 475: 540-550.
Zdobnov EM, Apweiler R: InterProScan - an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17: 847-848.
Wang J, Kim SK: Global analysis of dauer gene expression in Caenorhabditis elegans. Development. 2003, 130: 1621-1634.
Fisher RA: The Design of Experiments. 1966, Edinburgh: Oliver and Boyd, 8
Zheng M, Messerschmidt D, Jungblut B, Sommer RJ: Conservation and diversification of Wnt signaling function during evolution of nematode vulva development. Nat Genet. 2005, 37: 300-304.
Liao BY, Zhang J: Evolutionary conservation of expression profiles between human and mouse orthologous genes. Mol Biol Evol. 2006, 23: 530-540.
Liao BY, Zhang J: Low rates of expression profile divergence in highly expressed genes and tissue-specific genes during mammalian evolution. Mol Biol Evol. 2006, 23: 1119-1128.
Gu X, Su Z: Tissue-driven hypothesis of genomic evolution and sequence-expression correlations. Proc Natl Acad Sci USA. 2007, 104: 2779-2784.
Dong K, Opperman CH: Genetic analysis of parasitism in the soybean cyst nematode Heterodera glycines. Genetics. 1997, 146: 1311-1318.
de Boer JM, Yan Y, Wang X, Smant G, Hussey RS, Davis EL, Baum TJ: Developmental expression of secretory β-1,4-endoglucanases in the subventral esophageal glands of Heterodera glycines. Mol Plant Microbe Interact. 1999, 12: 663-669.
Storey JD, Tibshirani R: Statistical significance for genome-wide studies. Proc Natl Acad Sci USA. 2003, 100: 9440-9445.
Nettleton D: A discussion of statistical methods for design and analysis of microarray experiments for plant scientists. Plant Cell. 2006, 18: 2112-2121.
Kaufman I, Rousseeuw PJ: Finding Groups in Data: An Introduction to Cluster Analysis. 1990, Wiley InterScience New York
Tibshirani R, Walther G, Hastie T: Estimating the number of clusters in a dataset via the Gap statistic. J Roy Statist Soc B. 2001, 63: 411-423.
The R Project. [http://www.R-project.org]
The Wellcome Trust Sanger Institute C. elegans project. [http://www.sanger.ac.uk/Projects/C_elegans/ANNOTATION/]
Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2ΔΔCT Method. Methods. 2001, 25: 402-408.
This is a Journal Paper of the Iowa Agriculture and Home Economics Station, Ames, Iowa and supported by Hatch Act and State of Iowa funds. This study was funded by a grant from the United Soybean Board to TJB, ELD and RSH and USDA-NRI award #2005-35604-15434. Heterodera glycines EST sequencing at Washington University was supported by NSF Plant Genome award 0077503 to DMB. AAE was in part supported by a Storkan-Hanes-McCaslin Research Foundation fellowship. MM and JM were funded by NIH-NIAID grant AI46593. We thank Steve Whitham and Rico Caldo for helpful discussions and Jiqing Peng for expert technical assistance.
AAE planned and coordinated the study, prepared tissue for microarrays, analyzed the data and wrote the manuscript. MM and JPMcC oversaw sequencing and sequence analyses, analyzed data and edited the manuscript. JR assisted in statistical analyses. XG and JM assisted in sequence analyses. TRM generated tissue samples. JPMcD constructed cDNA libraries. TH performed qRT-PCR. DMB analyzed data and edited the manuscript. ELD provided material. RSH edited the manuscript. DN performed and directed statistical analyses and edited the manuscript. TJB planned the study, analyzed data and wrote the manuscript.
Electronic supplementary material
Additional data file 1: BLAST searches identified seven H. glycines probesets orthologous to C. elegans FaRP-encoding genes (mean expression pattern indicated in red). For visualization purposes, each probeset's estimated mean log-scale expression profile was standardized to have mean 0 and variance 1.5 prior to plotting. infJ2, infective J2; parJ2, parasitic J2. (EPS 7 KB)
Additional data file 2: Reciprocal BLAST searches identified 438 H. glycines probesets as orthologous to C. elegans dauer-regulated genes. These probesets were grouped into nine clusters based on their temporal expression profiles. The average expression pattern of the probesets in each cluster is indicated by a bold line. infJ2, infective J2; parJ2, parasitic J2. (EPS 27 MB)
Additional data file 4: Identities and cluster membership of differentially expressed (FDR 5%) probesets over the entire life cycle. (XLS 614 KB)
Additional data file 5: The 25 most abundant InterPro domains for the 10 expression clusters for H. glycines genes that are differentially expressed over the entire life cycle. (XLS 22 KB)
Additional data file 6: H. glycines probesets orthologous to dauer-regulated C. elegans genes and their cluster memberships. (XLS 81 KB)
Additional data file 8: Transformed and normalized Affymetrix probeset mean data for each probeset in each life stage and q values for each tested combination of life stages as described in Materials and methods. (CSV 2 MB)
About this article
Cite this article
Elling, A.A., Mitreva, M., Recknor, J. et al. Divergent evolution of arrested development in the dauer stage of Caenorhabditis elegans and the infective stage of Heterodera glycines. Genome Biol 8, R211 (2007). https://doi.org/10.1186/gb-2007-8-10-r211