Genome of Acanthamoeba castellanii highlights extensive lateral gene transfer and early evolution of tyrosine kinase signaling
- Michael Clarke†1,
- Amanda J Lohan†1,
- Bernard Liu2,
- Ilias Lagkouvardos3,
- Scott Roy4,
- Nikhat Zafar5,
- Claire Bertelli6,
- Christina Schilde7,
- Arash Kianianmomeni8,
- Thomas R Bürglin9,
- Christian Frech10,
- Bernard Turcotte11,
- Klaus O Kopec12,
- John M Synnott1,
- Caleb Choo10,
- Ivan Paponov13,
- Aliza Finkler14,
- Chris Soon Heng Tan15,
- Andrew P Hutchins16,
- Thomas Weinmeier17,
- Thomas Rattei17,
- Jeffery SC Chu18,
- Gregory Gimenez19,
- Manuel Irimia20,
- Daniel J Rigden21,
- David A Fitzpatrick22,
- Jacob Lorenzo-Morales23,
- Alex Bateman24,
- Cheng-Hsun Chiu25,
- Petrus Tang26,
- Peter Hegemann8,
- Hillel Fromm14,
- Didier Raoult19,
- Gilbert Greub6,
- Diego Miranda-Saavedra16,
- Nansheng Chen10,
- Piers Nash27,
- Michael L Ginger28,
- Matthias Horn3,
- Pauline Schaap7,
- Lis Caler5 and
- Brendan J Loftus1Email author
© Clarke et al.; licensee BioMed Central Ltd. 2013
Received: 19 July 2012
Accepted: 1 February 2013
Published: 1 February 2013
The Amoebozoa constitute one of the primary divisions of eukaryotes, encompassing taxa of both biomedical and evolutionary importance, yet its genomic diversity remains largely unsampled. Here we present an analysis of a whole genome assembly of Acanthamoeba castellanii (Ac) the first representative from a solitary free-living amoebozoan.
Ac encodes 15,455 compact intron-rich genes, a significant number of which are predicted to have arisen through inter-kingdom lateral gene transfer (LGT). A majority of the LGT candidates have undergone a substantial degree of intronization and Ac appears to have incorporated them into established transcriptional programs. Ac manifests a complex signaling and cell communication repertoire, including a complete tyrosine kinase signaling toolkit and a comparable diversity of predicted extracellular receptors to that found in the facultatively multicellular dictyostelids. An important environmental host of a diverse range of bacteria and viruses, Ac utilizes a diverse repertoire of predicted pattern recognition receptors, many with predicted orthologous functions in the innate immune systems of higher organisms.
Our analysis highlights the important role of LGT in the biology of Ac and in the diversification of microbial eukaryotes. The early evolution of a key signaling facility implicated in the evolution of metazoan multicellularity strongly argues for its emergence early in the Unikont lineage. Overall, the availability of an Ac genome should aid in deciphering the biology of the Amoebozoa and facilitate functional genomic studies in this important model organism and environmental host.
Acanthamoeba castellanii (Ac) is one of the predominant soil organisms in terms of population size and distribution, where it acts both as a predator and an environmental reservoir for a number of bacterial, fungal and viral species . Selective grazing by Ac in the rhizosphere alters microbial community structure and is an important contributor to the development of root architecture and nutrient uptake by plants . Ac can also be isolated from almost any body of water and manifests in a wide variety of man-made water systems, including potable water sources, swimming pools, hot tubs, showers and hospital air conditioning units [3, 4]. Acanthamoebae are frequently associated with a diverse range of bacterial symbionts [5, 6]. A subset of the microbes that serve as prey for Ac have evolved virulence stratagems to use Ac as both a replicative niche and as a vector for dispersal and are important human intracellular pathogens [7, 8]. These pathogens utilize analogous strategies to infect and persist within mammalian macrophages, illustrating the role of environmental hosts such as Ac in the evolution and maintenance of virulence [9, 10]. Commonalities at the level of host response between amoebae and macrophages to such pathogens have led to the use of both Dictyostelium discoideum (Dd) and Ac as model systems to study pathogenesis [11, 12].
Published Amoebozoa genomes from both the obligate parasite Entamoeba histolytica (Eh) and the facultatively multicellular Dd have both highlighted unexpected complexities at the level of cell motility and signaling [13, 14]. As the only solitary free-living representative, the genome of Ac establishes a unique reference point for comparisons for the interpretation of other amoebozoan genomes. Experimentally, Ac has been a more thoroughly studied organism than most other free living amoebae, acting as a model organism for studies on the cytoskeleton, cell movement, and aspects of gene regulation, with a large body of literature supporting its molecular interactions [15–18].
Results and discussion
Lateral gene transfer
Genetic exchange is also thought to occur between phylogenetically disparate organisms that reside within the same amoebal host cell [20, 21]. Ac contains three copies of a miniature transposable element (ISSoc2) of the IS607 family of insertion sequences related to those present in genomes of thermophilic cyanobacteria  and several giant nucleocytoplasmic large DNA viruses (NCLDVs). In the Mimivirus genome the IS elements are found within islands of genes of bacterial origin, some of which appear to have been contributed by a cyanobacterial donor. This data underscores the complex intermediary role that Ac, as host to both NCLDVs and cyanobacteria  may play in facilitating genetic transfer between sympatric species.
Comparison of predicted LGT across amoeboid genomes
In order to compare the impact and scale of LGT across Ac and other amoeba, we applied the same phylogenomic approach used to identify LGT in the Ac genome to published genomes of other amoeboid protists, including Dd, Eh, Entamoeba dispar (Ed) and Naegleria gruberi (Ng). Our findings predict that Ac and the excavate Ng encode a notably higher number of laterally acquired bacterial genes than either of the more closely related parasitic Entamoeba or the social Dd amoebozoans (Figure 2a). The taxonomic distribution of putative LGT donors is broadly similar for both Entamoeba species, but surprisingly also between Ac and Ng (Figure 2b,c; Section 2 of Additional file 1). The genomes of both Eh and Ed are predicted to have experienced a proportionately higher influx from anaerobic and host-associated microbes than their free-living counterparts Ac and Ng (Figure 2c; Additional file 2), likely reflecting the composition of microbes within their habitats. Many of the LGT candidates across all of the amoebae have predicted metabolic functions, suggesting that LGT in amoebae is reflective of trophic strategy and driven by the selective pressure of new ecological niches. Our data illustrating LGT as a contributing factor in shaping the biology of a diversity of amoeboid genomes provide further evidence supporting an underappreciated role for LGT in the diversification of microbial eukaryotes .
Intron-exon structures exhibit complex phylogenetic patterns with orders-of-magnitude differences across eukaryotic lineages, which imply frequent transformations during eukaryotic evolution . Some researchers have argued that intron gain is episodic with long periods of stasis  punctuated by periods of rapid gain while others argue for generally higher rates . Strikingly, Ac genes have an average of 6.2 introns per gene, among the highest known in eukaryotes . Genes predicted to have arisen through LGT have slightly lower but broadly comparable intron densities, offering an opportunity to study the evidence for proposed mechanisms underpinning post-LGT intron gain . An analysis of LGT introns, however, did not provide support for any of the proposed mechanisms of intron gain (Section 2 of Additional file 1). Thus, while the preponderance of introns in LGTs clearly indicates substantial intron gain at some point, it appears that, for Ac, these events have been very rare in recent times, consistent with a punctate model of intron gain.
As a unicellular sister grouping to the multicellular Dictyostelids, Ac provides a unique point of comparison to gain insight into the molecular underpinnings of multicellular development in Amoebozoa. Cell-cell communication is a hallmark of multicellularity and we looked at putative receptors for extracellular signals and their downstream targets. G-protein-coupled receptors (GPCRs) represent one of the largest families of sensors for extracellular stimuli. Overall, Ac encodes 35 GPCRs (compared to 61 in Dd), representing 4 out of the 6 major families of GPCRs  while lacking metabotropic glutamate-like GPCRs or fungal pheromone receptors. We identified three predicted fungal-associated glucose-sensing Git3 GPCRs  and an expansion in the number of frizzled/smoothened receptors  (Figure S3.1.1 in Additional file 1). We identified seven G-protein alpha subunits and a single putative target, phospholipase C, for GPCR-mediated signaling. The number and diversity of receptors in Ac raises the question of what they are likely to be sensing. Nematodes employ many of their GPCRs in detecting molecules secreted by their bacterial food sources , and given the diversity of Ac's feeding environments, many of the Ac GPCRs may fulfill a similar role.
The Ac PTK domains are highly conserved in key catalytic residues, resembling dedicated PTKs found in metazoans (Figure S4.2.1 in Additional file 1), and are distinct from Dd and Eh PTKs that are more tyrosine kinase like (TKL) (Figure S4.2.2 in Additional file 1). Ac PTK homologues are present in the apusomonad Thecamonas trahens and have also recently been described in two filasterean species, Capsaspora owczarzaki and Ministeria vibrans . One unusual feature of the pTyr machinery in Ac is the 2:1 ratio of SH2 to PTK domains as comparisons across opisthokonts show a strong correlation and co-expansion of these two domains with a ratio close to 1:1 (Figure 4c,d) . This increased ratio in Ac indicates either an expansion to handle the cellular requirements of pTyr signaling or that aspects of PTK function are accomplished by TKL or dual specificity kinases as appears to be the case in Dd . We also found that Ac has fewer tyrosine residues in its proteome in comparison to Dd, which lacks PTKs (Figure S4.3.1 in Additional file 1). This result is in line with recent analysis of metazoan genomes, suggesting increased pressure for selection against disadvantageous phosphorylation of tyrosine residues in genomes with extensive pTyr signaling .
Domain organization and composition of pTyr components reveal the selective pressures for adapting pTyr signaling into various pathways. Seven PTKs have predicted transmembrane domains and may function as receptor tyrosine kinases hinting at their potential for intercellular communication. The majority of PTKs in Ac, however, show unique domain combinations; six PTKs contain a sterile alpha motif (SAM) domain, which is found in members of the ephrin receptor family (Figure S4.4.3 in Additional file 1). The Ac SH2 proteins are conserved within the pTyr binding pocket and resemble SH2 domains from the SOCS, RIN, CBL and RASA families (Figure S4.4.2 in Additional file 1); however, the domain composition within these proteins differs between those of Monosiga brevicollis and metazoans (Figure S4.4.3A in Additional file 1). Approximately half of the Ac SH2 proteins share domain architectures with Dd, including the STAT family of transcription factors (Figure S4.4.3B in Additional file 1). The presence of homologous SH2 proteins in Dd coupled with the complete facility in Ac predicts an emergence of the complete machinery for pTyr early in the Unikont lineage. This finding is in contrast with models that posit a complete pTyr signaling machinery emerging late in the Unikont lineage  and has important implications for understanding the relationship between pTyr signaling and the evolution of multicellularity. The lack of clear metazoan orthologues makes it difficult to trace the evolutionary paths of pTyr signaling networks  or to accurately predict the cellular functions and adaptations of pTyr in Ac. However, with phosphoproteomics and sequence analysis, insights into ancient pTyr signaling circuits may be revealed through future studies in Ac (Figure S4.5.1 in Additional file 1).
Ac is not known to participate in social activity yet must adhere to a diversity of surfaces within the soil and practice discrimination between self and prey during phagocytosis . Ac shares some adhesion proteins with Dd (Table S5.1.1 in Additional file 1) but homologues of the calcium-dependent, integrin-like Sib cell-adhesion proteins are absent. Surprisingly, Ac contains a number of bacterial-like integrin and hemagglutinin domain adhesion proteins that may improve its ability to attach to bacterial cells or biofilms . Ac encodes two MAM domain-containing proteins, a domain found in functionally diverse receptors with roles in cell-cell adhesion . Ac has a copy of the laminin-binding protein (AhLBP) first identified in Acanthamoeba healyi, which has been shown to act as a non-integrin laminin binding receptor . Remarkably, Ac also encodes proteins containing cell adhesion immunoglobulin domains (Section 5 of Additional file 1). Both show affinity to the I-set subfamily  and contain weakly predicted transmembrane domains (Figure S5.1.1 in Additional file 1).
Microbial recognition through pattern recognition receptors
Receptor-mediated endocytosis of Legionella pneumophila in Ac is mediated by the c-type lectin mannose binding protein (MBP) . MBP also represents the principal virulence factor in pathogenic Acanthamoebae . In addition to MBP, the Ac genome encodes two paralogues of MBP with similarity to the amino-terminal region of the protein. Rhamnose-binding lectins serve a variety of functions in invertebrates, one of which is their role as germline-encoded PRRs in innate immunity . They are absent from other Amoebozoa, although Ac encodes 11 D-galactoside/L-rhamnose binding (SUEL) lectin domain-containing proteins. Approximately half of the SUEL lectin domain proteins harbour epidermal growth factor domains, a combination reminiscent of the selectin family of adhesion proteins found exclusively in vertebrates . An L-rhamnose synthesis pathway thought to contribute to biosynthesis of the lipopolysaccharide-like outer layer of the virus particle has recently been identified in Mimivirus that may facilitate its uptake by Ac [56, 57]. Ac also encodes a protein where multiple copies of H-type lectin are joined with an inhibitor of apoptosis domain. The H-lectin domain is predicted to bind to N-acetylgalactosamine (GalNAc) and is found in Dictyostelium discoidin I & II  and other invertebrates where it plays a role in antibacterial defense . In the brown algae Ectocarpus leucine-rich repeat (LRR) containing GTPases of the ROCO family and NB-ARC-TPR proteins have been proposed to represent PRRs that are involved in immune response . Ac encodes a NB-ARC-TPR homologue with a disease resistance domain (IPR000767) and an LRR-ROCO GTPase.
Ac encodes proteins with potential roles in antiviral defense including homologues of NCLDV major capsid proteins  as well as homologues of Dicer and Piwi, both of which have been implicated in RNA-mediated antiviral silencing . Our data also illustrate early evolution of a number of interferon-inducible innate immunity proteins absent from other sequenced Amoebozoa. These include a homologue of the interferon-γ-inducible lysosomal thiol reductase enzyme (GILT), an important host factor targeted by Listeria monocytogenes during infection in macrophages . In addition, Ac encodes two interferon-inducible GTPase homologues, which in vertebrates promote cell-autonomous immunity to vacuolar bacteria, including Mycobacteria and Legionella species . Ac also contains a natural resistance-associated macrophage protein (NRAMP) homologue, which has been implicated in protection against L. pneumophila and Mycobacterium avium infection in both macrophages and Dd .
Ac has traditionally been considered to be an obligate aerobe, although the recent identification of the oxygen-labile enzymes pyruvate:ferredoxin oxidoreductase and FeFe-hydrogenase perhaps pointed towards a cryptic capacity for anaerobic ATP production . Predictions for nitrite and fumarate reduction, hydrogen fermentation, together with a likely mechanism for acetate synthesis, coupled to ATP production indicate a considerable capacity for anaerobic ATP generation. This clearly sets Ac apart from Dd, which hunts within the aerobic leaf litter, but provides parallels with Ng, the alga Chlamydomonas reinhardtii and other soil-dwelling protists that are likely to experience considerable variation in local oxygen tensions . These protists achieve their flexible, facultative anaerobic metabolism, however, using different pathways (Figure S7.1 in Additional file 1). In addition, the classic anaerobic twists on glycolysis provided by pyrophosphate-dependent phosphofructokinase and pyruvate phosphate dikinase  are absent from Ac. This suggests that although multiple pathways are available for oxidation of NADH to NAD+ in the absence of oxygen, including a capacity for anaerobic respiration in the presence of nitrite (NO2-), a shift to a more ATP-sparing form of glycolysis is not necessary under low oxygen-tension. Given genome-led predictions of facultative anaerobic ATP metabolism, as well as extensive use of receptors and signaling pathways classically associated with animal biology, we also considered the possibility of a hypoxia-inducible factor (HIF)-dependent system for oxygen sensing, similar to that seen across the animal kingdom, including the simple animal Trichoplax adhaerens [69, 70]. However, despite conservation of a Skp1/HIFα-related prolyl hydroxylase in Ac, we found no genes encoding proteins with the typical domain architecture of animal HIFα or HIFβ. Currently, therefore, HIF-dependent oxygen sensing remains restricted to metazoan lineages.
Ac also retains biosynthetic pathways involved in anabolic metabolism that are absent in Dd (for example, the shikimic acid pathway and a classic type I pathway for fatty acid biosynthesis; Table S7.1 in Additional file 1), although investment in extensive polyketide biosynthesis  is not evident. An autophagy pathway, as defined by genetic studies of yeast, Dd and other organisms , is present in Ac with little paralogue expansion or loss of known autophagy-related (ATG) genes evident (Figure 7.2 in Additional file 1) and likely contributes to both intracellular re-modeling in response to environmental cues and the interaction with phagocytosed microbes.
Ac shares a broadly comparable repertoire of transcription factors with Dd excepting a number of lineage-specific expansions (Table S8.1 in Additional file 1). Ac encodes 22 zinc cluster transcription factors compared to the 3 in Dd (Figure S8.2.1 in Additional file 1) . It has almost double the number of predicted homeobox genes (25) compared to the 13 in Dd . Two are of the MEIS and PBC class respectively, with an expansion in a homologue of Wariai, a regulator of anterior-posterior patterning in Dictyostelium  comprising most of the additional members (Figure S8.3.2 in Additional file 1). Strikingly, we also identified 22 Regulatory factor × (RFX) genes, the first identified in an Amoebozoan . The Ac RFX repertoire is the earliest branching yet identified and forms an out-group to other known RFX genes (Section 8 of Additional file 1). Ac has been proposed to affect plant root branching in the rhizosphere via its effects on auxin balance in plants . It encodes a number of genes involved in auxin biosynthesis as well as those involved in free auxin (indole-3-acetic acid (IAA)) de-activation via formation of IAA conjugates (Table S9.1 in Additional file 1). These data suggest that Ac plays a role in altering the level of IAA in the rhizosphere through a strategy of alternative biosynthesis and sequestration. Ac may also respond transcriptionally to auxin as it encodes a member of the calmodulin-binding transcription activator (CAMTA) family (Figure S8.4.1 in Additional file 1), which in plants co-ordinate stress responses via effects on auxin signaling [78, 79].
Comparative genomics of the Amoebozoa has until now been restricted to comparisons between the multicellular dictyostelids and the obligate parasite Eh [80, 81]. Ac, while sharing many of their features, enriches the repertoire of amoebozoan genomes in a number of important areas, including signaling and pattern recognition. LGT has significantly contributed to both the genome and transcriptome of Ac whose accessory genome shares unexpected similarities with a phylogenetically distant amoeba. The presence of prokaryotic TEs in Ac illustrates its role in the evolution of some of the earth's most unusual organisms  as well a number of important human pathogens [7, 8].
Ac has adopted bacterial-like adhesion proteins to facilitate adherence to biofilms and H-NOX based nitric oxide signaling which likely aids in their dispersal . Overall the adaptive value conferred by LGT is highlighted by the expression of the large majority in Ac across multiple conditions, which points to their adoption into novel transcriptional networks. Given the feeding behavior of Ac, it seems plausible that eukaryote-to-eukaryote gene transfers may also have provided adaptive benefits . Increased sampling will be necessary to establish the extent to which such gene transfers made their way into the Ac genome and whether 'you are what you eat' equally applies to a diet of eukaryotes .
Ac participates in a myriad of as yet unexplored interactions, as reflected in the diversity of genes devoted to sensory perception and signal transduction of extracellular stimuli. Ac's survival in the rhizosphere is likely contingent on interactions not only with other microbes but also on a cross-talk with plant roots through manipulation of the levels of the plant hormone auxin. LGT may also have provided Ac with some of its recognition and environmental sensing components. An interesting parallel is the planktonic protozoan Oxyrrhis marina, which utilizes both MBP and LGT-derived sensory rhodopsins, to enable selective feeding behavior through prey detection and biorecognition . We predict that host response of Ac to pathogens and symbionts is likely modulated via a diversity of predicted PRRs that act in an analogous manner to effectors of innate immunity in higher organisms. Given the close association of Ac with a number of important intracellular pathogens, it will be interesting to determine which host-pathogen interactions can trace their origins to encounters with primitive cells such as Ac.
Ac shares protein family expansions in signal transduction with other Amoebozoa while introducing new components based on novel domain architectures (nucleotidyl cyclases) . The presence of the complete pTyr signaling toolkit especially when contrasted with its absence in the multicellular dictyostelids is a remarkable finding of the Ac genome analysis. However the role of tyrosine kinase signaling in both amoebozoan and mammalian phagocytosis [87–89] indicates that it likely represents an ancestral function. The most parsimonious interpretation predicts the supplanting of functions originally carried out by tyrosine kinases by other kinases in the Amoebozoa. This emphasizes the importance of representative sampling and in its absence the inherent difficulties in re-constructing ancestral signaling capacities.
Transcriptional response networks can be re-programmed either through expansion of transcription factors or their target genes . Ac and Dd share a conserved core of transcription factors with any differences between them largely accounted for by lineage-specific amplifications. These may result in sub- or neo-functionalization contributing to the adaptive radiation of Acanthamoebae into new ecological niches.
Comparison of Ac with Dd highlights a broadly similar apparatus for environmental sensing and cell-cell communication and implies that the molecular elements underpinning the transition to a multicellular lifestyle may be widespread. Such transitions would likely have involved co-option of ancestral functions into multicellular programs and have occurred multiple times. Our analysis suggests that many signal processing and regulatory modules of higher animals and plants likely have deep origins and are balanced with subsequent losses in certain lineages including tyrosine kinases in fungi, plants and many protists.
The availability of an Ac genome offers the first opportunity to initiate functional genomics in this important constituent of a variety of ecosystems and should foster a better understanding of the amoebic lifestyle. Utilizing the genome as a basis for unraveling the molecular interactions between Ac and a variety of human pathogens will provide a platform for understanding the contributions of environmental hosts to the evolution of virulence.
Materials and methods
Ac strain Neff (ATCC 30010) was grown at 30°C with moderate shaking to an OD550 of approximately 1.0. Total nucleic acid preparations were depleted of mitochondrial DNA contamination via differential centrifugation of cell extracts . High molecular weight DNA was extracted from nuclear pellets either on Cesium chloride-Hoechst 33258 dye gradients as per  or by utilizing the Qiagen Genomic-tip 20/G kit (Qiagen, Hilden, Germany).
Genomic DNA library preparation and sequencing
All genomic DNA libraries were generated according to the Illumina protocol Genomic DNA Sample Prep Guide - Oligo Only Kit (1003492 A); sonication was substituted for the recommended nebulization as the method for DNA fragmentation utilising a Biorupter™ (Diagenode, Liége, Belgium). The library preparation methodology of end repair to create blunt ended fragments, addition of a 3'-A overhang for efficient adapter ligation, ligation of the adapters, and size selection of adapter ligated material was carried out using enzymes indicated in the protocol. Adapters and amplification primers were purchased from Illumina (Illumina, San Diego, CA, USA); both Single Read Adapters (FC-102-1003) and Paired End Adapters (catalogue number PE-102-1003) were used in library construction. All enzymes for library generation were purchased from New England Biolabs (Ipswitch, MA, USA). A limited 14-cycle amplification of size-selected libraries was carried out. To eliminate adapter-dimers, libraries were further sized selected on 2.5% TAE agarose gels. Purified libraries were quantified using a Qubit™ fluorometer (Invitrogen, Carlsbad, CA, USA) and a Quant-iT™ double-stranded DNA High-Sensitivity Assay Kit (Invitrogen). Clustering and sequencing of the material was carried out as per the manufacturer's instructions on the Illumina GAII platform in the UCD Conway Institute (UCD, Dublin, Ireland).
RNA extraction and RNA.seq library preparation and sequencing
For all tested conditions (Table S1.6.1 in Additional file 1) except the infection series, RNA was extracted from a minimum of 1 × 106 cells using TRIzol® (Invitrogen/Life Technologies, Paisley, UK). For infection material the detailed protocol is published in . Strand-specific RNA.seq libraries were generated from total RNA using a modified version of  which is detailed in . Briefly, total RNA was poly(A) selected, fragmented, reverse transcribed and second strand cDNA marked with the addition of dUTP. Standard Illumina methodology was followed - end-repair, A-addition, adapter ligation and library size selection - with the exception of the use of 'home-brew 6-nucleotide indexed' adapters as per Craig et al. . Prior to limited amplification of the libraries, the dUTP marked second strand was removed via Uracil DNA-Glycosylase (Bioline, London, UK) digestion. Final libraries were quantified using the High Sensitivity DNA Quant-iT™ assay kit and Qubit™ Fluorometer (Invitrogen/Life Technologies). All sequencing was carried out in UCD Conway Institute on an Illumina GAII as per the manufacturer's instructions.
Sequencing and assembly
Genome assembly was carried out using a two-step process. Firstly, the Illumina reads were assembled using the Velvet  short read assembler to generate a series of contigs. These assembled contigs were used to generate a set of pseudo-reads 400 bp in length. These pseudo reads were then assembled in conjunction with the 454 FLX and Sanger sequences using version 2.3 of the GS De Novo Assembler using default parameters (Table S1.1.1 in Additional file 1). The assembly contained 45.1 Mb of scaffold sequence, of which 3.4 Mb (7.5%) represents gaps and 75% of the genome is contained in less than 100 scaffolds. For assembly statistics see Table S1.2.1 in Additional file 1. In order to determine the coverage of the transcriptome, we aligned our genome assembly to a publicly available EST dataset from GenBank (using the entrez query acanthamoeba EST) AND 'Acanthamoeba castellanii' [porgn:txid5755]). Of the 13,784 EST sequences downloaded, 12,975 (94%) map over 50% of their length with an average percent identity of 99.2% and 12,423 (90%) map over 70% of their length with an average percent identity of 99.26%.
Gene structure prediction
Gene finding was carried out on the largest 384 scaffolds of the Ac assembly using an iterative approach by firstly generating gene models directly from RNA.seq to train a gene-finding algorithm using a genome annotation pipeline followed by manual curation. Firstly, predicted transcripts were generated using RNA.seq data from a variety of conditions (Table S1.4.1 in Additional file 1) in conjunction with the G.Mo.R-Se algorithm (Gene Modelling using RNA.seq), an approach aimed at building gene models directly from RNA.seq data  running with default parameters. This algorithm generated 20,681 predicted transcripts. We then used these predicted transcripts to train the genefinder SNAP  using the MAKER genome annotation pipeline [99, 100]. MAKER is used for the annotation of prokaryotic and eukaryotic genome projects. It identifies repeats, aligns ESTs (in this case the transcripts generated by the G.Mo.R-Se algorithm) and proteins from (nr) to a genome, produces ab-initio gene predictions and automatically synthesizes these data into gene annotations. The 17,013 gene predictions generated by MAKER were then manually annotated using the Apollo genome annotation curation tool [101, 102]. Apollo allows the deletion of gene models, the creation of gene models from annotations and the editing of gene starts, stops, and 3' and 5' splice sites. Models were manually annotated examining a variety of evidence, including expressed sequence data and matches to protein databases (Section 1 of Additional file 1). Out of a total of 113,574 exons, 32,836 are exactly covered and 64,724 are partially covered by transcripts and 7,193 genes have at least 50% of their entire lengths covered by transcript data.
Functional annotation assignments
Functional annotation assignments were carried out using a combination of automated annotation as described previously  followed by manual annotation. Briefly, gene level searches were performed against protein, domain and profile databases, including JCVI in-house non-redundant protein databases, Uniref , Pfam , TIGRfam HMMs , Prosite , and InterPro . After the working gene set had been assigned an informative name and a function, each name was manually curated and changed where it was felt a more accurate name could be applied. Predicted genes were classified using Gene Ontology (GO) . GO assignments were attributed automatically, based on other assignments from closely related organisms using Pfam2GO, a tool that allows automatic mapping of Pfam hits to GO assignments.
This whole genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession AHJI00000000. The version described in this paper is the first version, AHJI01000000. The RNA.seq data are available under accessions SRA061350 and SRA061370-SRA061379.
- Ac :
- Dd :
- Ed :
- Eh :
expressed sequence tag
heme-nitric oxide/oxygen binding
lateral gene transfer
microbe-associated molecular pattern
mitogen-activated protein kinase
mannose binding protein
- Ng :
tyrosine kinase 'writer'
tyrosine phosphatase 'eraser'
Regulatory factor X
Src homology 2 'reader' domain
tyrosine kinase like.
This work was funded by the Irish Science Foundation (SFI) grants 05/RP1/B908 and 05/RP1/908/EC07 awarded to BJL. The authors thank the Broad Institute and the investigators of 'the Origins of Multicellularity Sequencing Project, Broad Institute of Harvard and MIT'  for making data publicly available and the Liverpool Centre for Genomic Research for provision of 454 sequencing data. IL and MH were funded through grants from the Austrian Research Fund (Y277-B03), the European Research Council (Starting Grant EVOCHLAMY 281633), and the University of Vienna (Graduate School 'Symbiotic Interactions'). BT was supported by a grant from the Natural Sciences and Engineering Research Council of Canada (grant #184053). TRB is funded by the Centre for Biosciences. EST sequencing was supported by grants (CMRPD1A0581, CMRPG450131, and SMRPD160011) from Chang Gung Memorial Hospital to PT and CHC. JLM was supported by the Ramón y Cajal Subprogramme of the Spanish Ministry of Economy and Competitivity RYC-2011-08863.
- De Jonckheere JF: Ecology of Acanthamoeba. Rev Infect Dis. 1991, 13 (Suppl 5): S385-387.PubMedGoogle Scholar
- Rosenberg K, Bertaux J, Krome K, Hartmann A, Scheu S, Bonkowski M: Soil amoebae rapidly change bacterial community composition in the rhizosphere of Arabidopsis thaliana. ISME J. 2009, 3: 675-684. 10.1038/ismej.2009.11.PubMedGoogle Scholar
- Nwachuku N, Gerba CP: Health effects of Acanthamoeba spp. and its potential for waterborne transmission. Rev Environ Contam Toxicol. 2004, 180: 93-131. 10.1007/0-387-21729-0_2.PubMedGoogle Scholar
- Thomas V, McDonnell G, Denyer SP, Maillard JY: Free-living amoebae and their intracellular pathogenic microorganisms: risks for water quality. FEMS Microbiol Rev. 2009Google Scholar
- Horn M, Wagner M: Bacterial endosymbionts of free-living amoebae. J Eukaryot Microbiol. 2004, 51: 509-514. 10.1111/j.1550-7408.2004.tb00278.x.PubMedGoogle Scholar
- Horn M: Chlamydiae as symbionts in eukaryotes. Annu Rev Microbiol. 2008, 62: 113-131. 10.1146/annurev.micro.62.081307.162818.PubMedGoogle Scholar
- Greub G, Raoult D: Microorganisms resistant to free-living amoebae. Clin Microbiol Rev. 2004, 17: 413-433. 10.1128/CMR.17.2.413-433.2004.PubMedPubMed CentralGoogle Scholar
- Molmeret M, Horn M, Wagner M, Santic M, Abu Kwaik Y: Amoebae as training grounds for intracellular bacterial pathogens. Appl Environ Microbiol. 2005, 71: 20-28. 10.1128/AEM.71.1.20-28.2005.PubMedPubMed CentralGoogle Scholar
- Salah IB, Ghigo E, Drancourt M: Free-living amoebae, a training field for macrophage resistance of mycobacteria. Clin Microbiol Infect. 2009, 15: 894-905. 10.1111/j.1469-0691.2009.03011.x.PubMedGoogle Scholar
- Winiecka-Krusnell J, Linder E: Free-living amoebae protecting Legionella in water: the tip of an iceberg?. Scand J Infect Dis. 1999, 31: 383-385. 10.1080/00365549950163833.PubMedGoogle Scholar
- Dallaire-Dufresne S, Paquet VE, Charette SJ: [Dictyostelium discoideum: a model for the study of bacterial virulence]. Can J Microbiol. 2011, 57: 699-707. 10.1139/w11-072.PubMedGoogle Scholar
- Sandström G, Saeed A, Abd H: Acanthamoeba-bacteria: a model to study host interaction with human pathogens. Curr Drug Targets. 2011, 12: 936-941. 10.2174/138945011795677845.PubMedGoogle Scholar
- Eichinger L, Pachebat JA, Glockner G, Rajandream MA, Sucgang R, Berriman M, Song J, Olsen R, Szafranski K, Xu Q, Tunggal B, Kummerfeld S, Madera M, Konfortov BA, Rivero F, Bankier AT, Lehmann R, Hamlin N, Davies R, Gaudet P, Fey P, Pilcher K, Chen G, Saunders D, Sodergren E, Davis P, Kerhornou A, Nie X, Hall N, Anjard C, et al: The genome of the social amoeba Dictyostelium discoideum. Nature. 2005, 435: 43-57. 10.1038/nature03481.PubMedPubMed CentralGoogle Scholar
- Loftus B, Anderson I, Davies R, Alsmark UC, Samuelson J, Amedeo P, Roncaglia P, Berriman M, Hirt RP, Mann BJ, Nozaki T, Suh B, Pop M, Duchene M, Ackers J, Tannich E, Leippe M, Bruchhaus I, Willhoeft U, Bhattacharya A, Chillingworth T, Churcher C, Hance Z, Harris B, Harris D, Jagels K, Moule S, Mungall K, Ormond D, et al: The genome of the protist parasite Entamoeba histolytica. Nature. 2005, 433: 865-868. 10.1038/nature03291.PubMedGoogle Scholar
- Bowers B, Korn ED: The fine structure of Acanthamoeba castellanii (Neff strain). II. Encystment. J Cell Biol. 1969, 41: 786-805. 10.1083/jcb.41.3.786.PubMedPubMed CentralGoogle Scholar
- Marciano-Cabral F, Cabral G: Acanthamoeba spp. as agents of disease in humans. Clin Microbiol Rev. 2003, 16: 273-307. 10.1128/CMR.16.2.273-307.2003.PubMedPubMed CentralGoogle Scholar
- Pollard TD, Korn ED: Acanthamoeba myosin. I. Isolation from Acanthamoeba castellanii of an enzyme similar to muscle myosin. J Biol Chem. 1973, 248: 4682-4690.PubMedGoogle Scholar
- Ulsamer AG, Smith FR, Korn ED: Lipids of Acanthamoeba castellanii. Composition and effects of phagocytosis on incorporation of radioactive precursors. J Cell Biol. 1969, 43: 105-114. 10.1083/jcb.43.1.105.PubMedPubMed CentralGoogle Scholar
- Keeling PJ, Palmer JD: Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet. 2008, 9: 605-618. 10.1038/nrg2386.PubMedGoogle Scholar
- Merhej V, Notredame C, Royer-Carenzi M, Pontarotti P, Raoult D: The rhizome of life: the sympatric Rickettsia felis paradigm demonstrates the random transfer of DNA sequences. Mol Biol Evol. 2011, 28: 3213-3223. 10.1093/molbev/msr239.PubMedGoogle Scholar
- Thomas V, Greub G: Amoeba/amoebal symbiont genetic transfers: lessons from giant virus neighbours. Intervirology. 2010, 53: 254-267. 10.1159/000312910.PubMedGoogle Scholar
- Nelson WC, Bhaya D, Heidelberg JF: Novel miniature transposable elements in thermophilic Synechococcus and their impact on an environmental population. J Bacteriol. 2012, 194: 3636-3642. 10.1128/JB.00333-12.PubMedPubMed CentralGoogle Scholar
- Andersson JO: Gene transfer and diversification of microbial eukaryotes. Annu Rev Microbiol. 2009, 63: 177-193. 10.1146/annurev.micro.091208.073203.PubMedGoogle Scholar
- Lynch M, Conery JS: The origins of genome complexity. Science. 2003, 302: 1401-1404. 10.1126/science.1089370.PubMedGoogle Scholar
- Roy SW, Fedorov A, Gilbert W: Large-scale comparison of intron positions in mammalian genes shows intron loss but no gain. Proc Natl Acad Sci USA. 2003, 100: 7158-7162. 10.1073/pnas.1232297100.PubMedPubMed CentralGoogle Scholar
- Li W, Tucker AE, Sung W, Thomas WK, Lynch M: Extensive, recent intron gains in Daphnia populations. Science. 2009, 326: 1260-1262. 10.1126/science.1179302.PubMedGoogle Scholar
- Roy SW: Intron-rich ancestors. Trends Genet. 2006, 22: 468-471. 10.1016/j.tig.2006.07.002.PubMedGoogle Scholar
- Roy SW, Irimia M, Penny D: Very little intron gain in Entamoeba histolytica genes laterally transferred from prokaryotes. Mol Biol Evol. 2006, 23: 1824-1827. 10.1093/molbev/msl061.PubMedGoogle Scholar
- Fredriksson R, Schiöth HB: The repertoire of G-protein-coupled receptors in fully sequenced genomes. Mol Pharmacol. 2005, 67: 1414-1425. 10.1124/mol.104.009001.PubMedGoogle Scholar
- Hoffman CS: Glucose sensing via the protein kinase A pathway in Schizosaccharomyces pombe. Biochem Soc Trans. 2005, 33: 257-260.PubMedPubMed CentralGoogle Scholar
- Huang HC, Klein PS: The Frizzled family: receptors for multiple signal transduction pathways. Genome Biol. 2004, 5: 234-10.1186/gb-2004-5-7-234.PubMedPubMed CentralGoogle Scholar
- Gaillard I, Rouquier S, Giorgi D: Olfactory receptors. Cell Mol Life Sci. 2004, 61: 456-469. 10.1007/s00018-003-3273-7.PubMedGoogle Scholar
- Iyer LM, Anantharaman V, Aravind L: Ancient conserved domains shared by animal soluble guanylyl cyclases and bacterial signaling proteins. BMC Genomics. 2003, 4: 5-10.1186/1471-2164-4-5.PubMedPubMed CentralGoogle Scholar
- Fitzpatrick DA, O'Halloran DM, Burnell AM: Multiple lineage specific expansions within the guanylyl cyclase gene family. BMC Evol Biol. 2006, 6: 26-10.1186/1471-2148-6-26.PubMedPubMed CentralGoogle Scholar
- Jekely G: Evolution of phototaxis. Phil Trans R Soc Lond B Biol Sci. 2009, 364: 2795-2808. 10.1098/rstb.2009.0072.Google Scholar
- Dudley R, Jarroll EL, Khan NA: Carbohydrate analysis of Acanthamoeba castellanii. Exp Parasitol. 2009, 122: 338-343. 10.1016/j.exppara.2009.04.009.PubMedGoogle Scholar
- Segall JE, Kuspa A, Shaulsky G, Ecke M, Maeda M, Gaskins C, Firtel RA, Loomis WF: A MAP kinase necessary for receptor-mediated activation of adenylyl cyclase in Dictyostelium. J Cell Biol. 1995, 128: 405-413. 10.1083/jcb.128.3.405.PubMedGoogle Scholar
- Suga H, Dacre M, de Mendoza A, Shalchian-Tabrizi K, Manning G, Ruiz-Trillo I: Genomic survey of premetazoans shows deep conservation of cytoplasmic tyrosine kinases and multiple radiations of receptor tyrosine kinases. Sci Signal. 2012, 5: ra35-10.1126/scisignal.2002733.PubMedGoogle Scholar
- Lim WA, Pawson T: Phosphotyrosine signaling: evolving a new cellular communication system. Cell. 2010, 142: 661-667. 10.1016/j.cell.2010.08.023.PubMedPubMed CentralGoogle Scholar
- Liu BA, Shah E, Jablonowski K, Stergachis A, Engelmann B, Nash PD: The SH2 domain-containing proteins in 21 species establish the provenance and scope of phosphotyrosine signaling in eukaryotes. Sci Signal. 2011, 4: ra83-10.1126/scisignal.2002105.PubMedPubMed CentralGoogle Scholar
- Tan JL, Spudich JA: Developmentally regulated protein-tyrosine kinase genes in Dictyostelium discoideum. Mol Cell Biol. 1990, 10: 3578-3583.PubMedPubMed CentralGoogle Scholar
- Tan CS, Pasculescu A, Lim WA, Pawson T, Bader GD, Linding R: Positive selection of tyrosine loss in metazoan evolution. Science. 2009, 325: 1686-1688. 10.1126/science.1174301.PubMedPubMed CentralGoogle Scholar
- Li L, Tibiche C, Fu C, Kaneko T, Moran MF, Schiller MR, Li SS, Wang E: The human phosphotyrosine signaling network: evolution and hotspots of hijacking in cancer. Genome Res. 2012, 22: 1222-1230. 10.1101/gr.128819.111.PubMedPubMed CentralGoogle Scholar
- Stuart LM, Ezekowitz RA: Phagocytosis: elegant complexity. Immunity. 2005, 22: 539-550. 10.1016/j.immuni.2005.05.002.PubMedGoogle Scholar
- Watnick PI, Fullner KJ, Kolter R: A role for the mannose-sensitive hemagglutinin in biofilm formation by Vibrio cholerae El Tor. J Bacteriol. 1999, 181: 3606-3609.PubMedPubMed CentralGoogle Scholar
- Beckmann G, Bork P: An adhesive domain detected in functionally diverse receptors. Trends Biochem Sci. 1993, 18: 40-41. 10.1016/0968-0004(93)90049-S.PubMedGoogle Scholar
- Hong YC, Lee WM, Kong HH, Jeong HJ, Chung DI: Molecular cloning and characterization of a cDNA encoding a laminin-binding protein (AhLBP) from Acanthamoeba healyi. Exp Parasitol. 2004, 106: 95-102. 10.1016/j.exppara.2004.01.011.PubMedGoogle Scholar
- Harpaz Y, Chothia C: Many of the immunoglobulin superfamily domains in cell adhesion molecules and surface receptors belong to a new structural set which is close to that containing variable domains. J Mol Biol. 1994, 238: 528-539. 10.1006/jmbi.1994.1312.PubMedGoogle Scholar
- Ausubel FM: Are innate immune signaling pathways in plants and animals conserved?. Nat Immunol. 2005, 6: 973-979. 10.1038/ni1253.PubMedGoogle Scholar
- Fujita T: Evolution of the lectin-complement pathway and its role in innate immunity. Nat Rev Immunol. 2002, 2: 346-353. 10.1038/nri800.PubMedGoogle Scholar
- Tissières P, Pugin J: The role of MD-2 in the opsonophagocytosis of Gram-negative bacteria. Curr Opin Infect Dis. 2009, 22: 286-291. 10.1097/QCO.0b013e32832ae2fc.PubMedGoogle Scholar
- Alsam S, Sissons J, Dudley R, Khan NA: Mechanisms associated with Acanthamoeba castellanii (T4) phagocytosis. Parasitol Res. 2005, 96: 402-409. 10.1007/s00436-005-1401-z.PubMedGoogle Scholar
- Garate M, Cubillos I, Marchant J, Panjwani N: Biochemical characterization and functional studies of Acanthamoeba mannose-binding protein. Infect Immun. 2005, 73: 5775-5781. 10.1128/IAI.73.9.5775-5781.2005.PubMedPubMed CentralGoogle Scholar
- Watanabe Y, Tateno H, Nakamura-Tsuruta S, Kominami J, Hirabayashi J, Nakamura O, Watanabe T, Kamiya H, Naganuma T, Ogawa T, Naudé RJ, Muramoto K: The function of rhamnose-binding lectin in innate immunity by restricted binding to Gb3. Dev Comp Immunol. 2009, 33: 187-197. 10.1016/j.dci.2008.08.008.PubMedGoogle Scholar
- Hynes RO, Zhao Q: The evolution of cell adhesion. J Cell Biol. 2000, 150: F89-96. 10.1083/jcb.150.2.F89.PubMedGoogle Scholar
- Ghigo E, Kartenbeck J, Lien P, Pelkmans L, Capo C, Mege JL, Raoult D: Ameobal pathogen Mimivirus infects macrophages through phagocytosis. PLoS Pathogens. 2008, 4: e1000087-10.1371/journal.ppat.1000087.PubMedPubMed CentralGoogle Scholar
- Parakkottil Chothi M, Duncan GA, Armirotti A, Abergel C, Gurnon JR, Van Etten JL, Bernardi C, Damonte G, Tonetti M: Identification of an L-rhamnose synthetic pathway in two nucleocytoplasmic large DNA viruses. J Virol. 2010, 84: 8829-8838. 10.1128/JVI.00770-10.PubMedPubMed CentralGoogle Scholar
- Poole S, Firtel RA, Lamar E, Rowekamp W: Sequence and expression of the discoidin I gene family in Dictyostelium discoideum. J Mol Biol. 1981, 153: 273-289. 10.1016/0022-2836(81)90278-3.PubMedGoogle Scholar
- Sanchez JF, Lescar J, Chazalet V, Audfray A, Gagnon J, Alvarez R, Breton C, Imberty A, Mitchell EP: Biochemical and structural analysis of Helix pomatia agglutinin. A hexameric lectin with a novel fold. J Biol Chem. 2006, 281: 20171-20180. 10.1074/jbc.M603452200.PubMedGoogle Scholar
- Zambounis A, Elias M, Sterck L, Maumus F, Gachon CM: Highly dynamic exon shuffling in candidate pathogen receptors... What if brown algae were capable of adaptive immunity?. Mol Biol Evol. 2012, 29: 1263-1276. 10.1093/molbev/msr296.PubMedPubMed CentralGoogle Scholar
- Koonin EV: Taming of the shrewd: novel eukaryotic genes from RNA viruses. BMC Biol. 2010, 8: 2-10.1186/1741-7007-8-2.PubMedPubMed CentralGoogle Scholar
- Aliyari R, Ding SW: RNA-based viral immunity initiated by the Dicer family of host immune receptors. Immunol Rev. 2009, 227: 176-188. 10.1111/j.1600-065X.2008.00722.x.PubMedPubMed CentralGoogle Scholar
- Singh R, Jamieson A, Cresswell P: GILT is a critical host factor for Listeria monocytogenes infection. Nature. 2008, 455: 1244-1247. 10.1038/nature07344.PubMedPubMed CentralGoogle Scholar
- MacMicking JD: Interferon-inducible effector mechanisms in cell-autonomous immunity. Nat Rev Immunol. 2012, 12: 367-382. 10.1038/nri3210.PubMedPubMed CentralGoogle Scholar
- Peracino B, Wagner C, Balest A, Balbo A, Pergolizzi B, Noegel AA, Steinert M, Bozzaro S: Function and mechanism of action of Dictyostelium Nramp1 (Slc11a1) in bacterial infection. Traffic. 2006, 7: 22-38. 10.1111/j.1600-0854.2005.00356.x.PubMedGoogle Scholar
- Hug LA, Stechmann A, Roger AJ: Phylogenetic distributions and histories of proteins involved in anaerobic pyruvate metabolism in eukaryotes. Mol Biol Evol. 2010, 27: 311-324. 10.1093/molbev/msp237.PubMedGoogle Scholar
- Ginger ML, Fritz-Laylin LK, Fulton C, Cande WZ, Dawson SC: Intermediary metabolism in protists: a sequence-based view of facultative anaerobic metabolism in evolutionarily diverse eukaryotes. Protist. 2010, 161: 642-671. 10.1016/j.protis.2010.09.001.PubMedPubMed CentralGoogle Scholar
- Slamovits CH, Keeling PJ: Pyruvate-phosphate dikinase of oxymonads and parabasalia and the evolution of pyrophosphate-dependent glycolysis in anaerobic eukaryotes. Eukaryot Cell. 2006, 5: 148-154. 10.1128/EC.5.1.148-154.2006.PubMedPubMed CentralGoogle Scholar
- Loenarz C, Coleman ML, Boleininger A, Schierwater B, Holland PW, Ratcliffe PJ, Schofield CJ: The hypoxia-inducible transcription factor pathway regulates oxygen sensing in the simplest animal, Trichoplax adhaerens. EMBO Rep. 2011, 12: 63-70. 10.1038/embor.2010.170.PubMedPubMed CentralGoogle Scholar
- Rytkönen KT, Storz JF: Evolutionary origins of oxygen sensing in animals. EMBO Rep. 2011, 12: 3-4. 10.1038/embor.2010.192.PubMedPubMed CentralGoogle Scholar
- Sucgang R, Kuo A, Tian X, Salerno W, Parikh A, Feasley CL, Dalin E, Tu H, Huang E, Barry K, Lindquist E, Shapiro H, Bruce D, Schmutz J, Salamov A, Fey P, Gaudet P, Anjard C, Babu MM, Basu S, Bushmanova Y, van der Wel H, Katoh Kurasawa M, Dinh C, Coutinho PM, Saito T, Elias M, Schaap P, Kay RR, Henrissat B, et al: Comparative genomics of the social amoebae Dictyostelium discoideum and Dictyostelium purpureum. Genome Biol. 2011, 12: R20-10.1186/gb-2011-12-2-r20.PubMedPubMed CentralGoogle Scholar
- Duszenko M, Ginger ML, Brennand A, Gualdrón-López M, Colombo MI, Coombs GH, Coppens I, Jayabalasingham B, Langsley G, de Castro SL, Menna-Barreto R, Mottram JC, Navarro M, Rigden DJ, Romano PS, Stoka V, Turk B, Michels PA: Autophagy in protists. Autophagy. 2011, 7: 127-158. 10.4161/auto.7.2.13310.PubMedPubMed CentralGoogle Scholar
- MacPherson S, Larochelle M, Turcotte B: A fungal family of transcriptional regulators: the zinc cluster proteins. Microbiol Mol Biol Rev. 2006, 70: 583-604. 10.1128/MMBR.00015-06.PubMedPubMed CentralGoogle Scholar
- Bürglin TR: Homeodomain subtypes and functional diversity. Sub-Cell Biochem. 2011, 52: 95-122. 10.1007/978-90-481-9069-0_5.Google Scholar
- Han Z, Firtel RA: The homeobox-containing gene Wariai regulates anterior-posterior patterning and cell-type homeostasis in Dictyostelium. Development. 1998, 125: 313-325.PubMedGoogle Scholar
- Piasecki BP, Burghoorn J, Swoboda P: Regulatory Factor × (RFX)-mediated transcriptional rewiring of ciliary genes in animals. Proc Natl Acad Sci USA. 2010, 107: 12969-12974. 10.1073/pnas.0914241107.PubMedPubMed CentralGoogle Scholar
- Krome K, Rosenberg K, Dickler C, Kreuzer K, Ludwig-Müller J, Ullrich-Eberius C, Scheu S, Bonkowski M: Soil bacteria and protozoa affect root branching via effects on the auxin and cytokinin balance in plants. Plant Soil. 2010, 328: 191-201. 10.1007/s11104-009-0101-3.Google Scholar
- Finkler A, Ashery-Padan R, Fromm H: CAMTAs: calmodulin-binding transcription activators from plants to human. FEBS Lett. 2007, 581: 3893-3898. 10.1016/j.febslet.2007.07.051.PubMedGoogle Scholar
- Galon Y, Aloni R, Nachmias D, Snir O, Feldmesser E, Scrase-Field S, Boyce JM, Bouché N, Knight MR, Fromm H: Calmodulin-binding transcription activator 1 mediates auxin signaling and responds to stresses in Arabidopsis. Planta. 2010, 232: 165-178. 10.1007/s00425-010-1153-6.PubMedGoogle Scholar
- Eichinger L, Noegel AA: Comparative genomics of Dictyostelium discoideum and Entamoeba histolytica. Curr Opin Microbiol. 2005, 8: 606-611. 10.1016/j.mib.2005.08.009.PubMedGoogle Scholar
- Song J, Xu Q, Olsen R, Loomis WF, Shaulsky G, Kuspa A, Sucgang R: Comparing the Dictyostelium and Entamoeba genomes reveals an ancient split in the Conosa lineage. PLoS Comput Biol. 2005, 1: e71-10.1371/journal.pcbi.0010071.PubMedPubMed CentralGoogle Scholar
- Raoult D, Boyer M: Amoebae as genitors and reservoirs of giant viruses. Intervirology. 2010, 53: 321-329. 10.1159/000312917.PubMedGoogle Scholar
- Lurie-Weinberger MN, Gomez-Valero L, Merault N, Glöckner G, Buchrieser C, Gophna U: The origins of eukaryotic-like proteins in Legionella pneumophila. Int J Med Microbiol. 2010, 300: 470-481. 10.1016/j.ijmm.2010.04.016.PubMedGoogle Scholar
- Plate L, Marletta MA: Nitric oxide modulates bacterial biofilm formation through a multicomponent cyclic-di-GMP signaling network. Mol Cell. 2012, 46: 449-460. 10.1016/j.molcel.2012.03.023.PubMedPubMed CentralGoogle Scholar
- Martel CM: Conceptual bases for prey biorecognition and feeding selectivity in the microplanktonic marine phagotroph Oxyrrhis marina. Microbial Ecol. 2009, 57: 589-597. 10.1007/s00248-008-9421-8.Google Scholar
- Anantharaman V, Iyer LM, Aravind L: Comparative genomics of protists: new insights into the evolution of eukaryotic signal transduction and gene regulation. Annu Rev Microbiol. 2007, 61: 453-475. 10.1146/annurev.micro.61.080706.093309.PubMedGoogle Scholar
- Aderem A, Underhill DM: Mechanisms of phagocytosis in macrophages. Annu Rev Immunol. 1999, 17: 593-623. 10.1146/annurev.immunol.17.1.593.PubMedGoogle Scholar
- Boettner DR, Huston CD, Linford AS, Buss SN, Houpt E, Sherman NE, Petri WA: Entamoeba histolytica phagocytosis of human erythrocytes involves PATMK, a member of the transmembrane kinase family. PLoS Pathogens. 2008, 4: e8-10.1371/journal.ppat.0040008.PubMedPubMed CentralGoogle Scholar
- Sun T, Kim L: Tyrosine phosphorylation-mediated signaling pathways in dictyostelium. J Signal Transduction. 2011, 2011: 894351-Google Scholar
- Turkarslan S, Reiss DJ, Gibbins G, Su WL, Pan M, Bare JC, Plaisier CL, Baliga NS: Niche adaptation by expansion and reprogramming of general transcription factors. Mol Systems Biol. 2011, 7: 554-Google Scholar
- Lohan AJ, Gray MW: Analysis of 5'- or 3'-terminal tRNA editing: mitochondrial 5' tRNA editing in Acanthamoeba castellanii as the exemplar. Methods Enzymol. 2007, 424: 223-242.PubMedGoogle Scholar
- Spencer DF, Schnare MN, Gray MW: Isolation of wheat mitochondrial DNA and RNA. Modern Methods of Plant Analysis New Series. Edited by: Linskens HF, Jackson JF: Springer-Verlag, Berlin. 1992, 14: 347-360.Google Scholar
- Weissenmayer BA, Prendergast JG, Lohan AJ, Loftus BJ: Sequencing illustrates the transcriptional response of Legionella pneumophila during infection and identifies seventy novel small non-coding RNAs. PloS One. 2011, 6: e17570-10.1371/journal.pone.0017570.PubMedPubMed CentralGoogle Scholar
- Parkhomchuk D, Borodina T, Amstislavskiy V, Banaru M, Hallen L, Krobitsch S, Lehrach H, Soldatov A: Transcriptome analysis by strand-specific sequencing of complementary DNA. Nucleic Acids Res. 2009, 37: e123-10.1093/nar/gkp596.PubMedPubMed CentralGoogle Scholar
- Craig DW, Pearson JV, Szelinger S, Sekar A, Redman M, Corneveaux JJ, Pawlowski TL, Laub T, Nunn G, Stephan DA, Homer N, Huentelman MJ: Identification of genetic variants using bar-coded multiplexed sequencing. Nat Methods. 2008, 5: 887-893. 10.1038/nmeth.1251.PubMedPubMed CentralGoogle Scholar
- Zerbino DR, Birney E: Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18: 821-829. 10.1101/gr.074492.107.PubMedPubMed CentralGoogle Scholar
- Denoeud F, Aury JM, Da Silva C, Noel B, Rogier O, Delledonne M, Morgante M, Valle G, Wincker P, Scarpelli C, Jaillon O, Artiguenave F: Annotating genomes with massive-scale RNA sequencing. Genome Biol. 2008, 9: R175-10.1186/gb-2008-9-12-r175.PubMedPubMed CentralGoogle Scholar
- Korf I: Gene finding in novel genomes. BMC Bioinformatics. 2004, 5: 59-10.1186/1471-2105-5-59.PubMedPubMed CentralGoogle Scholar
- Cantarel BL, Korf I, Robb SM, Parra G, Ross E, Moore B, Holt C, Sanchez Alvarado A, Yandell M: MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 2008, 18: 188-196.PubMedPubMed CentralGoogle Scholar
- MAKER. [http://www.yandell-lab.org/software/maker.html]
- Lewis SE, Searle SM, Harris N, Gibson M, Lyer V, Richter J, Wiel C, Bayraktaroglir L, Birney E, Crosby MA, Kaminker JS, Matthews BB, Prochnik SE, Smithy CD, Tupy JL, Rubin GM, Misra S, Mungall CJ, Clamp ME: Apollo: a sequence annotation editor. Genome Biol. 2002, 3: RESEARCH0082-PubMedPubMed CentralGoogle Scholar
- Apollo Genome Annotation Curation Tool. [http://apollo.berkeleybop.org/current/index.html]
- Lorenzi HA, Puiu D, Miller JR, Brinkac LM, Amedeo P, Hall N, Caler EV: New assembly, reannotation and analysis of the Entamoeba histolytica genome reveal new genomic features and protein content information. PLoS Negl Trop Dis. 2010, 4: e716-10.1371/journal.pntd.0000716.PubMedPubMed CentralGoogle Scholar
- Uniref. [http://www.ebi.ac.uk/uniref/]
- Pfam. [http://pfam.sanger.ac.uk/]
- TIGRFAMs. [http://www.jcvi.org/cgi-bin/tigrfams/index.cgi]
- Prosite. [http://prosite.expasy.org/]
- InterPro. [http://www.ebi.ac.uk/interpro/]
- Camon E, Magrane M, Barrell D, Lee V, Dimmer E, Maslen J, Binns D, Harte N, Lopez R, Apweiler R: The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Res. 2004, 32: D262-266. 10.1093/nar/gkh021.PubMedPubMed CentralGoogle Scholar
- Broad Institute. [http://www.broadinstitute.org/]
- Rattei T, Tischler P, Gotz S, Jehl M-A, Hoser J, Arnold R, Conesa A, Mewes H-W: SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters. Nucleic Acids Res. 2010, 38: D223-226. 10.1093/nar/gkp949.PubMedPubMed CentralGoogle Scholar
- Frickey T, Lupas AN: PhyloGenie: automated phylome generation and analysis. Nucleic Acids Res. 2004, 32: 5231-5238. 10.1093/nar/gkh867.PubMedPubMed CentralGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMedPubMed CentralGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.