- Open Access
Comparative analyses of Legionella species identifies genetic features of strains causing Legionnaires’ disease
Genome Biology volume 15, Article number: 505 (2014)
The genus Legionella comprises over 60 species. However, L. pneumophila and L. longbeachae alone cause over 95% of Legionnaires’ disease. To identify the genetic bases underlying the different capacities to cause disease we sequenced and compared the genomes of L. micdadei, L. hackeliae and L. fallonii (LLAP10), which are all rarely isolated from humans.
We show that these Legionella species possess different virulence capacities in amoeba and macrophages, correlating with their occurrence in humans. Our comparative analysis of 11 Legionella genomes belonging to five species reveals highly heterogeneous genome content with over 60% representing species-specific genes; these comprise a complete prophage in L. micdadei, the first ever identified in a Legionella genome. Mobile elements are abundant in Legionella genomes; many encode type IV secretion systems for conjugative transfer, pointing to their importance for adaptation of the genus. The Dot/Icm secretion system is conserved, although the core set of substrates is small, as only 24 out of over 300 described Dot/Icm effector genes are present in all Legionella species. We also identified new eukaryotic motifs including thaumatin, synaptobrevin or clathrin/coatomer adaptine like domains.
Legionella genomes are highly dynamic due to a large mobilome mainly comprising type IV secretion systems, while a minority of core substrates is shared among the diverse species. Eukaryotic like proteins and motifs remain a hallmark of the genus Legionella. Key factors such as proteins involved in oxygen binding, iron storage, host membrane transport and certain Dot/Icm substrates are specific features of disease-related strains.
Among the many pathogens provoking severe pneumonia, the Gram-negative bacteria Legionella pneumophila and Legionella longbeachae are responsible for Legionnaires’ disease, a severe pneumonia that can be deadly if not treated promptly . Although several of the more than 60 species described in the genus Legionella may cause disease, L. pneumophila is the major agent, responsible for nearly 90% of all cases worldwide. L. longbeachae comes second, causing around 2 to 7% of cases with the exception of Australia and New Zealand, where it is associated with 30% of Legionnaires’ disease cases . Legionella micdadei, Legionella bozemanii, Legionella dumoffii, Legionella anisa, Legionella wadsworthii and Legionella feelei are rarely found in humans and the remaining Legionella species have never or only once been isolated from humans . This highly significant difference in disease incidence among Legionella species may be due to different environmental distributions and/or to different virulence potential for humans. Few studies have analyzed the environmental distribution of Legionella, although one survey in France showed that L. pneumophila, which had a prevalence of 95.4% in clinical isolates, was found in only 28.2% of the environmental samples tested, whereas L. anisa was isolated in 13.8% of the environmental samples but found only once (0.8%) in a clinical isolate . Similarly, a more recent report from Denmark showed that only 4.5% of clinical cases were due to non-L. pneumophila strains and reported a strong discrepancy in the occurrence of different Legionella species in clinical and environmental isolates . For example, L. anisa was highly abundant in the environment, but never found in clinical isolates. In contrast, L. bozemanni, L. longbeachae and L. micdadei were identified in clinical samples but never or rarely in environmental samples . Furthermore, different Legionella species also seem to have a different host range and different capacities to infect human cells ,. Taken together, independently of the environmental distribution, different Legionella species seem also to possess different abilities to infect eukaryotic cells and to cause disease in humans.
After publication of the L. pneumophila genome sequence in 2004 , and that of L. longbeachae in 2010 , several additional L. pneumophila strains have been sequenced - as well as a few draft genome sequences of other species. However, apart from Legionella oakridgensis  none has been analyzed in detail. Thus the vast majority of comprehensively analyzed genome sequences are from the major human pathogens L. pneumophila (eight genomes) and L. longbeachae (two genomes). In order to deepen our insight into species never or rarely found in human disease, we completely sequenced and analyzed the genomes of three Legionella species, L. micdadei, Legionella hackeliae and Legionella fallonii (LLAP10), selected based on their different epidemiological characteristics compared with L. pneumophila and L. longbeachae. L. micdadei is found in less than 1% of community-acquired pneumonia , L. hackeliae has been isolated from humans only once , and L. fallonii has never been reported to cause disease. L. fallonii was originally designated LLAP10 for `legionella-like amoebal pathogen 10’ , a term coined by Rowbotham for bacteria that caused legionella-like infections in amoebae, but could not be grown on agar media.
Here we analyze and compare the L. micdadei, L. hackeliae and L. fallonii genomes and compare them with seven previously completely sequenced L. pneumophila (Paris, Philadelphia, Lens, Corby, Alcoy, Lorraine and HL06041035) ,,, and one L. longbeachae NSW150 genome sequence . We confirm that the presence of 'eukaryotic-like proteins' (ELPs) is indeed a specific feature of the genus Legionella and extend the knowledge of these proteins further by identifying additional eukaryotic motifs. Analyses of the virulence of the different Legionella species in protozoan and human cells correlated with the genetic content and allowed us to identify specific features of human pathogenic Legionella and to define a core set of 24 type IV secretion system (T4SS) effectors present in the Legionella species examined to date.
Results and discussion
L. micdadei, L. hackeliae and L. falloniishow different virulence in amoeba or macrophages
Little to nothing is known about the environmental distribution and the virulence of different Legionella species for human cells. Similarly, it is not known why L. pneumophila and L. longbeachae are so predominant in human disease compared with other Legionella species. As a first step to understand these differences we analyzed the capacity of L. micdadei, L. hackeliae and L. fallonii to infect the protozoan species Acanthamoeba castellanii and the human monocytic cell line THP-1. As shown in Figure 1A, L. micdadei replicated in THP-1 cells, similar to L. pneumophila, while L. fallonii and L. hackeliae were unable to replicate in these cells, although they are phagocytosed efficiently as seen from the higher numbers entering the cells after one hour of infection (Figure 1A). In contrast, L. fallonii was able to replicate in A. castellanii (Figure 1B). However, neither L. hackeliae nor L. micdadei replicated in this amoeba. Thus, additional experiments are necessary to analyze whether A. castellani is their environmental host or not (Figure 1B). Similar results have been obtained using Dictyostelium discoideum as a host where L. micdadei can replicate in this model amoeba but L. hackeliae cannot . In contrast, it was reported that L. micdadei is able to replicate in A. castellani ,. Puzzled by these contradicting results we further analyzed the infection capacity of L. micdadei. Our infection assays had been carried out at 20°C whereas Hägele and colleagues  performed their infections at 30°C. We thought that the different results might be due to the different temperatures used. We thus carried out infection assays at 30°C and also used amoeba plate testing  at 37°C and 30°C (Figure 1C). Indeed, L. micdadei was able to replicate in A. castellani at 37°C and also at 30°C, although to a lesser extent compared with L. pneumophila (Additional file 1). This suggested that the replication capacity of L. micdadei in A. castellanii is temperature dependent.
Taken together the replication capacity of the different Legionella species in amoeba and human cells differed in a way similar to the epidemiological data for these species. This suggests that common as well as species-specific mechanisms might be involved in Legionella infection and replication in human cells.
The Legionellagenomes have similar genomic features but very different genome content
At approximately 3.5 Mb, the genome sizes of L. hackeliae and L. micdadei are similar to that of L. pneumophila whereas that of L. fallonii is similar to that of L. longbeachae at approximately 4 Mb (Table 1). The GC content is highly homogenous (approximately 39%) and the gene order is relatively well conserved. Apart from L. micdadei, each strain contained one or two plasmids between 14 and 238 kb in size (Table 1). When five different L. pneumophila genomes were compared the pan-genome comprised 2,957 genes, the core-genome of the species L. pneumophila contained 1,979 genes and the calculation of the rarefraction curves indicated that L. pneumophila has an open pan-genome . This held true when we analyzed 11 Legionella genomes here (seven L. pneumophila strains and one strain each of L. longbeachae, L. micdadei, L. hackeliae and L. fallonii); the Legionella pan-genome increased considerably to 9,194 genes and the core genome was 1,388 genes (Figure 2A) or 1,415 genes when comparing one strain of each sequenced species (L. pneumophila Paris as representative) (Figure 2B). Thus, the core genome of Legionella represents only about 15% of the pan-genome, indicating that the Legionella accessory genome is large. The complete annotation of these three newly sequenced genomes is available in the LegionellaScope database  and at the Institut Pasteur, LegioList .
To establish a whole genome-based phylogeny of these Legionella species we used either 29 housekeeping genes or 816 orthologous genes shared among the 11 Legionella strains analyzed. Coxiella burnetii was used as outgroup. Phylogenetic reconstructions using either the nucleotide or the amino acid sequences gave the same tree topology for the different species. In contrast, the tree topology of the L. pneumophila strains was different depending on the data set or the phylogenetic method used, probably due to the high recombination rate of this species ,. Our phylogenetic analyses showed that L. pneumophila, L. fallonii and L. longbeachae group together, with L. fallonii being phylogenetically the closest to L. pneumophila. L. micdadei and L. hackeliae formed a second cluster (Figure 3). Except for the place of L. fallonii, this is in agreement with previous phylogenies of the genus Legionella ,. In previous work L. pneumophila was described as phylogenetically closer to L. longbeachae than to L. fallonii  or L. fallonii closer to L. longbeachae than to L. pneumophila . However, these studies are based on 16S RNA sequences and bootstrap values associated with the corresponding nodes to evaluate its statistical support are not provided.
In conclusion, the general features of the Legionella genomes are very similar but each Legionella species has a distinctive genomic content with about 60% of genes being species-specific. Interestingly, human pathogenic and non-pathogenic species were mixed in the phylogeny, which indicates that virulent traits favoring human infection have been acquired independently during the evolution of the genus.
Type II and IVB secretion systems are part of the core genome of Legionella
Like in other bacterial genera, the core genome of Legionella contains the genes encoding fundamental metabolic pathways and the ribosomal machinery. In addition, the Dot/Icm type IVB secretion system (T4BSS) as well as the Lsp type II secretion system (T2SS), both indispensable for intracellular replication, also belong to the core genome of this genus. The chromosomal organization of the Dot/Icm and the Lsp secretion system is also conserved, except for the genes icmD and icmC, which are duplicated in L. fallonii. Interestingly, the degree of conservation of the different Dot/Icm proteins is very variable, ranging from >90% for DotB to proteins without any homology such as IcmR. Surprisingly, DotA, an integral inner membrane protein  indispensable for intracellular growth , is one of the least conserved proteins of the Dot/Icm T4SS (Additional file 2). Unexpectedly, the sequenced L. hackeliae strain (ATCC35250) had a stop codon in the gene coding for DotA, splitting it into 984 and 2,040 nucleotide fragments. Resequencing of the dotA gene confirmed the presence of the stop codon. As this strain was not able to replicate in A. castellanii, we thought that this might be due to the mutated dotA gene leading to a non-functional T4SS. To verify if this mutation was specific for the sequenced strain, we analyzed the dotA gene in a second L. hackeliae strain (ATCC35999). In this strain the dotA gene was intact. Thus, the dotA gene fragmentation in the sequenced strain probably occurred during storage. However, when testing the virulence of both L. hackeliae strains in A. castellanii using the amoeba plate test, neither were able to replicate at 30°C or at 37°C (data not shown). To analyze if the Dot/Icm secretion system was functional in the sequenced strains, we used the calmodulin-dependent adenylate cyclase (CyaA) gene fusion approach  and RalF from L. pneumophila  for L. hackeliae, L. micdadei and L. fallonii. However, several attempts to show secretion of RalF in one of these strains failed, as RalF was never expressed in them despite testing under several different conditions. Thus, further experiments are necessary to adapt this assay to the here newly sequenced Legionella species.
Another particularity of the Dot/Icm system is the icmR gene. Indeed, similar to what was reported for L. hackeliae and L. micdadei where icmR was replaced by a non-homologous gene with functional equivalence ,, a gene encoding a protein with no similarity to any previously described protein is present in the position of icmR in L. fallonii, possibly serving as a functional equivalent of icmR of L. pneumophila. Other variable genes include icmX and icmG. IcmG has been described as a component that interacts with the effector proteins , which may explain the high variability in different species. In contrast, the components dotB, icmS, icmW and icmP are highly conserved. Indeed, these four genes can functionally replace their homologues in C. burnetii .
The L. micdadei, L. hackeliae and L. falloniigenomes encode surprising functions
L. fallonii is able to synthesize cellulose
Enzymes degrading cellulose have been described in L. longbeachae and were also found in L. fallonii. However, in addition the L. fallonii genome encodes a complete machinery for the synthesis of cellulose (Figure 4A). Although the bacterial need for cellulose may be surprising, cellulose has been reported as a common component of biofilms of several bacterial species such as Salmonella enterica or Escherichia coli . The bacterial genes for cellulose synthesis are called bcsABZC. In S. enterica and E. coli a second operon necessary for cellulose biosynthesis named bcsEFG is present ,. Both clusters (from lfa3354 to lfa3363 and lfa2987 to lfa2988) are present in L. fallonii, although with some differences in organization (Figure 4A). To analyze whether L. fallonii is able to synthesize cellulose, we used agar plates containing calcofluor, which binds cellulose and leads to fluorescence under UV radiation. Indeed, L. fallonii showed strong fluorescence under long-wave UV light, in contrast to L. pneumophila (Figure 4B), demonstrating cellulose biosynthesis in the genus Legionella for the first time. A blast search identified genes homologous to the L. fallonii cellulose operon (except bcsE and bcsF) also in the draft genome sequences of L. anisa and L. dumoffii (Figure 4A). This suggests that several Legionella species have the capacity to synthesize cellulose.
L. fallonii possesses genes encoding hopanoid biosynthesis and antibiotic resistance
L. fallonii encodes genes for hopanoid biosynthesis currently not found in any other Legionella species. About 10% of all sequenced bacteria contain genes for hopanoid synthesis, in particular cyanobacteria, acetobacter, streptomycetes, methylotrophs and purple non-sulfur bacteria. Hopanoids have been proposed to enhance membrane stability and to decrease membrane permeability , similar to sterols in eukaryotic cell membranes . In Burkholderia cenocepacia these genes are involved in sensitivity to low pH, detergent and antibiotics, and are related to motility . In Streptomyces coelicolor, this cluster has been well studied. Although not all genes of the S. coelicolor cluster are conserved in L. fallonii (Additional file 3), to date all bacteria carrying the gene for squelene-hopene-cyclase produce hopanoids . As L. fallonii also carries this gene, we expect this species is able to synthesize hopanoids, although their function in this species remains unknown.
Another peculiarity of L. fallonii is that it contains several antibiotic resistance genes not previously described in Legionella, including one encoding a chloramphenicol acetyltransferase (lfa0269) that is predicted to catalyze the acetyl-CoA-dependent acetylation of chloramphenicol. Furthermore, we identified a gene likely involved in erythromycin resistance, ereA (lfa1884) that is present also in L. drancourtii and L. dumoffii. This gene is located in gene clusters related to DNA mobility, such as integrases or prophage-related genes, and are rich in ELPs and repeats. These features indicate that these regions are putative genomic islands (Additional file 4).
L. hackeliae and L. fallonii encode chitin deacetylase activity
L. hackeliae and L. fallonii each contain a different gene coding for a chitin deacetylase (lha3256/lfa0697), an enzyme involved in deacetylation of chitin. An in vitro test described by Vadake  suggests that L. fallonii does have chitin deacetylase activity whereas it was not possible to demonstrate this clearly for L. hackeliae (Additional file 5). Chitin, a homopolymer of N-acetyl-glucosamine, is one of the most abundant polymers in the Earth’s biomass, especially in marine environments. Interestingly it is also a component of the cyst wall of Entamoeba invadens, and enzymes responsible for chitin synthesis have been found in Entamoeba genomes . The presence of chitin or chitin synthases has not been described in other protozoan genomes, but very few genomes of this group have been sequenced yet. Thus, chitin may be a common component of protozoa that are able to encyst. Although the other Legionella genomes analyzed here do not encode chitin deacetylase activity, all Legionella genomes encode chitinases. Chitinases are chitin-degrading enzymes leading to low molecular weight chito-oligomers whereas chitin decetylase degrades chitin to chitosan. Both products are of interest for industry and there is growing interest in organisms that produce chitosan. Legionella may be a new possible source of chitosan production.
L. micdadei contains the first putative complete prophage identified in a Legionella genome
The analysis of the unique genes from L. micdadei identified a specific region encoding 73 proteins, at least 16 of which are phage-associated proteins that represent a putative complete prophage (Additional file 6). This region contains genes encoding the phage capsid tail and replication proteins. Complete prophages have never been described in Legionella despite the frequent presence of phage-related proteins scattered in their genomes. Most attempts to isolate prophages that exclusively infect Legionella have also failed, until recently when two groups isolated Legionella bacteriophages , from environmental water samples and organs of guinea pigs. Thus, Legionella do have phages, but they seem to be rare.
L. fallonii and L. micdadei contain additional flagella operons
The comparison of the L. pneumophila and L. longbeachae genomes revealed that L. longbeachae does not contain genes allowing flagella biosynthesis . As recognition of flagellin by Naip5 initiates host immune responses that control L. pneumophila infection in certain eukaryotic cells ,, the presence or absence of flagella is important for intracellular replication of Legionella. L. hackeliae, L. fallonii and L. micdadei also contain three flagella operons homologous to those described in L. pneumophila (Figure S5A-C in Additional file 7). Interestingly L. fallonii and L. micdadei encode a fourth region not previously described in any sequenced Legionella species that might also code for flagella (Figure 5).
A highly dynamic mobilome characterizes the Legionellagenomes
Genomic elements like plasmids, genomic islands or transposons constitute the mobilome of a genome. All Legionella species analyzed contain many of these mobile elements. For example, L. hackeliae possesses a plasmid of 129.88 kb whereas L. fallonii (LLAP10) contains two plasmids of 238.76 kb and 14.57 kb, respectively (Table 1). Furthermore, the plasmid present in L. hackeliae is identical to the L. pneumophila strain Paris plasmid (100% nucleotide identity over the entire length except for two transposases in the strain Paris plasmid; Additional file 8). This suggests that this plasmid has recently moved horizontally between both species, which is a new example of the high rate of gene transfer among Legionella genomes ,.
In addition to the plasmids identified and their evident exchange among strains and species, a hallmark of the Legionella mobilome is the presence of many different type IVA secretion system-encoding regions in the plasmids as well as in genomic island-like regions on the chromosome. Interestingly, these regions often encode tra-like genes with considerable homology among the different strains. However, each new strain analyzed contained new regions, underlining the high diversity of these systems in the Legionella genomes. Predominant are F-type and P-type IVA systems that code for conjugative pili that allow mating. F-type IVA secretion systems are present on all L. pneumophila plasmids, the L. hackeliae plasmid, the 238 kb L. fallonii plasmid (two systems) and on the chromosomes of L. pneumophila strain Philadelphia, L. longbeachae and L. fallonii (Additional file 9). Each encodes a homologue of the global regulator CsrA, named LvrC, which when present in the chromosome also encodes the lvrRAB gene cluster. This was recently described as being involved in the regulation of excision of the ICE Trb1 of L. pneumophila strain Corby . Thus, conjugative exchange of DNA has an important role in Legionella and is one key factor enabling Legionella to rapidly adapt to changing conditions.
The mobility and horizontal transfer of these different regions are further shown when studying the distribution of these systems. For example, the lvh cluster, a type IVA system involved in virulence under conditions mimicking the spread of Legionnaires’ disease from environmental niches , is also present in L. micdadei, in one of the two completely sequenced L. longbeachae strains and in five of the completely sequenced L. pneumophila strains (Table 2). In addition, the so-called GI-T4SS recently described in strain L. pneumophila 130b , and first recognized in Haemophilus influenzae as a T4SS involved in propagation of genomic islands , is believed to play an important role in the evolution and adaptation of Legionella . GI-T4SS clusters were found to be conserved in L. pneumophila, with two clusters each in strains Corby, Paris, 130b and HL06041035, and one in each of Alcoy, Philadelphia, Lens and Lorraine , as well as in strains of L. longbeachae, L. hackeliae, L. micdadei and L. fallonii (Table 2). Thus, a heterogeneous distribution among species and strains testifies to the continuous exchange of these elements among Legionella, contributing to the plasticity and dynamic nature of their genomes.
L. micdadeistrains from different geographical regions are highly similar except for their mobilome
To investigate the genomic diversity of the species L. micdadei, we determined the draft genome sequence of a clinical isolate obtained from the Microbiological Diagnostic Unit Public Health Laboratory (MDU), Australia and compared it with the completely sequenced strain L. micdadei ATCC 33218. The genome size and GC content of the two L. micdadei strains were highly similar (Figure 6). The main differences between the two L. micdadei strains were mobile genetic elements. Furthermore, the number of SNPs (1,985 SNPs) was very low, similar to serogroup 1 strains of L. longbeachae (1,611 SNPs) . This is strikingly different to L. pneumophila where two different strains may contain more than 30 000 SNPs. This suggests that L. micdadei and L. longbeachae evolved more recently compared to the L. pneumophila. Three large regions of the L. micdadei ATCC 33218 genome are absent from the Australian isolate (Figure 6). One is a genomic island encoding a GI-T4SS (36 kb), one is the predicted prophage we identified in this study, and another is a smaller cluster of approximately 9 kb that is flanked by three tRNA genes and that contains phage-related genes and a gene associated with abortive infection system (Figure 6). Similarly, in the Australian isolate a cluster absent from the completely sequenced L. micdadei strain corresponds to a P-type IVA secretion system. Interestingly, the Lvh region, encoding a T4ASS that is highly conserved among all strains and species analyzed to date, is divergent in the two L. micdadei strains with a high number of SNPs (Additional file 10). Thus, the main genetic differences between these two closely related L. micdadei strains are mobile genetic elements, further underlining the large extent of horizontal gene transfer that is present in the genus Legionella.
The core set of Dot/Icm effectors is small with only 24 conserved substrates
L. pneumophila encodes over 300 proteins that are translocated into the host cell by the Dot/Icm T4SS (Additional file 11). Their conservation is high among different L. pneumophila strains, as 77% of these substrates are present in all L. pneumophila strains sequenced to date. Interestingly, when the Dot/Icm substrates of L. pneumophila and L. longbeachae are compared, only 35% (101) are present in both species . Interestingly, the L. longbeachae and L. pneumophila genomes contain the highest number of common substrates, although L. fallonii is phylogenetically closer to L. pneumophila than to L. longbeachae (Figure 3). When investigating the presence of these substrates in five Legionella species by adding the L. hackeliae, L. micdadei and L. fallonii genomes, this revealed that their conservation is very low (Figure 3). With 33 conserved substrates, the lowest number is shared between L. micdadei and L. pneumophila. This result suggests that the shared substrates might relate to similar environmental niches or virulence properties (L. pneumophila and L. longbeachae) than to a closer phylogenetic relationship.
The Dot/Icm substrates conserved in all Legionella species are probably those indispensable for intracellular replication and are important players in host-pathogen interactions. Most surprisingly, only 24 of the 300 described substrates of L. pneumophila are present in all five Legionella species and most of these are of yet unknown function (Table 3). However, a third of the conserved substrates contain eukaryotic motifs like ankyrin or Sel-1 domains or TPR repeats. Others were previously defined as ELPs, such as the sphingomyelinase-like phosphodiesterase. Among the substrates that have been investigated further are VipF, which causes growth defects in S. cerevisae, and several of the ankyrin repeat motif proteins. VipF inhibits lysosomal protein trafficking  and AnkH was shown to play a role in intracellular replication of L. pneumophila in macrophages and protozoa and in intrapulmonary proliferation in mice . The function of MavBFNQ and RavC is not known, but they have been recovered in screens for vacuolar localization and have been shown to co-localize with SidC at the L. pneumophila vacuole .
SdhA, a L. pneumophila effector that is necessary for full virulence of this species, is a particular case. It is present in all Legionella analyzed but the similarity with L. longbeachae is small and thus below the cutoff established for our orthologous search (at least 65% of the length of the compared protein). However, given that homologues with a significant similarity are present in all species in synteny (except for L. hackeliae), and coiled-coil motifs are detected in all, SdhA was also defined as a core effector. Moreover, SdhA has been shown to be necessary for infection of mice and in Galleria mellonella ,. Surprisingly, the effector SidJ is not part of the core set of Legionella substrates, although its deletion led to a strong replication defect in eukaryotic cells. However, SidJ is present in L. pneumophila and L. longbeachae, the major human pathogens.
Interestingly, the growth defect of strains lacking SdhA and SidJ seems more important in mice and human macrophages than in amoeba. Replication of the sdhA mutant is severely impaired in mouse bone marrow-derived macrophages but less in the amoeba Dictyostelium discoideum . Similarly, a ΔsidJ strain shows significant growth defects in both macrophages and amoebae, but replication in macrophages is affected from the start of the infection, whereas the growth defect in amoebae is evident only after 72 h of infection and was less pronounced . These data may suggest that effectors important in human infection are not necessarily essential in the protozoan hosts and thus certain effectors might be important for human infection even though no growth defect in protozoan infection is detectable.
Eukaryotic-like proteins are a specific feature of the genus Legionella
One feature shared by many of the substrates of the Dot/Icm secretion system is the presence of eukaryotic motifs (EMs). Indeed, of 55 proteins of L. pneumophila Philadelphia encoding EMs, 45 (82%) are confirmed substrates of the Dot/Icm secretion system (Additional file 12). Thus, we searched for proteins containing EMs in all sequenced genomes. In the five Legionella species we identified 218 proteins with eukaryotic domains (Additional file 13). The genomes of L. longbeachae and L. fallonii contain nearly twice as many proteins with EMs as the other genomes, probably due to their larger genome size. The ankyrin motif is the most frequent one, followed by long coiled-coil domains. Some EMs that were described remain specific for L. longbeachae, such as the PPR repeats, PAM2 domain or the phosphatidylinositol-4-phosphate 5-kinase, indicating that they are probably related to its particular habitat in soil . In contrast, proteins with tubulin-tyrosine ligase domains (LLo2200), probably involved in the posttranslational modification of tubulin , are absent only from L. pneumophila. With the aim to analyze whether additional eukaryotic motifs not yet identified are present in the Legionella genomes, we developed a strategy allowing for a comprehensive scan of all genomes. First we searched the Interpro database for all motifs, which occur in at least 85% of proteins from eukaryotic genomes and only 15% or less in proteins from prokaryotic genomes. Using this criterion, we obtained 8,329 motifs that were considered as eukaryotic (see Materials and methods). All predicted Legionella proteins were scanned for these motifs. This approach allowed us to identify 10 EMs not described before in Legionella, including thaumatin, RhoGTPase and DM9 domains (Table 4). Interestingly, thaumatin-like proteins accumulate in plants in response to infection by pathogens and possess antifungal activity , and a Drosophila DM9-containing protein is strongly up-regulated after infection of Drosphila larvae by Pseudomonas species . Many of these new EMs are only present in the newly sequenced genomes, such as synaptobrevin, an intrinsic membrane protein of small synaptic vesicles  or the clathrin/coatomer adaptine-like domain that is associated with transport between the endoplasmic reticulum and Golgi . Given their function in eukaryotic organisms, these protein domains might indeed be important in host-pathogen interactions.
Many eukaryotic proteins are indeed transferred horizontally from eukaryotes
Not all proteins we defined as ELPs possess EMs, but certain ones are also considered eukaryotic-like as they show a high homology to eukaryotic proteins over their whole length. One of the best known examples of this type of ELP is the sphingosine-1-phosphate lyase (encoded by the gene lpp2128), an enzyme that in eukaryotes catalyzes the irreversible cleavage of sphingosine-1-phosphate, and that has most likely been transferred horizontally from eukaryotes ,,. With the aim to detect proteins with higher similarity to eukaryotic proteins than to prokaryotic ones and for which we can suggest a eukaryotic origin through phylogenetic analysis, we have developed a pipeline that automatically extracts those proteins from the Legionella pan-genome with high similarity to eukaryotic proteins (for details see Materials and methods). Using this pipeline we identified 465 proteins as putative ELPs. For each of these proteins we constructed a phylogenetic tree that was curated and analyzed manually. However, for many of the ELPs a phylogenetic reconstruction did not allow clear demonstration of eukaryotic origin. Some aligned too poorly with their eukaryotic homologues or on just a small domain. This might be due to the fact that genomes of ciliated protozoa and amoeba, the known hosts of Legionella from which these ELPs are most likely acquired, are underrepresented in current databases. However, for 40 of the 465 proteins that are suggested to be of eukaryotic origin, the phylogenetic reconstruction clearly showed that they had been acquired by Legionella through horizontal gene transfer from eukaryotes (Table 5; Figure S9A-C in Additional file 14).
Among these proteins 27 had not been described before and 15 were identified in the newly sequenced species. A clear case of horizontal gene transfer from eukaryotes is GamA (Lpp0489), a glucoamylase that allows Legionella to degrade glycogen during intracellular replication in A. castellanii . In addition to already characterized proteins, we identified promising candidates for host-pathogen interactions in this study - for example, a L. longbeachae protein containing a tubulin-tyrosine ligase domain (Llo2200; Figure S9A in Additional file 14), a motif involved in addition of a carboxy-terminal tyrosine to α-tubulin as part of a tyrosination-detyrosination cycle that is present in most eukaryotic cells. This tyrosination process regulates the recruitment of microtubule-interacting proteins . It is thus tempting to assume that Legionella is able to interfere with or to modulate the recruitment of microtubule-interacting proteins in the host. Another example is the serine carboxypeptidase S28 family protein (Llo0042/Lfa0022; Figure 7). These proteins have been identified exclusively in eukaryotes and are active at low pH, suggesting a function in the phagosome .
Taken together, each Legionella genome contains many different ELPs and proteins carrying eukaryotic domains that help Legionella to establish its intracellular niche. Some of these proteins are specific to one or other Legionella species but most are present in all of them, although these proteins are rarely real orthologues. This suggests that the acquisition of these proteins is important for Legionella to manipulate the host but that their horizontal acquisition has taken place on multiple occasions.
Linking virulence properties and gene content
When using THP-1 cells as a model for infection of human macrophages, not all Legionella species were able to infect and replicate (Figure 1A). These results correlated with the epidemiology of legionellosis where only certain Legionella species are isolated from human disease. With the aim of identifying the genetic bases conferring these differences, we searched for genes that were present in the strains that cause disease but absent in the ones that had not been isolated from humans. This comparative analysis showed that L. pneumophila, L. longbeachae and L. micdadei share 40 genes that are not present in any of the other species. Among those we identified the hyp operon (hypABFCDE - lpg2171-75), necessary for hydrogenase activity in E. coli and the cyanobacterium Synechocystis . Legionella has additional downstream genes encoding for hydrogenases that are unique to these three species. This region is flanked by tRNA genes in L. micdadei and L. longbeachae, suggesting its acquisition by horizontal gene transfer.
Furthermore, a gene encoding a truncated hemoglobin (lpp2601) of group I called trHbN was identified as specific to the human pathogenic strains. Truncated hemoglobins are a family of small oxygen-binding heme proteins  that are ubiquitous in plants and present in many pathogenic bacteria such as Mycobacterium tuberculosis. Mycbacteria missing trHbNs are severely impaired for nitric oxide detoxification , and the expression of this gene is required for M. tuberculosis during macrophage infection . The proteins of M. tuberculosis and L. pneumophila share 30% identity and the important TrHbN residues are conserved in both, indicating a similar biochemical function. Furthermore, the M. tuberculosis trHbN shows 40% identity to its eukaryotic homologue in Tetrahymena thermophila and the Legionella protein 44% to the T. thermophila and 46% to the Paramecium tetraurelia protein. However, according to an in-depth phylogenetic analyses of truncated hemoglobins in prokaryotic and eukaryotic organisms, it seems that trHbNs are of prokaryotic origin and might have been transferred to eukaryotes . Interestingly, the Lvh system is not part of the genes unique to L. pneumophila, L. longbeachae and L. micdadei as not all L. pneumophila strains contain it, but it is uniquely present only in these three species. Finally, of the more than 300 proteins described as translocated by the Dot/Icm secretion system, only two, CegC4 (lpp2150/lpg2200) and Lem25 (lpp2487/lpg2422), are exclusive to the three species found in human disease, but their function is not known yet.
Comparing L. pneumophila and L. longbeachae, the two species responsible for over 95% of human infections, to all other Legionella species, showed that 124 genes are specific to these human pathogenic Legionella. Among them are 38 substrates of the Dot/Icm secretion system, including RalF (lpp1932/lpg1950), SidJ (lpp2094/lpg2155), SidI (lpp2572/lpg2504), SdeC (lpp2092/lpg2153), SidE (lpp2572/lpg2504), SdcA (lpp2578/lpg2510) and CegC7 (lpp0286/lpg0227). In addition to the secreted substrates, iron availability seems to be important for the human pathogens as among the specific proteins several are related to iron scavenging or iron storage. These are homologues of PvcA and PvcB (lpp0236-lpp0237), the siderophore pyoverdine that is involved in virulence and biofilm formation in the cystic fibrosis pathogen Pseudomonas aeuroginosa . In Legionella these genes are highly expressed in sessile cells, suggesting their involvement in sessile growth . Furthermore, a bacterioferritin (lpp2460) that is present also in L. micdadei but highly divergent is specific for the human pathogenic Legionella. Bacterioferritin plays a role in iron storage and is involved in protecting cellular components from oxidative damage, thereby playing a role in oxidative stress relief ,. Furthermore, a gene coding for a homologue of the Yersinia pestis plasminogen activator (lpp2452) that was shown to create transient plasmin activity  and the phospholipase C (lpp1411) implicated in host killing in a G. mellonella model  are specific to L. pneumophila and L. longbeachae.
The first comprehensive analyses of five species of the genus Legionella and the comparison of the genomes of human disease-related strains with non-disease-related strains have provided new insights into the genomic specificities related to adaptation and host-pathogen interactions of this fascinating intracellular bacterium and have identified specific features of the major human pathogenic Legionella. Highly dynamic genomes that evolve through frequent horizontal gene transfer, mediated by many and diverse T4SSs and acquisition of different eukaryotic proteins and protein domains at multiple times and stages of their evolution that allow host subversion are a hallmark of this amoeba-associated bacterial genus. The major human-related Legionella species, L. pneumophila and L. longbeachae, contain a set of genes that seems to increase their successful infection of mammalian cells. The key to their success may be a better capacity to subvert host functions to establish a protective niche for intracellular replication due to a specific set of secreted effectors and a higher ability to acquire iron and to resist oxidative damage. The analysis of additional Legionella genomes and other intracellular pathogens may allow the future definition of the major common strategies used by intracellular pathogens to cause disease and to understand how environmental pathogens may evolve to become human pathogens.
Materials and methods
Bacterial strains and sequence accession numbers
The strains sequenced in this study were L. hackeliae strain ATCC35250 (EMBL accession number chromosome: PRJEB7321), L. micdadei ATCC 33218 (EMBL accession number chromosome: PRJEB7312) and L. fallonii strain LLAP-10 (ATCC700992; EMBL accession number chromosome: PRJEB7322) . We obtained also the draft genome sequence of L. micdadei strain 02/42 (SRA accession number SRP047311), a clinical isolate from the Victorian Infectious Disease Research Laboratory (VIDRL). In addition, the genomes of Legionella species/strains that had been completely sequenced and published previously were included in the comparative analysis: L. pneumophila (strains Paris, Lens, Philadelphia, Corby, Lorraine and HL 0604 1035, Alcoy) ,,, and L. longbeachae strain NSW150 .
Sequencing and assembly
Strain L. micdadei 02/42 was sequenced using the Roche 454 GS-FLX platform, with Titanium chemistry and paired-end reads with an average insert size of 8.9 kb. The resultant reads, with an average length of 215 bp, were assembled using Newbler 2.5.3 (Roche/454) into three scaffolds with a total genome size of 3,266,670 bp (largest scaffold 3,261,115 bp) and an average read coverage of 26. L. micdadeii ATCC33218, L. hackeliae and L. fallonii sequences were determined using a Sanger/Illumina hybrid approach. For the Sanger approach sequencing reactions were performed using the ABI PRISM BigDye Terminator cycle sequencing ready reaction kit and a 3700 or a 3730 Xl Genetic Analyzer (Applied Biosystems, Saint Aubin, Ille de France, France). For L. micdadei ATCC33218, L. hackeliae and L. fallonii, 33,042, 33,042, and 36,240 sequences, respectively, from two libraries were determined. Assembly of the Sanger reads was done with the STADEN package in an iterative manner. We attempted to close remaining gaps with PCR products spanning repeats and regions recalcitrant to sequencing by testing several primer combinations for each gap. The final assemblies consisted of 36,084 reads and PCR products for L. micdadei ATCC33218, 33,085 for L. hackeliae, and 36,242 for L. fallonii. To finish the genome assembly each genome was in addition sequenced to a 60× coverage using an Illumina 2000 HiSeq sequencer and 36 bp reads. The Illumina reads and the programme Icorn  were used to correct the assembly and finish the genome.
Annotation and genome comparison
The newly sequenced genomes of L. fallonii, L. hackeliae and L. micdadei were integrated into the MicroScope platform  to perform automatic and expert annotation of the genes, and comparative analysis with the already sequenced and integrated L. pneumophila strains. MicrosScope annotation is based on a number of integrated bioinformatic tools: Blast on UniProt and specialized genomic data, InterPro, COG, PRIAM, synteny group computation using the complete bacterial genomes available at NCBI RefSeq, and so on (for more details see ). Orthologous groups were established using the program PanOCT  with the following parameters: e-value 1e-5, percent identity ≥30, and length of match ≥65. The programs Easyfig and BRIG , were used for graphical representation of genome regions compared using BLAST. MAUVE  was used for aligning and comparing the L. micdadei genomes.
A. castellanii and THPinfection assays
In brief, cultures of A. castellanii were grown in PYG712 medium (2% proteose peptone, 0.1% yeast extract, 0.1 M glucose, 4 MM MgSO4, 0.4 M CaCl2, 0.1% sodium citrate dihydrate, 0.05 MM Fe(NH4)2(SO4)2 × 6H2O, 2.5 MM NaH2PO3, 2.5 MM K2HPO3) at 20°C for 3 days. Then amoeba were washed in infection buffer (PYG 712 medium without proteose peptone, glucose, and yeast extract) and adjusted to 105 to 106 cells/ml. Stationary phase Legionella grown on BCYE (Buffer Charcoal Yeast Extract) agar and diluted in water were mixed with A. castellanii at a multiplicity of infection MOI of 0.1. After allowing invasion for 1 h at 20°C the A. castellanii layer was washed twice with infection buffer (start point of time-course experiment). Intracellular multiplication was monitored using a 300 μl sample, which was centrifuged (14,000 rpm) and vortexed to break up amoeba. The number of colony forming units (CFU) of Legionella was determined by plating on BCYE agar. The infections were carried out in duplicates.
The human monocytic cell line THP-1 was maintained in RPMI 1640 medium GlutaMAX medium (Gibco, Invitrogen, Saint Aubin, Ille de France, France), supplemented with 10% fetal bovine serum (BIOWEST, France Nuaille, Maine et Loire , France), in 5% CO2 at 37°C. For THP-1 infection, cells were seeded into 24-well tissue culture trays (Falcon, BD lab ware, Altrincham, Manchester, United Kingdom, England) at a density of 1.5 × 105 cells/well and pretreated with 10-8 M phorbol 12-myristate 13-acetate (PMA) for 72 h in 5% CO2 at 37°C to induce differentiation into macrophage-like adherent cells. Stationary phase Legionella were resuspended in RPMI 1640 serum free medium and added to THP-1 cell monolayers at an MOI of 10. After 1 h of incubation cells were treated with 100 μg Ml-1 gentamycin for 1 h to kill extracellular bacteria. Infected cells were then washed with phosphate-buffered saline (PBS) before incubation with serum-free medium. At 24, 48 and 72 h THP-1 cells were lysed with 0.1% TritonX-100. The amount of Legionella was monitored by counting the number of CFU determined by plating on BCYE agar. The infections were carried out in triplicate.
Cyclase translocation assay
The vector containing RalF-CyaA  was transformed into L. micdadei, L. hackeliae and L. fallonii and strain Paris wild type and its isogenic ΔdotA::Km mutant were used as positive and negative controls. Transformant strains were used to infect THP-1 cells previously plated at 1 × 105 cells/well in 24-well tissue culture dishes and pre-treated with 10-8 M PMA. After 1 h and 30 Minutes following infection cells were washed three times with cold PBS and lysed in 50 MM HCl, 0.1% Triton X-100. Lysates were boiled 5 Minutes and neutralized with 0.5 M NaOH. We then added 95% cold ethanol and samples were spun for 5 Minutes at maximum speed in a microcentrifuge. Supernatants were transferred in new 1.5 Ml tubes and vacuum dried, and cAMP concentrations were measured using the cAMP Biotrak Enzyme immunoassay System (Amersham, United Kingdom, England). Each value was calculated as means of two independent infections ± standard deviations.
Amoebae plate test
Samples of suspended amoeba were applied to BCYE agar plates as described previously . Stationary-phase bacterial cultures (OD600 > 4.5) were adjusted to an identical OD600 (2.5), series of 10-fold dilutions in sterile H2O were prepared and 3 μl of each dilution were spotted onto CYE plates both with amoeba and without amoeba (control plates) and incubated for 3 to 5 days at 30°C or 37°C.
Detection of new eukaryotic motifs in Legionellaproteins
To better define the term 'eukaryotic motifs' we searched for the already known EMs in all proteins present in the Pfam database and calculated their occurrence in eukaryotic proteins or prokaryotic proteins. The previously described EMs in Legionella showed an occurrence of about 99% in eukaryotic proteins and only 1% in prokaryotic ones, with the ankyrin repeats being the less restricted to eukaryotic proteins (85%). The only exception is Sel-1 domains, which were considered as EMs. Sel-1 domains have now been shown to be highly present also in prokaryotes. However, since this domain is present in many substrates of the Dot/Icm system and it was shown to be implicated in host-pathogen interactions , it was taken into account. Based on the frequencies of the typical EMs present in Legionella we searched the Interpro database for all motifs that occur in eukaryotes at least to 85%. Using this criterion we obtained 8,329 motifs that can be considered as eukaryotic. These motifs were searched in all proteins predicted in the different Legionella genomes. This approach identified 10 eukaryotic motifs previously not described in Legionella proteins.
Detection of genes transferred from eukaryotes to Legionella
To detect genes with putative eukaryotic origin we developed a pipeline based on several step filters. This pipeline was applied to one protein of each of the orthologous groups of the pan-proteome of the five studied species to avoid redundancy in the detection process with proteins of the same orthologous group. The first step consisted of discarding the protein families without significant similarity to eukaryotic sequences. This was achieved by a homology search using Blastp with an e-value cutoff of ≤10e-4 and a BLOSUM62 matrix with a representative protein of each group of orthologous families of the Legionella pan-genome against a database containing 83 genomes representative of all major eukaryotic phyla and certain viruses. In particular, members of Amoebozoa and other protist lineages that may be hosts for Legionella were included in this database. The results of the first filter led to the recovery of 2,669 proteins of the Legionella pan-genome with significant homology to eukaryotic sequences in the database. Then, among these 2,669 protein families those that have closer homologues in bacteria were discarded by searching for homologues against a database containing both eukaryotic and prokaryotic sequences using the same criteria. Only those that had at least a hit against a eukaryotic sequence among the first 25 hits were further selected. This step led to the selection of 465 protein families of the Legionella pan-genome representing ELP candidates. Finally, we carried out automatic phylogenetic reconstruction of these 465 proteins and their bacterial and eukaryotic homologues. The different steps of the pipeline were: (1) for each selected putative ELP the corresponding orthologues in other Legionella species analyzed where added if present; (2) each group of homologous sequences was aligned with MUSCLE ; (3) unambiguously aligned positions were automatically selected using the multiple alignment trimming program BMGE with low stringency parameters ; (4) preliminary maximum likelihood trees were obtained using FastTree . We applied a strict filter to select only very likely ELPs. Then each of the 465 trees was manually inspected to select those where the Legionella sequences were branching within eukaryotes or were closer to eukaryotic sequences than to prokaryotic ones. This allowed identification of 40 Legionella proteins that aligned well with their eukaryotic homologues. For those having a sufficient number of eukaryotic homologues and a sufficient number of positions that could be selected after trimming, we proceeded to phylogenetic analysis by maximum likelihood using LG +4 gamma as the evolutionary model. Then, we selected those trees where the Legionella sequences were branching within eukaryotes or were closer to eukaryotic sequences than to prokaryotes. Finally, in order to verify the eventual existence of closer bacterial homologues or additional eukaryotic homologues from representatives not present in our local database, we performed a Blast on the non-redundant database at the NCBI. Alignments were obtained and trimmed, and trees reconstructed as described above.
For phylogenetic reconstruction two different data sets were created: one based on the concatenated alignment of 29 housekeeping genes (lpp0086 (uvrB), lpp0152 (pgk), lpp0419 (rpoA), lpp0467 (ffh), lpp0575 (serS), lpp0749 (pros), lpp0791 (glyA), lpp1020 (lig), lpp1271 (cysS), lpp1399 (trpS), lpp1434 (aspD), lpp1534 (ruvB), lpp1738 (nrdA), lpp1765 (recA), lpp1830 (tig), lpp1837 (lepA), lpp2004 (metK), lpp2006 (dnaJ), lpp2013 (argS), lpp2020 (eno), lpp2662 (ftsZ), lpp2698 (uvrC), lpp2802 (dnaX), lpp2877 (recN), lpp2941 (metG), lpp3002 (rho), lpp3053 (atpD), lpp3055 (atpA), lpp3073 (thdF)) and another one based on all ortholgous genes among the studied species and C. burnetii as outgroup (816 genes). With these data sets the alignment of amino acids and the alignment of nucleotides based on the amino acid alignment were carried out. Individual genes/proteins were aligned with muscle and concatenated. The nucleotide alignments were cleaned using Gblocks . Trees were constructed using both a distance method (neighbor-joining) implemented in the program MEGA  and a likelihood method using the software RaxML . Bootstrap support was determined using 1,000 bootstrap replicates.
Test for chitinase degradating activity
According to Vadake , Whatman filter paper strips were cut to 5°Cm × 1°Cm. These strips were immersed and air-dried in a solution of p-nitroacetanilide (5 g in 100 Ml of ethanol 100%). The procedure was repeated three times to impregnate well the strips with p-nitroacetanilide. L. fallonii and L. pneumophila (used as negative control) were grown in liquid medium for 24 h and 2 Ml of these cultures were transferred to a new sterile tube containing 2 Ml of fresh liquid media and the diagnostic strips. These cultures were grown for 2 days at 30°C for L. fallonii and 37°C for L. pneumophila. After 2 days the development of yellow color on the strip indicated the presence of deacetylase in the corresponding bacterial culture.
Cellulose detection assays
To visualize the production of cellulose, plates containing Legionella BCYE medium supplemented with calcofluor (5%; fluorescent brightener 28; Sigma-Aldrich, Oakville, Ontario, Canada) were prepared. Drops of 5 μl of liquid media containing L. fallonii grown for 72 h were spread on the plates and incubated at 30°C for 48 h. The same procedure was carried out for L. pneumophila at 37°C as negative control. After incubation plates were visualized under a UV light source.
LGV, CB, MS and KH designed the study. SJ, NKP and EH supplied material and expertise; GG and RJM performed genome sequencing; LGV and CR performed the genome annotation and analysis work; MR and JD undertook experiments; MN and SG performed phylogenetic analyses; CM set up the LegioScope database. LGV and CB drafted and wrote the manuscript. All authors contributed to and approved the final manuscript.
Buffer Charcoal Yeast Extract
colony forming units
multiplicity of infection
phorbol 12-myristate 13-acetate
type IV secretion system
Newton HJ, Ang DK, van Driel IR, Hartland EL: Molecular pathogenesis of infections caused by Legionella pneumophila . Clin Microbiol Rev. 2010, 23: 274-298. 10.1128/CMR.00052-09.
Yu VL, Plouffe JF, Pastoris MC, Stout JE, Schousboe M, Widmer A, Summersgill J, File T, Heath CM, Paterson DL, Chereshsky A: Distribution of Legionella species and serogroups isolated by culture in patients with sporadic community-acquired legionellosis: an international collaborative survey. J Infect Dis. 2002, 186: 127-128. 10.1086/341087.
Doleans A, Aurell H, Reyrolle M, Lina G, Freney J, Vandenesch F, Etienne J, Jarraud S: Clinical and environmental distributions of Legionella strains in France are different. J Clin Microbiol. 2004, 42: 458-460. 10.1128/JCM.42.1.458-460.2004.
Svarrer CW, Uldum SA: The occurrence of Legionella species other than Legionella pneumophila in clinical and environmental samples in Denmark identified by mip gene sequencing and matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Microbiol Infect. 2012, 18: 1004-1009. 10.1111/j.1469-0691.2011.03698.x.
Alli OA, Zink S, von Lackum NK, Abu-Kwaik Y: Comparative assessment of virulence traits in Legionella spp. Microbiology. 2003, 149: 631-641. 10.1099/mic.0.25980-0.
Hägele S, Kohler R, Merkert H, Schleicher M, Hacker J, Steinert M: Dictyostelium discoideum: a new host model system for intracellular pathogens of the genus Legionella . Cell Microbiol. 2000, 2: 135-171. 10.1046/j.1462-5822.2000.00044.x.
Cazalet C, Rusniok C, Bruggemann H, Zidane N, Magnier A, Ma L, Tichit M, Jarraud S, Bouchier C, Vandenesch F, Kunst F, Etienne J, Glaser P, Buchrieser C: Evidence in the Legionella pneumophila genome for exploitation of host cell functions and high genome plasticity. Nat Genet. 2004, 36: 1165-1173. 10.1038/ng1447.
Chien M, Morozova I, Shi S, Sheng H, Chen J, Gomez SM, Asamani G, Hill K, Nuara J, Feder M, Rineer J, Greenberg JJ, Steshenko V, Park SH, Zhao B, Teplitskaya E, Edwards JR, Pampou S, Georghiou A, Chou IC, Iannuccilli W, Ulz ME, Kim DH, Geringer-Sameth A, Goldsberry C, Morozov P, Fischer SG, Segal G, Qu X, Rzhetsky A, et al: The genomic sequence of the accidental pathogen Legionella pneumophila . Science. 2004, 305: 1966-1968. 10.1126/science.1099776.
Cazalet C, Gomez-Valero L, Rusniok C, Lomma M, Dervins-Ravault D, Newton HJ, Sansom FM, Jarraud S, Zidane N, Ma L, Bouchier C, Etienne J, Hartland EL, Buchrieser C: Analysis of the Legionella longbeachae genome and transcriptome uncovers unique strategies to cause Legionnaires’ disease. PLoS Genet. 2010, 6: e1000851-10.1371/journal.pgen.1000851.
Kozak NA, Buss M, Lucas CE, Frace M, Govil D, Travis T, Olsen-Rasmussen M, Benson RF, Fields BS: Virulence factors encoded by Legionella longbeachae identified on the basis of the genome sequence analysis of clinical isolate D-4968. J Bacteriol. 2010, 192: 1030-1044. 10.1128/JB.01272-09.
D'Auria G, Jimenez-Hernandez N, Peris-Bondia F, Moya A, Latorre A: Legionella pneumophila pangenome reveals strain-specific virulence factors. BMC Genomics. 2010, 11: 181-10.1186/1471-2164-11-181.
Gomez-Valero L, Rusniok C, Jarraud S, Vacherie B, Rouy Z, Barbe V, Medigue C, Etienne J, Buchrieser C: Extensive recombination events and horizontal gene transfer shaped the Legionella pneumophila genomes. BMC Genomics. 2011, 12: 536-10.1186/1471-2164-12-536.
Schroeder GN, Petty NK, Mousnier A, Harding CR, Vogrin AJ, Wee B, Fry NK, Harrison TG, Newton HJ, Thomson NR, Beatson SA, Dougan G, Hartland EL, Frankel G: Legionella pneumophila strain 130b possesses a unique combination of type IV secretion systems and novel Dot/Icm secretion system effector proteins. J Bacteriol. 2010, 192: 6001-6016. 10.1128/JB.00778-10.
Steinert M, Heuner K, Buchrieser C, Albert-Weissenberger C, Glockner G: Legionella pathogenicity: genome structure, regulatory networks and the host cell response. Int J Med Microbiol. 2007, 297: 577-587. 10.1016/j.ijmm.2007.03.009.
Brzuszkiewicz E, Schulz T, Rydzewski K, Daniel R, Gillmaier N, Dittmann C, Holland G, Schunder E, Lautner M, Eisenreich W, L°Ck C, Heuner K: Legionella oakridgensis ATCC 33761 genome sequence and phenotypic characterization reveals its replication capacity in amoebae. Int J Med Microbiol. 2013, 303: 514-528. 10.1016/j.ijmm.2013.07.003.
Wilkinson HW, Thacker WL, Steigerwalt AG, Brenner DJ, Ampel NM, Wing EJ: Second serogroup of Legionella hackeliae isolated from a patient with pneumonia. J Clin Microbiol. 1985, 22: 488-489.
Birtles RJ, Rowbotham TJ, Raoult D, Harrison TG: Phylogenetic diversity of intra-amoebal Legionellae as revealed by 16S rRNA gene sequence comparison. Microbiology. 1996, 142: 3525-3530. 10.1099/13500872-142-12-3525.
Neumeister B, Reiff G, Faigle M, Dietz K, Northoff H, Lang F: Influence of Acanthamoeba castellanii on intracellular growth of different Legionella species in human monocytes. Appl Environ Microbiol. 2000, 66: 914-919. 10.1128/AEM.66.3.914-919.2000.
Albers U, Reus K, Shuman HA, Hilbi H: The amoebae plate test implicates a paralogue of lpxB in the interaction of Legionella pneumophila with Acanthamoeba castellanii . Microbiology. 2005, 151: 167-182. 10.1099/mic.0.27563-0.
MicroScope Microbial Genome Annotation & AnalysisPlatform containing the sequenced and annotated Legionella genomes , [https://www.genoscope.cns.fr/agc/microscope/about/collabprojects.php?P_id=27]
LegioList Integrated Genome database, containing the annotated genome sequences , [http://genolist.pasteur.fr/LegioList/]
Coscolla M, Comas I, Gonzalez-Candelas F: Quantifying nonvertical inheritance in the evolution of Legionella pneumophila . Mol Biol Evol. 2011, 28: 985-1001. 10.1093/molbev/msq278.
Grattard F, Ginevra C, Riffard S, Ros A, Jarraud S, Etienne J, Pozzetto B: Analysis of the genetic diversity of Legionella by sequencing the 23S-5S ribosomal intergenic spacer region: from phylogeny to direct identification of isolates at the species level from clinical specimens. Microbes Infect. 2006, 8: 73-83. 10.1016/j.micinf.2005.05.022.
Rubin CJ, Thollesson M, Kirsebom LA, Herrmann B: Phylogenetic relationships and species differentiation of 39 Legionella species by sequence determination of the RNase P RNA gene rnpB . Int J Syst Evol Microbiol. 2005, 55: 2039-2049. 10.1099/ijs.0.63656-0.
Adeleke A, Pruckler J, Benson R, Rowbotham T, Halablab M, Fields B: Legionella-like amebal pathogens-phylogenetic status and possible role in respiratory disease. Emerg Infect Dis. 1996, 2: 225-230. 10.3201/eid0203.960311.
Adeleke AA, Fields BS, Benson RF, Daneshvar MI, Pruckler JM, Ratcliff RM, Harrison TG, Weyant RS, Birtles RJ, Raoult D, Halablab MA: Legionella drozanskii sp. nov., Legionella rowbothamii sp. nov. and Legionella fallonii sp. nov.: three unusual new Legionella species. Int J Syst Evol Microbiol. 2001, 51: 1151-1160. 10.1099/00207713-51-3-1151.
Roy CR, Isberg RR: Topology of Legionella pneumophila DotA: an inner membrane protein required for replication in macrophages. Infect Immun. 1997, 65: 571-578.
Berger KH, Merriam JJ, Isberg RR: Altered intracellular targeting properties associated with mutations in the Legionella pneumophila dotA gene. Mol Microbiol. 1994, 14: 809-822. 10.1111/j.1365-2958.1994.tb01317.x.
Bardill JP, Miller JL, Vogel JP: IcmS-dependent translocation of SdeA into macrophages by the Legionella pneumophila type IV secretion system. Mol Microbiol. 2005, 56: 90-103. 10.1111/j.1365-2958.2005.04539.x.
Nagai H, Kagan JC, Zhu X, Kahn RA, Roy CR: A bacterial guanine nucleotide exchange factor activates ARF on Legionella phagosomes. Science. 2002, 295: 679-682. 10.1126/science.1067025.
Feldman M, Segal G: A specific genomic location within the icm/dot pathogenesis region of different Legionella species encodes functionally similar but nonhomologous virulence proteins. Infect Immun. 2004, 72: 4503-4511. 10.1128/IAI.72.8.4503-4511.2004.
Feldman M, Zusman T, Hagag S, Segal G: Coevolution between nonhomologous but functionally similar proteins and their conserved partners in the Legionella pathogenesis system. Proc Natl Acad Sci U S A. 2005, 102: 12206-12211. 10.1073/pnas.0501850102.
Luo ZQ, Isberg RR: Multiple substrates of the Legionella pneumophila Dot/Icm system identified by interbacterial protein transfer. Proc Natl Acad Sci U S A. 2004, 101: 841-846. 10.1073/pnas.0304916101.
Zusman T, Yerushalmi G, Segal G: Functional similarities between the icm/dot pathogenesis systems of Coxiella burnetii and Legionella pneumophila . Infect Immun. 2003, 71: 3714-3723. 10.1128/IAI.71.7.3714-3723.2003.
Solano C, Garcia B, Valle J, Berasain C, Ghigo JM, Gamazo C, Lasa I: Genetic analysis of Salmonella enteritidis biofilm formation: critical role of cellulose. Mol Microbiol. 2002, 43: 793-808. 10.1046/j.1365-2958.2002.02802.x.
Zogaj X, Nimtz M, Rohde M, Bokranz W, Romling U: The multicellular morphotypes of Salmonella typhimurium and Escherichia coli produce cellulose as the second component of the extracellular matrix. Mol Microbiol. 2001, 39: 1452-1463. 10.1046/j.1365-2958.2001.02337.x.
Welander PV, Hunter RC, Zhang L, Sessions AL, Summons RE, Newman DK: Hopanoids play a role in membrane integrity and pH homeostasis in Rhodopseudomonas palustris TIE-1. J Bacteriol. 2009, 191: 6145-6156. 10.1128/JB.00460-09.
Volkman JK: Sterols in microorganisms. Appl Microbiol Biotechnol. 2003, 60: 495-506. 10.1007/s00253-002-1172-8.
Schmerk CL, Bernards MA, Valvano MA: Hopanoid production is required for low-pH tolerance, antimicrobial resistance, and motility in Burkholderia cenocepacia . J Bacteriol. 2011, 193: 6712-6723. 10.1128/JB.05979-11.
Vadake RS: Biotransformation of Chitin to Chitosan. United States Patent 5739015. 1998. United States Patent.
Campos-Gongora E, Ebert F, Willhoeft U, Said-Fernandez S, Tannich E: Characterization of chitin synthases from Entamoeba . Protist. 2004, 155: 323-330. 10.1078/1434461041844204.
Grigor’ev AA, Bondarev VP, Borisevich IV, Darmov IV, Mironin AV, Zolotarev AG, Pogorel’skii IP, Ianov DS: Temperate Legionella bacteriophage: discovery and characteristics. Zh Mikrobiol Epidemiol Immunobiol. 2008, 4: 86-88.
Lammertyn E, Vande Voorde J, Meyen E, Maes L, Mast J, Anné J: Evidence for the presence of Legionella bacteriophages in environmental water samples. Microb Ecol. 2008, 56: 191-197. 10.1007/s00248-007-9325-z.
Molofsky AB, Byrne BG, Whitfield NN, Madigan CA, Fuse ET, Tateda K, Swanson MS: Cytosolic recognition of flagellin by mouse macrophages restricts Legionella pneumophila infection. J Exp Med. 2006, 17: 1093-1104. 10.1084/jem.20051659.
Ren T, Zamboni DS, Roy CR, Dietrich WF, Vance RE: Flagellin-deficient Legionella mutants evade caspase-1- and Naip5-mediated macrophage immunity. PLoS Pathog. 2006, 2: e18-10.1371/journal.ppat.0020018.
Gomez Valero L, Runsiok C, Cazalet C, Buchrieser C: Comparative and functional genomics of Legionella identified eukaryotic like proteins as key players in host-pathogen interactions. Front Microbiol. 2011, 2: 208-10.3389/fmicb.2011.00208.
Gomez-Valero L, Buchrieser C: Genome dynamics in Legionella: the basis of versatility and adaptation to intracellular replication. Cold Spring Harb Perspect Med. 2013, 3: a009993-10.1101/cshperspect.a009993.
Lautner M, Schunder E, Herrmann V, Heuner K: Regulation, integrase-dependent excision, and horizontal transfer of genomic islands in Legionella pneumophila . J Bacteriol. 2013, 195: 1583-1597. 10.1128/JB.01739-12.
Bandyopadhyay P, Liu S, Gabbai CB, Venitelli Z, Steinman HM: Environmental mimics and the Lvh type IVA secretion system contribute to virulence-related phenotypes of Legionella pneumophila . Infect Immun. 2007, 75: 723-735. 10.1128/IAI.00956-06.
Juhas M, Crook DW, Dimopoulou ID, Lunter G, Harding RM, Ferguson DJ, Hood DW: Novel type IV secretion system involved in propagation of genomic islands. J Bacteriol. 2007, 189: 761-771. 10.1128/JB.01327-06.
Wee BA, Woolfit M, Beatson SA, Petty NK: A distinct and divergent lineage of genomic island-associated type IV Secretion Systems in Legionella . PLoS One. 2013, 8: e82221-10.1371/journal.pone.0082221.
Shohdy N, Efe JA, Emr SD, Shuman HA: Pathogen effector protein screening in yeast identifies Legionella factors that interfere with membrane trafficking. Proc Natl Acad Sci U S A. 2005, 102: 4866-4871. 10.1073/pnas.0501315102.
Habyarimana F, Al-Khodor S, Kalia A, Graham JE, Price CT, Garcia MT, Kwaik YA: Role for the Ankyrin eukaryotic-like genes of Legionella pneumophila in parasitism of protozoan hosts and human macrophages. Environ Microbiol. 2008, 10: 1460-1474. 10.1111/j.1462-2920.2007.01560.x.
Huang L, Boyd D, Amyot WM, Hempstead AD, Luo ZQ, O'Connor TJ, Chen C, Machner M, Montminy T, Isberg RR: The E Block motif is associated with Legionella pneumophila translocated substrates. Cell Microbiol. 2011, 13: 227-245. 10.1111/j.1462-5822.2010.01531.x.
Harding CR, Schroeder GN, Reynolds S, Kosta A, Collins JW, Mousnier A, Frankel G: Legionella pneumophila pathogenesis in the Galleria mellonella infection model. Infect Immun. 2012, 80: 2780-2790. 10.1128/IAI.00510-12.
Laguna RK, Creasey EA, Li Z, Valtz N, Isberg RR: A Legionella pneumophila-translocated substrate that is required for growth within macrophages and protection from host cell deat. Proc Natl Acad Sci U S A. 2006, 103: 18745-18750. 10.1073/pnas.0609012103.
Liu Y, Luo ZQ: The Legionella pneumophila effector SidJ is required for efficient recruitment of endoplasmic reticulum proteins to the bacterial phagosome. Infect Immun. 2007, 75: 592-603. 10.1128/IAI.01278-06.
Eiserich JP, Estévez AG, Bamberg TV, Ye YZ, Chumley PH, Beckman JS, Freeman BA: Microtubule dysfunction by posttranslational nitrotyrosination of alpha-tubulin: a nitric oxide-dependent mechanism of cellular injury. Proc Natl Acad Sci U S A. 1999, 96: 6365-6370. 10.1073/pnas.96.11.6365.
Acharya K, Pal AK, Gulati A, Kumar S, Singh AK, Ahuja PS: Overexpression of Camellia sinensis thaumatin-like protein, CsTLP in potato confers enhanced resistance to Macrophomina phaseolina and Phytophthora infestans infection. Mol Biotechnol. 2013, 54: 609-622. 10.1007/s12033-012-9603-y.
Monteiro S, Barakat M, Picarra-Pereira MA, Teixeira AR, Ferreira RB: Osmotin and thaumatin from grape: a putative general defense mechanism against pathogenic fungi. Phytopathology. 2003, 93: 1505-1512. 10.1094/PHYTO.2003.93.12.1505.
Vodovar N, Vinals M, Liehl P, Basset A, Degrouard J, Spellman P, Boccard F, Lemaitre B: Drosophila host defense after oral infection by an entomopathogenic Pseudomonas species . Proc Natl Acad Sci U S A. 2005, 102: 11414-11419. 10.1073/pnas.0502240102.
Sudhof TC, Baumert M, Perin MS, Jahn R: A synaptic vesicle membrane protein is conserved from mammals to Drosophila . Neuron. 1989, 2: 1475-1481. 10.1016/0896-6273(89)90193-1.
McMahon HT, Mills IG: COP and clathrin-coated vesicle budding: different pathways, common approaches. Curr Opin Cell Biol. 2004, 16: 379-391. 10.1016/j.ceb.2004.06.009.
Degtyar E, Zusman T, Ehrlich M, Segal G: A Legionella effector acquired from protozoa is involved in sphingolipids metabolism and is targeted to the host cell mitochondria. Cell Microbiol. 2009, 11: 1219-1235. 10.1111/j.1462-5822.2009.01328.x.
Lurie-Weinberger MN, Gomez-Valero L, Merault N, Glockner G, Buchrieser C, Gophna U: The origins of eukaryotic-like proteins in Legionella pneumophila . Int J Med Microbiol. 2010, 300: 470-481. 10.1016/j.ijmm.2010.04.016.
Herrmann V, Eidner A, Rydzewski K, Bladel I, Jules M, Buchrieser C, Eisenreich W, Heuner K: GamA is a eukaryotic-like glucoamylase responsible for glycogen- and starch-degrading activity of Legionella pneumophila . Int J Med Microbiol. 2011, 301: 133-139. 10.1016/j.ijmm.2010.08.016.
Szyk A, Piszczek G, Roll-Mecak A: Tubulin tyrosine ligase and stathmin compete for tubulin binding in vitro. J Mol Biol. 2013, 425: 2412-2414. 10.1016/j.jmb.2013.04.017.
Tripathi LP, Sowdhamini R: Cross genome comparisons of serine proteases in Arabidopsis and rice. BMC Genomics. 2006, 7: 200-10.1186/1471-2164-7-200.
Jacobi A, Rossmann R, Bock A: The hyp operon gene products are required for the maturation of catalytically active hydrogenase isoenzymes in Escherichia coli . Arch Microbiol. 1992, 158: 444-451. 10.1007/BF00276307.
Wittenberg JB, Bolognesi M, Wittenberg BA, Guertin M: Truncated hemoglobins: a new family of hemoglobins widely distributed in bacteria, unicellular eukaryotes, and plants. J Biol Chem. 2002, 277: 871-874. 10.1074/jbc.R100058200.
Ouellet H, Ouellet Y, Richard C, Labarre M, Wittenberg B, Wittenberg J, Guertin M: Truncated hemoglobin HbN protects Mycobacterium bovis from nitric oxide. Proc Natl Acad Sci U S A. 2002, 99: 5902-5907. 10.1073/pnas.092017799.
Pawaria S, Lama A, Raje M, Dikshit KL: Responses of Mycobacterium tuberculosis hemoglobin promoters to in vitro and in vivo growth conditions. Appl Environ Microbiol. 2008, 74: 3512-3522. 10.1128/AEM.02663-07.
Vuletich DA, Lecomte JT: A phylogenetic and structural analysis of truncated hemoglobins. J Mol Evol. 2006, 62: 196-210. 10.1007/s00239-005-0077-4.
Wiens JR, Vasil AI, Schurr MJ, Vasil ML: Iron-regulated expression of alginate production, mucoid phenotype, and biofilm formation by Pseudomonas aeruginosa . MBio. 2014, 5: e01010-e01013. 10.1128/mBio.01010-13.
Hindre T, Bruggemann H, Buchrieser C, Hechard Y: Transcriptional profiling of Legionella pneumophila biofilm cells and the influence of iron on biofilm formation. Microbiology. 2008, 154: 30-41. 10.1099/mic.0.2007/008698-0.
Bou-Abdallah F, Lewin AC, Le Brun NE, Moore GR, Chasteen ND: Iron detoxification properties of Escherichia coli bacterioferritin. Attenuation of oxyradical chemistry. J Biol Chem. 2002, 277: 37064-37069. 10.1074/jbc.M205712200.
Carrondo MA: Ferritins, iron uptake and storage from the bacterioferritin viewpoint. EMBO J. 2003, 22: 1959-1968. 10.1093/emboj/cdg215.
Jarvinen HM, Laakkonen L, Haiko J, Johansson T, Juuti K, Suomalainen M, Buchrieser C, Kalkkinen N, Korhonen TK: Human single-chain urokinase is activated by the omptins PgtE of Salmonella enterica and Pla of Yersinia pestis despite mutations of active site residues. Mol Microbiol. 2013, 89: 507-517. 10.1111/mmi.12293.
Aurass P, Schlegel M, Metwally O, Harding CR, Schroeder GN, Frankel G, Flieger A: The Legionella pneumophila Dot/Icm-secreted effector PlcC/CegC1 together with PlcA and PlcB promotes virulence and belongs to a novel zinc metallophospholipase C family present in bacteria and fungi. J Biol Chem. 2013, 288: 11080-11092. 10.1074/jbc.M112.426049.
Otto TD, Sanders M, Berriman M, Newbold C: Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology. Bioinformatics. 2010, 26: 1704-1707. 10.1093/bioinformatics/btq269.
Vallenet D, Belda E, Calteau A, Cruveiller S, Engelen S, Lajus A, Le Fevre F, Longin C, Mornico D, Roche D, Rouy Z, Salvignol G, Scarpelli C, Thil Smith AA, Weiman M, Médigue C: MicroScope-an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data. Nucleic Acids Res. 2013, 41: D636-D647. 10.1093/nar/gks1194.
MicroScope Microbial Genome Annotation and Analysis Platform. , [https://www.genoscope.cns.fr/agc/microscope/home/index.php]
Fouts DE, Brinkac L, Beck E, Inman J, Sutton G: PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species. Nucleic Acids Res. 2012, 40: e172-10.1093/nar/gks757.
Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA: BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011, 12: 402-10.1186/1471-2164-12-402.
Sullivan MJ, Petty NK, Beatson SA: Easyfig: a genome comparison visualizer. Bioinformatics. 2011, 27: 1009-1010. 10.1093/bioinformatics/btr039.
Darling AC, Mau B, Blattner FR, Perna NT: Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004, 14: 1394-1403. 10.1101/gr.2289704.
Newton HJ, Sansom FM, Dao J, McAlister AD, Sloan J, Cianciotto NP, Hartland EL: Sel1 repeat protein LpnE is a Legionella pneumophila virulence determinant that influences vacuolar trafficking. Infect Immun. 2007, 75: 5575-5585. 10.1128/IAI.00443-07.
Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004, 5: 113-10.1186/1471-2105-5-113.
Criscuolo A, Gribaldo S: BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol Biol. 2010, 10: 210-10.1186/1471-2148-10-210.
Price MN, Dehal PS, Arkin AP: FastTree 2-approximately maximum-likelihood trees for large alignments. PLoS One. 2010, 5: e9490-10.1371/journal.pone.0009490.
Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17: 540-552. 10.1093/oxfordjournals.molbev.a026334.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Stamatakis A, Aberer AJ, Goll C, Smith SA, Berger SA, Izquierdo-Carrasco F: RAxML-Light: a tool for computing terabyte phylogenies. Bioinformatics. 2012, 28: 2064-2066. 10.1093/bioinformatics/bts309.
We would like to acknowledge P Glaser for critical reading of the manuscript and help from J Bernal with the calcoflour assay. The work in CB's laboratory received financial support from the Institut Pasteur, the Centre National de la Recherche (CNRS), the Fondation pour la Recherche Médicale (FRM) grant number DEQ20120323697, the French Region Ile de France (DIM Malinf) and from the ANR-10-PATH-004 project, in the frame of ERA-Net PathoGenoMics. The MicroScope platform received financial support from GIS IBiSA. MS acknowledges support from the Deutsche Forschungsgemeinschaft (DFG). ELH was supported by the Australian Research Council.
The authors declare that they have no competing interests.
Electronic supplementary material
Additional file 1: Figure S1.: The ability of L. micdadei to replicate intracellularly in A. castellanii is dependent on the temperature. L. pneumophila strain Paris wild type (wt) and ΔdotA were used as positive and negative controls, respectively. Intracellular replication of L. micdadei at 30°C was determined by recording the number of colony-forming units (CFU) through plating on BCYE agar. Blue, wild-type L. pneumophila strain Paris; red, ΔdotA; orange, L. micdadei. Results are expressed as Log10 ratio CFU Tn/T0 and each point represents the mean ± standard deviation of two independent experiments. The error bars represent standard deviation, but some were too small to clearly appear in the figure. (TIFF 4 MB)
Additional file 2: Figure S2.: The Dot/Icm-encoding genes are present in all Legionella species with variable homology. Graphical representation of Blastp comparisons of the genes encoding the Dot/Icm T4BSS in all strains/species used in this study. L. pneumophila strain Philadelphia genes were taken as query. Each color ring represents a species/strain, and each segment in the ring a different gene. The intensity of the color indicates the percentage of amino acid identity. The percentages of identity and their correlation with the color intensity is given in the left panel. Gene names are given in the outside circle. Gene names in red indicate 100% amino acid identity. (TIFF 8 MB)
Additional file 3: Figure S3.: Genomic organization of the L. fallonii hopanoid encoding gene cluster. Blastx comparisons between clusters encoding genes for hopanoid biosynthesis in the species Streptomyces coelicolor and L. fallonii (LLAP10). The gray color code represents the Blast matches; the darker the gray the better the blast match. Protein names and their predicted functions are indicated below. (TIFF 7 MB)
Additional file 4: Table S1.: Putative mobile regions present in L. fallonii (LLAP10) containing a chloramphenicol acetyltransferase gene and a erythromycin esterase gene. (DOCX 102 KB)
Additional file 5: Figure S4.: L. fallonii shows chitinase degradation activity. Whatman paper strips dipped in a p-nitroacetanilide solution were introduced in liquid cultures of the species L. hackeliae, L. fallonii and L. pneumophila (used as negative control). After 2 days of growth the development of yellow color indicates the presence of chitin deacetylase activity in the bacterial culture. (TIFF 2 MB)
Additional file 6: Table S2.: Putative prophage region in the genome of L. micdadei strain ATCC33218. (DOCX 128 KB)
Additional file 7: Figure S5.: Genomic organization and comparison of the three conserved flagella gene clusters among the five Legionella species. Blastx comparison of these clusters in L. pneumophila, L. longbeachae, L. hackeliae, L. fallonii (LLAP10) and L. micdadei. (A) Flagellar region 1. (B) Flagellar region 2. (C) Flagellar region 3. The gray color code represents the Blast matches; the darker the gray the better the blast match. (TIFF 15 MB)
Additional file 8: Figure S6.: The L. pneumophila strain Paris plasmid is quasi-identical to the L. hackeliae plasmid. Chromosomal organization and Blastn comparison shows 100% nucleotide identity except for the insertion of two transposase-encoding genes in strain Paris. The gray color code represents the Blast matches; the darker the gray the better the blast match. (TIFF 5 MB)
Additional file 9: Figure S7.: Comparison of F-type IV secretion systems present in the different Legionella species. Genomic organization and Blastx comparison between clusters encoding the F-type T4ASS, localized in the chromosome and on plasmids. Presence of the gene lvrC is indicated by a red dot. The gray color code represents the Blast matches; the darker the gray the better the blast match. Numbers indicate the chromosomal location. (TIFF 9 MB)
Additional file 10: Figure S8.: The Lvh-encoding region of the two sequenced L. micdadei strains is highly divergent. The Lvh T4ASS-encoding genes are highly conserved among all Legionella Blastx comparisons between the region encoding the Lvh system and the flanking genes in the two L. micdadei strains compared in this study. Blast matches are represented with gray lines that are darker as the blast match improves. (TIFF 4 MB)
Additional file 11: Table S3.: List of confirmed substrates of the Dot/Icm secretion system. (DOCX 136 KB)
Additional file 12: Table S4.: List of eukaryotic organisms and viruses whose genomes have been selected for the construction of the eukaryotic genome database for Blast searches. (DOCX 81 KB)
Additional file 13: Table S5: Orthologous genes encoding eukaryotic motifs in five Legionella species. (DOCX 94 KB)
Additional file 14: Figure S9.: Phylogenetic reconstruction for Legionella proteins that were probably transferred horizontally from eukaryotic organisms. Groups of homologous sequences were aligned with MUSCLE , unambiguously aligned positions were automatically selected using the multiple alignment trimming program BMGE  with low stringency parameters. After trimming, phylogenetic analysis was done using maximum likelihood. (ZIP 27 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
About this article
Cite this article
Gomez-Valero, L., Rusniok, C., Rolando, M. et al. Comparative analyses of Legionella species identifies genetic features of strains causing Legionnaires’ disease. Genome Biol 15, 505 (2014). https://doi.org/10.1186/s13059-014-0505-0
- Legionnaires’ disease
- Eukaryotic like proteins