Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens
- Mark W Silby†1,
- Ana M Cerdeño-Tárraga†2,
- Georgios S Vernikos†2,
- Stephen R Giddens3,
- Robert W Jackson3, 4,
- Gail M Preston3,
- Xue-Xian Zhang5,
- Christina D Moon3, 12,
- Stefanie M Gehrig3,
- Scott AC Godfrey3, 13,
- Christopher G Knight3, 14,
- Jacob G Malone3, 15,
- Zena Robinson3,
- Andrew J Spiers3, 16,
- Simon Harris2,
- Gregory L Challis6,
- Alice M Yaxley7,
- David Harris2,
- Kathy Seeger2,
- Lee Murphy2,
- Simon Rutter2,
- Rob Squares2,
- Michael A Quail2,
- Elizabeth Saunders8,
- Konstantinos Mavromatis9,
- Thomas S Brettin8,
- Stephen D Bentley2,
- Joanne Hothersall10,
- Elton Stephens10,
- Christopher M Thomas10,
- Julian Parkhill2,
- Stuart B Levy1,
- Paul B Rainey5, 11 and
- Nicholas R Thomson2Email author
© Silby et al.; licensee BioMed Central Ltd. 2009
Received: 6 March 2009
Accepted: 11 May 2009
Published: 11 May 2009
Pseudomonas fluorescens are common soil bacteria that can improve plant health through nutrient cycling, pathogen antagonism and induction of plant defenses. The genome sequences of strains SBW25 and Pf0-1 were determined and compared to each other and with P. fluorescens Pf-5. A functional genomic in vivo expression technology (IVET) screen provided insight into genes used by P. fluorescens in its natural environment and an improved understanding of the ecological significance of diversity within this species.
Comparisons of three P. fluorescens genomes (SBW25, Pf0-1, Pf-5) revealed considerable divergence: 61% of genes are shared, the majority located near the replication origin. Phylogenetic and average amino acid identity analyses showed a low overall relationship. A functional screen of SBW25 defined 125 plant-induced genes including a range of functions specific to the plant environment. Orthologues of 83 of these exist in Pf0-1 and Pf-5, with 73 shared by both strains. The P. fluorescens genomes carry numerous complex repetitive DNA sequences, some resembling Miniature Inverted-repeat Transposable Elements (MITEs). In SBW25, repeat density and distribution revealed 'repeat deserts' lacking repeats, covering approximately 40% of the genome.
P. fluorescens genomes are highly diverse. Strain-specific regions around the replication terminus suggest genome compartmentalization. The genomic heterogeneity among the three strains is reminiscent of a species complex rather than a single species. That 42% of plant-inducible genes were not shared by all strains reinforces this conclusion and shows that ecological success requires specialized and core functions. The diversity also indicates the significant size of genetic information within the Pseudomonas pan genome.
Pseudomonas fluorescens is a physiologically diverse species of opportunistic bacteria (gamma-proteobacteria) found throughout terrestrial habitats. The species contributes greatly to the turnover of organic matter and, while present in soil, is abundant on the surfaces of plant roots and leaves. Of the plant-colonizing strains, some, such as isolates SBW25 and Pf-5, positively affect plant health and nutrition [1–3]. The mechanistic bases of these effects remain unclear, but are known to include the production of plant growth hormones, the suppression of pathogens (especially fungi and oomycetes) detrimental to plant health via competitive and/or allelopathic effects, and the direct elicitation of plant defense responses .
It has been argued that exploitation of these plant growth promoting bacteria in agriculture requires an improved understanding of the determinants of ecological performance, particularly persistence . To this end, in vivo expression technology (IVET) promoter trapping strategies were devised and implemented to identify plant-induced and soil-induced genes [5–9]. In these early studies a number of coding sequences (CDSs) of ecological relevance were found to be up-regulated, including a type III secretion system [10, 11], a cellulose biosynthetic locus  and a number of CDSs involved in metabolism and protective responses [12–17]. However, the ability to comprehensively identify ecologically important sequences was limited in these previous studies by the use of incomplete genome libraries and the lack of whole genome sequences.
The genome sequence of a single isolate of P. fluorescens, Pf-5, has been reported . Although a large number of genes involved in nutrient uptake/degradation and biocontrol were identified in Pf-5, the true diversity within this species was not revealed. To address this issue and to enhance our understanding of the functional ecology of P. fluorescens, we have determined the complete nucleotide sequences of two strains from different environmental origins.
SBW25 was isolated in 1989 from the leaf surface of a sugar beet plant grown at the University Farm, Wytham, Oxford, UK . In addition to its use in the study of microbe-plant-soil interactions, SBW25 has become an important model organism for studies on evolutionary processes (for example, [20, 21]). Pf0-1 was isolated in 1987 from loam soil in Sherborn, Massachusetts, USA .
Here we report the genome sequences of SBW25 and Pf0-1 and the results of a comparative analysis of P. fluorescens that includes isolate Pf-5. Our data reveal hitherto unrecognized diversity , with the three strains sharing only 61.4% of genes. We also identify highly abundant families of repetitive DNA sequences and describe more than 100 genes that show elevated levels of expression in the plant environment. These plant-induced genes provide a snapshot of how P. fluorescens perceives and responds to the plant environment and reveals conservation of strategies among strains for the enhancement of ecological performance.
Results and discussion
P. fluorescensSBW25 and Pf0-1 genome architecture
General characteristics of the genomes of P. fluorescens strains SBW25, Pf0-1 and Pf-5
P. fluorescens strain
Number of bases
Number of CDSs
rRNA genes (clusters)
Intergenic repeat families
6 (with a total of 1,199 repeats)
9 (with a total of 231 repeats)
5 (with a total of 748 repeats)
Intra- and inter-species variation among Pseudomonas genomes
AAIs of the orthologous CDSs of P. fluorescens and P. aeruginosa strains and P. syringae pathovars
pv. syringae/pv. phaseolicola
pv. syringae/pv. tomato
pv. phaseolicola/pv. tomato
Based on the genomic criteria provided by Goris et al.  for defining species, the three P. fluorescens strains could indeed be different species. In fact, our analysis is in agreement with previous studies that have shown - based on gyrB and rpoD nucleotide sequences - P. fluorescens to be a complex composed of two major lineages , with Pf-5 and Pf0-1 belonging in the P. chlororaphis and SBW25 in the P. fluorescens lineage. Similar conclusions have come from DNA-DNA hybridization and average nucleotide identity scores  and the genome signature (genome-specific relative frequencies of dinucleotides) . Given the small sample of genomes, it seems premature to redefine the species 'P. fluorescens' at this time. It should also be noted that our analysis shows the three P. fluorescens strains to group more closely to each other than to any other member of the Pseudomonas genus (Figure 3; Table 2).
Functional analysis of the SBW25 gene complement
Analysis of the conserved genes present in the three P. fluorescens strains provides results that are typical of other soil dwelling bacteria [36, 37]. For example, SBW25 and Pf0-1 carry an abundance of regulatory genes (>300 each), and genes encoding motility and chemotaxis-related functions (>100 each) as well as genes specifying membrane and transport functions (>1,000 each).
Also typical for pseudomonads, the genomes of SBW25, Pf0-1 and Pf-5 lack 6-phosphofructokinase, required for conversion of β-D-fructose 6-phosphate to β-D-fructose 1,6-bisphosphate (although the gene for 1-phosphofructokinase is present) and these strains are therefore unlikely to carry out glycolysis. Nonetheless, each genome possesses genes predicted to specify the enzymes phosphogluconate dehydratase and 2-keto-3-deoxygluconate 6-phophate aldolase, which are necessary for utilization of glucose via the phosphorylative Entner-Doudoroff pathway.
EIL were identified by screening a library consisting of 33,000 clones (62 independent ligation reactions) and analyzed in pools of 250 on Beta vulgaris (sugar beet) seedlings. Given a genome of 6.7 Mbp, a random library of 3 to 5 kb fragments, and assuming 3,000 promoters in the SBW25 genome, then the chance of a promoter not being included in this study is less than 0.01 (based on the Poisson distribution).
The plant-inducibility of the EIL-'dapB fusion strains recovered by IVET selection was verified for each of the 125 IVET fusion strains by their inability to grow on M9 (glucose) minimal medium in the absence of DAP (thus demonstrating that the fusions are transcriptionally silent in vitro). The ability of each fusion strain to colonize both the rhizosphere and the phyllosphere of non-sterile sugar beet seedlings was then re-checked (strains colonizing these environments contain fusions to genes that are transcriptionally activated in the plant environment) [6, 11]. SBW25ΔdapB and an IVET negative-control strain, PBR393 , were used as controls and no colony forming units of either strain were recovered from either the rhizosphere or phyllosphere. Every putative SBW25ΔdapB strain carrying an EIL-'dapB fusion grew in the rhizosphere (the size of the initial inoculum more than tripled in the rhizosphere over the course of 3 weeks); 90 of these IVET fusion strains were also able to grow in the phyllosphere (cells recovered from the phyllosphere underwent at least 3 doublings in 3 weeks). Growth of all EIL-fusion strains was significantly impaired in M9 (glucose) minimal medium. These tests verify that the EIL fusions are expressed by SBW25 on plant surfaces, and that the EIL promoters are dependent on the plant environment for expression. Further studies to determine the precise function of individual EIL in the plant environment are on going.
The 125 genes shown to be specifically up-regulated in planta represent all major classes of genes found in SBW25: Pf0-1 and Pf-5 each have orthologues of 83 of the 125 IVET-identified genes. Of these, 73 genes are common to all three P. fluorescens strains (Supplementary Table 1 in Additional data file 1). These data confirm the importance of previously recognized activities , and those predicted from genome sequence analysis, including nutrient acquisition and scavenging, cell envelope function, metabolism, stress response, and detoxification. Interestingly, when compared with the results of a previously conducted (small scale) study using the DAP-based IVET strategy , only 4 of the 25 EIL recovered in that study were identified here. These included the cellulose biosynthetic locus wss (recovered on six independent occasions in this study), fliF, glcA and fadE (Supplementary Table 1 in Additional data file 1). The reasons for the relatively low overlap between the two studies is unclear, but perhaps reflects subtle differences in conditions for plant propagation, although the differences are more likely to reflect the particularly stringent criteria applied in this study in order for a putative plant-induced locus to qualify as an EIL. Nonetheless, of importance is the fact that genes of the same functional classes were obtained in both screens.
No validated 'dapB IVET fusions were obtained for genes within the Rsp type III secretion system, which was previously identified by a different (pantothenate-based) IVET selection strategy . Its low level of expression in the rhizosphere  is likely to be insufficient to restore competitive growth in the DAP-based promoter trapping strategy used in this study.
Regulators form a large class of EIL: the 17 predicted regulatory components include a sigma factor, LysR-type regulators, two component sensing systems, a di-guanylate cyclase, and a phosphodiesterase. Also included in this collection is an operon defined by EIL037 (PFLU1114-1111) whose four CDSs show remarkable complexity: a compound GGDEF/EAL/CheY protein (PFLU1114), followed by a predicted cytochrome C551 peroxidase precursor (PFLU1113), followed by two compound histidine kinases (PFLU1112 and PFLU1111). That this and other regulatory loci are actively transcribed outside of the laboratory environment supports the generally held assumption that the abundance of regulatory genes in Pseudomonas is important for life in complex environments .
Another notable 'dapB IVET fusion is EIL082, which falls within a previously unrecognized non-ribosomal peptide synthetase (NRPS) biosynthetic gene cluster (PFLU3215 to PFLU3228) present in the non-core region of SBW25. The non-ribosomal peptide produced by this enzyme complex is specific to SBW25, because no orthologues of these NRPS genes exist in Pf0-1 or Pf-5. Interestingly, Pf0-1 also possesses a novel NRPS gene cluster (Pfl01_2265-2267) not present in SBW25 or Pf-5. There are three NRPS biosynthetic clusters in SBW25, four in Pf0-1 and three in Pf-5, including the pyoverdine biosynthesis cluster. The similarity shared amongst these clusters is limited to conservation of the functional domains, with no full length identities. There is little conservation of the order of the functional domains. The production of pyoverdine is one of the defining characteristics for P. fluorescens and yet the genome has shown great diversity both in the pyoverdine clusters and in the other non-ribosomal peptides that are made by P. fluorescens.
Genes with no significant matches to DNA or protein sequences in public databases comprise approximately 10% of the IVET fusions. On the basis of in silico analyses, the majority of these genes are predicted to encode membrane-associated proteins, suggesting their interaction with the external environment through uptake, export, or signaling.
A large class of EIL are fusions of non-predicted genes, oriented in the opposite direction to transcription of predicted CDSs (40 in total; see class XI, Supplementary Table 1 in Additional data file 1). 'Antisense' fusions of this type have been described previously [5, 40]. It is possible that some of these fusions highlight in silico gene prediction errors; however, careful examination of these fusions renders this unlikely. Moreover, in a previous study both the sense CDS and antisense IVET fusion were found to encode proteins; furthermore, the IVET-identified 'antisense' gene was shown to be important for efficient colonization of soil . We refrain from further speculation as to the significance of these 'antisense' fusions, but such a substantial number suggests there is much yet to learn about the potential role of these genes in the function of bacteria in their natural environments.
Despite evidence for a highly variable accessory region towards the terminus of replication, the distribution of EIL in SBW25 appears to have little or no bias toward any particular genomic location (Figure 5a, b). The 31 genes defined by EIL055 to EIL096 are within the variable region (see below) of the SBW25 genome while the remainder are within the core region. This even distribution indicates that many of the mechanisms favoring success in natural environments are conserved, while individual strains appear to possess accessory traits that are likely to confer niche-specificity.
Whole genome alignments of SBW25, Pf0-1, and Pf-5 showed evidence for extensive within-genome recombination. In many bacteria this is driven by recombination between repeat sequences. However, in none of the P. fluorescens genome rearrangements were the recombined sequences flanked by rRNAs, tRNAs or known insertion sequence elements. To identify repetitive DNA sequences that may explain this intragenomic recombination, an exhaustive search for such sequences in SBW25, Pf0-1, and Pf-5 was performed.
Analysis of SBW25 revealed the presence of 4,357 repeat sequences representing 11.91% of the genome. These repetitive sequences ranged in size from 24 to 357 bps and comprised 1,199 intergenic repeats, 922 inverted repeats (IR1_g), and 2,236 intragenic repeats. This type of repeat expansion has been seen in other systems, where it is associated with a relaxation of selection on the genome. This can be associated with a recent change in niche, and the resulting evolutionary bottleneck [42, 43], or with reduced selection because of small effective population size and absence of recombination . However, as discussed below, this is not thought to apply here. The intragenic repeat families represent coding sequences for conserved protein domains within over-represented protein families; 1,293 represented just 4 protein domain families (as defined by Pfam; see Materials and methods) - ABC transporter, AMP-binding enzyme, response regulator receiver domain and the GGDEF domain.
Characteristics of the intergenic repeat elements present in P. fluorescens SBW25, Pf0-1, and Pf-5
IRs: 24 bp
IRs: 30 bp
IRs: 43 bp
IRs: 34 bp
IRs: 11 bp (internal)
Structural organization of the P. fluorescensintergenic repeats
Detailed analysis of the repeat sequences revealed that five families possess a complex structure consisting of two identical inverted repeats (IRs) that flank a variable size core region (Table 3). The IRs generally show a higher average G+C content than the genome as a whole (64.7%; the genome average is 60.5%), while the G+C% content of the variable core region sequences is closer to the genome average. Structural predictions made with these repeat sequences show that they readily form hairpin secondary structures, with the IRs forming the stem and the variable core region forming the loop.
Three repeat families, R0, R2 and IR1_g, are of particular interest given their disproportionately high numbers in SBW25 relative to Pf0-1 and Pf-5 (Table 3). The IRs of R0 and R2 are identical to those found flanking two different insertion sequence elements unique to strain SBW25 at locations 50373465038275 (PFLU4572A) and 63871926388340 (PFLU5832), respectively. It is possible that the IRs of repeat families R0 and R2 are recognized by the two insertion sequence element-encoded transposases in trans, which might explain why the elements have become over-represented in the SBW25 genome. If this is true, then these repeats are likely to represent miniature inverted-repeat transposable elements (MITEs), only very few of which have been reported in bacteria .
In addition to the ability to form stem-loop structures, the IR1_g repeats also possess the consensus sequence for the repetitive extragenic palindromic repeats (REP) family, which were originally thought to be specific to P. putida KT2440. The functional significance of the Pseudomonas REPs awaits elucidation, but they may play a role in transcription termination or provide binding sites for the DNA gyrase .
Since many of these repeat families can form stem-loop structures, they have the potential to act as transcriptional terminators. We therefore examined the transcription orientation of the genes flanking repeat elements to look for bias. In describing this analysis we use 'Head' to refer to the 5' end of a CDS and 'Tail' to refer to the 3' end. Using this nomenclature there are four transcriptional orientation states (including CDSs on both the forward and reverse DNA strands) for the CDSs that lie on either side of a repeat element: Tail-repeat-Head (forward strand) (→ →), Tail-repeat-Tail (→ ←), Tail-repeat-Head (reverse strand) (← ←) and Head-repeat-Head (← →). We compared the frequency of each of the four states with all CDS pairs that lacked an intervening repeat element. The frequency of the four orientation states among CDS pairs that flank repeat elements was significantly different from that of CDS pairs that do not (SBW25, P < 0.0005; Pf0-1, P = 0.016; Pf-5, P < 0.0005). For those CDS pairs that do not flank repeat elements the Tail-Head (forward and reverse strand) orientation is predominant; for CDS pairs flanking repeats the most frequent orientation is the Tail-repeat-Tail (Supplementary Figure 2 in Additional data file 2). The Tail-repeat-Tail bias is prevalent for the largest three of the six intergenic repeat families present in SBW25 and for five of the nine repeat families in Pf0-1 (Supplementary Figure 3 in Additional data file 2). The selective pressure for the non-random distribution of repeats may derive from the predicted stem-loop (transcription terminator-like) structure; insertion of a repeat with a stem-loop structure between Tail-Head oriented CDSs within an operon would cause termination, thus disrupting these transcriptional units. The Tail-repeat-Tail biased distribution of these repeats probably reflects a 'least worst' location as insertion is less likely to cause aberrant transcription termination since termination of convergent transcription is likely to occur anyway. In addition, the Head-repeat-Head state, which could potentially disrupt promoters for one or both genes, occurs at a low frequency, particularly in SBW25 and Pf-5. These data would also suggest that the expansion of the intergenic repeats has been subject to selection. Consequently, it is unlikely that the repeat expansion seen in P. fluorescens results from the organism having been through an evolutionary bottleneck (this scenario is generally associated with random distribution of repetitive sequences)  and more likely that it is linked to a lack of selection against increased genome size.
P. fluorescensrepeat deserts
Evident from the genome analysis are large regions of the SBW25 genome that lack any complex repeat families (R-family repeats; Table 3). We refer to these as repeat deserts (ReDs; Figure 5a). The SBW25 genome harbors 60 ReDs, which range in size from an arbitrary lower limit of 15.8 kb up to 176 kb and encode a total of 2,475 CDSs (40% of the coding capacity), of which 93.7% are unique to SBW25 compared to Pf0-1 and Pf-5 (Supplementary Table 2 in Additional data file 3). Because of the density of repeats in SBW25, the identification of ReDs was straightforward. In contrast, the lower number of repeats in Pf0-1 and Pf-5 makes definition of similar regions more difficult.
Two, not mutually exclusive, explanations for the lack of repeats in these regions exist: first, the ReDs comprise mostly essential genes that normally experience high purifying selection [47, 48]; and second, the ReDs might have been recently acquired from a donor lacking repeat sequences. Indeed, examples of the former include the rRNA clusters, the ribosomal proteins cluster, the wss cluster (PFLU0300 to PFLU0309), which directs production of an acetylated cellulose-like polymer involved in formation of a microbial mat [49, 50], and cell division proteins (PFLU0940 to PFLU0953, amongst others).
Recently acquired ReDs that have different dinucleotide frequencies to the above group contain CDS clusters that might confer niche specificity. One such example is the anthranilate synthase cluster (PFLU1381 to PFLU1386), which is unique to SBW25. Other examples found within ReDs include 'atypical' regions of the SBW25 genome, which show limited phylogenetic distribution, aberrant G+C% content or dinucleotide frequency compared with the genome average for Pseudomonas species (Supplementary Table 3 in Additional data file 3). These may reflect sequences acquired through recent gene transfer events . While ReDs are not evident in Pf0-1, several such atypical regions have been identified (Supplementary Table 4 in Additional data file 3), and these are also free of repeats, as are all but one of the mobile genetic elements recently described in Pf-5 . For example, SBW25 and Pf0-1 each carry multiple prophage-like elements, and both genomes have one probable integrative conjugative element (ICE)-like genomic island, SBW_GI-1 and the related island Pf0-1_GI-1, which have similarity to the genomic island PFGI-2 in Pf-5 . SBW_GI-1 is located between partially duplicated tRNAval and is over 101 kb in length. Strengthening the possibility that this region is a hotspot for insertions, comparison of approximately 5 kb of unpublished sequences flanking the mupirocin biosynthetic cluster of P. fluorescens NCIMB10586 , which based on DNA sequence identity (generally 93% to 96%) and synteny is more closely related to SBW25 than Pf0-1 or Pf-5, indicates that the mup cluster is inserted adjacent to the same tRNAval tRNAasp tandem cluster as SBW_GI-1. Pf0-1_GI-1 defines a slightly smaller locus than SBW_GI-1 and lacks flanking insertion site duplications. These islands are related in structure to a family of ICEs, which include those found in other pseudomonads [54, 55] as well as wider members of the gamma-proteobacteria such as Yersinia (YAPI [56, 57]) and Salmonella (SPI-7 ). These elements are defined as having a conserved core carrying a type IV pilus operon and plasmid-related functions as well as a highly variable region, which carries genes involved in resistance and host adaptation. The reduction of the type IV pilus genes, and breakdown of the flanking regions in Pf0-1_GI-1, suggest these ICEs may be undergoing fixation in the genome, perhaps attributable to an important function of the cargo genes. The variable cargo regions of SBW_GI-1 and Pf0-1_GI-1 are summarized in Supplementary Tables 3 and 4 in Additional data file 3.
P. fluorescens is an opportunistic species long recognized for its genetic, physiological and functional diversity . The previously sequenced genome of isolate Pf-5 offered a glimpse of genome content and organization, but in the absence of comparative data sheds little insight into the extent of genomic diversity. The genome sequences of the two additional strains (SBW25 and Pf0-1) have provided the opportunity for comparative studies and show an unexpectedly high degree of among-genotype diversity. Typically, different isolates of the same species would be expected to show substantial overlap among core genes of the genome. For example, five sequenced genomes of P. aeruginosa share 80% to 90% of their gene content , whereas the three P. fluorescens genomes share just 61% of their genes, and have low average nucleotide identity  and AAI (this study), leading Goris et al. to suggest that these three isolates cannot be members of the same species. With further genome sequences, it will become possible to strengthen the species criteria using whole genome characteristics. The fact that these three strains group more closely to each other than to other members of the genus makes it tempting to describe P. fluorescens strains as members of a complex until more DNA sequence analyses provide a deeper understanding of the genetic structure of these populations.
The ecological significance of the genes specific to each strain also awaits further study, but the IVET-based analysis shows that at least some of the SBW25 genes are likely to be important in the plant environment. The fact that EIL fusions identify both core and accessory genes as ecologically relevant comes as little surprise given both the diverse range of core metabolic functions and the diversity of niches within which P. fluorescens exists. That a subset of the IVET-identified genes corresponds to orthologues in Pf0-1 and Pf-5 indicates conserved strategies for ecological success, and also the diversity of mechanisms employed.
The lack of synteny among the three strains marks a further defining feature of the species P. fluorescens. Previous studies of this species using restriction fragment length polymorphism showed a bewildering range of patterns - even amongst strains that were phenotypically indistinguishable . The presence of numerous repeat sequences, particularly the intergenic MITE-like elements, provides a probable explanation. While the evolutionary origin of these elements is unclear, one likely consequence of the presence of numerous repeated sequences (between genes) is elevated levels of intragenic recombination. Although recombination between repeat sequences is to be expected, it seems that P. fluorescens can tolerate significant rearrangements without sacrificing performance. One striking example in SBW25 comes from the arrangement of genes involved in pyoverdine biosynthesis. In SBW25 these genes are distributed across seven different regions of the genome ; in Pf-5 and Pf0-1 (with fewer MITE-like elements) these genes are distributed across three  and five regions, respectively; in P. aeruginosa PAO1 (and other sequenced isolates) these are in two clusters separated by 11.5 kb; in P. syringae they reside within a single cluster .
Whole genome sequencing - particularly when combined with functional studies such as IVET - provides unprecedented insight into the functional activity of microbes. Despite their environmental significance, common saprophytic bacteria, such as P. fluorescens, have been the subject of relatively few genome-based projects. The addition of SBW25 and Pf0-1 to the list of genome-sequenced saprophytes is an important advance. It reveals the gene content of soil/plant saprophytes and shows that our prior appreciation of the diversity of the Pseudomonas pan genome was restricted. Since many isolates pathogenic to humans, animals, and plants are thought to have their origins in non-pathogenic environmental isolates, understanding the genomes of these saprophytes has implications for our ability to predict, monitor and understand the evolution of these pathogenic strains.
Materials and methods
Bacterial strains and sequencing
P. fluorescens strain SBW25 is an environmental isolate taken from the leaf surfaces of a sugar beet plant. A single colony of SBW25 was grown on LB agar and then grown overnight in LB broth with shaking at 28°C. Cells were collected and total DNA was extracted with a Gentra Puregene extraction kit (Qiagen, West Sussex, UK) according to the manufacturer's instructions. The DNA was fragmented by sonication, and several libraries were generated in plasmid vectors using size fractions ranging from 2 to 9 kb. The whole genome was sequenced to a depth of 9× coverage from 2 to 3 kb, 3 to 4 kb and 6 to 9 kb in pOTW12 and pMAQ1Sac_BstXI libraries using dye terminator chemistry on ABI3730 automated sequencers. End sequences from larger insert bacterial artificial chromosome (pBACehr 5 to 15 kb insert size) libraries were used as a scaffold. The sequence was assembled, finished and annotated as described previously , using the program Artemis  to collate data and facilitate annotation.
P. fluorescens strain Pf0-1 was isolated from bulk loam soil. It was grown overnight in LB broth with shaking at 30°C. Total DNA was extracted using a Wizard Genomic DNA Purification Kit (Promega, Madison, WI, USA). The genome of Pf0-1 was sequenced at the Joint Genome Institute using a combination of 3.7, 9.4, and 37 kb DNA libraries. Draft assemblies were based on 114,960 total sequence reads. All three libraries provided 5× coverage of the genome. A total of 470 additional reactions, 3 shatter libraries from PCR products, and 20 transposon bombs (in vitro transposon mutagenesis (EZ::TN<kan2>Insertion Kit; Epicentre, Madison, WI, USA) of plasmids to generate new primer sites for DNA sequencing) were necessary to close gaps and to raise the quality of the finished sequence. All general aspects of library construction, sequencing and gene prediction performed at the Joint Genome Institute were as previously described .
The sequences of SBW25 and Pf0-1 can be accessed using the accession numbers [EMBL:AM181176] and [GenBank:CP000094], respectively.
The genome sequences of P. fluorescens strains SBW25, Pf0-1 and Pf-5 were compared pairwise using TBLASTX analyses loaded on the Artemis Comparison Tool .
Orthologous CDSs in the three genomes were defined after comparing all-against-all running a reciprocal FASTA search of translated DNA with a 30% identity over 80% of the length of the CDSs as minimum similarity score. The results were used to calculate the average amino acid identities.
Pseudogenes were defined as CDSs that had one or more mutations that would ablate expression and/or lack start and/or stop codon; each of these possible inactivating mutations was subsequently checked against the original sequencing data.
Circular diagrams were plotted using DNAplotter .
Identification and analysis of orthologues in Pseudomonasgenomes
Fourteen Pseudomonas species (P. fluorescens SBW25, Pf0-1, and Pf-5;P. aeruginosa PAO1, PA14 and PA7; P. syringae pv. syringae B728a, pv. phaseolicola 1448A and pv. tomato DC3000; P. putida strains KT2440, W619, F1, and GB1; and P. stutzeri A1501) were compared all-against-all using a reciprocal FASTA approach (30% identity over 80% of the length as minimum similarity), yielding a set of 1,705 core genes shared between all these genomes. In a second step, the amino acid sequences of these core gene products were aligned (gene-wise) using MUSCLE version 3.52  and poorly aligned regions were removed with Gblocks . Maximum likelihood analysis of each alignment was carried out in RAxML version 7.0.0  using the JTT+gamma model. A majority rule consensus of the 1,705 individual trees was built using the consense module of Phylip to assess the agreement between the individual trees.
Identification and analysis of repetitive sequences in P. fluorescens
In order to analyze the repeat elements and their distribution in the genome of SBW25, we firstly concatenated three P. fluorescens genomic sequences (SBW25, Pf0-1 and Pf-5). Running the Repeatscout  algorithm on the concatenated sequence yielded 122 repeat families, of which 103 include intragenic repeats, mostly Pfam domains, and 19 intergenic repeat families. For each of the 122 families we built a multiple sequence alignment using CLUSTAL  and manually curated the alignments using JalView . Using each of the multiple alignments obtained, we built a profile hidden Markov model (HMM) using the HMMER package version 1.8.4. The 122 HMMs were searched against the concatenated sequence (leading and lagging strand). HMMs can be trained on a dataset of sequences and can predict, in a probabilistic framework, more distant members of this sequence family. The results obtained were manually curated to infer the number of distinct repeat families. The consensus of the intergenic repeat families and their HMM logos are provided in Supplementary Figure 1 in Additional data file 2. The HMM logos where produced using the LogoMat-M application .
Intergenic repeat families were initially predicted using the default parameters of RepeatScout: minimum number of copies per repeat family, 20; minimum repeat length, 50 bp; low complexity repeats were filtered out prior to the repeat prediction. In a second step, the predicted repeats were manually curated and very similar repeat families were merged under the same family, where possible. A multiple sequence alignment for each repeat family was used to train HMMs specific for each family. Each query genome was searched against those HMMs, using the HMMER package. Once the repeat families were built, using the HMM-based approach, the structure of each family was determined with visual inspection of the multiple sequence alignment; in case of complex repeat structure, with IRs being part of a repeat family, new HMMs were built to model the IRs of each family (if applicable) and used to search the three query genomes.
A computer-based search through the SBW25 and Pf0-1 genomes using the Alien Hunter program  resulted in identification of several regions within these genomes that were termed 'atypical' due to differences in nucleotide features such as G+C% and dinucleotide frequency. A manual curation of the results is shown in Figure 5, and Supplementary Tables 3 and 4 in Additional data file 3.
In vivoexpression technology
Identification of EIL from SBW25 was based on the IVET strategy as described previously [5, 6]. Libraries were constructed in pIVETD by cloning partial Sau3AI digested genomic DNA. Libraries were maintained in Escherichia coli and moved into P. fluorescens SBW25ΔdapB by conjugation. Library screening took place on non-sterile sugar beet seedlings maintained in non-sterile vermiculite pots . Fusions were recovered after 3 weeks of selection (rather than the 2 weeks used previously ) by plating homogenized plant material on selective plates. Integrated genomic fusions from strains recovered from the plant environment were mobilized into E. coli by conjugative cloning . The identity of recovered fusions was determined by sequencing inserts from recovered plasmids (see [5, 6] for details).
Additional data files
The following additional data are available with the online version of this paper: Supplementary Table 1, listing environmentally induced loci in SBW25, and orthologues in Pf0-1 and Pf-5 (Additional data file 1); Supplementary Figures 1-3 (Additional data file 2); Supplementary Tables 2-4 (Additional data file 3).
amino acid identity
environmentally induced loci
hidden Markov model
integrative conjugative element
in vivo expression technology
miniature inverted repeat transposable element
non-ribosomal peptide synthetase
We thank the core sequencing and informatics teams at the Sanger Institute for their assistance, The Wellcome Trust for its support of the Sanger Institute Pathogen Genomics and Pathogen Informatics groups, and Andrew Berry and Alex Bateman for their role in the repeat analysis. The P. fluorescens SBW25 work was carried out at The Wellcome Trust Sanger Institute supported by a BBSRC grant (number 104/P16729) to PBR and CMT. Pf0-1 genome sequencing was performed under the auspices of the US Department of Energy's Office of Science, Biological and Environmental Research Program and by the University of California, Lawrence Livermore National Laboratory under contract number W-7405-Eng-48, Lawrence Berkeley National Laboratory under contract number DE-AC03-76SF00098 and Los Alamos National Laboratory under contract number W-7405-ENG-36. The P. fluorescens Pf0-1 work was supported by USDA grant 2006-35604-16673 to SBL. MWS was supported in part by a postdoctoral fellowship from the New Zealand Foundation for Research Science and Technology. JH and ES were supported by BBSRC grant P15257 for studies on mupirocin biosynthesis in P. fluorescens NCIMB10586.
- Naseby DC, Way JA, Bainton NJ, Lynch JM: Biocontrol of Pythium in the pea rhizosphere by antifungal metabolite producing and non-producing Pseudomonas strains. J Appl Microbiol. 2001, 90: 421-429. 10.1046/j.1365-2672.2001.01260.x.PubMedView ArticleGoogle Scholar
- Rodriguez F, Pfender WF: Antibiosis and antagonism of Sclerotinia homoeocarpa and Drechslera poae by Pseudomonas fluorescens Pf-5 in vitro and in planta. Phytopathology. 1997, 87: 614-621. 10.1094/PHYTO.1922.214.171.1244.PubMedView ArticleGoogle Scholar
- de Bruijn I, de Kock MJ, Yang M, de Waard P, van Beek TA, Raaijmakers JM: Genome-based discovery, structure prediction and functional analysis of cyclic lipopeptide antibiotics in Pseudomonas species. Mol Microbiol. 2007, 63: 417-428. 10.1111/j.1365-2958.2006.05525.x.PubMedView ArticleGoogle Scholar
- Haas D, Defago G: Biological control of soil-borne pathogens by fluorescent pseudomonads. Nat Rev Microbiol. 2005, 3: 307-319. 10.1038/nrmicro1129.PubMedView ArticleGoogle Scholar
- Rainey PB: Adaptation of Pseudomonas fluorescens to the plant rhizosphere. Environ Microbiol. 1999, 1: 243-257. 10.1046/j.1462-2920.1999.00040.x.PubMedView ArticleGoogle Scholar
- Gal M, Preston GM, Massey RC, Spiers AJ, Rainey PB: Genes encoding a cellulosic polymer contribute toward the ecological success of Pseudomonas fluorescens SBW25 on plant surfaces. Mol Ecol. 2003, 12: 3109-3121. 10.1046/j.1365-294X.2003.01953.x.PubMedView ArticleGoogle Scholar
- Silby MW, Levy SB: Use of in vivo expression technology to identify genes important in growth and survival of Pseudomonas fluorescens Pf0-1 in soil: discovery of expressed sequences with novel genetic organization. J Bacteriol. 2004, 186: 7411-7419. 10.1128/JB.186.21.7411-7419.2004.PubMedPubMed CentralView ArticleGoogle Scholar
- Yang S, Perna NT, Cooksey DA, Okinaka Y, Lindow SE, Ibekwe AM, Keen NT, Yang CH: Genome-wide identification of plant-upregulated genes of Erwinia chrysanthemi 3937 using a GFP-based IVET leaf array. Mol Plant Microbe Interact. 2004, 17: 999-1008. 10.1094/MPMI.2004.17.9.999.PubMedView ArticleGoogle Scholar
- Rediers H, Bonnecarrere V, Rainey PB, Hamonts K, Vanderleyden J, De Mot R: Development and application of a dapB-based in vivo expression technology system to study colonization of rice by the endophytic nitrogen-fixing bacterium Pseudomonas stutzeri A15. Appl Environ Microbiol. 2003, 69: 6864-6874. 10.1128/AEM.69.11.6864-6874.2003.PubMedPubMed CentralView ArticleGoogle Scholar
- Preston GM, Bertrand N, Rainey PB: Type III secretion in plant growth-promoting Pseudomonas fluorescens SBW25. Mol Microbiol. 2001, 41: 999-1014. 10.1046/j.1365-2958.2001.02560.x.PubMedView ArticleGoogle Scholar
- Jackson RW, Preston GM, Rainey PB: Genetic characterization of Pseudomonas fluorescens SBW25 rsp gene expression in the phytosphere and in vitro. J Bacteriol. 2005, 187: 8477-8488. 10.1128/JB.187.24.8477-8488.2005.PubMedPubMed CentralView ArticleGoogle Scholar
- Giddens SR, Jackson RW, Moon CD, Jacobs MA, Zhang XX, Gehrig SM, Rainey PB: Mutational activation of niche-specific genes provides insight into regulatory networks and bacterial function in a complex environment. Proc Natl Acad Sci USA. 2007, 104: 18247-18252. 10.1073/pnas.0706739104.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang XX, Rainey PB: Regulation of copper homeostasis in Pseudomonas fluorescens SBW25. Environ Microbiol. 2008, 10: 3284-3294. 10.1111/j.1462-2920.2008.01720.x.PubMedView ArticleGoogle Scholar
- Zhang XX, Scott K, Meffin R, Rainey PB: Genetic characterization of psp encoding the DING protein in Pseudomonas fluorescens SBW25. BMC Microbiol. 2007, 7: 114-10.1186/1471-2180-7-114.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang XX, George A, Bailey MJ, Rainey PB: The histidine utilization (hut) genes of Pseudomonas fluorescens SBW25 are active on plant surfaces, but are not required for competitive colonization of sugar beet seedlings. Microbiology. 2006, 152: 1867-1875. 10.1099/mic.0.28731-0.PubMedView ArticleGoogle Scholar
- Howden AJ, Harrison CJ, Preston GM: A conserved mechanism for nitrile metabolism in bacteria and plants. Plant J. 2009, 57: 243-253. 10.1111/j.1365-313X.2008.03682.x.PubMedView ArticleGoogle Scholar
- Moon CD, Zhang XX, Matthijs S, Schafer M, Budzikiewicz H, Rainey PB: Genomic, genetic and structural analysis of pyoverdine-mediated iron acquisition in the plant growth-promoting bacterium Pseudomonas fluorescens SBW25. BMC Microbiol. 2008, 8: 7-10.1186/1471-2180-8-7.PubMedPubMed CentralView ArticleGoogle Scholar
- Paulsen IT, Press CM, Ravel J, Kobayashi DY, Myers GS, Mavrodi DV, DeBoy RT, Seshadri R, Ren Q, Madupu R, Dodson RJ, Durkin AS, Brinkac LM, Daugherty SC, Sullivan SA, Rosovitz MJ, Gwinn ML, Zhou L, Schneider DJ, Cartinhour SW, Nelson WC, Weidman J, Watkins K, Tran K, Khouri H, Pierson EA, Pierson LS, Thomashow LS, Loper JE: Complete genome sequence of the plant commensal Pseudomonas fluorescens Pf-5. Nat Biotechnol. 2005, 23: 873-878. 10.1038/nbt1110.PubMedView ArticleGoogle Scholar
- Rainey PB, Bailey MJ: Physical and genetic map of the Pseudomonas fluorescens SBW25 chromosome. Mol Microbiol. 1996, 19: 521-533. 10.1046/j.1365-2958.1996.391926.x.PubMedView ArticleGoogle Scholar
- Rainey PB, Travisano M: Adaptive radiation in a heterogeneous environment. Nature. 1998, 394: 69-72. 10.1038/27900.PubMedView ArticleGoogle Scholar
- Bantinaki E, Kassen R, Knight CG, Robinson Z, Spiers AJ, Rainey PB: Adaptive divergence in experimental populations of Pseudomonas fluorescens. III. Mutational origins of wrinkly spreader diversity. Genetics. 2007, 176: 441-453. 10.1534/genetics.106.069906.PubMedPubMed CentralView ArticleGoogle Scholar
- Compeau G, Al-Achi BJ, Platsouka E, Levy SB: Survival of rifampin-resistant mutants of Pseudomonas fluorescens and Pseudomonas putida in soil systems. Appl Environ Microbiol. 1988, 54: 2432-2438.PubMedPubMed CentralGoogle Scholar
- Mathee K, Narasimhan G, Valdes C, Qiu X, Matewish JM, Koehrsen M, Rokas A, Yandava CN, Engels R, Zeng E, Olavarietta R, Doud M, Smith RS, Montgomery P, White JR, Godfrey PA, Kodira C, Birren B, Galagan JE, Lory S: Dynamics of Pseudomonas aeruginosa genome evolution. Proc Natl Acad Sci USA. 2008, 105: 3100-3105. 10.1073/pnas.0711982105.PubMedPubMed CentralView ArticleGoogle Scholar
- Hughes D: Evaluating genome dynamics: the constraints on rearrangements within bacterial genomes. Genome Biol. 2000, 1: REVIEWS0006-10.1186/gb-2000-1-6-reviews0006.PubMedPubMed CentralView ArticleGoogle Scholar
- Tett A, Spiers AJ, Crossman LC, Ager D, Ciric L, Dow JM, Fry JC, Harris D, Lilley A, Oliver A, Parkhill J, Quail MA, Rainey PB, Saunders NJ, Seeger K, Snyder LA, Squares R, Thomas CM, Turner SL, Zhang XX, Field D, Bailey MJ: Sequence-based analysis of pQBR103; a representative of a unique, transfer-proficient mega plasmid resident in the microbial community of sugar beet. ISME J. 2007, 1: 331-340.PubMedPubMed CentralGoogle Scholar
- Lilley AK, Bailey MJ: The acquisition of indigenous plasmids by a genetically marked pseudomonad population colonizing the sugar beet phytosphere is related to local environmental conditions. Appl Environ Microbiol. 1997, 63: 1577-1583.PubMedPubMed CentralGoogle Scholar
- Bentley SD, Chater KF, Cerdeño-Tárraga AM, Challis GL, Thomson NR, James KD, Harris DE, Quail MA, Kieser H, Harper D, Bateman A, Brown S, Chandra G, Chen CW, Collins M, Cronin A, Fraser A, Goble A, Hidalgo J, Hornsby T, Howarth S, Huang CH, Kieser T, Larke L, Murphy L, Oliver K, O'Neil S, Rabbinowitsch E, Rajandream MA, Rutherford K, et al: Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature. 2002, 417: 141-147. 10.1038/417141a.PubMedView ArticleGoogle Scholar
- Daubin V, Ochman H: Bacterial genomes as new gene homes: the genealogy of ORFans in E. coli. Genome Res. 2004, 14: 1036-1042. 10.1101/gr.2231904.PubMedPubMed CentralView ArticleGoogle Scholar
- Dykhuizen DE, Green L: Recombination in Escherichia coli and the definition of biological species. J Bacteriol. 1991, 173: 7257-7268.PubMedPubMed CentralGoogle Scholar
- Curran B, Jonas D, Grundmann H, Pitt T, Dowson CG: Development of a multilocus sequence typing scheme for the opportunistic pathogen Pseudomonas aeruginosa. J Clin Microbiol. 2004, 42: 5644-5649. 10.1128/JCM.42.12.5644-5649.2004.PubMedPubMed CentralView ArticleGoogle Scholar
- Haubold B, Rainey PB: Genetic and ecotypic structure of a fluorescent Pseudomonas population. Mol Ecol. 1996, 5: 747-761. 10.1111/j.1365-294X.1996.tb00371.x.View ArticleGoogle Scholar
- Konstantinidis KT, Tiedje JM: Prokaryotic taxonomy and phylogeny in the genomic era: advancements and challenges ahead. Curr Opin Microbiol. 2007, 10: 504-509. 10.1016/j.mib.2007.08.006.PubMedView ArticleGoogle Scholar
- Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM: DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007, 57: 81-91. 10.1099/ijs.0.64483-0.PubMedView ArticleGoogle Scholar
- Yamamoto S, Kasai H, Arnold DL, Jackson RW, Vivian A, Harayama S: Phylogeny of the genus Pseudomonas: intrageneric structure reconstructed from the nucleotide sequences of gyrB and rpoD genes. Microbiology. 2000, 146: 2385-2394.PubMedView ArticleGoogle Scholar
- van Passel MW, Kuramae EE, Luyf AC, Bart A, Boekhout T: The reach of the genome signature in prokaryotes. BMC Evol Biol. 2006, 6: 84-10.1186/1471-2148-6-84.PubMedPubMed CentralView ArticleGoogle Scholar
- Cases I, de Lorenzo V, Ouzounis CA: Transcription regulation and environmental adaptation in bacteria. Trends Microbiol. 2003, 11: 248-253. 10.1016/S0966-842X(03)00103-3.PubMedView ArticleGoogle Scholar
- Ren Q, Paulsen IT: Comparative analyses of fundamental differences in membrane transport capabilities in prokaryotes and eukaryotes. PLoS Comput Biol. 2005, 1: e27-10.1371/journal.pcbi.0010027.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang XX, Lilley AK, Bailey MJ, Rainey PB: The indigenous Pseudomonas plasmid pQBR103 encodes plant-inducible genes, including three putative helicases. FEMS Microbiol Ecol. 2004, 51: 9-17. 10.1016/j.femsec.2004.07.006.PubMedView ArticleGoogle Scholar
- Stover CK, Pham XQ, Erwin AL, Mizoguchi SD, Warrener P, Hickey MJ, Brinkman FS, Hufnagle WO, Kowalik DJ, Lagrou M, Garber RL, Goltry L, Tolentino E, Westbrock-Wadman S, Yuan Y, Brody LL, Coulter SN, Folger KR, Kas A, Larbig K, Lim R, Smith K, Spencer D, Wong GK, Wu Z, Paulsen IT, Reizer J, Saier MH, Hancock RE, Lory S, Olson MV: Complete genome sequence of Pseudomonas aeruginosa PA01, an opportunistic pathogen. Nature. 2000, 406: 959-964. 10.1038/35023079.PubMedView ArticleGoogle Scholar
- Silby MW, Rainey PB, Levy SB: IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes. Microbiology. 2004, 150: 518-520. 10.1099/mic.0.26871-0.PubMedView ArticleGoogle Scholar
- Silby MW, Levy SB: Overlapping protein-encoding genes in Pseudomonas fluorescens Pf0-1. PLoS Genet. 2008, 4: e1000094-10.1371/journal.pgen.1000094.PubMedPubMed CentralView ArticleGoogle Scholar
- Parkhill J, Wren BW, Thomson NR, Titball RW, Holden MT, Prentice MB, Sebaihia M, James KD, Churcher C, Mungall KL, Baker S, Basham D, Bentley SD, Brooks K, Cerdeño-Tárraga AM, Chillingworth T, Cronin A, Davies RM, Davis P, Dougan G, Feltwell T, Hamlin N, Holroyd S, Jagels K, Karlyshev AV, Leather S, Moule S, Oyston PC, Quail M, Rutherford K, et al: Genome sequence of Yersinia pestis, the causative agent of plague. Nature. 2001, 413: 523-527. 10.1038/35097083.PubMedView ArticleGoogle Scholar
- Parkhill J, Sebaihia M, Preston A, Murphy LD, Thomson N, Harris DE, Holden MT, Churcher CM, Bentley SD, Mungall KL, Cerdeño-Tárraga AM, Temple L, James K, Harris B, Quail MA, Achtman M, Atkin R, Baker S, Basham D, Bason N, Cherevach I, Chillingworth T, Collins M, Cronin A, Davis P, Doggett J, Feltwell T, Goble A, Hamlin N, Hauser H, et al: Comparative analysis of the genome sequences of Bordetella pertussis, Bordetella parapertussis and Bordetella bronchiseptica. Nat Genet. 2003, 35: 32-40. 10.1038/ng1227.PubMedView ArticleGoogle Scholar
- Wu M, Sun LV, Vamathevan J, Riegler M, Deboy R, Brownlie JC, McGraw EA, Martin W, Esser C, Ahmadinejad N, Wiegand C, Madupu R, Beanan MJ, Brinkac LM, Daugherty SC, Durkin AS, Kolonay JF, Nelson WC, Mohamoud Y, Lee P, Berry K, Young MB, Utterback T, Weidman J, Nierman WC, Paulsen IT, Nelson KE, Tettelin H, O'Neill SL, Eisen JA: Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol. 2004, 2: E69-10.1371/journal.pbio.0020069.PubMedPubMed CentralView ArticleGoogle Scholar
- Delihas N: Small mobile sequences in bacteria display diverse structure/function motifs. Mol Microbiol. 2008, 67: 475-481.PubMedPubMed CentralView ArticleGoogle Scholar
- Aranda-Olmedo I, Tobes R, Manzanera M, Ramos JL, Marques S: Species-specific repetitive extragenic palindromic (REP) sequences in Pseudomonas putida. Nucleic Acids Res. 2002, 30: 1826-1833. 10.1093/nar/30.8.1826.PubMedPubMed CentralView ArticleGoogle Scholar
- Jordan IK, Rogozin IB, Wolf YI, Koonin EV: Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. Genome Res. 2002, 12: 962-968.PubMedPubMed CentralView ArticleGoogle Scholar
- Shakhnovich BE, Koonin EV: Origins and impact of constraints in evolution of gene families. Genome Res. 2006, 16: 1529-1536. 10.1101/gr.5346206.PubMedPubMed CentralView ArticleGoogle Scholar
- Spiers AJ, Bohannon J, Gehrig SM, Rainey PB: Biofilm formation at the air-liquid interface by the Pseudomonas fluorescens SBW25 wrinkly spreader requires an acetylated form of cellulose. Mol Microbiol. 2003, 50: 15-27. 10.1046/j.1365-2958.2003.03670.x.PubMedView ArticleGoogle Scholar
- Spiers AJ, Kahn SG, Bohannon J, Travisano M, Rainey PB: Adaptive divergence in experimental populations of Pseudomonas fluorescens. I. Genetic and phenotypic bases of wrinkly spreader fitness. Genetics. 2002, 161: 33-46.PubMedPubMed CentralGoogle Scholar
- Lawrence JG, Ochman H: Molecular archaeology of the Escherichia coli genome. Proc Natl Acad Sci USA. 1998, 95: 9413-9417. 10.1073/pnas.95.16.9413.PubMedPubMed CentralView ArticleGoogle Scholar
- Mavrodi DV, Loper JE, Paulsen IT, Thomashow LS: Mobile genetic elements in the genome of the beneficial rhizobacterium Pseudomonas fluorescens Pf-5. BMC Microbiol. 2009, 9: 8-10.1186/1471-2180-9-8.PubMedPubMed CentralView ArticleGoogle Scholar
- El-Sayed AK, Hothersall J, Cooper SM, Stephens E, Simpson TJ, Thomas CM: Characterization of the mupirocin biosynthesis gene cluster from Pseudomonas fluorescens NCIMB 10586. Chem Biol. 2003, 10: 419-430. 10.1016/S1074-5521(03)00091-7.PubMedView ArticleGoogle Scholar
- Burrus V, Waldor MK: Shaping bacterial genomes with integrative and conjugative elements. Res Microbiol. 2004, 155: 376-386. 10.1016/j.resmic.2004.01.012.PubMedView ArticleGoogle Scholar
- Pitman AR, Jackson RW, Mansfield JW, Kaitell V, Thwaites R, Arnold DL: Exposure to host resistance mechanisms drives evolution of bacterial virulence in plants. Curr Biol. 2005, 15: 2230-2235. 10.1016/j.cub.2005.10.074.PubMedView ArticleGoogle Scholar
- Collyn F, Billault A, Mullet C, Simonet M, Marceau M: YAPI, a new Yersinia pseudotuberculosis pathogenicity island. Infect Immun. 2004, 72: 4784-4790. 10.1128/IAI.72.8.4784-4790.2004.PubMedPubMed CentralView ArticleGoogle Scholar
- Thomson NR, Howard S, Wren BW, Holden MT, Crossman L, Challis GL, Churcher C, Mungall K, Brooks K, Chillingworth T, Feltwell T, Abdellah Z, Hauser H, Jagels K, Maddison M, Moule S, Sanders M, Whitehead S, Quail MA, Dougan G, Parkhill J, Prentice MB: The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081. PLoS Genet. 2006, 2: e206-10.1371/journal.pgen.0020206.PubMedPubMed CentralView ArticleGoogle Scholar
- Pickard D, Wain J, Baker S, Line A, Chohan S, Fookes M, Barron A, Gaora PO, Chabalgoity JA, Thanky N, Scholes C, Thomson N, Quail M, Parkhill J, Dougan G: Composition, acquisition, and distribution of the Vi exopolysaccharide-encoding Salmonella enterica pathogenicity island SPI-7. J Bacteriol. 2003, 185: 5055-5065. 10.1128/JB.185.17.5055-5065.2003.PubMedPubMed CentralView ArticleGoogle Scholar
- Stanier RY, Palleroni NJ, Doudoroff M: The aerobic pseudomonads: a taxonomic study. J Gen Microbiol. 1966, 43: 159-271.PubMedView ArticleGoogle Scholar
- Rainey PB, Bailey MJ, Thompson IP: Phenotypic and genotypic diversity of fluorescent pseudomonads isolated from field-grown sugar beet. Microbiology. 1994, 140: 2315-2331.PubMedView ArticleGoogle Scholar
- Ravel J, Cornelis P: Genomics of pyoverdine-mediated iron uptake in pseudomonads. Trends Microbiol. 2003, 11: 195-200.PubMedView ArticleGoogle Scholar
- Parkhill J, Achtman M, James KD, Bentley SD, Churcher C, Klee SR, Morelli G, Basham D, Brown D, Chillingworth T, Davies RM, Davis P, Devlin K, Feltwell T, Hamlin N, Holroyd S, Jagels K, Leather S, Moule S, Mungall K, Quail MA, Rajandream MA, Rutherford KM, Simmonds M, Skelton J, Whitehead S, Spratt BG, Barrell BG: Complete DNA sequence of a serogroup A strain of Neisseria meningitidis Z2491. Nature. 2000, 404: 502-506. 10.1038/35006655.PubMedView ArticleGoogle Scholar
- Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B: Artemis: sequence visualization and annotation. Bioinformatics. 2000, 16: 944-945. 10.1093/bioinformatics/16.10.944.PubMedView ArticleGoogle Scholar
- Pierce E, Xie G, Barabote RD, Saunders E, Han CS, Detter JC, Richardson P, Brettin TS, Das A, Ljungdahl LG, Ragsdale SW: The complete genome sequence of Moorella thermoacetica (f. Clostridium thermoaceticum). Environ Microbiol. 2008, 10: 2550-2573. 10.1111/j.1462-2920.2008.01679.x.PubMedPubMed CentralView ArticleGoogle Scholar
- Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J: ACT: the Artemis Comparison Tool. Bioinformatics. 2005, 21: 3422-3423. 10.1093/bioinformatics/bti553.PubMedView ArticleGoogle Scholar
- Carver T, Thomson N, Bleasby A, Berriman M, Parkhill J: DNAPlotter: circular and linear interactive genome visualization. Bioinformatics. 2009, 25: 119-120. 10.1093/bioinformatics/btn578.PubMedPubMed CentralView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMedPubMed CentralView ArticleGoogle Scholar
- Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17: 540-552.PubMedView ArticleGoogle Scholar
- Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22: 2688-2690. 10.1093/bioinformatics/btl446.PubMedView ArticleGoogle Scholar
- Price AL, Jones NC, Pevzner PA: De novo identification of repeat families in large genomes. Bioinformatics. 2005, 21 (Suppl 1): i351-358. 10.1093/bioinformatics/bti1018.PubMedView ArticleGoogle Scholar
- Higgins DG, Sharp PM: CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene. 1988, 73: 237-244. 10.1016/0378-1119(88)90330-7.PubMedView ArticleGoogle Scholar
- Clamp M, Cuff J, Searle SM, Barton GJ: The Jalview Java alignment editor. Bioinformatics. 2004, 20: 426-427. 10.1093/bioinformatics/btg430.PubMedView ArticleGoogle Scholar
- Schuster-Bockler B, Schultz J, Rahmann S: HMM Logos for visualization of protein families. BMC Bioinformatics. 2004, 5: 7-10.1186/1471-2105-5-7.PubMedPubMed CentralView ArticleGoogle Scholar
- Vernikos GS, Parkhill J: Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. Bioinformatics. 2006, 22: 2196-2203. 10.1093/bioinformatics/btl369.PubMedView ArticleGoogle Scholar
- Rainey PB, Heithoff DM, Mahan MJ: Single-step conjugative cloning of bacterial gene fusions involved in microbe-host interactions. Mol Gen Genet. 1997, 256: 84-87. 10.1007/s004380050548.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.