Gene functionalities and genome structure in Bathycoccus prasinos reflect cellular specializations at the base of the green lineage
- Hervé Moreau1, 2Email author,
- Bram Verhelst3, 4,
- Arnaud Couloux5,
- Evelyne Derelle1, 2,
- Stephane Rombauts3, 4,
- Nigel Grimsley1, 2,
- Michiel Van Bel3, 4,
- Julie Poulain5,
- Michaël Katinka5,
- Martin F Hohmann-Marriott6,
- Gwenael Piganeau1, 2,
- Pierre Rouzé3, 4,
- Corinne Da Silva5,
- Patrick Wincker†5,
- Yves Van de Peer†3, 4 and
- Klaas Vandepoele3, 4
© Moreau et al.; licensee BioMed Central Ltd. 2012
Received: 11 April 2012
Accepted: 24 August 2012
Published: 24 August 2012
Bathycoccus prasinos is an extremely small cosmopolitan marine green alga whose cells are covered with intricate spider's web patterned scales that develop within the Golgi cisternae before their transport to the cell surface. The objective of this work is to sequence and analyze its genome, and to present a comparative analysis with other known genomes of the green lineage.
Its small genome of 15 Mb consists of 19 chromosomes and lacks transposons. Although 70% of all B. prasinos genes share similarities with other Viridiplantae genes, up to 428 genes were probably acquired by horizontal gene transfer, mainly from other eukaryotes. Two chromosomes, one big and one small, are atypical, an unusual synapomorphic feature within the Mamiellales. Genes on these atypical outlier chromosomes show lower GC content and a significant fraction of putative horizontal gene transfer genes. Whereas the small outlier chromosome lacks colinearity with other Mamiellales and contains many unknown genes without homologs in other species, the big outlier shows a higher intron content, increased expression levels and a unique clustering pattern of housekeeping functionalities. Four gene families are highly expanded in B. prasinos, including sialyltransferases, sialidases, ankyrin repeats and zinc ion-binding genes, and we hypothesize that these genes are associated with the process of scale biogenesis.
The minimal genomes of the Mamiellophyceae provide a baseline for evolutionary and functional analyses of metabolic processes in green plants.
Marine phytoplankton is responsible for about half of the photosynthetic activity on the planet , the second half being carried out by terrestrial plants. Two major traits differentiate these two classes of organisms. First, phytoplankton is essentially composed of unicellular organisms that have a high turnover; whereas terrestrial plants are renewed, on average, once every 9 years, the global phytoplankton population is replaced approximately every week . Second, while photosynthesis is confined to specific organs of plants, often only a minor component of the plant biomass, in phytoplankton, photosynthesis essentially takes place in each cell. Phytoplankton populations are thus highly dynamic and may be able to adapt rapidly to changing environments. Even so, a global decline of photosynthetic micro-organisms over the past century has recently been reported , motivating research aimed at better understanding the global diversity of phytoplankton and how these species adapt to changing marine environment.
Phytoplankton is usually pragmatically classified according to size, from pico- (below 3 µm), nano- (3 to 8 µm) to micro-algae (above 5 to 8 µm), although these categories have no evolutionary significance. The eukaryotic fraction of picophytoplankton accounts for a modest part of the oceanic biomass, but nevertheless contributes an important part to primary production in many oceanic waters [3, 4]. Among these picoeukaryotes, environmental diversity studies based on ribosomal gene sequences showed that small green algae, and notably the three genera Bathycoccus, Micromonas and Ostreococcus, are distributed worldwide and are numerically important in coastal areas. These three genera are characterized by their small size (1 to 2 µm), their rudimentary cellular organization (one mitochondrion and one chloroplast) and their small genomes (from 13 to 22 Mb). Micromonas  is a naked cell with one long flagellum whereas the two other genera are non-motile. Ostreococcus [6, 7] is naked whereas Bathycoccus  is covered with scales. The complete genome sequences of two Micromonas , two Ostreococcus [10, 11] and a low-light adapted strain of Ostreococcus (strain RCC809, available on the Joint Genome Institute web site) have been analyzed. The three genera belong to the order Mamiellales, in the class Mamiellophyceae [12, 13], a monophyletic group in the phylum Chlorophyta. The ancestors of these micro-organisms emerged at the base of the green lineage and knowledge about them provides a baseline for exploring the evolution of this lineage, which also gave rise to terrestrial plants. Given their small cellular and genome sizes, they may reveal the 'bare limits' of life as a free-living photosynthetic eukaryotes, thus presenting a simple organization with very little non-coding sequences .
Here we report the analysis of the genome of one Mediterranean strain belonging to the genus Bathycoccus and its comparison with Mamiellales and other green algae, allowing a survey of the genome organization at the base of the green lineage. Although Bathycoccus was initially isolated from deep water (100 meters) , it has been frequently reported in various marine environments and seems an important component of the picoeukaryote compartment [15–18]. The availability of this genome, coupled to the development of new sequencing possibilities for metagenomes [19, 20] from various marine environments, opens the way for comparative studies and to a better understanding of the adaptations of this organism to its environment(s).
Results and discussion
Characterization and phylogenetic position of the Bathycoccus prasinos RCC1105 strain
Global characteristics of the Bathycoccus genome
Nuclear genome characteristics of green algae
Genome size (Mb)
Bathycoccus sp. RCC1105
Micromonas sp. RCC299
Micromonas sp. CCMP1545
Ostreococcus lucimarinus clade A
Ostreococcus sp. RCC809 clade B
Ostreococcus tauri clade C
Chlorella sp. NC64A
Based on the annotated gene sets of different land plants and green algae, sequence similarity searches were performed to group homologous genes into families (a family being defined as a set of two or more homologous genes; see Materials and methods). Subsequently, pan and core genome plots were built to quantify the number of shared and unique genes and families between different species (Figure S5 in Additional file 1). Comparing the set of core genes between different algal groups reveals that the smaller genome sizes of Mamiellales, as well as the lower number of genes, correspond both with the decrease of the average number of genes per family and with the number of families conserved within a specific clade. For example, whereas the number of gene families shared between all land plants, Chlamydomonales, and Trebouxiophyceae is 2,692, this number drops to 1,959 when including all Mamiellales species. Similarly, based on a set of core gene families conserved in both land plants and algae, the average gene family size is smaller for Mamiellales compared to Trebouxiophyceae or Chlamydomonales (average of 1.63, 1.78 and 1.93 genes per family, respectively). More than 500 gene families were found that were conserved between land plants and green algae but that were lost in all Mamiellales species (Figure S6 in Additional file 1). These families were enriched for functions related to zinc ion-binding and transport (ten families), UDP-glucosyltransferase activity (six families), vitamin ion binding (eight families) and sucrose and fatty acid metabolism (eight families) (Table S2 in Additional file 1). Although this pattern suggests a reduction of the functional gene repertoire, we also found more than 400 gene families that are specific to Mamiellales and found in all Mamiellales species. Whereas many of these Mamiellales-specific genes have unknown functions, three families related to drug transport and ten families including genes related to zinc ion binding were found (Table S2 in Additional file 1). Although rapid sequence evolution can interfere with the accurate detection of homologs using similarity searches, the observed pattern indicates a high turnover of zinc ion binding-related genes.
Biological role and evolution of the big and small outlier chromosomes in Bathycoccus and in the Mamiellales
Characteristics of the small outlier chromosomes for Bathycoccus and one Micromonas and one Ostreococcus species
Gene densities (bp/gene)
Micromonas sp. RCC299
The big outlier chromosome in Bathycoccus
The BOC1 region also displays structural specificities that are absent in the rest of the genome. Besides its low GC content, genes in the BOC1 region are split by many small (40 to 65 bp) AT-rich introns [10, 11]. This feature is present in all of the sequenced Mamiellales genomes (Figure S10 in Additional file 1) and absent from the genomes of other green algae (Figure S10 in Additional file 1). There is no universal RNA-fold for these introns and no conserved sequence motifs (for example, branch points, splice sites) could be detected. Although the only intrinsic indication from their DNA sequences that they are introns comes from their AT-richness relative to the surrounding GC rich exons, their existence is clear from EST data. Consequently, the BOC1 region includes a high proportion of multiple exon genes, a feature absent in the rest of the genome (Table S2 in Additional file 1). In Bathycoccus, 103 of the 214 BOC1 genes harbor 330 introns, an intron content tenfold higher than in the rest of the genome (average of 1.54 and 0.15 introns per BOC1 and non-BOC1 gene, respectively).
In conclusion, the BOC1 region in the Mamiellales has unique structural characteristics: it represent one contiguous low(er) GC content region in the chromosome, flanked by two high(er) GC content regions at the extremities and carries between 193 and 633 genes depending on the species examined. The gene order within the region shows little colinearity between species and it encodes a high proportion of often vital housekeeping genes with elevated expression levels clustered together in a pattern unique to the Mamiellales (Figure 4). The biological reason for the existence of this region remains obscure, although its structural characteristics (shuffling of genes, small introns, low GC content) concur with the hypothesis that it may be a sex or species-barrier chromosome [27, 28].
The small outlier chromosome in Bathycoccus
The size of the Bathycoccus small outlier chromosome is 146,238 bp. compared to around 150 kb in Ostreococcus lucimarinus and 200 to 250 kb in Micromonas. The SOC average gene density in B. prasinos is slightly lower than that observed in the other chromosomes (72 genes with an average of 2.0 kb per gene in SOC compared to 1.7 kb per gene in the global genome), with a similar expression level based on EST counts. Only 44% of the genes in SOC have a potentially identified function compared to 77% in other chromosomes. Furthermore, up to 75% of the SOC genes have no known plant orthologs, in sharp contrast to most other chromosomes, where most genes share green lineage descent. Last but not least, in Bathycoccus, 24 of the 34 SOC genes having an identified function group in two categories. The first group encodes enzymes involved in metabolism of glycoconjugates (17 genes), mainly glycosyltransferase (12 genes), and the second is related to methyl transferases (7 genes). These features are globally similar in the other known Mamiellales SOCs, where the same two dominant gene functions were found (Table 2). However, despite their common function, no synteny and almost no orthologous relationships could be established between the SOCs of the different Mamiellales' species, suggesting a more functional convergence than a common phylogenetic origin. To explain the presence of such genes in SOCs, an alien origin of these chromosomes was proposed, which could have yielded some selective advantages in cell surface processes, potentially related, for example, to defense against pathogens or other environmental interactions . However, since SOCs and BOCs have now been found in all sequenced mamiellophycean genomes, it is likely that their lower GC composition, higher proportion of specific genes and higher evolution rates [9, 29] are being maintained by the same evolutionary pressure in all of these species. Interestingly, a paper on the cyanobacteria Prochlorococcus describes how variable genomic islands showing similar characteristics to those found in SOCs (low number of orthologs, a high level of horizontal gene transfer (HGT) and a high fraction of sugar-modifying enzymes, methyl transferases and membrane associated proteins) are involved in resistance to viruses . The viral resistance determined by these genomic islands induced a fitness cost measured either by a reduced growth rate and/or a more rapid infection by other viruses. The three genera Bathycoccus, Micromonas and Ostreococcus are the microalgae tested, which are among the most attacked by viruses , and viral resistance phenomena showing similar characteristics to what is reported for Prochlorococcus (reduced growth rate and higher infection rate by other viruses) have been reported to occur frequently . It is tempting to link this unusual high viral sensitivity and the ability to develop rapid and frequent resistance to these attacks to the presence of SOCs. Interestingly, two other Mamiellales species (Mamiella sp. or Mantoniella squamata) were tested recently and did not show this high viral sensitivity (N Simon, personal communication). It can be predicted that if our hypothesis on the link between SOC and viral hypersensitivity/resistance is correct, these species should not present a SOC-like structure in their genome.
Phylogenomics suggests many horizontal gene transfers
In line with a recent report about the acquisition of ice-binding proteins in sea ice diatoms from prokaryotic origin , it is tempting to speculate that the HGT genes contribute new functional properties to the Bathycoccus genome. The analysis of a large DNA virus in Ostreococcus tauri suggested that the capture of host DNA in viral genomes could represent a mechanism for the transfer of genes between eukaryotic cells . This idea was confirmed by the additional sequencing of four double-stranded DNA marine prasinovirus genomes (infecting Bathycoccus, Micromonas, and Ostreococcus), showing that these viruses encode a gene repertoire of certain amino acid biosynthesis pathways never previously observed in viruses that are likely to have been acquired from lateral gene transfer from their host or from bacteria . A similar eukaryotic phytoplankton-virus system was also described in Emiliania huxleyi, mediating the transfer of seven genes related to sphingolipid biosynthesis .
To verify whether specific genomic regions or chromosomes would be more likely to harbor genes arriving via HGT, we estimated the number of HGT genes per chromosome. We observed that transferred genes were more or less equally distributed over the different chromosomes, except for the low GC outlier chromosomes, which contained higher fractions of HGT genes (BOC1 and SOC contain 1.63 and 1.54 times more HGT genes compared to the genome-wide average; Figure 5b). Different possibilities for the increased abundance of HGT on the outliers include, for example: (1) they may have specific sequence features that can serve to integrate HGT genes that are subsequently re-arranged and embedded in other locations in the genome; (2) it may reflect a lower density of essential gene functionalities in outliers, which could thus support a higher density of random insertions; or (3) there might be a lower level of recombination on these chromosomes, reducing the rate of removal of deleterious alleles via sexual recombination. None of these scenarios are mutually exclusive.
Expanded gene families in the Bathycoccus genome
Copy number in Bathycoccus prasinos
Copy number in Micromonassp. CCMP1545
Copy number in Micromonassp. RCC299
Copy number in Ostreococcus lucimarinus
Copy number in Ostreococcussp. RCC809
Copy number in Ostreococcus tauri
Glycosyl transferase, family 29 (IPR001675)
Ankyrin repeats (IPR020683)
Zinc finger, C2H2
Sialic acid metabolism in Bathycoccus
The second gene family includes sialidases (or neuraminidases), which are enzymes cleaving the terminal sialic acid residues from glycoproteins or glycolipids. Again, this gene expansion is specific to Bathycoccus. In contrast to the previous family, no clear domain organization could be defined in sialidases, but some key amino acids known to be involved in the catalytic activity are conserved in the Bathycoccus family. The taxonomic origin of the sialidases is less clear than that for the sialyltransferases discussed above, and could correspond to either metazoans or bacteria. For sialidases, scores are globally weak and best blast hits are found mostly with hypothetical proteins either from the choanoflagellate Monosiga brevicolis or from the green alga Chlorella variabilis (where only one sialidase has been annotated).
The expansion of these two enzyme families prompted us to look for specific potential 'sialic acid' metabolism in Bathycoccus. The composition of flagellar scales in Scherffelia dubia (phylum Chlorophyta, class Chlorodendrophyceae) was found to be a mix of acidic polysaccharides having similar structures to sialic acids . Although the chemical nature of the scales covering the Bathycoccus cell membrane is unknown, it is tempting to establish a correlation between the potential biosynthetic pathway of these scales and the expansion of gene families involved in the metabolism of sialic acids. Furthermore, we confirmed previous electron microscopy studies [8, 47] showing that, in Bathycoccus as in other Mamiellophyceae, scale biosynthesis occurs inside intracellular vesicles with striking resemblance to Golgi vesicles (Figure 6b-d); that is, in agreement with the notion that they might be produced by sialyltransferases located at the luminal side of intracellular vesicles. Scales almost identical to those of B. prasinos are observed in the more closely related Mantoniella squamata , where they are also extruded to the surface after transport via the Golgi body [49–51].
Other Bathycoccus expanded gene families
One of the two other highly expanded gene families in the Bathycoccus genome are ankyrin-repeat proteins (149 gene copies). This family is also expanded, although to a lesser extent, in the Micromonas strain CCMP1545 (56 copies), whereas only very few copies were detected in other Mamiellales (Table 3). These genes have ankyrin repeats located in the carboxy-terminal part of the protein whereas the amino-terminal part has no hit in GenBank. There are also many other ankyrin repeats containing genes in Bathycoccus as in both plants and microalgae, but associated with different protein domains that often have predicted functionalities. Indeed, the ankyrin repeat is considered as one of the most common protein-protein interaction motifs in nature . The 149 Bathycoccus-specific genes were not distributed randomly among chromosomes, with the bigger chromosomes having few copies, whereas chromosomes 12 or 19 bear many tandem duplicated genes (Figure S12 in Additional file 1). No obvious function can be attributed to these genes. However, by analogy with the human membrane-associated ankyrin, which is responsible for the attachment of the cytoskeleton to the plasma membrane, it is possible that a number of these genes might function in some way to bind extracellular scales to the plasmic membrane, although experimental evidence is lacking. It has been shown, however, by electron microscopy coupled to immunogold that scales in Scherffelia dubia are linked to the membrane by glycoproteins . In addition, in Tetraselmis striata (Chlorodendrophyceae) some scale-associated glycoproteins may provide connections between scales and the underlying flagellar membrane .
The last group of expanded genes in Bathycoccus are zinc finger proteins. There are many zinc finger proteins in microalgae and in plants, but the family specifically expanded in Bathycoccus is most related to the C2H2-type zinc finger DNA-binding domain of certain integrases, which share a common alpha/beta two-layer sandwich core structure. The typical organization of the 48 copies identified in the Bathycoccus genome (Table 3) includes a short amino-terminal part (around 20 to 40 amino acids) followed by a strongly acidic region (10 to 20 amino acids) and by 2 to 6 C2H2 domains. Zinc finger proteins were originally identified as DNA-binding domains, although a growing body of evidence suggests an important and widespread role for these domains in protein binding. There are even examples of zinc fingers that support both DNA and protein interactions, and, globally, C2H2 protein-protein interactions are proving to be more abundant than previously appreciated .
The most parsimonious explanation for the abundance of the four expanded gene families would be an initial single HGT event followed by expansion in the Bathycoccus genome. The potential function of these four gene families and their expansion only in Bathycoccus also suggest that they could all be involved in the biosynthesis, exportation and fixation of the scales around the external membrane, and possibly for protection of the cell. Several other members of the Mamiellales have morphologically similar scales around the cells, but they are absent in the two genera Micromonas and Ostreococcus. The most parsimonious evolutionary scenario to explain these observations is that the scale synthesis pathway was acquired by the ancestor of the Mamiellales (or even before) and has been lost in the two naked genera. This scenario predicts that similar gene family expansions should be found in the genomes of other scaled Mamiellaophyceae but not in Micromonas and Ostreococcus. This is the case for Micromonas and Ostreococcus, but the genome sequences of other scaled species are not yet available.
Mamiellophyceae, and more particularly the three genera Bathycoccus, Micromonas and Ostreococcus, are dominant in different marine areas, where they can play an important role in the primary biomass production. However, the ecological importance of Bathycoccus has probably been overlooked these past years, although it was sporadically mentioned in several studies [5, 16–18]. The availability of this genome, coupled to the development of new sequencing possibilities for metagenomes [19, 20] from various marine environments, opens the door to future comparative studies and to a better understanding of the adaptations of the organisms to their environment.
Materials and methods
B. prasinos RCC1105 genome and EST sequencing and annotation
The sequenced strain B. prasinos RCC1105 was isolated in the bay of Banyuls sur mer at the SOLA station (Additional file 1). The genomic DNA was extracted from cell pellets containing a collective total of 6.4 × 1010 cells, using a cetyl trimethyl ammonium bromide protocol (adapted from ). The Bathycoccus genome was sequenced using Sanger technology on three independent shot-gun libraries with insert sizes of 3 (TK0AAA, vector pcdna2.1 (BstXI)), 10 (TK0AAB, vector pCNS (BstXI)) and 50 kb (TK0ACA, vector pBeloBAC11 (HindIII) and TK0ACB, vector pBeloBAC11 (BamHI)), resulting in 230,496 reads (180 Mb), 118,070 reads (152 Mb) and 10,368 reads (14 Mb), respectively. After trimming, read numbers were 223,577 reads (174 Mb) for the 3 kb library, 112,842 reads (145 Mb) for the 10 kb library and 8,189 reads (11 Mb) for the 50 kb library, and represented a coverage of 22-fold from 330 Mb of sequenced DNA. The data were assembled using the Genoscope pipe-line that includes the software Arachne 3.0 . ESTs were sequenced from a Bathycoccus culture grown to log phase (107 cells/ml), harvested by centrifugation and the cell pellets were immediately flash frozen in liquid nitrogen. The total RNA was extracted using the TriReagent (Sigma-Aldrich, Saint-Quentin, France) protocol and mRNAs purified using Poly(A)Purist (Ambion-Applied Bioystems, Saint Aubin, France). Complementary DNAs were constructed and cloned using the CloneMiner procedure (InvitroGen, Saint Aubin, France) with some minor modifications. EST sequences were obtained using pyrosequencing technology developed by Roche (Boulogne-Billancourt, France). A total of 253,791 EST reads were processed through the Genoscope EST pipeline. Short (<60 bp) and low complexity sequences were identified and removed. Clustering and assembly of all 251,875 filtered EST reads resulted in 8,370 EST consensus sequences.
The genome was annotated using the EuGene [56, 57] gene finding system with Splice- Machine  signal sensor components trained specifically on Bathycoccus datasets. The functional annotation resulted from the synthesis of InterPro and the BLASTP hits against the non-redundant UniProt database. Gene Ontology assignments were derived from the InterPro results. Gene Ontology enrichment analysis was performed using the hypergeometric distribution with Bonferonni correction for multiple hypothesis testing and corrected P-values <0.05 were retained as significant. The resulting database is publicly available at  in a format that includes browse and query options and the genome has been submitted to GenBank.
Comparative sequence and expression analysis
Starting from all protein-coding genes from the included species (Table 1), only retaining the longest transcript if alternative splicing variants exist, protein sequences were used to construct gene families by applying sequence-based protein clustering. First, an all against all sequence comparison was performed using BLASTP, applying an E-value threshold of 1e-05 and retaining the best 500 hits . Next, the complete sequence similarity graph was processed using Tribe-MCL (mclblastline, default parameters except I = 2 and scheme = 4) to identify gene families. A set of 154 single-copy core gene families was used to construct the phylogenetic tree depicted in Figure S1 in Additional file 1 (see also Additional files 2 and 3).
The boundaries of all Mamiellales BOC1 regions were manually delineated based on gene coordinates, gene family information and GC content (Table S2 in Additional file 1). For non-Mamiellales, a 'virtual' BOC1 region was created by taking the best BLASTP hit for each B. prasinos RCC1105 BOC1 gene. Putative BOC1 Mamiellales core gene families (Figure 6, blue lines) were identified by first retaining only those families that contain at least one protein for each Mamiellales species. Next, each family was aligned and manually curated. This was done by inspecting and correcting, if necessary, the structural and functional annotation (NCBI BLAST results plus InterProScan) of all cluster members. For Ostreococcus sp. RCC809 no SOC could be identified in the current draft genome assembly (Additional file 1).
To detect co-linearity within and between species, i-ADHoRe 3.0 was used (Additional file 1)  and all chromosomes from all species were compared against each other and significant colinear regions were identified. All gene colinearity can be browsed using the pico-PLAZA comparative genomics platform . i-ADHoRe was run with the following settings: alignment_method gg, gap_size 30, cluster_gap 35, q_value 0.9, prob_cutoff 0.0001, anchor_points 5 and level_2_only false.
EST databases were retrieved from their respective public repositories and mapped on the Mamiellales genomes using GenomeThreader  with a minimum alignment score threshold of 0.95 and minimum transcript coverage of 0.89. Only uniquely mapped ESTs were retained and assigned to genes. When an EST with no strand information overlapped with two adjacent genes, it was assigned to the gene with the highest overlap. For the BOC expression analysis global gene, EST counts were first summarized per functional category. In a second stage, expression enrichment was determined by comparing for each functional category the fraction of BOC expressed genes against the overall fraction of BOC expressed genes (denoted 'relative BOC expression enrichment' in Figure S8 in Additional file 1).
Analysis of potential horizontal gene transfer
For each protein-coding gene a BLAST sequence similarity search was performed against the NCBI protein database, which contains the proteins of all sequenced Ostreococcus, Micromonas and Chlamydomonas species (E-value <1e-05). Starting from a selection of BLAST hits a phylogenetic approach was used to identify the putative origin of all genes. Briefly, good hits (20% top hits relative to the best Bit score excluding query self-hits) were retained per gene, protein sequences and detailed taxonomic information was retrieved, a multiple sequence alignment was generated using MUSCLE and a maximum likelihood phylogenetic tree was constructed using PhyML (100 bootstrap sets, WAG model, kappa estimated, 4 substitution rate categories, gamma distribution parameter estimated, BIONJ starting tree, no topology, branch lengths and rate parameter optimization). For each query gene the corresponding tree topology was investigated to identify the nearest neighbor gene/clade, including bootstrap support, and determine the nearest neighbor taxonomic information. Genes showing complex punctuate patterns  (that is, clustering with homologs from different phyla outside the Viridiplantae; labeled 'multi-kingdom' in Table S5 in Additional file 1) were excluded. Singletons refer to genes for which no phylogenetic analysis could be done because they only have a single BLAST hit based on the 20% top hits. Nearest neighbors with bootstrap support >90% and gene coverage of 50% or more in the multiple alignment were scored as reliable HGT genes to estimate the fraction of eukaryotic origin. Although the low number of HGT genes found in Arabidopsis does not serve as a perfect negative control for the detection of HGT in unicellular green algae, it suggests that, when applied to a full set of proteins of a specific organism, this approach gives a conservative estimate of putative transfer events with a low number of false positives. To verify if, for some HGT genes, homologous genes exist in other algae that were missed during the process of gene annotation, a systematic sequence similarity search (using tblastn, E-value threshold 1e-05 against intergenic sequences of O. tauri, O. lucimarinus, Ostreococcus RCC809, M. pusilla and C. reinhardtii) revealed that, on average, no homologous locus could be found for 93% of the HGT genes. A list of all HGT genes together with protein alignment and phylogenetic tree statistics is available in Additional file 5.
Four functional categories (two types with two subdivisions each) were defined and genes were assigned to each class, if applicable. The first type of functional category describes the expression state of a gene (based on uniquely mapped ESTs; is a gene expressed (number of ESTs >0) or highly expressed (number of ESTs >2)) while the second type describes the intron content of a gene (contains an intron (number of introns >0) or contains a 'lot' of introns (number of introns >2)). C-hunter  software was used to identify, in all genomes, significant clusters of genes belonging to one of the four functional categories. The C-hunter thresholds for each category subdivision were determined by reviewing the average expression and intron content of all Mamiellales genes. C-hunter was run with the following parameters: <C-hunter categories.go genome.index genome.go 2 80 80 0.001 50 T chunter output>.
Sequence data from this article (the genome of B. prasinos RCC1105) can be found in the EMBL/GenBank data libraries under accession number [FO082258] (mitochondrion), [FO082259] (chloroplast), [FO082278] (chromosome 1), [FO082277] (chromosome 2), [FO082276] (chromosome 3), [FO082275] (chromosome 4), [FO082274] (chromosome 5), [FO082273] (chromosome 6), [FO082272] (chromosome 7), [FO082271] (chromosome 8), [FO082270] (chromosome 9), [FO082269] (chromosome 10), [FO082268] (chromosome 11), [FO082267] (chromosome 12), [FO082266] (chromosome 13), [FO082265] (chromosome 14), [FO082264] (chromosome 15), [FO082263] (chromosome 16), [FO082262] (chromosome 17), [FO082261] (chromosome 18), [FO082260] (chromosome19). The annotation of the genome can be found at the BOGAS web site . EST data are available at the ENA database (accession number ERA148021) and raw genome sequencing data are available at the Trace archive database of the NCBI under the query: species_code="BATHYCOCCUS SP. BAN7".
big outlier chromosome
Center for Culture of Marine Phytoplankton
expressed sequence tag
horizontal gene transfer
National Center for Biotechnology Information
Roscoff Culture Collection
small outlier chromosome.
We are very grateful to L Fibla and L Subirana for technical help, R Cooke and M Laudié for sequencing and S Proost for technical assistance during the colinearity analysis. Sequencing of the Bathycoccus genome has been supported by the Genoscope (Evry, France) and this research was supported by the CNRS and the University Pierre et Marie Curie (UMR 7232) and the Agence Nationale de la Recherche grants PHYTADAPT n° NT09_567009 and Tara-Girus 09-PCS-GENM-218. We thank Igor Grigoriev, Brian Palenik and the Joint Genome Institute for the access to the Ostreococcus strain RCC809 genome. BV, KV and YVdP acknowledge the support of Ghent University (Multidisciplinary Research Partnership 'Bioinformatics: from nucleotides to networks') and the Interuniversity Attraction Poles Programme (IUAP P6/25), initiated by the Belgian State, Science Policy Office (BioMaGNet).
- Field BC, Behrenfeld MJ, Randerson JT, Falkowski P: Primary production of the biosphere: integrating terrestrial and oceanic components. Science. 1998, 281: 237-240.PubMedView ArticleGoogle Scholar
- Boyce DG, Lewis MR, Worm B: Global phytoplankton decline over the past century. Nature. 2010, 466: 591-596. 10.1038/nature09268.PubMedView ArticleGoogle Scholar
- Li WKW: Primary productivity of prochlorophytes cyanobacteria, and eucaryotic ultraphytoplankton: measurements from flow cytometric sorting. Limnol Oceanogr. 1994, 39: 169-175. 10.4319/lo.1994.39.1.0169.View ArticleGoogle Scholar
- Worden AZ, Nolan JK, Palenik B: Assessing the dynamics and ecology of marine picophytoplankton: the importance of the eukaryotic component. Limnol Oceanogr. 2004, 49: 168-179. 10.4319/lo.2004.49.1.0168.View ArticleGoogle Scholar
- Knight-Jones EW, Walne PR: Chromulina pusilla Butcher; a dominant member of the ultraplankton. Nature. 1951, 167: 445-10.1038/167445a0.PubMedView ArticleGoogle Scholar
- Courties C, Vaquer A, Trousselier M, Lautier J, Chrétiennot-Dinet MJ, Neveux J, Machado MC, Claustre H: Smallest eukaryotic organism. Nature. 1994, 370: 255-View ArticleGoogle Scholar
- Chrétiennot-Dinet MJ, Courties C, Vaquer A, Neveux J, Claustre H, Lautier J, Machado MC: A new marine picoeukaryote Ostreococcus tauri gen et sp. nov (Chlorophyta, Prasinophyceae). Phycologia. 1995, 4: 285-292.View ArticleGoogle Scholar
- Eikrem W, Throndsen J: The ultrastructure of Bathycoccus gen nov and B. prasinos sp Nov, a non-motile picoplanktonic alga (Chlorophyta, Prasinophyceae) from the Mediterranean and Atlantic. Phycologia. 1990, 29: 344-350. 10.2216/i0031-8884-29-3-344.1.View ArticleGoogle Scholar
- Worden AZ, Lee JH, Mock T, Rouzé P, Simmons MP, Aerts AL, Allen AE, Cuvelier ML, Derelle E, Everett MV, Foulon E, Grimwood J, Gundlach H, Henrissat B, Napoli C, McDonald SM, Parker MS, Rombauts S, Salamov A, Von Dassow P, Badger JH, Coutinho PM, Demir E, Dubchak I, Gentemann C, Eikrem W, Gready JE, John U, Lanier W, Lindquist EA, Lucas S, et al: Green evolution and dynamic adaptations revealed by genomes of the marine picoeukaryotes Micromonas. Science. 2009, 324: 268-272. 10.1126/science.1167222.PubMedView ArticleGoogle Scholar
- Derelle E, Ferraz C, Rombauts S, Rouzé P, Worden AZ, Robbens S, Partensky F, Degroeve S, Echeynié S, Cooke R, Saeys Y, Wuyts J, Jabbari K, Bowler C, Panaud O, Piégu B, Ball SG, Ral JP, Bouget FY, Piganeau G, De Baets B, Picard A, Delseny M, Demaille J, Van de Peer Y, Moreau H: Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features. Proc Natl Acad Sci USA. 2006, 103: 11647-11652. 10.1073/pnas.0604795103.PubMedPubMed CentralView ArticleGoogle Scholar
- Palenik B, Grimwood J, Aerts A, Rouzé P, Salamov A, Putnam N, Dupont C, Jorgensen R, Derelle E, Rombauts S, Zhou K, Otillar R, Merchant SS, Podell S, Gaasterland T, Napoli C, Gendler K, Manuell A, Tai V, Vallon O, Piganeau G, Jancek S, Heijde M, Jabbari K, Bowler C, Lohr M, Robbens S, Werner G, Dubchak I, Pazour GJ, Ren Q, Paulsen I, et al: The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation. Proc Natl Acad Sci USA. 2007, 104: 7705-7710. 10.1073/pnas.0611046104.PubMedPubMed CentralView ArticleGoogle Scholar
- Guillou L, Eikrem W, Chrétiennot-Dinet MJ, Le Gall F, Massana R, Romari K, Pedrós-Alió C, Vaulot D: Diversity of picoplanktonic prasinophytes assessed by direct nuclear SSU rDNA sequencing of environmental samples and novel isolates retrieved from oceanic and coastal marine ecosystems. Protist. 2004, 155: 193-214. 10.1078/143446104774199592.PubMedView ArticleGoogle Scholar
- Marin B, Melkonian M: Molecular phylogeny and classification of the Mamiellophyceae class nov (Chlorophyta) based on sequence comparisons of the nuclear- and plastid-encoded rRNA operons. Protist. 2010, 161: 304-336. 10.1016/j.protis.2009.10.002.PubMedView ArticleGoogle Scholar
- Peers GK, Niyogi K: Pond scum genomics: The genomes of Chlamydomonas and Ostreococcus. Plant Cell. 2008, 20: 502-507. 10.1105/tpc.107.056556.PubMedPubMed CentralView ArticleGoogle Scholar
- Johnson PW, McSieburth J: In-situ morphology and occurrence of eukaryotic phototrophs of bacterial size in the picoplankton of estuarine and oceanic waters. J Phycol. 1982, 18: 318-327. 10.1111/j.1529-8817.1982.tb03190.x.View ArticleGoogle Scholar
- Marie D, Zhu F, Balagué V, Ras J, Vaulot D: Eukaryotic picoplankton communities of the Mediterranean Sea in summer assessed by molecular approaches (DGGE, TTGE, QPCR). FEMS Microbiol Ecol. 2006, 55: 403-415. 10.1111/j.1574-6941.2005.00058.x.PubMedView ArticleGoogle Scholar
- Monier A, Welsh RM, Gentemann C, Weinstock G, Sodergren E, Armbrust EV, Eisen JA, Worden AZ: Phosphate transporters in marine phytoplankton and their viruses: cross-domain commonalities in viral-host gene exchanges. Environ Microbiol. 2011, 14: 162-176.PubMedView ArticleGoogle Scholar
- Treusch AH, Demir-Hilton E, Vergin KL, Worden AZ, Carlson CA, Donatz MG, Burton RM, Giovannoni SJ: Phytoplankton distribution patterns in the northwestern Sargasso Sea revealed by small subunit rRNA genes from plastids. ISME J. 2011, 6: 481-492.PubMedPubMed CentralView ArticleGoogle Scholar
- Cheung MK, Au CH, Chu KH, Kwan HS, Wong CK: Composition and genetic diversity of picoeukaryotes in subtropical coastal waters as revealed by 454 pyrosequencing. ISME J. 2010, 4: 1053-1059. 10.1038/ismej.2010.26.PubMedView ArticleGoogle Scholar
- Marie D, Shi X L, Rigaut-Jalabert F, Vaulot D: Use of flow cytometric sorting to better assess the diversity of small photosynthetic eukaryotes in the English Channel. FEMS Microbiol Ecol. 2010, 72: 165-178. 10.1111/j.1574-6941.2010.00842.x.PubMedView ArticleGoogle Scholar
- Massana R, Balagué V, Guillou L, Pedrós-Alió C: Picoeukaryotic diversity in an oligotrophic coastal site studied by molecular and culturing approaches. FEMS Microbiol Ecol. 2004, 50: 231-243. 10.1016/j.femsec.2004.07.001.PubMedView ArticleGoogle Scholar
- Haugen PD, Simon D, Bhattacharya D: The natural history of group I introns. Trends Genet. 2005, 21: 111-119. 10.1016/j.tig.2004.12.007.PubMedView ArticleGoogle Scholar
- Piganeau G, Grimsley N, Moreau H: Genome diversity in the smallest marine photosynthetic eukaryotes. Res Microbiol. 2011, 162: 570-577. 10.1016/j.resmic.2011.04.005.PubMedView ArticleGoogle Scholar
- Blanc G, Duncan G, Agarkova I, Borodovsky M, Gurnon J, Kuo A, Lindquist E, Lucas S, Pangilinan J, Polle J, Salamov A, Terry A, Yamada T, Dunigan DD, Grigoriev IV, Claverie JM, Van Etten JL: The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex. Plant Cell. 2010, 22: 2943-2955. 10.1105/tpc.110.076406.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang X: The epigenetic landscape of plants. Science. 2008, 320: 489-492. 10.1126/science.1153996.PubMedView ArticleGoogle Scholar
- Wong S, Wolfe KH: Birth of a metabolic gene cluster in yeast by adaptive gene relocation. Nat Genet. 2005, 37: 777-782. 10.1038/ng1584.PubMedView ArticleGoogle Scholar
- Lee SC, Ni M, Li W, Shertz C, Heitman J: The evolution of sex: a perspective from the fungal kingdom. Microbiol Mol Biol Rev. 2010, 74: 298-340. 10.1128/MMBR.00005-10.PubMedPubMed CentralView ArticleGoogle Scholar
- Soo Chan Lee SC, Ni M, Li W, Shertz C, Heitman J: The evolution of sex: a perspective from the fungal kingdom. Microbiol Mol Biol Rev. 2010, 74: 298-340. 10.1128/MMBR.00005-10.PubMedPubMed CentralView ArticleGoogle Scholar
- Jancek S, Gourbiere S, Moreau H, Piganeau G: Clues about the genetic basis of adaptation emerge from comparing the proteomes of two Ostreococcus ecotypes (Chlorophyta, Prasinophyceae). Mol Biol Evol. 2008, 25: 2293-2300. 10.1093/molbev/msn168.PubMedView ArticleGoogle Scholar
- Avrani S, Wurtzel O, Sharon I, Sorek R, Lindell D: Genomic island variability facilitates Prochlorococcus-virus coexistence. Nature. 2011, 474: 604-608. 10.1038/nature10172.PubMedView ArticleGoogle Scholar
- Bellec L, Grimsley N, Derelle E, Moreau H, Desdevises Y: Abundance, spatial distribution and genetic diversity of Ostreococcus tauri viruses in two different environments. Env Microbiol Reports. 2010, 2: 313-321. 10.1111/j.1758-2229.2010.00138.x.View ArticleGoogle Scholar
- Thomas R, Grimsley N, Escande ML, Subirana L, Derelle E, Moreau H: Acquisition and maintenance of resistance to viruses in eukaryotic phytoplankton populations. Env Microbiol. 2011, 13: 1412-1420. 10.1111/j.1462-2920.2011.02441.x.View ArticleGoogle Scholar
- Ragan MA, Harlow TJ, Beiko RG: Do different surrogate methods detect lateral genetic transfer events of different relative ages?. Trends Microbiol. 2006, 14: 4-8. 10.1016/j.tim.2005.11.004.PubMedView ArticleGoogle Scholar
- Kurland CG, Canback B, Berg OG: Horizontal gene transfer: a critical view. Proc Natl Acad Sci USA. 2003, 100: 9658-9662. 10.1073/pnas.1632870100.PubMedPubMed CentralView ArticleGoogle Scholar
- Keeling PJ, Palmer JD: Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet. 2008, 9: 605-618. 10.1038/nrg2386.PubMedView ArticleGoogle Scholar
- Bowler C, Allen AE, Badger JH, Grimwood J, Jabbari K, Kuo A, Maheswari U, Martens C, Maumus F, Otillar RP, Rayko E, Salamov A, Vandepoele K, Beszteri B, Gruber A, Heijde M, Katinka M, Mock T, Valentin K, Verret F, Berges JA, Brownlee C, Cadoret JP, Chiovitti A, Choi CJ, Coesel S, De Martino A, Detter JC, Durkin C, Falciatore A, Fournet J, et al: The Phaeodactylum genome reveals the evolutionary history of diatom genomes. Nature. 2008, 456: 239-244. 10.1038/nature07410.PubMedView ArticleGoogle Scholar
- Moustafa A, Beszteri B, Maier UG, Bowler C, Valentin K, Bhattacharya D: Genomic footprints of a cryptic plastid endosymbiosis in diatoms. Science. 2009, 324: 1724-1726. 10.1126/science.1172983.PubMedView ArticleGoogle Scholar
- Raymond JA, Kim HJ: Possible role of horizontal gene transfer in the colonization of sea ice by algae. PLoS One. 2012, 7: e35968-10.1371/journal.pone.0035968.PubMedPubMed CentralView ArticleGoogle Scholar
- Derelle E, Ferraz C, Escande ML, Eychenie S, Cooke R, Piganeau G, Desdevises Y, Bellec L, Moreau H, Grimsley N: Life-cycle and genome of OtV5, a large DNA virus of the pelagic marine unicellular green alga Ostreococcus tauri. PLoS One. 2008, 3: e2250-10.1371/journal.pone.0002250.PubMedPubMed CentralView ArticleGoogle Scholar
- Moreau H, Piganeau G, Desdevises Y, Cooke R, Derelle E, Grimsley N: Marine prasinovirus genomes show low evolutionary divergence and acquisition of protein metabolism genes by horizontal gene transfer. J Virol. 2010, 84: 12555-12563. 10.1128/JVI.01123-10.PubMedPubMed CentralView ArticleGoogle Scholar
- Monier A, Pagarete A, de Vargas C, Allen MJ, Read B, Claverie JM, Ogata H: Horizontal gene transfer of an entire metabolic pathway between a eukaryotic alga and its DNA virus. Genome Res. 2009, 19: 1441-1449. 10.1101/gr.091686.109.PubMedPubMed CentralView ArticleGoogle Scholar
- Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B: The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009, D233-238. 37 Database
- Jeanneau C, Chazalet V, Augé C, Soumpasis DM, Harduin-Lepers A, Delannoy P, Imberty A, Breton C: Structure-function analysis of the human sialyltransferase ST3GalI. J Biol Chem. 2004, 279: 13461-13468. 10.1074/jbc.M311764200.PubMedView ArticleGoogle Scholar
- Harduin-Lepers A, Mollicone R, Delannoy P, Oriol R: The anormal sialyltransferase-related genes: a phylogenetic approach. Glycobiology. 2005, 15: 805-817. 10.1093/glycob/cwi063.PubMedView ArticleGoogle Scholar
- Harduin-Lepers A, Mollicone R, Delannoy P, Oriol R: The animal sialyltransferases and sialyltransferase-related genes: a phylogenetic approach. Glycobiology. 2005, 15: 805-817. 10.1093/glycob/cwi063. A published erratum appears in Glycobiology 2005, 15:21GPubMedView ArticleGoogle Scholar
- Melkonian M, Preisig HR: A light and electron microscopic study of Scherffelia dubia, a new member of the scaly green flagellates (Prasinophyceae). Nord J Bot. 1986, 6: 235-256. 10.1111/j.1756-1051.1986.tb00876.x.View ArticleGoogle Scholar
- Moestrup O, Walne PL: Studies on scale morphogenesis in the Golgi apparatus of Pyramimonas tetrarhynchus (Prasinophyceae). J Cell Sci. 1979, 36: 437-459.PubMedGoogle Scholar
- Moestrup O: Scale structure in Mantoniella squamata, with some comments on the phylogeny of the Prasinophyceae (Chlorophyta). Phycologia. 1990, 29: 437-442. 10.2216/i0031-8884-29-4-437.1.View ArticleGoogle Scholar
- Melkonian M, Becker B, Becker D: Scale formation in algae. J Electron Microscopy Technique. 1991, 17: 165-178. 10.1002/jemt.1060170205.View ArticleGoogle Scholar
- Becker D, Melkonian M: N-linked glycoproteins associated with flagellar scales in a flagellate green alga: characterization of interactions. Eur J Cell Biol. 1992, 57: 109-116.PubMedGoogle Scholar
- Becker B: Anterograde transport of algal scales through the Golgi complex is not mediated by vesicles. Trends Cell Biol. 1995, 5: 305-307. 10.1016/S0962-8924(00)89047-9.PubMedView ArticleGoogle Scholar
- Al-Khodor S, Price CT, Kalia A, Kwaik A: Functional diversity of ankyrin repeats in microbial proteins. Trends Microbiol. 2010, 18: 132-139. 10.1016/j.tim.2009.11.004.PubMedPubMed CentralView ArticleGoogle Scholar
- Brayer KJ, Segal DJ: Keep your fingers off my DNA: protein-protein interactions mediated by C2H2 zinc finger domains. Cell Biochem Biophys. 2008, 50: 111-131. 10.1007/s12013-008-9008-5.PubMedView ArticleGoogle Scholar
- Winnepenninckx B, Backeljau T, De Wachter R: Extraction of high molecular weight DNA from molluscs. Trends Genet. 1993, 9: 407-PubMedView ArticleGoogle Scholar
- Batzoglou S, Jaffe DB, Stanley K, Butler J, Gnerre S, Mauceli E, Berger B, Mesirov JP, Lander ES: ARACHNE: a whole-genome shotgun assembler. Genome Res. 2002, 12: 177-189.52. 10.1101/gr.208902.PubMedPubMed CentralView ArticleGoogle Scholar
- Schiex T, Moisan A, Rouzé P: EUGÉNE: an eukaryotic gene finder that combines several sources of evidence. Lect Notes Comput Sci. 2001, 2066: 111-125. 10.1007/3-540-45727-5_10.View ArticleGoogle Scholar
- Foissac S, Gouzy J, Rombauts S, Mathé C, Amselem J, Sterck L, Van de Peer Y, Rouzé P, Schiex T: Genome annotation in plants and fungi: EuGène as a model platform. Curr Bioinformatics. 2008, 3: 87-97. 10.2174/157489308784340702.View ArticleGoogle Scholar
- Degroeve S, Saeys Y, De Baets B, Rouzé P, Van de Peer Y: SpliceMachine: predicting splice sites from high-dimensional local context representations. Bioinformatics. 2005, 21: 1332-1338. 10.1093/bioinformatics/bti166.PubMedView ArticleGoogle Scholar
- Genome Sequences and Annotations at Ghent University. [http://bioinformatics.psb.ugent.be/genomes]
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-33402. 10.1093/nar/25.17.3389.PubMedPubMed CentralView ArticleGoogle Scholar
- Proost S, Fostier J, De Witte D, Dhoedt B, Demeester P, Van de Peer Y, Vandepoele K: i-ADHoRe 3 0 - fast and sensitive detection of genomic homology in extremely large data sets. Nucleic Acids Res. 2012, 40: e11-10.1093/nar/gkr955.PubMedPubMed CentralView ArticleGoogle Scholar
- pico-PLAZA: an integrative resource for cross-species genome analysis in algae. [http://bioinformatics.psb.ugent.be/pico-plaza/]
- Gremme G, Brendel V, Sparks ME, Kurtz S: Engineering a software tool for gene structure prediction in higher organisms. Information Software Technol. 2005, 47: 965-978. 10.1016/j.infsof.2005.09.005.View ArticleGoogle Scholar
- Rogers MB, Watkins RF, Harper JT, Durnford DG, Gray MW, Keeling PJ: A complex and punctate distribution of three eukaryotic genes derived by lateral gene transfer. BMC Evol Biol. 2007, 7: 89-10.1186/1471-2148-7-89.PubMedPubMed CentralView ArticleGoogle Scholar
- Yi G, Sze SH, Thon MR: Identifying clusters of functionally related genes in genomes. Bioinformatics. 2007, 23: 1053-1060. 10.1093/bioinformatics/btl673.PubMedView ArticleGoogle Scholar
- Bathycoccus Genome Annotation Database at Ghent University. [http://bioinformatics.psb.ugent.be/webtools/bogas/]
- Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA: Circos: an information aesthetic for comparative genomics. Genome Res. 2009, 19: 1639-1645. 10.1101/gr.092759.109.PubMedPubMed CentralView ArticleGoogle Scholar
- Merchant SS, Prochnik SE, Vallon O, Harris EH, Karpowicz SJ, Witman GB, Terry A, Salamov A, Fritz-Laylin LK, Maréchal-Drouard L, Marshall WF, Qu LH, Nelson DR, Sanderfoot AA, Spalding MH, Kapitonov VV, Ren Q, Ferris P, Lindquist E, Shapiro H, Lucas SM, Grimwood J, Schmutz J, Cardol P, Cerutti H, Chanfreau G, Chen CL, Cognat V, Croft MT, Dent R, Dutcher S, et al: The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science. 2007, 318: 245-250. 10.1126/science.1143609.PubMedPubMed CentralView ArticleGoogle Scholar
- Prochnik SE, Umen J, Nedelcu AM, Hallmann A, Miller SM, Nishii I, Ferris P, Kuo A, Mitros T, Fritz-Laylin LK, Hellsten U, Chapman J, Simakov O, Rensing SA, Terry A, Pangilinan J, Kapitonov V, Jurka J, Salamov A, Shapiro H, Schmutz J, Grimwood J, Lindquist E, Lucas S, Grigoriev IV, Schmitt R, Kirk D, Rokhsar DS: Genomic analysis of organismal complexity in the multicellular green alga Volvox carteri. Science. 2010, 329: 223-226. 10.1126/science.1188800.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.