New evolutionary frontiers from unusual virus genomes
© BioMed Central Ltd 2005
Published: 2 March 2005
Skip to main content
© BioMed Central Ltd 2005
Published: 2 March 2005
The sequences of two giant viral genomes, Mimivirus and a polydnavirus, have recently been published. Mimivirus has the largest known viral genome and encodes an unprecedented number of proteins, whereas the polydnavirus genome has an extremely low coding density and does not encode DNA-replication proteins. These and other unusual features challenge the way we view the evolution and definition of viruses.
If an alien landed on Earth and studied the biology here, it might justifiably conclude that viruses run the planet. They are numerically the most abundant biological entities , and they are profoundly important in shaping the ecology and evolution of just about every species on Earth . Yet viruses are not considered to be alive by most biologists, and they have arguably fallen by the wayside in the genomics revolution . The recent publication of the genome sequences of two unusual viruses, however, highlights the wealth of information that remains to be discovered through viral genomics. Here, we discuss Mimivirus  and Cotesia congregata Bracovirus  (CcBV) and the interesting questions they raise concerning the biology and evolution of viruses.
Characteristics of Mimivirus and CcBV and their genomes
CcBV (in caterpillar host)
CcBV (in wasp host)
Genome size (base-pairs)
30 closed circles
G + C composition (%)
Coding density (%)
Number of genes
Genes containing introns
Genes with assigned function
Obligate intracellular parasite
Viral DNA replication in host
Virion assembly in host
Transmission via virion
Viral gene expression dependent on host cellular machinery
CcBV differs from Mimivirus and other viruses in many fundamental aspects. As a member of the Polydnaviridae, the transmission and replication cycle of this Bracovirus is unconventional . The Polydnaviridae - pronounced polyd-na-viridae by the research community and named after the unique segmented structure of the packaged genome - consists of two subgroups, Bracoviruses and Ichnoviruses, which associate with braconid and ichneumonid wasps, respectively . These wasps are parasitoids (parasites that kill their hosts) that attack caterpillars and are of particular interest for their use as biological control agents. In the wasp host, polydnaviruses exist in a benign state, integrated into the wasp genome as a provirus. Amplification of segments from the provirus and production of virions (particles containing viral DNA encased within a capsid) occurs only in the ovaries of a female wasp, and virions are co-injected with eggs during parasitization of caterpillars. The viral particles are replication-deficient in both hosts; the virus can increase in number only through genome amplification in wasp ovaries but is transmitted from wasp to wasp by vertical transmission of the provirus. Viral gene expression in caterpillars interferes with the latter's immune response and developmental cycle, promoting survival of the parasitoid and therefore of the provirus. Thus, polydnaviruses depend on vertical transmission in a tripartite relationship that includes both mutual and parasitic symbioses.
The genome of CcBV - whose wasp host is C. congregata - totals 568 kilobase-pairs (kbp) and is composed of 30 circles ranging in size from about 5 kbp to 40 kbp . Although the cumulative genome size of CcBV would place it in the category of a giant virus, segments appear to be packed into individual capsids, with several capsids being enveloped by a single membrane  (Figure 1b). In contrast to the high coding density of most viruses, the CcBV genome encodes very few proteins, and the smallest segment consists entirely of non-coding DNA . Almost 70% of the protein-coding genes are predicted to contain introns dependent on spliceosomal excision; it is unusual for viruses to have introns. This high rate of intron prediction remains to be confirmed by cDNA sequence data, however. About 40% of the proteins with assigned functions fall into four gene families: protein tyrosine phosphatases, inhibitors of NF-κB, cystatins and cysteine-rich proteins; these proteins may modulate the responses of lepidopteran caterpillars to infection. In contrast with Mimivirus and other viruses, the CcBV genome contains very few recognizable homologs of components of the transcriptional, translational and replication machinery, although it does encode a homolog of the chromatin protein histone H4.
Mimivirus is the sole member of Mimiviridae and is classified as a nucleocytoplasmic large DNA virus (NCLDV)  on the basis of the presence/absence pattern of 'core' genes defined for NCLDVs . By contrast, the viral origins of polydnaviruses are less certain. It has been hypothesized from virion morphology that bracoviruses may be related to baculoviruses [7, 9], but only three genes in the CcBV genome are similar to genes found in free-replicating viruses, two to a baculovirus and one to an ascovirus . Thus, even though we have a genome sequence the origin of bracoviruses remains unclear. Many genes that are typical of viruses are absent from the CcBV genome and may have been transferred to the wasp genome, as is the case for a gene coding for a major structural protein in Campoletis sonorensis Ichnovirus .
Inferring viral phylogenies is often difficult, as high rates of viral evolution make it difficult to identify conserved genes between viruses. The ultimate origin of viruses - where viruses should be placed on the Tree of Life  - is even more vexing. Many theories abound [2, 12–14]: that they evolved before the first cells; that because they infect all domains they arose from cellular life before the last universal common ancestor; or that they evolved from cells at a later point in evolution. In principle, it should be possible to distinguish among these theories through careful genome analysis. For example, if viruses have a separate origin from 'living' organisms, their gene content should overlap very little, if at all, with that of bacteria, archaea and eukaryotes. In contrast, if viruses evolved from a bacterial parasite, their content should resemble bacteria more than that of archaea and their genes should branch in evolutionary trees with genes from bacteria, as is the case for organelles .
But there are several complications in determining viral origins. First, it is possible that different types of viruses arose independently. Second, and more confounding, there has unquestionably been gene flow between viruses and their hosts, which means that any one gene might not reflect the phylogeny of the virus itself. The authors of the Mimivirus genome sequence paper  try to address this by making a concatenated alignment of multiple genes shared by Mimivirus and living organisms. They report that the viral genes branch as a sister group to the eukaryotes, potentially identifying Mimivirus as the basal member of a major branch on the Tree of Life. But in a separate phylogenetic analysis of the RNA polymerase β' subunit (one of the proteins used in the concatenated analysis and also encoded in other NCLDVs), Mimivirus did not group with the NCLDVs. The ramifications of this conflict are still unclear: is Mimivirus an NCLDV that acquired many of its genes through lateral transfer, although differences in codon usage between viruses and amoebae would indicate otherwise? Is it a sister-group to the eukaryotes and not an NCLDV, despite the presence of NCLDV core genes? Resolving such questions will have to wait until other members or close relatives of Mimiviridae are discovered and their genome sequences analyzed.
As CcBV has very few features associated with other viruses, and its coding content and gene structure resemble that of the wasp host more than that of viruses, Espagne et al.  raise questions about the viral ancestry of bracoviruses. They propose that bracoviruses may have evolved from mobile DNA that acquired the ability to be packaged into capsids (perhaps from a virus); the existence of remnants of transposon and retrovirus-like elements in the CcBV genome provides additional support for such an argument. Genome sequencing and phylogenetic analysis of additional polydnaviruses, their proviral sequences, and genes associated with virion formation will be required to shed light on the question of whether bracoviruses are the product of reductive viral evolution or not.
Mimivirus expands our definition of viruses quantitatively to accommodate bigger genomes and larger particle size. Although Raoult et al.  point out that Mimivirus has more components of the cellular machinery than any other virus, in our opinion it does not yet seem to stretch the definition of viruses in any fundamental way. It is just a more complicated virus. By contrast, CcBV appears to differ qualitatively from many definitions of a virus (Table 1), but it could still be classified as a highly defective one. Clearly, these two viruses present some interesting problems regarding viral phylogeny and classification that remain to be resolved. Considering the importance of viruses in evolution, we believe that we need to direct more effort to systematically characterize the genomes and biology of diverse viruses, as this will further our understanding of how and where they fit into the Tree of Life .