Skip to main content

The origins and genomic diversity of American Civil War Era smallpox vaccine strains


Vaccination has transformed public health, most notably including the eradication of smallpox. Despite its profound historical importance, little is known of the origins and diversity of the viruses used in smallpox vaccination. Prior to the twentieth century, the method, source and origin of smallpox vaccinations remained unstandardised and opaque. We reconstruct and analyse viral vaccine genomes associated with smallpox vaccination from historical artefacts. Significantly, we recover viral molecules through non-destructive sampling of historical materials lacking signs of biological residues. We use the authenticated ancient genomes to reveal the evolutionary relationships of smallpox vaccination viruses within the poxviruses as a whole.


Smallpox epidemics were caused by variola virus (VARV), a human-specific member of the Orthopoxvirus (OPXV) genus of the Poxviridae, and resulted in high mortality and morbidity with survivors frequently disabled or disfigured [1,2,3]. Smallpox remains the only human infectious disease eradicated, a global accomplishment achieved through widespread coordinated vaccination [2, 3]. Despite these profound public health benefits, the origins and diversity of the viruses used in the early vaccination programs remain uncertain. The World Health Organization’s success in eradicating smallpox using vaccinia virus (VACV) (1980) was in part due to the broad protective immunity induced by infection with one OPXV against subsequent infection by another.

The lack of standardisation in vaccination practices and propagation throughout most of its history means that historical vaccine strains may be any one of several OPXVs. On the basis of Edward Jenner’s work [4], cowpox virus (CPXV) was assumed to have been involved in historical vaccination, although horsepox virus (HSPV) and ‘equination’ are also cited [2, 4,5,6,7]. Both are thought to produce comparatively self-limiting infections in humans with negligible mortality rates [1, 8]. However, ‘cowpox’ and ‘horsepox’ are likely misnomers, for neither cows nor horses are considered the natural reservoirs of these viruses, and the absence of endemic CPXV or HSPV outside of Europe suggests geographically restricted hosts [9,10,11]. In 1939, it was recognised that the smallpox vaccine strains being used in the twentieth century were distinct from CPXV [12, 13] and these VACV strains had become the predominant smallpox vaccines [2, 14,15,16,17,18]. However, both the origin of VACV and its natural host or reservoir are also unknown [19].

Vaccination ‘kits’ and their biological contents (scabs, lymph) provide evidence of early vaccination methods and materials and remain in medical collections/archives across the globe. Kits found in collections relating to the American Civil War correspond to a time of known medical crisis and intervention to prevent smallpox outbreaks [20,21,22,23].

To better characterise the origins of smallpox vaccination, we investigated the origin, diversity and propagation of early smallpox vaccine strains by extracting and sequencing total DNA and analysing both the viriome and metagenome from these kits. The results reported herein are an attempt to begin to survey viruses that were in use for smallpox vaccination and circulating in Philadelphia in the mid-to-late nineteenth century, during or just after the conclusion of the American Civil War.

Results and discussion

We were kindly granted access to five historical kits from the Mütter Museum of the College of Physicians of Philadelphia that date to the mid-to-late nineteenth century (likely circa 1859–1873) and are associated with medical practices of the American Civil War era (Fig. 1a). Of the five vaccination kits, four were leather roll-ups containing one or two folding lancets, small glass plates for mixing lymph (fluid collected from blisters of infected patients [23]), and tin boxes with sliding lids to contain scab (or crust) material (Fig. 1a). The fifth kit only contained ‘The Automatic Vaccinator’, a tool designed for use with lymph or scabs smeared into a mixture on glass plates. Museum records, donor history, and manufacturer data regarding the kits’ contents were used to determine date ranges (Additional File 1: Supplementary Materials and Methods, Fig. 1a, Additional File 2: Table S1). Initial evaluation by the Poxvirus Laboratory at the US Centers for Disease Control and Prevention indicated that there was no presence of VARV but identified OPXV DNA within the materials from three of the kits.

Fig. 1
figure 1

Mütter Museum vaccination kits. a Left panel, Mütter catalogue # 17090.29 representative vaccination kit containing two vaccination lancets, a small metal box to hold scabrous material and glass slides to hold lymph. Upper right panel, scabrous material from Mütter catalogue # MISC-1090, subsamples of this material were used to produce library VK01. Lower right panel, metal box from Mutter catalogue # 17831.42.16, it is internally divided into four quadrants. The lower two quadrants are filled with a thick hardened residue, and a portion of the same residue remains in the upper left quadrant. This hardened substance, which we believe may be dried lymph, was used to produce library VK08. Photos courtesy of the Mütter Museum of The College of Physicians of Philadelphia. b Relative metagenomic composition of the VK01, VK02, VK05, VK08 and VK12 libraries from shotgun sequencing data and c post orthopoxvirus targeted enrichment. Blue portions represent reads classified as eukaryotic in origin, with the proportion specifically identified as human highlighted. Red portions represent reads classified as viral in origin with a distinction between those assigned specifically to VACV and other OPXV. Orange portions represent reads not classified as either eukaryotic or viral. Post blast analysis, assignments were visualised in Krona [24] and simplified as pie charts produced using ggplot2 [25]

The metagenomic profiles of the shotgun libraries generated from these kits were overwhelmingly eukaryotic, with most reads identified as human (~ 80%), yet contained a significant (0.3–2.0%) proportion of reads mapping to VACV with unexpectedly few bacterial DNA reads (Fig. 1b). Post enrichment for OPXV molecules (Additional File 1: Supplementary Materials and Methods), the roles were reversed, with viral reads representing the majority of the molecules sequenced (Fig. 1c, Additional File 2: Table S1). Importantly, both the endogenous human and viral sequences recovered from these historical artefacts have the characteristic signatures of ancient DNA, that is, short fragment lengths and terminal C to T damage (Additional File 2: Table S1, Additional File 3: Fig. S1).

The total human constituent obtained in the shotgun sequence data enabled us to reconstruct the mitochondrial genomes for the human donors from three of our samples VK01, VK02 and VK08 (Additional File 4: Table S2). The haplogroups H1b, T2b4f and U5b1 are most frequently found in west Eurasia, suggesting that the vaccine donors were likely of European ancestry and not African American even though African American children were frequently used for vaccine propagation in the southern states during the American Civil War [23]. Using an algorithm that compares the ratio of the total number of reads that map to both X- and Y-chromosomes [26], we concluded that VK01 and VK02 were clearly derived from female sources (Additional File 4: Table S2). While the algorithm could not assign a definitive sex for VK08, an order of magnitude more reads mapped to the X-chromosome than to the Y-chromosome suggesting a low-level male contamination in that one sample (Additional File 1: Supplementary Materials and Methods, Additional File 4: Table S2). Thus, these three samples suggest that vaccine propagation was still occurring via human to human transfer.

We de novo assembled a nearly complete virus genome (~ 95%) from our shotgun sequencing reads from the VK01 library that had significant read depth when mapped against the VACV Copenhagen strain reference (Additional File 2: Table S1). This draft genome totalled 184,677 bp in length, approximately 95% the length of most VACV genomes though they vary considerably in length due to terminal repetitive motifs. We believe that our reconstructed contig represents the central core of the VK01 strain and one, perhaps partial, of the inverted terminal repeats. Repetitive regions pose serious difficulties for genome reconstruction from aDNA libraries as read lengths are extremely short and prohibit scaffold-building sections that span these regions [27]. Our other shotgun libraries did not have adequate read length and/or depth for successful de novo assembly (Additional File 2: Table S1, Additional File 5: Table S3). We generated consensus sequences for the remaining four samples by mapping both the shotgun and enriched data to our VK01 contig and calling consensus sequences for positions that had at least 10x coverage with variants present at ≥ 0.9 frequency. This methodology produced consensus sequences for VK02, VK05, VK08 and VK12 that are between 97.0 and 99.9% complete at > 10x coverage relative to the VK01 assembly.

To determine the evolutionary relationships of the five Mütter vaccine strains, we placed them within an expansive OPXV phylogeny that includes representative viruses described as CPXV, HSPV, VACV and VARV (Fig. 2). The vaccination kit viruses sit firmly within the VACV clade, indicating that VACV was indeed circulating prior to the twentieth century. To ease concerns that the phylogenetic positioning of our consensus sequences was dictated by the reference, VK01, we repeated the mapping process with an additional three VACV reference sequences and generated new consensus sequences; the positioning of VK01, VK02, VK05, VK08 and VK12, within the larger OPXV tree remained unchanged (Additional File 1: Supplementary Materials and Methods, Additional File 6: Fig. S2). Within the VACV clade, all of the vaccination kit strains recovered from the Mütter collection cluster tightly, differing from the VK01 assembly at 20–352 polymorphic positions, suggesting that there may have been little diversity in vaccination strains circulating amongst Philadelphia physicians at this time (Fig. 2). Notably, these strains group closely with a commercially produced vaccination strain from 1902, also manufactured in Philadelphia [6]. Importantly, that the later vaccine strain was sequenced by another group acts to validate the genomic data obtained here. Interestingly, the strain identified as HSPV, isolated from a horse in an 1976 outbreak in Mongolia [28], also clusters closely with these vaccine strains. Given the age of the sample and its phylogenetic position within the larger VACV group, it is more reasonable to re-classify the 1976 Mongolian HSPV isolate as either a VACV vaccine escape strain or a VACV virus introduced into horses from an unknown animal reservoir. Similar occurrences have been noted previously in both buffalo and rabbit [9,10,11], and this observation fits with the Jenner-era assertion that horsepox was not found outside of Europe [9]. In addition to the paraphyletic nature of strains described as ‘cowpox’, that the ‘horsepox’, ‘buffalopox’ and ‘rabbitpox’ viruses all fall within the VACV clade further demonstrates the imprudence of naming these viruses after the host of isolation (Additional File 7: Fig. S3).

Fig. 2
figure 2

Maximum likelihood phylogenetic analysis of historical vaccine strains in relation to other OPXVs. a Position of the VACV clade and Mütter Museum vaccination strains within a broader OPXV phylogeny rooted using Ectromelia (ECTV) as an outgroup. b Position of the Mütter Museum vaccination strains within the phylogeny of available VACV strains, rooted using VACV Tashkent KM044310 as an outgroup. Nodes with > 95% bootstrap support are indicated with asterisks. All horizontal branch lengths are scaled according to the number of nucleotide substitutions per site

Unfortunately, there is no temporal structure across the OPXV phylogeny, including within the VACV clade, making it impossible to reliably estimate either a rate of nucleotide substitution or divergence times (Additional File 8: Fig. S4). This lack of temporal structure likely reflects the differences in evolutionary rates within different host species [29, 30] and that many of the available genomes have gone through repeated passaging and/or cell culturing after they were originally isolated (Additional File 7: Fig. S3, Additional File 9: Table S4).

Our phylogenetic analysis robustly places the historical vaccination strains within the diversity of viruses labelled ‘vaccinia’. The most closely related strains of VACV include known North American vaccine strains of the twentieth century as well as many strains currently circulating in Brazil, including Cantagalo virus—a virus circulating amongst dairy cows thought to represent a historical escape of smallpox vaccine [31, 32] (Fig. 2, Additional File 7: Fig. S3, Additional File 10: Fig. S5). Although the relationship of the Mütter vaccine strains and the Brazilian and North American vaccine strains varies depending on the inclusion/exclusion of other sequences (Fig. 2), their close relationship to the Cantagalo and IOC strains from Brazil may tentatively suggest a tie to the Beaugency lymph strain of the late nineteenth century that had arrived in the New England area by the 1870s [33].

Historical context

During the American Civil War (1861–1865), Philadelphia functioned as the second largest hospital city in the northern states, after Washington, D.C. The birthplace of American medicine, Philadelphia fostered the creation in 1787 of the College of Physicians of Philadelphia, a professional-fraternal institution to advance medical practice and research, its members having founded the first hospital and medical school in the USA. The 1860s wartime influx of wounded and sick soldiers constituted such a large and growing population of medical patients that army authorities created specialty hospitals centring on the treatment of particular disorders. The work in these hospitals effectively created medical disciplines of cardiology and neurology. Wartime Philadelphia physicians employed state-of-the-art medical technology, much of it furnished by local firms, as the city had become a leading centre for the manufacture of scientific instruments. The College created the Mütter Museum, which opened during the war in 1863 as a repository of artefacts and specimens for teaching and research. The museum began to acquire vaccination material and related tools at this time. During the war, most of the College members served either in the army or as contract physicians and, as such, administered smallpox vaccination according to standard army protocol. Vaccination was required of all military recruits in northern and southern armies. It is not surprising, therefore, that the Mütter Museum possesses vaccination tools and remnants of early vaccine during Philadelphia’s concentration of medical assets during the war and into the 1870s and 1880s.


Within the historical context of American medical practices in the 1860s and 1870s, we note that vaccination was a uniquely human process. Vaccination material was still being produced within humans and transferred directly from donors to patients, a process that changed in the following decades in response to public health concerns over iatrogenic disease spread and the for-profit industrialisation of vaccine production through animals. The similarities between the construction of the kits and their contents, which were not available in catalogues but seemingly constructed through a bespoke manner, suggest that there was a common wisdom in how vaccination was practised in this particular era. As part of this project, we surveyed instrument catalogues available in the USA during the last half of the nineteenth century and found no advertised vaccination kits. Instrument firms, however, advertised that custom-built cases would be created to order. Indeed, the similarity in the virus strains, not only from these five Civil War Era kits, but from the 1902 Mulford’s strain [6], suggests that there may have been a common source for material in the Philadelphia area.

This work highlights the value of research involving historical medical collections, by presenting a novel, non-destructive methodology to recover DNA, thereby preserving these artefacts for continued display and study. Indeed, this project was only feasible as a result of the foresight and meticulous and continued conservation of museum collections by dedicated curators and collection management. The clear identification and reconstruction of near-complete genomes of VACV from these vaccination kits, which were in use during the American Civil War era, indicates that these strains were circulating within humans and via physician networks prior to the twentieth century.

Materials and methods

Five vaccination kits dating to the mid-to-late nineteenth century were found within the collection of the Mütter Museum of the College of Physicians of Philadelphia. The kits were first sent to the US Centers for Disease Control and Prevention in Atlanta, USA, where nine specimens from the five kits were tested with amplification assays to detect VARV [34], contemporary VACV (CDC, unpublished) and generic OPXV sequences [35]. There was no amplification of the VARV-specific or VACV-specific assays. Homogenates of the scabs from Mütter collection # MISC-1090 and # 17090.33 as well as a swab from the glass plates of Mütter collection # 17831.42.16 tested positive for the generic OPXV assay and were further placed into culture: no growth was observed. The kits were then transported to the McMaster Ancient DNA Centre and processed in dedicated clean room facilities through both destructive analysis of organic materials (crusts and lymph) and non-destructive sampling of inorganic materials (lancets, boxes and glass slides). Full details of destructive and non-destructive sampling techniques and sequencing conditions are described in the Supplementary Materials and Methods (Additional File 1).

We attempted to de novo assemble genomes from the pooled shotgun reads of VK01, VK02 and VK08. From the VK01 library, we were able to assemble a 184-kbp contig representing approximately 95% the total length of a VACV (Additional File 5: Table S3). For the remaining libraries, shotgun and enriched datasets were separately mapped with a modified version of BWA ( [36] to the VK01 de novo contig. Mapped reads from separate sequencing runs and the shotgun and enriched libraries were then filtered of PCR duplicates and restricted to a minimum length of 35 bp and a minimum mapping quality of 30. The endocaller program of schmutzi v1.0 (qual -60) was then used to generate consensus base calls [37]. In addition to reconstructing the genomes of VK01, VK02 and VK08, we also produced the mitochondrial genomes of their hosts [37,38,39] and further attempted to determine the sex of the human hosts using an algorithm developed specifically for shotgun sequencing data from ancient human remains [26].

The five Mütter viral genomes were aligned with 76 representative OPXV genomes including ectromelia virus as an outgroup (Additional File 9: Table S4) using MAFFT v7.205 [40]. The resultant alignment (263,227 bp) was cleaned of poorly aligned regions and indels using Gblocks v0.91b [41] and subsequently utilised as the input for a maximum likelihood phylogeny (134,607 bp) using PhyML [42] (Fig. 2). Metadata including host of virus isolation was overlain on the ML phylogeny and visualised using Grapetree [43] (Additional File 7: Fig. S3). The ML phylogeny produced from the data set including the additional 76 OPXV genomes, as well as those restricted to the VACV clade, were used as input along with either year of strain collection (if known) or year of genome sequencing in root-to-tip regressions on the ML trees to determine the extent of temporal structure in the data and hence the level of support for a molecular clock of evolutionary change (Additional File 8: Fig. S4). Complete details of computational methods are further described in the Supplementary Materials and Methods (Additional File 1).

Availability of data and materials

The de novo assembly for VK01 is available on GenBank, accession MN369532 (63). Complete sequencing data are available through the SRA accession PRJNA561155 (64).


  1. Castro R, Casanas B. Orthopoxviruses and human disease. New York: In: Global Virology II-HIV and NeuroAIDS. Springer; 2017. p.689–697.

  2. Fenner F, Henderson DA, Arita I, Jezek Z, Ladnyi ID. Smallpox and its eradication. Vol. 6. World Health Organization Geneva; 1988.

  3. Hopkins DR. The greatest killer: smallpox in history. Chicago: University of Chicago Press; 2002.

  4. Jenner E. An inquiry into the causes and effects of the variolae vaccinae: a disease discovered in some of the western counties of England, particularly Gloucestershire, and known by the name of cow pox. London: Printed for the author by Samson Low, No. 7, Berwick Street, Soho.; 1798.

  5. Esparza J, Schrick L, Damaso CR, Nitsche A. Equination (inoculation of horsepox): an early alternative to vaccination (inoculation of cowpox) and the potential role of horsepox virus in the origin of the smallpox vaccine. Vaccine. 2017;35(52):7222–30.

    Article  CAS  PubMed  Google Scholar 

  6. Schrick L, Tausch SH, Dabrowski PW, Damaso CR, Esparza J, Nitsche A. An early American smallpox vaccine based on horsepox. N Engl J Med. 2017;377(15):1491–2.

    Article  PubMed  Google Scholar 

  7. Martin HA. Report on animal vaccination. Trans Am Med Assoc. 1877;XXVII:187–248.

  8. Funkhouser WK. Edward Jenner and vaccination: the road to elimination of epidemic smallpox. Am Soc Investigative Pathol. 2010;5(3):8–10.

  9. Baxby D, Gaskell C, Gaskell R, Bennett M. Ecology of orthopoxviruses and use of recombinant vaccinia vaccines. Lancet. 1986;328(8511):850–1.

    Article  Google Scholar 

  10. Baxby D, Bennett M, Getty B. Human cowpox 1969–93: a review based on 54 cases. Br J Dermatol. 1994;131(5):598–607.

    Article  CAS  PubMed  Google Scholar 

  11. Esparza J. Has horsepox become extinct? Vet Rec. 2013;173(11):272–3.

    Article  PubMed  Google Scholar 

  12. Downie AW. A study of the lesions produced experimentally by cowpox virus. J Pathol Bacteriol. 1939;48:361–79.

    Article  Google Scholar 

  13. Downie AW. Immunological relationship of the virus of spontaneous cowpox to vaccinia virus. Br J Exp Pathol. 1939;20:158–76.

    PubMed Central  Google Scholar 

  14. Baxby D. Poxvirus hosts and reservoirs. Arch Virol. 1977;55(3):169–79.

    Article  CAS  PubMed  Google Scholar 

  15. Baxby D. The origins of vaccinia virus. J Infectious Dis. 1977;136(3):453–5.

    Article  CAS  Google Scholar 

  16. Qin L, Upton C, Hazes B, Evans DH. Genomic analysis of the vaccinia virus strain variants found in Dryvax vaccine. J Virol. 2011;85(24):13049–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Qin L, Favis N, Famulski J, Evans DH. Evolution of and evolutionary relationships between extant vaccinia virus strains. J Virol. 2015;89(3):1809–24.

    Article  PubMed  Google Scholar 

  18. Sánchez-Sampedro L, Perdiguero B, Mejías-Pérez E, García-Arriaza J, Di Pilato M, Esteban M. The evolution of poxvirus vaccines. Viruses. 2015;7(4):1726–803.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Baxby D. Jenner’s smallpox vaccine. The riddle of the origin of vaccinia virus. Heinemann Lond. 1981;12:997–1035.

    Google Scholar 

  20. Harris E. Vaccination in the army. In: Flint A, editor. Contributions relating to the causation and prevention of disease. New York: US Sanitary Commission/Hurd and Houghton; 1867.

    Google Scholar 

  21. United States Durgeon-General’s Office. In: The medical and surgical history of the war of rebellion. Washington, D.C. US Government Printing Office. 1888. Available from:

  22. Jones J. Researches upon “Spurious Vaccination,”: or the abnormal phenomena accompanying and following vaccination in the confederate army during the recent American Civil War, 1861–1865. Vol. 1376. Nashville: University Medical Press; 1867.

  23. Hicks R. Scabrous matters: spurious vaccination in the confederacy. In: Cashin J, editor. Chapel Hill: War matters: material culture of the Civil War Era. University of North Carolina Press; 2018:123–150.

  24. Ondov BD, Bergman NH, Phillippy AM. Interactive metagenomic visualization in a Web browser. BMC Bioinformatics. 2011;12(1):385.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer-Verlag; 2016. Available from:

    Book  Google Scholar 

  26. Skoglund P, Storå J, Götherström A, Jakobsson M. Accurate sex identification of ancient human remains using DNA shotgun sequencing. J Archaeol Sci. 2013;40(12):4477–82.

    Article  CAS  Google Scholar 

  27. Devault AM, Mortimer TD, Kitchen A, Kiesewetter H, Enk JM, Golding GB, Southon J, Kuch M, Duggan AT, Aylward W, Gardner SN, Allen JE, King AM, Wright G, Kuroda M, Kato K, Briggs DEG, Fornaciari G, Holmes EC, Poinar HN, Pepperell CS. A molecular portrait of maternal sepsis from Byzantine Troy. Elife. 2017;6:e20983.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Tulman E, Delhon G, Afonso C, Lu Z, Zsak L, Sandybaev N, Kerembekova UZ, Zaitsec VL, Kutish GF, Rock DL. Genome of horsepox virus. J Virol. 2006;80(18):9244–58.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Alcamí A, Smith GL. Receptors for gamma-interferon encoded by poxviruses: implications for the unknown origin of vaccinia virus. Trends Microbiol. 1996;4(8):321–6.

    Article  PubMed  Google Scholar 

  30. Gubser C, Hue S, Kellam P, Smith GL. Poxvirus genomes: a phylogenetic analysis. J Gen Virol. 2004;85(1):105–17.

    Article  CAS  PubMed  Google Scholar 

  31. Damaso CR, Esposito JJ, Condit RC, Moussatché N. An emergent poxvirus from humans and cattle in Rio de Janeiro State: Cantagalo virus may derive from Brazilian smallpox vaccine. Virology. 2000;277(2):439–49.

    Article  CAS  PubMed  Google Scholar 

  32. Medaglia MLG, Moussatché N, Nitsche A, Dabrowski PW, Li Y, Damon IK, Lucas CG, Arruda LB, Damaso CR. Genomic analysis, phenotype, and virulence of the historical Brazilian smallpox vaccine strain IOC: implications for the origins and evolutionary relationships of vaccinia virus. J Virol. 2015;89(23):11909–25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Damaso CR. Revisiting Jenner’s mysteries, the role of the Beaugency lymph in the evolutionary path of ancient smallpox vaccines. Lancet Infectious Disease. 2017;18(2):e55–63.

    Article  Google Scholar 

  34. Kondas AV, Olson VA, Li Y, Abel J, Laker M, Rose L, Wilkins K, Turner J, Kline R, Damon IK. Variola virus specific diagnostic assays: characterization, sensitivity, and specificity. J Clin Microbiol. 2015;53(4):1406–10.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Li Y, Olson VA, Laue T, Laker MT, Damon IK. Detection of monkeypox virus with real-time PCR assays. J Clin Virol. 2006;36(3):194–203.

    Article  CAS  PubMed  Google Scholar 

  36. Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25(14):1754–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Renaud G, Slon V, Duggan AT, Kelso J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biol. 2015;16(1):224.

    Article  PubMed  PubMed Central  Google Scholar 

  38. van Oven M. PhyloTree Build 17: growing the human mitochondrial DNA tree. Forensic Sci Int. 2015;5:e392–4.

    Google Scholar 

  39. Weissensteiner H, Pacher D, Kloss-Brandstätter A, Forer L, Specht G, Bandelt H-J, Kronenberg F, Salas A, Schönherr. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 2016;44(W1):W58–63.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007;56(4):564–77.

    Article  CAS  PubMed  Google Scholar 

  42. Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59(3):307–21.

    Article  CAS  PubMed  Google Scholar 

  43. Zhou Z, Alikhan N-F, Sergeant MJ, Luhmann N, Vaz C, Francisco AP, Carriço JA, Achtman M. GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens. Genome Res. 2018;28(9):1395–404.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


The authors thank Gabriel Renaud and Nicholas Waglechner for their helpful discussions regarding consensus building and assembly evaluation. We thank the Mütter Museum of the College of Physicians of Philadelphia and the Mütter Research Institute for access to their collections and financial contributions.

The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of US Centers for Disease Control and Prevention.

Peer review information

Andrew Cosgrove was the primary editor of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.

Review history

The review history is available as Additional file 17.


ATD is supported by a Banting Fellowship and by the Government of Canada through an Insight Development Grant from the Social Sciences and Humanities Research Council of Canada (430-21018-00662), Genome Canada, and the Ontario Genomics Institute (OGI-170). HNP is supported by a Canada Research Chair, NSERC, SSHRC Insight Grant, Genome Canada, Red Wilson and McMaster University. ECH is supported by an ARC Australian Laureate Fellowship. GLS is supported by a Wellcome Trust Principal Research Fellowship. GBG is supported by a Natural Sciences and Engineering Research Council grant (RGPIN-2015-04477). AGM holds a Cisco Research Chair in Bioinformatics, supported by Cisco Systems Canada, Inc. The Comprehensive Antibiotic Resistance Database is funded by the Canadian Institutes of Health Research (PJT-156214 to AGM). Computer resources were supplied, in part, by the McMaster Service Lab and Repository computing cluster, funded in part by the Canadian Foundation for Innovation Grants to AGM. TTYL was supported by a M.G. DeGroote Institute for Infectious Disease Research Summer Student Fellowship.

Author information

Authors and Affiliations



ATD, JK, AND, RH and HNP conceived of the research idea. JK performed all wet-lab experimentation. ATD, AFP and ECH performed genomic analyses. AND, RH, MH, AB, HP, LF, DP and GLS provided historical research and context. AMC, WBD, KW and YL performed early analyses as part of the CDC. ARR, TTYL, BA and AGM examined the low-coverage S. aureus genome for antimicrobial resistance. GBG, ECH and HNP provided materials and reagents. ATD, JK, RH, GLS, ECH and HNP wrote the manuscript. All co-authors provided edits and reviews of the final manuscript. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Ana T. Duggan.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

JK declares financial interest in Arbor Biosciences, which produced the baits for targeted capture.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Supplementary Material and Methods.

Additional file 2: Table S1.

Details of the libraries generated from the Mütter Museum vaccination kits and vaccinia virus mapping statistics.

Additional file 3: Figure S1.

Terminal damage rates for VK01, VK02, VK05, VK08 and VK12 libraries mapped to VACV strain Copenhagen reference (M35027) and human mitochondrial reference (rCRS).

Additional file 4: Table S2.

Mapping statistics for human component of libraries VK01, VK02 and VK08.

Additional file 5: Table S3.

Summary statistics produced by QUAST for SPAdes de novo assemblies of VK01, VK02 and VK08 from shotgun data.

Additional file 6: Figure S2.

Maximum likelihood trees for Mutter vaccine strains called in reference to alternative VACV genomes.

Additional file 7: Figure S3.

GrapeTree analysis of 79 OPXV.

Additional file 8: Figure S4.

Regression analyses of root-to-tip genetic distance on the ML phylogeny against either year of collection or year of sequencing.

Additional file 9: Table S4.

OPXV genomes used for comparison in the phylogenetic analyses.

Additional file 10: Figure S5.

Partitioned maximum likelihood analysis of OPXV.

Additional file 11: Figure S6.

Mütter catalogue # 17090.29 before and after non-destructive sampling.

Additional file 12: Table S5.

OPXV genomes used for bait design.

Additional file 13: Figure S7.

Comparison of terminal damage patterns of VK01.

Additional file 14: Table S6.

BLASTX, Bowtie2, and taxonomic k-mer annotation of VK02 reads against CARD reference sequences.

Additional file 15: Figures S8–10.

VK01 coverage relative to Copenhagen VACV strain M35027, GRI90 CPXV strain X93455, and Horsepox strain DQ792504.

Additional file 16: Figure S11.

Maximum likelihood phylogenetic analysis of historical vaccine strains in relation to other OPXV using largest de novo assembled contig for each sample.

Additional file 17.

Review history.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Duggan, A.T., Klunk, J., Porter, A.F. et al. The origins and genomic diversity of American Civil War Era smallpox vaccine strains. Genome Biol 21, 175 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: