© BioMed Central Ltd 2005
Published: 1 December 2005
Skip to main content
© BioMed Central Ltd 2005
Published: 1 December 2005
Ever since its invention, the polymerase chain reaction has been the method of choice for work with ancient DNA. In an application of modern genomic methods to material from the Pleistocene, a recent study has instead undertaken to clone and sequence a portion of the ancient genome of the cave bear.
Nearly 20 years ago a researcher from down the hall in the Department of Genetics at Cambridge University brought in a home-built contraption for us to try. It consisted of a metal box containing a single halogen lamp with a copper sheet on top with eppendorf-tube-shaped indentations, together with a BBC Micro computer to run it. It was a proto-PCR machine, and along with their home-brewed Taq polymerase it worked quite well. This was soon after Saiki et al.  had described the full method, including the use of a thermostable polymerase, and before long manufacturers were offering a dazzling array of PCR machines (see ), including temperature-cycling ovens and water baths (neither of which survived the test of time). A few months after we had incorporated the copper contraption into our methodology, someone from Archaeology brought along a stingless bee, trapped in amber for millions of years. At the time, many people saw that PCR could provide a window into the past, and there was considerable excitement about its potential: even if the number of surviving intact DNA templates was low, they could, in theory, be amplified by PCR. But as with the design of thermo-cycling machines, there was a learning curve. Early results from material more than 1 million years old could not be replicated, and it began to appear that there are limits to how long we can expect intact DNA to survive. While amplifications of material up to hundreds of thousands of years old have been confirmed (for example, from bacteria in permafrost ), amplifications of older material remain controversial (reviewed in ).
DNA degrades by processes such as oxidation and hydrolysis, leading to lesions that eventually break DNA down into smaller and smaller fragments [5, 6]. As this reduces the available template, and given that PCR has the potential to amplify from even a single copy of a genome, contamination with modern DNA template becomes a major problem. One recent study used quantitative PCR to assess the relative proportion of true template to contaminating human DNA in extracts from 5,000-year-old canine bones . Standard precautions were taken, such as the isolation of the work in a separate, ultraviolet-irradiated lab, and cleaning the exterior of the sample before drilling into the bone. Even so, a large proportion of extracts were contaminated with human DNA. Malmström et al.  found that although the proportion of human template in the extracts varied among the 29 dog bones analyzed, human DNA was always more abundant than canine DNA, in some cases by two orders of magnitude. The template being amplified in this study was the mito-chondrial genome (mtDNA). From the earliest ancient amplifications that have stood the test of time through to recent applications this has tended to be the genome of choice. The reason is obvious: there are just two copies of each 'single-copy' nuclear gene in diploid organisms, but there are 2-10 mitochondrial genomes in each mitochon-drion, and hundreds or thousands (depending on the cell type) of mitochondria per cell. Given so many copies, the chance of finding intact mtDNA is greater. Contaminating material will, however, also have abundant mtDNA.
Studies of ancient DNA have often involved the amplification of short mtDNA segments for either phylogenetic or, less frequently, population studies. The PCR primers can be designed to amplify specifically from the target species, thereby helping to reduce the risk of contamination (unless the ancient material is human). A major break with this trend was published by Willerslev et al. . The chosen template was from the 18S gene in the ribosomal DNA (rDNA) family. This sequence is highly conserved, and primers were designed to amplify from as broad a range of taxa as possible. The material for amplification was from 2,000- and 4,000-year-old segments of an ice core, from which total DNA was extracted. The amplified DNA was cloned, and the clones sequenced to assess the diversity of organisms found in the two layers of ancient ice. A remarkable diversity of species was revealed in what was a significant departure from earlier applications, combining PCR and cloning strategies.
Now, in the modern era of genomics, there has been a return to basics. The first ancient DNAs to be examined came from ample material that was cloned directly into a vector (no PCR); this material came from a 150-year-old extinct horse called the quagga  and from an Egyptian mummy . This was after the concept of PCR had been invented by Kary Mullis in 1983, but before the first patent application by Cetus (where Mullis worked) and the first publication . In a study just published in Science by Noonan et al. , an approach exclusively based on cloning has been used again. This time two metagenomic libraries were constructed by anonymously cloning all DNA present in the samples. One library was from a 44,000-year-old bone and the other from a 42,000-year-old tooth from the cave bear (Ursus spelaeus). Creating a genomic library from sub-fossil material is not a very efficient process. In true genomics style, 9,035 clones (1.06 Mb) and 4,992 clones (1.03 Mb) were sequenced from the bone and tooth libraries, respectively. Among these sequences, 1.1% from the tooth library and 5.8% from the bone library were identified as cave bear. The rest were a mixture of unknown (62-66% - the largest portion), local environmental contaminants (11-17%), prokaryotic sequences (17-26%) and other eukaryotic sequences (0.7%).
After database screening, 6,775 bp from the tooth library and 20,086 bp from the bone library were attributed to the cave bear. Among these, 4-6% represented reference exon sequences, 4-6% were conserved noncoding, 35-45% were repetitive DNA, and the majority (45-55%) were unanno-tated sequences. These proportions are roughly comparable to those seen in modern mammalian genomes. There were no matches to mtDNA among the 389 putative cave bear clones, but this is not surprising given the difference in the size of the two genomes (16.5 kb mitochondrial versus 3 Gb nuclear), in spite of the high copy number of the mitochon-drial genome.
A phylogenetic reconstruction was then generated using 3,201 bp of the sequences assigned to the cave bear and comparing them with modern black bear, brown bear and polar bear sequences. The estimated substitution rate for the cave bear, based on this phylogenetic tree, was higher than expected. The probable cause was an increased occurrence of GC-AT transitions, given the propensity for deamination to convert cytosine to uracil in ancient DNA. Most of the problem seemed to lie with a few damaged clones in one of the libraries, and when they were removed the substitution rate appeared more consistent with expectation. The same four species (among others) had previously been compared by another group using the mtDNA control region and the cytochrome B (cytb) loci , and that tree was topologically equivalent to the nuclear DNA tree. This to some extent begs a question: why clone and sequence 2 million base pairs of nuclear DNA when direct amplification of mtDNA (the 'traditional' way) gives you the same result, especially for such a straightforward phylogenetic question? I think the answer is in large part that the construction of this phylogeny was not the only object of the exercise. Instead, these researchers have set their sights on a contemporary of the cave bear, and this project illustrates the potential and scope of the cloning method towards that future objective.
In July of this year, collaborators in the cave bear project from the Max-Planck Institute for Evolutionary Anthropology, and the University of California at Berkeley, announced a joint venture to sequence the genome of Homo nean-derthalensis. Such a project will raise special problems of its own. The closeness of the human and Neanderthal genomes will make distinguishing Neanderthal from contaminating human clones that much harder. Krings et al.  showed that human and Neanderthal share 92.5% of their mtDNA control region sequence. Furthermore, some studies have demonstrated the difficulty of identifying specific mutations in ancient samples, given the propensity for mutagenesis in ancient DNA . If successful, however, the Neanderthal genome project should teach us a lot about what it is to be human, and together with the results of the human genome project, something about how we got there.