Analysis of the platypus genome suggests a transposon origin for mammalian imprinting
- Andrew J Pask†1, 2Email author,
- Anthony T Papenfuss†3,
- Eleanor I Ager1,
- Kaighin A McColl3,
- Terence P Speed3 and
- Marilyn B Renfree1
© Pask et al.; licensee BioMed Central Ltd. 2009
Received: 21 November 2008
Accepted: 2 January 2009
Published: 2 January 2009
Genomic imprinting is an epigenetic phenomenon that results in monoallelic gene expression. Many hypotheses have been advanced to explain why genomic imprinting evolved in mammals, but few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large scale genomic resources between all extant classes. The recent release of the platypus genome has provided the first opportunity to perform comparisons between prototherian (monotreme; which appear to lack imprinting) and therian (marsupial and eutherian; which have imprinting) mammals.
We compared the distribution of repeat elements known to attract epigenetic silencing across the entire genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. There is a significant accumulation of certain repeat elements within imprinted regions of therian mammals compared to the platypus.
Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long terminal repeats and DNA elements, in therian imprinted genes and gene clusters is coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. These data provide strong support for the host defence hypothesis.
Genomic imprinting is an epigenetic phenomenon that results in monoallelic gene expression. Amongst mammals, it has only been identified in the therians (marsupials and eutherians). Many hypotheses have been advanced to explain why genomic imprinting evolved in mammals, but few have examined how it arose . The retention of genomic imprinting must confer an evolutionary advantage since the resulting haploinsufficiency is frequently associated with increased susceptibility to disease . The most widely accepted hypothesis to explain why mammalian imprinting may have been retained is the 'kinship hypothesis' [3, 4]. This suggests that imprinting evolved to regulate nutrient exchange between the mother and the developing fetus . Indeed, almost all the imprinted genes identified thus far are widely expressed in the eutherian placenta , a primary site of nutrient exchange. Genomic imprinting is, therefore, thought to be absent in the egg-laying monotremes, as it is in other egg laying, non-mammalian amniotes , where maternal-fetal nutrient exchange is minimal. Furthermore, investigations of four imprinted therian genes have failed to detect any evidence for genomic imprinting in the monotremes: IGF2 , IGF2R  and UBE3A  are biallelically expressed in the platypus while PEG10  is absent.
Until now, no one has been able to examine at the genome level how imprinting may have evolved due to the absence of large scale genomic resources available for all classes of mammals. Genomic imprinting may have evolved from the same mechanisms that silence transposable elements and invading foreign DNA within the genome. This is referred to as the host defence hypothesis  and is supported by the observation that most imprinted genes in eutherians are associated with repeat sequences and endogenous retroviruses [11, 12]. The recently sequenced platypus genome  provides the key resource to examine how imprinting evolved, since it is thought to have arisen after the divergence of this group from the therian mammals. While a number of imprinted gene orthologues have now been mapped in the platypus , with the exception of the DLK1 locus , there has been limited detailed analyses of their surrounding genomic context.
Comparative analyses of the PEG10 locus between therian mammals, the platypus and chicken provided the first evidence that retrotransposition is directly involved in the acquisition of genomic imprinting . Insertion of PEG10 in the therian genome was coincident with its differential methylation, established by host defence mechanisms. This was then selected for and maintained in the therian genome . The host defence hypothesis predicts that an accumulation of foreign DNA elements would have occurred in all imprinted regions in therians. To gain a greater understanding of how imprinted regions have evolved and to comprehensively test the host defence hypothesis, we have examined, on a genome scale, the conservation of synteny and accumulation of repeats and retrotransposed elements within therian-imprinted regions by comparison with the entire platypus genome.
Imprinted region conservation
Repeat distribution across the entire genome and in orthologous imprinted regions
CpG island distribution
In the eutherians, the predominant mechanism of gene silencing is due to differential methylation of CpG islands, located in or near imprinted genes [19–21]. Therefore, we also examined the distribution of CpG islands within the orthologous imprinted clusters from all species (Figure 2; Additional data file 1). Given the overall high G-C content of the platypus genome compared to that of other mammals (45.5% in platypus versus 40% in eutherians ), it is surprising that the platypus gene clusters have relatively few CpG islands compared to all other mammalian species. This suggests that, in addition to an increase in repeat elements, the accumulation of CpG islands was also coincident with the acquisition of imprinting in the therian mammals and may have evolved as a secondary mechanism to stabilize the silencing mechanism.
Our platypus genome analyses have confirmed that eutherian imprinted clusters are not recent assemblages, but instead reside in ancient syntenic mammalian groups, as previously suggested (based on analysis of orthologues of eight imprinted genes in the platypus) . In fact, the arrangement of most clusters appears to predate the divergence of birds and mammals as shown by the analysis of 61 genes over 12 clusters in non-mammalian vertebrate genomes . Despite the conservation of gene arrangement within most orthologous imprinted gene clusters between all species examined, the regions have expanded greatly in the therian mammals compared to the platypus and chicken (Additional data file 3). This is particularly noticeable in the IGF2 and SDHD imprinted regions (Additional data file 2a, b), which show a rapid expansion in the therian mammals after divergence from the monotremes. The IGF2 region is the best-characterised imprinted domain among the mammals and is imprinted in both marsupials and eutherian mammals , but not in the monotremes . The expansion of repeat classes within this cluster unequivocally coincides with the acquisition of imprinting to this region.
Analysis of the change in copy number of specific repeat classes showed that the platypus genome has significantly fewer LTRs and DNA elements within the gene clusters that became imprinted in the therian mammals. We suggest that the accumulation of LTRs and DNA elements in the therian genome is coincident with, and may have been the driving force in, the development of mammalian genomic imprinting.
LTRs comprise a particularly interesting class of repeat, as they are almost entirely absent from the platypus genome. Likewise, DNA elements are substantially lower in most, but not all, orthologous imprinted regions in the platypus and throughout the entire genome.
While the change in the incidence of repeats between platypus and therian mammals is only significant for LTRs and DNA elements across all regions combined, examination of each region individually indicates significant changes in other repeat classes within several specific regions. For example, the GNAS locus in eutherians has levels of DNA elements that are well below those in the platypus. However, the proportion of simple repeats for this region is dramatically higher in eutherians than in platypus. Similarly, low-complexity repeats are almost absent from the RASGRF1 locus in platypus, but increase rapidly in all other mammalian groups (Additional data file 2). This suggests that genomic imprinting may not be induced by a single class of repeat elements in all regions but rather an increase in any repeat type at a given locus.
Host defence mechanisms would also be attracted to repeats that move within the genome. However, our statistical analyses are limited to the detection of accumulation (insertion and expansion) of repeats but not their movement within clusters or the genome. The comparative spatial distribution of repeats (Additional data file 1) clearly demonstrates the different distribution of repeats within orthologous regions. Again, using the GNAS locus as an example, while the total percentage of SINEs is identical for both the platypus and human locus, in human the SINEs are distributed mainly within the GNAS gene, while in platypus they are found mainly within the 5' intergenic region. This could result from the movement of repeats or independent insertions in different lineages. Since our analyses are unable to discriminate between these events, our statistics are an underestimate of the changes occurring in the genome that may have attracted host defence silencing. A more detailed spatial examination of repeats could also help to explain the acquisition imprinting at some loci.
Whole genome repeat distribution analyses were also performed to determine if repeat expansion was a general feature of the mammalian genome or specific to just the orthologous imprinted regions. Our findings show that, as expected, repeat expansion is not restricted to certain regions, but a general feature of the mammalian genome. The random invasion and expansion of LTRs and DNA elements would have attracted host defence silencing mechanisms to many regions throughout the entire therian genome. This phenomenon would occasionally lead to the silencing of surrounding genes, resulting in a phenotypic effect. Only where this effect conferred an evolutionary advantage (in genes such as those that control fetal growth and maternal nutrient supply) would it have been selected for and maintained, creating an imprinted allele. This imprint can then spread to neighbouring genes, resulting in the characteristic clusters of silenced (or imprinted) genes in the genome. This suggestion is supported by the spread of imprinting observed in the PEG10 locus  and the rapid accumulation of repeat elements within the IGF2 imprinted gene cluster in therian mammals compared to the repeat deprived, non-imprinted orthologous domain in the platypus.
This is the first complete analysis of the repeat distribution in the entire genome and imprinted clusters across all extant mammalian lineages. Since imprinting arose only in the viviparous (therian) mammals and is suggested to have occurred through the cooption of host defence mechanisms, comparisons of eutherian and marsupials with the newly available platypus genome provide the first opportunity for testing this hypothesis. Our findings provide strong, genome-wide support for the host defence hypothesis to explain the evolution of genomic imprinting in therian mammals. Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially LTRs and DNA elements, is not specific to the orthologous imprinted regions but has occurred throughout the therian genome. Host defence mechanisms such as DNA methylation would have been attracted to silence newly inserted foreign elements. This occasionally led to the silencing (imprinting) of adjacent genes. This 'imprint' was selected for, and maintained where it conferred an evolutionary advantage - for example, in genes that had functions in fetal growth, placentation or nutrient exchange - leading to the evolution of mammalian genomic imprinting.
Materials and methods
Repeat annotations were obtained for the human, mouse, dog, opossum, platypus and chicken genomes from the UCSC genome browser. The proportion of sequence in repetitive elements in imprinted gene clusters (including 20 kb flanking sequences) was then calculated. The proportion of sequence in repetitive elements was also calculated in 700 kb blocks across all genomes (this was the average size of imprinted clusters in human).
The repeats shown in Figure 2 were identified using RepeatMasker  for all species except platypus, where the whole-genome repeat analysis  was used. CpG islands (defined as more than 200 bp of continuous sequence with a C-G percentage greater than 60%) that attract methylation in imprinted regions in eutherian mammals were identified using a modified version of the CpGLH program by G Miklem and L Hillier .
To test for differences between the proportions of each repeat family in each species, all pairwise two-tailed t-tests were performed. The Holm method of correction for multiple testing was applied . In all tests, n = 19 (gene clusters) and the significance level was α = 0.05. For comparisons between platypus and therians or eutherians, the p-values quoted in the text are the largest of the adjusted p-values for all tests between platypus and those species considered (therian or eutherian). Complete results are provided in Additional data file 5.
Additional data files
The following additional data are available with the online version of this paper. Additional data file 1 shows the comparison of the spatial distribution of repeats for seven of the regions examined in our analysis. Additional data file 2 shows the analysis of percent sequence comprised by each class of repeat element separated by each region. Additional data file 3 shows a comparative gene map of the IGF2R imprinted region. Additional data file 4 shows the conservation of imprinted gene orthologues and regions within the human, mouse, dog, opossum, platypus and chicken genomes. Additional data file 5 shows the adjusted p-values from all pairwise t-tests comparing the transformed proportion of sequence in each repeat class between each species for the 19 genes and regions shown in Additional data file 4 and throughout the entire genome.
long terminal repeat
short interspersed nuclear element.
AJP is supported by a National Health and Medical Research Council RD Wright Fellowship, and MBR is supported by an Australian Research Council Federation Fellowship. ATP, TPS and KMcC are supported by the National Health and Medical Research Council of Australia.
- O'Neill MJ, Ingram RS, Vrana PB, Tilghman SM: Allelic expression of IGF2 in marsupials and birds. Dev Genes Evol. 2000, 210: 18-20. 10.1007/PL00008182.PubMedView ArticleGoogle Scholar
- Jiang YH, Bressler J, Beaudet AL: Epigenetics and human disease. Annu Rev Genomics Hum Genet. 2004, 5: 479-510. 10.1146/annurev.genom.5.061903.180014.PubMedView ArticleGoogle Scholar
- Wilkins JF, Haig D: Parental modifiers, antisense transcripts and loss of imprinting. Proc Biol Sci. 2002, 269: 1841-1846. 10.1098/rspb.2002.2096.PubMedPubMed CentralView ArticleGoogle Scholar
- Wilkins JF, Haig D: What good is genomic imprinting: the function of parent-specific gene expression. Nat Rev Genet. 2003, 4: 359-368. 10.1038/nrg1062.PubMedView ArticleGoogle Scholar
- Fowden AL, Sibley C, Reik W, Constancia M: Imprinted genes, placental development and fetal growth. Horm Res. 2006, 65: 50-58. 10.1159/000091506.PubMedView ArticleGoogle Scholar
- Killian JK, Nolan CM, Stewart N, Munday BL, Andersen NA, Nicol S, Jirtle RL: Monotreme IGF2 expression and ancestral origin of genomic imprinting. J Exp Zool. 2001, 291: 205-212. 10.1002/jez.1070.PubMedView ArticleGoogle Scholar
- Killian JK, Byrd JC, Jirtle JV, Munday BL, Stoskopf MK, MacDonald RG, Jirtle RL: M6P/IGF2R imprinting evolution in mammals. Mol Cell. 2000, 5: 707-716. 10.1016/S1097-2765(00)80249-X.PubMedView ArticleGoogle Scholar
- Rapkins RW, Hore T, Smithwick M, Ager E, Pask AJ, Renfree MB, Kohn M, Hameister H, Nicholls RD, Deakin JE, Graves JA: Recent assembly of an imprinted domain from non-imprinted components. PLoS Genet. 2006, 2: e182-10.1371/journal.pgen.0020182.PubMedPubMed CentralView ArticleGoogle Scholar
- Suzuki S, Ono R, Narita T, Pask AJ, Shaw G, Wang C, Kohda T, Alsop AE, Marshall Graves JA, Kohara Y, Ishino F, Renfree MB, Kaneko-Ishino T: Retrotransposon silencing by DNA methylation can drive mammalian genomic imprinting. PLoS Genet. 2007, 3: e55-10.1371/journal.pgen.0030055.PubMedPubMed CentralView ArticleGoogle Scholar
- Barlow DP: Methylation and imprinting: from host defense to gene regulation?. Science. 1993, 260: 309-310. 10.1126/science.8469984.PubMedView ArticleGoogle Scholar
- McDonald JF, Matzke MA, Matzke AJ: Host defenses to transposable elements and the evolution of genomic imprinting. Cytogenet Genome Res. 2005, 110: 242-249. 10.1159/000084958.PubMedView ArticleGoogle Scholar
- Hore TA, Rapkins RW, Graves JA: Construction and evolution of imprinted loci in mammals. Trends Genet. 2007, 23: 440-448. 10.1016/j.tig.2007.07.003.PubMedView ArticleGoogle Scholar
- Warren WC, Hillier LW, Marshall Graves JA, Birney E, Ponting CP, Grützner F, Belov K, Miller W, Clarke L, Chinwalla AT, Yang SP, Heger A, Locke DP, Miethke P, Waters PD, Veyrunes F, Fulton L, Fulton B, Graves T, Wallis J, Puente XS, López-Otín C, Ordóñez GR, Eichler EE, Chen L, Cheng Z, Deakin JE, Alsop A, Thompson K, Kirby P, et al: Genome analysis of the platypus reveals unique signatures of evolution. Nature. 2008, 453: 175-183. 10.1038/nature06936.PubMedPubMed CentralView ArticleGoogle Scholar
- Edwards CA, Rens W, Clarke O, Mungall AJ, Hore T, Graves JA, Dunham I, Ferguson-Smith AC, Ferguson-Smith MA: The evolution of imprinting: chromosomal mapping of orthologues of mammalian imprinted domains in monotreme and marsupial mammals. BMC Evol Biol. 2007, 7: 157-10.1186/1471-2148-7-157.PubMedPubMed CentralView ArticleGoogle Scholar
- Edwards CA, Mungall AJ, Matthews L, Ryder E, Gray DJ, Pask AJ, Shaw G, Graves JA, Rogers J, SAVOIR consortium, Dunham I, Renfree MB, Ferguson-Smith AC: The evolution of the DLK1-DIO3 imprinted domain in mammals. PLoS Biol. 2008, 6: e135-10.1371/journal.pbio.0060135.PubMedPubMed CentralView ArticleGoogle Scholar
- Hubbard TJ, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Fitzgerald S, Fernandez-Banet J, Graf S, Haider S, Hammond M, Herrero J, Holland R, Howe K, Howe K, Johnson N, Kahari A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Melsopp C, Megy K, et al: ENSEMBL 2007. Nucleic Acids Res. 2007, 35: D610-617. 10.1093/nar/gkl996.PubMedPubMed CentralView ArticleGoogle Scholar
- Brudno M, Do CB, Cooper GM, Kim MF, Davydov E, NISC Comparative Sequencing Program, Green ED, Sidow A, Batzoglou S: LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 2003, 13: 721-731. 10.1101/gr.926603.PubMedPubMed CentralView ArticleGoogle Scholar
- Margulies EH, NISC Comparative Sequencing Program, Maduro VV, Thomas PJ, Tomkins JP, Amemiya CT, Luo M, Green ED: Comparative sequencing provides insights about the structure and conservation of marsupial and monotreme genomes. Proc Natl Acad Sci USA. 2005, 102: 3354-3359. 10.1073/pnas.0408539102.PubMedPubMed CentralView ArticleGoogle Scholar
- Gardiner-Garden M, Frommer M: CpG islands in vertebrate genomes. J Mol Biol. 1987, 196: 261-282. 10.1016/0022-2836(87)90689-9.PubMedView ArticleGoogle Scholar
- Rakyan VK, Preis J, Morgan HD, Whitelaw E: The marks, mechanisms and memory of epigenetic states in mammals. Biochem J. 2001, 356: 1-10. 10.1042/0264-6021:3560001.PubMedPubMed CentralView ArticleGoogle Scholar
- Reik W, Dean W: DNA methylation and mammalian epigenetics. Electrophoresis. 2001, 22: 2838-2843. 10.1002/1522-2683(200108)22:14<2838::AID-ELPS2838>3.0.CO;2-M.PubMedView ArticleGoogle Scholar
- Dünzinger U, Haaf T, Zechner U: Conserved synteny of mammalian imprinted genes in chicken, frog, and fish genomes. Cytogenet Genome Res. 2007, 117: 78-85. 10.1159/000103167.PubMedView ArticleGoogle Scholar
- Repeat Masker. [http://www.repeatmasker.org/]
- R. [http://www.r-project.org/]
- Holm S: Simple sequentially rejective multiple test procedure. Scandinavian J Stat. 1979, 6: 65-70.Google Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.