- Open Access
Species-specific shifts in centromere sequence composition are coincident with breakpoint reuse in karyotypically divergent lineages
Genome Biology volume 8, Article number: R170 (2007)
It has been hypothesized that rapid divergence in centromere sequences accompanies rapid karyotypic change during speciation. However, the reuse of breakpoints coincident with centromeres in the evolution of divergent karyotypes poses a potential paradox. In distantly related species where the same centromere breakpoints are used in the independent derivation of karyotypes, centromere-specific sequences may undergo convergent evolution rather than rapid sequence divergence. To determine whether centromere sequence composition follows the phylogenetic history of species evolution or patterns of convergent breakpoint reuse through chromosome evolution, we examined the phylogenetic trajectory of centromere sequences within a group of karyotypically diverse mammals, macropodine marsupials (wallabies, wallaroos and kangaroos).
The evolution of three classes of centromere sequences across nine species within the genus Macropus (including Wallabia) were compared with the phylogenetic history of a mitochondrial gene, Cytochrome b (Cyt b), a nuclear gene, selenocysteine tRNA (TRSP), and the chromosomal histories of the syntenic blocks that define the different karyotype arrangements. Convergent contraction or expansion of predominant satellites is found to accompany specific karyotype rearrangements. The phylogenetic history of these centromere sequences includes the convergence of centromere composition in divergent species through convergent breakpoint reuse between syntenic blocks.
These data support the 'library hypothesis' of centromere evolution within this genus as each species possesses all three satellites yet each species has experienced differential expansion and contraction of individual classes. Thus, we have identified a correlation between the evolution of centromere satellite sequences, the reuse of syntenic breakpoints, and karyotype convergence in the context of a gene-based phylogeny.
The centromere paradox posits that the DNA at centromeres is conserved for function, but not sequence . Within the murine and primate lineages, centromeric DNA sequences are species specific and different chromosomes within a species sometimes contain divergent centromeric DNA sequences . In stark contrast, the gross structure of the centromere and the associated kinetochore proteins are conserved across eukaryotes [3, 4]. Such functional conservation in the apparent absence of sequence conservation, combined with the identification of functional centromeres at non-centromere locations (that is, neocentromeres), has led to the hypothesis that centromeres are largely determined by epigenetic modifications, such as histone variants [5, 6] (reviewed in ). In humans, it has been suggested that segmental duplication events around the centromere ultimately lead to the high degree of variability within centric sequences [8, 9]. However, studying the evolution of human centromere sequences in the context of karyotypic change has been difficult because the family of great apes has experienced little gross chromosome change between species .
Several marsupial families have experienced extensive karyotypic change, deriving from the rearrangement of a basic complement of 19 chromosome blocks through centric shifts (centromere repositioning), fissions, fusions and translocations [11–13]. Extant marsupial karyotypes exhibit a bimodal distribution between 2n = 14 and 2n = 22 [14, 15]. While the 2n = 14 karyotype is homologous in several extant lineages, the 2n = 22 karyotype is highly divergent, suggesting independent derivation through breakpoint reuse. Rens et al.  traced the history of the rearrangements of these 19 syntenic blocks across several marsupial families, demonstrating frequent convergent breakpoint reuse within marsupials at breaks of synteny between these chromosome segments [11, 12].
The recent radiation of karyotypically diverse species within the marsupial subfamily Macropodinae (kangaroos, wallaroos and wallabies)  affords the opportunity to study centromere evolution in the context of karyotypic change within a relatively short evolutionary time frame. Across the approximately 58 macropodine species diploid numbers range from 2n = 10(XX)/11(XY1Y2) (Wallabia bicolor) to 2n = 24 (Lagostrophus fasciatus), all derived through different suites of centric fusions (Robertsonian translocations), centric shifts (centromere repositioning) and pericentric inversions [16, 17]. Within the Macropodinae, the genus Macropus (14 species including W. bicolor) has undergone a recent (4-11 million years ago) [18–20] and rapid karyotypic radiation. However, phylogenetic studies within this genus, relying on DNA-DNA hybridization , chromosome evolution based on G-banding studies  and serology-based studies [22, 23] have failed to provide well-supported concordant phylogenies for species within this genus.
Confounding efforts to reconstruct phylogenetic relationships based on chromosome evolution is the observation that several species within Macropus have experienced breakpoint reuse between syntenic blocks, each at active centromere locations, in the derivation of novel karyotypes (reviewed in ). For example, the karyotype of the model species Macropus eugenii (tammar wallaby) is derived from the ancestral 2n = 22 through a series of fusions and translocations resulting in a reduction in chromosome number to 2n = 16. In fact, three different 2n = 16 karyotypes are seen within Macropus, each resulting from different fusions and translocations at the centromeres of the same syntenic blocks. The reuse of the breaks of synteny within this genus occurs exclusively at centromeric sites, allowing commensurate tracking of syntenic boundaries and centromeric sequences.
We hypothesize that the reuse of breaks of synteny involving centromeric sequences, active or inactive, leads not to an increase in the variability of involved DNA sequences, but instead leads to their conservation. The 'library hypothesis' posits that a suite of satellite sequences may be shared between closely related species . Different satellite families may experience different evolutionary processes, such as concerted evolution, intrachromosomal sequence conversion and unequal crossing over . Species specific 'turnover' of these sequences may occur when a satellite family becomes selected as a major centromere satellite capable of attracting centromere proteins, and is thus functional with respect to cell division . We posit that the conservation of centromeric sequences between lineages will be reinforced the more reuse the breakpoints associated with these centromere locations experience. This hypothesis can be tested in models where centromere reuse is high, as the evolution of centromeric repeats will be predicted to accompany the evolution of syntenic block rearrangements. The data presented herein was used to determine whether the evolutionary trajectory of centromeric sequence composition has paralleled chromosome evolution, and thus syntenic block arrangement, or whether it follows species evolution.
Previous work on the macropodine species Macropus rufogriseus (red-necked wallaby, Mrb) detailed the sequence and karyotypic distribution of three centromeric sequence classes, a functional 178 bp centromere satellite (typified by the sequence Mrb-sat23), a repeat derived from a simple 7-mer (typified by the sequence Mrb-B29), and a degenerate pericentric satellite (typified by the sequence Mrb-sat1). In the present study, we examined the karyotypic distribution of these three centromeric constituents across Macropus and have identified patterns of chromosome distributions. A gene phylogeny based on mitochondrial and nuclear sequences was constructed for nine species within Macropus. This phylogeny was then tested for concordance with a phylogeny derived from syntenic block arrangement as determined by GRIMM (Genome Rearrangements In Man and Mouse) algorithms within the Multiple Genome Rearrangements (MGR) program . Comparative analyses of these datasets showed that the evolution of centromeric sequence composition has paralleled chromosome, and thus syntenic block, evolution but has not strictly followed species evolution as measured by phylogenetic metrics.
Results and discussion
The entire mitochondrial Cytochrome b (Cyt b) gene was sequenced from eight Macropus species (M. robustus, M. antilopinus, M. rufus, M. giganteus, M. eugenii, M. rufogriseus, M. agilis, M. parma) and W. bicolor. One Petrogale (P. xanthopus) and one Thylogale (T. thetis) species were sequenced as outgroups (reviewed in ), representing two other macropodine genera (Additional data file 1). Thylogale carries the ancestral karyotype of all Macropodidae and shares a common ancestor with Macropus and Petrogale [16, 18], rendering it an ideal outgroup taxa for all datasets of our study. The macropodine Cyt b is 1,146 bp in length and was included in its entirety for these analyses. The polymorphic sites, 362 variable sites and 200 parsimony informative sites, across the 11 species analyzed in this study were evenly distributed throughout the gene (data not shown).
Due to the potential for natural interspecies hybridization within the Macropus genus that may skew mitochrondrial gene sequence towards one species, albeit a very rare occurrence in this clade , a nuclear gene, selenocysteine tRNA (TRSP), and its flanking regions were included in these analyses. TRSP is the gene region that includes the selenocysteine tRNA gene, an alternative tRNA for the UGA termination codon (also called the opal suppressor) in selenoproteins. Similar in structure to cysteine, selenocysteine substitutes sulfur with selenium. The TRSP region was selected because previous studies indicated that regions flanking the TRSP transcription unit carried sufficient informative sites for phylogenetic resolution of closely related species within the Canidae . The time since species divergence within the Canidae (0.3-12 million years ago) [31, 32] is similar to that of Macropus (4-11 million years ago) [18, 19]. Moreover, extensive karyotype rearrangement has also been documented across Canid species . Bardeleben et al.  found the evolutionary rates of the 5' and 3' flanking regions of TRSP to be faster than that of introns of some nuclear genes, making the choice of this gene more attractive for our intra-genus study.
The 87 bp TRSP gene and its 5' (340 bp) and 3' (261 bp) flanking regions were sequenced from 11 species (Additional data file 1). Across the dataset, the TRSP region (688 bp) had 213 variable sites and 103 parsimony informative sites. Unlike the informative sites identified in Cyt b that show an even distribution across the entire gene (data not shown), the informative sites within TRSP are clustered in the regions flanking the TRSP coding sequence (Figure 1). Nine of the variable sites were intragenic, of which eight were transversions. Five indels were present in the 5' region and six in the 3' region. Within the 5' flanking region, though they do not have good sequence identity, the proximal and distal sequence elements can be found at the same positions as their eutherian counterparts. While the proximal and distal sequence elements are not well conserved, other regions, such as -182 bp to -145 bp upstream, are (Figure 1), but no established functionality has been attributed to them.
Previous studies have failed to clarify the phylogenetic relationships amongst the five groups of Macropus species included in this study: grey kangaroo (M. giganteus), red kangaroo (M. rufus), the wallaroos (M. antilopinus and M. robustus), swamp wallaby (W. bicolor), and the 'true' wallabies (M. parma, M. eugenii, M. agilis, M. rufogriseus). The Cyt b analysis places W. bicolor within the 'true' wallabies, to the exclusion of M. giganteus. The p-distances within the Cyt b dataset for W. bicolor are low and comparable to those of the other wallabies (Figure 2a). The two wallaroo species maintain a close association and define a group unto themselves. In contrast, the TRSP tree places M. giganteus with the 'true' wallabies, to the exclusion of W. bicolor (Figure 2b), with more significant clade credibility values. The tree derived from analysis of a concatenation of both sequences maintains the topology of the TRSP tree with respect to these two species and has strong supporting credibility values (Figure 2c). Given the individual or concatenated datasets, we used the Shimodaira-Hasegawa test  (as implemented using TREE-PUZZLE 5.2 ) to explore the confidence set of phylogenies. Shimodaira-Hasegawa testing of single and combined datasets did not significantly reject phylogenies where M. giganteus and W. bicolor formed a distinct clade and thus cannot reject the possibility that M. giganteus and W. bicolor form a separate clade.
The association supported by the TRSP and concatenated datasets places M. rufus as sister taxa to all other Macropus and Wallabia. Surprisingly, in trees produced from both Cyt b and TRSP, the wallaroos are sister taxa to Wallabia and the rest of Macropus, though overall sequence differences of M. rufus outweigh that association in the concatenated tree (Figure 2a,b versus 2c). The concatenated tree logically places M. rufus on the ancestral Macropus node, places the two wallaroo species together and places all the 'true' wallabies together (see below and Figure 3). Neither the Cyt b nor TRSP analysis alone resolves the species within the 'true' wallabies. It is likely that there were not enough phylogenetically informative sites to resolve this group due to their recent derivation, though the combination of both genes does provide some resolution. However, the phylogeny presented is statistically robust (see Materials and methods), and thus provides a sound topological basis from which to examine the pattern of karyotypic evolution in this group.
Multiple genome rearrangement analysis
Basic marsupial karyotypes can be defined by the arrangement of 19 conserved chromosome segments, or syntenic blocks derived from a common ancestor [11, 12] (Figure 3a key). These syntenic blocks can be used to trace the history of chromosome rearrangements across Macropus species. The syntenic blocks are arranged in six different configurations among the eleven species examined. Though the karyotypes of most Macropus species have been defined by G-banding , deriving the most parsimonious chromosome phylogeny has produced conflicting trees [11, 13] largely due to a lack of consensus on the amount of convergent breakpoint reuse and the number of translocations across species with a 2n = 16 diploid number. Resolution has been further confounded by the lack of a comprehensive gene phylogeny to use as a guide from which to study the order and pattern of rearrangement of affected chromosomes.
As a general measure of chromosome evolution, a karyotype phylogeny has been generated for Macropus. The MGR program  was used to reconstruct the most parsimonious lineage for Macropus based on syntenic block organization. For this analysis, each chromosomal segment was coded by syntenic number relative to the ancestral karyotype, represented by T. thetis, and oriented relative to the centromere position (Figure 3a key). The MGR program allows break-point reuse, fissions, fusions, inversions and translocations to occur in any direction necessary to achieve maximum parsimony . Our analysis of the phylogenetic history of the 19 synteny blocks across this group of mammals supports a tree for the taxa that reduces the number of rearrangements suggested by previous phylogenies [11, 13] and supports convergent breakpoint reuse at the centromeres among syntenic blocks C1 (Figure 3a,b, dark pink), C2 (grey), C8 (light yellow), C10 (aqua), C15 (dark orange), and C18 (orange). These six blocks, each with boundaries at the centromere, are involved in the various suites of chromosome rearrangements within this genus.
The MGR analysis of syntenic block rearrangement produced an unrooted tree with three ancestral nodes from which the input karyotypes derive (Figure 3a). Nine steps (one step = one fission, fusion, or translocation) were extrapolated along the total length of the tree. This tree also infers three ancestral karyotypes, denoted as α, β and γ. The ancestral α karyotype, presumed to be the oldest from its proximity to T. thetis (an outgroup and a species carrying the 2n = 22 karyotype ancestral to all Macropodidae), is 2n = 20 and equivalent to M. rufus. In the ancestral α (and M. rufus) karyotype, syntenic block C8 has undergone a fusion with block C1. Ancestral α is inferred to have undergone two autosomal fusions (denoted by '^') to create the 2n = 16 karyotype of ancestral β (C10^C18, C15^C2), equivalent to that of M. eugenii. From ancestral β, a further translocation (denoted as '*'; C15*10, C2*C18) achieved a karyotype equivalent to that of M. robustus. A different translocation (C8*C18, C10*C1) occurred to form the ancestral γ karyotype, still a 2n = 16 form. The karyotype of M. giganteus is derived by yet a different translocation event (C8*C2, C15*C18), still preserving the 2n = 16 form. The karyotype of W. bicolor is achieved from the ancestral γ by three fusions, including the fusions of two autosomes to the X.
The MGR derived phylogeny is not concordant with the gene tree (Figure 2c) as it places M. giganteus and W. bicolor in a separate monophyletic group derived from the γ ancestral karyotype. When the derivation of these arrangements, including the inferred ancestors, is mapped along the tree derived from our sequence analyses, convergent breakpoint reuse is implicated (Figures 3, 4, 5). The breaks between blocks C1, C2, C8, C10, C15 and C18 are used in the derivation of the β ancestral karyotype in the lineage leading to the M. eugenii group as well as in the derivation of the γ ancestral karyotype in the lineages leading to M. giganteus and W. bicolor.
Fluorescence in situhybridization of centromere satellites
Our MGR analysis, taken in the context of species evolution, indicated that convergent breakpoint reuse has occurred several times within this genus. Moreover, the rearrangements within Macropus that distinguish each karyotype involve a breakpoint at a centromere. With the exception of the W. bicolor X, which has two autosomes fused to it , this karyotypic lability is derived from intrachromosomal rearrangements of six syntenic blocks (C1, C2, C8, C10, C15 and C18) that border active centromeres in all macropodine species (Figure 3b). Each species within Macropus also carries a unique X chromosome arrangement and structure (see below and Figure 6), yet is composed of only one syntenic block (C19). For these reasons, we wanted to investigate genetic markers that would prove informative for analyses of sex chromosome evolution in addition to karyotypic evolution of the autosomal complement.
To track the evolution of centromere sequences, with attention towards breakpoint reuse and chromosome rearrangement, as well as sex chromosome structure, we analyzed the chromosome distribution of large blocks of the three centromere satellite classes, sat1, sat23 and B29 . Fluorescence in situ hybridization (FISH) of these three satellite classes previously proved informative in the identification of functional centromere sequences within the 2n = 16 M. rufogriseus karyotype .
A representative of each of these satellite classes was used as a probe for FISH on metaphase chromosome preparations from eight Macropus species (M. eugenii, M. agilis, M. rufogriseus, M. parma, M. giganteus, M. robustus, M. antilopinus, M. rufus), W. bicolor and P. xanthopus (Figure 4, Additional data files 1-3). The Mrb-sat1 probe is a degenerate pericentric satellite from M. rufogriseus, containing just over two 342 bp tandem repeats of 71% homology to one another. The 410 bp Mrb-B29 probe contains simple, tandem 6- and 7-mer repeat variants of GGAATTT. The Mrb-sat23 probed contains one and a half units of a 178 bp alphoid satellite containing a functional CENP-B-box .
Figure 4 shows the FISH data for satellites found at all centromeres within a karyotype while Figure 6 summarizes the distribution of all three satellites to the X and Y chromosomes of each species. Within M. rufogriseus, W. bicolor and M. giganteus, sat1 is centromeric on the X chromosomes. M. giganteus and M. rufus are unique in sharing the localization of sat1 to nearly all autosomal centromeres (Figures 4 and 6). Hybridization of this satellite to M. rufus chromosome 7 could not be detected, perhaps because of the small size of the centromere on this chromosome. The ubiquitous presence of sat1 at the centromeres of M. giganteus indicates this sequence may be acting as a functional centromere in this species.
The sat1 hybridization to wallaroo M. robustus chromosomes is a prominent indicator of the governance rearrangements of the six syntenic blocks (C1, C2, C8, C10, C15 and C18) may have over centromere sequences (Figure 5). The only centromeres to contain sat1 (chromosomes 1, 5 and 6) are the centromeres that separate syntenic blocks C1, C2, C8, C10, C15 and C18. These are the only actively rearranging blocks in the karyotype of this lineage, indicating that the use of these centromeres as sites of rearrangement has led to the conservation of this satellite sequence. M. antilopinus, while closely related to M. robustus, shows a similar distribution of sat1, although the hybridization signal is more dispersed. Sat23 signal is also reduced in M. antilopinus, perhaps an indication of a lesser amount of heterochromatic material. In fact, hybridization signal of centromere satellites in general will be less detectible in M. antilopinus as its centromeres in general are smaller than those of M. robustus.
Traces of B29, the simple repeat, are seen at the centromeres of P. xanthopus, indicating this satellite is not restricted to the Macropus (Figure 4). Within the other wallabies (for example M. eugenii) this sequence is lost at the X centromere core, but is present pericentromerically (Figure 6, Additional data files 3-5). Remarkably, B29 is conserved as a major component of the Y chromosome in every species included in this study (Figure 6). Its predominance in the P. xanthopus and presence in the Macropus Y chromosomes implies that once this repeat has been introduced to the Y chromosome, it has not been removed over the evolutionary time period analyzed herein (approximately 15 million years).
Sat23, which contains a CENP-B binding domain in the M. rufogriseus variant, does not appear to be present in large enough tandem arrays in P. xanthopus to be detected via FISH analyses (Additional data files 3-5), though Southern analyses detect its presence in lower copy number (Additional data file 6). Sat23 is present at every autosomal centromere except for those of W. bicolor and M. giganteus (Figure 4). However, this satellite is present at the centromere of the X and Y1 of W. bicolor and the centromere of the Y of M. giganteus (Figures 4 and 6). The presence of sat23 in tandem arrays exclusively at all centromeres of most Macropus species (Figure 4, Additional data files 3-5) indicates that this genus is characterized by a conserved centromeric sequence. It is important to note that given the low stringency applied to our FISH assays, these sequences are likely not identical but are homologous and only detectable in large repeated blocks.
The pattern of satellite expansion and contraction, taken in the context of tree topology and chromosome rearrangement (Figure 4) indicates that the amounts of both sat1 and sat23 have each grown and diminished in divergent lineages that have experienced different types of chromosome rearrangements. For example, the contraction of sat23 in W. bicolor and M. giganteus coincides with the formation of the ancestral γ karyotype through the same translocation of (C8*C18) and (C10*C1) (Figure 3). Subsequent to this, sat1 experienced another expansion specific to the M. giganteus lineage and coincident with one more translocation between (C8*C2) and (C15*C18) (Figure 3). Prior to the divergence of M. rufus an expansion of sat23 and the reduction of B29 coincides with a fusion (C1^C8). The wallaroos (M. robustus and M. antilopinus) experience a partial increase of sat1 accompanying a translocation (C15*C10, C2*C18) in their karyotype lineage only in the centromeres participant in common translocation sites (the centromeres between C1 and C8, C15 and C10, and C2 and C18) (Figures 3, 4, 5).
Every change in karyotypic evolution in this genus has been accompanied by a corresponding change in predominant centromeric sequence composition. From our data it appears that the contractions of centromere satellites are most often associated with fusion events during chromosome rearrangement, and the expansion of centromere satellites are most often associated with translocation events. The amount of satellite B29 diminishment coincides with a fusion at arrow 1 in Figure 4. Reduction of satellites sat1 and sat23 coincide with fusions at arrows 3 and 4, respectively. Arrows 2 and 5 indicate sat1 accretion accompanies translocation events. On the lineage to W. bicolor (arrow 4) a translocation also occurs. While we have been able to identify a centromeric repeat that has undergone diminution associated with the fusion, we have not been able to identify a predominant centric sequence nor its accompanying accretion in this lineage as predicted by the presence of the translocation. It is likely that an as yet unidentified sequence exists, and we predict it has undergone an expansion to become more prevalent at the centromeres of this species.
Phylogenetic history of Macropus
While several studies have attempted to refine a phylogeny for the genus Macropus, these studies were lacking in either species coverage or support. Previous morphological and serological studies of Macropus adequately sampled the genus, though were lacking in statistical support [22, 23, 37]. Previous genetic studies, while statistically supported, lacked adequate representation of the genus . The choice of two genes, one mitochrondrial and one nuclear, provides for a sound phylogenetic analysis of this group of species [38–58]. The Cyt b and TRSP gene phylogenies reported herein include 9 of the 13 extant species (approximately 70% coverage), encompassing a more comprehensive dataset for developing a Macropus phylogeny. Our phylogenetic analyses derived from the Cyt b/TRSP concatenated dataset shows high Bayesian clade credibility values and maximum likelihood (ML) boostrap values (Figure 2c), providing a robust phylogeny on which to analyze the pattern of centromere and chromosome evolution across this group of mammals.
The placement of W. bicolor in relation to the Macropus phylogeny has previously been debated . W. bicolor is the only extant member of its genus. The status of this species as a sister genus to Macropus has been historically supported by morphological , immunological , and limited gene phylogeny studies . However, this placement has been challenged by serology  and DNA-DNA hybridization [19, 20, 37], providing support for inclusion of Wallabia species within Macropus. Our gene based phylogenies (Figure 2) and MGR karyotype analysis (Figure 3) find Macropus to be monophyletic including Wallabia.
Another conclusion from these phylogenetic analyses is the relative position of M. rufus (red kangaroo) and M. giganteus (grey kangaroo). The Cyt b/TRSP tree (Figure 2c) excludes M. rufus from the rest of Macropus, while placing M. giganteus and W. bicolor with the 'true' wallabies. Previous taxonomists placed M. rufus with the wallaroos in a separate genus, Osphranter . The analyses presented herein do not place these three species into one monophyletic group and support their inclusion within Macropus.
Chromosomal history of Macropus
Comparison of the Cyt b/TRSP phylogeny (Figure 2c) overlaid with the karyotype analysis derived from the MGR phylogeny (Figure 3) indicates ten different rearrangements are needed to form every karyotype derived from the ancestral α karyotype (Figure 4). Though the representation of chromosome rearrangements with respect to the gene phylogeny is less parsimonious than the MGR analysis by one step, it is probably more reflective of the natural history of the clade as measured by the Cyt b/TRSP analysis and FISH analyses.
In our analyses we examined the evolution of centromere satellite repeats across Macropus species to determine whether the path of centromere evolution has paralleled chromosome evolution or species evolution, as measured by gene histories. The distribution of predominant centromeric sequences across these species is not informative when mapped onto the gene phylogeny alone (Figure 4). When the history of syntenic block rearrangement is considered, the contractions and expansions of predominant satellites are found to consistently accompany specific karyotype rearrangements of syntenic blocks C1, C2, C8, C10, C15 and C18 (see Figure 5 for an example). Thus, there is a strong correlation between changes in predominant satellite sequences, with respect to homogenous distribution across all centromeres within a karyotype, and chromosome rearrangement events.
Of significance is the demonstration that convergent breakpoint reuse between C1, C2, C8, C10, C15 and C18 results in convergent centromere restructuring. Other studies have identified retention of low-copy numbers of sat23 satellite sequence at the breaks of synteny between most of the 19 conserved chromosome segments within M. eugenii . Based on evidence of convergent centromere sequence expansion of sat1 among M. rufus, M. robustus, and M. giganteus (Figure 4), we hypothesize that retention of these sequences at breaks of synteny in low copy provides the sequence targets for centromere satellite expansion.
Our data suggest that 'new' satellite sequences have not been repeatedly introduced into the macropodine genome to become predominant centromeric sequences as predicted by centromere drive . Rather, these centromeric satellites remain in the genome, likely at latent centromere locations , and undergo recurrent repeat copy number expansion and contraction in divergent lineages. This analysis does not imply de novo adoption of previously non-centromeric sequences at centromere locations following chromosome rearrangement, but indicates the same sequences can undergo convergent expansion across all centromeres in different lineages.
Salser et al.  proposed the 'library hypothesis' of satellite evolution in which related lineages share a collection of heterochromatic repeat sequences that may become preferentially amplified in any of the given species during the normal events of centromere evolution. In Macropus the 'library' of satellite sequences, including sat1, sat23 and B29, is involved in the creation of large, satellite arrays. This conservation implies that centromeric sequences are not created de novo, but recycled from the existing library. Mestrovic et al.  found support for the 'library hypothesis' in examining satellite sequence predominance across congeneric species of Palorus insects. By PCR assays it was determined that though all Palorus species examined possessed all satellites examined, a different single satellite was greatly amplified in each of the different species, demonstrating that all species shared a common satellite library from which the amplifications occurred. We have found the same to be true by FISH (Figures 4 and 5, Additional data files 3-5) and Southern analyses of our repeats across Macropus (Additional data file 6). Lin and Li  identified similar inter-genera evidence of centromeric heterochromatin conservation among cervid deer.
Within Macropus, the recurrent pattern of detectable repeat presence or absence by FISH in the autosomes versus the sex chromosomes (Figures 4 and 6) could be indicative of the rate at which these two types of chromosomes accumulate and dissipate centromeric material. The sex chromosomes of this group appear to retain tandem arrays of ancestral centromeric material for longer periods of time. For example, the presence of B29 on the sex chromosomes of all species examined indicates an origin for this satellite predating Macropus diversification. While most Macropus species carry all three satellites on their sex chromosomes, subsequent reduction of sat1 on the sex chromosomes has occurred in the lineage leading to M. robustus/M. antilopinus as well as within M. parma. Most species within Macropus carry a suite of satellites (B29, sat1) on their sex chromosomes that are no longer found as expanded satellites on their autosomal counterparts. Evidence from cervid deer and muntjac also shows retention of tandem arrays of satellites in the sex chromosomes that are not found in the autosomes [61, 62].
Mechanistic processes inherent to fusion and translocation events may be responsible for the observed contractions and expansions of the satellite arrays. Diminution of satellite arrays by excision may occur as a result of subtractive processes occurring during fusion events. Prior to Robertsonian fusions, chromosome breaks within the centric satellites remove the p-arms, and a portion of the centromere and pericentromere, to expose the fusion sites [63, 64]. The centric position of the break sites leads to overall reduction of satellite sequences as a result of fusion events.
In contrast, duplication of centromere material leading to accretion of satellite arrays may be the result of arm-swapping translocations. Studies of patterns of segmental duplications in humans indicate that segmental duplications precede and accompany translocation events . Segmental duplications show a concentration near centromeres and occur more often interchromosomally. As such, they are hypothesized to aid in centromere sequence convergence [9, 65, 66]. Segmental duplications preceding a translocation event would increase sequence identity between sites, making such translocations events more likely . At the centromeres, such duplication events would also serve to distribute satellite sequences to centromeres throughout the genome, increasing the likelihood of the adoption of a predominant satellite sequence . Propagation of satellites via segmental duplication events also supports the 'library hypothesis' of centromere satellite origination, as it represents a recycling process inherent to the hypothesis.
After predominant centromere satellites were identified in a majority of the genus, these sequences were used to track centromere evolution. Comparing the karyotype phylogeny to the gene-tree topology concludes that while Macropus species possess several divergent karyotypes, there is reuse of satellite sequences as a result of breakpoint reuse at specific syntenic block boundaries coincident with centromeres. This study shows that the 'library hypothesis' describes the patterns of centromere sequence convergence within this mammalian lineage. Thus, satellite sequence evolution is found to strictly follow chromosomal evolution, likely as a result of the dynamic role the centromere plays in karyotype change.
Materials and methods
Cyt Bsequence analyses
Mitochondrial Cyt b was amplified from 11 species (Additional data file 1) using the primers Mr1/Mr2  (Additional data file 7) that flank the gene. Internal gene sequencing was done with combinations of primers (Additional data file 7). Products were direct sequenced with ABI BigDye3.1 on an ABI 3130 sequencer as per the manufacturer's instructions (ABI: Foster City, CA, USA). Cyt b was sequenced from two individuals per species to confirm species identity. However, two individuals each were unavailable for M. antilopinus and M. parma. Thus, branch lengths based on the number of substitutions per site were calculated to evaluate intra-versus interspecies diversity across the first 406 bp of Cyt b from all available Macropodine sequences available from GenBank and our dataset (Additional data file 1). All interspecies branch lengths were larger than all intraspecies branch lengths. The distances between the respective nearest neighbors to M. antilopinus and M. parma exceeded all intraspecies branch length values and thus were concluded to be appropriately individual species, at least sharing identity with none of the species included in this study. P. xanthopus and T. thetis were sequenced as outgroups.
Mus musculus (mouse), Oryctolagus cuniculus (rabbit) and Gallus gallus (chicken) sequences of the 87 bp TRSP transcription unit were used to search the NCBI Trace Archives of M. eugenii. The Trace Archives' sequences of M. eugenii with the highest percent identity to the mouse, rabbit and chicken TRSP were aligned with VectorNTI version 10 (Invitrogen: Carlsbad, CA, USA) to find the largest contiguous trace sequence containing TRSP.
The M. eugenii trace sequence accession number [Genbank:976005645] was found to have the highest percent identity to both the other trace sequences from M. eugenii as well as TRSP from mouse and rabbit. Primer3  was then used to construct primers from [Genbank:976005645] spanning the TRSP gene region (Additional data file 7). Primers to the TRSP gene itself were designed using the Ornithorhynchus anatinus (platypus) trace sequence [Genbank:188164072] (Additional data file 7).
The nuclear gene region of TRSP (688 bp) was sequenced from two individuals of each species. Direct sequencing was performed as above. Within W. bicolor, M. robustus and M. antilopinus, some regions of this gene were not amenable to direct sequencing and, thus, were subcloned into pGEM-T Easy (Promega: Madison, WI, USA) and then sequenced in triplicate from the plasmid clones. The TRSP gene from P. xanthopus and T. thetis were PCR amplified and direct sequenced as outgroups.
Identical tree topologies were generated using MrBayes [72, 73], Mega  and PhyML . Analyses performed were used to infer phylogenetic relationships for Cyt b, TRSP, and Cyt b-TRSP concatenated together. T. thetis and P. xanthopus are outgroups to the Macropus dataset. Within MrBayes, the general time reversible (GTR) model gave the greatest ML and clade credibility values across all three datasets. Five Markov Chain Monte Carlo chains were run for 1,000,000 generations, sampling every generation with a burn-in of 1,000 generations. Potential scale reduction factor values all converged on 1.000 by the conclusion of the runs. For each of the Cyt b and TRSP datasets, tree topology did not change with respect to model choice. When the two datasets were concatenated to form a contiguous sequence, only the supported tree topology within the wallabies changed with respect to the model.
ML trees were generated in PhyML  using the following parameters: GTR nucleotides substitution model, discrete gamma model, 4-categories, shape parameter, proportion of invariant sites and nucleotide frequencies estimated from the data. Bootstrap values were also generated in PhyML (shown in Figure 2). The ML trees, and an additional 18 tree topologies obtained through rearrangement, were used for the Shimodaira-Hasegawa test  in TREE-PUZZLE 5.2 .
Multiple genome rearrangements tool
The web-based software MGR [28, 75], designed for constructing phylogenies based on gene order for multichromosomal rearrangements, was used to construct a phylogeny based on syntenic block rearrangement across the clade (Figure 3). Only fissions, fusions, inversion and translocations are considered significant. Rearrangement events between syntenic blocks or chromosomes occur one at a time. As per the rules of this program, no unitary associations were locked, meaning breakpoint reuse was allowed. The input was oriented relative to the arrangement of the Thylogale syntenic arrangement and the output was an unrooted parsimony tree.
Thylogale, a macropodeid possessing the ancestral familial karyotype and diploid number of 2n = 22, was coded such that each syntenic block was oriented and numbered according to . T. thetis also shares its autosomal karyotype with P. xanthopus. All other species included in this analysis experienced reductions in chromosome number relative to T. thetis and were coded relative to this ancestral form. The 2n = 16 karyotype of M. eugenii is shared with M. agilis, M. rufogriseus and M. parma. The 2n = 16 karyotype of M. robustus, defined by a different suite of fusions, is shared with the other wallaroo species, M. antilopinus.
Cross species fluorescence in situhybridization
Mrb-sat1, Mrb-B29, and Mrb-sat23 clones, isolated following microdissection of the M. rufogriseus X chromosome , were PCR labeled with biotin-16-dUTP (Roche: Basel, Switzerland) as per the manufacture's instructions. M. rufogriseus FISH were performed as per . T. thetis chromosomes were unattainable for this study, though this species has a karyotype configuration equivalent to P. xanthopus. All cross-species FISH experiments were hybridized at 37°C in 50% hybridization solution (50% formamide, 2 × SSC, 500 ng/ml salmon sperm DNA, 200 ng probe) and washed at room temperature (three 50% formamide/2 × SSC washes for 5 minutes each, followed by three 2 × SSC rinses at room temperature). Slides were blocked with 4 × SSC/0.2% Tween-20/5% bovine serum albumin before avidin rhodamine (Texas Red; Invitrogen: Carlsbad, CA, USA) incubation at 37°C for 30 minutes. Antibody layering, when needed, was of first, avidin TexasRed, second, anti-rhodamine biotin, and third, avidin TexasRed.
For each species, we determined the most stringent conditions required to obtain FISH signal without losing signal integrity. Hybridization time and layering varied as follows: M. eugenii with all three probes, and M. agilis with Mrb-B29 and Mrb-sat23 were hybridized for two nights and detected with one layer; M. parma, M. rufus, and M. giganteus with all three probes, W. bicolor with Mrb-B29, M. robustus with Mrb-B29 and Mrb-sat23, and M. antilopinus with Mrb-B29 were hybridized for three nights and detected with one layer; W. bicolor with Mrb-sat1 and Mrb-sat23, P. xanthopus with Mrb-B29 and Mrb-sat23, and M. antilopinus with Mrb-sat23 were hybridized for three nights and detected with three layers.
Because of low signal strength from the sat1 probes, due to cross species variation, sat1 FISH to M. agilis, M. antilopinus, M. robustus and P. xanthopus used pooled sat1 probes derived from M. robustus, M. parma, W. bicolor, and M. rufus species (probes named Mrob-sat1, Mpm-sat1, Wbi-sat1 and Mrfs-sat1, respectively). The pooled sat1 probes from these species were PCR amplified with Mrb-sat1 primers (Additional data file 7). PCR products were cloned and sequenced, with a range of 74.8-91.5% identity to Mrb-sat1, verifying PCR product identity given the average sequence identity observed for satellites both within one genome (50-100% between different monomers) and between species [76, 77]. Clones were PCR labeled for FISH as above. Probes were pooled during precipitation (200 ng of each) prior to rehydration in the hybridization solution. Pooled probes were hybridized for four nights, and detected with three layers, as above. All FISH conditions are described in Additional data file 2.
Slides were mounted with DAPI/Vectasheld (Vector Laboratories: Burlingame, CA, USA) mounting media. Images were captured with a Leica DM6000B microscope with a DFC350FX-R2 digital camera and analyzed with Leica CW4000 Cytogenetics Karyotype software (Leica Microsystems: Bannockburn, IL, USA).
Additional data files
The following additional data are available with the online version of this paper. Additional data file 1 is a list of all species names and corresponding accession numbers used in phylogenetic studies. Additional data file 2 lists the cross-species FISH hybridization conditions. For each species (left), probes used are indicated (top). For the pooled probe set, a combination of sat1 sequences from Mrob, Mpm, Wbi, Mrfs were used in one hybridization reaction. Hybridization time is indicated by the number (hyb #) of days probe is incubated at 37°C. The number of antibody detection layers is also indicated. All other conditions are described in the Materials and methods. Additional data files 3, 4 and 5 are the FISH for each satellite (sat1, B29 and sat23, respectively) to each species used in these analyses. Probe images are in red and metaphase chromosomes are inverted DAPI. Additional data file 6 shows the Southern analyses of each satellite (sat1, B29 and sat23) to each species used in these analyses. Additional data file 7 is a list of all primer sequences used in sequence and phylogenetic analyses. Cyt b nucleotide positions are numbered according to M. robustus numbering [GenBank:Y10524]; Cyt b spans 14,184 bp to 15,329 bp.
- Cyt b:
fluorescence in situ hybridization
general time reversible
multiple genome rearrangement
Henikoff S, Ahmad K, Malik HS: The centromere paradox: stable inheritance with rapidly evolving DNA. Science. 2001, 293: 1098-1102. 10.1126/science.1062939.
Choo KHA: The Centromere. 1997, Oxford, New York: Oxford University Press
Meraldi P, McAinsh AD, Rheinbay E, Sorger PK: Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins. Genome Biol. 2006, 7: R23-10.1186/gb-2006-7-3-r23.
Vos LJ, Famulski JK, Chan GK: How to build a centromere: from centromeric and pericentromeric chromatin to kinetochore assembly. Biochem Cell Biol. 2006, 84: 619-639. 10.1139/O06-078.
Warburton PE: Chromosomal dynamics of human neocentromere formation. Chromosome Res. 2004, 12: 617-626. 10.1023/B:CHRO.0000036585.44138.4b.
Ferreri GC, Liscinsky DM, Mack JA, Eldridge MD, O'Neill RJ: Retention of latent centromeres in the mammalian genome. J Hered. 2005, 96: 217-224. 10.1093/jhered/esi029.
Amor DJ, Choo KH: Neocentromeres: role in human disease, evolution, and centromere study. Am J Hum Genet. 2002, 71: 695-714. 10.1086/342730.
Ventura M, Mudge JM, Palumbo V, Burn S, Blennow E, Pierluigi M, Giorda R, Zuffardi O, Archidiacono N, Jackson MS, et al: Neocentromeres in 15q24-26 map to duplicons which flanked an ancestral centromere in 15q25. Genome Res. 2003, 13: 2059-2068. 10.1101/gr.1155103.
Bailey JA, Baertsch R, Kent WJ, Haussler D, Eichler EE: Hotspots of mammalian chromosomal evolution. Genome Biol. 2004, 5: R23-10.1186/gb-2004-5-4-r23.
Ferguson-Smith MA, Yang F, Rens W, O'Brien PC: The impact of chromosome sorting and painting on the comparative analysis of primate genomes. Cytogenet Genome Res. 2005, 108: 112-121. 10.1159/000080809.
O'Neill RJ, Eldridge MD, Metcalfe CJ: Centromere dynamics and chromosome evolution in marsupials. J Hered. 2004, 95: 375-381. 10.1093/jhered/esh063.
Rens W, O'Brien PC, Fairclough H, Harman L, Graves JA, Ferguson-Smith MA: Reversal and convergence in marsupial chromosome evolution. Cytogenet Genome Res. 2003, 102: 282-290. 10.1159/000075764.
Rofe R: G-banded chromosomes and the evolution of Macropodidae. Aust Mammol. 1979, 2: 53-63.
Hayman D: Marsupial cytogenetics. Aust J Zool. 1990, 37: 331-349. 10.1071/ZO9890331.
Sharman GB, Close RL, Maynes M: Chromosomal evolution, phylogeny and speciation of rock wallabies (Petrogale: Macropodidae). Aust J Zool. 1989, 37: 351-363. 10.1071/ZO9890351.
Eldridge MD, Johnston PG: Chromosomal rearrangements in rock wallabies, Petrogale (Marsupialia: Macropodidae). VIII. An investigation of the nonrandom nature of karyotypic change. Genome. 1993, 36: 524-534.
Hayman DL, Martin PG: Mammalia I: Monotremata and Marsupialia. Animal Cytogenetics. Edited by: Bernard J. 1974, Berlin-Stuttgart: Gebruder Borntraeger, 4: 1-110.
Burk A, Springer M: Intergeneric relationships among Macropodoidea (Metatheria: Diprotodontia) and the chronicle of kangaroo evolution. J Mamm Evol. 2000, 7: 213-237. 10.1023/A:1009488431055.
Flannery TF: Phylogeny of the Macropodoidae: A study in convergence. Kangaroos, Wallabies and Rat-Kangaroos. Edited by: Grigg PJ, Hume I. 1989, Chipping Norton, Australia: Surrey Beatty & Sons, 1-46.
Kirsch JA, Lapointe F, Foeste A: Resolution of portions of the kangaroo phylogeny (Marsupialia : Macropodidae) using DNA hybridization. Biol J Linn Soc. 1995, 55: 309-328.
Kirsch JA, Lapointe F, Springer MS: DNA-hybridization studies of marsupials and their implications for metatherian classification. Aust J Zool. 1997, 45: 211-280. 10.1071/ZO96030.
Baverstock PR, Krieg M, Birrell J: Evolutionary relationships pf Australian marsupials and assessed by albumin immunology. Aust J Zool. 1990, 37: 273-287. 10.1071/ZO9890273.
Kirsch JA: The comparative serology of Marsupialia, and a classification of marsupials. Aust J Zool. 1977, 1-152. Supplementary series 52
Salser W, Bowen S, Browne D, el-Adli F, Fedoroff N, Fry K, Heindell H, Paddock G, Poon R, Wallace B, et al: Investigation of the organization of mammalian chromosomes at the DNA sequence level. Fed Proc. 1976, 35: 23-35.
Schindelhauer D, Schwarz T: Evidence for a fast, intrachromosomal conversion mechanism from mapping of nucleotide variants within a homogeneous alpha-satellite DNA array. Genome Res. 2002, 12: 1815-1826. 10.1101/gr.451502.
Pons J, Bruvo B, Petitpierre E, Plohl M, Ugarkovic D, Juan C: Complex structural features of satellite DNA sequences in the genus Pimelia (Coleoptera: Tenebrionidae): random differential amplification from a common 'satellite DNA library'. Heredity. 2004, 92: 418-427. 10.1038/sj.hdy.6800436.
Bulazel K, Metcalfe C, Ferreri GC, Yu J, Eldridge MD, O'Neill RJ: Cytogenetic and molecular evaluation of centromere-associated DNA sequences from a marsupial (Macropodidae: Macropus rufogriseus) X chromosome. Genetics. 2006, 172: 1129-1137. 10.1534/genetics.105.047654.
Bourque G, Pevzner PA: Genome-scale evolution: reconstructing gene orders in the ancestral species. Genome Res. 2002, 12: 26-36.
Close RL, Lowry PS: Hybrids in marsupial research. Aust J Zool. 1990, 37: 259-267. 10.1071/ZO9890259.
Bardeleben C, Moore RL, Wayne RK: Isolation and molecular evolution of the selenocysteine tRNA (Cf TRSP) and RNase P RNA (Cf RPPH1) genes in the dog family, Canidae. Mol Biol Evol. 2005, 22: 347-359. 10.1093/molbev/msi022.
Bininda-Emonds OR, Gittleman JL, Purvis A: Building large trees by combining phylogenetic information: a complete phylogeny of the extant Carnivora (Mammalia). Biol Rev Camb Philos Soc. 1999, 74: 143-175. 10.1017/S0006323199005307.
Wayne RK, Van Valkenburgh B, O'Brien SJ: Molecular distance and divergence time in carnivores and primates. Mol Biol Evol. 1991, 8: 297-319.
Graphodatsky AS, Yang F, O'Brien PC, Serdukova N, Milne BS, Trifonov V, Ferguson-Smith MA: A comparative chromosome map of the Arctic fox, red fox and dog defined by chromosome painting and high resolution G-banding. Chromosome Res. 2000, 8: 253-263. 10.1023/A:1009217400140.
Shimodaira H, Hasegawa M: Multiple comparisons of Log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. 1999, 16: 1114-1116.
Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002, 18: 502-504. 10.1093/bioinformatics/18.3.502.
Toder R, O'Neill RJ, Wienberg J, O'Brien PC, Voullaire L, Marshall-Graves JA: Comparative chromosome painting between two marsupials: origins of an XX/XY1Y2 sex chromosome system. Mamm Genome. 1997, 8: 418-422. 10.1007/s003359900459.
Archer M: Vertebrate Zoogeography and Evolution in Australasia. 1984, Marrickville, New South Wales: Southwood Press
Barnett R, Yamaguchi N, Barnes I, Cooper A: The origin, current diversity and future conservation of the modern lion (Panthera leo). Proc Biol Sci. 2006, 273: 2119-2125. 10.1098/rspb.2006.3555.
Bickham JW, Patton JC, Schlitter DA, Rautenbach IL, Honeycutt RL: Molecular phylogenetics, karyotypic diversity, and partition of the genus Myotis (Chiroptera: Vespertilionidae). Mol Phylogenet Evol. 2004, 33: 333-338. 10.1016/j.ympev.2004.06.012.
Bowen BW, Muss A, Rocha LA, Grant WS: Shallow mtDNA coalescence in Atlantic pygmy angelfishes (genus Centropyge) indicates a recent invasion from the Indian Ocean. J Hered. 2006, 97: 1-12. 10.1093/jhered/esj006.
Bunch TD, Wu C, Zhang YP, Wang S: Phylogenetic analysis of snow sheep (Ovis nivicola) and closely related taxa. J Hered. 2006, 97: 21-30. 10.1093/jhered/esi127.
Feng J, Lajia C, Taylor DJ, Webster MS: Genetic distinctiveness of endangered dwarf blue sheep (Pseudois nayaur schaeferi): evidence from mitochondrial control region and Y-linked ZFY intron sequences. J Hered. 2001, 92: 9-15. 10.1093/jhered/92.1.9.
Hiendleder S, Kaupe B, Wassmuth R, Janke A: Molecular analysis of wild and domestic sheep questions current nomenclature and provides evidence for domestication from two different subspecies. Proc Biol Sci. 2002, 269: 893-904. 10.1098/rspb.2002.1975.
Huchard E, Martinez M, Alout H, Douzery EJ, Lutfalla G, Berthomieu A, Berticat C, Raymond M, Weill M: Acetylcholinesterase genes within the Diptera: takeover and loss in true flies. Proc Biol Sci. 2006, 273: 2595-2604. 10.1098/rspb.2006.3621.
Huo G, Jiang G, Sun Z, Liu D, Zhang Y, Lu L: Phylogenetic reconstruction of the family acrypteridae (orthoptera: acridoidea) based on mitochondrial cytochrome B gene. J Genet Genomics. 2007, 34: 294-306. 10.1016/S1673-8527(07)60031-9.
Jansa SA, Weksler M: Phylogeny of muroid rodents: relationships within and among major lineages as determined by IRBP gene sequences. Mol Phylogenet Evol. 2004, 31: 256-276. 10.1016/j.ympev.2003.07.002.
Kawai K, Nikaido M, Harada M, Matsumura S, Lin LK, Wu Y, Hasegawa M, Okada N: The status of the Japanese and East Asian bats of the genus Myotis (Vespertilionidae) based on mitochondrial sequences. Mol Phylogenet Evol. 2003, 28: 297-307. 10.1016/S1055-7903(03)00121-0.
Larson G, Cucchi T, Fujita M, Matisoo-Smith E, Robins J, Anderson A, Rolett B, Spriggs M, Dolman G, Kim TH, et al: Phylogeny and ancient DNA of Sus provides insights into neolithic expansion in Island Southeast Asia and Oceania. Proc Natl Acad Sci USA. 2007, 104: 4834-4839. 10.1073/pnas.0607753104.
Pavlova A, Rohwer S, Drovetski SV, Zink RM: Different post-Pleistocene histories of Eurasian parids. J Hered. 2006, 97: 389-402. 10.1093/jhered/esl011.
Reed LK, Nyboer M, Markow TA: Evolutionary relationships of Drosophila mojavensis geographic host races and their sister species Drosophila arizonae. Mol Ecol. 2007, 16: 1007-1022. 10.1111/j.1365-294X.2006.02941.x.
Roberts E, Shoureshi P, Kozak K, Szynskie L, Baron A, Lecaude S, Dores RM: Tracking the evolution of the proenkephalin gene in tetrapods. Gen Comp Endocrinol. 2007, 153: 189-197. 10.1016/j.ygcen.2007.02.023.
Ruedi M, Mayer F: Molecular systematics of bats of the genus Myotis (Vespertilionidae) suggests deterministic ecomorphological convergences. Mol Phylogenet Evol. 2001, 21: 436-448. 10.1006/mpev.2001.1017.
Sato JJ, Hosoda T, Wolsan M, Suzuki H: Molecular phylogeny of arctoids (Mammalia: Carnivora) with emphasis on phylogenetic and taxonomic positions of the ferret-badgers and skunks. Zoolog Sci. 2004, 21: 111-118. 10.2108/zsj.21.111.
Sato JJ, Hosoda T, Wolsan M, Tsuchiya K, Yamamoto M, Suzuki H: Phylogenetic relationships and divergence times among mustelids (Mammalia: Carnivora) based on nucleotide sequences of the nuclear interphotoreceptor retinoid binding protein and mitochondrial cytochrome b genes. Zoolog Sci. 2003, 20: 243-264. 10.2108/zsj.20.243.
Slamovits CH, Cook JA, Lessa EP, Rossi MS: Recurrent amplifications and deletions of satellite DNA accompanied chromosomal diversification in South American tuco-tucos (genus Ctenomys, Rodentia: Octodontidae): a phylogenetic approach. Mol Biol Evol. 2001, 18: 1708-1719.
Sullivan JP, Lundberg JG, Hardman M: A phylogenetic analysis of the major groups of catfishes (Teleostei: Siluriformes) using rag1 and rag2 nuclear gene sequences. Mol Phylogenet Evol. 2006, 41: 636-662. 10.1016/j.ympev.2006.05.044.
Tserenbataa T, Ramey RR, Ryder OA, Quinn TW, Reading RP: A population genetic comparison of argali sheep (Ovis ammon) in Mongolia using the ND5 gene of mitochondrial DNA; implications for conservation. Mol Ecol. 2004, 13: 1333-1339. 10.1111/j.1365-294X.2004.02123.x.
van Rheede T, Bastiaans T, Boone DN, Hedges SB, de Jong WW, Madsen O: The platypus is in its place: nuclear genes and indels confirm the sister group relation of monotremes and Therians. Mol Biol Evol. 2006, 23: 587-597. 10.1093/molbev/msj064.
Malik HS, Bayes JJ: Genetic conflicts during meiosis and the evolutionary origins of centromere complexity. Biochem Soc Trans. 2006, 34: 569-573. 10.1042/BST0340569.
Mestrovic N, Plohl M, Mravinac B, Ugarkovic D: Evolution of satellite DNAs from the genus Palorus - experimental evidence for the "library" hypothesis. Mol Biol Evol. 1998, 15: 1062-1068.
Lin CC, Li YC: Chromosomal distribution and organization of three cervid satellite DNAs in Chinese water deer (Hydropotes inermis). Cytogenet Genome Res. 2006, 114: 147-154. 10.1159/000093331.
Li YC, Cheng YM, Hsieh LJ, Ryder OA, Yang F, Liao SJ, Hsiao KM, Tsai FJ, Tsai CH, Lin CC: Karyotypic evolution of a novel cervid satellite DNA family isolated by microdissection from the Indian muntjac Y-chromosome. Chromosoma. 2005, 114: 28-38. 10.1007/s00412-005-0335-7.
Imai HT, Satta Y, Takahata N: Integrative study on chromosome evolution of mammals, ants and wasps based on the minimum interaction theory. J Theor Biol. 2001, 210: 475-497. 10.1006/jtbi.2001.2327.
Slijepcevic P: Telomeres and mechanisms of Robertsonian fusion. Chromosoma. 1998, 107: 136-140. 10.1007/s004120050289.
Samonte RV, Eichler EE: Segmental duplications and the evolution of the primate genome. Nat Rev Genet. 2002, 3: 65-72. 10.1038/nrg705.
She X, Horvath JE, Jiang Z, Liu G, Furey TS, Christ L, Clark R, Graves T, Gulden CL, Alkan C, et al: The structure and evolution of centromeric transition regions within the human genome. Nature. 2004, 430: 857-864. 10.1038/nature02806.
Eichler EE: Repetitive conundrums of centromere structure and function. Hum Mol Genet. 1999, 8: 151-155. 10.1093/hmg/8.2.151.
Metcalfe CJ: Telomeres and Chromosome Evolution in Marsupials. 2003, Sydney, Australia: Macquarie University
Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132: 365-386.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25: 4876-4882. 10.1093/nar/25.24.4876.
Kumar S, Tamura K, Nei M: MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform. 2004, 5: 150-163. 10.1093/bib/5.2.150.
Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.
Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.
Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.
Rudd MK, Wray GA, Willard HF: The evolutionary dynamics of alpha-satellite. Genome Res. 2006, 16: 88-96. 10.1101/gr.3810906.
Schueler MG, Higgins AW, Rudd MK, Gustashaw K, Willard HF: Genomic and genetic definition of a functional human centromere. Science. 2001, 294: 109-115. 10.1126/science.1065042.
Thank you to the staff of the Macquarie University Fauna Park for their help in cell and DNA sample collection and to the UNSW Fauna Park and Wonderland Sydney for their contributed samples. Special thanks to Dr Peter Gogarten for advice on phylogenetic analyses, Dr Robert Friedman for his advice on bioinformatics, Dr Cushla Metcalfe for her FISH advice and Dr Mike O'Neill for critical evaluation of the manuscript. KB was supported in Australia by a J William Fulbright scholarship. RJO, KB, and GF were support by a grant from the NSF (0093250) and the University of Connecticut Research Foundation. MDBE was supported by the Australian Research Council and Macquarie University.
KVB performed all experiments and analyses herein and wrote the manuscript, GCF performed Southern analyses, MDBE provided tissue and DNA samples and edited the manuscript, and RJO provided general project oversight and co-wrote the manuscript.
Electronic supplementary material
Additional data file 2: For each species (left), probes used are indicated (top). For the pooled probe set, a combination of sat1 sequences from Mrob, Mpm, Wbi, Mrfs were used in one hybridization reaction. Hybridization time is indicated by the number (hyb #) of days probe is incubated at 37°C. The number of antibody detection layers is also indicated. All other conditions are described in the Materials and methods. (DOC 52 KB)
Additional data file 6: Southern analyses of each satellite (sat1, B29 and sat23) to each species used in these analyses (EPS 7 MB)
Additional data file 7: Cyt b nucleotide positions are numbered according to M. robustus numbering [GenBank:Y10524]; Cyt b spans 14,184 bp to 15,329 bp (DOC 55 KB)
Authors’ original submitted files for images
About this article
Cite this article
Bulazel, K.V., Ferreri, G.C., Eldridge, M.D. et al. Species-specific shifts in centromere sequence composition are coincident with breakpoint reuse in karyotypically divergent lineages. Genome Biol 8, R170 (2007). https://doi.org/10.1186/gb-2007-8-8-r170
- Additional Data File
- Syntenic Block
- Satellite Sequence
- Ancestral Karyotype
- Centromere Sequence