Escape from X inactivation in mice and humans
© BioMed Central Ltd 2010
Published: 24 June 2010
Skip to main content
© BioMed Central Ltd 2010
Published: 24 June 2010
A subset of X-linked genes escapes silencing by X inactivation and is expressed from both X chromosomes in mammalian females. Species-specific differences in the identity of these genes have recently been discovered, suggesting a role in the evolution of sex differences. Chromatin analyses have aimed to discover how genes remain expressed within a repressive environment.
Regulation of the X chromosome in eutherian mammals
X upregulation is the process by which the active X chromosome (Xa) is upregulated in both sexes to balance expression between the X and the autosomes. The molecular mechanisms of the process are unknown.
Random X inactivation is the process by which one X chromosome is silenced in female embryos in order to avoid X hyperexpression due to X upregulation and to balance gene dosage between the sexes. Random X inactivation silences either the paternal or maternal X chromosome at the blastocyst stage and persists into adulthood. The noncoding RNA transcribed from Xist is essential for the onset of silencing. Xist RNA coats the X chromosome in cis and recruits a protein complex to establish repressive epigenetic modifications and implement gene silencing. Escape from random X inactivation affects about 15% of human genes and 3% of mouse genes, most of these genes being protein coding.
Imprinted paternal X inactivation is the process by which the paternal X chromosome is silenced in early female embryos before implantation. This paternal X inactivation persists in extraembryonic tissues (as shown in mice, but not well studied in humans) but is reversed in the inner cell mass before random X inactivation. This silencing process is Xist dependent, although it is controversial whether Xist is necessary for initiation. Escape from imprinted paternal X inactivation has been observed for some genes (which may differ from those that escape random X inactivation), but no complete survey is available.
Meiotic sex-chromosome inactivation (MSCI) is the process of silencing of both the X and Y chromosomes in male meiosis and occurs in almost all organisms that have differentiated sex chromosomes, including humans. As in somatic X inactivation, Xist RNA coats the X chromosome during MSCI. However, Xist is not required for silencing. MSCI is associated with recruitment of DNA repair proteins such as the histone variant H2AX and MDC1 (mediator of DNA damage checkpoint 1). Escape from MSCI characterizes a set of miRNA genes such as mir-221, mir-374, mir-470 and mir-741, which may be important for spermatogenesis.
Initiation of X inactivation in female embryos depends on the transcription of the long noncoding RNA XIST/Xist (X-inactive specific transcript) from one chromosome (which will become the inactive X (Xi)) and recruitment of a protein complex important for X-chromosome silencing and heterochromatin formation [7, 8]. In humans, XIST (17 kb in size) is located in the long arm of the X chromosome, whereas in mice where there is only one arm, Xist (15 kb in size) is in the middle of the chromosome. Xist RNA spreads along the X chromosome in cis and recruits a protein complex responsible for deposition of repressive histone modifications onto the Xi [9–11]. As a result the Xi becomes heterochromatic, silent and condensed. Before implantation, X inactivation is imprinted, with the paternal X chromosome always being silenced. At the blastocyst stage, the paternal X reactivates and random X inactivation takes place (see Table 1).
Although most genes on the Xi are silenced, some genes remain expressed from both the Xa and the Xi. Not surprisingly, genes that retain a Y-linked copy - for example, Kdm5c and Kdm5d (which encode histone demethylases) - escape X inactivation and thus have two expressed alleles in both male and female somatic tissues. However, not all 'escaping' genes have a Y copy, for example Car5b (carbonic anhydrase). Recent reports have shown striking differences between human and mouse regarding the identity and number of these 'escape' genes in somatic tissues [12, 13]. Why are there such species differences? Structural differences between the X chromosomes may play a role as well as selective pressure to maintain sex differences.
Escape from X inactivation is not limited to female somatic cells. Indeed, another type of silencing of the X takes place in male germ cells and is known as meiotic sex chromosome inactivation (MSCI; see Table 1). MSCI results in silencing of protein-coding messenger RNAs from the X chromosome, but a majority of the X-linked microRNAs (miRNAs) escape MSCI, suggesting that they play a role in male meiosis . How do genes escape silencing on the heterochromatic X chromosome, whether in somatic or germ cells? Many studies have shown that epigenetics plays a crucial role in X inactivation and escape [7, 15]. In this review, we will summarize recent progress made in the field of escape from X inactivation, compare the number and distribution of human and mouse escape genes, and discuss possible molecular mechanisms involved in genes escaping X inactivation.
Escape from X inactivation in other mammalian species has not been extensively characterized. Nonetheless, escape genes have been identified in marsupials, which differ from eutherian mammals in terms of key features of X inactivation - Xist is absent and the paternal X always silenced. At least four X-linked genes encoding glucose-6-phosphate dehydrogenase (G6PD), hypoxanthine guanine phosphoribosyl transferase (HPRT), phosphoglycerate kinase (PGK1), and a monocarboxylic acid transporter (SLC16A2) show incomplete silencing in a tissue- and species-dependent manner in marsupial females [16, 17].
In both human and mouse, many of the genes that escape X inactivation are expressed more strongly in females. In fact, one study has identified escape genes on the basis of expression levels in women with different numbers of X chromosomes . However, in both humans and mice, differences in levels of expression of the escape genes between males and females are small, indicating partial repression of the escape genes on the Xi [21, 22]. This was confirmed by measuring allele-specific expression of escape genes in humans and in mice [12, 13]. We hypothesize that the Xi allele is either partially silenced by adjacent repressive modifications or might lack modifications associated with X upregulation of the Xa. As we do not know yet what these modifications are, this hypothesis remains to be tested. It is expected that, compared with mice, men and women would demonstrate greater sex differences in X-linked gene expression as a result of the large number of escape genes. Whether such sex differences provide an evolutionary advantage remains to be explored. Possible evolutionary advantages would be, for example, higher expression in female reproductive organs or in neurological tissues, which could influence behavior. It should be noted that most studies about escape from X inactivation have been done using cell lines; thus, tissue-specific effects have not been fully addressed.
Escape genes play important roles in human diseases as women with a single X chromosome (X-chromosome monosomy; 45,X) have Turner syndrome, with severe phenotypes including ovarian dysgenesis, short stature, webbed neck, and other physical abnormalities . In addition, as many as 99% of 45,X embryos die in utero . Deficiency in escape genes is thought to play a major role in phenotypes observed in Turner patients . Because the Y chromosome protects men from these deficiencies, the most likely candidate genes would have a Y copy, except for genes that control female-specific phenotypes such as ovarian failure and thus, by definition, would not affect men. So far, the pseudoautosomal gene SHOX (SHORT STATURE HOMEBOX), which encodes a homeodomain transcription factor, is the only gene directly implicated in the short-stature phenotype . Interestingly, early lethality of 45,X embryos may be due to a defect in placenta differentiation, which is supported by the finding that many placental genes have much higher expression in 46,XX versus 45,X cells in differentiated human embryonic stem (ES) cells . Notably, the pseudoautosomal gene CSF2RA (colony-stimulating factor 2 receptor, alpha), which encodes a receptor for a hematopoietic differentiation factor, has more than ninefold higher expression in 46,XX versus 45,X cells, suggesting that this gene may be involved in placenta differentiation defects . In contrast, X0 mice have a near-normal phenotype and are fertile, although the number of oocytes is reduced, potentially as a result of the lack of sex-chromosome pairing . Meiotic arrest due to lack of pairing could be attenuated in mouse compared with human single-X oocytes because of self-pairing of the X in mouse .
The fact that few escape genes exist in the mouse is consistent with the significant differences in the impact of X-chromosome monosomy in female mice and in women . Genes that escape from X inactivation in humans but are subject to X inactivation in the mouse may be good candidates for genes responsible for Turner syndrome severe phenotypes. Pseudoautosomal genes may play a prominent role in these phenotypes, as already demonstrated for SHOX, and possibly for CSF2RA. Indeed, the mouse pseudoautosomal region contains only one gene, Sts (steroid sulfatase) , whereas all genes located in the pseudoautosomal region in humans are autosomal in the mouse and thus are not affected in X0 mice .
Another potential role for escape from X inactivation is in aging. Inappropriate reactivation of an X-linked gene, Otc, which encodes a urea cycle enzyme called ornithine transcarbamoylase, has been reported in mouse tissues . Furthermore, a recent study has found epigenetic alterations including X reactivation in a mouse model of accelerated aging due to telomere shortening . So far, no such reactivation of X-linked genes has been observed in humans. It will be important to determine whether environmental factors could cause inappropriate escape from X inactivation due to changes in epigenetic marks.
The Xi is distinguishable from its active counterpart by its epigenetic marks, including coating with Xist RNA. This is the earliest event in X inactivation during embryogenesis, and gene silencing follows within one or two cell cycles . Interestingly, Xist-induced silencing can only be achieved in early differentiating ES cells, and reaches a point of irreversibility. Just how Xist RNA is spread along the Xi is still not fully understood. One hypothesis suggests that long interspersed repetitive elements (L1) repeats are overrepresented on the X and may serve as 'booster' elements by anchoring Xist RNA to the chromosome, thus aiding spreading . Consistent with this hypothesis, human genes that escape X inactivation have fewer L1 repeats [6, 35, 36]. These genes are also enriched in specific sequence motifs such as Alu repeats and short motifs containing ACG/CGT at their 5' ends . In the mouse, another type of repeat - long terminal repeats (LTRs) - appears to be depleted on escape genes . These observations imply that Xist RNA coating could be deficient at genes escaping X inactivation. This was recently demonstrated in mouse myoblasts using RNA tagging and recovery of associated DNA (modified TRAP) method for identification of targets . In this study, escapees Kdm5c and Kdm6a, which encode chromatin-modifying histone lysine demethylases, were shown to be devoid of Xist RNA coating over their promoters and transcribed regions. Conversely, genes subjected to X inactivation, and L1 repeat elements themselves, recruited Xist RNA  (Figure 2b). Taken together, these studies support the idea that specific DNA sequence motifs are involved in recruitment of Xist RNA to the Xi.
While Xist RNA coating is important in the initiation of X inactivation, many other epigenetic modifications follow to silence the X and maintain silencing. An early repressive chromatin mark, tri-methylation of lysine 27 on histone H3 (H3K27me3), is recruited by the Polycomb complex of chromatin-modifying proteins, resulting in compaction of the silenced portion of the Xi (Figure 2a). Other repressive marks include H3K9me3 and the histone variant macroH2A1, which are also enriched on the Xi (Figure 2b) [7, 39]. Concomitantly, 'active' marks such as acetylation of histone H3 and H4 are lost from the silenced chromatin [7, 40]. Modifications characteristic of silenced genes contrast with those within escape genes, which remain euchromatic and harbor histone H3 and H4 acetylation [7, 41]. H3K4me3, another mark associated with transcriptional activity, is absent from most of the Xi except at discrete regions corresponding to areas of escape, as shown in female lymphoblasts  (Figure 2b). We recently demonstrated a lack of H3K27me3 at escape genes in mouse, which shows complete concordance in the cell line used to assay allelic expression .
The existence of discrete areas of 'escape chromatin' adjacent to silenced chromatin suggests the need for boundary elements, such as insulator sequences, that may block the spreading of heterochromatin into escape regions or prevent repressive marks from being added to escape domains (Figure 2). Supporting this idea are our findings that the insulator protein CTCF (CCCTC-binding factor), which binds known insulator sequences, binds to the transition region between the escape gene Kdm5c and the inactivated gene Iqsec2 (IQ motif and SEC7 domain-containing protein 2) in mouse, whereas in humans, the corresponding region between the same genes, which both escape X inactivation, does not bind CTCF . Furthermore, we have found that the CpG island at the 5' end of Kdm5c remains hypomethylated throughout mouse development, possibly because it is rendered inaccessible to DNA methyltransferases by CTCF binding (Figure 2b). CTCF-binding sites were also identified in other transition areas between escape and inactivated genes, suggesting that CTCF may play a role in the insulation of escape domains . However, a subsequent study showed that insertion of CTCF-binding sites from the HS4 insulator site (from the chicken β-globin gene cluster) at each end of a short reporter gene was not sufficient to protect it from silencing when inserted within an inactivated gene on the Xi in mouse cells . A more recent study reported that a bacterial artificial chromosome clone containing Kdm5c and its flanking regions retains its properties of escape even when inserted at other sites that are normally inactivated on the Xi in mouse cells . CTCF-binding sites may turn out not to be sufficient for insulation, and other elements within or around escape genes may be important.
In particular, the structure of chromatin may have an important role in insulation by looping specific regions out of the condensed Xi (Figure 2a) . Our recent X-chromatin profiles show a discontinuous distribution of the repressive chromatin mark H3K27me3 along the Xi, consistent with the presence of insulator elements and/or specific attachment sites for looped chromatin . However, in human × mouse hybrid cell lines, where the human X can be distinguished from the rodent background, repressive chromatin marks were found to be progressively diminished in the intergenic region between the inactivated RBM10 (RNA-binding motif protein 10) and the escape gene UBA1/UBE1 (ubiquitin-like modifier activating enzyme). Specifically, H3K9me3 and another histone modification associated with gene silencing, H4K20me3, were enriched in the last RBM10 exon but were already depleted approximately 2 kb upstream of UBA1/UBE1 .
Escape from X inactivation can vary between different tissues and/or individuals and the escape status can also be developmentally regulated. In humans, about 10% of X-linked genes show variation in escape in different tissues and/or individuals [12, 47]. Some escape genes may have a different chromatin structure throughout development, as suggested by the lack of promoter-restricted H3K4me2 in undifferentiated ES cells before X inactivation . Other escape genes may be initially silenced, and only reactivate in some tissues or with aging . Individual cells may also vary: in an analysis of single-cell allelic expression of Kdm5c in mouse, significant silencing in individual embryonic cells was observed in contrast to consistent expression from both alleles in adult cells . Differences in H3K27me3 enrichment on some genes in a tissue and developmental-stage-specific manner also suggest variability in escape . For example, enrichment in H3K27me3 along Mid1 (midline 1) in mouse embryos but not in adult liver suggests removal of the repressive mark in a tissue-specific manner. It is possible that the recently identified histone demethylases KDM6A and KDM6B may facilitate the removal of H3K27me3 at escape genes [50–52].
Imprinted X inactivation silences the paternal X during the preimplantation stage (see Table 1). This imprinting is reversed in the inner cell mass, and is followed by random X inactivation . It is not known whether imprinted X inactivation occurs in humans and the mechanisms for imprinted X inactivation in mice are still unclear. Are there genes that escape the initial imprinted X inactivation? Several recent studies have addressed this question by profiling transcriptional activity from the paternal X during early development. A specific set of genes apparently does escape imprinted X inactivation at the two-cell stage [53, 54]. However, another subset of genes shows a variable escape status during development and in a lineage-specific manner. For example, Huwe1 (HECT, UBA and WWE domain containing 1) shows no evidence of silencing during pre-implantation stages but is efficiently silenced after implantation, whereas Kdm5c is partially inactivated during the preimplantation stage but escapes fully throughout the rest of development, and Atrx (alpha thalassemia/mental retardation syndrome X-linked) is expressed from both alleles in extraembryonic ectoderm but not in trophectoderm (the precursor of some extraembryonic tissues in the preimplantation embryo), or in later embryos [13, 49, 53].
In male spermatogenesis, yet another type of X-chromosome silencing takes place - MSCI  (see Table 1). Unlike X inactivation in female somatic cells, where extensive analyses have catalogued the proportion of genes that escape silencing, no such study has been done so far for MSCI. However, the permissive mark H3K4me3 is present in discrete regions of the X in mouse pachytene spermatocytes. Furthermore, immunofluorescence staining for RNA polymerase II in these cells revealed several regions of transcriptional activity, suggesting areas of escape from MSCI . Another study revealed that up to 86% of the 72 known X-encoded miRNAs escape MSCI at different times during spermatogenesis. Some of the miRNAs were upregulated during MSCI and either downregulated or maintained in the context of postmeiotic sex chromatin . Recent evidence suggests that repression of the X chromosome due to MSCI persists, at least in part, into the mature sperm , which could be important for suppression of oogenesis-specific genes and/or dosage compensation by potentially enabling transmission of a partially inactivated paternal X . However, not all sex-linked genes remain inactivated following MSCI and evidence points to maintenance of post-meiotic X-chromosome repression being incomplete. In fact, about 18% of X-linked genes, especially multicopy genes, are expressed in postmeiotic cells .
X inactivation is an important process required to balance gene dosage in males and females. Equally important are those genes that escape X inactivation. Why is there a far greater number of X-linked genes that escape X inactivation in humans than in mice? Not only does the number of escape genes differ but also their location. Human escape genes exist in large domains of escape whereas mouse escape genes are scattered along the X chromosome. Their location in recent evolutionary strata in humans suggests a major role of sex chromosome evolution in the retention of escape genes. However, their retention may also be linked to their inherent ability to cause sex-specific differences in gene expression levels. We propose that the complexity of dosage compensation in mammals, which involves X upregulation, X inactivation, and escape from X inactivation, may have specific advantages in providing opportunities to modulate gene expression between the sexes in specific tissues. This may be especially advantageous in reproductive organs. Whether sex differences do lead to physiological effects remains to be determined. Specific epigenetic mechanisms may have evolved to ensure maintenance of escape from X inactivation. These may include the accumulation of repeats and DNA motifs to recruit or repel the silencing complex, as well as specific boundary elements. Future studies are needed to further characterize the chromatin structure of escape domains and to understand their role in evolution.
This work was supported by grants from the National Institutes of Health to JBB (HD060402) and to CMD (GM046883 and GM079537).