- Open Access
Epigenetic signatures associated with imprinted paternally expressed genes in the Arabidopsis endosperm
Genome Biologyvolume 20, Article number: 41 (2019)
The Correction to this article has been published in Genome Biology 2019 20:182
Imprinted genes are epigenetically modified during gametogenesis and maintain the established epigenetic signatures after fertilization, causing parental-specific gene expression.
In this study, we show that imprinted paternally expressed genes (PEGs) in the Arabidopsis endosperm are marked by an epigenetic signature of Polycomb Repressive Complex2 (PRC2)-mediated H3K27me3 together with heterochromatic H3K9me2 and CHG methylation, which specifically mark the silenced maternal alleles of PEGs. The co-occurrence of H3K27me3 and H3K9me2 on defined loci in the endosperm drastically differs from the strict separation of both pathways in vegetative tissues, revealing tissue-specific employment of repressive epigenetic pathways in plants. Based on the presence of this epigenetic signature on maternal alleles, we are able to predict known PEGs at high accuracy and identify several new PEGs that we confirm using INTACT-based transcriptomes generated in this study.
The presence of the three repressive epigenetic marks, H3K27me3, H3K9me2, and CHG methylation on the maternal alleles in the endosperm serves as a specific epigenetic signature that allows prediction of genes with parental-specific gene expression. Our study reveals that there are substantially more PEGs than previously identified, indicating that paternal-specific gene expression is of higher functional relevance than currently estimated. The combined activity of PRC2-mediated H3K27me3 together with the heterochromatic H3K9me3 has also been reported to silence the maternal Xist locus in mammalian preimplantation embryos, suggesting convergent employment of both pathways during the evolution of genomic imprinting.
Genomic imprinting is an epigenetic phenomenon causing maternal and paternal alleles to be differentially expressed after fertilization. In plants, genomic imprinting is mainly confined to the endosperm, an ephemeral nutritive tissue supporting embryo growth, similar to the placenta in mammals . The endosperm is the product of a double fertilization event, where one of the haploid sperm cells fertilizes the haploid egg cell giving rise to the diploid embryo, while the second sperm cell fertilizes the diploid central cell to give rise to the triploid endosperm . Imprinted genes are epigenetically modified during gamete formation, and the established epigenetic asymmetry is maintained after fertilization. Differential DNA methylation is established by the DNA glycosylase DEMETER (DME) that removes methylated cytosine residues in the central cell of the female gametophyte . DME is not active in sperm, leading to differential DNA methylation between male and female genomes in the endosperm. DME acts on small transposable elements (TEs) in the vicinity of genes  and its activity has been connected to the expression of maternally expressed imprinted genes (MEGs). Hypomethylation can furthermore cause repression [5, 6], possibly by exposing binding sites for the Fertilization-Independent Seed (FIS)-Polycomb Repressive Complex 2 (PRC2) [7, 8], an evolutionary conserved chromatin modifying complex that applies a trimethylation mark on histone H3 at lysine 27 (H3K27me3) . The Arabidopsis FIS-PRC2 consists of the subunits MEDEA (MEA), FIS2, FERTILIZATION-INDEPENDENT ENDOSPERM (FIE), and MULTICOPY SUPPRESSOR OF IRA1 (MSI1)  and is specifically active in the central cell of the female gametophyte and in the endosperm . Repression of the maternal alleles of PEGs is mediated by the activity of the FIS-PRC2 , consistent with maternal PEG alleles being marked by H3K27me3 . In this manuscript, we addressed the mechanism of maternal allele repression in PEGs. We surprisingly found that PEGs are regulated by two otherwise largely exclusive epigenetic repressive pathways, the FIS2-PRC2 and the pathway establishing the heterochromatin-localized H3K9me2 modification . We demonstrate that both modifications, H3K27me3 and H3K9me2, overlap on the maternal alleles of the majority of PEGs. Our data suggest that most likely FIS-PRC2 acts first and is required to establish H3K9me2. Furthermore, we find maternal alleles of PEGs to be marked by CHG methylation in the central cell, indicating that repressive pathways establishing H3K27me3, H3K9me2, and CHG methylation act in the central cell of the female gametophyte. Finally, we use the presence of the three modifications to predict novel PEGs and propose that the number of PEGs predicted based on expression data strongly underestimates the real number of PEGs.
The maternal alleles of PEGs are marked by H3K27me3, H3K9me2, and CHG methylation
In a previous study, we revealed that H3K27me3 and H3K9me2 overlapped in the endosperm at pericentromeric heterochromatic regions of the paternal genome, suggesting a partial functional redundancy of both modifications . We now addressed the question whether this redundancy extends to other regions of the genome. We found that about one third of genes marked by H3K27me3 on the maternal alleles were also marked by H3K9me2 (Fig. 1a, hypergeometric test, P = 0 (Col × Ler crosses); Additional file 1: Figure S1A, P = 0 (Ler × Col crosses)). Genes containing both modifications on their maternal alleles had significantly higher levels of H3K27me3 compared to those only marked by H3K27me3 (Fig. 1b, Additional file 1: Figure S1B). The majority of double-marked genes contained both modifications specifically on the maternal but not the paternal alleles (maternal-specific marks) (Fig. 1a, Additional file 1: Figure S1A) and increasing levels of H3K27me3 on maternal alleles correlated with increasing levels of H3K9me2 on the maternal but not the paternal alleles (Additional file 1: Figure S2 and S3). We thus proposed that the presence of both modifications on the maternal alleles correlates with paternally biased expression. Consistent with this notion, we found that genes previously identified as PEGs  were substantially enriched for both modifications on their maternal alleles, while MEGs did not show enrichment of both marks on either maternal or paternal alleles (Fig. 1c, d, Additional file 1: Figure S1C-D). This trend was independent of the direction of the cross and similarly observed in Col × Ler and Ler × Col crosses (Fig. 1 and Additional file 1: Figure S1). The presence of H3K27me3 is generally confined to gene bodies , consistent with the observed enrichment of H3K27me3 in gene bodies of PEG maternal alleles (Fig. 1e and Additional file 1: Figure S1E). Interestingly, H3K9me2 was similarly restricted to gene bodies of PEGs (Fig. 1e and Additional file 1: Figure S1E), contrasting the exclusion of H3K9me2 from genic regions in sporophytic tissues . The maternal alleles of PEGs were also strongly enriched for CHG methylation (Fig. 1f, Additional file 1: Figure S1 and S2), consistent with the chromomethylase 3 (CMT3) acting in a positive feedback loop with the histone methyltransferase proteins KYP/SUVH4, SUVH5, and SUVH6 . Levels of CHH methylation were generally higher on maternal than on paternal alleles (Fig. 1f, Additional file 1: Figure S1F), consistent with depletion of CHH methylation in sperm [4, 17].
Maternal-specific CHG methylation is established in the central cell and depends on FIS-PRC2
PEGs had increased levels of CHG methylation in the central cell, but low CHG methylation in sperm and vegetative cells of pollen (Fig. 2a), revealing that differences in CHG methylation are established before fertilization. The FIS-PRC2 had been proposed to promote non-CG methylation , raising the hypothesis that increased CHG methylation on the maternal alleles of PEGs depends on the activity of FIS-PRC2 in the central cell. Consistently, we observed strongly decreased CHG methylation on the maternal alleles of PEGs in seeds lacking maternal FIE activity (Fig. 2b), suggesting that FIS-PRC2 activity is required to recruit CHG methylation. The DNA glycosylase DME is required for the activation of MEA and FIS2, both encoding subunits of the FIS-PRC2 [18, 19]. Maternal PEG alleles had reduced levels of CHG methylation in seeds lacking maternal DME activity (Fig. 2b), supporting the notion that FIS-PRC2 activity is required for CHG methylation establishment on maternal PEG alleles.
Previous work revealed that loss of FIS-PRC2 function caused activation of maternal PEG alleles . We addressed the question whether loss of CMT3 and the redundantly acting SUVH4,5,6  similarly affect silencing of maternal PEG alleles. Based on published expression studies, CMT3, SUVH4, and SUVH6 are expressed in the central cell of the female gametophyte and in the endosperm [21, 22]. The maternal alleles of seven tested PEGs remained silenced in reciprocal crosses of wild type with cmt3 and suvh4,5,6 triple mutants (Additional file 1: Figure S4A-B), indicating that stable silencing of the maternal alleles of PEGs does not depend on CMT3 and SUVH4,5,6 activity before fertilization. We tested the requirement of CMT3 and SUVH4,5,6 for PEG regulation after fertilization by monitoring PEG expression in homozygous mutant cmt3 and suvh4,5,6 seeds. None of the seven tested genes was significantly upregulated in seeds of cmt3 or suvh4,5,6 mutants (Additional file 1: Figure S4C), indicating that CMT3 and SUVH4,5,6 are not required for repression of maternal PEG alleles after fertilization. In the suvh4,5,6 triple mutant, H3K9me2 is genome-wide eliminated in vegetative tissues ; however, whether H3K9me2 is similarly eliminated in the central cell and endosperm remains to be shown. Therefore, deciphering the role of H3K9me2 in repressing the maternal alleles of PEGs remains to be subject of future investigations.
Paternally biased expression coincides with the combination of H3K27me3, H3K9me2, and CHG methylation
We addressed the question whether the presence of H3K9me2 and CHG methylation is functionally relevant for maternal allele repression by testing whether the presence of both modifications either alone or in combination with H3K27me3 correlates with maternal allele repression. Strikingly, while the presence of either CHG methylation, H3K9me2, or H3K27me3 on maternal alleles did not shift the allelic balance, the combination of more than one modification shifted the balance towards preferential paternal allele expression (Fig. 3a, Additional file 1: Figure S5A), suggesting that the combined presence of H3K27me3, H3K9me2, and CHG methylation is a hallmark for maternal allele repression. Consistently, increased bias towards paternal allele expression correlated with increased levels of H3K27me3, H3K9me2, and CHG methylation on maternal alleles (Fig. 3b, Additional file 1: Figure S5B).
We tested which combination of marks would most reliably allow predicting paternally biased genes. We assigned scores for the allele-specific presence of the three modifications (see Additional file 2: Table S1) and tested whether highly scoring genes were enriched for previously predicted PEGs . The combination of CHG methylation in the central cell together with maternal-specific H3K27me3 and H3K9me2 allowed to predict the highest number of previously described PEGs in Col and Ler accessions (24 out of 42 (57%), ) in relation to the number of genes in the category with the highest score (Fig. 4a, Additional file 2: Table S1) and was chosen for further analysis. The category with the highest score was significantly enriched for PEGs (hypergeometric test, P = 1.0e−32), while categories with lower scores contained only few PEGs (Fig. 4a). Similarly, out of 64 PEGs that had been predicted by a recent study re-evaluating previously published imprintome datasets , 40 (62%) PEGs were present in the highest score category (Fig. 4a). Nearly half (96 genes, 46.4%) of those genes in the highest score category were significantly paternally biased (chi-square < 0.05, Bonferroni corrected, Additional file 3: Table S2), which was significantly more than the 8% paternally biased genes identified among all genes tested (hypergeometric test, P = 8.9e−52). We thus conclude that the presence of the three modifications, CHG in the central cell and H3K27me3 and H3K9me2 on maternal alleles in the endosperm, allows to predict genes with paternally biased expression. Paternally biased genes were particularly involved in regulation of transcription (P = 7.54e−5) and chromatin modification (P = 1.57e−3) (Additional file 1: Table S3), consistent with previous reports on the functional role of PEGs [14, 25].
Maternal seed coat contamination restricts the identification of PEGs
Published endosperm transcriptome data contain a substantial fraction of transcripts from the maternal seed coat, which may limit the correct prediction of paternally biased genes . We hypothesized that there are several genes that based on their epigenetic modifications (score 12, Fig. 4a and Additional file 3: Table S2) are likely to be PEGs but based on available expression data are not correctly classified. A prediction of this hypothesis is that genes in the highest score category that have been classified as biallelically expressed or maternally biased are more highly expressed in the seed coat than those with paternally biased expression. We tested this hypothesis using available expression data of seed coat tissue  at a similar developmental stage as published endosperm expression data . Indeed, genes with paternally biased expression were significantly less expressed in the seed coat compared to genes that based on available expression data are either predicted to be biallelically expressed or maternally biased (Wilcoxon rank sum test, P < 0.005; Fig. 4b), strongly suggesting that maternal tissue contamination in available expression data limits the prediction of PEGs. To further test this hypothesis, we generated transgenic reporter lines containing the promoter (≃ 2 kb) and downstream genic regions fused to the green fluorescent protein (GFP) of seven genes belonging to the highest score category but predicted to be maternally (AT2G33620, AT1G43580, AT1G47530, AT1G64660, AT2G30590, AT4G15390) or biallelically expressed (AT5G53160). We detected a GFP signal in the endosperm only for construct AT1G64660 (Fig. 5, Additional file 1: Table S4). For construct AT1G47530, a signal was detected in the seed coat, while for the other constructs no GFP signal was detected in seeds, indicating that the regulatory elements required for the expression of those genes are located outside the promoter and genic regions used to generate the reporter lines. Reciprocal crosses using the AT1G64660 reporter lines revealed that this gene is indeed a PEG and strongly expressed in the endosperm when paternally, but not when maternally inherited (Fig. 5). These data support the hypothesis that seed coat contamination limits the transcriptome-based identification of PEGs.
To test whether we could confirm additional PEGs that have been predicted based on their epigenetic signatures, we applied INTACT (isolation of nuclei tagged in specific cell types) to purify endosperm nuclei at 4 days after pollination (DAP) from Col × Ler and Ler × Col reciprocal crosses. Isolated RNA was sequenced and profiled for allele-specific gene expression. By analyzing the maternal to total reads ratio in each epigenetic category, we confirmed that the genes in the highest score category (group with score 12) indeed showed a clear trend towards paternally biased expression (Fig. 6a, Additional file 4: Table S5). Following previously established criteria , we predicted 148 PEGs that were reciprocally imprinted in both directions of the crosses. There was a significantly higher number of PEGs present in the highest score category compared to a representative random sample of genes with informative reads (Fig. 6b). Furthermore, the highest score category had the highest frequency of PEGs, with other categories having significantly fewer PEGs (Fig. 6c). Of the 148 genes that we predicted as PEGs based on our RNA sequencing data, 45 were present in the highest score category, which is significantly more than expected by chance (P = 2.008e−55, Fig. 6d). Out of those, 24 were previously predicted based on published data [12, 14, 26, 27], while 21 genes are likely new high-confidence PEGs, revealing that PEGs are more common than previously estimated.
In this study, we identified the concomitant presence of maternal-specific CHG methylation, H3K27me3, and H3K9me2 as an epigenetic signature for paternally biased expression in the endosperm. We furthermore predict that there are substantially more PEGs than previously reported in Arabidopsis [12, 14, 26], suggesting that in Arabidopsis the number of PEGs exceeds the number of MEGs. Recent re-evaluation of published imprintome data of Arabidopsis revealed that a large number of previously predicted MEGs were seed coat expressed genes, while the number of PEGs was underestimated . Our study supports and extends this notion by showing that there is a large number of paternally biased genes that likely failed to be identified in previous studies because of maternal seed coat contamination or early stage-specific expression.
A previous study reported that the maternal alleles of PEGs in A. lyrata are marked by CHG methylation and implicated that closely related species use different mechanisms to regulate imprinted gene expression . Our study reveals that similar to A. lyrata the maternal alleles of many PEGs are also marked by CHG methylation in A. thaliana, highlighting that epigenetic mechanisms employed to maintain monoallelic expression are rather conserved between related species. Interestingly, while the maternal alleles of PEGs in maize are marked by H3K27me3, they are not marked by CHG methylation , indicating that diverged species may use a different mechanism in maintaining maternal allele repression.
How H3K9me2 and CHG methylation are established at PRC2 target genes remains to be studied; however, the strong activation of maternal PEG alleles upon loss of FIS-PRC2 function  suggests that H3K9me2 and CHG methylation require FIS-PRC2 function. The PRC2 is generally targeting genes with specific roles during development , while complexes establishing H3K9me2 are mainly targeting TEs localized in heterochromatic regions of the genome . This functional division of PRC2 and machineries establishing heterochromatic marks is conserved in plants as well as in mammals ; however, there are notable exceptions to this rule in both groups of organisms. In rice seedlings, about one third of H3K27me3 marked genes are also marked by CHG and CHH methylation . Similar to the findings reported in our study, higher levels of H3K27me3 correlate with higher CHG and CHH methylation in gene bodies of rice . The rice H3K27me3 methyltransferase SDG711 physically interacts with the CHH methyltransferase OsDMR2 and the SRA-domain containing SUVH protein SDG703, uncovering a mechanistic connection between PRC2 and non-CG methylation.
Which methyltransferases establish H3K9me2 in the central cell of the female gametophyte and in the endosperm of Arabidopsis remains to be investigated. SUVH4,5,6 are the main H3K9me2 methyltransferases in sporophytic tissues of Arabidopsis  and SUVH4 and SUVH6 are expressed in the central cell and in the endosperm [21, 22]. However, the imprinted genes SUVH7 and SUVH8  encode for two potential H3K9me2 methyltransferases that despite lack of in vitro activity  may be active in the endosperm. Increased expression of SUVH7 is detrimental in triploid seeds , indicating that SUVH7 is functionally active in the endosperm. Similarly, CMT3 is expressed in the central cell and in the endosperm [21, 22]; however, functional redundancies with other CMT genes cannot be ruled out based on available data. Identifying the H3K9me2 and CHG methyltransferases acting in the endosperm will be a major step to address the functional role of H3K9me2 and CHG methylation in the stable repression of maternal PEG alleles.
In mammalian cells, H3K9 methyltransferases colocalize with PRC2 [35,36,37], revealing crosstalk between these two major epigenetic silencing pathways that likely is required for stable gene silencing. The imprinted maternal Xist locus encoding an X-linked long-noncoding RNA is covered by H3K27me3 and H3K9me3 in preimplantation embryos [38, 39]. Importantly, loss of maternal H3K27me3 induces Xist activation, indicating that maternal H3K27me3 is the major imprinting mark of Xist . This is strikingly similar to findings made in this study revealing H3K27me3 as the major repressive mark for PEGs. Recent work revealed that maternal H3K27me3 controls DNA methylation-independent imprinting in mammalian preimplantation embryos . While imprinted expression of most genes is lost in the embryonic cell lineage, few genes maintain their imprinted expression in the extra-embryonic cell lineage . The Xist locus that is marked by H3K27me3 and H3K9me3 is among those loci that remain imprinted in extra-embryonic tissues . Whether the presence of both marks distinguishes those genes that maintain their imprinted expression from those that become biallelically expressed remains to be tested, but we consider this a very attractive hypothesis. We speculate that the presence of both marks in certain tissue types of mammals and flowering plants is a conserved epigenetic signature marking stably repressed genes.
We discovered the co-occurrence of the PRC2-mediated H3K27me3 and heterochromatic H3K9me2 and CHG methylation as an epigenetic signature marking the silenced maternal alleles of PEGs. This signature can be used to predict PEGs at high accuracy, and based on this prediction, we estimate that the number of PEGs is substantially larger than previously estimated. We hypothesize that the common use of PRC2 and H3K9 methylation to silence target loci during reproduction has convergently evolved in flowering plants and mammals to ensure stable silencing during this sensitive life stage.
Plant material and growth conditions
All seeds were surface sterilized (5% sodium hypochlorite and 0.01% Triton X-100), stratified for 2 days at 4 °C, and germinated on half-strength Murashige and Skoog medium containing 1% sucrose under long-day conditions (16 h light/8 h darkness, 21 °C). Plants were transferred to soil after 10 to 12 days and grown under long-day conditions. The cmt3-11 (SALK_148381; ) and suvh456 mutants (kindly provided by Judith Bender) used in this study are in the Col-0 background.
Imprinting assays and expression analysis
To generate siliques of indicated crosses, three to five flowers were emasculated, hand-pollinated, and harvested at 4 DAP. RNA extraction was performed using the MagJET Plant RNA Purification Kit (Thermo Scientific) following the manufacturer’s instructions. Residual DNA was removed using Invitrogen DNase I (Amplification Grade), and cDNA was synthesized using the Fermentas first-strand cDNA synthesis kit according to the manufacturer’s instructions. Quantitative PCR was performed using a MyiQ5 real-time PCR detection system (Bio-Rad) and Solis BioDyne-5x Hot FIREPol EvaGreen qPCR Mix Plus (ROX, Solis BioDyne). For the imprinting-by-sequencing assay, the PCR products were purified and analyzed by Sanger sequencing. For the imprinting-by-restriction enzyme digestion assay, the PCR products were purified and digested. Restriction enzymes and primers used are listed in the Additional file 1: Table S6.
We made use of endosperm-specific ChIP-seq data that have been previously generated in our group . Data correspond to pooled biological triplicates with ChIP signals being normalized with H3 ChIP data by calculating the log2 ratio in 150-bp bins across the genome. Data were standardized and normalized with a z-score transformation . Metagene plots over genes were constructed between − 2 and + 2 kb by calculating mean levels of methylation signals in 100-bp bins in the flanks of the genes and in 40 equally long bins between the transcriptional start and stop. Gene z-scores were calculated as an average of z-scores over the gene body. DNA methylation data of fie and dme and their corresponding wild types are from . DNA methylation data of the central cell, sperm, and vegetative cells are from [3, 17]. Endosperm expression data are from  and seed coat expression data from .
Endosperm nuclei isolation and RNA sequencing
We performed Col × Ler reciprocal crosses using Arabidopsis lines expressing PHE1::NTF and PHE1::BirA (lines referred as INT hereafter) . To facilitate the crosses, we used the male sterile mutants pistillata (pi-1, in Ler accession) and dde2 (in Col accession containing INT) as female parents and pollinated them with the INT line (Col accession) and Ler wild type, respectively. A total of 500 mg of siliques for the first replicates and 250 mg of siliques for the second and third replicates were collected at 4 DAP. Tissue homogenization, nuclei purification, RNA extraction, and library preparation were performed from three biological replicates as previously described [43, 44]. Samples were sequenced at the National Genomic Infrastructure (NGI) from SciLife Laboratory (Uppsala, Sweden) on an Illumina HiSeq2500 in paired-end 125 bp read length. Mapping and discrimination of parental reads was done as previously described . We calculated contamination levels based on the deviation of the read counts from the expected 2:1 maternal/paternal genome ratio in the endosperm  (Additional file 1: Table S6). Based on this analysis, the first replicates of both cross directions Col × Ler and Ler × Col were not included in the downstream analyses. To increase the statistical power to detect parentally biased genes, we merged libraries from two replicates of Col × Ler and three replicates of Ler × Col for downstream analysis.
Allele-specific expression analysis
We followed our previously published analysis pipeline to define imprinted genes . Briefly, we defined a minimum threshold of 20 informative reads for Col × Ler (2 replicates) and Ler × Col (2 replicates) crosses, respectively. Statistical differences between maternal and paternal read counts for each gene were calculated using a chi-square test, considering genes with a false discovery rate adjusted P value of less than 0.01. Additionally, MEGs required to have at least 85% maternal informative reads in both directions of the reciprocal cross and PEGs to have at least 50% paternal informative reads in both directions of the reciprocal cross, following previously defined conditions . Quality of sequencing samples is shown in Additional file 1: Table S7.
Generation of reporter constructs and transgenic lines
For the generation of reporter constructs, we used the ClonExpress® MultiS One Step Cloning kit. Promoters (≃ 2 kb) and genic sequences of AT2G33620, AT1G43580, AT1G47530, AT1G64660, AT2G30590, AT4G15390, and AT5G53160 were amplified from WT Col-0 genomic DNA using primers specified in Additional file 1: Table S6 and cloned into vector pB7FWG.0. Constructs were transformed into Agrobacterium tumefaciens strain GV3101, and Arabidopsis plants were transformed using the floral dip method . Ten transgenic lines per construct were generated and analyzed.
For reciprocal crosses, designated female partners were emasculated at 1–2 days prior to anthesis. Two days after emasculation, pistils were hand-pollinated with respective pollen donors. Seeds were dissected from the siliques and mounted on a microscope slide for imaging and counting at 2 and 4 DAP. For fluorescence analyses, seeds were stained with 0.1 mg/mL propidium iodide (PI) solution in 7% glucose. Seeds of reciprocal crosses of reporter lines were analyzed under confocal microscopy on a Zeiss 800 Inverted Axio Observer with a supersensitive GaASp detector. Images were acquired, analyzed, and exported using Zeiss ZEN software. For each reporter construct, 50–60 seeds per line were analyzed.
Rodrigues JA, Zilberman D. Evolution and function of genomic imprinting in plants. Genes Dev. 2015;29(24):2517–31.
Li J, Berger F. Endosperm: food for humankind and fodder for scientific discoveries. New Phytol. 2012;195(2):290–305.
Park K, Kim MY, Vickers M, Park JS, Hyun Y, Okamoto T, Zilberman D, Fischer RL, Feng X, Choi Y, et al. DNA demethylation is initiated in the central cells of Arabidopsis and rice. Proc Natl Acad Sci U S A. 2016;113(52):15138–43.
Ibarra CA, Feng X, Schoft VK, Hsieh TF, Uzawa R, Rodrigues JA, Zemach A, Chumak N, Machlicova A, Nishimura T, et al. Active DNA demethylation in plant companion cells reinforces transposon methylation in gametes. Science. 2012;337(6100):1360–4.
Villar C, Erilova A, Makarevich G, Trösch R, Köhler C. Control of PHERES1 imprinting in Arabidopsis by direct tandem repeats. Mol Plant. 2009;2:654–60.
Pignatta D, Novitzky K, Satyaki PR, Gehring M. A variably imprinted epiallele impacts seed development. PLoS Genet. 2018;14(11):e1007469.
Weinhofer I, Hehenberger E, Roszak P, Hennig L, Köhler C. H3K27me3 profiling of the endosperm implies exclusion of polycomb group protein targeting by DNA methylation. PLoS Genet. 2010;6(10):e1001152.
Moreno-Romero J, Jiang H, Santos-Gonzalez J, Kohler C. Parental epigenetic asymmetry of PRC2-mediated histone modifications in the Arabidopsis endosperm. EMBO J. 2016;35:1298–311.
Simon JA, Kingston RE. Occupying chromatin: polycomb mechanisms for getting to genomic targets, stopping transcriptional traffic, and staying put. Mol Cell. 2013;49(5):808–24.
Mozgova I, Hennig L. The polycomb group protein regulatory network. Annu Rev Plant Biol. 2015;66:269–96.
Luo M, Bilodeau P, Dennis ES, Peacock WJ, Chaudhury A. Expression and parent-of-origin effects for FIS2, MEA, and FIE in the endosperm and embryo of developing Arabidopsis seeds. Proc Natl Acad Sci U S A. 2000;97(19):10637–42.
Hsieh TF, Shin J, Uzawa R, Silva P, Cohen S, Bauer MJ, Hashimoto M, Kirkbride RC, Harada JJ, Zilberman D, et al. Regulation of imprinted gene expression in Arabidopsis endosperm. Proc Natl Acad Sci U S A. 2011;108(5):1755–62.
Law JA, Jacobsen SE. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11(3):204–20.
Pignatta D, Erdmann RM, Scheer E, Picard CL, Bell GW, Gehring M. Natural epigenetic polymorphisms lead to intraspecific variation in Arabidopsis gene imprinting. Elife. 2014;3:e03198.
Shu H, Nakamura M, Siretskiy A, Borghi L, Moraes I, Wildhaber T, Gruissem W, Hennig L. Arabidopsis replacement histone variant H3.3 occupies promoters of regulated genes. Genome Biol. 2014;15(4):R62.
Roudier F, Ahmed I, Berard C, Sarazin A, Mary-Huard T, Cortijo S, Bouyer D, Caillieux E, Duvernois-Berthet E, Al-Shikhley L, et al. Integrative epigenomic mapping defines four main chromatin states in Arabidopsis. EMBO J. 2011;30(10):1928–38.
Calarco JP, Borges F, Donoghue MT, Van Ex F, Jullien PE, Lopes T, Gardner R, Berger F, Feijo JA, Becker JD, et al. Reprogramming of DNA methylation in pollen guides epigenetic inheritance via small RNA. Cell. 2012;151(1):194–205.
Choi Y, Gehring M, Johnson L, Hannon M, Harada JJ, Goldberg RB, Jacobsen SE, Fischer RL. DEMETER, a DNA glycosylase domain protein, is required for endosperm gene imprinting and seed viability in Arabidopsis. Cell. 2002;110:33–42.
Jullien PE, Kinoshita T, Ohad N, Berger F. Maintenance of DNA methylation during the Arabidopsis life cycle is essential for parental imprinting. Plant Cell. 2006;18(6):1360–72.
Ebbs ML, Bender J. Locus-specific control of DNA methylation by the Arabidopsis SUVH5 histone methyltransferase. Plant Cell. 2006;18(5):1166–76.
Wuest SE, Vijverberg K, Schmidt A, Weiss M, Gheyselinck J, Lohr M, Wellmer F, Rahnenfuhrer J, von Mering C, Grossniklaus U, et al. Arabidopsis female gametophyte gene expression map reveals similarities between plant and animal gametes. Curr Biol. 2010;20(6):506–12.
Belmonte MF, Kirkbride RC, Stone SL, Pelletier JM, Bui AQ, Yeung EC, Hashimoto M, Fei J, Harada CM, Munoz MD, et al. Comprehensive developmental profiles of gene activity in regions and subregions of the Arabidopsis seed. Proc Natl Acad Sci U S A. 2013;110(5):435–44.
Stroud H, Greenberg MV, Feng S, Bernatavichute YV, Jacobsen SE. Comprehensive analysis of silencing mutants reveals complex regulation of the Arabidopsis methylome. Cell. 2013;152(1–2):352–64.
Schon MA, Nodine MD. Widespread contamination of Arabidopsis embryo and endosperm transcriptome data sets. Plant Cell. 2017;29(4):608–17.
Waters AJ, Bilinski P, Eichten SR, Vaughn MW, Ross-Ibarra J, Gehring M, Springer NM. Comprehensive analysis of imprinted genes in maize reveals allelic variation for imprinting and limited conservation with other species. Proc Natl Acad Sci U S A. 2013;110(48):19639–44.
Wolff P, Weinhofer I, Seguin J, Roszak P, Beisel C, Donoghue MT, Spillane C, Nordborg M, Rehmsmeier M, Köhler C, et al. High-resolution analysis of parent-of-origin allelic expression in the Arabidopsis endosperm. PLoS Genet. 2011;7(6):e1002126.
Gehring M, Missirian V, Henikoff S. Genomic analysis of parent-of-origin allelic expression in Arabidopsis thaliana seeds. PLoS One. 2011;6(8):e23687.
Klosinska M, Picard CL, Gehring M. Conserved imprinting associated with unique epigenetic signatures in the Arabidopsis genus. Nat Plants. 2016;2:16145.
Zhang M, Xie S, Dong X, Zhao X, Zeng B, Chen J, Li H, Yang W, Zhao H, Wang G, et al. Genome-wide high resolution parental-specific DNA and histone methylation maps uncover patterns of imprinting regulation in maize. Genome Res. 2014;24(1):167–76.
Mozgova I, Kohler C, Hennig L. Keeping the gate closed: functions of the polycomb repressive complex PRC2 in development. Plant J. 2015;83(1):121–32.
Du J, Johnson LM, Jacobsen SE, Patel DJ. DNA methylation pathways and their crosstalk with histone methylation. Nat Rev Mol Cell Biol. 2015;16(9):519–32.
Alabert C, Groth A. Chromatin replication and epigenome maintenance. Nat Rev Mol Cell Biol. 2012;13(3):153–67.
Zhou S, Liu X, Zhou C, Zhou Q, Zhao Y, Li G, Zhou DX. Cooperation between the H3K27me3 chromatin mark and non-CG methylation in epigenetic regulation. Plant Physiol. 2016;172(2):1131–41.
Wolff P, Jiang H, Wang G, Santos-Gonzalez J, Köhler C. Paternally expressed imprinted genes establish postzygotic hybridization barriers in Arabidopsis thaliana. Elife. 2015;4. https://doi.org/10.7554/eLife.1.
Bilodeau S, Kagey MH, Frampton GM, Rahl PB, Young RA. SetDB1 contributes to repression of genes encoding developmental regulators and maintenance of ES cell state. Genes Dev. 2009;23(21):2484–9.
Yuan P, Han J, Guo G, Orlov YL, Huss M, Loh YH, Yaw LP, Robson P, Lim B, Ng HH, et al. Eset partners with Oct4 to restrict extraembryonic trophoblast lineage potential in embryonic stem cells. Genes Dev. 2009;23(21):2507–20.
Mozzetta C, Pontis J, Fritsch L, Robin P, Portoso M, Proux C, Margueron R, Ait-Si-Ali S. The histone H3 lysine 9 methyltransferases G9a and GLP regulate polycomb repressive complex 2-mediated gene silencing. Mol Cell. 2014;53(2):277–89.
Fukuda A, Tomikawa J, Miura T, Hata K, Nakabayashi K, Eggan K, Akutsu H, Umezawa A. The role of maternal-specific H3K9me3 modification in establishing imprinted X-chromosome inactivation and embryogenesis in mice. Nat Commun. 2014;5:5464.
Inoue A, Jiang L, Lu F, Zhang Y. Genomic imprinting of Xist by maternal H3K27me3. Genes Dev. 2017;31(19):1927–32.
Inoue A, Jiang L, Lu F, Suzuki T, Zhang Y. Maternal H3K27me3 controls DNA methylation-independent imprinting. Nature. 2017;547(7664):419–24.
Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW, Chen H, Henderson IR, Shinn P, Pellegrini M, Jacobsen SE, et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in Arabidopsis. Cell. 2006;126(6):1189–201.
Cheadle C, Vawter MP, Freed WJ, Becker KG. Analysis of microarray data using Z score transformation. J Mol Diagn. 2003;5(2):73–81.
Moreno-Romero J, Santos-Gonzalez J, Hennig L, Köhler C. Applying the INTACT method to purify endosperm nuclei and to generate parental-specific epigenome profiles. Nat Protoc. 2017;12(2):238–54.
Del Toro-De Leon G, Köhler C. Endosperm-specific transcriptome analysis by applying the INTACT system. Plant Reprod. 2018. https://doi.org/10.1007/s00497-018-00356-3.
Clough SJ, Bent AF. Floral dip: a simplified method for agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1998;16(6):735–43.
Moreno-Romero J, Del Toro-De León G, Yadav VK, Santos-González J, Köhler C. Epigenetic signatures associated with imprinted paternally-expressed genes in the Arabidopsis endosperm. Datasets. Gene Expression Omnibus. 2019. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE119915. Accessed 7 Feb 2019.
Sequencing was performed by the SNP&SEQ Technology Platform, Science for Life Laboratory at Uppsala University, a national infrastructure supported by the Swedish Research Council (VRRFI) and the Knut and Alice Wallenberg Foundation.
This research was supported by a European Research Council Starting Independent Researcher grant (to C.K.), a grant from the Swedish Science Foundation (to C.K.), a grant from the Knut and Alice Wallenberg Foundation (to C.K.), the Göran Gustafsson Foundation for Research in Natural Sciences and Medicine (to C.K.), and an EMBO fellowship (to G.D.T.D.L).
Availability of data and materials
The RNA-seq data generated in this study are available through GEO (GSE119915) publicly available at https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE119915 . We furthermore used endosperm RNA expression data from  (GSE52814), seed coat expression data from  (GSE12404), parental-specific histone and DNA methylation data from  (GSE66585), central cell DNA methylation profiles from  (GSE89789), and DNA methylation data from sperm cell, vegetative cell and endosperm of fie and dme mutants from  (GSE38935).
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. H3K27me3, H3K9me2, and CHG methylation overlap on maternal alleles of PEGs in crosses Ler × Col. Figure S2. Total and parental-specific H3K27me3, H3K9me2, and CHG methylation levels in the endosperm of previously described PEGs. Figure S3. Increasing levels of H3K27me3 on maternal alleles correlates with increasing levels of H3K9me2. Figure S4. Maternal alleles remain silenced in suvh4,5,6 and cmt3 mutants. Figure S5. Paternally biased expression coincides with the combination of H3K27me3, H3K9me2, and CHG methylation in crosses Ler × Col (for reciprocal cross direction see Fig. 3). Table S3. GO terms enrichment of genes in the highest score category. Table S4. Fluorescence analysis of 4 DAP seeds after reciprocal crosses. Table S6. List of primers used in this study. Table S7. Quality of sequencing samples. (PDF 1499 kb)
Table S1. Scores assigned to genes based on the presence of CHG methylation in the central cell as well as H3K9me2 and H3K27me3 on maternal alleles in the endosperm. (XLSX 1745 KB) (XLSX 1744 kb)
Table S2. Seed coat expression of genes with maximum score (12) based on Additional file 2: Table S1. (XLSX 45 kb)
Table S5. Parent-of-origin RNAseq dataset of 4 DAP INTACT-purified endosperm of Col × Ler reciprocal crosses. (XLSX 904 kb)