- Open Access
Genome-wide signatures of differential DNA methylation in pediatric acute lymphoblastic leukemia
Genome Biologyvolume 14, Article number: r105 (2013)
Although aberrant DNA methylation has been observed previously in acute lymphoblastic leukemia (ALL), the patterns of differential methylation have not been comprehensively determined in all subtypes of ALL on a genome-wide scale. The relationship between DNA methylation, cytogenetic background, drug resistance and relapse in ALL is poorly understood.
We surveyed the DNA methylation levels of 435,941 CpG sites in samples from 764 children at diagnosis of ALL and from 27 children at relapse. This survey uncovered four characteristic methylation signatures. First, compared with control blood cells, the methylomes of ALL cells shared 9,406 predominantly hypermethylated CpG sites, independent of cytogenetic background. Second, each cytogenetic subtype of ALL displayed a unique set of hyper- and hypomethylated CpG sites. The CpG sites that constituted these two signatures differed in their functional genomic enrichment to regions with marks of active or repressed chromatin. Third, we identified subtype-specific differential methylation in promoter and enhancer regions that were strongly correlated with gene expression. Fourth, a set of 6,612 CpG sites was predominantly hypermethylated in ALL cells at relapse, compared with matched samples at diagnosis. Analysis of relapse-free survival identified CpG sites with subtype-specific differential methylation that divided the patients into different risk groups, depending on their methylation status.
Our results suggest an important biological role for DNA methylation in the differences between ALL subtypes and in their clinical outcome after treatment.
Methylation of cytosine (5 mC) residues in CpG dinucleotides across the genome is an epigenetic modification that plays a pivotal role in the establishment of cellular identity by influencing gene expression during development . In somatic mammalian cells, the majority of CpG sites are methylated. However, CpG sites located in regions of increased CG density, known as CpG islands, generally have low levels of CpG methylation . On the molecular level, it is well known that CpG methylation leads to X-chromosome inactivation, genomic imprinting, and suppression of transposable elements. Disruption of DNA methylation patterns is associated with diseases, and particularly with cancer . Key regulators that are essential for establishing and maintaining the epigenomic landscape are frequently mutated and can drive cancer development via alterations of DNA methylation and histone modifications .
Pediatric acute lymphoblastic leukemia (ALL) originates from the malignant transformation of lymphocyte progenitor cells into leukemic cells in the B-cell and T-cell lineages. ALL is a heterogeneous disease, in which patients are stratified into subtype groups based on their cellular immunophenotype and recurrent cytogenetic aberrations, such as aneuploidies and translocations, acquired by the leukemic cells [5, 6]. In the Nordic countries, the five-year survival rate for pediatric ALL patients exceeds 80%, but one-fifth of the patients relapse despite continued chemotherapy . Although the cytogenetic aberrations are indicative of better or poorer relapse-free survival rates, relapses occur in all cytogenetic subtypes .
We and others have previously observed differential patterns of CpG site methylation in ALL cells compared to non-leukemic bone marrow [7, 8], in subtypes of ALL [9–12], and between diagnosis and relapse . However, the genome-wide DNA methylation patterns have not yet been comprehensively described for all subtypes of ALL and the synergy between DNA methylation, leukemogenesis, drug resistance, and relapse in ALL is poorly understood. Increased understanding of the role of aberrant DNA methylation is of considerable interest, especially in lieu of the possible application of epigenetic treatment in combination chemotherapy [14, 15]. In the present study we provide a comprehensive, genome-wide map of de novo DNA methylation changes in ALL cells at diagnosis and relapse by interrogating the methylation levels of 435,941 CpG sites distributed genome-wide in a large collection of pediatric ALL cells of diverse cytogenetic backgrounds.
The DNA methylation landscape in ALL
HumanMethylation 450k BeadChips were used for quantitative DNA methylation analysis of leukemic blasts from pediatric ALL patients in the Nordic countries. This large collection includes samples from patients with T-cell ALL (T-ALL; n = 101) and B-cell precursor ALL (BCP ALL; n = 663), including multiple samples from rare subtypes of BCP ALL (Table 1). To determine signatures of differential methylation that are characteristic for ALL, we compared the CpG site methylation levels in ALL cells to those in blood cells from non-leukemic individuals. To represent the different stages in lymphoid cell development, we included CD19+ B cells, CD3+ T cells, and CD34+ hematopoietic stem cells isolated from healthy adult blood donors. We also included age-matched bone marrow (BM) samples collected at remission from 86 of the ALL patients as control samples. This set of non-leukemic reference cells includes multipotent progenitor cells (CD34+) and mature lymphoid cells (CD19+, CD3+), which allows the distinction of lineage- and cell type-specific differences from de novo methylation in the ALL cells.
To obtain an initial view of the variation in CpG site methylation in our dataset, we subjected the complete set of methylation data to principal component analysis (PCA). T-ALL, BCP ALL, and non-leukemic samples formed separated clusters using the principal components 1 and 2 (Figure 1A). Although only two components were needed to capture >60% of the variation in the dataset (Figure 1B), higher order components separated the subtypes of BCP ALL from each other (not shown). Although the non-leukemic reference samples originated from different blood cell populations, they clustered together, clearly separated from the ALL samples. Unsupervised cluster analysis across all of the CpG sites revealed distinct methylation patterns that separated ALL cells according to their cytogenetic and immunophenotypic subtype. The evident difference between ALL cells and the non-leukemic blood cells, and the similarity between the non-leukemic cells in the heatmap (Figure 1C) provide the rationale to use these cells as a non-leukemic reference cell panel to detect differential methylation.
Differential DNA methylation
We searched for differentially methylated CpG sites (DMCs) in the ALL cells by comparing the β-values (methylation values ranging from 0.0 to 1.0) in non-leukemic reference samples to the ALL samples of each individual subtype. CD19+, CD34+, and BM samples were used as the reference panel for BCP ALL and CD3+, CD34+, and BM were used as the reference panel for T-ALL. For calling a CpG site as differentially methylated, we required a minimum absolute ∆β-value of 0.2 and a false discovery rate (FDR)-adjusted Wilcoxon rank-sum P-value of <0.01 for the difference. This analysis revealed between 21,799 and 58,157 DMCs in the ALL subtypes, distributed across 5,956 to 8,245 gene regions (Table 2; in Additional file 1: Table S1). In total, 9,406 of the DMCs annotated to 2,023 gene regions and 2,979 CpG islands were observed across all the ALL subtypes and were thereby considered 'constitutive' (Additional file 2: Table S2). The vast majority of the constitutive DMCs (98.6%) were hypermethylated in the ALL cells compared with the non-leukemic reference cells (Figure 2A). The number of DMCs that were unique for each ALL subtype according to the applied criteria varied independently of the number of samples in a subtype, from 16,841 CpG sites in 895 unique gene regions in T-ALL to 271 CpG sites in 36 unique gene regions in the t(9;22) subtype (Table 2). As expected, the heterogeneous BCP ALL samples with unknown cytogenetic aberrations labeled as 'undefined' and those with 'non-recurrent' abnormalities did not display unique differential methylation patterns. The methylation patterns between BCP ALL subtypes differed substantially, with high methylation levels in samples harboring MLL rearrangements, which is opposite to a recent finding of predominant hypomethylation in adult ALL with MLL rearrangements , while the high hyperdiploid (HeH) samples were predominantly hypomethylated in our study (Table 2; Figure 2B-I), as has been previously described in pediatric BCP ALL for HeH . The distribution between hyper- and hypomethylation between the subtypes of pediatric BCP ALL in our study is in agreement with the findings in a recent study of 50,000 CpG sites that used an alternative method for DNA methylation analysis . For the DMCs, the absolute average β-value difference between ALL cells and reference cells for the subtype-specific DMCs was approximately 0.50, which is in agreement with allele-specific gains or losses of DNA methylation in ALL compared to normal cells (Figures 2A-I; Additional file 3: Figures S1A-F).
Functional genomic distribution of differentially methylated CpG sites
The hypermethylated DMCs were enriched in CpG islands, while hypomethylated DMCs were primarily annotated to 'open sea' regions, independent of whether they were constitutive or subtype-specific (Figure 3A). The subtype-specific differences were more frequently observed in CpG island 'shores' and 'shelves', which display a large variation in β-value between ALL samples (Additional file 3: Figure S2). Both constitutive and subtype-specific DMCs in proximal promoter regions (transcription start sites and 5’ untranslated regions) of genes were commonly hypermethylated, but a greater enrichment of subtype-specific hypomethylation was observed in gene bodies and in intergenic regions (Figure 3B). To explore putative functional roles for the DMCs, we intersected the genomic coordinates of the constitutive and subtype-specific DMCs with regions defined by chromatin-immunoprecipitation of six histone marks and DNase1 hypersensitivity (DHS) assays in relevant primary cell types such as CD19+, CD3+, and CD34+ cells [17, 18]. Although the histone code in normal blood cells may not reflect that in ALL cells, the genomic distribution of histone marks is useful for annotating functional regions of the genome. This analysis revealed differences in enrichment between constitutive and subtype-specific DMCs to functional genomic regions with marks of repressed or active chromatin (Figure 3C). The 9,406 constitutive DMCs were enriched more than two-fold in regions marked by repressive H3K9me3 and H3K27me3, or bivalently by H3K27me3 and H3K4me3, which marks active chromatin (P < 0.001). On the contrary, the subtype-specific DMCs were enriched more than two-fold in regions of active chromatin marked by DHS, H3K4me3, and H3K4me1 (P < 0.001; Figure 3C). These observations suggest that subtype-specific methylation of CpG sites has specific functional roles.
The constitutive DMCs were enriched in genes in the transcriptional regulatory network in embryonic stem cells (P = 3.53 × 10-3) and in genes that regulate or are regulated by transcription factors involved in embryonic development: NANOG (P = 9.7 × 10-6), OCT4 (P = 4.9 × 10-5), SOX2 (P = 2.3 × 10-6), and REST (P = 4.75 × 10-13) (Additional file 2: Table S3). While no enrichment to known pathways was observed for the subtype-specific DMC signatures, all of the DMC signatures were enriched for genes with biological functions in cancer, cellular development, cellular growth and proliferation, and cell-to-cell signaling (P < 0.05).
DMCs as regulators of gene expression
To investigate whether the DMCs influence gene expression and to determine which of the annotation classes of DMCs are involved in the regulation of gene expression, we compared the DNA methylation levels of each constitutive and subtype-specific DMC with gene expression data. First, we determined the correlation between the methylation levels of constitutive DMCs and mRNA expression levels obtained using digital gene expression sequencing of 28 ALL samples, including T-ALL and five BCP ALL subtypes, and five reference samples  (Additional file 2: Table S4). The β-values of only a small proportion (<1%) of the constitutive DMCs (n = 85) correlated with up- or down-regulation of the mRNA expression levels of 41 genes (permuted P ≤ 0.05 and fold change ≥2) (Additional file 2: Table S5). This observation was expected since 79% of the constitutive DMCs were annotated to regions containing the repressive H3K27me3 or H3K9me3 marks in healthy blood cells and thus genes in these regions were presumably not widely expressed (Figure 3C). Secondly, we determined which of the subtype-specific DMCs correlated with microarray-based gene expression data for 93 of the ALL samples of the t(12;21), HeH, t(1;19), t(9;22), dic(9;20), MLL/11q23 and T-ALL subtypes (Additional file 2: Table S6). We found that, on average, 15% (range 10 to 21%) of the β-values for the subtype-specific DMCs annotated to genes correlated with gene expression levels (permuted P ≤ 0.05 and fold change ≥2) (Additional file 2: Tables S7 to S13). The proportion of DMCs and gene annotations in t(12;21) that were correlated with gene expression in our study were highly similar to those in a recent, small methylation study on the t(12;21) BCP ALL subtype . Ten of the 17 genes suggested in the earlier study based on their correlation to be drivers of leukemogenesis were also highlighted in our study (Additional file 2: Table S14).
We used the functional annotation of the DMCs correlated with gene expression to explore their putative functional roles, and found hypermethylated DMCs that correlated with down-regulation of gene expression to be enriched in DHS regions, active promoters (H3K4me3), and enhancers (H3K27ac/H3K4me1) (Figure 4A; Additional file 3: Figure S3). On the contrary, hypomethylation of gene bodies was highly correlated with either up- or down-regulation of gene expression. DMCs that were highly correlated with gene expression included genes with functions in epigenetic regulation and previously known subtype-specific gene expression in ALL (Figure 4B). For example, we observed an inverse correlation between the β-value and gene expression for the UHRF1 gene, which encodes a methyl CpG binding protein that has high affinity for hemi-methylated DNA and was highly expressed in the ALL samples, independent of their subtype, while it was not expressed in reference samples . DNA methylation of NCOR2, which is a transcriptional co-repressor that acts through covalent modification of histones , was positively correlated with gene expression in T-ALL. We also show up-regulation of known subtype-specific genes such as BIRC7 in t(12;21) [12, 22] and DDIT4L in HeH , and previously unobserved subtype-specific expression of PHACTR3 in t(1;19) and UAP1 in the dic(9;20) subtype.
Differential DNA methylation in relapsed ALL
Next we compared the genome-wide DNA methylation levels between paired samples at diagnosis and relapse from 27 patients, and in five of the patients after a second relapse (Additional file 2: Table S15). We used PCA to visualize the genome-wide methylation patterns of the sample pairs. Plots of the first two principal components showed similar changes in DNA methylation levels between diagnosis, first, and second relapse in all patients (Figure 5A; Additional file 3: Figure S4). We observed a similar pattern in 10 paired BCP ALL samples at diagnosis and relapse from the Quebec childhood ALL (QcALL) cohort that were included for verification of our results (Figure 5B).
In total, we identified 6,612 DMCs in 1,854 gene regions in the 27 paired diagnosis-relapse ALL samples (Additional file 2: Table S16). Although only 773 (12%) DMCs at relapse overlapped with the constitutive DMCs, the gene region annotations of both signatures were remarkably similar, and included 1,186 (64%) of overlapping gene regions. Hence, like the genes in the constitutive signature, the genes in the relapse signature were enriched for the transcriptional regulatory network in embryonic stem cells and in the Wnt/β-catenin signaling pathways (P = 2.8 × 10-7, 1.8 × 10-4; Additional file 3: Figures S5 and S6), to genes regulated by REST, SOX2, NANOG and OCT4 (P < 6.6 × 10-10), and to regions with the repressive H3K27me3 mark or bivalent H3K4me3/H3K27me3 marks (P < 0.001; Figure 5C).
The methylation levels of each of the relapse DMCs increased in each of the ALL pairs, with the highest levels after the second relapse (Figure 5D). The β-values of the CpG sites in the relapse signature were highly similar in the Nordic Society of Pediatric Hematology and Oncology (NOPHO) and QcALL sample sets (Figure 5E), suggesting that this signature of DMCs is common to relapsed ALL samples, regardless of subtype and treatment protocol. To visualize individual β-value changes in the paired samples, the top 25 ranking DMCs from the relapse signature are plotted in the paired samples (Figure 5F). Regional analysis surrounding CpG sites in each of the top 25 genes showed that nearby CpG sites displayed concordant (increased) methylation changes at relapse (Additional file 3: Figure S7).
DNA methylation for predicting relapse-free survival in ALL
Finally, we utilized CpG sites that constitute the four signatures of differential methylation defined in this study to search for DMCs that are predictive of relapse-free survival of ALL patients. For this purpose, relapse-free survival in each ALL subtype further stratified into standard risk (SR), intermediate risk (IR), high risk (HR), and infant (I) treatment groups was analyzed against the β-values of the DMCs comprising the constitutive, subtype-specific, subtype-specific correlated with gene expression, and relapse signatures using nearest shrunken centroids classification (Additional file 2: Table S17; Additional file 3: Figure S8) . Four of the methylation signatures allowed for prediction of relapse-free survival with an area under the receiver operating characteristic (ROC) curve (AUC) >0.60 (Figure 6A). After permutation testing, subtype-specific DMCs in the group of ALL patients with the t(12;21) translocation that had been treated according to the standard risk (SR) protocol (n = 71) were found to be associated with relapse (P = 0.033). In addition, the subtype-specific sites in patients with the t(9;22) translocation treated on the high risk (HR) protocol (n = 18) and 11q23/MLL rearrangements treated on the infant protocol (n = 14) had indicative P-values of 0.062 and 0.098, respectively, despite the small number of samples in these groups (Additional file 2: Table S17). The relapse signature in all patients treated according to the infant protocol was not statistically significant (P = 0.22).
The effect of each DMC in the relapse-associated signatures was subsequently assessed using permutation testing (Additional file 4). To reduce spurious associations, we required a minimum of two significant CpG sites within the same gene or within 50 kb of each other. Genomic regions were analyzed individually for predictive classification of relapse-free survival. This resulted in the identification of six genomic regions in t(12;21), eight in 11q23/MLL, and one in t(9;22), whose methylation values were associated with relapse (Table 3). Strikingly, 11 of the top ranking DMCs for relapse-free survival in the t(12;21) subtype were annotated to a 2.2 kb region on chr6q12, which encodes an endogenous retroviral gene, ERVH-3 (Figure 6B). In addition, two CpG sites in the DMNBP gene distinguished a group of t(12;21) patients with promoter hypomethylation and high risk of relapse (Figure 6C). Two CpG sites in the first intron of the non-coding RNA gene LOC146880 (ENSG00000215769/hsa-mir-6080) in patients harboring t(9;22) translocations also distinguished a group of patients with hypomethylation and high risk of relapse (Figure 6D). The additional genes associated with increased risk of relapse are plotted in Additional file 3: Figure S9 to S11. These genes include PAG1 in t(12;21), which is known to harbor recurrent somatic mutations in pediatric ALL patients with the hypodiploid karyotype , and WT1 in MLL/11q23, which is commonly mutated in acute myeloid leukemia . Mutations in both these genes are associated with increased risk of relapse in pediatric leukemias. Five zinc finger genes (ZSCAN18, ZNF256, ZNF329, ZNF544, and ZNF681) on chromosome 19q13 were each independently associated with relapse in 11q23/MLL patients, with hypomethylation indicating increased relapse (Additional file 3: Figure S10). These findings indicate that DNA methylation levels of individual genes could be potentially useful as clinical biomarkers in addition to the currently used treatment stratification.
The 450k BeadChips for DNA methylation analysis are particularly suitable for analysis of large sample sets for which next generation bisulfite sequencing is not yet feasible. In the present study, we examined the methylation status of 435,941 CpG sites to determine the methylation patterns in a large set of samples from patients with childhood ALL at diagnosis (n = 764), relapse (n = 27), and in non-leukemic reference samples (n = 137). The quantitative methylation data from the 450k BeadChips in our large set of ALL samples at diagnosis revealed that the average absolute β-value difference between ALL cells and reference cells for the subtype-specific DMCs is approximately 0.50. Similarly, the β-value difference from pair-wise analysis of ALL cells at diagnosis and at relapse is close to 0.5. Based on these observations we speculate that differential methylation occurs in an allele-specific manner in ALL, analogously to what has been recently suggested by integrative analysis of single nucleotide polymorphisms and methylation using next-generation sequencing in prostate cancer . Our speculation on allele-specific DNA methylation is also substantiated by the quantitative correlation between DNA methylation and allele-specific gene expression that we observed in an earlier study of close to 200 of the diagnostic ALL samples analyzed here .
We analyzed multiple cytogenetic subtypes of ALL and found a core methylation signature shared by all the subtypes. This set of 'constitutive' DMCs, which comprised approximately 25% of all DMCs in each ALL subtype, were predominantly hypermethylated and associated with promoters repressed by the polycomb group proteins (PcG) in the context of bivalent chromatin. In stem cells, the repressive PcG complex cooperates with OCT4, SOX2 and NANOG to silence lineage-specific genes and to preserve the pluripotent state of the cells. Hypermethylation preferentially targets CpG islands of PcG-regulated genes in solid cancers [29–31] and in leukemias [13, 32, 33], which suggests a common signature of hypermethylation across cancer types by which cells lose their plasticity, giving them the ability to differentiate while retaining unlimited self-renewal capacity . Although the expression of the majority of the PcG-regulated genes did not appear to be down-regulated in our data set, other studies [29, 31, 34] have shown that these genes are usually expressed at very low levels in normal cells, and become fully silenced upon aberrant DNA methylation in cancer cells. In our digital gene expression (DGE) data, the low expression levels of these genes (<0.5 transcripts per million) inhibited accurate quantification of differential expression.
To our knowledge, our study is the first to observe a signature with higher DNA methylation levels of PcG target genes at relapse of ALL than at diagnosis. ALL cells at relapse are generally more resistant to chemotherapeutic treatment, which is consistent with the association between drug resistance and hypermethylation that is beginning to emerge in hematological neoplasms [13, 35–37]. Hypermethylation may be reversible by pretreatment with a histone deacetylase inhibitor (vorinostat) and DNA methyltransferase inhibitor (decitabine) before standard chemotherapy . In total, 74 of the genes in the constitutive and/or relapse DMC signatures that we identified in the current study have been experimentally shown to be targets for demethylation by decitabine (P < 3.96 × 10-9). As recent evidence suggests that cancer cells become dependent on DNA methylation acquired at specific positions , targeting the DNA methylation machinery may provide novel treatment options for cancers with hypermethylation phenotypes, especially for those patients who have relapsed .
In our study we established that additional hypermethylation in enhancers (marked by H3K4me3/H3K27ac) and in gene bodies are strongly associated with gene expression. Enhancers are distal elements that regulate gene expression and are influenced by aberrant DNA methylation in several cancer types [2, 40–42]. We show here that DNA methylation of enhancers is associated with differential gene expression in ALL. We also found that hypomethylation is prevalent outside CpG islands in gene bodies, and can be associated with either increased or decreased gene expression. This observation suggests a complex relationship between methylation in gene bodies in the regulation of gene expression, which may be acting via alternative promoter usage, splicing, and activity of other regulatory elements . Because the regions with histone marks to which DMCs in ALL cells were enriched originated from normal fractionated blood cells , our results warrant an investigation of histone marks in primary ALL cells, which like DNA methylation are potentially altered in ALL.
The DNA methylation status of individual candidate genes has been demonstrated to predict clinical outcome and allow refined subgrouping of ALL in a clinical setting [10, 43, 44]. We utilized the signatures of differentially methylated CpG sites identified in our study to screen for new markers of relapse in ALL, and found that subtype-specific DMCs may be useful as prognostic markers. We detected differential methylation of multiple CpG sites clustered in the ERHV-3, DMNBP, KCNA3, PAG1, and C11orf52 gene regions that were associated with increased risk of relapse in patients with the t(12;21) translocation treated according to standard risk (SR) therapy. In other patient subgroups we did not observe any significant association between DMCs and clinical outcome (P < 0.05). Patients with HeH and t(12;21) represent the two largest subgroups in pediatric BCP ALL (Table 1), and a majority of them are stratified to standard risk (SR) therapy. One possible explanation for the lack of DMCs with predictive power in patients with HeH is that this subtype group is less homogeneous than the t(12;21) group, and that various combinations of extra chromosomes in HeH cause differences in treatment response, something we will try to explore further. In all other BCP ALL subgroups, patient numbers were considerably smaller, which hinders analysis by repeated cross-validation. As in other contemporary ALL protocols, the current NOPHO ALL2008 protocol includes more intense treatment with asparaginase for all patients than the earlier treatment protocols that were used for the patients included in this study . When follow-up times are long enough, it will be interesting to see if the same genes continue to have prognostic significance for patients treated on the most recent NOPHO ALL2008 protocol. Several studies have reported cancer-associated hypomethylation, expression, and a link to poor outcome for some of the human endogenous retrovirus families . Although hypomethylation or expression of ERVH-3 has not previously been associated with outcome in t(12;21) BCP ALL, this gene was originally discovered in the REH ALL cell line bearing the t(12;21) translocation . A recent study in acute myeloid leukemia showed that decitabine treatment of acute myeloid leukemia cells causes hypomethylation and up-regulation of ERVH-3 expression . Our findings of hypomethylation in the ERVH-3 gene as a marker of relapse in t(12;21) warrant exploration of the side effects of decitabine treatment on abnormal hypomethylation of endogenous retroviral genes.
We generated a comprehensive view of the methylation landscape in pediatric ALL compared to non-leukemic reference cells. The analysis identified prevalent hypermethylation of CpG sites at diagnosis and relapse in all subtypes of pediatric ALL. We also detected discrete differences in methylation that drives differential gene expression in a subtype-specific pattern. Moreover, hypomethylation of several genes appeared to be predictive of relapse in a subset of patients with the common t(12;21)ETV6/RUNX1 translocation. Whether the de novo methylation detected here contributes actively to ALL, or is a passenger in the malignant transformation of blood progenitor cells into ALL cells remains to be elucidated. Our study implies that aberrant DNA methylation is a signature of leukemic development and progression, and for the heterogeneity between patients of similar cytogenetic backgrounds that contributes to relapse.
Materials and methods
DNA and RNA samples
BM aspirates or peripheral blood samples were collected from pediatric ALL patients enrolled in the NOPHO ALL92 or ALL2000 protocols . Clinical follow-up data were obtained from the NOPHO registry. The median follow-up time for patients in continuous complete remission was 9.1 years (range 4.6 to 18 years). Lymphocytes were isolated from ALL samples at diagnosis (n = 764), remission (n = 86), first relapse (n = 27), and second relapse (n = 5) by Ficoll-isopaque centrifugation (Pharmacia, Uppsala, Sweden; Table 1). All samples included in the study contained >80% leukemic blasts at diagnosis (average 91%) and relapse (average 90%), and <5% at remission. For validation, a sample set of DNA samples that were isolated at diagnosis, remission, and relapse from 10 children with pediatric BCP ALL from the QcALL cohort was used. Clinical information for QcALL and relapse samples is available in Additional file 2: Table S15. CD19+ B cells and CD3+ T cells were isolated from peripheral blood mononuclear cells of healthy Swedish blood donors using positive selection (CD19 Microbeads #120-050-301 and CD3 Microbeads #130-050-101) and MACS cell separation reagents (Miltenyi Biotec, Bergisch Gladbach, Germany). Pooled CD34+ cells isolated from five healthy blood donors were purchased from 3H Biomedical (Uppsala, Sweden) . DNA and RNA were extracted as previously described [19, 28]. The study was approved by the Regional Ethical Review Board in Uppsala, Sweden and was conducted according to the guidelines of the Declaration of Helsinki. The patients and/or their guardians provided informed consent.
DNA methylation assay
DNA was treated with sodium bisulfite (EZ DNA methylation Gold, Zymo Research, Irvine, CA, USA) and DNA methylation levels were measured using the Infinium HumanMethylation 450k BeadChip assay (Illumina, San Diego, CA, USA). The ALL samples and controls were randomly distributed across the arrays, all arrays were measured using the same HiScan instrument, and no evidence for batch effects was observed in the β-values (data not shown). The methylation β-value distribution between Infinium type I and II probes was normalized using peak-based correction (Additional file 3: Figure S12) . The data were filtered by removing the data from probes on the X and Y chromosomes and with genetic variation affecting probe hybridization (Additional file 3: Figure S13). After filtering, methylation data for 435,941 CpG sites remained for further analysis (Additional file 1). A subset of diagnostic ALL samples (n = 364) were previously analyzed on a custom GoldenGate DNA methylation array (Illumina) . DNA methylation values of 207 CpG sites interrogated by both arrays evaluate reproducibility of the β-value measurements (Additional file 3: Figure S14). Additional details about the methylation assay, probe filtering, and technical validation can be found in Additional file 4. The DNA methylation data are available at the Gene Expression Omnibus (GEO) with accession number GSE49031.
Annotation of CpG sites
CpG sites were annotated to RefSeq genes and CpG islands according to the Human Methylation 450k manifest file version 1.1. The distribution of probes that passed our stringent filtering is shown in relation to CpG islands, gene regions, and corresponding β-value distributions are shown in Additional file 3: Figures S15 and S16. When a CpG site had more than one gene-level annotation, that is, was present in both the transcription start site and the first exon, both annotations were used.
The following publicly available chromatin datasets from primary CD19+, CD3+, or CD34+ cells were obtained from the NIH Roadmaps Epigenomics Project: DHS regions, H3K27me3, H3K36me3, H3K4me3, H3K9me3, and H3K4me1 (Additional file 2: Table S18) . Peaks were called using the MACS software using default settings . H3K27ac peaks were downloaded from the UCSC table browser  derived from H1-hESC and GM12878 cell lines . CpG sites were annotated for the chromatin marks by overlapping genomic location with a peak in at least two of the replicates analyzed (Additional file 2: Table S1).
Analysis of differential DNA methylation
DMCs were determined using the non-parametric Wilcoxon rank-sum test. They were determined in T-ALL using remission BM, CD3+, and CD34+ cells as reference and in BCP ALL using remission BM, CD19+, and CD34+ cells. The Wilcoxon signed-rank test was used to identify methylation differences between paired samples at diagnosis and relapse. Minimal cut-off values for the mean absolute differences in DNA methylation (∆β) of 0.2 were applied to highlight CpG sites with large differences between groups. CpG sites with standard deviations >0.10 in the reference control group (n = 33,533 sites) were removed from DMC lists to minimize DMCs occurring due to cell type-specific variability (Additional file 3: Figure S2).
Correlation between DNA methylation and gene expression
Genome-wide digital mRNA gene expression (DGE) sequencing data from 28 ALL patient samples and five non-leukemic reference samples were generated as previously described (Additional file 2: Table S4) . RNA expression levels for 93 ALL patient samples were measured with Affymetrix U1333 Plus 2.0 arrays (Additional file 2: Table S6). Raw data were processed and normalized using the robust multichip average (RMA) algorithm [19, 52]. The expression datasets are publicly available at GEO under series GSE47051. Details on the gene expression assays can be found in Additional file 4. For each DMC signature, the correlation between β-value and log2 transformed gene expression was evaluated using the Pearson’s correlation coefficient. Statistical significance of each DMC was calculated by permuting the data 10,000 times and comparing the correlation coefficient in the unpermuted data to the permuted coefficients. In each dataset, the permuted P-values were adjusted for multiple testing using the Benjamini and Hochberg approach for controlling FDR .
Data analysis and visualization
Data analysis was carried out in the R environment . The R code for the analyses performed in this study is available at GitHub . One-sided Fisher’s exact tests were used to assess the significance of the enrichment of DMCs to functionally annotated regions, using the annotation of the 450k array as background. Pathway analysis and enrichment for upstream regulators was performed using software from Ingenuity Pathway Analysis (Ingenuity® Systems, Redwood City, CA, USA) and significance was evaluated with the Fisher’s exact test. All P-values were adjusted for multiple testing by FDR unless otherwise stated. Analysis of relapse-free survival for constitutive and relapse DMC signatures was performed on all patients. Relapse-free survival for the subtype-specific signatures was evaluated individually for T-ALL and BCP ALL separated into the cytogenetic subtypes 11q23/MLL, HeH, t(1;19), t(12;21), and t(9;22). Each subtype was further stratified according to standard, intermediate, high risk, or infant treatment protocols . The patients with dic(9;20) and iAMP21 were not analyzed for relapse-free survival due to the small number of patients in each treatment group. Nearest shrunken centroids classifiers were designed to discriminate between the classes and evaluated with repeated cross-validation . AUC was used to measure predictive performance and statistical significance was evaluated by permuting the data 1,000 times. Each CpG site was scored by its coefficient after shrinkage and the significance was evaluated by permutation testing, as described above. Further details on the relapse-free classification procedure can be found in Additional file 3: Figure S8 and Additional file 4.
acute lymphoblastic leukemia
area under the ROC curve
- BCP ALL:
B-cell precursor acute lymphoblastic leukemia
differentially methylated CpG site
Gene Expression Omnibus
Nordic Society of Pediatric Hematology and Oncology
principal component analysis
Quebec childhood ALL
receiver operating characteristic
T-cell acute lymphoblastic leukemia
Deaton AM, Bird A: CpG islands and the regulation of transcription. Genes Dev. 2011, 25: 1010-1022. 10.1101/gad.2037511.
Jones PA: Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012, 13: 484-492. 10.1038/nrg3230.
Portela A, Esteller M: Epigenetic modifications and human disease. Nat Biotechnol. 2010, 28: 1057-1068. 10.1038/nbt.1685.
You JS, Jones PA: Cancer genetics and epigenetics: two sides of the same coin?. Cancer Cell. 2012, 22: 9-20. 10.1016/j.ccr.2012.06.008.
Schmiegelow K, Forestier E, Hellebostad M, Heyman M, Kristinsson J, Soderhall S, Taskinen M: Long-term results of NOPHO ALL-92 and ALL-2000 studies of childhood acute lymphoblastic leukemia. Leukemia. 2010, 24: 345-354. 10.1038/leu.2009.251.
Pui CH, Carroll WL, Meshinchi S, Arceci RJ: Biology, risk stratification, and therapy of pediatric acute leukemias: an update. J Clin Oncol. 2011, 29: 551-565. 10.1200/JCO.2010.30.7405.
Nordlund J, Milani L, Lundmark A, Lonnerholm G, Syvanen AC: DNA methylation analysis of bone marrow cells at diagnosis of acute lymphoblastic leukemia and at remission. PLoS One. 2012, 7: e34513-10.1371/journal.pone.0034513.
Wong NC, Ashley D, Chatterton Z, Parkinson-Bates M, Ng HK, Halemba MS, Kowalczyk A, Bedo J, Wang Q, Bell K, Algar E, Craig JM, Saffery R: A distinct DNA methylation signature defines pediatric pre-B cell acute lymphoblastic leukemia. Epigenetics. 2012, 7: 535-541. 10.4161/epi.20193.
Geng H, Brennan S, Milne TA, Chen WY, Li Y, Hurtz C, Kweon SM, Zickl L, Shojaee S, Neuberg D, Huang C, Biswas D, Xin Y, Racevskis J, Ketterling RP, Luger SM, Lazarus H, Tallman MS, Rowe JM, Litzow MR, Guzman ML, Allis CD, Roeder RG, Müschen M, Paietta E, Elemento O, Melnick AM: Integrative epigenomic analysis identifies biomarkers and therapeutic targets in adult B-acute lymphoblastic leukemia. Cancer Discov. 2012, 2: 1004-1023. 10.1158/2159-8290.CD-12-0208.
Milani L, Lundmark A, Kiialainen A, Nordlund J, Flaegstad T, Forestier E, Heyman M, Jonmundsson G, Kanerva J, Schmiegelow K, Söderhäll S, Gustafsson MG, Lönnerholm G, Syvänen AC: DNA methylation for subtype classification and prediction of treatment outcome in patients with childhood acute lymphoblastic leukemia. Blood. 2010, 115: 1214-1225. 10.1182/blood-2009-04-214668.
Davidsson J, Lilljebjorn H, Andersson A, Veerla S, Heldrup J, Behrendtz M, Fioretos T, Johansson B: The DNA methylome of pediatric acute lymphoblastic leukemia. Hum Mol Genet. 2009, 18: 4054-4065. 10.1093/hmg/ddp354.
Busche S, Ge B, Vidal R, Spinella J-F, Saillour V, Richer C, Healy J, Chen S-H, Droit A, Sinnett D, Pastinen T: Integration of high-resolution methylome and transcriptome analyses to dissect epigenomic changes in childhood acute lymphoblastic leukemia. Cancer Res. 2013, 73: 4323-4336. 10.1158/0008-5472.CAN-12-4367.
Hogan LE, Meyer JA, Yang J, Wang J, Wong N, Yang W, Condos G, Hunger SP, Raetz E, Saffery R, Relling MV, Bhojwani D, Morrison DJ, Carroll WL: Integrated genomic analysis of relapsed childhood acute lymphoblastic leukemia reveals therapeutic strategies. Blood. 2011, 118: 5218-5226. 10.1182/blood-2011-04-345595.
Bhatla T, Wang J, Morrison DJ, Raetz EA, Burke MJ, Brown P, Carroll WL: Epigenetic reprogramming reverses the relapse-specific gene expression signature and restores chemosensitivity in childhood B-lymphoblastic leukemia. Blood. 2012, 119: 5201-5210. 10.1182/blood-2012-01-401687.
Pui CH, Mullighan CG, Evans WE, Relling MV: Pediatric acute lymphoblastic leukemia: where are we going and how do we get there?. Blood. 2012, 120: 1165-1174. 10.1182/blood-2012-05-378943.
Figueroa ME, Chen S-C, Andersson AK, Phillips LA, Li Y, Sotzen J, Kundu M, Downing JR, Melnick A, Mullighan CG: Integrated genetic and epigenetic analysis of childhood acute lymphoblastic leukemia. J Clin Invest. 2013, 123: 3099-3111. 10.1172/JCI66203.
Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, Farnham PJ, Hirst M, Lander ES, Mikkelsen TS, Thomson JA: The NIH roadmap epigenomics mapping consortium. Nat Biotechnol. 2010, 28: 1045-1048. 10.1038/nbt1010-1045.
ENCODE Project Consortium: A user’s guide to the encyclopedia of DNA elements (ENCODE). Plos Biol. 2011, 9: e1001046-10.1371/journal.pbio.1001046.
Nordlund J, Kiialainen A, Karlberg O, Berglund EC, Göransson-Kultima H, Sønderkær M, Nielsen KL, Gustafsson MG, Behrendtz M, Forestier E, Perkkiö M, Söderhäll S, Lönnerholm G, Syvänen AC: Digital gene expression profiling of primary acute lymphoblastic leukemia cells. Leukemia. 2012, 26: 1218-1227. 10.1038/leu.2011.358.
Bostick M, Kim JK, Esteve PO, Clark A, Pradhan S, Jacobsen SE: UHRF1 plays a role in maintaining DNA methylation in mammalian cells. Science. 2007, 317: 1760-1764. 10.1126/science.1147939.
Watson PJ, Fairall L, Schwabe JWR: Nuclear hormone receptor co-repressors: structure and function. Mol Cell Endocrinol. 2012, 348: 440-449. 10.1016/j.mce.2011.08.033.
Ross ME, Zhou X, Song G, Shurtleff SA, Girtman K, Williams WK, Liu HC, Mahfouz R, Raimondi SC, Lenny N, Patel A, Downing J: Classification of pediatric acute lymphoblastic leukemia by gene expression profiling. Blood. 2003, 102: 2951-2959. 10.1182/blood-2003-01-0338.
Tibshirani R, Hastie T, Narasimhan B, Chu G: Class prediction by nearest shrunken centroids, with applications to DNA microarrays. Stat Sci. 2003, 18: 104-117. 10.1214/ss/1056397488.
Patzke S, Lindeskog M, Munthe E, Aasheim HC: Characterization of a novel human endogenous retrovirus, HERV-H/F, expressed in human leukemia cell lines. Virology. 2002, 303: 164-173. 10.1006/viro.2002.1615.
Holmfeldt L, Wei L, Diaz-Flores E, Walsh M, Zhang J, Ding L, Payne-Turner D, Churchman M, Andersson A, Chen SC, McCastlain K, Becksfort J, Ma J, Wu G, Patel SN, Heatley SL, Phillips LA, Song G, Easton J, Parker M, Chen X, Rusch M, Boggs K, Vadodaria B, Hedlund E, Drenberg C, Baker S, Pei D, Cheng C, Huether R, et al: The genomic landscape of hypodiploid acute lymphoblastic leukemia. Nat Genet. 2013, 45: 242-252. 10.1038/ng.2532.
Staffas A, Kanduri M, Hovland R, Rosenquist R, Ommen HB, Abrahamsson J, Forestier E, Jahnukainen K, Jónsson ÓG, Zeller B, Palle J, Lönnerholm G, Hasle H, Palmqvist L, Ehrencrona H, Nordic Society of Pediatric Hematology and Oncology (NOPHO): Presence of FLT3-ITD and high BAALC expression are independent prognostic markers in childhood acute myeloid leukemia. Blood. 2011, 118: 5905-5913. 10.1182/blood-2011-05-353185.
Lin PC, Giannopoulou EG, Park K, Mosquera JM, Sboner A, Tewari AK, Garraway LA, Beltran H, Rubin MA, Elemento O: Epigenomic alterations in localized and advanced prostate cancer. Neoplasia. 2013, 15: 373-383.
Milani L, Lundmark A, Nordlund J, Kiialainen A, Flaegstad T, Jonmundsson G, Kanerva J, Schmiegelow K, Gunderson KL, Lonnerholm G, Syvanen AC: Allele-specific gene expression patterns in primary leukemic cells reveal regulation of gene expression by CpG site methylation. Genome Res. 2009, 19: 1-11.
Ohm JE, McGarvey KM, Yu X, Cheng L, Schuebel KE, Cope L, Mohammad HP, Chen W, Daniel VC, Yu W, Berman DM, Jenuwein T, Pruitt K, Sharkis SJ, Watkins DN, Herman JG, Baylin SB: A stem cell-like chromatin pattern may predispose tumor suppressor genes to DNA hypermethylation and heritable silencing. Nat Genet. 2007, 39: 237-242. 10.1038/ng1972.
Ernst J, Kheradpour P, Mikkelsen TS, Shoresh N, Ward LD, Epstein CB, Zhang X, Wang L, Issner R, Coyne M, Ku M, Durham T, Kellis M, Bernstein BE: Mapping and analysis of chromatin state dynamics in nine human cell types. Nature. 2011, 473: 43-49. 10.1038/nature09906.
Easwaran H, Johnstone SE, Van Neste L, Ohm J, Mosbruger T, Wang Q, Aryee MJ, Joyce P, Ahuja N, Weisenberger D, Collisson E, Zhu J, Yegnasubramanian S, Matsui W, Baylin SB: A DNA hypermethylation module for the stem/progenitor cell signature of cancer. Genome Res. 2012, 22: 837-849. 10.1101/gr.131169.111.
Ammerpohl O, Haake A, Pellissery S, Giefing M, Richter J, Balint B, Kulis M, Le J, Bibikova M, Drexler HG, Seifert M, Shaknovic R, Korn B, Küppers R, Martín-Subero JI, Siebert R: Array-based DNA methylation analysis in classical Hodgkin lymphoma reveals new insights into the mechanisms underlying silencing of B cell-specific genes. Leukemia. 2012, 26: 185-188. 10.1038/leu.2011.194.
Deneberg S, Guardiola P, Lennartsson A, Qu Y, Gaidzik V, Blanchet O, Karimi M, Bengtzén S, Nahi H, Uggla B, Tidefelt U, Höglund M, Paul C, Ekwall K, Döhner K, Lehmann S: Prognostic DNA methylation patterns in cytogenetically normal acute myeloid leukemia are predefined by stem cell chromatin marks. Blood. 2011, 118: 5573-5582. 10.1182/blood-2011-01-332353.
Gal-Yam EN, Egger G, Iniguez L, Holster H, Einarsson S, Zhang X, Lin JC, Liang G, Jones PA, Tanay A: Frequent switching of Polycomb repressive marks and DNA hypermethylation in the PC3 prostate cancer cell line. Proc Natl Acad Sci USA. 2008, 105: 12979-12984. 10.1073/pnas.0806437105.
Stumpel DJ, Schneider P, van Roon EH, Boer JM, de Lorenzo P, Valsecchi MG, de Menezes RX, Pieters R, Stam RW: Specific promoter methylation identifies different subgroups of MLL-rearranged infant acute lymphoblastic leukemia, influences clinical outcome, and provides therapeutic options. Blood. 2009, 114: 5490-5498. 10.1182/blood-2009-06-227660.
Schafer E, Irizarry R, Negi S, McIntyre E, Small D, Figueroa ME, Melnick A, Brown P: Promoter hypermethylation in MLL-r infant acute lymphoblastic leukemia: biology and therapeutic targeting. Blood. 2010, 115: 4798-4809. 10.1182/blood-2009-09-243634.
Jelinek J, Gharibyan V, Estecio MR, Kondo K, He R, Chung W, Lu Y, Zhang N, Liang S, Kantarjian HM, Cortes JE, Issa JP: Aberrant DNA methylation is associated with disease progression, resistance to imatinib and shortened survival in chronic myelogenous leukemia. PLoS One. 2011, 6: e22110-10.1371/journal.pone.0022110.
De Carvalho DD, Sharma S, You JS, Su SF, Taberlay PC, Kelly TK, Yang X, Liang G, Jones PA: DNA methylation screening identifies driver epigenetic events of cancer cell survival. Cancer Cell. 2012, 21: 655-667. 10.1016/j.ccr.2012.03.045.
Shen H, Laird PW: In epigenetic therapy, less is more. Cell Stem Cell. 2012, 10: 353-354. 10.1016/j.stem.2012.03.012.
Kulis M, Heath S, Bibikova M, Queirós AC, Navarro A, Clot G, Martínez-Trillos A, Castellano G, Brun-Heath I, Pinyol M, Barberán-Soler S, Papasaikas P, Jares P, Beà S, Rico D, Ecker S, Rubio M, Royo R, Ho V, Klotzle B, Hernández L, Conde L, López-Guerra M, Colomer D, Villamor N, Aymerich M, Rozman M, Bayes M, Gut M, Gelpí JL, et al: Epigenomic analysis detects widespread gene-body DNA hypomethylation in chronic lymphocytic leukemia. Nat Genet. 2012, 44: 1236-1242. 10.1038/ng.2443.
Schmidl C, Klug M, Boeld TJ, Andreesen R, Hoffmann P, Edinger M, Rehli M: Lineage-specific DNA methylation in T cells correlates with histone methylation and enhancer activity. Genome Res. 2009, 19: 1165-1174. 10.1101/gr.091470.109.
Aran D, Sabato S, Hellman A: DNA methylation of distal regulatory sites characterizes dysregulation of cancer genes. Genome Biol. 2013, 14: R21-10.1186/gb-2013-14-3-r21.
Esteller M: Inactivation of the DNA-repair gene MGMT and the clinical response of gliomas to alkylating agents (vol 343, pg 1350, 2000). New Engl J Med. 2000, 343: 1740-1740.
Tan AC, Jimeno A, Lin SH, Wheelhouse J, Chan F, Solomon A, Rajeshkumar NV, Rubio-Viqueira B, Hidalgo M: Characterizing DNA methylation patterns in pancreatic cancer genome. Mol Oncol. 2009, 3: 425-438. 10.1016/j.molonc.2009.03.004.
Toft N, Birgens H, Abrahamsson J, Bernell P, Griškevičius L, Hallböök H, Heyman M, Holm MS, Hulegårdh E, Klausen TW, Marquart HV, Jónsson OG, Nielsen OJ, Quist-Paulsen P, Taskinen M, Vaitkeviciene G, Vettenranta K, Åsberg A, Schmiegelow K: Risk group assignment differs for children and adults 1–45 yr with acute lymphoblastic leukemia treated by the NOPHO ALL-2008 protocol. Euro J Haematol. 2013, 90: 404-412. 10.1111/ejh.12097.
Romanish MT, Cohen CJ, Mager DL: Potential mechanisms of endogenous retroviral-mediated genomic instability in human cancer. Semin Cancer Biol. 2010, 20: 246-253. 10.1016/j.semcancer.2010.05.005.
Klco JM, Spencer DH, Lamprecht TL, Sarkaria SM, Wylie T, Magrini V, Hundal J, Walker J, Varghese N, Erdmann-Gilmore P, Lichti CF, Meyer MR, Townsend RR, Wilson RK, Mardis ER, Ley TJ: Genomic impact of transient low-dose decitabine treatment on primary AML cells. Blood. 2013, 121: 1633-1643. 10.1182/blood-2012-09-459313.
3H Biomedical [http://www.3hbiomedical.com/]
Dedeurwaerder S, Defrance M, Calonne E, Denis H, Sotiriou C, Fuks F: Evaluation of the Infinium Methylation 450K technology. Epigenomics. 2011, 3: 771-784. 10.2217/epi.11.105.
Feng J, Liu T, Zhang Y: Using MACS to identify peaks from ChIP-Seq data. Curr Protoc Bioinformatics. 2011, 34: 2.14.1-2.14.14.
UCSC Genome Browser [http://genome.ucsc.edu]
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4: 249-264. 10.1093/biostatistics/4.2.249.
Benjamini Y, Hochberg Y: Controlling the false discovery rate - a practical and powerful approach to multiple testing. J Roy Stat Soc B Met. 1995, 57: 289-300.
R Core Development Team: R: A Language and Environment for Statistical Computing. 2011, Vienna, Austria: R Foundation for Statistical Computing
This work was supported by grants from the Swedish Foundation for Strategic Research (RBc08-008; ACS, GL, MGG), the Swedish Cancer Society (CAN2010/592; ACS), the Swedish Childhood Cancer Foundation (11098; ACS), and the Swedish Research Council for Science and Technology (90559401; ACS), the Swedish Research Council FORMAS (ACS), and the Erik, Karin and Gösta Selanders Stiftelse (JN). Epigenotyping was performed at the SNP&SEQ platform in Uppsala with assistance from Ingvar Thorsteinsson. Affymetrix gene expression data were generated at the Uppsala Array Platform with assistance from Hanna Göransson and Anders Isaksson. Computational analysis was performed on resources provided by the Swedish National Infrastructure for Computing (SNIC) through the Uppsala Multidisciplinary Center for Advanced Computational Science (UPPMAX). We thank Anna-Karin Lannegård, Christina Leek, Anders Lundmark, Elin Övernäs, and Ingrid Thörn for excellent technical assistance, Lili Milani and Anna Kiialainen for help with sample procurement, and Eva Freyhult for advice on survival analysis. We especially thank our colleagues from NOPHO and the ALL patients who contributed samples to this study. This study has been approved by the NOPHO Scientific Committee as study #56.
The authors declare that they have no competing interests.
ACS, JN, and GL designed the study. GL coordinated clinical sample procurement. EF provided expertise on patient karyotypes. TF, EF, BMF, MH, AHS, RL, KS, SS, and GL provided samples and clinical information. SB, DS, and TP provided the validation cohort. MLE and LR provided control samples. PW and ECB provided expertise on genomic analyses. MGG supervised multivariate data analyses. JN and CLB performed the bioinformatics and statistical analyses. JN, ACS, CLB, PW, and GL wrote the paper. All authors read and approved the final manuscript.
Jessica Nordlund, Christofer L Bäcklin contributed equally to this work.