- Open Access
Aging of blood can be tracked by DNA methylation changes at just three CpG sites
Genome Biology volume 15, Article number: R24 (2014)
Human aging is associated with DNA methylation changes at specific sites in the genome. These epigenetic modifications may be used to track donor age for forensic analysis or to estimate biological age.
We perform a comprehensive analysis of methylation profiles to narrow down 102 age-related CpG sites in blood. We demonstrate that most of these age-associated methylation changes are reversed in induced pluripotent stem cells (iPSCs). Methylation levels at three age-related CpGs - located in the genes ITGA2B, ASPA and PDE4C - were subsequently analyzed by bisulfite pyrosequencing of 151 blood samples. This epigenetic aging signature facilitates age predictions with a mean absolute deviation from chronological age of less than 5 years. This precision is higher than age predictions based on telomere length. Variation of age predictions correlates moderately with clinical and lifestyle parameters supporting the notion that age-associated methylation changes are associated more with biological age than with chronological age. Furthermore, patients with acquired aplastic anemia or dyskeratosis congenita - two diseases associated with progressive bone marrow failure and severe telomere attrition - are predicted to be prematurely aged.
Our epigenetic aging signature provides a simple biomarker to estimate the state of aging in blood. Age-associated DNA methylation changes are counteracted in iPSCs. On the other hand, over-estimation of chronological age in bone marrow failure syndromes is indicative for exhaustion of the hematopoietic cell pool. Thus, epigenetic changes upon aging seem to reflect biological aging of blood.
Aging reflects accumulation of cellular changes, due to either stochastic defects or a regulated developmental process . This process is usually measured chronologically, although it does not perfectly correlate with time: 'biological age' is influenced by additional parameters such as genetic background, disease and lifestyle. Biomarkers for biological aging are relevant for geriatric assessment and may support the adaptation of habits to assist healthy aging . Leukocyte telomere length has been suggested as a marker for biological age . In fact, telomere attrition seems to be enhanced by various parameters, such as obesity and cigarette smoking . Several other molecular methods can be used to estimate human age, including analysis of age-dependent deletions of mitochondrial DNA  or T-cell DNA rearrangements , and protein alterations such as racemization of aspartic acid  and advanced glycation end products . However, all of these biomarkers have relatively low precision and practical limitations .
The epigenetic landscape provides new perspectives for biomarkers. In particular, DNA methylation (DNAm) is well known to change during aging . Various recent studies have demonstrated the presence of age-related CpG sites (AR-CpGs), which are either hypermethylated or hypomethylated [11–14]. These DNAm changes are significantly enriched in bivalent chromatin domain promoters  and Polycomb group protein target genes [16–18], indicating that they might be governed by a developmental process. It is still not clear how epigenetic modifications are regulated during aging or if they rather reflect an increased deviation of local DNAm levels due to loss of control at specific loci . Some CpG sites reveal almost linear DNAm changes during aging and can therefore be used for age prediction [19–21]. Bocklandt et al. described a predictor of age in saliva samples generated using DNAm profiles from 34 twin pairs, indicating that age prediction based on just a few CpG sites is feasible, although this has not been validated in an independent set of samples. More recently, Hannum et al. built a quantitative model of aging using DNAm values of 71 CpG sites that has been validated on various independent datasets. However, these studies were based on Illumina BeadChip technology, a method that requires complex bioinformatic analysis. In this study, we address the question of whether an assay based on bisulfite pyrosequencing of just a few CpG sites might be a less costly, faster, and user-friendly approach with similar accuracy to DNAm profiling approaches. To this end, we pooled publicly available DNAm profiles derived from blood samples to identify AR-CpG sites that, when combined, best predict donor age. Based on this analysis we developed an epigenetic aging signature that requires measurement of DNAm levels at only three CpG sites by bisulfite sequencing to facilitate reliable age predictions. Notably, these age predictions are influenced by clinical and lifestyle parameters, indicating that they are more indicative of biological age than chronological age.
Age-related DNAm changes in blood samples
We combined 575 DNAm profiles derived from blood cells from four different studies spanning an age range of 0 to 78 years (Table S1 in Additional file 1) [15, 16, 22, 23]. All of these DNAm profiles were generated with the HumanMethylation27 BeadChip platform, which covers 27,578 individual CpG sites . AR-CpGs with linear DNAm changes during aging were selected by Pearson correlation (either r > 0.85 or r < −0.85): 102 CpG sites passed these stringent parameters, including 58 hypomethylated and 44 hypermethylated CpGs (Figure 1a; Additional file 2).
In particular, the hypomethylated AR-CpGs were associated with genes involved in 'hematopoietic regulation', indicating that these age-related DNAm (AR-DNAm) changes reveal some tissue specificity (Table S3 in Additional file 1). Hypermethylated AR-CpGs were enriched within GC-rich sequences (Figure S1a in Additional file 1) . Predicted transcription factor binding sites within 1 kb up- and downstream of AR-CpG sites differed considerably for hypo- and hypermethylated loci (Figure S1b in Additional file 1). Furthermore, only hypermethylated CpGs were significantly enriched in regions with bivalent histone modifications in embryonic stem cells (ESCs) and the H3K27me3 (trimethylation of lysine 27 of histone 3) marker in monocytes and mononuclear cells (Figure S2 in Additional file 1) [25, 26], which has also been described before . Overall, age-associated hypermethylation occurred particularly in regions with low DNAm levels, whereas age-related hypomethylation occurred at highly methylated regions. This trend towards a moderate methylation level has recently been described by other authors and may support the notion that many AR-DNAm changes are due to epigenetic drift evoked by increasing entropy of CpG markers, which tends towards 50% (Figure S3 in Additional file 1) [19, 27]. Either way, the underlying mechanism resulting in AR-DNAm changes at specific genomic regions seems to differ for hypo- and hyper-methylated CpGs.
We trained a multivariate linear model to predict donor age based on the 102 AR-CpG sites selected by Pearson correlation. The results correlated well with chronological age with a mean absolute deviation (MAD) from chronological age of only 3.34 years (root mean square error (RMSE) = 4.26 years; R2 = 0.98; Figure 1b). These AR-CpG sites were further validated in three other publicly available datasets derived from blood samples [28–30] using the same multivariate linear model: the MAD from chronological age in these datasets was only slightly higher (GSE49904, 5.79 years; GSE41037, 5.52 years; GSE37008, 4.02 years; Figure S4a in Additional file 1). Furthermore, we considered the recently published dataset by Hannum and co-workers  of 656 DNAm profiles derived from blood samples (from donors aged 19 to 101 years). This dataset has been analyzed on the HumanMethylation450 BeadChip, which assays 485,577 CpG sites, including 99 of the 102 AR-CpG sites . When we applied our multivariate linear model to this dataset, there was a clear correlation between age prediction and chronological age (R2 = 0.71). However, the linear offset indicated a systematic bias that might be due to the three missing CpG sites or to the different assay design of the two microarray platforms . Therefore, we adjusted the multivariate regression model to facilitate age predictions based on these 99 AR-CpG sites, similar to the above-mentioned model (MAD, 4.12 years; RMSE, 5.34 years; R2 = 0.87; Figure S4b,c in Additional file 1).
Age-related DNAm changes are counteracted in pluripotent stem cells
We have recently demonstrated that senescence-associated DNAm changes - which accumulate during long-term culture of cells in vitro - can be reversed by reprogramming into induced pluripotent stem cells (iPSCs) [32, 33]. Here, we analyzed if AR-DNAm changes are also reversed in this dataset upon reprogramming: although our aging model had been trained on freshly isolated blood samples, it enabled moderate estimations of age in culture-expanded mesenchymal stromal cells as well. Notably, AR-DNAm changes were hardly affected by replicative senescence during culture expansion in vitro, indicating that AR-DNAm changes are not identical to DNAm changes acquired during in vitro culture (Figure 1c). Interestingly, CpG sites that are hypermethylated during aging are hypomethylated in pluripotent cells and vice versa. Using our multivariate model, the ESCs and iPSCs were even predicted to be of negative age (Figure 1d), which may reflect the reversal beyond the new-born state to the embryonic cell state. However, when we applied our multivariate model to a dataset with 19 undifferentiated human ESC lines and 5 iPSC lines (GSE34869) , they were predicted to have a mean donor age of −0.06 and 5.20 years, respectively (Figure S5 in Additional file 1). Thus, the deviation of ESCs and especially of iPSCs from zero might also be due to culture conditions or the comparison between different datasets. Either way, the data clearly indicate that AR-DNAm changes are, overall, reversed upon reprogramming. These findings fit nicely with other recent observations that iPSCs generated from senescent cells or centenarian donors reset telomere length, gene expression profiles, and other physiological features to those of young cells [35, 36]. In addition, our results indicate that iPSCs are rejuvenated also on the epigenetic level.
Selection of the epigenetic aging signature
Analysis of DNAm in a small subset of AR-CpGs might be sufficient for robust age predictions. Restriction to the most relevant CpGs would facilitate site-specific analysis of DNAm instead of profiling approaches. Therefore, we searched for subsets of AR-CpGs that, when combined, yield the best age predictions. AR-CpGs with the highest variation in DNAm levels were used for recursive feature elimination to select subsets of five CpG sites for testing in multivariate linear regression models. Predictions for each of these subsets were made by iterative divisions of the dataset into training and test sets (split ratios; Figure 2a). For further analysis we considered only those subsets of five CpGs that performed better than the average of models based on all 51 AR-CpG sites (Figure S6 in Additional file 1). Five specific CpGs occurred in more than 50% of the remaining subsets, indicating that they complement each other for age prediction (Figure 2b). These CpG sites are associated with EDAR-associated death domain (EDARADD), which has been associated with AR-DNAm changes before ; integrin, alpha 2b (ITGA2B); RAB36, a member of the RAS oncogene family (RAB36); phosphodiesterase 4C, cAMP specific (PDE4C); and aspartoacylase (ASPA) (Figure S7 in Additional file 1). Nevertheless, gene expression profiles indicated that expression of the five corresponding genes is hardly affected by aging (Figure S8 in Additional file 1) .
DNAm at these CpG sites was subsequently analyzed by pyrosequencing after bisulfite conversion in an independent training set derived from 82 blood samples. The sequences in the vicinity of cg09809672 (EDARADD) and cg15379633 (RAB36) were not ideal for the primer design and therefore we focused on the remaining three AR-CpGs. As expected, we observed a clear age-associated correlation for each of the three CpGs. In fact, the upstream located CpG site cg17861230 (PDE4C), which was covered by the same pyrosequencing assay, revealed an even better age-association and was therefore considered instead (Figure 2c,d). A multivariate linear regression model based on these pyrosequencing results facilitated age predictions with a MAD from chronological age of 5.4 years (RMSE, 7.2 years; Figure 2e). The corresponding equation is provided in the Material and methods section. To further simplify application of the epigenetic aging signature, we have compiled an online calculator that implements this equation .
Subsequently, we used this method on an independent validation set derived from 69 blood samples; this validation set was analyzed two months after the training set. DNAm levels at the three relevant CpGs were integrated into the above mentioned linear regression model established with the training set. Notably, the predictions for the validation set correlated even better with chronological age (MAD, 4.5 years; RMSE, 5.6 years; Figure 2f), indicating that pyrosequencing of these three CpG sites enables reliable age prediction.
AR-DNAm changes are not due to changes in the cellular composition of blood
Our epigenetic age predictions might also be influenced by differences in the cellular composition in blood that result from aging . Between the second and seventh decade a moderate decline in lymphocytes  and erythrocytes  has been described. When we analyzed DNAm of the three relevant CpG sites (cg02228185 in ASPA, cg25809905 in ITGA2B, and cg17861230 in PDE4C) in a publicly available dataset of cell type-specific DNAm profiles  the results indicated that the age predictions made using our epigenetic aging signature were not evoked by myeloid skewing (Figure S9a in Additional file 1). Furthermore, we made age predictions based on the three AR-CpGs using publicly available DNAm profiles of fractionated monocytes (CD14), T cells (CD4), granulocytes (CD16), and hematopoietic stem and progenitor cells (CD34) (GSE20242; E-MTAB-487) [15, 42]. The results demonstrated that age predictions were feasible in purified cell populations, even though the MAD from chronological age was higher (Figure S9b in Additional file 1). Alternatively, we used another dataset to determine if the percentage of monocytes, lymphocytes, neutrophils, basophils, or eosinophils correlates with predicted age and there were no clear associations (GSE37008; Figure S10 in Additional file 1) . These results indicate that AR-DNAm changes are due to intrinsic DNAm changes rather than to changes in cellular composition.
Clinical and lifestyle parameters
In an exploratory analysis to determine whether deviation of predicted age and chronological age correlated with other co-variables - such as clinical or lifestyle parameters - we focused on the 105 samples from the population-based prospective Heinz Nixdorf Recall (HNR) study . Generally, age was estimated to be higher in men and in obese people (body mass index >30). These trends are in line with previous studies demonstrating an association with telomere length [44–46], but our results were not significant, which may be explained by the relatively small sample size (Figure S11 in Additional file 1). High alcohol consumption was also associated with overestimation of age (0.091 ± 0.045 years deviation per gram alcohol consumed per day; P = 0.049). Notably, age was increasingly underestimated according to increasing number of children in women (categories, 0, 1, 2, ≥3 children; -2.52 ± 0.84 years deviation per category; P = 0.0043), which has been associated with a longer lifespan before (Figure 3) .
Age predictions correlate with telomere length
Telomere length is well known to decline during aging - an average of 39 bp per year in granulocytes  - and this approach can also be used to estimate donor age. We analyzed telomere length in 104 blood samples (from donors aged 18 to 84 years) by flow-FISH. Despite a clear inverse correlation between telomere length and chronological age, the precision of age predictions based on telomere length was relatively low (MAD, 18.2 years; RMSE, 23.1 years; Figure 4a; Figure S12a in Additional file 1). Extremely shortened telomeres have been reported in severe acquired aplastic anemia (AA) and patients with dyskeratosis congenita (DKC), both of which are associated with progressive bone marrow failure syndromes. Telomere attrition in these patients might result either from increased hematopoietic cell turnover due to autoimmune-mediated depletion of the hematopoietic stem cell pool, or, particularly in DKC, due to direct functional impairment of the telomerase complex by inactivating mutations [49, 50]. We analyzed blood from 15 AA and 5 DKC patients, which revealed significantly shorter telomeres than healthy controls (Figure S12b in Additional file 1). Notably, these samples were predicted to be significantly older than their chronological age using our epigenetic aging signature, which may reflect replicative exhaustion of the hematopoietic stem cell pool (Figure 4b,c) [18, 49].
We describe a method to predict donor age using blood samples based on DNAm at three specific CpG sites. The model is based on locus-specific pyrosequencing analysis of bisulfite-converted DNA, an approach that is relatively cost effective and does not require complicated bioinformatics. Most importantly, the precision of our method is much better than that of alternative non-epigenetic approaches such as measurement of telomere length, DNA rearrangements or protein alterations . This assay can be used for forensic analysis of blood samples - theoretically, it can even be scaled down for relatively small traces of blood. Furthermore, our method may enable estimation of biological age using blood. Such predictions may be useful for geriatric assessment and may help tailor lifestyle to improve the odds of staying healthy. We provide evidence that deviation of predicted age and chronological age might be associated with specific lifestyle parameters - for example, gender, body mass index, alcohol consumption and the number of children - but most of these effects were not significant, which might be attributed to the relatively small sample size. On the other hand, aging may vary between tissues and therefore risk factors for age-related diseases - for example, in the cardiovascular system - are not necessarily reflected in the hematopoietic system.
AR-DNAm changes were suggested to be a tissue-specific phenomenon [17, 51]. Yet, we and other authors have demonstrated that similar - but not identical - DNAm changes are acquired in different tissues [15, 16, 19, 21]. The epigenetic aging signature described here has been specifically designed for use with blood, which is advantageous for practical diagnostics. Isolation of specific subsets, such as B cells, T cells, or neutrophils, would be difficult to implement in daily routine, even if it would further increase the precision of the method. Another advantage is that DNA is relatively stable and can be shipped at room temperature. Notably, the method even enabled age predictions using samples that had been cryopreserved for more than 10 years.
It is still not clear if and how AR-DNAm changes are controlled. Our data support the notion that epigenetic drift - in terms of Shannon entropy - occurs in certain regions that trend towards a DNAm level of 50% . This may indicate that AR-DNAm changes are due to loss of DNAm maintenance at specific regions; hypermethylation happens in regions associated with repressive histone marks  and polycomb group protein targets , whereas hypomethylation may be enriched with other histone modifications, transcription factor binding sites or chromatin organizations. Whether AR-DNAm changes are the cause or consequence of aging also remains to be elucidated. Even though many of these AR-DNAm changes, including of the CpGs from our epigenetic aging signature, are not reflected in gene expression levels, it is well conceivable that the global epigenetic shift that occurs during aging entails loss of function in the elderly .
On the other hand, we provide evidence that AR-DNAm changes can be reversed by reprogramming into iPSCs. We have previously described that senescence-associated DNAm changes, which are acquired during long-term culture of mesenchymal stromal cells in vitro, are counteracted in iPSCs . We have also demonstrated that AR-DNAm changes and senescence-associated DNAm changes differ considerably despite significant overlap . This is in line with findings of this study as 102 AR-CpGs could not separate mesenchymal stromal cells into those of early and late passage. The observation that AR-DNAm changes - as well as senescence-associated DNAm changes - are reset to ground state supports the notion that pluripotency resets the aging clock: iPSCs generated from senescent cells and cells from centenarian donors have restored telomere length and their gene expression and cellular physiology appear to be indistinguishable from those of ESCs [35, 36]. Notably, these cells can give rise to new organisms that - at the beginning - do not reveal any signs of aging. It is still not clear whether this 'epigenetic rejuvenation' can be disengaged from developmental reprogramming or whether it is inherently associated with passing through the pluripotent state . The observation that AR-DNAm changes can be reversed indicates that, molecularly, their acquisition over a lifetime might be avoided and this is compatible with the evolutionary theories of aging . In this regard, aging would not resemble inevitable accumulation of genetic aberrations, but rather a continuous deviation from the epigenetic ground state - a loss of control that is not counteracted in somatic cells and might be species- and tissue-specific.
Bone marrow failure in acquired AA and DKC has been suggested to be associated with extensive proliferation of hematopoietic stem cells [49, 55]. DKC resembles an hereditary syndrome evoked by mutations in the telomerase complex or telomere-associated proteins whereas AA is acquired due to autoimmune processes, toxic compounds, or unknown factors. The degree of telomere shortening in AA is correlated with disease duration, stage and severity as well as with response to disease-modifying treatment strategies . In this regard, it is striking that those samples with stark telomere attrition in particular are also predicted to be prematurely aged by our epigenetic aging signature. This observation further substantiates our method, which may provide a diagnostic measure for the replicative exhaustion of the hematopoietic stem cell pool.
Our comprehensive analysis of DNAm profiles discerned AR-CpGs in blood. DNAm at three of these CpGs - determined, for example, by bisulfite pyrosequencing - can be used as a biomarker to enable age predictions with a MAD from chronological age of about 5 years. The notion that bone marrow failure syndromes, which are associated with telomere attrition, reveal premature aging also at the epigenetic level may indicate that AR-CpGs depict exhaustion of the hematopoietic stem cell pool. Although our exploratory analysis suggests some association with clinical or lifestyle parameters, it is still not known if this analysis reflects biological age of the organism or rather of the hematopoietic system. It is also not yet clear how AR-DNAm changes, which seem to occur in a coordinated and reversible manner, are governed and if they are functionally relevant. AR-DNAm changes may impact on chromatin structure or non-coding RNAs even without having an immediate impact on gene expression. Our epigenetic aging signature provides a simple approach that can be used to track the aging process, which may be useful when further trying to detail the underlying molecular mechanisms.
Materials and methods
We used blood samples from the HNR study, which is a prospective population-based cohort study (105 samples) [43, 56] from the Department of Obstetrics and Gynecology of the University Hospital Aachen (GYN; 27 samples), and from the Department of Oncology, Hematology and Stem Cell Transplantation of the University Hospital Aachen (HEM; 104 samples from healthy donors, 15 AA, 5 DKC). All samples were taken after written consent and according to the guidelines of the local ethics committees.
DNAm profiles and selection of AR-CpGs
We considered all DNAm profiles derived from blood samples that were generated with the HumanMethylation27 BeadChip platform and available at the time (Table S1 in Additional file 1). Beta values were combined and 102 AR-CpG sites were selected by Pearson correlation (r > 0.85 or r < −0.85). We trained a multivariate linear model for these AR-CpGs and applied leave-one-out cross-validation to estimate the model's performance. Age association of these CpGs was then tested using datasets GSE49904 , GSE41037 , GSE37008 , GSE34869 , and GSE20242 , and E-MTAB-487 . We also used dataset GSE40279 , although this was generated with the HumanMethylation450 BeadChip (different assay design with type II bead type and only 99 of 102 AR-CpGs)  and we therefore trained an alternative model for use with this platform.
Nucleotide sequences around each AR-CpG were retrieved from the human NCBI36/hg18 assembly. The frequency of nucleotides in the flanking regions was determined for 10 up- and downstream positions. FIMO (Find Individual Motif Occurrences) from the MEME Suite was utilized to scan for known transcription factor binding motifs within 1 kb flanking each CpG sites. The five most significantly enriched motifs in relation to all CpG sites on the array were depicted (P-values were estimated by Fisher’s exact test). The sequence logo plots were generated by the R seqLogo packages. Histone modifications at AR-CpG loci were analyzed using chromatin immunoprecipitation data for ESCs  (GSE8463 ; GSE29611, ENCODE project), CD14+ monocytes (GSE29611, ENCODE project), and MNCs (GSE31755, ENCODE project). Enrichment of H3K4me3, H3K27me3, the bivalent state or neither was estimated in relation to all CpG sites (Fisher’s exact test). Gene Ontology classification of genes associated with AR-CpGs was performed with GoMiner software  in relation to all CpGs on the BeadChip (Fisher’s exact test). For the five most important AR-CpGs, we analyzed gene expression data from the Leiden Longevity Study (150 samples; GSE16717) , which were generated with the 54 k CodeLink Human Whole Genome Bioarray. Methylation profiles of our top three AR-CpGs in subsets of different blood cells were analyzed using another dataset (GSE35069) .
Selection of CpGs for the epigenetic aging signature
To obtain a small set of AR-CpGs for age predictions based on locus-specific DNAm levels, we narrowed 102 AR-CpGs down to those with above median variations in beta values (above median level of interquartile ranges). The dataset was then randomly divided into training and validation sets by defined split ratios. Next we applied recursive feature elimination implemented in the R caret package  for a limited feature size of five CpGs using a linear model on the training set. Relevant CpG sites were then selected by their frequency in the best performing models (Figure S6 in Additional file 1). The five most relevant AR-CpGs were subsequently considered for locus-specific DNAm analysis by pyrosequencing. The sequences surrounding cg02228185 (ASPA), cg25809905 (ITGA2B), and cg17861230 (PDE4C) were best suited for this approach.
Pyrosequencing and age predictions
Genomic DNA was isolated from GYN samples with the QIAamp DNA Blood Midi Kit (QIAGEN, Hilden, Germany; GYN), from HEM samples with the DNeasy Mini Kit (QIAGEN), and from the HNR study samples using the Chemagic Magnetic Separation Module I (Chemagen, Baesweiler, Germany). Subsequently, 500 ng DNA were bisulfite-converted using the EpiTect Bisulfite Kit (QIAGEN). Converted DNA was amplified and 12 μl (ASPA, ITGA2B) or 20 to 25 μl (PDE4C) of PCR product was immobilized to 2 μl Streptavidin Sepharose™ HP beads (GE Healthcare, Piscataway, NJ, USA) followed by annealing to 1.0 μl sequencing primer (5 μM) for 2 minutes at 80°C. Primers for pyrosequencing analysis are listed in Table S4 in Additional file 1. Analysis was performed with PyroMark Q24 software. Initially, pyrosequencing was performed for 82 blood samples (27 GYN and 55 HNR) at Varionostic GmbH (Ulm, Germany); these samples were entirely independent from the DNAm profiles that were used to identify the three genomic locations. Based on the pyrosequencing results of the initial 82 blood samples, we generated the multivariate model. Beta values at the following three CpGs were used for age-prediction: (α) = cg02228185; (β) = cg25809905, and (γ) = a CpG site upstream of cg17861230 that revealed better correlation with age (Figure 2c).
This equation resembles also the underlying source code for the freely available online calculator . This simple model was subsequently validated with an independent set of 69 samples (19 HEM and 50 HNR).
Biostatistics on clinical parameters
To estimate the impact of co-variables (for example, lifestyle or clinical parameters), we calculated univariate linear regression models using the deviation of predicted age and chronological age as dependent variable and the respective co-variable as independent variable (SAS, version 9.2, Cary, New Jersey, USA). Similar models were computed separately for males and females to assess gender differences. Some 52 male and 53 female donors from different age categories (intervals of 6 years per age category) were selected randomly from the HNR study (4,814 participants) . Box-and-whisker plots (10 to 90 percentiles) were plotted with GraphPad Prism 5 (GraphPad Software, La Jolla, CA, USA). Statistical significance was assessed from statistical significance of parameter estimates of the linear regression model depicting the change in deviation per one unit increase in the corresponding co-variable.
Analysis of telomere length and age prediction
Telomere length of granulocytes and lymphocytes was analyzed in 104 samples from healthy donors aged 18 to 84 years, 15 patients with AA, and 5 with DKC (all HEM) by flow-FISH as described before [59, 60]. In brief, samples were analyzed in triplicates with and without FITC-(C3TA2)3 PNA or Alexa488-(C3TA2) PNA (for healthy controls or AA and DKC, respectively; Panagene, Daejeon, South Korea). Cow thymocytes with known telomere length were used as an internal control to calculate telomere length in kilobases. The cow thymocytes as well as granulocytes and lymphocytes from human samples were identified based on forward scatter and LDS 751 binding to double-stranded DNA. For flow-FISH, telomere length was determined in absolute values. Age-related telomere length was estimated by linear regression on the 104 blood samples from healthy donors.
age-related CpG site
age-related DNA methylation
embryonic stem cell
fluorescent in situ hybridization
Heinz Nixdorf Recall
induced pluripotent stem cell
mean absolute deviation
root mean square error.
Garinis GA, van der Horst GT, Vijg J, Hoeijmakers JH: DNA damage and ageing: new-age ideas for an age-old problem. Nat Cell Biol. 2008, 10: 1241-1247. 10.1038/ncb1108-1241.
Song Z, Von FG, Liu Y, Kraus JM, Torrice C, Dillon P, Rudolph-Watabe M, Ju Z, Kestler HA, Sanoff H, Lenhard RK: Lifestyle impacts on the aging-associated expression of biomarkers of DNA damage and telomere dysfunction in human blood. Aging Cell. 2010, 9: 607-615. 10.1111/j.1474-9726.2010.00583.x.
Blasco MA: Telomeres and human disease: ageing, cancer and beyond. Nat Rev Genet. 2005, 6: 611-622. 10.1038/nrg1656.
Valdes AM, Andrew T, Gardner JP, Kimura M, Oelsner E, Cherkas LF, Aviv A, Spector TD: Obesity, cigarette smoking, and telomere length in women. Lancet. 2005, 366: 662-664. 10.1016/S0140-6736(05)66630-5.
Cortopassi GA, Shibata D, Soong NW, Arnheim N: A pattern of accumulation of a somatic deletion of mitochondrial DNA in aging human tissues. Proc Natl Acad Sci U S A. 1992, 89: 7370-7374. 10.1073/pnas.89.16.7370.
Zubakov D, Liu F, van Zelm MC, Vermeulen J, Oostra BA, van Duijn CM, Driessen GJ, van Dongen JJ, Kayser M, Langerak AW: Estimating human age from T-cell DNA rearrangements. Curr Biol. 2010, 20: R970-R971. 10.1016/j.cub.2010.10.022.
Helfman PM, Bada JL: Aspartic acid racemisation in dentine as a measure of ageing. Nature. 1976, 262: 279-281.
Odetti P, Rossi S, Monacelli F, Poggi A, Cirnigliaro M, Federici M, Federici A: Advanced glycation end products and bone loss during aging. Ann N Y Acad Sci. 2005, 1043: 710-717. 10.1196/annals.1333.082.
Meissner C, Ritz-Timme S: Molecular pathology and age estimation. Forensic Sci Int. 2010, 203: 34-43. 10.1016/j.forsciint.2010.07.010.
Fraga MF, Esteller M: Epigenetics and aging: the targets and the marks. Trends Genet. 2007, 23: 413-418. 10.1016/j.tig.2007.05.008.
Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, Heine-Suner D, Cigudosa JC, Urioste M, Benitez J, Boix-Chornet M, Sanchez-Aguilera A, Ling C, Carlsson E, Poulsen P, Vaag A, Stephan Z, Spector TD, Wu YZ, Plass C, Esteller M: Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci U S A. 2005, 102: 10604-10609. 10.1073/pnas.0500398102.
Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, Wiemels JL, Nelson HH, Karagas MR, Padbury JF, Bueno R, Sugarbaker DJ, Yeh RF, Wiencke JK, Kelsey KT: Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet. 2009, 5: e1000602-10.1371/journal.pgen.1000602.
Alisch RS, Barwick BG, Chopra P, Myrick LK, Satten GA, Conneely KN, Warren ST: Age-associated DNA methylation in pediatric populations. Genome Res. 2012, 22: 623-632. 10.1101/gr.125187.111.
Bork S, Pfister S, Witt H, Horn P, Korn B, Ho AD, Wagner W: DNA methylation pattern changes upon long-term culture and aging of human mesenchymal stromal cells. Aging Cell. 2010, 9: 54-63. 10.1111/j.1474-9726.2009.00535.x.
Rakyan VK, Down TA, Maslau S, Andrew T, Yang TP, Beyan H, Whittaker P, McCann OT, Finer S, Valdes AM, Leslie RD, Deloukas P, Spector TD: Human aging-associated DNA hypermethylation occurs preferentially at bivalent chromatin domains. Genome Res. 2010, 20: 434-439. 10.1101/gr.103101.109.
Teschendorff AE, Menon U, Gentry-Maharaj A, Ramus SJ, Weisenberger DJ, Shen H, Campan M, Noushmehr H, Bell CG, Maxwell AP, Savage DA, Mueller-Holzner E, Marth C, Kocjan G, Gayther SA, Jones A, Beck S, Wagner W, Laird PW, Jacobs IJ, Widschwendter M: Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer. Genome Res. 2010, 20: 440-446. 10.1101/gr.103606.109.
Maegawa S, Hinkal G, Kim HS, Shen L, Zhang L, Zhang J, Zhang N, Liang S, Donehower LA, Issa JP: Widespread and tissue specific age-related DNA methylation changes in mice. Genome Res. 2010, 20: 332-340. 10.1101/gr.096826.109.
Beerman I, Bock C, Garrison BS, Smith ZD, Gu H, Meissner A, Rossi DJ: Proliferation-dependent alterations of the DNA methylation landscape underlie hematopoietic stem cell aging. Cell Stem Cell. 2013, 12: 413-425. 10.1016/j.stem.2013.01.017.
Hannum G, Guinney J, Zhao L, Zhang L, Hughes G, Sadda S, Klotzle B, Bibikova M, Fan JB, Gao Y, Deconde R, Chen M, Rajapakse I, Friend S, Ideker T, Zhang K: Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol Cell. 2013, 49: 359-367. 10.1016/j.molcel.2012.10.016.
Bocklandt S, Lin W, Sehl ME, Sanchez FJ, Sinsheimer JS, Horvath S, Vilain E: Epigenetic predictor of age. PLoS ONE. 2011, 6: e14821-10.1371/journal.pone.0014821.
Koch CM, Wagner W: Epigenetic-aging-signature to determine age in different tissues. Aging (Albany NY). 2011, 3: 1018-1027.
Chen YA, Choufani S, Ferreira JC, Grafodatskaya D, Butcher DT, Weksberg R: Sequence overlap between autosomal and sex-linked probes on the Illumina HumanMethylation27 microarray. Genomics. 2011, 97: 214-222. 10.1016/j.ygeno.2010.12.004.
Adkins RM, Thomas F, Tylavsky FA, Krushkal J: Parental ages and levels of DNA methylation in the newborn are correlated. BMC Med Genet. 2011, 12: 47-
Bibikova M, Le J, Barnes R, Saedinia-Melnyk S, Shou L, Zhen R, Gunderson KL: Genome-wide DNA methylation profiling using Infinium assay. Epigenomics. 2009, 1: 177-200. 10.2217/epi.09.14.
Ku M, Koche RP, Rheinbay E, Mendenhall EM, Endoh M, Mikkelsen TS, Presser A, Nusbaum C, Xie X, Chi AS, Adli M, Kasif S, Ptaszek LM, Cowan CA, Lander ES, Koseki H, Bernstein BE: Genomewide analysis of PRC1 and PRC2 occupancy identifies two classes of bivalent domains. PLoS Genet. 2008, 4: e1000242-10.1371/journal.pgen.1000242.
Pan G, Tian S, Nie J, Yang C, Ruotti V, Wei H, Jonsdottir GA, Stewart R, Thomson JA: Whole-genome analysis of histone H3 lysine 4 and lysine 27 methylation in human embryonic stem cells. Cell Stem Cell. 2007, 1: 299-312. 10.1016/j.stem.2007.08.003.
Teschendorff AE, West J, Beck S: Age-associated epigenetic drift: implications, and a case of epigenetic thrift?. Hum Mol Genet. 2013, 22: R7-R15. 10.1093/hmg/ddt375.
Day K, Waite LL, Thalacker-Mercer A, West A, Bamman MM, Brooks JD, Myers RM, Absher D: Differential DNA methylation with age displays both common and dynamic features across human tissues that are influenced by CpG landscape. Genome Biol. 2013, 14: R102-10.1186/gb-2013-14-9-r102.
Horvath S, Zhang Y, Langfelder P, Kahn RS, Boks MP, Van EK, van den Berg LH, Ophoff RA: Aging effects on DNA methylation modules in human brain and blood tissue. Genome Biol. 2012, 13: R97-10.1186/gb-2012-13-10-r97.
Lam LL, Emberly E, Fraser HB, Neumann SM, Chen E, Miller GE, Kobor MS: Factors underlying variable DNA methylation in a human community cohort. Proc Natl Acad Sci U S A. 2012, 109: 17253-17260. 10.1073/pnas.1121249109.
Bibikova M, Barnes B, Tsan C, Ho V, Klotzle B, Le JM, Delano D, Zhang L, Schroth GP, Gunderson KL, Fan JB, Shen R: High density DNA methylation array with single CpG site resolution. Genomics. 2011, 98: 288-295. 10.1016/j.ygeno.2011.07.007.
Koch CM, Joussen S, Schellenberg A, Lin Q, Zenke M, Wagner W: Monitoring of cellular senescence by DNA-methylation at specific CpG sites. Aging Cell. 2012, 11: 366-369. 10.1111/j.1474-9726.2011.00784.x.
Koch C, Reck K, Shao K, Lin Q, Joussen S, Ziegler P, Walenda G, Drescher W, Opalka B, May T, Brummendorf T, Zenke M, Saric T, Wagner W: Pluripotent stem cells escape from senescence-associated DNA methylation changes. Genome Res. 2013, 23: 248-259. 10.1101/gr.141945.112.
Mallon BS, Chenoweth JG, Johnson KR, Hamilton RS, Tesar PJ, Yavatkar AS, Tyson LJ, Park K, Chen KG, Fann YC, McKay RD: StemCellDB: the human pluripotent stem cell database at the National Institutes of Health. Stem Cell Res. 2013, 10: 57-66. 10.1016/j.scr.2012.09.002.
Marion RM, Strati K, Li H, Tejera A, Schoeftner S, Ortega S, Serrano M, Blasco MA: Telomeres acquire embryonic stem cell characteristics in induced pluripotent stem cells. Cell Stem Cell. 2009, 4: 141-154. 10.1016/j.stem.2008.12.010.
Lapasset L, Milhavet O, Prieur A, Besnard E, Babled A, Ait-Hamou N, Leschik J, Pellestor F, Ramirez JM, De VJ, Lehmann S, Lemaitre JM: Rejuvenating senescent and centenarian human cells by reprogramming through the pluripotent state. Genes Dev. 2011, 25: 2248-2253. 10.1101/gad.173922.111.
Passtoors WM, Boer JM, Goeman JJ, Akker EB, Deelen J, Zwaan BJ, Scarborough A, Breggen R, Vossen RH, Houwing-Duistermaat JJ, Ommen GJ, Westendorp RG, Heemst D, Craen AJ, White AJ, Gunn DA, Beekman M, Slagboom PE: Transcriptional profiling of human familial longevity indicates a role for ASF1A and IL7R. PLoS ONE. 2012, 7: e27759-10.1371/journal.pone.0027759.
Online Calculator for the epigenetic aging signature. [http://www.molcell.rwth-aachen.de/epigenetic-aging-signature/]
MacKinney AA: Effect of aging on the peripheral blood lymphocyte count. J Gerontol. 1978, 33: 213-216. 10.1093/geronj/33.2.213.
Mahlknecht U, Kaiser S: Age-related changes in peripheral blood counts in humans. Exp Ther Med. 2010, 1: 1019-1025.
Reinius LE, Acevedo N, Joerink M, Pershagen G, Dahlen SE, Greco D, Soderhall C, Scheynius A, Kere J: Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility. PLoS ONE. 2012, 7: e41361-10.1371/journal.pone.0041361.
Bocker MT, Hellwig I, Breiling A, Eckstein V, Ho AD, Lyko F: Genome-wide promoter DNA methylation dynamics of human hematopoietic progenitor cells during differentiation and aging. Blood. 2011, 117: e182-e189. 10.1182/blood-2011-01-331926.
Schmermund A, Möhlenkamp S, Stang A, Grönemeyer D, Seibel R, Hirche H, Mann K, Siffert W, Lauterbach K, Siegrist J, Jöckel KH, Erbel R: Assessment of clinically silent atherosclerotic disease and established and novel risk factors for predicting myocardial infarction and cardiac death in healthy middle-aged subjects: rationale and design of the Heinz Nixdorf Recall Study. Risk Factors, Evaluation of Coronary Calcium and Lifestyle. Am Heart J. 2002, 144: 212-218. 10.1067/mhj.2002.123579.
Flegal KM, Kit BK, Orpana H, Graubard BI: Association of all-cause mortality with overweight and obesity using standard body mass index categories: a systematic review and meta-analysis. JAMA. 2013, 309: 71-82. 10.1001/jama.2012.113905.
Pavanello S, Hoxha M, Dioni L, Bertazzi PA, Snenghi R, Nalesso A, Ferrara SD, Montisci M, Baccarelli A: Shortened telomeres in individuals with abuse in alcohol consumption. Int J Cancer. 2011, 129: 983-992. 10.1002/ijc.25999.
Lee M, Martin H, Firpo MA, Demerath EW: Inverse association between adiposity and telomere length: The Fels Longitudinal Study. Am J Hum Biol. 2011, 23: 100-106. 10.1002/ajhb.21109.
Kuningas M, Altmae S, Uitterlinden AG, Hofman A, van Duijn CM, Tiemeier H: The relationship between fertility and lifespan in humans. Age (Dordr). 2011, 33: 615-622. 10.1007/s11357-010-9202-4.
Rufer N, Brummendorf TH, Kolvraa S, Bischoff C, Christensen K, Wadsworth L, Schulzer M, Lansdorp PM: Telomere fluorescence measurements in granulocytes and T lymphocyte subsets point to a high turnover of hematopoietic stem cells and memory T cells in early childhood. J Exp Med. 1999, 190: 157-167. 10.1084/jem.190.2.157.
Brummendorf TH, Balabanov S: Telomere length dynamics in normal hematopoiesis and in disease states characterized by increased stem cell turnover. Leukemia. 2006, 20: 1706-1716. 10.1038/sj.leu.2404339.
Calado RT, Young NS: Telomere diseases. N Engl J Med. 2009, 361: 2353-2365. 10.1056/NEJMra0903373.
Thompson RF, Atzmon G, Gheorghe C, Liang HQ, Lowes C, Greally JM, Barzilai N: Tissue-specific dysregulation of DNA methylation in aging. Aging Cell. 2010, 9: 506-518. 10.1111/j.1474-9726.2010.00577.x.
Liu L, Rando TA: Manifestations and mechanisms of stem cell aging. J Cell Biol. 2011, 193: 257-266. 10.1083/jcb.201010131.
Rando TA, Chang HY: Aging, rejuvenation, and epigenetic reprogramming: resetting the aging clock. Cell. 2012, 148: 46-57. 10.1016/j.cell.2012.01.003.
Kirkwood TB: Understanding the odd science of aging. Cell. 2005, 120: 437-447. 10.1016/j.cell.2005.01.027.
Brummendorf TH, Maciejewski JP, Mak J, Young NS, Lansdorp PM: Telomere length in leukocyte subpopulations of patients with aplastic anemia. Blood. 2001, 97: 895-900. 10.1182/blood.V97.4.895.
Erbel R, Möhlenkamp S, Möbus S, Schmermund A, Lehmann N, Stang A, Dragano N, Grönemeyer D, Seibel R, Kälsch H, Bröcker-Preuss M, Mann K, Siegrist J, Jöckel KH: Coronary risk stratification, discrimination, and reclassification improvement based on quantification of subclinical coronary atherosclerosis: the Heinz Nixdorf Recall study. J Am Coll Cardiol. 2010, 56: 1397-1406. 10.1016/j.jacc.2010.06.030.
Zeeberg BR, Feng W, Wang G, Wang MD, Fojo AT, Sunshine M, Narasimhan S, Kane DW, Reinhold WC, Lababidi S, Bussey KJ, Riss J, Barrett JC, Weinstein JN: GoMiner: a resource for biological interpretation of genomic and proteomic data. Genome Biol. 2003, 4: R28-10.1186/gb-2003-4-4-r28.
Kuhn M: Building predictive models in R using the caret package. J Statist Software. 2008, 28: 1-26.
Baerlocher GM, Vulto I, De JG, Lansdorp PM: Flow cytometry and FISH to measure the average length of telomeres (flow FISH). Nat Protoc. 2006, 1: 2365-2376. 10.1038/nprot.2006.263.
Beier F, Balabanov S, Buckley T, Dietz K, Hartmann U, Rojewski M, Kanz L, Schrezenmeier H, Brummendorf TH: Accelerated telomere shortening in glycosylphosphatidylinositol (GPI)-negative compared with GPI-positive granulocytes from patients with paroxysmal nocturnal hemoglobinuria (PNH) detected by proaerolysin flow-FISH. Blood. 2005, 106: 531-533. 10.1182/blood-2004-10-3996.
We thank those researchers who made microarray data used in this study available to the scientific community. We thank the Heinz Nixdorf Foundation, Germany (Chairman Martin Nixdorf, Past Chairman Dr Jur. G Schmidt (deceased)), for their generous support of this study. The authors are indebted to all the study participants and to the dedicated personnel of the study centre of the Heinz Nixdorf Recall Study. This work was supported by the Stem Cell Network North Rhine Westphalia (WW) and by the Else-Kröner Fresenius Stiftung (WW and CK).
The authors declare that they have no competing interests. RWTH Aachen University has applied for a patent for the epigenetic aging signature.
CIW, QL, CMK, LE, FB, PZ, and WW designed research, and analyzed and interpreted data; CIW, QL, CMK, LE, FB, and PZ performed experiments; DOB, KJ, RE, TWM, THB, and MZ contributed vital reagents; CIW and WW wrote the manuscript, and all authors provided input to the manuscript. All authors read and approved the final manuscript.
Carola Ingrid Weidner, Qiong Lin contributed equally to this work.
Electronic supplementary material
Additional file 1: Tables S1: S3 and S4 and Figures S1 to S12. Table S1. DNAm profiles for selection of AR-CpGs. Table S3. Gene Ontology analysis of the 102 AR-CpG sites. Table S4. primers used for pyrosequencing. Figure S1. nucleotides and motifs near AR-CpGs. Figure S2. enrichment of histone modifications near AR-CpGs. Figure S3. DNAm level in age-related hypo- or hypermethylation. Figure S4. analysis of AR-CpG sites in an independent dataset. Figure S5. age prediction in ESCs and iPSCs. Figure S6. flowchart for selection of the epigenetic aging signature. Figure S7. DNAm level at CpGs in the neighborhood of five AR-CpG sites. Figure S8. gene expression of selected genes with age-related CpG sites. Figure S9. DNAm level in different blood subsets. Figure S10. influence of blood cell composition on age prediction. Figure S11. effect of clinical and lifestyle parameters on age predictions. Figure S12. age predictions based on telomere length. (PDF 4 MB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
About this article
Cite this article
Weidner, C.I., Lin, Q., Koch, C.M. et al. Aging of blood can be tracked by DNA methylation changes at just three CpG sites. Genome Biol 15, R24 (2014). https://doi.org/10.1186/gb-2014-15-2-r24
- Root Mean Square Error
- Telomere Length
- Aplastic Anemia
- Mesenchymal Stromal Cell
- Mean Absolute Deviation
View archived comments (2)