4D nucleomes in single cells: what can computational modeling reveal about spatial chromatin conformation?

Sekelja, Monika; Paulsen, Jonas; Collas, Philippe

doi:10.1186/s13059-016-0923-2

Review
Open access
Published: 07 April 2016

4D nucleomes in single cells: what can computational modeling reveal about spatial chromatin conformation?

Monika Sekelja¹,
Jonas Paulsen¹ &
Philippe Collas¹

Genome Biology volume 17, Article number: 54 (2016) Cite this article

6863 Accesses
15 Citations
14 Altmetric
Metrics details

Abstract

Genome-wide sequencing technologies enable investigations of the structural properties of the genome in various spatial dimensions. Here, we review computational techniques developed to model the three-dimensional genome in single cells versus ensembles of cells and assess their underlying assumptions. We further address approaches to study the spatio-temporal aspects of genome organization from single-cell data.

Background

Increasing evidence indicates that the spatial, three-dimensional (3D) organization of chromatin influences gene expression and cell fate [1–8]. Chromosome conformation capture (3C) techniques coupled with high-throughput sequencing enable interrogations of short-range and long-range chromosomal interactions on a genome-wide scale [8–11]. One such technique, Hi-C [8], involves crosslinking (or ‘freezing’) interacting chromosome regions, fragmentation of chromatin, ligation of the crosslinked fragments, paired-end sequencing of the ligation products, and mapping of the sequence reads to a reference genome. A matrix is constructed to map read pairs that reflect contact between two chromosome regions by binning the genome and ascribing each read pair into the corresponding bin. The frequency of read pairs in each bin reflects contact frequencies between loci. These are optionally transformed into pairwise distances and used to estimate the position of these loci in a 3D space. In order to reconstitute 3D models of chromatin, interaction frequencies can directly or indirectly be used as constraints so that genomic regions with high contact frequencies are drawn to each other in the nuclear space. To improve the accuracy of 3D models of chromatin, other constraints can potentially be incorporated into structural models based on association of chromatin with known anchors in the nucleus, such as the nuclear envelope [4, 12], nuclear pore complexes [13, 14], or nucleoli [15, 16].

Most 3D genome reconstructions are performed on cell population-averaged Hi-C contact matrices [6, 8, 17–23]. The results consistently provide a hierarchical view of folding of the genome, with chromatin divided into supra-megabase compartments of transcriptionally active or inactive chromatin (the so-called A and B compartments) [6, 8] and, within these compartments, megabase-scale topologically associated domains (TADs) [7, 24, 25]. TADs show distinct boundaries, within which loci interact more frequently with one another than with loci of adjacent TADs. Unlike compartments, which can differ between cell types, TADs are more conserved [6, 8], although chromosome topology within TADs can vary [26].

The 3D conformation of chromatin is also variable between cells in a population [27, 28], presumably as a result of asynchronous gene expression patterns, epigenetic variation, and stochastic chromatin movements [29–33]. Further complicating the issue of structural variability of genomes between cells is increasing evidence suggesting that even two copies of the same chromosome in diploid cells vary in structure [26, 34]. This problem is obviously amplified for polyploid cells, such as some cancer cell types, or if one were to investigate genome structure in polyploid organisms. As discussed in this review, computational methods have been developed to address the structural variability of genomes between subpopulations of cells. Cell-to-cell heterogeneity has also been directly captured in a pioneering study by applying Hi-C to multiple single cells [35]. Other emerging single-cell, high-throughput, sequencing-based technologies provide additional evidence for cell-to-cell heterogeneity in associations of chromatin with the nuclear envelope [36], chromatin accessibility [37–39], epigenetic states [40–44], and gene expression patterns [45, 46] (Table 1).

Table 1 Overview of genome-wide high-throughput sequencing-based single-cell technologies

Full size table

The main purpose of single-cell genome conformation studies is to assess the heterogeneity in 3D chromatin structures between cells and, therefore, characterize the subpopulations of structures. In this review, we first address computational approaches that interrogate 3D chromatin structure from population-based studies; we evaluate their underlying assumptions and focus on how these methods tackle the cell-to-cell variability in 3D chromatin structures. We further examine challenges associated with inference of chromosome structures from single-cell interrogations. We address computational techniques enabling modeling the 3D genome over time and highlight how single-cell data might benefit this exercise. Finally, we summarize implications from applications of computational modeling to study the spatio-temporal (so-called ‘4D’) and functional aspects of genome organization.

Assessing genome conformation in cell populations

Virtually all 3D chromosome-conformation studies are based on the analysis of millions of cells, with no obvious way to discern conformations between cells in the population. As discussed in this section, however, computational methods are very helpful in resolving this issue. Although single-cell chromosome conformation can capture cell-to-cell chromosome structural heterogeneity [35], this approach comes with its own challenges. Before discussing these challenges, we describe two main methods to infer chromatin 3D structure from Hi-C data, namely consensus methods and deconvolution methods. We further evaluate issues in addressing the heterogeneity of chromosome structures from Hi-C data averaged from ensembles of cells.

Consensus chromosome structures inferred from population-average data

Constraint-based 3D chromosome modeling strategies primarily rely on consensus methods that aim to find a unique 3D structure averaged over many representative structures [19–23, 47] (see also recent reviews [11, 48, 49]). Consensus methods typically use a matrix of pairwise distances between genomic elements obtained by transforming Hi-C contact frequencies to visualize these elements as points in a 3D space. Many of these methods exploit the property that the distance between any two points must be smaller than or equal to the sum of distances of these two points to a third point (triangle inequality principle). However, in data from a cell population with any degree of heterogeneity, this property is not met as any given distance between two points is based on an average of multiple structures in that population [50, 51]. Thus, a structure inferred from the average of millions of cells will differ from structures derived from subpopulations of cells and will typically not represent any of the structures in individual cells [51, 52].

As a result, consensus 3D structure reconstruction methods have been devised to assess chromosomal structural heterogeneity, without estimating the population of structures per se. Semi-definite programming has been applied to identify the best consensus structure fitting Hi-C data [20]. The approach includes a test of population homogeneity that examines whether the triangle inequality assumption is met in the input distance matrix and how well the distance matrix is represented by three dimensions as opposed to a higher number of dimensions [20]. A second approach assumes that structures within TADs vary across cell subpopulations and estimates this within-TAD variation using a mixture component model [19]. In this model, each component represents a unique spatial structure of these sub-TADs, with the weight of each component defining the proportion of a given structure in the population.

The ability to evaluate heterogeneity of chromosome structures is a significant advance in genome modeling from consensus methods. However, consensus methods produce a unique structure and do not escape the (incorrect) underlying assumption that structures are homogeneous. Moreover, uncertainty remains on the source of the heterogeneity estimated: for example, a population from one cell type might be truly more heterogeneous than another or data for that cell type might simply be noisier. Thus, consensus modeling techniques do not fully capture the 3D structural heterogeneity within a cell population. They might nevertheless constitute a promising approach to unveiling 3D structures in single cells.

Deconvolution methods identify hidden substructures

Deconvolution methods assume that Hi-C and other 3C-based data arise from many chromatin substructures present in a cell population and seek to identify these substructures [53–57] (Fig. 1). These methods demultiplex the data to identify structurally plausible, unobserved substructures. Two different deconvolution strategies have been applied to date: structural deconvolution and matrix deconvolution.

Structural deconvolution methods occur at the 3D structure reconstruction level, applying constraints such as enforced intrachromosomal and interchromosomal interactions, nuclear boundary or volume exclusion [53, 54, 58, 59]. The constraints are applied such that the population as a whole should satisfy the constraints [54] or such that individual structures within the population should satisfy as many of the constraints as possible [58, 59]. In both approaches, the resulting structural ensemble can be clustered to study the underlying structural variability and sub-population constituents. By contrast, matrix deconvolution can be applied directly on contact frequency matrices using information from, for example, TADs [57, 60] or similar topological domains [61], to search for a set of contact frequency matrices that optimally reflects the proportions of each predicted substructure in the cell population. This set of matrices should, in aggregate, reflect the input data [57]. Illustrating this point, deconvolution of individual single-cell Hi-C contact matrices [35] pooled into one mixed matrix has been shown to successfully recover the single-cell Hi-C matrices [57]. Although matrix deconvolution is usually faster than structural deconvolution, the substructures recovered might not be physically plausible. A major drawback of all deconvolution methods is the requirement of extensive computational resources. It is also currently not clear how accurate current deconvolution methods really are and whether the current data allow for well-determined estimation of structural sub-populations. We refer to recent reviews addressing these methods in detail [11, 49, 61].

Although computationally more demanding, deconvolution techniques seeking an ensemble of 3D structural solutions are better suited than consensus methods to capture the inherent heterogeneity of chromosome structures in a cell population. This in turn raises the question of whether one can evaluate through deconvolution the extent of 3D structural heterogeneity in a given experimental system and use this information design for more-rationalized interrogations of 3D chromosome conformations: for instance, how futile is it to analyze high-throughput 3C data knowing that chromosome conformation is estimated to be heterogeneous, and should one rather consider single-cell approaches?

Inference of 3D chromatin structure from single-cell Hi-C contact maps

A breakthrough in the field of single-cell genomics came with the report of a protocol enabling the extraction of Hi-C contact maps from single cells by allowing several steps in the protocol to occur inside intact nuclei [35]. From each extracted single-cell contact map, 3D structures can be inferred using a simulated annealing approach [35] or other reconstruction approaches [23, 34]. Interestingly, large structural differences are noted between individual cells, the greatest variance residing in inter-TAD and interchromosomal contacts, whereas TAD structures are conserved. In addition, because male cells were purposely examined in this study (male cells only bear one X chromosome), repeated 3D reconstruction of chromosome X from a given cell should result in one unique structure. This view is, however, challenged by data sparsity and noise inherent to single-cell Hi-C experiments. As discussed below, this challenge becomes greater in the reconstruction of autosomes, present in two copies in diploid cells, owing to structural differences between these two copies.

Sparseness of single-cell Hi-C contact maps can hamper high-confidence 3D structure reconstruction

A crucial issue with single-cell Hi-C chromosome contact information is the inherent sparsity of the contact frequency maps. Approximately 2.5 % of the theoretical total possible number of ligations has been shown to be recovered in the only single-cell Hi-C study reported to date [35].

One approach to alleviate the sparsity of single-cell Hi-C data is to computationally impute the “missing data”. This can be achieved by constructing a graph based on the observed contacts (edges in the graph) and computing the shortest possible path between the missing edges (where no contact is found; Fig. 2). The imputed missing distances have, in fact, the neat property that they satisfy the triangle inequality principle [23]. While this helps circumvent the missing value problem, it might, however, introduce additional noise as the imputed values are only rudimentary estimates of the true distances.

We recently addressed the challenge of missing values and proposed a method to down-weight contacts relative to the degree of confidence in their estimates [34]. To reconstruct 3D structures from single-cell Hi-C data, a manifold-based optimization method was used that enables incorporation of such weights. To assess the implication of data sparsity on the reconstruction of 3D structures, single-cell Hi-C contact matrices were constructed in silico where 80–98 % of the entries in the matrices were randomly deleted and the ability to reconstruct the original structure from the sparsely sampled data was examined [34]. This showed that, even with 90 % missing contacts, the reconstructed structure was essentially no different from the original structure. However, from more sparse data (>95 % missing), the similarity between the original and reconstructed structures decreases dramatically [34] (Fig. 3). Comparison of these theoretical values with existing single-cell Hi-C data shows that Hi-C data are, at the current bin size resolution (e.g., 50 kb), too sparse for high-confidence chromosome structure reconstruction, even under noise-free conditions [34]. However, by enabling increased bin sizes, the robustness of structure reconstruction can be increased, although at the cost of reduced structural resolution. Increasing coverage will, therefore, either allow for higher-resolution Hi-C maps (smaller bin sizes) or enable a greater tolerance of missing data without loss of confidence in the reconstructed 3D models.

Another important limitation to Hi-C map resolution is imposed by the genomic distribution of the digestion sites of the restriction enzyme used. To overcome this, Ma and colleagues [62] have proposed using DNase I, which cleaves the DNA into fragments smaller and more frequent than those from restriction enzymes used in conventional Hi-C protocols. The increased number of fragments leads to a higher number of possible proximity ligations and might increase Hi-C map resolution with sufficient sequencing coverage. Additionally, capture arrays can be used to increase resolution for selected loci [63, 64] at the expense of compromising full-genome reconstruction.

Autosomal chromosomes further complicate the picture

The human genome consists of two copies of autosomal chromosomes, yet this has often been overlooked in Hi-C experiments because it is difficult to distinguish the two copies. Manifold-based optimization can successfully reconstruct 3D structures of a single X chromosome in male diploid cells from single-cell Hi-C data despite their relative noisiness and sparsity [34]. For autosomal chromosomes, however, reconstruction is less successful [34]. Simulation of a diploid situation by summing Hi-C contact maps of the X chromosome from two different individual cells reveals that 3D reconstruction is compromised. This provides important clues on specific challenges with genome 3D reconstruction in mixed Hi-C maps as even two a priori indistinguishable autosomes in single cells can hamper 3D reconstruction.

A possible solution might be to recover diploid maps based on haplotype information [6, 26, 65, 66]. This, however, requires information on allele-specific sequence variations (polymorphisms), which is typically not available for most cell lines. Haploid cell lines might also be suitable for modeling the spatial genome conformation in single cells, such as those used to map genome-wide chromatin nuclear lamina interactions [36] or multi-locus chromatin contacts [67].

Distinguishing biological variation from technical noise

An intriguing characteristic of genome-wide 3C-based data is that the data can be used in various kinds of analyses in addition to reconstruction of 3D structures. Importantly, not all types of analyses are prone to the effects of averaging subpopulations. For instance, when one is interested in studying 3D contacts between individual loci (e.g., contacts between promoters and enhancers), the goal is to identify the statistically enriched contacts above an average background [68–71]. To do so, a model considering both the decreased chance of contacts with increasing linear genomic distance between loci and the variance of the contact frequencies is required.

A corresponding type of analysis in single-cell Hi-C would be to consider any ligation event between two restriction fragments as a contact or rely on multiple contacts in near proximity [35]. Yet, in such an analysis, how to reliably estimate the variance of contact frequencies for a given pair of loci is not clear, so it can be more appropriate to use ensemble Hi-C to study individual contacts. One way to estimate the variance in single-cell Hi-C studies could be to base the analysis on a large aggregate of multiplexed single-cell Hi-C datasets by, for example, using DNA barcoding [40]. However, this raises the question of how many single-cell datasets are required to obtain biologically relevant insights and how much information is gained from aggregated single-cell data as opposed to ensemble-cell data.

Another type of investigation is to pre-select a set of loci and consider their mutual 3D colocalization [72, 73]. In this situation, both genomic distance and variance need to be considered in order to identify statistically significant colocalizations. As multiple loci are considered, however, the effect of averaging over subpopulations again becomes a limiting factor. In single-cell Hi-C, where the effect can be avoided, the detection of multiple colocalized loci is instead hampered by the fact that only two pairs of restriction fragments can be observed for any given interaction.

Towards the 4D nucleome

Inferring pseudotime 3D conformational trajectories from cross-sectional data

Increasing experimental evidence supports a view of local and global alterations in spatial genome conformation as cellular states change during development and differentiation [2, 26, 74] or in disease [75–77]. However, there are to date no truly longitudinal (developmental) studies of 3D chromatin conformation in single cells because 3C techniques are destructive to cells. Single-cell interrogations could prove useful to a posteriori recapitulate pseudo-developmental changes, or ‘trajectories’, in 3D chromosome conformation and thereby infer a pseudo-4D view of chromatin dynamics (Fig. 4). Support for this approach comes from developmental gene expression studies using single-cell RNA sequencing [78–80]. As expected from cell-to-cell heterogeneity within populations, single cells analyzed at any time-point in a time-series show variations in transcript levels [81–84]. At consecutive time-points, individual cells will be expected to show both similar and distinct transcript levels across cells. This information can be used to computationally reorder cells a posteriori in order to find a pseudotime trajectory; see especially the perspective by Trapnell [52] for an excellent assessment of this approach.

A similar strategy could conceivably be used to reorder cells in a pseudotime trajectory based on single-cell Hi-C data (Fig. 4a, b). Instead of gene expression data for each cell, information can be extracted from single-cell Hi-C contact maps to construct a matrix consisting of distances between pairs of bins in each single-cell genome. This would in principle allow an a posteriori reconstruction of a path of changes in chromatin structures between two consecutive time-points at which Hi-C data are actually collected (Fig. 4b). Such pseudotime reordering of chromatin structures from single-cell Hi-C contact maps might provide more accurate information on the specific trajectories that genome topology takes during development or differentiation.

Locus tracking in living cells by real-time imaging

Even though high-throughput sequencing-based methods do not currently enable true 4D studies of chromatin organization in single cells, emerging microscopy-based cell-imaging techniques provide opportunities. For example, locus tracking, relying on modifications of CRISPR/Cas9-mediated genome-editing approaches to tag fluorescent molecules to specific loci, enables the visualization of movements of loci in the nuclear space in living cells [85, 86]. Targeting of a nuclease-deficient dCas9 mutant fused to a fluorophore-encoding protein [e.g. enhanced green fluorescent protein (EGFP)] to a given locus with sequence-specific guide RNAs can be achieved for intergenic repeat regions (e.g., telomeres) or genes. This approach is non-disruptive and, pending that low-intensity fluorescence emission is used to avoid photobleaching, it can be applied to track changes in locus positioning, for example, in response to a stimulus. A current limitation of this approach, however, is the relatively low fluorescence intensity of the tags, making their detection above background at single loci difficult. This often requires the use of several guide RNAs to target sufficient dCas9–EGFP molecules (or other fusions of dCas9 with fluorescent proteins) to the locus of interest [85, 86]. Continuous development of increasingly more-potent fluorophores will probably remedy this issue. Multi-color tagging of several loci simultaneously [87] in combination with super-resolution live-cell microscopy could also enable visualization of interactions between genomic regions in real time. Dynamic interactions can in turn be correlated with gene expression patterns [88]. Furthermore, new strategies for real-time simultaneous observation of gene localization and expression are emerging [89], providing some functionality to spatial locus positioning and chromosomal interactions. In addition, analysis of epigenetic states over time has been reported with the advent of live-cell imaging techniques for monitoring DNA methylation changes using a reporter-based system [90]. Although currently low throughput, these live-cell techniques do enable real-time investigations of chromatin conformation.

Conclusions

Advancements in wet-lab high-throughput genomics and computational methods in the past 15 years have taken our understanding of the genome to a whole new level by allowing genome-wide assessments of chromatin conformation in the 3D space. Single-cell high-throughput genomics is still in its infancy and most computational techniques are developed for ensemble-cell Hi-C data. Variation is, however, an inherent property of genome structures in a cell population. We have highlighted in this review challenges in the interpretation of Hi-C data arising from this variation. As a result, a number of computational methods have been proposed to take on board this heterogeneity. Consensus methods of modeling chromatin in 3D poorly address structural heterogeneity because they produce a single chromatin structure based on data obtained from millions of cells. By contrast, deconvolution techniques, despite being computationally more demanding, enable inference of the main substructures that exist in an ensemble of cells.

To disentangle the heterogeneity in cell populations, it will be essential to seek improvements in the efficiency of laboratory methods and in the streamlining of computational techniques to explore chromatin dynamics from down-sized cell populations or from single cells. When studying chromatin dynamics in subpopulations, structures from single-cell Hi-C datasets provide more information on structural variance than deconvoluted structures from ensembles of cells, pending that hundreds of single-cell structures are determined. In time-course studies, laboratory and financial resources might rather be used to examine more time-points, albeit from down-sized cell populations. This, however, requires the development of appropriate deconvolution methods to assess the main subpopulations and their chromatin dynamics.

Arguably, the most significant weakness in single-cell Hi-C analyses to date is sparsity of the values in the distance matrix. We have shown that this data sparsity does not necessarily prevent structure modeling [65]. Yet the question remains of how to distinguish significant interactions from mere random (albeit cross-linkable) contacts in single cells. Deep sequencing of single-cell Hi-C ligation products will be necessary to enable the reconstruction of 3D chromatin conformations with high confidence.

By analogy to single-cell gene expression and pseudo-developmental trajectory reconstitutions in developmental studies, another advantage of single-cell high-throughput chromosome conformation queries is the foreseen ability to re-order single-cell structures to infer a developmental path of changes in chromatin conformation—for example, as cells go through the cell cycle or differentiate. This is expected to constitute an important step in our understanding of the spatial dynamics of the 4D nucleome. Furthermore, with the emergence of additional single-cell technologies that allow interrogations of the epigenome [40, 41, 43], chromatin accessibility [38, 39] or associations of loci with the nuclear periphery [36] (Table 1), we foresee the emergence of more-sophisticated (and arguably more accurate) models of genome architecture. Finally, fluorescent tagging of multiple loci simultaneously in single cells, even though this approach is currently not high-throughput in terms of the number of identifiable loci that can be examined, unveils possibilities to interrogate the dynamics of relative positioning of loci in real time.

We can look forward to exciting developments in the combination of high-throughput sequencing-based techniques and imaging methodologies to interrogate the functional significance of chromatin folding in space and real-time in single cells. Efficient methods to estimate heterogeneity within a given cell population and to enable integration of several types of single-cell ‘omics’ data will aid in developing improved models of genome conformation at various scales.

Abbreviations

3C:: chromosome conformation capture
3D:: three-dimensional
ATAC:: assay for transposase-accessible chromatin
ChIP:: chromatin immunoprecipitation
DamID:: Dam identification
EGFP:: enhanced green fluorescent protein
TAD:: topologically associated domain

References

Pombo A, Dillon N. Three-dimensional genome architecture: players and mechanisms. Nat Rev Mol Cell Biol. 2015;16:245–57.
Article CAS PubMed Google Scholar
Ay F, Bunnik EM, Varoquaux N, Bol SM, Prudhomme J, Vert J-P, et al. Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression. Genome Res. 2014;24:974–88.
Article CAS PubMed PubMed Central Google Scholar
Gorkin DU, Leung D, Ren B. The 3D genome in transcriptional regulation and pluripotency. Cell Stem Cell. 2014;14:762–75.
Article CAS PubMed PubMed Central Google Scholar
Bickmore WA, van Steensel B. Genome architecture: domain organization of interphase chromosomes. Cell. 2013;152:1270–84.
Article CAS PubMed Google Scholar
Bouwman BAM, de Laat W. Getting the genome in shape: the formation of loops, domains and compartments. Genome Biol. 2015;16:154.
Article PubMed PubMed Central Google Scholar
Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–80.
Article CAS PubMed Google Scholar
Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376–80.
Article CAS PubMed PubMed Central Google Scholar
Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326:289–93.
Article CAS PubMed PubMed Central Google Scholar
Belton J-M, McCord RP, Gibcus JH, Naumova N, Zhan Y, Dekker J. Hi-C: a comprehensive technique to capture the conformation of genomes. Methods. 2012;58:268–76.
Article CAS PubMed Google Scholar
Lajoie BR, Dekker J, Kaplan N. The Hitchhiker’s guide to Hi-C analysis: practical guidelines. Methods. 2015;72:65–75.
Article CAS PubMed PubMed Central Google Scholar
Ay F, Noble WS. Analysis methods for studying the 3D architecture of the genome. Genome Biol. 2015;16:183.
Article PubMed PubMed Central Google Scholar
Guelen L, Pagie L, Brasset E, Meuleman W, Faza MB, Talhout W, et al. Domain organization of human chromosomes revealed by mapping of nuclear lamina interactions. Nature. 2008;453:948–51.
Article CAS PubMed Google Scholar
Casolari JM, Brown CR, Komili S, West J, Hieronymus H, Silver PA. Genome-wide localization of the nuclear transport machinery couples transcriptional status and nuclear organization. Cell. 2004;117:427–39.
Article CAS PubMed Google Scholar
Pascual-Garcia P, Capelson M. Nuclear pores as versatile platforms for gene regulation. Curr Opin Genet Dev. 2014;25:110–7.
Article CAS PubMed Google Scholar
Németh A, Längst G. Genome organization in and around the nucleolus. Trends Genet. 2011;27:149–56.
Article PubMed Google Scholar
Németh A, Conesa A, Santoyo-Lopez J, Medina I, Montaner D, Péterfia B, et al. Initial genomics of the human nucleolus. PLoS Genet. 2010;6:e1000889.
Article PubMed PubMed Central Google Scholar
Duan Z, Andronescu M, Schutz K, McIlwain S, Kim YJ, Lee C, et al. A three-dimensional model of the yeast genome. Nature. 2010;465:363–7.
Article CAS PubMed PubMed Central Google Scholar
Gehlen LR, Gruenert G, Jones MB, Rodley CD, Langowski J, O’Sullivan JM. Chromosome positioning and the clustering of functionally related loci in yeast is driven by chromosomal interactions. Nucleus. 2012;3:370–83.
Article PubMed Google Scholar
Hu M, Deng K, Qin Z, Dixon J, Selvaraj S, Fang J, et al. Bayesian inference of spatial organizations of chromosomes. PLoS Comput Biol. 2013;9:e1002893.
Article CAS PubMed PubMed Central Google Scholar
Zhang Z, Li G, Toh K-C, Sung W-K. 3D chromosome modeling with semi-definite programming and Hi-C data. J Comput Biol. 2013;20:831–46.
Article CAS PubMed Google Scholar
Peng C, Fu L-Y, Dong P-F, Deng Z-L, Li J-X, Wang X-T, et al. The sequencing bias relaxed characteristics of Hi-C derived data and implications for chromatin 3D modeling. Nucleic Acids Res. 2013;41:e183.
Article CAS PubMed PubMed Central Google Scholar
Varoquaux N, Ay F, Noble WS, Vert J-P. A statistical approach for inferring the 3D structure of the genome. Bioinformatics. 2014;30:26–33.
Article Google Scholar
Lesne A, Riposo J, Roger P, Cournac A, Mozziconacci J. 3D genome reconstruction from chromosomal contacts. Nat Methods. 2014;11:1141–3.
Article CAS PubMed Google Scholar
Nora EP, Lajoie BR, Schulz EG, Giorgetti L, Okamoto I, Servant N, et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485:381–5.
Article CAS PubMed PubMed Central Google Scholar
Sexton T, Yaffe E, Kenigsberg E, Bantignies F, Leblanc B, Hoichman M, et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell. 2012;148:458–72.
Article CAS PubMed Google Scholar
Dixon JR, Jung I, Selvaraj S, Shen Y, Antosiewicz-Bourget JE, Lee AY, et al. Chromatin architecture reorganization during stem cell differentiation. Nature. 2015;518:331–6.
Article CAS PubMed PubMed Central Google Scholar
Gasser SM. Visualizing chromatin dynamics in interphase nuclei. Science. 2002;296:1412–6.
Article CAS PubMed Google Scholar
Voss TC, Hager GL. Visualizing chromatin dynamics in intact cells. Biochim Biophys Acta. 2008;1783:2044–51.
Article CAS PubMed Google Scholar
Noordermeer D, de Wit E, Klous P, van de Werken H, Simonis M, Lopez-Jones M, et al. Variegated gene expression caused by cell-specific long-range DNA interactions. Nat Cell Biol. 2011;13:944–51.
Article CAS PubMed PubMed Central Google Scholar
Wang X-T, Dong P-F, Zhang H-Y, Peng C. Structural heterogeneity and functional diversity of topologically associating domains in mammalian genomes. Nucleic Acids Res. 2015;43:7237–46.
Article CAS PubMed PubMed Central Google Scholar
Noordermeer D, Leleu M, Schorderet P, Joye E, Chabaud F, Duboule D. Temporal dynamics and developmental memory of 3D chromatin architecture at Hox gene loci. Elife. 2014;3:e02557.
Article PubMed PubMed Central Google Scholar
Brown CR, Boeger H. Nucleosomal promoter variation generates gene expression noise. Proc Natl Acad Sci U S A. 2014;111:17893–8.
Article CAS PubMed PubMed Central Google Scholar
Landan G, Cohen NM, Mukamel Z, Bar A, Molchadsky A, Brosh R, et al. Epigenetic polymorphism and the stochastic formation of differentially methylated regions in normal and cancerous tissues. Nat Genet. 2012;44:1207–14.
Article CAS PubMed Google Scholar
Paulsen J, Gramstad O, Collas P. Manifold based optimization for single-cell 3D genome reconstruction. PLoS Comput Biol. 2015;11:e1004396.
Article PubMed PubMed Central Google Scholar
Nagano T, Lubling Y, Stevens TJ, Schoenfelder S, Yaffe E, Dean W, et al. Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature. 2013;502:59–64.
Article CAS PubMed Google Scholar
Kind J, Pagie L, de Vries SS, Nahidiazar L, Dey SS, Bienko M, et al. Genome-wide maps of nuclear lamina interactions in single human cells. Cell. 2015;163:134–47.
Article CAS PubMed Google Scholar
Pott S, Lieb JD. Single-cell ATAC-seq: strength in numbers. Genome Biol. 2015;16:172.
Article PubMed PubMed Central Google Scholar
Buenrostro JD, Wu B, Litzenburger UM, Ruff D, Gonzales ML, Snyder MP, et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature. 2015;523:486–90.
Article CAS PubMed PubMed Central Google Scholar
Cusanovich DA, Daza R, Adey A, Pliner HA, Christiansen L, Gunderson KL, et al. Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing. Science. 2015;348:910–4.
Article CAS PubMed Google Scholar
Rotem A, Ram O, Shoresh N, Sperling RA, Goren A, Weitz DA, et al. Single-cell ChIP-seq reveals cell subpopulations defined by chromatin state. Nat Biotechnol. 2015;33:1165–72.
Article CAS PubMed Google Scholar
Farlik M, Sheffield NC, Nuzzo A, Datlinger P, Schönegger A, Klughammer J, et al. Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics. Cell Rep. 2015;10:1386–97.
Article CAS PubMed PubMed Central Google Scholar
Guo H, Zhu P, Wu X, Li X, Wen L, Tang F. Single-cell methylome landscapes of mouse embryonic stem cells and early embryos analyzed using reduced representation bisulfite sequencing. Genome Res. 2013;23:2126–35.
Article CAS PubMed PubMed Central Google Scholar
Smallwood SA, Lee HJ, Angermueller C, Krueger F, Saadeh H, Peat J, et al. Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat Methods. 2014;11:817–20.
Article CAS PubMed PubMed Central Google Scholar
Guo H, Zhu P, Guo F, Li X, Wu X, Fan X, et al. Profiling DNA methylome landscapes of mammalian cells with single-cell reduced-representation bisulfite sequencing. Nat Protoc. 2015;10:645–59.
Article CAS PubMed Google Scholar
Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161:1202–14.
Article CAS PubMed Google Scholar
Tang F, Barbacioru C, Wang Y, Nordman E, Lee C, Xu N, et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat Methods. 2009;6:377–82.
Article CAS PubMed Google Scholar
Caudai C, Salerno E, Zoppè M, Tonazzini A. Inferring 3D chromatin structure using a multiscale approach based on quaternions. BMC Bioinformatics. 2015;16:234.
Article PubMed PubMed Central Google Scholar
Shavit Y, Merelli I, Milanesi L, Lio’ P. How computer science can help in understanding the 3D genome architecture. Brief Bioinform. 2015; Oct 3 pii:bbv085.
Rosa A, Zimmer C. Computational models of large-scale genome architecture. Int Rev Cell Mol Biol. 2014;307:275–349.
Article CAS PubMed Google Scholar
Hu M, Deng K, Qin Z, Liu JS. Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data. Quant Biol. 2013;1:156–74.
Article PubMed PubMed Central Google Scholar
O’Sullivan JM, Hendy MD, Pichugina T, Wake GC, Langowski J. The statistical-mechanics of chromosome conformation capture. Nucleus. 2013;4:390–8.
Article PubMed PubMed Central Google Scholar
Trapnell C. Defining cell types and states with single-cell genomics. Genome Res. 2015;25:1491–8.
Article CAS PubMed PubMed Central Google Scholar
Giorgetti L, Galupa R, Nora EP, Piolot T, Lam F, Dekker J, et al. Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcription. Cell. 2014;157:950–63.
Article CAS PubMed PubMed Central Google Scholar
Kalhor R, Tjong H, Jayathilaka N, Alber F, Chen L. Genome architectures revealed by tethered chromosome conformation capture and population-based modeling. Nat Biotechnol. 2012;30:90–8.
Article CAS Google Scholar
Baù D, Marti-Renom MA. Genome structure determination via 3C-based data integration by the Integrative Modeling Platform. Methods. 2012;58:300–6.
Article PubMed Google Scholar
Meluzzi D, Arya G. Recovering ensembles of chromatin conformations from contact probabilities. Nucleic Acids Res. 2013;41:63–75.
Article CAS PubMed PubMed Central Google Scholar
Sefer E, Duggal G, Kingsford C. Deconvolution of ensemble chromatin interaction data reveals the latent mixing structures in cell subpopulations. In: Research in Computational Molecular Biology. Switzerland: Springer International Publishing; 2015. p. 293–308.
Marti-Renom MA, Mirny LA. Bridging the resolution gap in structural modeling of 3D genome organization. PLoS Comput Biol. 2011;7:e1002125.
Article CAS PubMed PubMed Central Google Scholar
Baù D, Sanyal A, Lajoie BR, Capriotti E, Byron M, Lawrence JB, et al. The three-dimensional folding of the α-globin gene domain reveals formation of chromatin globules. Nat Struct Mol Biol. 2011;18:107–14.
Article PubMed PubMed Central Google Scholar
Naumova N, Imakaev M, Fudenberg G, Zhan Y, Lajoie BR, Mirny LA, et al. Organization of the mitotic chromosome. Science. 2013;342:948–53.
Article CAS PubMed PubMed Central Google Scholar
Junier I, Spill YG, Marti-Renom MA, Beato M, le Dily F. On the demultiplexing of chromosome capture conformation data. FEBS Lett. 2015;589:3009–13.
Article Google Scholar
Ma W, Ay F, Lee C, Gulsoy G, Deng X, Cook S, et al. Fine-scale chromatin interaction maps reveal the cis-regulatory landscape of human lincRNA genes. Nat Methods. 2015;12:71–8.
Article PubMed PubMed Central Google Scholar
Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust EM, Brockman W, et al. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol. 2009;27:182–9.
Article CAS PubMed PubMed Central Google Scholar
Dryden NH, Broome LR, Dudbridge F, Johnson N, Orr N, Schoenfelder S, et al. Unbiased analysis of potential targets of breast cancer susceptibility loci by Capture Hi-C. Genome Res. 2014;24:1854–68.
Article CAS PubMed PubMed Central Google Scholar
Selvaraj S, R Dixon J, Bansal V, Ren B. Whole-genome haplotype reconstruction using proximity-ligation and shotgun sequencing. Nat Biotechnol. 2013;31:1111–8.
Article CAS PubMed PubMed Central Google Scholar
Tang Z, Luo OJ, Li X, Zheng M, Zhu JJ, Szalaj P, Trzaskoma P et al. CTCF-mediated human 3D genome architecture reveals chromatin topology for transcription. Cell. 2015;163:1611–27.
Ay F, Vu TH, Zeitz MJ, Varoquaux N, Carette JE, Vert J-P, et al. Identifying multi-locus chromatin contacts in human cells using tethered multiple 3C. BMC Genomics. 2015;16:121.
Article PubMed PubMed Central Google Scholar
Cairns J, Freire-Pritchett P, Wingett SW, Dimond A, Plagnol V, Zerbino D, et al. CHiCAGO. Robust detection of DNA looping interactions in capture Hi-C data. bioRxiv. Cold Spring Harb Labs J doi. 2015;10:028068.
Google Scholar
Ay F, Bailey TL, Noble WS. Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts. Genome Res. 2014;24:999–1011.
Article CAS PubMed PubMed Central Google Scholar
Paulsen J, Rødland EA, Holden L, Holden M, Hovig E. A statistical model of ChIA-PET data for accurate detection of chromatin 3D interactions. Nucleic Acids Res. 2014;42:e143.
Article PubMed PubMed Central Google Scholar
Phanstiel DH, Boyle AP, Heidari N, Snyder MP. Mango: a bias-correcting ChIA-PET analysis pipeline. Bioinformatics. 2015;31:3092–8.
Article PubMed Google Scholar
Witten DM, Noble WS. On the assessment of statistical significance of three-dimensional colocalization of sets of genomic elements. Nucleic Acids Res. 2012;40:3849–55.
Article CAS PubMed PubMed Central Google Scholar
Paulsen J, Lien TG, Sandve GK, Holden L, Borgan O, Glad IK, et al. Handling realistic assumptions in hypothesis testing of 3D co-localization of genomic elements. Nucleic Acids Res. 2013;41:5164–74.
Article CAS PubMed PubMed Central Google Scholar
Chen H, Chen J, Muir LA, Ronquist S, Meixner W, Ljungman M, et al. Functional organization of the human 4D Nucleome. Proc Natl Acad Sci U S A. 2015;112:8002–7.
Article CAS PubMed PubMed Central Google Scholar
Sulli G, Di Micco R, d’Adda di Fagagna F. Crosstalk between chromatin state and DNA damage response in cellular senescence and cancer. Nat Rev Cancer. 2012;12:709–20.
Article CAS PubMed Google Scholar
Babu D, Fullwood MJ. 3D genome organization in health and disease: emerging opportunities in cancer translational medicine. Nucleus. 2015;6:382–93.
Article PubMed Google Scholar
Misteli T. Higher-order genome organization in human disease. Cold Spring Harb Perspect Biol. 2010;2:a000794.
Article PubMed PubMed Central Google Scholar
Buettner F, Natarajan KN, Casale FP, Proserpio V, Scialdone A, Theis FJ, et al. Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells. Nat Biotechnol. 2015;33:155–60.
Article CAS PubMed Google Scholar
Leng N, Chu L-F, Barry C, Li Y, Choi J, Li X, et al. Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments. Nat Methods. 2015;12:947–50.
Article CAS PubMed Google Scholar
Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, Lennon NJ, Livak KJ et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32:381–6.
Hager GL, McNally JG, Misteli T. Transcription dynamics. Mol Cell. 2009;35:741–53.
Article CAS PubMed Google Scholar
Voss TC, Schiltz RL, Sung M-H, Yen PM, Stamatoyannopoulos JA, Biddie SC, et al. Dynamic exchange at regulatory elements during chromatin remodeling underlies assisted loading mechanism. Cell. 2011;146:544–54.
Article CAS PubMed PubMed Central Google Scholar
Lee M-CW, Lopez-Diaz FJ, Khan SY, Tariq MA, Dayn Y, Vaske CJ, et al. Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing. Proc Natl Acad Sci U S A. 2014;111:4726–35.
Article Google Scholar
Pollen AA, Nowakowski TJ, Shuga J, Wang X, Leyrat AA, Lui JH, et al. Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex. Nat Biotechnol. 2014;32:1053–8.
Article CAS PubMed PubMed Central Google Scholar
Chen B, Gilbert LA, Cimini BA, Schnitzbauer J, Zhang W, Li G-W, et al. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell. 2013;155:1479–91.
Article CAS PubMed PubMed Central Google Scholar
Anton T, Bultmann S, Leonhardt H, Markaki Y. Visualization of specific DNA sequences in living mouse embryonic stem cells with a programmable fluorescent CRISPR/Cas system. Nucleus. 2014;5:163–72.
Article PubMed PubMed Central Google Scholar
Ma H, Naseri A, Reyes-Gutierrez P, Wolfe SA, Zhang S, Pederson T. Multicolor CRISPR labeling of chromosomal loci in human cells. Proc Natl Acad Sci U S A. 2015;112:3002–7.
Article CAS PubMed PubMed Central Google Scholar
Fabre PJ, Benke A, Joye E, Nguyen Huynh TH, Manley S, Duboule D. Nanoscale spatial organization of the HoxD gene cluster in distinct transcriptional states. Proc Natl Acad Sci U S A. 2015;112:13964–9.
Article CAS PubMed PubMed Central Google Scholar
Ochiai H, Sugawara T, Yamamoto T. Simultaneous live imaging of the transcription and nuclear position of specific genes. Nucleic Acids Res. 2015;43:e127.
Article PubMed PubMed Central Google Scholar
Stelzer Y, Jaenisch R. Monitoring dynamics of DNA methylation at single-cell resolution during development and disease. Cold Spring Harb Symp Quant Biol. 2015; Oct 2 pii:027334.
Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Yang JL, Ferrante TC, et al. Highly multiplexed subcellular RNA sequencing in situ. Science. 2014;343:1360–3.
Article CAS PubMed PubMed Central Google Scholar
Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Ferrante TC, Terry R, et al. Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues. Nat Protoc. 2015;10:442–58.
Article CAS PubMed PubMed Central Google Scholar
Imakaev M, Fudenberg G, McCord RP, Naumova N, Goloborodko A, Lajoie BR, et al. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat Methods. 2012;9:999–1003.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

The authors thank members of the Collas lab for fruitful discussions. The authors are supported by the Research Council of Norway (JP, PC), and by the Molecular Life Science Program of the University of Oslo (MS) and the Medical Faculty of the University of Oslo (PC).

Author information

Authors and Affiliations

Department of Molecular Medicine, Faculty of Medicine, University of Oslo, PO Box 1112, Blindern, 0317, Oslo, Norway
Monika Sekelja, Jonas Paulsen & Philippe Collas

Authors

Monika Sekelja
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Paulsen
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Collas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philippe Collas.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MS, JP and PC conceptualized and wrote the article. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Sekelja, M., Paulsen, J. & Collas, P. 4D nucleomes in single cells: what can computational modeling reveal about spatial chromatin conformation?. Genome Biol 17, 54 (2016). https://doi.org/10.1186/s13059-016-0923-2

Download citation

Published: 07 April 2016
DOI: https://doi.org/10.1186/s13059-016-0923-2

4D nucleomes in single cells: what can computational modeling reveal about spatial chromatin conformation?

Abstract

Background

Assessing genome conformation in cell populations

Consensus chromosome structures inferred from population-average data

Deconvolution methods identify hidden substructures

Inference of 3D chromatin structure from single-cell Hi-C contact maps

Sparseness of single-cell Hi-C contact maps can hamper high-confidence 3D structure reconstruction

Autosomal chromosomes further complicate the picture

Distinguishing biological variation from technical noise

Towards the 4D nucleome

Inferring pseudotime 3D conformational trajectories from cross-sectional data

Locus tracking in living cells by real-time imaging

Conclusions

Abbreviations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Keywords

Genome Biology

Contact us

4D nucleomes in single cells: what can computational modeling reveal about spatial chromatin conformation?

Abstract

Background

Assessing genome conformation in cell populations

Consensus chromosome structures inferred from population-average data

Deconvolution methods identify hidden substructures

Inference of 3D chromatin structure from single-cell Hi-C contact maps

Sparseness of single-cell Hi-C contact maps can hamper high-confidence 3D structure reconstruction

Autosomal chromosomes further complicate the picture

Distinguishing biological variation from technical noise

Towards the 4D nucleome

Inferring pseudotime 3D conformational trajectories from cross-sectional data

Locus tracking in living cells by real-time imaging

Conclusions

Abbreviations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Biology

Contact us