- Open Access
Understanding tumor ecosystems by single-cell sequencing: promises and limitations
Genome Biologyvolume 19, Article number: 211 (2018)
Cellular heterogeneity within and across tumors has been a major obstacle in understanding and treating cancer, and the complex heterogeneity is masked if bulk tumor tissues are used for analysis. The advent of rapidly developing single-cell sequencing technologies, which include methods related to single-cell genome, epigenome, transcriptome, and multi-omics sequencing, have been applied to cancer research and led to exciting new findings in the fields of cancer evolution, metastasis, resistance to therapy, and tumor microenvironment. In this review, we discuss recent advances and limitations of these new technologies and their potential applications in cancer studies.
A single cell is the ultimate unit of life activity, in which genetic mechanisms and the cellular environment interplay with each other and shape the formation and function of such complex structures as tissues and organs. Dissecting the composition and characterizing the interaction, dynamics, and function at the single-cell resolution are crucial for fully understanding the biology of almost all life phenomena, under both normal and diseased conditions. Cancer, a disease caused by somatic mutations conferring uncontrolled proliferation and invasiveness, could in particular benefit from advances in single-cell analysis. During oncogenesis, different populations of cancer cells that are genetically heterogeneous emerge, evolve, and interact with cells in the tumor microenvironment, which leads to host metabolism hijacking, immune evasion, metastasis to other body parts, and eventual mortality. Cancer cells can also manifest resistance to various therapeutic drugs through cellular heterogeneity and plasticity. Cancer is increasingly viewed as a ‘tumor ecosystem’, a community in which tumor cells cooperate with other tumor cells and host cells in their microenvironment, and can also adapt and evolve to changing conditions [1,2,3,4,5].
Detailed understanding of tumor ecosystems at single-cell resolution has been limited for technological reasons. Conventional genomic, transcriptomic, and epigenomic sequencing protocols require microgram-level input materials, and so cancer-related genomic studies were largely limited to bulk tumor sequencing, which does not address intratumor heterogeneity and complexity. The advent of single-cell sequencing technologies [6,7,8] has shifted cancer research to a new paradigm and revolutionized our understanding of cancer evolution [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22], tumor heterogeneity [23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46], and the tumor microenvironment [47,48,49,50,51,52,53,54,55,56,57,58,59]. Development of single-cell sequencing technologies and the applications in cancer research have been astonishing in the past decade, but many challenges still exist and much remains to be explored. Single-cell cancer genomic studies have been reviewed previously [60,61,62,63]. In this review, we summarize recent progress and limitations in cancer sample single-cell sequencing with a focus on the dissection of tumor ecosystems.
Overview of single-cell sequencing and analysis
Single-cell sequencing technologies have improved considerably from the initial proof-of-principle studies [6,7,8]. Modification of the underlying molecular biology and chemistry of single-cell library preparation has provided diverse approaches to obtain and amplify single-cell nucleic acids for subsequent high-throughput sequencing [64,65,66,67,68,69,70,71,72] (Fig. 1). Because an individual cancer cell typically contains only ∼6–12 pg of DNA and 10–50 pg of total RNA (depending on the cell types and status) , amplification is essential for single-cell library preparation to fulfill the sequencing input requirements, although both false positive and false negative errors may arise in the process . Single-cell DNA and RNA sequencing, epigenomic sequencing [68, 70, 72, 75], and simultaneous sequencing of the genome, transcriptome, epigenome, and epitopes of the same single cell [32, 35, 76,77,78,79,80] are all now possible, and can facilitate exploration of the connection between cellular genotypes to phenotypes. Furthermore, the throughput of single-cell sequencing technologies has improved vastly, with some methods allowing simultaneous sequencing of tens of thousands of single cells in one run [81,82,83,84,85,86]. Methods that couple additional experimental techniques with single-cell sequencing technologies are also gaining traction [21, 87,88,89,90,91], to provide a more integrated analysis of single cells.
Accompanying the tremendous progress of experimental single-cell sequencing technologies, specialized bioinformatics and algorithmic approaches have also been developed to best interpret the single-cell data while reducing their technological noise. Examples of these approaches include the imputation of dropout events [92,93,94,95], normalization and correction of batch effects [96,97,98,99,100], clustering for identification of cell types [98, 101,102,103,104,105,106,107,108], pseudo-temporal trajectory inference [109,110,111,112], spatial position inference [87, 88, 90], and data visualization [102, 113,114,115]. Progress in this area requires the application of statistics, probability theory, and computing technologies, which lead to new algorithms, software packages, databases, and web servers. Detailed information of specific single-cell technologies and the underlying principles of the algorithms have been elegantly discussed in other reviews [61, 64,65,66,67,68,69,70, 72, 116,117,118,119,120,121,122,123]. This myriad of experimental and computational methods is becoming the new foundation for uncovering the mystery of cancer complexity at the single-cell resolution.
Despite the dramatic advances, substantial limitations and challenges still exist in single-cell sequencing. The first challenge lies in the technological noise introduced during the amplification step. Notable allelic dropouts (i.e., amplification and sequencing of only one allele of a particular gene in a diploid/multiploid cell) and non-uniform genome coverage hinder the accurate detection of single nucleotide variants (SNVs) at the genome or exome level. These problems can be partially alleviated by the LIANTI (linear amplification via transposon insertion) method , which implements a linear genomic amplification by bacterial transposons and reportedly reaches improvements in genome coverage (~ 97%), allelic dropout rates (< 0.19) and false negative rates (< 0.47). Similarly, in single-cell RNA sequencing (scRNA-seq), lowly expressed genes are prone to dropout and susceptible to technological noise even when detected, although they often encode proteins with important regulatory or signaling functions. These technological issues are more profound for scRNA-seq technologies designed to offer higher throughput [81, 84]. Although many computational methods are available to model or impute dropout events [92, 94, 95], their performances vary and may introduce artificial biases. Much effort is needed to fully address this challenge.
The second challenge is that only a small fraction of cells from bulk tissues can be sequenced. Bulk tissues consist of millions of cells, but present studies can often only sequence hundreds to thousands of single cells because of technological and economic limitations [9,10,11, 20, 25, 124,125,126]. To what extent the sequenced cells represent the distribution of cells in the entire tissue of interest is not clear. A plausible solution to address this challenge would be to further improve the throughput of cellular captures, e.g., MARS-seq  and SPLiT-seq , or alternatively to combine bulk and single-cell sequencing together and then conduct deconvolution analysis . Deconvolution analysis for bulk RNA-seq data uses cell-type signature genes as inputs [128,129,130], which can be substituted by single-cell sequencing results, although critical computational challenges still exist, such as collinearity among single cells. If marker genes for known cell types are orthogonal to each other, the proportions of each cell type in a bulk sample can be reliably estimated. However, collinearity of gene expression exists widely among single cells, which complicates the deconvolution process. At present, successful deconvolution of bulk RNA-seq data based on scRNA-seq-defined signatures has been reported only in cases where orthogonal molecular signatures and fine cluster structures are well balanced . The wide usage of scRNA-seq based deconvolution will hinge upon the availability of comprehensive single-cell clusters and the development of general methods for selecting orthogonal signatures for each cell type.
Spatial information of single cells in the tissue is often lost during the isolation step and thus single-cell sequencing data typically do not show how cells are organized to implement the concerted function within a tissue of interest. Many new techniques have been developed to keep or restore the spatial information of sequenced single cells such as fluorescence in situ hybridization (FISH), single-molecule fluorescence in situ hybridization (smFISH), laser capture microdissection, laser scanning microscopy, including two-photon laser scanning microscopy, and fluorescence in situ sequencing [21, 30, 87,88,89,90,91, 132,133,134,135,136,137,138,139,140,141,142,143]. However, at present all of these techniques have inherent limitations and only apply to specific spatial architecture. For example, while FISH-based technologies can map the spatial distribution of a set of selected genes upon which the spatial information of single cells subject to RNA-seq can be reconstructed via probabilistic inference, the methods are limited to two dimensions and the inference is primarily dependent on the availability of marker genes that can properly discriminate the spatial characteristics with sufficient resolutions. Other conditions for valid marker genes include accurate and robust estimation of their expression levels, but this requirement can be greatly compromised by inherent dropout in scRNA-seq protocols. Accurate restoration of single cell spatial positions via FISH-based inference also requires replicable tissues for parallel FISH and scRNA-seq, which can be only approximately fulfilled on model organisms. For human cancers, however, such requirements usually cannot be met and spatial-recording methods have thus been proposed. With laser capture microdissection, single cells are obtained simultaneously when their spatial information is recorded. However, the cellular throughput of such methods is extremely limited due to operation difficulties, and the biological interpretation of the recorded spatial positions are confined because adjacent cells cannot be properly dissected for scRNA-seq, whereas sequenced cells are often distantly distributed. Low molecular throughput is also problematic with these recently developed in situ sequencing methods. Typically, only tens or hundreds of known genes can be in situ labeled or sequenced, far from the requirement of fully understanding the molecular landscapes of single cells of interest. Furthermore, the replicability of such complicated experiments also imposes barriers for their practical applications to human samples.
Because single-cell sequencing captures individual cells at a particular time point, other factors such as cell cycle and functional state must be considered. By contrast, these factors are often ignored in bulk sequencing due to the average effect. Cell cycle phases can be discerned by phase-specific expression analysis [144,145,146], but cell types and cell states can be hard to distinguish. Sometimes even cancer cells cannot be easily distinguished from normal cells, although inferred DNA copy numbers are often used for this purpose [22, 47, 51]. More robust methods are needed for cell type determination in silico.
Compared to traditional bulk sequencing technologies, which characterize samples via a gene-by-sample matrix, single-cell sequencing adds a cellular layer between genes and samples, which results in a gene-by-cell-by-sample data structure. Addition of the cellular dimension allows simultaneous characterization of samples at both the molecular and cellular level. However, bioinformatics and algorithmic methods for single-cell sequencing data analysis are generally developed for gene-by-cell data, which essentially have the same structure with the gene-by-sample matrices. Although methods exploiting the cellular dimension for phenotype classification have been proposed , tools sufficiently employing all the molecular, cellular, and sample information of the new data structure are still needed.
Given the maturation of single-cell sequencing technologies, especially scRNA-seq, the scale of datasets of one study soon increases from hundreds to tens of thousands and even millions of cells. For large programs, e.g., the Human Cell Atlas project , the volume of data demands more robust computer hardware and software. Although a few down-sampling or convolution-based methods have been proposed to manage large-scale scRNA-seq data for clustering and differential expression analysis [149,150,151], efficient and effective algorithms are of pressing need to circumvent these difficulties.
Complexity of tumor ecosystems
Cancer is known for its heterogeneity, at the inter- and intra-tumor levels . Within a tumor, different spatial sites have different composition of cancer cell clones (Fig. 2), which results in spatial heterogeneity . As cancer cells evolve, temporal variations also arise during the course of cancer genesis and progression, causing temporal heterogeneity . In addition to cancer cells, tumors are also infiltrated with stromal, immune, and other cell types. The diversity of these cells forms the basis of the heterogeneity of the tumor microenvironments [1, 4, 153]. The complex and dynamic nature of cancer heterogeneity within tumors is analogous to ecosystems. Thorough understanding of the composition, interactions, dynamics, and operating principles of tumor ecosystems is key to understanding cancer evolution and the emergence of drug resistance. Multi-region sampling coupled with bulk sequencing is a plausible approach to investigating intra-tumor heterogeneity on the genome scale [36, 154, 155]. However, although this approach reveals intra-tumor heterogeneity, it cannot directly dissect the cellular composition of tumors. Computational deconvolution techniques could help infer the cellular composition of tumors, but such analyses are limited to a few known cell types [128,129,130]. Single-cell sequencing represents a quantum technological leap, as it allows the most precise dissection of the complex architecture of tumors while capturing rare cell types. Here, we review recent progress on understanding tumor ecosystems using single-cell sequencing technologies (Table 1).
Decomposition of clonal and sub-clonal tumor structure
Early success of single-cell sequencing applications in cancer research came from the studies of clonal and sub-clonal structure of primary tumors. DNA-based single-cell sequencing has been applied to breast [7, 20, 21, 26, 156, 157], kidney , bladder , and colon tumors [39, 160, 161], glioblastoma , and hematological malignancies such as acute myeloid leukemia and acute lymphoblastic leukemia [11, 33, 163,164,165]. These studies demonstrated the existence of common mutations among different cancer cell clones in individual cancer patients, which provided evidence for the origin of common cancerous cells and subsequent clonal evolution. Meanwhile, the application of scRNA-seq in glioma [22, 51, 166] demonstrated that cell differentiation of neural stem cells also contributes to tumor heterogeneity, thus supporting a cancer stem cell model. Notably, a recent study of intra-tumor diversification of colorectal cancers  integrated single-cell technologies and tumor organoid culture to show that cancer cells had several times more somatic mutations than normal cells. The authors of this study also observed that most of the mutations occurred during the final dominant clonal expansion, contributed by mutational processes absent from normal controls. In addition to canonical mutations, transcriptomic alterations and DNA methylation were cell-autonomous, stable, and followed the phylogenetic tree of each cancer. The study by Roerink et al.  provided a paradigm of cancer evolution by characterizing clonal and sub-clonal tumor structures, and indicated the potential dynamics of cancer progression. These findings exemplify the unique power of single-cell sequencing to characterize the diversity of cancer cells, resulting in different evolutionary models between cancers. In particular, single-cell data challenged the cancer stem cell model by showing that continued proliferation and clonal expansion formed the majority of tumor cells. Furthermore, scRNA-seq data supported the cancer stem cell model by demonstrating the contribution of cell differentiation to tumor heterogeneity. Copy number alternations (CNAs) and point mutations of cancer cells were subject to different evolutionary modes, with the former preferring punctuated evolution and the latter preferring gradual accumulation. Outstanding disparities need to be resolved before consistent models of cancer genesis and evolution can be applied to a wide range of cancers. Studies with larger sample size and higher molecular and cellular resolution are needed to reconcile various cancer evolution models. Sequencing analysis of single-cell-derived organoids could provide a template for investigating cancer evolution, but this should be extended to larger samples and other cancer types.
Monitoring cancer progress through characterization of circulating tumor cells
Circulating tumor cells (CTCs) are extremely rare in blood (1 in 106), with only tens of cells captured from a typical blood draw . The application of bulk sequencing to such limited input material for genomic exploration is difficult, hindering the analysis of cancer cell migration via blood. Single-cell sequencing has transformed the ability to characterize CTCs and has been used to identify metastatic potential of CTCs in cancer metastasis models, to monitor abnormal signaling pathways for drug-resistance prediction. By characterizing mutation profiles of CTCs, their tissue sources can be matched to the positions of primary and metastatic tumors [13, 16, 24, 167, 168]. This type of analysis holds great potential in early cancer detection and real-time monitoring of disease progression with or without treatment. Furthermore, the origin and destination of CTCs could be further explored to reveal the dissemination conditions of specific tumors. The application of DNA-based single-cell sequencing to CTCs in colon cancer , melanoma , lung cancer , and prostate cancer [171, 172] revealed that the copy number profiles of CTCs are highly similar to primary and metastatic tumors but point mutation profiles show much greater variations, consistent with punctuated evolution of CNAs and gradual evolution of point mutations observed within tumors. A recent integrative analysis of colon, breast, gastric, and prostate cancers by single-cell DNA sequencing compared the mutation profiles between primary tumor cells and CTCs, and revealed convergent evolution of CNAs from primary cancer tissues to CTCs . Remarkably, CNAs affecting the oncogene MYC and the tumor suppressor gene PTEN were observed only in a minor proportion of primary tumor cells but were present in all CTCs spanning multiple cancer types. These observations suggest that the potential of primary tumor cells to transit into CTCs are quite uneven, or otherwise strong selection pressure exists upon CTCs during the metastasis process. To resolve the detailed molecular mechanisms involved in the generation of CTCs in primary tumors to colonization in metastasis sites, it will be important to temporally trace the variations of CTCs during cancer progression from primary tumors to metastasis in both a research and clinical setting. Furthermore, scRNA-seq has been used in the study of CTCs in melanoma , breast , pancreatic [126, 174], and prostate cancers , revealing specific transcriptional signatures of CTCs relative to their primary and metastatic tumors. Extracellular matrix proteins were specifically expressed by CTCs, and plakoglobin appeared to be a key regulator of CTC clusters with survival advantages distinct from individual CTCs. Furthermore, abnormal signaling pathways for drug resistance prediction can be monitored using scRNA-seq of CTCs, as illustrated by the Miyamoto et al. study , in which scRNA-Seq profiling of 77 CTCs from 13 prostate cancer patients revealed extensive heterogeneity of the androgen receptor gene at both expression and splicing levels. Activation of non-canonical Wnt signaling was observed in the retrospective study of CTCs from patients treated with an androgen receptor inhibitor, indicating the potential resistance to therapy. Despite enviable progress, CTC studies remain limited by difficulties in the detection and enrichment of CTCs from blood. How to effectively obtain insight into the generation, progress, metastasis, and response to therapies of the entire tumor through the characterization of CTCs is still an elusive question.
Interrogating the genesis and evolution of therapy resistance
Chemotherapy and targeted therapies have been important weapons to combat cancers, but drug resistance is common for most tumors. Due to the complexity of cancer drug resistance, the underlying mechanisms remain poorly understood for most human cancers, which hampers the development of new approaches to overcome drug resistance. An important question to address is whether drug resistance arises from rare pre-existing subclones with drug-resistant phenotypes prior to treatment (intrinsic resistance) or, alternatively, is acquired through induction of new mutations conferring drug-resistance (acquired resistance). Acquired versus intrinsic resistance has been studied for decades in bacteria, which are single-cell systems , but remains elusive in most human cancers. Single-cell sequencing can be used to resolve tumor heterogeneity, reconstruct the evolutionary trajectories of cancer cells, and identify rare subclones, and has therefore been a promising method to address drug resistance [19, 25, 29, 47, 165]. The recent study by Kim et al.  of triple-negative breast cancers treated with neoadjuvant chemotherapy employed both single-cell DNA- and RNA-sequencing to resolve the genesis and evolution of drug-resistant clones. Using DNA data from 900 cells and RNA data from 6862 cells, CNAs in drug-resistant subclones were found to be pre-existing and adaptively selected while their expression profiles were acquired through transcriptional reprogramming in response to chemotherapy. These results suggest a model of drug-resistance acquisition involving both intrinsic and acquired modes of evolution. According to the newly proposed model, drug resistance-associated CNAs are acquired in rare tumor clones during several short evolutionary bursts at the earliest stages of tumor progression and then subject to gradual evolution. Following anti-tumor therapies, the selective pressure will result in two fates for tumor cells: clonal extinction and persistence, during which the pre-existing rare drug-resistant tumor clones will persist and become the major clones. The transcriptional programs of the persisting clones will converge on a few common pathways associated with the therapy-resistance phenotypes. Both genomic mutations and transcriptional reprogramming could be relevant in understanding therapy resistance as they might exert different modes of evolution for changes at individual levels. It remains unclear how different mechanisms coordinate with one another; therefore, more powerful technologies, such as single-cell multi-omics, are needed to address these questions.
Dissecting the tumor microenvironment to understand cancer immune evasion and metastasis
The tumor microenvironment represents all components of a solid tumor that are not cancer cells. Besides the genetic and non-genetic heterogeneity among tumor clones, heterogeneity among tumor-infiltrating stromal and immune cells in the microenvironment also plays vital roles in tumor growth, angiogenesis, immune evasion, metastasis, and responses to various therapies. With bulk DNA sequencing, the genomes of these cells in the microenvironment are indistinguishable from those of normal tissues and thus often interfere with the detection of tumor CNAs and point mutations by altering tumor purity. With bulk RNA sequencing, the mRNAs of these cells are intermingled with those of tumor cells, which makes it difficult to untangle the expression signals by tumor cells from those by microenvironment cells. The variable compositions of tumor microenvironment often become ‘dark matter’ that confounds subsequent analyses. Although pathway analysis may indicate major types of infiltrated cells, the results are not sufficiently detailed to provide insights into the underlying mechanisms of tumor phenotypes. Computational deconvolution analysis can infer tumor-infiltrating cell types based on tumor bulk RNA-seq data [128,129,130]. However, these algorithms are limited by the availability of gene signatures specific to individual cell types and the collinearity among gene signature profiles.
The majority of these limitations are overcome by single-cell sequencing. With scRNA-seq, the immune landscapes of melanoma , glioblastoma , breast [52, 55, 56], head and neck , colorectal , liver , kidney, [54, 58] and lung [53, 57, 59] cancers have been depicted at unprecedented resolution. New immune cell subtypes with distinct functions or states have been identified, and genes specifically expressed in rare immune cells have been linked to tumor immune evasion. For example, results from a recent single cell study of lung cancers by 10X Genomics  revealed that tumor-enriched B cells can be further grouped into six clusters, of which two follicular B cell clusters are characterized by the high expression of CD20, CXCR4, and HLA-DRs. By contrast, two plasma B-cell clusters express immunoglobulin gamma and the remaining two mucosa-associated lymphoid tissue-derived B-cell clusters have immunoglobulins A and M and JCHAIN as signature molecules. Subtypes of macrophages were also depicted by mass cytometry . In particular, T cells, which specifically recognize tumor neoantigens and kill cancer cells in a targeted way, have been in the spotlight of single cell interrogation of several cancer types [49, 55, 57]. Tissue-resident T-cell subsets are found in liver, lung, and breast tumors, with lower T-cell exhaustion levels associated with better prognosis [49, 55, 57]. Immunotherapies that reinvigorate cytotoxic T cells via immune checkpoint blockade or adoptively transfer neoantigen-specific T cells are therapeutically effective in multiple cancer types . Specific T-cell clusters with suppressive functions in treatment-naïve tumors and T-cell clusters that respond to immunotherapies have been identified [47, 49, 178, 179]. Signature genes of these T-cell clusters, e.g., LAYN identified in exhausted CD8+ T cells and regulatory T cells of liver cancer, can provide attractive biomarkers to predict patient responses to cancer immunotherapies and potentially serve as new candidate targets for further investigation. Nevertheless, accompanying these great achievements, single-cell studies of tumor microenvironment are limited in their depictions of spatial, temporal, and interactive characteristics among cancer and immune cells.
Besides the immune cells themselves, cancer-associated fibroblasts (CAFs) also play crucial roles in cancer immune evasion and metastasis. Heterogeneity of CAFs in various cancer types via scRNA-seq has been shown in several studies [47, 48, 50, 59]. In lung cancer studies by 10X Genomics , five distinct types of tumor-resident fibroblasts were identified that expressed unique repertoires of collagens and other extracellular matrix molecules. In colorectal cancers profiled by SMART-seq2 , two distinct subtypes of CAFs were identified, one of which was enriched for epithelial–mesenchymal transition (EMT)-related genes, which is consistent with results from the lung cancer study . The heterogeneity of CAFs of these cancer types was consistent with results from earlier studies in metastatic melanoma and head and neck cancer, in which the potential functions of CAF subclusters were indicated [47, 48]. Interestingly, a specific subcluster of CAFs that exclusively expressed multiple complement factors, including C1S, C1R, C3, C4A, CFB, and C1NH (SERPING1), correlated with T-cell infiltration based on data analysis from the Cancer Genome Atlas project . Although the correlation cannot imply causality, the cellular and molecular mechanisms of T-cell recruitment by CAFs should be studied. Furthermore, certain CAFs observed in a head and neck cancer single-cell study were found to co-localize with malignant cells highly expressing a p-EMT (partial EMT) gene program that is correlated with metastasis . The co-localization was supported by numerous ligand–receptor interactions between CAFs and the corresponding malignant cells, thus providing new clues for the underlying mechanisms of tumor invasion. The dynamic nature of CAF gene expression certainly deserves further exploration.
Outlook of single-cell sequencing in cancer research
Single-cell epigenomic technologies are maturing and steadily making their way to cancer research [15, 68, 72, 180,181,182,183,184,185,186,187,188,189,190] (Fig. 3). These technologies provide various means to explore DNA methylation status, chromosome accessibility, protein binding, and high-order chromosome conformations. As single-cell epigenomic technologies depict the molecular layers connecting the genome and its functional outputs, the adaptation of single-cell epigenomic technologies to cancer research would greatly advance the understanding of regulatory mechanisms of cancer cell phenotypes and provide new therapeutic targets to combat cancers . New insights may also include mechanisms of cancer cell mutagenesis as epigenomics plays key roles in chromosome stability and dynamics . Single-cell epigenomic technologies may also help investigate the regulatory mechanisms that shape tumor-infiltrating cells, and thus help in advancing the development of therapies that target the tumor microenvironment.
Despite its exciting prospects, single-cell sequencing still faces notable technical challenges that limit the release of its full power in cancer research and clinical applications. For example, the single layer–omics technology generally only gives a snapshot of the state of tested cells. Thorough understanding of the functions of individual cells often requires comprehensive molecular information that covers all layers from the nucleus to extracellular matrix, and includes genomes, epigenomes, chromosome confirmation, transcriptomes, proteomes, metabolomes, and interactomes (Fig. 3). Comprehensive information is important for cancer studies because of the great genomic and phonemic heterogeneity of cancer cells. Single-cell multi-omics technologies [32, 76,77,78,79, 124, 187, 193] have proved feasible but these methods are still in the infant phase of development, limited by low coverage, throughput, and automation levels. Wide application of such technologies in cancer research and clinics requires more effort to conquer the aforementioned challenges. CITE-seq has been used to simultaneously profile mRNA levels and the abundance of a set of selected proteins of cancer samples . Furthermore, SUPeR-seq allows simultaneous measuring of linear and circular RNA levels within the same single cancer cell and associated cells , and G&T-seq provides both genomic and transcriptomic information of a given cell . scTrio-seq has been used to obtain epigenomic, genomic, and transcriptomic information of the same cancer cell .
Future challenges will include circumventing the loss of spatial information of tested single cells during the dissociation step. Tumor ecosystems are highly organized and dynamic; therefore, the spatial positions of various cancer cells and the tumor microenvironment cells and their interactions may play pivotal roles during cancer progression, metastasis, immune evasion, and the development of therapeutic resistance (Fig. 3). Integration of imaging techniques with single-cell sequencing have made meaningful progress in this area. By recording the spatial information of single cells or important ‘anchor genes’ via FISH, smFISH, immunohistochemistry, laser capture microdissection, laser scanning microscopy, or in situ sequencing, the spatial structure of single cells can be experimentally recorded or computationally reconstructed [21, 87,88,89,90,91, 132, 138, 143, 149], thereby shedding light on the spatial heterogeneity of tumor ecosystems. The recently developed NICHE-seq technology  allows isolation of immune cells in a specifically prescribed niche of model animals for single-cell sequencing, which provides a powerful tool to explore tumor immunology in animal models. However, the wider application of NICHE-seq to clinical samples will take time, because two-photon laser scanning microscopy requires the targeted cells to be optically labeled, which at present is only possible on model animals. ProximID maps the cellular interaction network of tissues and could be used for spatial position mapping to cellular physical networks  to show how cancer cells interplay with the tumor microenvironment (Fig. 3). ProximID dissects tissues into doublets or triplets to capture the physical interactions among cells and determines cellular identities via scRNA-seq. ProximID provides great promise for cellular interaction and spatial position mapping, as shown by the recently proposed paired-cell sequencing method that adopts a similar strategy ; however, cellular throughput is still modest at present. A newer version of ProximID parallels the microdissection of doublets and triplets with single cell identity determination, and improves the throughput at the expense of accuracy of cell identity assignment. Overall, creative technological advances in the basic research field have recently emerged in quick succession. Despite obvious pros and cons, they provide exciting new tools to interrogate human cancers at the single-cell level.
Furthermore, the development of new computational and analytical tools is often lagging behind corresponding experimental methods. The new single-cell sequencing data, with added new dimensions or features, often violate the analytical assumptions of bulk sequencing studies, which makes existing analytical tools obsolete or underpowered. For example, the data structure of single-cell sequencing of cancers requires the application of tensors to depict the gene-by-cell-by-sample relationships, whereas the bulk sequencing data can be sufficiently encapsulated by gene-by-sample matrices. Analytical tools currently available are generally designed for matrix-based data structure. Reduction of dimensionality from tensors to matrices is currently needed to use the available bioinformatics tools to analyze either gene-by-cell, gene-by-sample, or cell-by-sample relationships. Tools for simultaneous analysis of gene–cell–sample relationships are urgently needed. The ever-increasing data size of single-cell sequencing studies also requires more robust computational powers. Down-sampling is often applied to reduce data size so that the dataset can be analyzed. Computational algorithms that can handle large single-cell sequencing datasets while simultaneously maintaining similar analytical performance are needed. The spatial single-cell RNA sequencing technique also generates unprecedented data type, for which two new algorithms have been proposed recently [196, 197], allowing analysis of the spatial variance of cancers. Computational development specifically for single-cell data will likely be the field to watch in the next few years, because there are many unresolved yet important issues. It is hoped that bioinformatics of single-cell analysis will catch up with the rapid technology development and the ever-expanding appetite for new data in the cancer research field.
Potential applications of single-cell sequencing in the clinic
Single-cell technologies use limited input materials to resolve tumor heterogeneity and so have great potential in the cancer clinic for diagnosis, prognosis, early detection, risk assessment, progress monitoring, and therapy response prediction. Single cancerous cells can be isolated from blood samples in early stages of cancer genesis [161, 170, 172], which enables early detection and assessment of cancers [198, 199]. If a set of known driver mutations are observed independently in multiple single cancer cells, clonal expansion of cancerous cells is inferred. Additional diagnostic tests are then combined to validate the inference, and further monitoring or treatments may be needed. For diagnosed cancer patients, single-cell sequencing can reveal clonal and subclonal information of their tumor lesions with respect to their genomic and transcriptomic characteristics, upon which clinicians can determine the most suitable therapies . With longitudinal sampling of CTCs or DTCs (disseminated tumor cells), single-cell sequencing also allows the monitoring of patient responses to the prescribed therapies [31, 171, 201]. The resulting genomic and transcriptomic information can be used to examine the selective pressure of drugs to various cancer clones and alert the emergence or expansion of drug-resistance cancer clones . The non-invasive nature of CTC or DTC isolation also greatly reduces the inherent risks of core biopsy directly at the tumor site. Single-cell sequencing data potentially provide metrics beyond conventional genomic mutation data or gene expression data for prognosis analysis. For example, various indices for tumor heterogeneity could be designed to predict responses to therapies, probability of metastasis, disease-free periods, and overall survival [147, 202,203,204,205].
Since its inception, single-cell sequencing has revolutionized cancer research. The pioneering studies have covered the development and applications of single-cell DNA and RNA sequencing to address a wide range of topics such as intra-tumor heterogeneity of primary tumors, roles of CTCs and DTCs during metastasis, evolution of therapy resistance, and the characteristics of tumor microenvironments. Many novel biological insights have been obtained, and the revolution is just starting. Improvement of existing single-cell sequencing technologies, emergence of new techniques, and the integration of single-cell sequencing with other experimental protocols have provided powerful toolsets to understand many of the remaining mysteries of cancers. Single-cell epigenomics, multi-omics, and spatial single-cell sequencing technologies are some of the major directions of single-cell sequencing technologies that will bring the second wave of revolutions of cancer research.
Copy number alteration
Circulating tumor cells
Disseminated tumor cells
Fluorescence in situ hybridization
Single-cell RNA sequencing
Single molecule fluorescence in situ hybridization
Single nucleotide variant
Valkenburg KC, de Groot AE, Pienta KJ. Targeting the tumour stroma to improve cancer therapy. Nat Rev Clin Oncol. 2018;15:366–81.
Amend SR, Roy S, Brown JS, Pienta KJ. Ecological paradigms to understand the dynamics of metastasis. Cancer Lett. 2016;380:237–42.
Merlo LMF, Pepper JW, Reid BJ, Maley CC. Cancer as an evolutionary and ecological process. Nat Rev Cancer. 2006;6:924–35.
Maley CC, Aktipis A, Graham TA, Sottoriva A, Boddy AM, Janiszewska M, et al. Classifying the evolutionary and ecological features of neoplasms. Nat Rev Cancer. 2017;17:605–19.
Greaves M, Maley CC. Clonal evolution in cancer. Nature. 2012;481:306–13.
Tang F, Barbacioru C, Wang Y, Nordman E, Lee C, Xu N, et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat Methods. 2009;6:377–82.
Navin N, Kendall J, Troge J, Andrews P, Rodgers L, McIndoo J, et al. Tumour evolution inferred by single-cell sequencing. Nature. 2011;472:90–4.
Baslan T, Kendall J, Rodgers L, Cox H, Riggs M, Stepansky A, et al. Genome-wide copy number analysis of single cells. Nat Protocols. 2012;7:1024–41.
Lu S, Zong C, Fan W, Yang M, Li J, Chapman AR, et al. Probing meiotic recombination and aneuploidy of single sperm cells by whole-genome sequencing. Science. 2012;338:1627–30.
Hou Y, Song L, Zhu P, Zhang B, Tao Y, Xu X, et al. Single-cell exome sequencing and monoclonal evolution of a JAK2-negative myeloproliferative neoplasm. Cell. 2012;148:873–85.
Hughes AE, Magrini V, Demeter R, Miller CA, Fulton R, Fulton LL, et al. Clonal architecture of secondary acute myeloid leukemia defined by single-cell sequencing. PLoS Genet. 2014;10:e1004462.
Landau DA, Clement K, Ziller MJ, Boyle P, Fan J, Gu H, et al. Locally disordered methylation forms the basis of intratumor methylome variation in chronic lymphocytic leukemia. Cancer Cell. 2014;26:813–25.
Demeulemeester J, Kumar P, Moller EK, Nord S, Wedge DC, Peterson A, et al. Tracing the origin of disseminated tumor cells in breast cancer using single-cell sequencing. Genome Biol. 2016;17:250.
Jahn K, Kuipers J, Beerenwinkel N. Tree inference for single-cell data. Genome Biol. 2016;17:86.
Corces MR, Buenrostro JD, Wu B, Greenside PG, Chan SM, Koenig JL, et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat Genetics. 2016;48:1193–203.
Gao Y, Ni X, Guo H, Su Z, Ba Y, Tong Z, et al. Single-cell sequencing deciphers a convergent evolution of copy number alterations from primary to circulating tumor cells. Genome Res. 2017;27:1312–22.
Kuipers J, Jahn K, Raphael BJ, Beerenwinkel N. Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors. Genome Res. 2017;27:1885–94.
Wu H, Zhang XY, Hu Z, Hou Q, Zhang H, Li Y, et al. Evolution and heterogeneity of non-hereditary colorectal cancer revealed by single-cell exome sequencing. Oncogene. 2017;36:2857–67.
Brady SW, McQuerry JA, Qiao Y, Piccolo SR, Shrestha G, Jenkins DF, et al. Combating subclonal evolution of resistant cancer phenotypes. Nat Commun. 2017;8:1231.
Kim C, Gao R, Sei E, Brandt R, Hartman J, Hatschek T, et al. Chemoresistance evolution in triple-negative breast cancer delineated by single-cell sequencing. Cell. 2018;173:879–93 e13.
Casasent AK, Schalck A, Gao R, Sei E, Long A, Pangburn W, et al. Multiclonal invasion in breast tumors identified by topographic single cell sequencing. Cell. 2018;172:205–17 e212.
Tirosh I, Venteicher AS, Hebert C, Escalante LE, Patel AP, Yizhak K, et al. Single-cell RNA-seq supports a developmental hierarchy in human oligodendroglioma. Nature. 2016;539:309–13.
Dalerba P, Kalisky T, Sahoo D, Rajendran PS, Rothenberg ME, Leyrat AA, et al. Single-cell dissection of transcriptional heterogeneity in human colon tumors. Nat Biotechnol. 2011;29:1120–7.
Powell AA, Talasaz AH, Zhang H, Coram MA, Reddy A, Deng G, et al. Single cell profiling of circulating tumor cells: transcriptional heterogeneity and diversity from breast cancer cell lines. Plos One. 2012;7:e33788.
Lee MC, Lopez-Diaz FJ, Khan SY, Tariq MA, Dayn Y, Vaske CJ, et al. Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing. Proc Natl Acad Sci U S A. 2014;111:E4726–35.
Cleary AS, Leonard TL, Gestl SA, Gunther EJ. Tumour cell heterogeneity maintained by cooperating subclones in Wnt-driven mammary cancers. Nature. 2014;508:113–7.
Patel AP, Tirosh I, Trombetta JJ, Shalek AK, Gillespie SM, Wakimoto H, et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science. 2014;344:1396–401.
Pollen AA, Nowakowski TJ, Shuga J, Wang X, Leyrat AA, Lui JH, et al. Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex. Nat Biotechnol. 2014;32:1053–8.
Kim KT, Lee HW, Lee HO, Kim SC, Seo YJ, Chung W, et al. Single-cell mRNA sequencing identifies subclonal heterogeneity in anti-cancer drug responses of lung adenocarcinoma cells. Genome Biol. 2015;16:127.
Janiszewska M, Liu L, Almendro V, Kuang Y, Paweletz C, Sakr RA, et al. In situ single-cell analysis identifies heterogeneity for PIK3CA mutation and HER2 amplification in HER2-positive breast cancer. Nat Genet. 2015;47:1212–9.
Miyamoto DT, Zheng Y, Wittner BS, Lee RJ, Zhu H, Broderick KT, et al. RNA-Seq of single prostate CTCs implicates noncanonical Wnt signaling in antiandrogen resistance. Science. 2015;349:1351–6.
Hou Y, Guo H, Cao C, Li X, Hu B, Zhu P, et al. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res. 2016;26:304–19.
Bakker B, Taudt A, Belderbos ME, Porubsky D, Spierings DC, de Jong TV, et al. Single-cell sequencing reveals karyotype heterogeneity in murine and human malignancies. Genome Biol. 2016;17:115.
Mann KM, Newberg JY, Black MA, Jones DJ, Amaya-Manzanares F, Guzman-Rojas L, et al. Analyzing tumor heterogeneity and driver genes in single myeloid leukemia cells with SBCapSeq. Nat Biotechnol. 2016;34:962–72.
Angermueller C, Clark SJ, Lee HJ, Macaulay IC, Teng MJ, Hu TX, et al. Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity. Nat Methods. 2016;13:229–32.
Liu M, Liu Y, Di J, Su Z, Yang H, Jiang B, et al. Multi-region and single-cell sequencing reveal variable genomic heterogeneity in rectal cancer. BMC Cancer. 2017;17:787.
Gao R, Kim C, Sei E, Foukakis T, Crosetto N, Chan LK, et al. Nanogrid single-nucleus RNA sequencing reveals phenotypic diversity in breast cancer. Nat Commun. 2017;8:228.
Giustacchini A, Thongjuea S, Barkas N, Woll PS, Povinelli BJ, Booth CAG, et al. Single-cell transcriptomics uncovers distinct molecular signatures of stem cells in chronic myeloid leukemia. Nat Med. 2017;23:692–702.
Zong C, Lu S, Chapman AR, Xie XS. Genome-wide detection of single-nucleotide and copy-number variations of a single human cell. Science. 2012;338:1622–6.
Chen C, Xing D, Tan L, Li H, Zhou G, Huang L, et al. Single-cell whole-genome analyses by linear amplification via transposon insertion (LIANTI). Science. 2017;356:189–94.
Pastushenko I, Brisebarre A, Sifrim A, Fioramonti M, Revenco T, Boumahdi S, et al. Identification of the tumour transition states occurring during EMT. Nature. 2018;556:463–8.
Roerink SF, Sasaki N, Lee-Six H, Young MD, Alexandrov LB, Behjati S, et al. Intra-tumour diversification in colorectal cancer at the single-cell level. Nature. 2018;556:457–62.
Carter L, Rothwell DG, Mesquita B, Smowton C, Leong HS, Fernandez-Gutierrez F, et al. Molecular analysis of circulating tumor cells identifies distinct copy-number profiles in patients with chemosensitive and chemorefractory small-cell lung cancer. Nat Med. 2017;23:114–9.
Lawson DA, Bhakta NR, Kessenbrock K, Prummel KD, Yu Y, Takai K, et al. Single-cell analysis reveals a stem-cell program in human metastatic breast cancer cells. Nature. 2015;526:131–5.
Martelotto LG, Baslan T, Kendall J, Geyer FC, Burke KA, Spraggon L, et al. Whole-genome single-cell copy number profiling from formalin-fixed paraffin-embedded samples. Nat Med. 2017;23:376–85.
Suzuki A, Matsushima K, Makinoshima H, Sugano S, Kohno T, Tsuchihara K, et al. Single-cell analysis of lung adenocarcinoma cell lines reveals diverse expression patterns of individual cells invoked by a molecular target drug treatment. Genome Biol. 2015;16:66.
Tirosh I, Izar B, Prakadan SM, Wadsworth MH 2nd, Treacy D, Trombetta JJ, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science. 2016;352:189–96.
Puram SV, Tirosh I, Parikh AS, Patel AP, Yizhak K, Gillespie S, et al. Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell. 2017;171:1611–24 e1624.
Zheng C, Zheng L, Yoo JK, Guo H, Zhang Y, Guo X, et al. Landscape of infiltrating t cells in liver cancer revealed by single-cell sequencing. Cell. 2017;169:1342–56 e1316.
Li H, Courtois ET, Sengupta D, Tan Y, Chen KH, Goh JJL, et al. Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors. Nat Genet. 2017;49:708–18.
Venteicher AS, Tirosh I, Hebert C, Yizhak K, Neftel C, Filbin MG, et al. Decoupling genetics, lineages, and microenvironment in IDH-mutant gliomas by single-cell RNA-seq. Science. 2017;355. https://doi.org/10.1126/science.aai8478.
Chung W, Eum HH, Lee H-O, Lee K-M, Lee H-B, Kim K-T, et al. Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nat Commun. 2017;8:15081.
Lavin Y, Kobayashi S, Leader A, Amir El-Ad D, Elefant N, Bigenwald C, et al. Innate immune landscape in early lung adenocarcinoma by paired single-cell analyses. Cell. 2017;169:750–65.
Chevrier S, Levine JH, Zanotelli VRT, Silina K, Schulz D, Bacac M, et al. An immune atlas of clear cell renal cell carcinoma. Cell. 2017;169:736–49 e718.
Savas P, Virassamy B, Ye C, Salim A, Mintoff CP, Caramia F, et al. Single-cell profiling of breast cancer T cells reveals a tissue-resident memory subset associated with improved prognosis. Nat Med. 2018;24:986–93.
Azizi E, Carr AJ, Plitas G, Cornish AE, Konopacki C, Prabhakaran S, et al. Single-cell map of diverse immune phenotypes in the breast tumor microenvironment. Cell. 2018;173:1293–308.
Guo X, Zhang Y, Zheng L, Zheng C, Song J, Zhang Q, Kang B, Liu Z, Jin L, Xing R, et al. Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing. Nat Med. 2018;174:1293–308.
Young MD, Mitchell TJ, Vieira Braga FA, Tran MGB, Stewart BJ, Ferdinand JR, et al. Single-cell transcriptomes from human kidneys reveal the cellular identity of renal tumors. Science. 2018;361:594–9.
Lambrechts D, Wauters E, Boeckx B, Aibar S, Nittner D, Burton O, et al. Phenotype molding of stromal cells in the lung tumor microenvironment. Nat Med. 2018;24:1277–89.
Navin NE. The first five years of single-cell cancer genomics and beyond. Genome Res. 2015;25:1499–507.
Saadatpour A, Lai S, Guo G, Yuan GC. Single-cell analysis in cancer genomics. Trends Genet. 2015;31:576–86.
Tsoucas D, Yuan GC. Recent progress in single-cell cancer genomics. Curr Opin Genet Dev. 2017;42:22–32.
Müller S, Diaz A. Single-cell mRNA sequencing in cancer research: integrating the genomic fingerprint. Front Genet. 2017;8:73.
Shapiro E, Biezuner T, Linnarsson S. Single-cell sequencing-based technologies will revolutionize whole-organism science. Nat Rev Genet. 2013;14:618–30.
Gawad C, Koh W, Quake SR. Single-cell genome sequencing: current state of the science. Nat Rev Genet. 2016;17:175–88.
Wu AR, Wang J, Streets AM, Huang Y. Single-cell transcriptional analysis. Annu Rev Anal Chem (Palo Alto Calif). 2017;10:439–62.
Stubbington MJT, Rozenblatt-Rosen O, Regev A, Teichmann SA. Single-cell transcriptomics to explore the immune system in health and disease. Science. 2017;358:58–63.
Wen L, Tang F. Single cell epigenome sequencing technologies. Mol Aspects Med. 2018;59:62–9.
Potter SS. Single-cell RNA sequencing for the study of development, physiology and disease. Nat Rev Nephrol. 2018;14:479–92.
Schwartzman O, Tanay A. Single-cell epigenomics: techniques and emerging applications. Nat Rev Genet. 2015;16:716–26.
Svensson V, Natarajan KN, Ly L-H, Miragaia RJ, Labalette C, Macaulay IC, et al. Power analysis of single-cell RNA-sequencing experiments. Nat Methods. 2017;14:381–7.
Kelsey G, Stegle O, Reik W. Single-cell epigenomics: recording the past and predicting the future. Science. 2017;358:69–75.
Livesey FJ. Strategies for microarray analysis of limiting amounts of RNA. Brief Funct Genomic Proteomic. 2003;2:31–6.
Navin NE. Cancer genomics: one cell at a time. Genome Biol. 2014;15:452.
Clark SJ, Lee HJ, Smallwood SA, Kelsey G, Reik W. Single-cell epigenomics: powerful new methods for understanding gene regulation and cell identity. Genome Biol. 2016;17:72.
Macaulay IC, Haerty W, Kumar P, Li YI, Hu TX, Teng MJ, et al. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nat Methods. 2015;12:519–22.
Guo F, Li L, Li J, Wu X, Hu B, Zhu P, et al. Single-cell multi-omics sequencing of mouse early embryos and embryonic stem cells. Cell Res. 2017;27:967–88.
Clark SJ, Argelaguet R, Kapourani C-A, Stubbs TM, Lee HJ, Alda-Catalinas C, et al. scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells. Nat Commun. 2018;9:781.
Hu Y, Huang K, An Q, Du G, Hu G, Xue J, et al. Simultaneous profiling of transcriptome and DNA methylome from a single cell. Genome Biol. 2016;17:88.
Stoeckius M, Hafemeister C, Stephenson W, Houck-Loomis B, Chattopadhyay PK, Swerdlow H, et al. Simultaneous epitope and transcriptome measurement in single cells. Nat Methods. 2017;14:865–8.
Zheng GXY, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049.
Jaitin DA, Kenigsberg E, Keren-Shaul H, Elefant N, Paul F, Zaretsky I, et al. Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types. Science. 2014;343:776–9.
Gole J, Gore A, Richards A, Chiu Y-J, Fung H-L, Bushman D, et al. Massively parallel polymerase cloning and genome sequencing of single cells using nanoliter microwells. Nat Biotechnol. 2013;31:1126–32.
Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161:1202–14.
Rotem A, Ram O, Shoresh N, Sperling RA, Schnall-Levin M, Zhang H, et al. High-throughput single-cell labeling (Hi-SCL) for RNA-seq using drop-based microfluidics. PLoS One. 2015;10:e0116328.
Rosenberg AB, Roco CM, Muscat RA, Kuchina A, Sample P, Yao Z, et al. Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding. Science. 2018;360:176–82.
Achim K, Pettit JB, Saraiva LR, Gavriouchkina D, Larsson T, Arendt D, et al. High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin. Nat Biotechnol. 2015;33:503–9.
Halpern KB, Shenhav R, Matcovitch-Natan O, Tóth B, Lemze D, Golan M, et al. Single-cell spatial reconstruction reveals global division of labour in the mammalian liver. Nature. 2017;542:352–6.
Medaglia C, Giladi A, Stoler-Barak L, De Giovanni M, Salame TM, Biram A, et al. Spatial reconstruction of immune niches by combining photoactivatable reporters and scRNA-seq. Science. 2017;358:1622–6.
Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33:495–502.
Chen J, Suo S, Tam PP, Han JJ, Peng G, Jing N. Spatial transcriptomic analysis of cryosectioned tissue samples with Geo-seq. Nat Protoc. 2017;12:566–80.
Lin P, Troup M, Ho JWK. CIDR: ultrafast and accurate clustering through imputation for single-cell RNA-seq data. Genome Biol. 2017;18:59.
Pierson E, Yau C. ZIFA: dimensionality reduction for zero-inflated single-cell gene expression analysis. Genome Biol. 2015;16:241.
Li WV, Li JJ. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat Commun. 2018;9:997.
Gong W, Kwak IY, Pota P, Koyano-Nakagawa N, Garry DJ. DrImpute: imputing dropout events in single cell RNA sequencing data. BMC Bioinformatics. 2018;19:220.
Bacher R, Chu L-F, Leng N, Gasch AP, Thomson JA, Stewart RM, et al. SCnorm: robust normalization of single-cell RNA-seq data. Nat Methods. 2017;14:584–6.
McCarthy DJ, Campbell KR, Lun ATL, Wills QF. Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R. Bioinformatics. 2017;33:1179–86.
Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol. 2018;36:411–20.
Haghverdi L, Lun ATL, Morgan MD, Marioni JC. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat Biotechnol. 2018;36:421–7.
Vallejos CA, Risso D, Scialdone A, Dudoit S, Marioni JC. Normalizing single-cell RNA sequencing data: challenges and opportunities. Nat Methods. 2017;14:565–71.
Kiselev VY, Kirschner K, Schaub MT, Andrews T, Yiu A, Chandra T, et al. SC3: consensus clustering of single-cell RNA-seq data. Nat Methods. 2017;14:483–6.
Wang B, Zhu J, Pierson E, Ramazzotti D, Batzoglou S. Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning. Nat Methods. 2017;14:414–6.
Žurauskienė J, Yau C. pcaReduce: hierarchical clustering of single cell transcriptional profiles. BMC Bioinformatics. 2016;17:140.
Grun D, Lyubimova A, Kester L, Wiebrands K, Basak O, Sasaki N, et al. Single-cell messenger RNA sequencing reveals rare intestinal cell types. Nature. 2015;525:251–5.
Tsoucas D, Yuan G-C. GiniClust2: a cluster-aware, weighted ensemble clustering method for cell-type detection. Genome Biol. 2018;19:58.
Jiang L, Chen H, Pinello L, Yuan GC. GiniClust: detecting rare cell types from single-cell gene expression data with Gini index. Genome Biol. 2016;17:144.
Xu C, Su Z. Identification of cell types from single-cell transcriptomes using a novel clustering method. Bioinformatics. 2015;31:1974–80.
Jiang H, Sohn LL, Huang H, Chen L. Single cell clustering based on cell-pair differentiability correlation and variance analysis. Bioinformatics. 2018;34:3684–94.
Setty M, Tadmor MD, Reich-Zeliger S, Angel O, Salame TM, Kathail P, et al. Wishbone identifies bifurcating developmental trajectories from single-cell data. Nat Biotechnol. 2016;34:637–45.
Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32:381–6.
Teschendorff AE, Enver T. Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome. Nat Commun. 2017;8:15599.
Ji Z, Ji H. TSCAN: pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis. Nucleic Acids Res. 2016;44:e117.
El-ad DA, Davis KL, Tadmor MD, Simonds EF, Levine JH, Bendall SC, et al. viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat Biotechnol. 2013;31:545–52.
Weinreb C, Wolock S, Klein AM. SPRING: a kinetic interface for visualizing high dimensional single-cell expression data. Bioinformatics. 2018;34:1246–8.
Ding J, Condon A, Shah SP. Interpretable dimensionality reduction of single cell transcriptome data with deep generative models. Nat Commun. 2018;9:2002.
Rostom R, Svensson V, Teichmann SA, Kar G. Computational approaches for interpreting scRNA-seq data. FEBS Lett. 2017;591:2213–25.
Wen L, Tang F. Single-cell sequencing in stem cell biology. Genome Biol. 2016;17:71.
Poirion OB, Zhu X, Ching T, Garmire L. Single-cell transcriptomics bioinformatics and computational challenges. Front Genet. 2016;7:163.
Wagner A, Regev A, Yosef N. Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol. 2016;34:1145–60.
Huang L, Ma F, Chapman A, Lu S, Xie XS. Single-cell whole-genome amplification and sequencing: methodology and applications. Annu Rev Genomics Hum Genet. 2015;16:79–102.
Stegle O, Teichmann SA, Marioni JC. Computational and analytical challenges in single-cell transcriptomics. Nat Rev Genet. 2015;16:133–45.
de Vargas RL, Claassen M. Computational and experimental single cell biology techniques for the definition of cell type heterogeneity, interplay and intracellular dynamics. Curr Opin Biotechnol. 2015;34:9–15.
Grun D, van Oudenaarden A. Design and analysis of single-cell sequencing experiments. Cell. 2015;163:799–810.
Fan X, Zhang X, Wu X, Guo H, Hu Y, Tang F, et al. Single-cell RNA-seq transcriptome analysis of linear and circular RNAs in mouse preimplantation embryos. Genome Biol. 2015;16:148.
Bose S, Wan Z, Carr A, Rizvi AH, Vieira G, Pe'er D, et al. Scalable microfluidics for single-cell RNA printing and sequencing. Genome Biol. 2015;16:120.
Yu M, Ting DT, Stott SL, Wittner BS, Ozsolak F, Paul S, et al. RNA sequencing of pancreatic circulating tumour cells implicates WNT signalling in metastasis. Nature. 2012;487:510–3.
Salehi S, Steif A, Roth A, Aparicio S, Bouchard-Cote A, Shah SP. ddClone: joint statistical inference of clonal populations from single cell and bulk tumour sequencing data. Genome Biol. 2017;18:44.
Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015;12:453–7.
Li B, Severson E, Pignon J-C, Zhao H, Li T, Novak J, et al. Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biol. 2016;17:174.
Racle J, de Jonge K, Baumgaertner P, Speiser DE, Gfeller D. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data. eLife. 2017;6:e26476.
Schelker M, Feau S, Du J, Ranu N, Klipp E, MacBeath G, et al. Estimation of immune cell content in tumour tissue using single-cell RNA-seq data. Nat Commun. 2017;8:2032.
Shah S, Lubeck E, Zhou W, Cai L. In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus. Neuron. 2016;92:342–57.
Stahl PL, Salmen F, Vickovic S, Lundmark A, Navarro JF, Magnusson J, et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016;353:78–82.
Wahlby C. The quest for multiplexed spatially resolved transcriptional profiling. Nat Methods. 2016;13:623–4.
Chen KH, Boettiger AN, Moffitt JR, Wang S, Zhuang X. Spatially resolved, highly multiplexed RNA profiling in single cells. Science. 2015;348:aaa6090.
Crosetto N, Bienko M, van Oudenaarden A. Spatially resolved transcriptomics and beyond. Nat Rev Genet. 2015;16:57–66.
Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Yang JL, Ferrante TC, et al. Highly multiplexed subcellular RNA sequencing in situ. Science. 2014;343:1360–3.
Lovatt D, Ruble BK, Lee J, Dueck H, Kim TK, Fisher S, et al. Transcriptome in vivo analysis (TIVA) of spatially defined single cells in live tissue. Nat Methods. 2014;11:190–6.
Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Ferrante TC, Terry R, et al. Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues. Nat Protoc. 2015;10:442–58.
Lubeck E, Coskun AF, Zhiyentayev T, Ahmad M, Cai L. Single-cell in situ RNA profiling by sequential hybridization. Nat Methods. 2014;11:360–1.
Ke R, Mignardi M, Pacureanu A, Svedlund J, Botling J, Wahlby C, et al. In situ sequencing for RNA analysis in preserved tissue and cells. Nat Methods. 2013;10:857–60.
Larsson C, Grundberg I, Soderberg O, Nilsson M. In situ detection and genotyping of individual mRNA molecules. Nat Methods. 2010;7:395–7.
Wen L, Tang F. Reconstructing complex tissues from single-cell analyses. Cell. 2014;157:771–3.
Liu Z, Lou H, Xie K, Wang H, Chen N, Aparicio OM, et al. Reconstructing cell cycle pseudo time-series via single-cell transcriptome data. Nat Commun. 2017;8:22.
Scialdone A, Natarajan KN, Saraiva LR, Proserpio V, Teichmann SA, Stegle O, et al. Computational assignment of cell-cycle stage from single-cell transcriptome data. Methods. 2015;85:54–61.
Leng N, Chu L-F, Barry C, Li Y, Choi J, Li X, et al. Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments. Nat Methods. 2015;12:947–50.
Sima C, Hua J, Bittner ML, Kim S, Dougherty ER. Phenotype classification using moment features of single-cell data. Cancer Inform. 2018;17:1176935118771701.
Rozenblatt-Rosen O, Stubbington MJT, Regev A, Teichmann SA. The human cell atlas: from vision to reality. Nature. 2017;550:451–3.
Iacono G, Mereu E, Guillaumet-Adkins A, Corominas R, Cusco I, Rodriguez-Esteban G, et al. bigSCale: an analytical framework for big-scale single-cell data. Genome Res. 2018;28:878–90.
Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19:15.
Sinha D, Kumar A, Kumar H. Bandyopadhyay S. Sengupta D. dropClust: efficient clustering of ultra-large scRNA-seq data. Nucleic Acids Res. 2018;46:e36.
Andor N, Graham TA, Jansen M, Xia LC, Aktipis CA, Petritsch C, et al. Pan-cancer analysis of the extent and consequences of intratumor heterogeneity. Nat Med. 2016;22:105–13.
Maman S, Witz IP. A history of exploring cancer in context. Nat Rev Cancer. 2018;18:359–76.
Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med. 2012;366:883–92.
Zhang J, Fujimoto J, Zhang J, Wedge DC, Song X, Zhang J, et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science. 2014;346:256–9.
Wang Y, Waters J, Leung ML, Unruh A, Roh W, Shi X, et al. Clonal evolution in breast cancer revealed by single nucleus genome sequencing. Nature. 2014;512:155–60.
Eirew P, Steif A, Khattra J, Ha G, Yap D, Farahani H, et al. Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution. Nature. 2015;518:422–6.
Xu X, Hou Y, Yin X, Bao L, Tang A, Song L, et al. Single-cell exome sequencing reveals single-nucleotide mutation characteristics of a kidney tumor. Cell. 2012;148:886–95.
Li Y, Xu X, Song L, Hou Y, Li Z, Tsang S, et al. Single-cell sequencing analysis characterizes common and cell-lineage-specific mutations in a muscle-invasive bladder cancer. Gigascience. 2012;1:12.
Yu C, Yu J, Yao X, Wu WKK, Lu Y, Tang S, et al. Discovery of biclonal origin and a novel oncogene SLC12A5 in colon cancer by single-cell sequencing. Cell Res. 2014;24:701–12.
Heitzer E, Auer M, Gasch C, Pichler M, Ulz P, Hoffmann EM, et al. Complex tumor genomes inferred from single circulating tumor cells by array-CGH and next-generation sequencing. Cancer Res. 2013;73:2965–75.
Francis JM, Zhang C-Z, Maire CL, Jung J, Manzo VE, Adalsteinsson VA, et al. EGFR variant heterogeneity in glioblastoma resolved through single-nucleus sequencing. Cancer Discov. 2014;4:956–71.
Jan M, Snyder TM, Corces-Zimmerman MR, Vyas P, Weissman IL, Quake SR, et al. Clonal evolution of preleukemic hematopoietic stem cells precedes human acute myeloid leukemia. Sci Transl Med. 2012;4:149ra118.
Gawad C, Koh W, Quake SR. Dissecting the clonal origins of childhood acute lymphoblastic leukemia by single-cell genomics. Proc Natl Acad Sci U S A. 2014;111:17947–52.
Ebinger S, Ozdemir EZ, Ziegenhain C, Tiedt S, Castro Alves C, Grunert M, et al. Characterization of rare, dormant, and therapy-resistant cells in acute lymphoblastic leukemia. Cancer Cell. 2016;30:849–62.
Müller S, Liu SJ, Di Lullo E, Malatesta M, Pollen AA, Nowakowski TJ, et al. Single-cell sequencing maps gene expression to mutational phylogenies in PDGF- and EGF-driven gliomas. Mol Syst Biol. 2016;12:889.
Aceto N, Bardia A, Miyamoto DT, Donaldson MC, Wittner BS, Spencer JA, et al. Circulating tumor cell clusters are oligoclonal precursors of breast cancer metastasis. Cell. 2014;158:1110–22.
Li Y, Wu S, Bai F. Molecular characterization of circulating tumor cells-from bench to bedside. Semin Cell Dev Biol. 2018;75:88–97.
Ruiz C, Li J, Luttgen MS, Kolatkar A, Kendall JT, Flores E, et al. Limited genomic heterogeneity of circulating melanoma cells in advanced stage patients. Phys Biol. 2015;12:016008.
Ni X, Zhuo M, Su Z, Duan J, Gao Y, Wang Z, et al. Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients. Proc Natl Acad Sci U S A. 2013;110:21083–8.
Dago AE, Stepansky A, Carlsson A, Luttgen M, Kendall J, Baslan T, et al. Rapid phenotypic and genomic change in response to therapeutic pressure in prostate cancer inferred by high content analysis of single circulating tumor cells. PLoS One. 2014;9:e101777.
Lohr JG, Adalsteinsson VA, Cibulskis K, Choudhury AD, Rosenberg M, Cruz-Gordillo P, et al. Whole-exome sequencing of circulating tumor cells provides a window into metastatic prostate cancer. Nat Biotechnol. 2014;32:479–84.
Ramskold D, Luo SJ, Wang YC, Li R, Deng QL, Faridani OR, et al. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat Biotechnol. 2012;30:777–82.
Ting DT, Wittner BS, Ligorio M, Jordan NV, Shah AM, Miyamoto DT, et al. Single-cell RNA sequencing identifies extracellular matrix gene expression by pancreatic circulating tumor cells. Cell Rep. 2014;8:1905–18.
Luria SE, Delbruck M. Mutations of bacteria from virus sensitivity to virus resistance. Genetics. 1943;28:491–511.
Darmanis S, Sloan SA, Croote D, Mignardi M, Chernikova S, Samghababi P, et al. Single-cell RNA-seq analysis of infiltrating neoplastic cells at the migrating front of human glioblastoma. Cell Rep. 2017;21:1399–410.
Sharma P, Hu-Lieskovan S, Wargo JA, Ribas A. Primary, adaptive, and acquired resistance to cancer immunotherapy. Cell. 2017;168:707–23.
Wei SC, Levine JH, Cogdill AP, Zhao Y, Anang N-AAS, Andrews MC, et al. Distinct cellular mechanisms underlie anti-CTLA-4 and anti-PD-1 checkpoint blockade. Cell. 2017;170:1120–33 e1117.
Zappasodi R, Budhu S, Hellmann MD, Postow MA, Senbabaoglu Y, Manne S, et al. Non-conventional inhibitory CD4+Foxp3–PD-1hi T Cells as a biomarker of immune checkpoint blockade activity. Cancer Cell. 2018;33:1017–32 e7.
Stelzer Y, Shivalila CS, Soldner F, Markoulaki S, Jaenisch R. Tracing dynamic changes of DNA methylation at single-cell resolution. Cell. 2015;163:218–29.
Zhu C, Gao Y, Guo H, Xia B, Song J, Wu X, et al. Single-cell 5-formylcytosine landscapes of mammalian early embryos and ESCs at single-base resolution. Cell Stem Cell. 2017;20:720–31 e5.
Guo H, Zhu P, Guo F, Li X, Wu X, Fan X, et al. Profiling DNA methylome landscapes of mammalian cells with single-cell reduced-representation bisulfite sequencing. Nat Protoc. 2015;10:645–59.
Stevens TJ, Lando D, Basu S, Atkinson LP, Cao Y, Lee SF, et al. 3D structures of individual mammalian genomes studied by single-cell Hi-C. Nature. 2017;544:59–64.
Flyamer IM, Gassler J, Imakaev M, Brandao HB, Ulianov SV, Abdennur N, et al. Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature. 2017;544:110–4.
Ramani V, Deng X, Qiu R, Gunderson KL, Steemers FJ, Disteche CM, et al. Massively multiplex single-cell Hi-C. Nat Methods. 2017;14:263–6.
Nagano T, Lubling Y, Stevens TJ, Schoenfelder S, Yaffe E, Dean W, et al. Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature. 2013;502:59–64.
Satpathy AT, Saligrama N, Buenrostro JD, Wei Y, Wu B, Rubin AJ, et al. Transcript-indexed ATAC-seq for precision immune profiling. Nat Med. 2018;24:580–90.
Cusanovich DA, Daza R, Adey A, Pliner HA, Christiansen L, Gunderson KL, et al. Multiplex single cell profiling of chromatin accessibility by combinatorial cellular indexing. Science. 2015;348:910–4.
Buenrostro JD, Wu B, Litzenburger UM, Ruff D, Gonzales ML, Snyder MP, et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature. 2015;523:486–90.
Rotem A, Ram O, Shoresh N, Sperling RA, Goren A, Weitz DA, et al. Single-cell ChIP-seq reveals cell subpopulations defined by chromatin state. Nat Biotechnol. 2015;33:1165–72.
Wang L, Leite de Oliveira R, Huijberts S, Bosdriesz E, Pencheva N, Brunen D, et al. An acquired vulnerability of drug-resistant melanoma with therapeutic potential. Cell. 2018;173:1413–25.
Friedman N, Rando OJ. Epigenomics and the structure of the living genome. Genome Res. 2015;25:1482–90.
Dey SS, Kester L, Spanjaard B, Bienko M, van Oudenaarden A. Integrated genome and transcriptome sequencing of the same cell. Nat Biotechnol. 2015;33:285–9.
Boisset J-C, Vivié J, Grün D, Muraro MJ, Lyubimova A, van Oudenaarden A. Mapping the physical network of cellular interactions. Nat Methods. 2018;15:547–53.
Halpern KB, Shenhav R, Massalha H, Toth B, Egozi A, Massasa EE, et al. Paired-cell sequencing enables spatial gene expression mapping of liver endothelial cells. Nat Biotechnol. 2018;36:962–70.
Edsgärd D, Johnsson P, Sandberg R. Identification of spatial expression trends in single-cell gene expression data. Nat Methods. 2018;15:339–42.
Svensson V, Teichmann SA, Stegle O. SpatialDE: identification of spatially variable genes. Nat Methods. 2018;15:343–6.
Russell MR, D'Amato A, Graham C, Crosbie EJ, Gentry-Maharaj A, Ryan A, et al. Novel risk models for early detection and screening of ovarian cancer. Oncotarget. 2017;8:785–97.
Bowtell DD, Boehm S, Ahmed AA, Aspuria P-J, Bast RC Jr, Beral V, et al. Rethinking ovarian cancer II: reducing mortality from high-grade serous ovarian cancer. Nat Rev Cancer. 2015;15:668–79.
Baudino TA. Targeted cancer therapy: the next generation of cancer treatment. Curr Drug Discov Technol. 2015;12:3–20.
Scher HI, Lu D, Schreiber NA, Louw J, Graf RP, Vargas HA, et al. Association of AR-V7 on circulating tumor cells as a treatment-specific biomarker with outcomes and survival in castration-resistant prostate cancer. JAMA Oncol. 2016;2:1441–9.
Burrell RA, McGranahan N, Bartek J, Swanton C. The causes and consequences of genetic heterogeneity in cancer evolution. Nature. 2013;501:338–45.
Murugaesu N, Chew SK, Swanton C. Adapting clinical paradigms to the challenges of cancer clonal evolution. Am J Pathol. 2013;182:1962–71.
Almendro V, Cheng Y-K, Randles A, Itzkovitz S, Marusyk A, Ametller E, et al. Inference of tumor evolution during chemotherapy by computational modeling and in situ analysis of genetic and phenotypic cellular diversity. Cell Rep. 2014;6:514–27.
Sadanandam A, Lyssiotis CA, Homicsko K, Collisson EA, Gibb WJ, Wullschleger S, et al. A colorectal cancer classification system that associates cellular phenotype and responses to therapy. Nat Med. 2013;19:619–25.
Baslan T, Kendall J, Ward B, Cox H, Leotta A, Rodgers L, et al. Optimizing sparse sequencing of single cells for highly multiplex copy number profiling. Genome Res. 2015;25:714–24.
Dean FB, Hosono S, Fang LH, Wu XH, Faruqi AF, Bray-Ward P, et al. Comprehensive human genome amplification using multiple displacement amplification. Proc Natl Acad Sci U S A. 2002;99:5261–6.
Leung K, Klaus A, Lin BK, Laks E, Biele J, Lai D, et al. Robust high-performance nanoliter-volume single-cell multiple displacement amplification on planar substrates. Proc Natl Acad Sci U S A. 2016;113:8484–9.
This project was supported by grants from the Beijing Advanced Innovation Centre for Genomics at Peking University, Key Technologies R&D Program (2016YFC0900100), and the National Natural Science Foundation of China (81573022, 31530036, 91742203).
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.