Mapping human pluripotent stem cell differentiation pathways using high throughput single-cell RNA-sequencing
Genome Biology volume 19, Article number: 47 (2018)
Human pluripotent stem cells (hPSCs) provide powerful models for studying cellular differentiations and unlimited sources of cells for regenerative medicine. However, a comprehensive single-cell level differentiation roadmap for hPSCs has not been achieved.
We use high throughput single-cell RNA-sequencing (scRNA-seq), based on optimized microfluidic circuits, to profile early differentiation lineages in the human embryoid body system. We present a cellular-state landscape for hPSC early differentiation that covers multiple cellular lineages, including neural, muscle, endothelial, stromal, liver, and epithelial cells. Through pseudotime analysis, we construct the developmental trajectories of these progenitor cells and reveal the gene expression dynamics in the process of cell differentiation. We further reprogram primed H9 cells into naïve-like H9 cells to study the cellular-state transition process. We find that genes related to hemogenic endothelium development are enriched in naïve-like H9. Functionally, naïve-like H9 show higher potency for differentiation into hematopoietic lineages than primed cells.
Our single-cell analysis reveals the cellular-state landscape of hPSC early differentiation, offering new insights that can be harnessed for optimization of differentiation protocols.
Thomson et al. derived human pluripotent stem cells (hPSCs) from human blastocysts for the first time in 1998 . hPSCs have the capacity of self-renewal and multilineage differentiation both in vitro and in vivo. These features of hPSCs have provided remarkable promise in developmental biology and regenerative medicine . hPSCs can be used to generate diverse cell-types from all three germ layers using different differentiation protocols [3,4,5,6,7]. However, most existing protocols suffer from low efficiency and functional deficiency.
In vivo, fertilized mammalian eggs undergo multiple cleavage divisions and form blastocysts (Fig. 1a). The pre-implantation mouse epiblasts obtained from blastocysts have the ground-state naïve pluripotency that can be recapitulated in vitro in the form of embryonic stem cells (ESCs) [8, 9]. Soon after implantation, epiblasts become primed for lineage specification. In vitro, the counterparts of primed epiblasts are termed epiblast stem cells (EpiSCs), which are functionally and morphologically distinct from ESCs. These two states of pluripotent stem cells (i.e. ESCs and EpiSCs) are interchangeable under specific conditions . The study of this cellular-state transition process will contribute to the understanding of early development from pre-implantation epiblasts to post-implantation epiblasts. Conventional hPSCs are considered as the primed state with the molecular and functional identity of post-implantation lineage-primed epiblasts. Several groups have established the naïve hPSCs, which share several molecular features and functional characteristics with naïve mPSCs and pre-implantation epiblasts [10,11,12,13,14,15,16]. Following lineage specification, primed epiblasts develop into embryonic ectoderm and primitive streak, which further develop into embryonic mesoderm and endoderm. These three embryonic germ layers develop into all embryonic tissues. Under proper in vitro culture conditions, hPSCs can undergo spontaneous differentiation and form a three-dimensional (3D) structure called embryoid body (EB), which contains cells from all three germ layers [17, 18]. The EB differentiation system is a widely used model to study the early differentiation of various lineage-specific progenitors, including cardiac muscle , blood , liver , and neuron , etc. hPSCs (both primed and naïve) and EBs are powerful models to simulate early developmental process in vitro, from pre-implantation epiblasts to lineage-committed progenitors.
hPSC differentiation is a complex process. Flow cytometry and immunostaining have been used to define cell types in hPSC differentiation cultures. However, these methods are limited by the number of fluorescent probes that can be used at the same time; the heterogeneity of the hPSC differentiation process cannot be fully resolved. Single-cell RNA-sequencing (scRNA-seq), first released in 2009 , has provided a promising alternative. During the past few years, the technology has been vastly improved by the development of numerous innovative approaches [24, 25], including C1 (SMARTer) , SMART-seq2 , CEL-seq , Drop-seq , InDrop , 10X Genomics , etc. To date, single-cell technology has been used to study cellular heterogeneity in a wide range of systems , including the hierarchy of tumor cells [32, 33], tissue and organs [34,35,36,37], developing embryos [38, 39] and in vitro differentiation systems [40,41,42].
We use the upgraded Fluidigm C1 system with optimized high-throughput integrated fluidics circuits (HT IFCs) to construct the early differentiation trajectories of various lineage-specific progenitors derived from hPSCs and to reveal the interaction between these precursor cells in EB differentiation system. We find key TFs and signaling pathways that direct the differentiation process. We show that liver may be involved in regulating the differentiation of other tissue cells through cell–cell interactions. We also use the C1 scRNA-seq platform to study the primed-to-naïve transition process and to reveal the differences in gene expression profiles between Primed and Naïve-like H9. Combined with the analysis of EB differentiation, genes related to hemogenic endothelium development and MAPK-ERK1/2 signaling pathway are enriched in Naïve-like H9 but not in Primed H9. Functionally, Naïve-like H9 show the differentiation bias to endothelial-hematopoietic lineages. Taken together, we construct a comprehensive single-cell level differentiation roadmap for hPSCs and offer new insights into early embryonic lineages that can guide the establishment and optimization of more sophisticated differentiation system.
scRNA-seq analysis of hPSCs and EBs
In order to systematically map hPSCs early differentiation pathways, Naïve-like H9, Primed H9, and EBs were prepared as single-cell samples for sequencing using Fluidigm C1 system with HT IFCs (Fig. 1a). This system can be used to analyze up to 800 cells at a time and detect an average of 5000 genes per cell. A major advantage of this technology is the balance of throughput and resolution. After sequencing and data processing, we got high-quality transcriptomic data from 4822 single cells, including 2636 EB samples (683 day 4 EBs and 1953 day 8 EBs), 1491 Naïve-like H9 samples, and 695 Primed H9 samples. The scRNA-seq data had high read depth, which can map to 5000 genes for most of the single-cell samples (Fig. 1b and Additional file 1: Figure S1a); and Naïve-like H9 datasets show weak batch effect of Fluidigm C1 system (Additional file 1: Figure S1b). The random differentiation of EBs causes the batch effect (Additional file 1: Figure S1c). We used Seurat to perform principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE) analysis . Seurat divided our samples into four main clusters, including two EB clusters (EB-ectodetm and EB-mesendoderm), one Primed H9 cluster, and one Naïve-like H9 cluster (Fig. 1c). hPSCs (i.e. Naïve-like and Primed H9) have relative homogeneity. EB cells show significant heterogeneity, which indicated well spontaneous differentiation of hPSCs and provided a variety of samples for Monocle pseudotime analysis . To reveal the gene expression dynamics and key regulators of hPSC early differentiation, we used Seurat and Monocle to analyze these data.
Mapping cellular landscape for early embryonic lineages
Spontaneous differentiation of EBs exhibit heterogeneous patterns of differentiated cell types (Fig. 1c). We extracted single cells from EBs and Primed H9 for further analysis (Fig. 2a). According to the expression of differential genes, day 4 EBs were divided into three clusters, including progenitor cell-2, progenitor cell-10, and progenitor cell-11. Day 4 EBs have weak heterogeneity. Progenitor cell-2 does not highly express lineage-related genes; progenitor cell-10 may be related with neural cell differentiation; progenitor cell-11 may be related with mesendoderm differentiation (Fig. 2b and c). We defined 11 clusters as different progenitor cells in day 8 EBs. We identified six major types of progenitor cells with distinct gene expression patterns, including muscle cells (cluster 3, 4, 12, 13), stromal cells (cluster 8), endothelial cells (cluster 15), neural cells (cluster 6, 7, 9), epithelial cells (cluster 14), and liver cells (cluster 5) (Fig. 2a and b). Clusters 3, 4, 12, and 13 are associated with high expression of muscle progenitor cell markers such as HAND1, APLNR, and ACTC1 , and therefore these clusters are annotated as muscle cells (Fig. 2). Cluster 8 is annotated as stromal cells for the expression of LUM, KLF6, and COL5A1 . Though muscle cell and stromal cell clusters exhibit shared gene expression profiles, collagen genes (e.g. COL3A1, COL5A1, COL5A2, COL1A1, and COL1A2) are enriched in stromal cell cluster (Fig. 2b and c) . Cluster 15 is annotated as endothelial cells for the high expression of KDR, GNG11, and ECSCR (Fig. 2b and c) . Clusters 6, 7, and 9 are annotated as neural cells for the high expression of OTX2, PTN, and FZD3 (Fig. 2b and c), which are important for the development of neural system [48,49,50]. Cluster 14 is annotated as epithelial cells for the high expression of PDPN, TFAP2C, and DMD [36, 51]. Cluster 5 is annotated as liver cells for the high expression of AFP, TTR, and FGB [52,53,54]. We also found specific surface markers to separate these progenitor cells from EBs, such as CD34 and PROCR (CD201), which can enrich endothelial cells from EBs (Additional file 1: Figure S2a and S2b). As a further validation, we found that genes specifically expressed in each cell type were enriched for the expected Gene Ontology (GO) terms (Additional file 1: Figure S2c) . For example, genes that are specifically expressed in muscle cell cluster are significantly enriched for skeletal system development (p = 7.06E-07); specific genes of neural cell cluster are significantly enriched for positive regulation of neuroblast proliferation (p = 1.86E-04) and neuronal stem cell population maintenance (p = 2.17E-04); specific genes of liver cell cluster are significantly enriched for very-low-density lipoprotein particle (p = 4.29E-08) and lipoprotein metabolic process (p = 5.15E-08); specific genes of endothelial cell cluster are significantly enriched for angiogenesis (p = 1.90E-12), positive regulation of endothelial cell proliferation (p = 4.40E-05), and hemopoiesis (p = 0.0045). These analyses strongly indicate that our cell-type assignments are accurate.
Both cluster neural cell and muscle cell consist of several sub-clusters. According to the specific gene expression pattern, neural cell cluster was further divided into three subsets, including neural progenitor-PODXL-6, neural progenitor-OTX2–7, and neural progenitor-ERBB3–9 (Fig. 3a and b and Additional file 1: Figure S3a–d). Sub-clusters of neural cells are different types of cells. Neural progenitor-PODXL-6 may be related with cerebral cortex development, because of the enriched expression of PODXL , DRAXIN [57, 58], and TUBB2A . Neural progenitor-OTX2–7 is enriched for RAX, SIX3, and OTX2, which are highly expressed in retinal-pigmented epithelium (RPE) . Genes related with forebrain development are enriched in neural progenitor-OTX2–7 (Additional file 1: Figure S4a). RPE is derived from forebrain , so neural progenitor-OTX2–7 may be related with the RPE development. Neural progenitor-ERBB3–9 exhibits specific expression of known neural crest (NC) cell markers (ERBB3, SOX9, and EDNRA) [62,63,64]. So neural progenitor-ERBB3–9 may be related with the NC cell development (Additional file 1: Figure S4a).
Cluster muscle cell consists of four sub-clusters, including muscle-LDHA-3, muscle-FBN2–4, muscle-CRYAB-13, and muscle-GABRP-12. These sub-clusters have specific gene expression pattern and gene expression distribution (Fig. 3c and d and Additional file 1: Figure S3e–h). Muscle-LDHA-3 is enriched for LDHA, POSTN, and IGF2 ; muscle-FBN2–4 is enriched for FBN2, NCAM1, and SERPINE2 ; muscle-CRYAB-13 is enriched for CRYAB, COL1A1, and LGALS1 [67, 68]; muscle-GABRP-12 is enriched for GABRP, CXCL12, and TRIML2 . Combined with the differentiation trajectory, we think these muscle sub-clusters were divided for both different cell types and different differentiation stages (Additional file 1: Figure S4c). Skeletal muscle cell differentiation related genes are enriched in muscle-FBN2–4 and muscle-CRYAB-13; angiogenesis related genes are enriched in muscle-GABRP-12; glycolytic process and insulin receptor signaling pathway related genes are enriched in muscle-LDHA-3 (Additional file 1: Figure S4b). The sub-clusters analysis indicated the diversity of differentiation direction in neural and muscle cell clusters.
Construction of hPSC early differentiation trajectory
We used Monocle  to order single cells through EB differentiation and construct the whole lineage differentiation trajectory with a tree-like structure (Fig. 4a). We found two branches following EB differentiation, including an ectoderm branch and a mesendoderm (mesoderm and endoderm) branch. The ectoderm branch only consists of cells from neural cell cluster. The mesendoderm branch consists of cells from the muscle cell, endothelial cell, stromal cell, liver cell, and epithelial cell clusters. This differentiation trend is similar to the development in vivo that primed epiblasts develop into embryonic ectoderm and primitive streak (embryonic mesoderm and endoderm). It indicates that the differentiation trajectory of whole EBs can simulate the development in vivo. We used a specific heatmap to show the gene expression dynamics of these two branches (Fig. 4b and Additional file 1: Figure S5a). From pre-branch (Primed H9) to cell fate 1 (ectoderm), we found some gene clusters with specific expression pattern, including II (cluster 2), III (cluster 3), and V (cluster 5). We defined the genes cluster VI (cluster 6) and I (cluster 1), which are highly expressed in Primed H9 and cell fate 2 (mesendoderm), respectively. We performed GO enrichment analysis to reveal the different functions of these gene clusters (Additional file 1: Figure S5b). Nervous system development related GO terms are significantly enriched in cluster II, III, and V. In neural differentiation, cells get neural characteristics at the early stage of differentiation. Cluster I is related with kidney development, heart development, skeletal system development, angiogenesis, and lung cell differentiation.
The differentiation trend of EBs is similar to the development in vivo, because 3D EBs have complex cell adhesions and paracrine signaling system, which can establish various interactions among different cell types . Based on the expression of ligands or complementary receptors on every cell, we calculated the number of interactions among different cell types and showed potential cell-cell interactions in a network (Additional file 1: Figure S6) . These ligand-receptor pairings suggest extensive crosstalk among six types of progenitor cells (Fig. 4c). The one-to-many and many-to-one relationships exist between receptors and ligands. For example, liver cell receptor CLEC2B can bind ligand CLEC3A from muscle cells, stromal cells, endothelial cells, and neural cells; liver ligand SHBG can bind to receptor CLDN4 on all types of cells, which indicate the important roles of liver cells in the differentiation of other cell types. These ligand-receptor pairings may reveal the cell–cell interactions during the development in vivo.
The whole lineage differentiation trajectory cannot reveal the single-cell gene expression dynamics of each progenitor cell, so we constructed differentiation trajectory for each cell type with day 8 EBs (Fig. 5a). Transcription factors (TFs) play a key role in the regulation of development and differentiation. Based on the dynamics of their expression patterns, the TFs associated with each differentiation trajectory were divided into three clusters (I, II, III) (Fig. 5b). Cluster I TFs are highly expressed at the initial stage of differentiation. Primed H9, the common starting point of differentiation, highly express cluster I TFs (e.g. SOX2, PRDM14, and ZIC2) . Cluster II TFs are highly expressed at the terminal stage of differentiation, so these TFs indicate the characteristics of each progenitor cell, including cluster endothelial cell (e.g. GATA2 and TAL1) , cluster muscle cell (e.g. HAND1 and ZFHX3) , cluster stromal cell (e.g. KLF6 and MAF) , cluster liver cell (e.g. EOMES and SOX7) , cluster epithelial cell (e.g. SOX9 and FOXP1) , and cluster neural cell (e.g. PAX3 and TFAP2B) . Cluster I and II TFs indicate that the start points and end points of our differentiation trajectories are correct. Cluster III TFs are highly expressed at the middle stage of differentiation, including cluster endothelial cell (e.g. HOXA1 and TFAP2A), cluster muscle cell (e.g. CDX1 and MEF2C), cluster stromal cell (e.g. PAX3 and SALL1), cluster liver cell (e.g. MEIS2 and PRRX1), cluster epithelial cell (e.g. ARID5B and CASZ1), and cluster neural cell (e.g. PKNOX2 and TBX3). These TFs are the candidate genes to control the differentiation of each progenitor cell. We also performed Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis to reveal the major differential signaling pathways involved in the differentiation, including cluster endothelial cell (e.g. MAPK and Hippo), cluster muscle cell (e.g. Prolactin and Estrogen), cluster stromal cell (e.g. Wnt and Hippo), cluster liver cell (e.g. Prolactin and cGMP-PKG), cluster epithelial cell (e.g. Hippo and AMPK), and cluster neural cell (e.g. Hippo and cGMP-PKG) (Additional file 1: Figure S7). These differentiation trajectories show us the key TFs and signaling pathways that related to the differentiation process and may provide evidences for the optimization of the differentiation system in vitro.
Construction of naïve hPSC reset trajectory
We reset Primed H9 into Naïve-like H9 by RSeT, a commercial medium based on NHSM formula . Through bulk RNA-seq, we found the state of Naïve-like H9 was stable after 15–20 days domestication in RSeT media (Fig. 6a). We confirmed the naïve state via morphology, immunofluorescence of surface markers and pluripotent transcription factors, quantitative PCR (qPCR) of naïve and primed genes [79, 80], and flow cytometry of surface markers  (Additional file 1: Figure S8).
We performed pseudotime analysis to study cell state transition process (from day 0 to day 20) using scRNA-seq data (Fig. 6b). Day 10 RSeT samples are at the intermediate state followed Primed H9. Day 20 RSeT samples are divided into two branches (cell fate 1 and cell fate 2). The RSeT culture process causes the heterogeneity of the Naïve-like H9. The expression pattern shows that only cell fate 2 branch directs to the naïve state with high expression of pluripotent TFs (e.g. POU5F1, NANOG, and PRDM14) (Fig. 6c and d). Cell fate 1 branch directs to a differentiation state with gradual downregulation of pluripotent TFs (e.g. POU5F1 and NANOG) and upregulation of lineage specifier genes (e.g. HAND1, SNAI2, and PAX6). Though these lineage specifier genes are upregulated at the middle stage in both branches, they are downregulated at the terminal stage of cell fate 2. The differential expression dynamics of lineage specifier genes may help us to understand the reset process.
We extracted cell fate 2 branch as Naïve-like H9 to compare gene expression pattern between Naïve-like and Primed H9 at single-cell level (Fig. 7a). Though expression distributions of pluripotent TFs (e.g. POU5F1, NANOG, and SOX2) are similar (Fig. 7b and Additional file 1: Figure S9a), there are significant differences in gene expression signatures between Naïve-like and Primed H9 (Fig. 7c). Primed genes (e.g. ZIC2, ZIC5, DNMT3B, DUSP6, THY1, and CD24) are enriched in Primed H9 (Fig. 7d). NODAL, LEFTY2, GDF3, KLF4, DNMT3L, IL6ST, PRDM14, DPPA2, and TDGF1 are highly expressed in Naïve-like H9 as previously reported (Fig. 7e) [79, 80]. These results confirmed the “naïve” state of our Naïve-like H9. These gene expression characteristics also exist in Naïve-like H1 (Additional file 1: Figure S10a–c and S10f). We performed surface marker analysis to show the candidate surface genes, which can distinguish naïve hPSCs from primed hPSCs (Additional file 1: Figure S9b). The surface markers of Primed H9 include CUZD1, KCNQ2, CLDN10, PODXL, etc.; the surface markers of Naïve-like H9 include CNTNAP2, FZD5, etc.
We performed GO enrichment analysis and established GO term diagram of Naïve-like and Primed H9 (Additional file 1: Figure S9c). Primed H9 have high expression of major histocompatibility complex I related genes as previously reported . And genes related with nervous system (ectoderm) development are also enriched in Primed H9. In Naïve-like H9, genes related with gastrulation and endoderm development and MAPK signaling pathway are enriched. GO term diagram of Naïve-like H1 also shows MAPK signaling pathway enrichment (Additional file 1: Figure S10 g). Interestingly, MAPK signaling pathway is also enriched in the differentiation trajectory of endothelial cell cluster, which derives from mesendoderm (Additional file 1: Figure S7). We checked the expression distribution of germ layer genes in both H9 and H1. Genes related with mesendoderm development (T, FGF4, MIXL, LEFTY1, EOMES, etc. ) are highly expressed in Naïve-like hPSCs (Fig. 8a and Additional file 1: Figure S10e); genes related with nervous development (ALCAM , OLFM1 , SIGMAR1 , etc.) are highly expressed in Primed hPSCs (Fig. 8b and Additional file 1: Figure S10d). We suspected that Naïve-like hPSCs have the differentiation bias to tissue cells related with endothelial-hematopoietic lineages [85, 86]. We used hematopoietic differentiation system to compare the differentiation ability between Naïve-like and Primed H9. The percentage of CD34+CD45+ cells and CD34+CD43+ cells are higher in Naïve-like H9, which generates more colony-forming units (CFUs) than Primed H9 as well (Fig. 8c and d). It suggests that Naïve-like H9 has better potency for hematopoietic differentiation. We checked the protein level of MAPK (p38, JNK, and ERK1/2) through western blot, and only the ERK1/2 is highly detected in Naïve-like H9 (Fig. 8e), which is consistent with the single cell transcriptome analysis. The high expression of FOS (the substrates of MAPK-ERK1/2)  and ID1 (the downstream target of MAPK-ERK1/2) are also detected in Naïve-like H9 (Fig. 7f) , which do not affect the pluripotency of Naïve-like H9 (Fig. 7b and Additional file 1: Figure S8). FOS can induce the expression of hematopoietic genes . Furthermore, ID1 is a helix-loop-helix inhibitor and may promote the hematopoietic differentiation  like the helix-loop-helix inhibitor TAL1 does .
In this study, we performed the analysis of scRNA-seq data from a total of 4822 single cells generated from EBs and hPSCs (Fig. 9). Through constructing the early differentiation trajectories of various progenitors identified in EBs, we revealed the key TFs and signaling pathways that direct the differentiation of distinct cell types. Moreover, we constructed the cell–cell interaction network of these cell types and indicated the key roles of liver cells in the differentiation of other cell types. We further reprogramed Primed H9 into Naïve-like H9 to study the cellular-state transition process. We found that genes related with MAPK-ERK1/2 signaling pathway are enriched in endothelial-hematopoietic development and Naïve-like H9. Functionally, Naïve-like H9 show higher potency for differentiation into the hematopoietic lineages. These results provide valuable information for the optimization of differentiation protocols.
The scRNA-seq platform we used is Fluidigm C1 system with the HT IFCs . The old version of IFCs can only capture 96 cells at most [40, 92], so cell sorting or other enrichment strategies are usually performed before scRNA-seq for the recovery of rare cell types. However, HT IFCs can efficiently analyze thousands of single cells without prior enrichment from heterogeneous systems such as EB differentiation (Fig. 2a).
In contrast to monolayer differentiation, EB differentiation system provides 3D structure to establish complex cell adhesions and paracrine signaling, which promote the differentiation and morphogenesis similar to the native tissue development . The interactions between different cell types are important for the development and differentiation. Liver is the essential site for the early hematopoietic development in the embryo stage . Cardiomyocytes and endothelial cell were reported important for the differentiation of liver in EB differentiation . We used the random EB differentiation system to generate various tissue cells of three germ lineages for our hPSC early differentiation trajectory construction (Fig. 9). We found complex interactions among different cell types (Fig. 4c and Additional file 1: Figure S6). Interestingly, liver cells build specific interactions with other cell types using specific ligands and receptors in EBs. The functions of these interactions should be verified in further studies. It may indicate the important role of liver cells in the differentiation of other cell types.
We used the scRNA-seq to study the reset trajectory of Naïve-like H9. In the tree-like trajectory, we found two branches, one directs to success and the other directs to failure (Fig. 6b). After comparing gene expression dynamics, we revealed the cell state transition process, from Primed to Naïve-like H9. We found that some lineage specifier genes (PAX6, HAND1, et al.) are upregulated at the middle stage (Fig. 6d). In the success branch, those lineage specifier genes are downregulated before the terminal stage. However, in the failure branch, the upregulation is persistent, which lead to differentiation but not the naïve state. The balance of lineage specifier genes can keep the pluripotency of stem cell . We therefore suspected that in the cell state transition process Primed H9 is reprogramed to a pluripotent intermediate state with the balance of lineage specifiers (Fig. 9). When this balance is broken, the intermediate state cells lose their pluripotency and differentiate. Understanding the mechanisms that control the balances of these lineage specifier genes may help us to regulate the pluripotency of hPSCs and optimize differentiation protocols.
Differentiation bias of different hPSCs might be harnessed for better lineage differentiation protocols. Here, we found better potency of hematopoietic differentiation in Naïve-like H9. MAPK-ERK1/2 related genes are highly expressed in Naïve-like H9 but not in Primed H9 (Fig. 7f and Fig. 8e and Additional file 1: Figure S9c). We therefore suspect that MAPK-ERK1/2 contributes to the hematopoietic differentiation bias of Naïve-like H9 (Fig. 9). Though LIF is the key cytokine to keep the “naïve” state of hPSCs , it can also activate the MAPK-ERK1/2 signaling pathway , which is involved in the differentiation of hPSCs towards endothelial lineages (Additional file 1: Figure S7), and hematopoietic development . The commercial naïve medium RSeT contain the MAPK-ERK1/2 inhibitor (such as PD0325901 ). The inhibitor may lead to perturbation of MAPK-ERK1/2 pathway. When the inhibitor is removed and differentiation media are added, hemogenic fate is enhanced for Naïve-like hPSC culture.
In this study, we used scRNA-seq to map the early differentiation of hPSCs. We identified various lineage-specific progenitor cells and constructed the differentiation trajectories by pseudotime analysis. The gene expression dynamics offer new insights into molecular pathways of early embryonic lineages that can be harnessed for optimization of differentiation protocols.
Cell culture and differentiation
H9 and H1 human ES cells were maintained in mTeSR™1 media (STEMCELL Technologies) on tissue culture plates coated with Matrigel (BD Bioscience) routinely . H9 and H1 were reset into a naïve-like state by RSeT™ media (STEMCELL Technologies) following the instruction [81, 96]. We generated EBs by clone suspension. EBs were differentiated in DMEM/F12 (GIBCO) supplemented with 20% FBS (GIBCO), 50 U/mL penicillin/streptomycin (GIBCO), 2 mM L-Glutamine (GIBCO), 1 × non-essential amino acids, and 100 μM ß-mercaptoethanol (Sigma). In brief, H9 was digested using 0.5 mg/mL Dispase (Invitrogen) for 30 min. Then cell clumps suspended in differentiation media were seeded into an Ultra-Low attachment 6 well plate (Corning). After four days or eight days of culture, EBs were harvested and digested into single-cell suspension in 3 × 105 cells/mL using TrypLE (GIBCO). We did not use cell sorting or other enrichment strategies before single-cell capture. The hematopoietic differentiation of hPSCs was performed using STEMdiff Hematopoietic Kit (STEMCELL Technologies) following the instructions. At day 12, cells were analyzed with flow cytometry. CD34+ cells were enriched with EasySep™ CD34 positive selection kit (StemCell Technologies) for CFU assays.
Colony-forming unit (CFU) assays
CFU assays were performed with MethoCult™ H4034 Optimum methylcellulose-based media (StemCell Technologies) following manufacturer’s instructions. In brief, 3 mL MethoCult™ media with 1 × 104/mL CD34+ cells and penicillin-streptomycin were added into each 35 mm low adherent plastic dish. Colonies were counted and identified after 10–14 days of incubation.
Flow cytometry analysis of cell phenotype
Cells suspended in 100 μL of PBS were incubated with antibodies at 4 °C for 30 min. The samples were measured on BD Fortessa and analyzed by FlowJo software (Tree Star). Antibodies used in our study were listed: anti-Human CD34 (BioLegend, Pacific Blue, clone 581), anti-human CD34 (BioLegend, PE, clone 581), anti-human CD201 (BioLegend, APC, clone RCR-401), anti-Human CD43 (BioLegend, APC, clone 10G7), anti-Human CD45 (BioLegend, FITC, clone HI30), anti-Human CD90 (BD Pharmingen, APC, clone 5E10), and CD24 (BioLegend, PE, clone ML5).
Immunofluorescence staining and confocal image analysis
Cells were seeded into glass-bottom culture dishes (NEST, 35/15 mm) coated with Matrigel. Cultured cells were fixed in 4% paraformaldehyde at room temperature for 30 min. Then permeabilized treatment was performed at room temperature for 30 min with PBS + 0.2% TritonX-100. Cells were blocked with PBS + 1% BSA + 4% FBS + 0.4% TritonX-100 at room temperature for 1 h. Then cells were incubated with primary antibodies, diluted in PBS + 0.2% BSA + 0.1% TritonX-100, at 4 °C overnight. Cells were incubated with AlexaFluor secondary antibodies (Invitrogen) for 1 h at room temperature. Then cells were incubated with DAPI for 5 min at room temperature. After the second round of fixation, cells were ready for imaging. Olympus IX81-FV1000 was used to collect immunofluorescence images and FV10-ASW 2.1 Viewer was used to process images. The primary antibodies used in our study were listed in Additional file 2: Table S1.
Western blot analysis
Whole-cell protein were isolated from Primed H9 and Naïve-like H9. Protein samples were incubated with the following primary antibodies in 5%BSA: anti-ERK (Servicebio, Wuhan, China, GB13003–1), anti-JNK (Epitomics, 3496-s), anti-P38 (ABCAM, ab31828), and anti-β-actin (Servicebio, Wuhan, China, GB13001–1). Secondary antibodies were HRP-linked goat anti-mouse, goat anti-rabbit (Servicebio, Wuhan, China, GB23303). Blots were developed using ECL (Servicebio, Wuhan, China, G2014). The primary antibodies used in our study were listed in Additional file 2: Table S1.
Reverse transcription (RT) and qPCR analysis
Total RNA prepared with EasyPure RNA Kit (Transgen) was reverse transcribed into complementary DNA (cDNA) by TransScript All-in-One First-Strand cDNA Synthesis SuperMix for qPCR kit (Transgen). The diluted cDNA was used as temples in qPCR (ChamQ SYBR qPCR Master Mix-Q311 (Vazyme)). The qPCR platform we used was LightCycler 480 (Roche) and data were analyzed by the ∆∆Ct method. The primers used in our study were listed in Additional file 3: Table S2, including the reference gene (ACTB).
Single-cell capture and scRNA-seq library preparation
We used Fluidigm C1 system and C1 high-throughput integrated fluidics circuits (HT IFCs) to perform the single-cells capture and library construction as instruction described. A total of 4000–8000 cells were loaded onto a medium-sized (10–17 μm) HT IFCs. The efficiency of capture was measured under the microscope. The capture sites without cell or with more than one cell were marked and excluded from further analysis. C1 system captured and converted all polyadenylated messenger RNA (mRNA) into cDNA with the cell-specific barcodes. After reverse transcription and preamplification, cDNA was prepared as samples for next-generation sequencing using library tagmentation and 3’end enrichment. Samples harvested from HT IFCs were used to create libraries for Illumina sequencing with Illumina Nextera XT DNA Library kit.
Bulk RNA-seq library construction
We used mRNA Capture Beads (VAHTS mRNA-seq v2 Library Prep Kit for Illumina, Vazyme) to extract mRNA from total RNA. PrimeScript™ Double Strand cDNA Synthesis Kit (TaKaRa) was used to synthesize double-stranded cDNA from purified polyadenylated mRNA templates. We used TruePrep DNA Library Prep Kit V2 for Illumina (TaKaRa) to prepare cDNA libraries for Illumina sequencing.
Sequencing data analysis
The sequenced reads were mapped against the reference GRCh38 using STAR v2.5.2a . scRNA-seq expression data, quantified by counts via featureCounts v1.5.1 , were analyzed with Seurat v2.0.1 (PCA, Cluster, t-SNE and cluster) . In brief, the Seurat object was generated from digital gene expression matrices. The parameter of “Filtercells” is nGene (2000 to 8800) and transcripts (-Inf to 6e + 05). In the standard pre-processing workflow of Seurat, we selected 8706 variable genes for following PCA. Then we performed cell cluster and t-SNE. Fifteen principal components were used in cell cluster with the resolution parameter set at 1.5. Marker genes of each cell cluster were outputted for GO and KEGG analysis, which were used to define the cell types. Cell clusters were annotated with the information of cell types and germ layers. Digital gene expression matrices with annotations from Seurat were analyzed by Monocle v2.3.6 (pseudotime analysis) . TFs from AnimalTFDB  and surface genes  were used to filter the gene lists. The cell–cell interactions were constructed by igraph v1.12 as previously reported . The count of cell–cell interactions was based on the ligands-receptors pairings . We used DAVID  to perform GO and KEGG analysis. GO terms were visualized by REVIGO  and Cytoscape . Bulk RNA-seq data, quantified by FPKM via RSEM v0.4.6 , were analyzed with DEseq2 v1.14.1 .
Thomson JA, Itskovitz-Eldor J, Shapiro SS, Waknitz MA, Swiergiel JJ, Marshall VS, et al. Embryonic stem cell lines derived from human blastocysts. Science. 1998;282:1145–7.
Tabar V, Studer L. Pluripotent stem cells in regenerative medicine: challenges and recent progress. Nat Rev Genet. 2014;15:82–92. https://doi.org/10.1038/nrg3563
Vodyanik MA, Bork JA, Thomson JA, Slukvin II. Human embryonic stem cell-derived CD34+ cells: efficient production in the coculture with OP9 stromal cells and analysis of lymphohematopoietic potential. Blood. 2005;105:617–26. https://doi.org/10.1182/blood-2004-04-1649
Doulatov S, Vo LT, Chou SS, Kim PG, Arora N, Li H, et al. Induction of multipotential hematopoietic progenitors from human pluripotent stem cells via respecification of lineage-restricted precursors. Cell Stem Cell. 2013;13:459–470. https://doi.org/10.1016/j.stem.2013.09.002.
Kroon E, Martinson LA, Kadoya K, Bang AG, Kelly OG, Eliazer S, et al. Pancreatic endoderm derived from human embryonic stem cells generates glucose-responsive insulin-secreting cells in vivo. Nat Biotechnol. 2008;26:443–452. https://doi.org/10.1038/nbt1393.
Chambers SM, Fasano CA, Papapetrou EP, Tomishima M, Sadelain M, Studer L. Highly efficient neural conversion of human ES and iPS cells by dual inhibition of SMAD signaling. Nat Biotechnol. 2009;27:275–80. https://doi.org/10.1038/nbt.1529
Kriks S, Shim JW, Piao J, Ganat YM, Wakeman DR, Xie Z, et al. Dopamine neurons derived from human ES cells efficiently engraft in animal models of Parkinson’s disease. Nature. 2011;480:547–551. https://doi.org/10.1038/nature10648.
Nichols J, Smith A. Pluripotency in the embryo and in culture. Cold Spring Harb Perspect Biol. 2012;4:a008128. https://doi.org/10.1101/cshperspect.a008128
Nichols J, Smith A. Naive and primed pluripotent states. Cell Stem Cell. 2009;4:487–92. https://doi.org/10.1016/j.stem.2009.05.015
Gafni O, Weinberger L, Mansour AA, Manor YS, Chomsky E, Ben-Yosef D, et al. Derivation of novel human ground state naive pluripotent stem cells. Nature. 2013;504:282–286. https://doi.org/10.1038/nature12745.
Theunissen TW, Friedli M, He Y, Planet E, O’Neil RC, Markoulaki S, et al. Molecular Criteria for Defining the Naive Human Pluripotent State. Cell Stem Cell. 2016;19:502–515. https://doi.org/10.1016/j.stem.2016.06.011.
Theunissen TW, Powell BE, Wang H, Mitalipova M, Faddah DA, Reddy J, et al. Systematic identification of culture conditions for induction and maintenance of naive human pluripotency. Cell Stem Cell. 2014;15:471–487. https://doi.org/10.1016/j.stem.2014.07.002.
Ware CB, Nelson AM, Mecham B, Hesson J, Zhou W, Jonlin EC, et al. Derivation of naive human embryonic stem cells. Proc Natl Acad Sci U S A. 2014;111:4484–4489. https://doi.org/10.1073/pnas.1319738111.
Yang Y, Liu B, Xu J, Wang J, Wu J, Shi C, et al. Derivation of pluripotent stem cells with in vivo embryonic and extraembryonic potency. Cell. 2017;169:243–257.e225. https://doi.org/10.1016/j.cell.2017.02.005.
Zimmerlin L, Park TS, Huo JS, Verma K, Pather SR, Talbot CC Jr, et al. Tankyrase inhibition promotes a stable human naive pluripotent state with improved functionality. Development. 2016;143:4368–4380. https://doi.org/10.1242/dev.138982.
Liu X, Nefzger CM, Rossello FJ, Chen J, Knaupp AS, Firas J, et al. Comprehensive characterization of distinct states of human naive pluripotency generated by reprogramming. Nat Methods. 2017;14(11):1055–1106. https://doi.org/10.1038/nmeth.4436.
Itskovitz-Eldor J, Schuldiner M, Karsenti D, Eden A, Yanuka O, Amit M, et al. Differentiation of human embryonic stem cells into embryoid bodies compromising the three embryonic germ layers. Mol Med. 2000;6:88–95.
Boxman J, Sagy N, Achanta S, Vadigepalli R, Nachman I. Integrated live imaging and molecular profiling of embryoid bodies reveals a synchronized progression of early differentiation. Sci Rep. 2016;6:31623. https://doi.org/10.1038/srep31623
Magli A, Schnettler E, Swanson SA, Borges L, Hoffman K, Stewart R, et al. Pax3 and Tbx5 specify whether PDGFRalpha+ cells assume skeletal or cardiac muscle fate in differentiating embryonic stem cells. Stem Cells. 2014;32:2072–2083. https://doi.org/10.1002/stem.1713.
Fehling HJ, Lacaud G, Kubo A, Kennedy M, Robertson S, Keller G, et al. Tracking mesoderm induction and its specification to the hemangioblast during embryonic stem cell differentiation. Development. 2003;130:4217–27.
Ogawa S, Tagawa Y, Kamiyoshi A, Suzuki A, Nakayama J, Hashikura Y, et al. Crucial roles of mesodermal cell lineages in a murine embryonic stem cell-derived in vitro liver organogenesis system. Stem Cells. 2005;23:903–13. https://doi.org/10.1634/stemcells.2004-0295
Lee MS, Jun DH, Hwang CI, Park SS, Kang JJ, Park HS, et al. Selection of neural differentiation-specific genes by comparing profiles of random differentiation. Stem Cells. 2006;24:1946–1955. https://doi.org/10.1634/stemcells.2005-0325.
Tang F, Barbacioru C, Wang Y, Nordman E, Lee C, Xu N, et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat Methods. 2009;6:377–382. https://doi.org/10.1038/nmeth.1315.
Haque A, Engel J, Teichmann SA, Lonnberg T. A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications. Genome Med. 2017;9:75. https://doi.org/10.1186/s13073-017-0467-4
Kolodziejczyk AA, Kim JK, Svensson V, Marioni JC, Teichmann SA. The technology and biology of single-cell RNA sequencing. Mol Cell. 2015;58:610–20. https://doi.org/10.1016/j.molcel.2015.04.005
Pollen AA, Nowakowski TJ, Shuga J, Wang X, Leyrat AA, Lui JH, et al. Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex. Nat Biotechnol. 2014;32:1053–1058. https://doi.org/10.1038/nbt.2967.
Picelli S, Bjorklund AK, Faridani OR, Sagasser S, Winberg G, Sandberg R. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nat Methods. 2013;10:1096–8. https://doi.org/10.1038/nmeth.2639
Hashimshony T, Wagner F, Sher N, Yanai I. CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Rep. 2012;2:666–73. https://doi.org/10.1016/j.celrep.2012.08.003
Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161:1202–1214. https://doi.org/10.1016/j.cell.2015.05.002.
Klein AM, Mazutis L, Akartuna I, Tallapragada N, Veres A, Li V, et al. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell. 2015;161:1187–1201. https://doi.org/10.1016/j.cell.2015.04.044.
Zheng GX, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049. https://doi.org/10.1038/ncomms14049.
Ramskold D, Luo S, Wang YC, Li R, Deng Q, Faridani OR, et al. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat Biotechnol. 2012;30:777–782. https://doi.org/10.1038/nbt.2282.
Chung W, Eum HH, Lee HO, Lee KM, Lee HB, Kim KT, et al. Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nat Commun. 2017;8:15081. https://doi.org/10.1038/ncomms15081.
Papalexi E, Satija R. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat Rev Immunol. 2018;18:35–45. https://doi.org/10.1038/nri.2017.76
Bjorklund AK, Forkel M, Picelli S, Konya V, Theorell J, Friberg D, et al. The heterogeneity of human CD127(+) innate lymphoid cells revealed by single-cell RNA sequencing. Nat Immunol. 2016;17:451–460. https://doi.org/10.1038/ni.3368.
Treutlein B, Brownfield DG, Wu AR, Neff NF, Mantalas GL, Espinoza FH, et al. Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seq. Nature. 2014;509:371–375. https://doi.org/10.1038/nature13173.
Gaublomme JT, Yosef N, Lee Y, Gertner RS, Yang LV, Wu C, et al. Single-cell genomics unveils critical regulators of Th17 cell pathogenicity. Cell. 2015;163:1400–1412. https://doi.org/10.1016/j.cell.2015.11.009.
Petropoulos S, Edsgard D, Reinius B, Deng Q, Panula SP, Codeluppi S, et al. Single-cell RNA-seq reveals lineage and X chromosome dynamics in human preimplantation embryos. Cell. 2016;165:1012–1026. https://doi.org/10.1016/j.cell.2016.03.023.
Blakeley P, Fogarty NM, del Valle I, Wamaitha SE, Hu TX, Elder K, et al. Defining the three cell lineages of the human blastocyst by single-cell RNA-seq. Development. 2015;142:3151–3165. https://doi.org/10.1242/dev.123547.
Chu LF, Leng N, Zhang J, Hou Z, Mamott D, Vereide DT, et al. Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm. Genome Biol. 2016;17:173. https://doi.org/10.1186/s13059-016-1033-x.
Semrau S, Goldmann JE, Soumillon M, Mikkelsen TS, Jaenisch R, van Oudenaarden A. Dynamics of lineage commitment revealed by single-cell transcriptomics of differentiating embryonic stem cells. Nat Commun. 2017;8:1096. https://doi.org/10.1038/s41467-017-01076-4
Han X, Yu H, Huang D, Xu Y, Saadatpour A, Li X, et al. A molecular roadmap for induced multi-lineage trans-differentiation of fibroblasts by chemical combinations. Cell Res. 2017;27:842. https://doi.org/10.1038/cr.2017.77.
Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33:495–502. https://doi.org/10.1038/nbt.3192
Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32:381–386. https://doi.org/10.1038/nbt.2859.
Boutilier JK, Taylor RL, Ram R, McNamara E, Nguyen Q, Goullee H, et al. Variable cardiac alpha-actin (Actc1) expression in early adult skeletal muscle correlates with promoter methylation. Biochim Biophys Acta. 2017;1860:1025–1036. https://doi.org/10.1016/j.bbagrm.2017.08.004.
Despars G, Periasamy P, Tan J, Abbey J, O’Neill TJ, O’Neill HC. Gene signature of stromal cells which support dendritic cell development. Stem Cells Dev. 2008;17:917–27. https://doi.org/10.1089/scd.2007.0170
Kilari S, Remadevi I, Zhao B, Pan J, Miao R, Ramchandran R, et al. Endothelial cell-specific chemotaxis receptor (ECSCR) enhances vascular endothelial growth factor (VEGF) receptor-2/kinase insert domain receptor (KDR) activation and promotes proteolysis of internalized KDR. J Biol Chem. 2013;288:10265–10274. https://doi.org/10.1074/jbc.M112.413542.
Kimura C, Takeda N, Suzuki M, Oshimura M, Aizawa S, Matsuo I. Cis-acting elements conserved between mouse and pufferfish Otx2 genes govern the expression in mesencephalic neural crest cells. Development. 1997;124:3929–41.
Silos-Santiago I, Yeh HJ, Gurrieri MA, Guillerman RP, Li YS, Wolf J, et al. Localization of pleiotrophin and its mRNA in subpopulations of neurons and their corresponding axonal tracts suggests important roles in neural-glial interactions during development and in maturity. J Neurobiol. 1996;31:283–296.
Qu Y, Huang Y, Feng J, Alvarez-Bolado G, Grove EA, Yang Y, et al. Genetic evidence that Celsr3 and Celsr2, together with Fzd3, regulate forebrain wiring in a Vangl-independent manner. Proc Natl Acad Sci U S A. 2014;111:E2996–E3004. https://doi.org/10.1073/pnas.1402105111.
Cyr AR, Kulak MV, Park JM, Bogachek MV, Spanheimer PM, Woodfield GW, et al. TFAP2C governs the luminal epithelial phenotype in mammary development and carcinogenesis. Oncogene. 2015;34:436–444. https://doi.org/10.1038/onc.2013.569.
Mirkovitch J, Darnell JE Jr. Rapid in vivo footprinting technique identifies proteins bound to the TTR gene in the mouse liver. Genes Dev. 1991;5:83–93.
Lee CS, Friedman JR, Fulmer JT, Kaestner KH. The initiation of liver development is dependent on Foxa transcription factors. Nature. 2005;435:944–7. https://doi.org/10.1038/nature03649
Fish RJ, Vorjohann S, Bena F, Fort A, Neerman-Arbez M. Developmental expression and organisation of fibrinogen genes in the zebrafish. Thromb Haemost. 2012;107:158–66. https://doi.org/10.1160/TH11-04-0221
Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. https://doi.org/10.1038/nprot.2008.211
Vitureira N, McNagny K, Soriano E, Burgaya F. Pattern of expression of the podocalyxin gene in the mouse brain during development. Gene Expr Patterns. 2005;5:349–54. https://doi.org/10.1016/j.modgep.2004.10.002
Islam SM, Shinmyo Y, Okafuji T, Su Y, Naser IB, Ahmed G, et al. Draxin, a repulsive guidance protein for spinal cord and forebrain commissures. Science. 2009;323:388–393. https://doi.org/10.1126/science.1165187.
Shinmyo Y, Asrafuzzaman Riyadh M, Ahmed G, Bin Naser I, Hossain M, Takebayashi H, et al. Draxin from neocortical neurons controls the guidance of thalamocortical projections into the neocortex. Nat Commun. 2015;6:10232. https://doi.org/10.1038/ncomms10232.
Cushion TD, Paciorkowski AR, Pilz DT, Mullins JG, Seltzer LE, Marion RW, et al. De novo mutations in the beta-tubulin gene TUBB2A cause simplified gyral patterning and infantile-onset epilepsy. Am J Hum Genet. 2014;94:634–641. https://doi.org/10.1016/j.ajhg.2014.03.009.
Li P, Sun X, Ma Z, Liu Y, Jin Y, Ge R, et al. Transcriptional reactivation of OTX2, RX1 and SIX3 during reprogramming contributes to the generation of RPE cells from human iPSCs. Int J Biol Sci. 2016;12:505–517. https://doi.org/10.7150/ijbs.14212.
Murisier F, Guichard S, Beermann F. Distinct distal regulatory elements control tyrosinase expression in melanocytes and the retinal pigment epithelium. Dev Biol. 2007;303:838–47. https://doi.org/10.1016/j.ydbio.2006.11.038
Abe M, Ruest LB, Clouthier DE. Fate of cranial neural crest cells during craniofacial development in endothelin-A receptor-deficient mice. Int J Dev Biol. 2007;51:97–105. https://doi.org/10.1387/ijdb.062237ma
Prasad MK, Reed X, Gorkin DU, Cronin JC, McAdow AR, Chain K, et al. SOX10 directly modulates ERBB3 transcription via an intronic neural crest enhancer. BMC Dev Biol. 2011;11:40. https://doi.org/10.1186/1471-213X-11-40.
Simoes-Costa M, Bronner ME. Establishing neural crest identity: a gene regulatory recipe. Development. 2015;142:242–57. https://doi.org/10.1242/dev.105445
Alzhanov DT, McInerney SF, Rotwein P. Long range interactions regulate Igf2 gene transcription during skeletal muscle differentiation. J Biol Chem. 2010;285:38969–77. https://doi.org/10.1074/jbc.M110.160986
Chern SR, Li SH, Lu CH, Chen EI. Spatiotemporal expression of the serine protease inhibitor, SERPINE2, in the mouse placenta and uterus during the estrous cycle, pregnancy, and lactation. Reprod Biol Endocrinol. 2010;8:127. https://doi.org/10.1186/1477-7827-8-127
Chan J, O’Donoghue K, Gavina M, Torrente Y, Kennea N, Mehmet H, et al. Galectin-1 induces skeletal muscle differentiation in human fetal mesenchymal stem cells and increases muscle regeneration. Stem Cells. 2006;24:1879–1891. https://doi.org/10.1634/stemcells.2005-0564.
Wojtowicz I, Jablonska J, Zmojdzian M, Taghli-Lamallem O, Renaud Y, Junion G, et al. Drosophila small heat shock protein CryAB ensures structural integrity of developing muscles, and proper muscle and heart performance. Development. 2015;142:994–1005. https://doi.org/10.1242/dev.115352.
Subramanian P, Karshovska E, Reinhard P, Megens RT, Zhou Z, Akhtar S, et a. Lysophosphatidic acid receptors LPA1 and LPA3 promote CXCL12-mediated smooth muscle progenitor cell recruitment in neointima formation. Circ Res. 2010;107:96–105. https://doi.org/10.1161/CIRCRESAHA.109.212647.
Murry CE, Keller G. Differentiation of embryonic stem cells to clinically relevant populations: lessons from embryonic development. Cell. 2008;132:661–80. https://doi.org/10.1016/j.cell.2008.02.008
Camp JG, Sekine K, Gerber T, Loeffler-Wirth H, Binder H, Gac M, et al. Multilineage communication regulates human liver bud development from pluripotency. Nature. 2017;546:533–538. https://doi.org/10.1038/nature22796.
Weinberger L, Ayyash M, Novershtern N, Hanna JH. Dynamic stem cell states: naive to primed pluripotency in rodents and humans. Nat Rev Mol Cell Biol. 2016;17:155–69. https://doi.org/10.1038/nrm.2015.28
Zambidis ET, Peault B, Park TS, Bunz F, Civin CI. Hematopoietic differentiation of human embryonic stem cells progresses through sequential hematoendothelial, primitive, and definitive stages resembling human yolk sac development. Blood. 2005;106:860–70. https://doi.org/10.1182/blood-2004-11-4522
Sun S, Zhang W, Chen X, Peng Y, Chen Q. A complex insertion/deletion polymorphism in the compositionally biased region of the ZFHX3 gene in patients with coronary heart disease in a Chinese population. Int J Clin Exp Med. 2015;8:7890–7.
Nakamura H, Edward DP, Sugar J, Yue BY. Expression of Sp1 and KLF6 in the developing human cornea. Mol Vis. 2007;13:1451–7.
Teo AK, Arnold SJ, Trotter MW, Brown S, Ang LT, Chng Z, et al. Pluripotency factors regulate definitive endoderm specification through eomesodermin. Genes Dev. 2011;25:238–250. https://doi.org/10.1101/gad.607311.
Bastide P, Darido C, Pannequin J, Kist R, Robine S, Marty-Double C, et al. Sox9 regulates cell proliferation and is required for Paneth cell differentiation in the intestinal epithelium. J Cell Biol. 2007;178:635–648. https://doi.org/10.1083/jcb.200704152.
Mansouri A, Pla P, Larue L, Gruss P. Pax3 acts cell autonomously in the neural tube and somites by controlling cell surface properties. Development. 2001;128:1995–2005.
Warrier S, Van der Jeught M, Duggal G, Tilleman L, Sutherland E, Taelman J, et al. Direct comparison of distinct naive pluripotent states in human embryonic stem cells. Nat Commun. 2017;8:15055. https://doi.org/10.1038/ncomms15055.
Wang J, Singh M, Sun C, Besser D, Prigione A, Ivics Z, et al. Isolation and cultivation of naive-like human pluripotent stem cells based on HERVH expression. Nat Protoc. 2016;11:327–346. https://doi.org/10.1038/nprot.2016.016.
Collier AJ, Panula SP, Schell JP, Chovanec P, Plaza Reyes A, Petropoulos S, et al. Comprehensive cell surface protein profiling identifies specific markers of human naive and primed pluripotent states. Cell Stem Cell. 2017;20(6):874-890.e7. https://doi.org/10.1016/j.stem.2017.02.014.
Buhusi M, Demyanenko GP, Jannie KM, Dalal J, Darnell EP, Weiner JA, et al. ALCAM regulates mediolateral retinotopic mapping in the superior colliculus. J Neurosci. 2009;29:15630–41. https://doi.org/10.1523/JNEUROSCI.2215-09.2009
Nakaya N, Sultana A, Lee HS, Tomarev SI. Olfactomedin 1 interacts with the Nogo A receptor complex to regulate axon growth. J Biol Chem. 2012;287:37171–84. https://doi.org/10.1074/jbc.M112.389916
Gregianin E, Pallafacchina G, Zanin S, Crippa V, Rusmini P, Poletti A, et al. Loss-of-function mutations in the SIGMAR1 gene cause distal hereditary motor neuropathy by impairing ER-mitochondria tethering and Ca2+ signalling. Hum Mol Genet. 2016;25:3741–3753. https://doi.org/10.1093/hmg/ddw220.
Ditadi A, Sturgeon CM, Keller G. A view of human haematopoietic development from the Petri dish. Nat Rev Mol Cell Biol. 2017;18:56–67. https://doi.org/10.1038/nrm.2016.127
Zovein AC, Hofmann JJ, Lynch M, French WJ, Turlo KA, Yang Y, et al. Fate tracing reveals the endothelial origin of hematopoietic stem cells. Cell Stem Cell. 2008;3:625–636. https://doi.org/10.1016/j.stem.2008.09.018.
Geest CR, Coffer PJ. MAPK signaling pathways in the regulation of hematopoiesis. J Leukoc Biol. 2009;86:237–50. https://doi.org/10.1189/jlb.0209097
Fiori JL, Billings PC, de la Pena LS, Kaplan FS, Shore EM. Dysregulation of the BMP-p38 MAPK signaling pathway in cells from patients with fibrodysplasia ossificans progressiva (FOP). J Bone Miner Res. 2006;21:902–9. https://doi.org/10.1359/jbmr.060215
Lee SY, Yoon J, Lee MH, Jung SK, Kim DJ, Bode AM, et al. The role of heterodimeric AP-1 protein comprised of JunD and c-Fos proteins in hematopoiesis. J Biol Chem. 2012;287:31342–31348. https://doi.org/10.1074/jbc.M112.387266.
Perry SS, Zhao Y, Nie L, Cochrane SW, Huang Z, Sun XH. Id1, but not Id3, directs long-term repopulating hematopoietic stem-cell maintenance. Blood. 2007;110:2351–60. https://doi.org/10.1182/blood-2007-01-069914
Begley CG, Green AR. The SCL gene: from case report to critical hematopoietic regulator. Blood. 1999;93:2760–70.
Guo G, Pinello L, Han X, Lai S, Shen L, Lin TW, et al. Serum-based culture conditions provoke gene expression variability in mouse embryonic stem cells as revealed by single-cell analysis. Cell Rep. 2016;14:956–965. https://doi.org/10.1016/j.celrep.2015.12.089.
Shu J, Wu C, Wu Y, Li Z, Shao S, Zhao W, et al. Induction of pluripotency in mouse somatic cells with lineage specifiers. Cell. 2013;153:963–975. https://doi.org/10.1016/j.cell.2013.05.001.
Niwa H, Ogawa K, Shimosato D, Adachi K. A parallel circuit of LIF signalling pathways maintains pluripotency of mouse ES cells. Nature. 2009;460:118–22. https://doi.org/10.1038/nature08113
Ludwig T, A Thomson J. Defined, feeder-independent medium for human embryonic stem cell culture. Curr Protoc Stem Cell Biol. 2007;Chapter 1:Unit 1C 2. https://doi.org/10.1002/9780470151808.sc01c02s2.
Vallot C, Patrat C, Collier AJ, Huret C, Casanova M, Liyakat Ali TM, et al. XACT noncoding RNA competes with XIST in the control of X chromosome activity during human early development. Cell Stem Cell. 2017;20:102–111. https://doi.org/10.1016/j.stem.2016.10.014.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. https://doi.org/10.1093/bioinformatics/bts635.
Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–30. https://doi.org/10.1093/bioinformatics/btt656
Zhang HM, Liu T, Liu CJ, Song S, Zhang X, Liu W, et al. AnimalTFDB 2.0: a resource for expression, prediction and functional study of animal transcription factors. Nucleic Acids Res. 2015;43:D76–D81. https://doi.org/10.1093/nar/gku887.
da Cunha JP, Galante PA, de Souza JE, de Souza RF, Carvalho PM, Ohara DT, et al. Bioinformatics construction of the human cell surfaceome. Proc Natl Acad Sci U S A. 2009;106:16752–16757. https://doi.org/10.1073/pnas.0907939106.
Ramilowski JA, Goldberg T, Harshbarger J, Kloppmann E, Lizio M, Satagopam VP, et al. A draft network of ligand-receptor-mediated multicellular signalling in human. Nat Commun. 2015;6:7866. https://doi.org/10.1038/ncomms8866.
Supek F, Bosnjak M, Skunca N, Smuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One. 2011;6:e21800. https://doi.org/10.1371/journal.pone.0021800
Kohl M, Wiese S, Warscheid B. Cytoscape: software for visualization and analysis of biological networks. Methods Mol Biol. 2011;696:291–303. https://doi.org/10.1007/978-1-60761-987-1_18
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. https://doi.org/10.1186/1471-2105-12-323
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. https://doi.org/10.1186/s13059-014-0550-8
Han X, Chen H, Huang D. Mapping human pluripotent stem cell differentiation pathways via high throughput single-cell RNA-sequencing. Gene Expression Omnibus. 2018, GSE107552. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE107552. Accessed 6 Mar 2018.
We thank Junfeng Ji at Zhejiang University School of Medicine for his assistance in this study. We thank Mengmeng Jiang for critical reading of this manuscript. We thank Huiyu Sun and Yang Xu for their assistance in RNA-seq data analysis. We thank G-BIO, Annoroad, VeritasGenetics, and Novogene for deep sequencing experiments, LongGene for supplying Gradient Thermal Cycler and Vazyme for supplying customized enzymes for the study.
This work was supported by grants from National Natural Science Foundation of China (31722027, 81770188, and 31701290), Fundamental Research Funds for the Central Universities (2016XZZX002–04), Zhejiang Provincial Natural Science Foundation of China (R17H080001), National Key Program on Stem Cell and Translational Research (2017YFA0103401) and 973 Program (2015CB964900).
Availability of data and materials
The RNA-seq data used in our study have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GEO accession number GSE107552 .
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Quality control of the dataset. Figure S2. Surface marker analysis and GO enrichment analysis of lineage progenitors. Figure S3. FeaturePlot of specific genes from neural and muscle sub-clusters. Figure S4. Differentiation trajectories and GO analysis of neural and muscle sub-clusters. Figure S5. GO analysis and expression dynamics of gene clusters I–VI. Figure S6. Network of potential cell–cell interactions in EBs. Figure S7. Signaling pathways involved in differentiation of various progenitor cells. Figure S8. The identification of Naïve-like H9. Figure S9. Surface marker analysis and GO analysis of Primed and Naïve-like H9. Figure S10. The identification and GO analysis of Primed and Naïve-like H1 (Additional file 11: Table S10, Additional file 12: Table S11, Additional file 13: Table S12, Additional file 14: Table S13, Additional file 15: Table S14). (DOCX 19207 kb)
Table S1. Immunofluorescence antibody. (XLSX 33 kb)
Table S2. qPCR primer. (XLSX 47 kb)
Table S6. List of ligand-receptor pairs and cell–cell pairs used in Fig. 4c for heatmap. (XLSX 12 kb)
Table S13. List of signaling pathways used in Additional file 1: Figure S7a. (XLSX 20 kb)
About this article
Cite this article
Han, X., Chen, H., Huang, D. et al. Mapping human pluripotent stem cell differentiation pathways using high throughput single-cell RNA-sequencing. Genome Biol 19, 47 (2018). https://doi.org/10.1186/s13059-018-1426-0
- Single-cell RNA-sequencing
- Primed human pluripotent stem cell
- Embryoid body
- Naïve human pluripotent stem cell