TraSig: inferring cell-cell interactions from pseudotime ordering of scRNA-Seq data

Li, Dongshunyi; Velazquez, Jeremy J.; Ding, Jun; Hislop, Joshua; Ebrahimkhani, Mo R.; Bar-Joseph, Ziv

doi:10.1186/s13059-022-02629-7

Method
Open access
Published: 07 March 2022

TraSig: inferring cell-cell interactions from pseudotime ordering of scRNA-Seq data

Dongshunyi Li¹^na1,
Jeremy J. Velazquez^2,3^na1,
Jun Ding⁴,
Joshua Hislop^2,3,5,
Mo R. Ebrahimkhani^2,3,5,6 &
…
Ziv Bar-Joseph^1,7

Genome Biology volume 23, Article number: 73 (2022) Cite this article

7882 Accesses
8 Citations
22 Altmetric
Metrics details

Abstract

A major advantage of single cell RNA-sequencing (scRNA-Seq) data is the ability to reconstruct continuous ordering and trajectories for cells. Here we present TraSig, a computational method for improving the inference of cell-cell interactions in scRNA-Seq studies that utilizes the dynamic information to identify significant ligand-receptor pairs with similar trajectories, which in turn are used to score interacting cell clusters. We applied TraSig to several scRNA-Seq datasets and obtained unique predictions that improve upon those identified by prior methods. Functional experiments validate the ability of TraSig to identify novel signaling interactions that impact vascular development in liver organoids.

Software

https://github.com/doraadong/TraSig.

Background

The ability to profile cells at the single cell level enabled the identification of new cell types and additional markers for known cell types as well as the reconstruction of cell type-specific regulatory networks [1, 2]. Several methods have been developed to group or cluster cells in scRNA-Seq data [3] and to reconstruct trajectories and pseudotime for time series scRNA-Seq data [4]. Such methods have mainly focused on the expression similarity between cells in the same cluster or at consecutive time points and on the differences in transcriptional regulation between cell types and over time [5].

More recently, a number of methods have been developed to infer another type of interaction from scRNA-Seq data: signaling between cell clusters or cell types [6]. These methods attempt to identify ligands in one of the clusters or cell types and corresponding receptors in another cluster and then infer interactions based on the average expression of these ligand-receptor pairs. For example, CellPhoneDB [7] scores ligand-receptor pairs using their mean expression values in two clusters and assigns significance levels using permutations tests. SingleCellSingleR [8] designs a score based on the product of ligandreceptors’ mean expression values in two clusters and selects ligand-receptor scoring above a predefined threshold.

While successful, most current methods for inferring cell-cell interactions from scRNA-Seq data only use the average expression levels of ligands and receptors in the two clusters or cell types they test [6]. While this may be fine for steady state populations (for example, different cell types in adult tissues), for studies that focus on development or response modeling, such averages do not take full advantage of the available data in scRNA-Seq studies. Indeed, even cells on the same branch are often ordered in such studies using various pseudotime ordering methods [9]. In such cases, cells on the same branch (or cluster) cannot be assumed to be homogeneous with respect to the expression of key genes. Using average analysis for such clusters may lead to inaccurate predictions about the relationship between ligands and receptors in two different (though parallel in terms of timing) branches. Specifically, Fig. 1 presents four cases of pseudotime orderings for a ligand and its corresponding receptor in two different branches. While the average expression of a ligand and receptor in two different branches are the same, the first two cases are unlikely to strongly support an interaction between these two cell types while the third and fourth, where both are either increasing or decreasing in their respective ordering, are much more likely to hint at real interactions between the groups. In other words, if two groups of cells are interacting, then we expect to see the genes encoding signaling molecules in these groups co-express at a similar pace along the pseudotime.

To enable the use of pseudotime ordering for predicting cell type interactions between dynamically changing cell populations, we developed TraSig. TraSig can use several of the most popular pseudotime ordering and trajectory inference methods to extract expression patterns for ligands and receptors in different edges of the trajectory using a sliding window approach. It then uses these profiles to score temporal interactions between ligands and their known receptors in different edges corresponding to the same time. Permutation testing is used to assign significance levels to specific pairwise interactions and scores are combined to identify significant cluster-cluster interactions.

We applied TraSig to a number of scRNA-Seq datasets and compared its performance to a number of popular methods for inferring signaling interactions from scRNA-Seq data. As we show, the ability to utilize the temporal information in the analysis improves the accuracy of predicted relevant pairs and leads to distinct predictions that are not identified by other methods that rely on average expression. We experimentally validated a number of interaction predictions from TraSig for liver organoid differentiation data.

Results

We developed a computational method, TraSig, for inferring cell-cell interactions from pseudotime ordered data. Figure 2 presents an overview of the method. We start by using a trajectory inference method to obtain grouping and pseudotime ordering for cells in the dataset. Here we use the Continuous-State Hidden Markov Model (CSHMM) [10] for this, though as discussed below, TraSig can be applied to results from other pseudotime ordering methods. We then reconstruct expression profiles for genes along each of the edges using sliding windows summaries. Next we compute dot product scores for pairs of genes in edges (clusters) sampled at the same time or those representing the same pseudotime. Finally, we use permutation analysis to assign significance levels to the scores we computed. See the “Methods” section for details on each of the steps of TraSig.

Reconstructing dynamic liver development model using CSHMM

We first applied TraSig to a liver organoid differentiation scRNA-Seq dataset. This dataset is composed of 11,083 cells sampled at three time points: day 5, day 11 (see Additional file 1: Figure S4 for details), and day 17 [11]. The data was preprocessed using a standard Seurat V3 [12] pipeline and cell types were assigned as previously discussed [11]. These were used to initialize trajectory inference using CSHMM [10]. Following filtering to remove genes not expressed in any of the cells, 26,955 genes were used to learn the CSHMM model. Figure 3a presents the resulting model learned for this data. As can be seen, the method identifies 12 clusters (edges) for these data. These agree very well with the clustering assignments from the Seurat single cell analysis. Specifically, CSHMM assigns separate edges for hepatocyte- (edge 3, 5, 9, and 10), endothelial- (edges 7 and 11), stellate- (edges 2 and 8), and ductal/cholangiocyte-like (edges 4 and 6) cells. In addition, the model also presents informative pseudotime ordering of cells as we discuss below based on the reconstructed expression profiles for key marker genes. See http://www.cs.cmu.edu/~trasig/ for an interactive Web user interface to visualize the trajectory inference results.

Inferring cell type interactions for liver development

We next applied TraSig to the model reconstructed by CSHMM in order to gain insight into developmental signaling of co-differentiating liver cells from multiple germ layers. Such data is severely lacking for humans and so the use of the trajectory learned for liver organoid differentiation can provide valuable information on interactions regulating liver development. We thus tested all pairs of edges for which the assigned cells were from the same time point (Additional file 1: Supplementary Notes). Figure 3d presents the results for scoring interactions between edges representing the same time (Methods). For the day 11 clusters (edge 1, 2, 3, 4, 5, 7), we find strong interactions between stellate-like 1 cells (edge 2) and endothelial-like cells (edge 7) and between ductal/cholangiocyte-like cells (edge 4) and endothelial-like cells (edge 7). For the day 17 clusters (edge 6, 8, 9, 10, 11), we find that the strongest interactions are between the ductal/cholangiocyte-like cells (edge 6) and stellate-like cells (edge 8). We also find high scoring interactions between stellate-like cells (edge 8) and endothelial-like cells (edge 11) and between ductal/cholangiocyte-like cells (edge 6) and endothelial-like cells (edge 11) for the day 17 clusters. The detection of significant interactions between the endothelial, stellate, and cholangiocyte cell types is further supported by their proximity in the liver. The stellate cells wrap around the endothelial cells and are bordered by the cholangiocyte comprised bile ducts [14].

TraSig identifies ligand-receptor interactions important to vascular development

We evaluated the significant ligand-receptor pairs that were ranked highly by TraSig for the high scoring cluster pairs. We found that many agree with known functions and signaling pathways activated during liver development. Figure 3e presents a few examples of identified ligand-receptor pairs. We next studied the top scoring edges predicted to interact with endothelial-like cells. Endothelial cells play a major role in vascular development in the liver [15]. To study the interactions of such cells, we looked for cluster pairs for which the receiver (receptor) cluster is the day 17 endothelial-like cell cluster (edge 11). GO term analysis of the identified ligands and receptors for these cluster pairs identifies several relevant functional terms related to vascular development including “blood vessel development” (minimum p-value among cluster pairs 5.72128e −65), “regulation of endothelial cell proliferation” (p-value 3.34715e −27), and “vascular process in circulatory system” (p-value 8.38655e −12).

Many of the ligand-receptor pairs identified for interactions involving the endothelial-like cells are known to play a role in endothelial cell specification, migration, and angiogenesis further supporting the results of TraSig. Of note, we identified pairs including VEGFA/VEGFB/VEGFC with FLT1/KDR, which is required for proper liver zonation, sinusoid endothelial cell specification, and endothelial lipoprotein uptake [16, 17]; DLL4 with NOTCH1/NOTCH4, which is essential for endothelial tip and stalk cell crosstalk and liver sinusoidal endothelial cell capillarization [18, 19]; CXCL12 with CXCR4, which has been shown to promote endothelial cell migration and lumen formation independent of VEGF [20]; MDK with PTPRB, which is of great interest for its known impact on cancer angiogenesis [21, 22]; and CYR61 with ITGAV, which represents one of the many integrin interactions identified by TraSig which activate PI3K/AKT downstream signaling and is known to regulate tip cell activity and angiogenesis (Fig. 4a–d) [23].

Experimental validation for predicted TraSig pairs

Given the success in identifying known interactions, we next experimentally validated additional TraSig predictions. We first assessed if there was a correlation between the signal level of CXCL12 or VEGF and vascularity via immunofluorescent staining of liver organoid cultures. As shows in Fig. 5a–c, we found that loci with high relative expression of CXCL12 and VEGF co-localized with regions of increased vessel area percentage and vessel junction density, when compared to loci with relative low expression of CXCL12 and VEGF measured by AngioTool analysis of the immunofluorescent staining (see also Additional file 1: Figures S5a and S5b).

This motivated further investigation into the significance of predicted signaling interactions in the liver organoid cultures as they pertain to vascular development. We therefore performed prolonged (5 days from D9-14) inhibition of several predicted signaling proteins: VEGF, NOTCH, CXCR4, MDK, and PI3K (downstream of MDK and multiple integrin interactions). These experiments validated several of the predictions. Specifically, we observed significant decreases in percent vessel area, junction density, and average vessel length were detected in the VEGF, MDK, and PI3K conditions, while NOTCH inhibition revealed an opposite effect (Fig. 5d, e). In contrast, the local correlation of increased vascular network formation with high CXCL12 expression did not carry over to a negative global effect via CXCR4 inhibition, indicating opportunity for further investigation, perhaps involving alternative inhibitors or assessment of the alternative CXCL12 receptor CXCR7, which also plays important roles in angiogenesis and liver regeneration [24, 25].

Comparing TraSig with prior methods

We compared interactions predicted by TraSig to two popular methods for inferring cell type interactions from scRNA-Seq data: CellPhoneDB [7] and SingleCellSignalR [8]. Both methods use the overall expression of genes in clusters and unlike TraSig do not use any ordering information. For both methods, we tested the same cluster pairs as we did for TraSig and used the same ligand-receptor database (Additional file 1: Supplementary Notes). To make the comparisons more consistent, we combined the paracrine and autocrine predicted interactions for SingleCellSignalR since this is what other methods do. Figure 6a presents scores for all cluster pairs for TraSig, SingCellSignalR, and CellPhoneDB. As can be seen, while some pairs score high for all methods, others are only identified by one or two of the methods. Specifically, SingleCellSignalR seems to assign similar scores for most pairs whereas both TraSig and CellPhoneDB assign more variable scores. Figure 6c presents the Venn diagrams for the overlap between ligand-receptor pairs identified by the three methods for four example cell cluster pairs. In all cases, the receiver (receptor) cluster is the day 17 endothelial edge (edge 11). While SingleCellSignalR and TraSig overlap in roughly 50% of the identified ligand-receptor pairs, the overlap with CellPhoneDB is much lower.

To evaluate the predicted pairs from these methods, we performed validation experiments, as mentioned above, and also compared enrichment p-values for relevant GO terms using ligands and receptors for several high scoring cluster pairs from each of the methods (see Additional file 1: Supplementary Notes on how we perform GO analysis [26] and how we select relevant GO terms). Among the significant ligand-receptors we successfully validated based on TraSig predictions, many were completely missed by CellPhoneDB even though they are included in the database it is using. These include DLL4-NOTCH1/4, JAG1-NOTCH1, VEGFB-FLT1, and VEGFC-KDR. As for SingleCellSignalR, for the DLL4-NOTCH1/4 predicted interaction, SingleCellSignalR only identifies these as interactions within a single cell type and therefore does not identify the paracrine signaling between cell types. In contrast, TraSig identified these interactions as significant between day 17 endothelial-like cells (edge 11) and ductal/cholangiocyte-like cells (edge 6) and hepatocyte-like cells (edges 9 and 10). GO analysis further supports the advantages of TraSig. Figure 6b shows that TraSig leads to more significant relevant categories when compared to the two other methods. For example, TraSig obtains a minimum p-value among cluster pairs of 7.81570e −60 for “blood vessel morphogenesis” whereas the minimum p-values for this category are higher for the other two methods (3.22968e −57 and 6.02315e −52 for SingleCellSignalR and CellPhoneDB respectfully). For “endothelial cell migration,” TraSig has a minimum p-value of 6.28812e −25, again, lower than the minimum p-values for SingleCellSignalR (7.70322e −20) and CellPhoneDB (2.06128e −20). While all three methods result in significant relevant GO terms in some cluster pairs, as indicated by the overall low minimum p-values, TraSig finds more cluster pairs significant in these relevant GO terms, implying that endothelial-like cells (edge 11) receiving signals from multiple different cell types. We obtained similar results when using another ligand-receptor database for all methods [27]. See Additional file 1: Figure S15 for details.

TraSig identifies interactions in neocortical development

To further evaluate TraSig’s performance, we applied TraSig to a mouse neocortical development scRNA-Seq data [28]. After preprocessing (Additional file 1: Supplementary Notes), we obtained 18,545 cells sampled at two time points: E14.5 and P0. We used the top 5000 dispersed genes to reconstruct CSHMM trajectories. The CSHMM model was initialized using the cell labels from [28]. Next the model was refined to improve both trajectory learning and cell assignment. The final trajectory learned for this data is presented in Additional file 1: Figure S8. The model is composed of 44 clusters (edges) of which 23 contain cells from the first time point and 21 from the second. Next we applied TraSig to infer ligand-receptor pairs and interacting cluster pairs based on the sampling time.

Additional file 1: Figure S7a presents scores for all cluster pairs. As can be seen, the method identified strongly interacting cluster pairs for both time points. The highest scoring interactions identified involve either endothelial cells (edge 18 from E14.5 and edge 39 from P0), radial glial cells (edge 1 from E14.5), interneurons (edge 24 from P0), or astrocytes (edge 26 from P0). We performed GO analysis using the significant ligands and receptors identified for radial glial cells in E14.5 or interneurons in P0. Additional file 1: Figure S7b shows the − log10p-value of enriched GO terms for interactions involving either RG2 [14-E] cluster for the radial glial cells in E14.5 (edge 1) or Int2 [14-P] cluster for the interneurons in P0 (edge 24). Radial glial cells were identified as progenitor cells for neocortical development [29] and determined to function as “scaffolds” for neuronal migration [30]. GO analysis shows that the signaling proteins identified by TraSig for interactions involving this cluster are indeed related to such functions and include “cell migration” (p-value 1.69780e −60), “cell motility” (p-value 1.01291e −56), and “regulation of cell migration” (p-value 9.23644e −42). Terms related to neuron development are also highly enriched in the set of ligand and receptor proteins identified for the interneuron cell cluster and include “neurogenesis” (p-value 1.39908e −64) and “neuron projection development” (p-value 5.39174e −64).

Applying TraSig to trajectories obtained by Slingshot

To test the ability of TraSig to generalize to pseudotime inferred by additional methods, we used it to post-process trajectories inferred by Slingshot [9]. Slingshot is a trajectory inference method that first infers a global lineage structure using a cluster-based minimum spanning tree (MST) and then infers the cell-level pseudotimes for each lineage. We applied Slingshot and TraSig to an oligodendrocyte differentiation dataset composed of 3685 cells [4, 31]. Additional file 1: Figure S9a presents the trajectory learned by Slingshot for this data. Additional file 1: Figure S9b presents the interactions predicted by TraSig for the inferred trajectory. Cells assigned to edges 2 and 3 are more mature cells while those assigned to edges 0 and 1 containing precursor cells (Additional file 1: Figure S9a). Our results suggest that the more mature oligodendrocytes are signaling to the precursors during development. As before, we preformed GO analysis on the set of ligands and receptors predicted for strongly interacting clusters. We found several relevant GO terms including “neuron projection development” (p-value 2.50804e −24) and “neuron development” (p-value 7.129894e −23) (Additional file 1: Figure S9c). Ligands in top ranking ligand-receptor TraSig pairs include PDGFA, BMP4, and PTN, all of which are know to be involved in regulating oligodendrocyte development [32–34].

Discussion

Initial methods for the analysis of scRNA-Seq data mainly focused on within cluster or trajectory interactions. Recently, a number of methods have been developed to use these data to infer interactions between different cell types or clusters [6]. These methods focus on the average expression of ligands and their corresponding receptors in a pair of cell types to score and identify interacting cell type pairs.

While the exact way in which scores are computed differs between methods developed to predict such interactions, to date, most methods looked at the average or sum of the expression values for ligands and receptors in the two clusters or cell types. Such analysis works well when studying processes that are in a steady state (for example, adult tissues) but may be less appropriate for dynamic processes. For real interactions, when time or pseudotime information is available, we expect to see not just average expression levels match but also trajectory matches in their expression profiles. Since many methods have been developed to infer pseudotime from scRNA-Seq data, such information is readily available for many studies.

To fully utilize information in scRNA-Seq data, we developed TraSig, a new computational method for inferring signaling interactions. TraSig first orders cells along a trajectory and then extracts expression profiles for genes in different clusters using a sliding window approach. Matches between profiles for ligands and their corresponding receptors in different clusters are then scored and their significance is assessed using permutation tests. Finally, scores for individual pairs are combined to obtain a cluster interaction score. Since we use pseudotime ordering as input, we assume that the cells in the datasets we analyze are dynamically changing and that the input pseudotime ordering provides a good representation of the real time changes. We have experimentally tested that this is indeed the case for the liver organoid data we analyzed in this paper (Additional file 1: Figure S11, Additional file 2). We leave it up to users to decide if they would like to use the method for all cells profiled or for a subset of the cells (for example, those expected to change dynamically during the process being studied). Alternatively, we also provide an implementation of TraSig that following pseudotime ordering aligns the expression of cells in two edges (clusters) based on the expression of ligands and receptors. Next, the aligned profiles are used to score and identify interacting ligand-receptor and cluster pairs. See Additional file 1: Supplementary Notes for details. See also Additional file 1: Figures S12–S14 for the comparisons between aligned and unaligned options.

We applied TraSig to several different scRNA-Seq datasets and have also compared its predictions to predictions by prior methods developed for this task. As we have shown, for liver organoid development, TraSig was able to identify several known and novel interactions related to the regulation of vascular network formation. These interactions involve endothelial, stellate, and cholangiocyte cell types that have been known to reside in close proximity [14] and several ligand-receptor pairs known to be involved in vascular development. While many interactions were predicted by all methods we tested, there are also several interactions uniquely predicted by TraSig. We validated a number of these interactions including DLL4-NOTCH1/4, which are missed by CellPhoneDB and only identified by SingleCellSignalR as interactions within a single cell type. TraSig also uniquely identified WNT2/3/4/7a/7b interactions with the FZD family and LRP6 supported by the known role of WNT in angiogenesis [35]. It also uniquely found BMP10ACVRL1/ACVR2A and SHH, interacting with multiple different receptors, both of which have also been shown related to angiogenesis [36, 37].

Our experiments showed that the VEGF inhibitor Axitinib completely ablated the vascular network formation as shown previously [11, 38] and appeared to completely remove CD34 expressing cells. PI3K inhibition showed similar disruption of network formation; however, in contrast to Axitinib treatment, rounded CD34 expressing cells remained present and evenly spaced yet completely disconnected (Additional file 1: Figure S5b). MDK inhibition appeared to decrease branching and connectivity of CD34 expressing cells significantly; however, these cells still maintained a spread morphology. MDK is a pleiotropic growth factor that can induce cell proliferation, migration, and angiogenesis [39–41]. It has been suggested that MDK from mesothelial cells can participate in liver organogenesis [42]. While its role was suggested in cancer-related angiogenesis [22, 43], less is known about its function in liver development. Our combined computational and experimental analysis suggests such role for MDK in vascular development in human livers.

Interestingly, inhibition of NOTCH resulted in increased endothelial cell numbers and vascular formation. Vascularization can enable better engraftment in vivo. Hence, modulation of notch signaling might be a possible target to improve liver organoid implantation in vivo that warrants further investigation. The mechanisms of these findings can be further investigated via cell type-specific genetic circuits to determine dose, timing, and cell types involved. Combined, our data confirms that significant signaling pathways in the liver organoids could be predicted using TraSig and functionally validated.

The INHBE-ENG interaction measured in the liver organoids (Additional file 1: Figure S11b) was also found by TraSig. INHBE is uniquely highly expressed in primary liver as well as the liver organoids and has been far less studied than its INHBA and INHBB counterparts [44]. Thus far, INHBE has been proposed as a hepatokine responsible for controlling energy homeostasis of white and brown adipose cells [45] and is potentially associated with insulin resistance [46], but has not been studied in the developing human liver to our knowledge. This poses a potential interesting avenue of further study that could help reveal the function of INHBE in the liver, specifically as a regulator of angiogenesis during liver development.

Among the inhibitors we use, small molecules may have potential unintended off-target effects with limited spatial control. WZ811 and axitinib are relatively specific for inhibition of CXCR4 and VEGFR signaling respectively, while molecules like LY294002 can have broad effects due to the effects of PI3K signaling beyond its role downstream of integrin interactions. Likewise, DAPT is a gamma secretase inhibitor that will prevent all NOTCH receptors from relaying downstream signals. Therefore, we view this as more of a proof of principle to test if TraSig is able to successfully determine natural key players important for angiogenesis in organoids.

We note that for this liver organoid data, the trajectory inferred by CSHMM put both edge 7 (mainly day 11 endothelial-like cells) and edge 8 (mainly day 17 stellate-like cells) downstream of edge 2, which mainly consists of day 11 stellate-like cells. This implicates the likelihood of common progenitor cells in edge 2, which can further differentiate into the endothelial lineage and pericyte(stellate) cells in liver organoids. In fact, co-development of pericytes in endothelial differentiation cultures has been observed recently [47], which may further suggest the presence of common mesodermal progenitors [48].

We have also tested TraSig on neocortical development and oligodendrocyte differentiation datasets. As we have shown, TraSig was able to correctly identify known and novel interacting cell type pairs for these datasets as well. In addition to CSHMM, we also tested and validated TraSig using Slingshot [9] and Monocle 3 [49] (Additional file 1: Figures S9-S10). These results demonstrate the generalizability of TraSig which can be applied to output data from any pseudotime ordering method. As we have shown, the ability to identify significant interactions is independent of the ordering method itself enabling the use of TraSig in post-processing of any pseudotime ordered scRNA-Seq data.

Similar as many other inference methods, TraSig uses scRNA-Seq data to infer cell-cell interactions. While RNA levels do not fully correspond to protein activity, we use these levels as a proxy for the dynamic activation of ligands and receptors. Cases in which either of them is only post-transcriptionally or post-translationally activated would thus be missed by TraSig. We expect that we can further improve TraSig when single cell proteomics data become more abundant.

Methods

To identify interacting cell type pairs, we developed TraSig (Trajectory based Signaling gene inference), which infers key genes involved in cell-cell interactions. We primarily focus on genes encoding ligands and receptors at this stage but our method can accommodate other proteins likely to interact. For any two groups of cells that are expected to overlap in time, TraSig takes the pseudotime ordering for each group and the expression of genes along the trajectory as input and then outputs an interaction score and p-value for each possible ligand-receptor pair.

Learning trajectories for time series scRNA-Seq data

There have been several methods developed to infer trajectories from time series scRNA-Seq data [4]. Several of these methods first reduce the dimension of the data and then infer trajectory structures by using minimum spanning trees in the reduced dimension space [4]. While such methods work well for obtaining global ordering and for groupings cells, they may not be as accurate for the exact ordering of cells in the same edge (cluster), especially for clusters with small number of cells. Since the ordering is only based on the low dimension representation, genes that are only active in a small number of cells may have little impact on the representation of the cell in the lower dimension [10]. Since such ordering is critical for the ability to infer the activation or repression of individual genes along the pseudotime, we instead use another method for trajectory inference which works in the original gene space. This method, termed CSHMM, uses probabilistic graphical models to learn trajectories and to assign cells to specific points along the trajectories. CSHMM (Continuous-state Hidden Markov Model) [10] learns a generative model on the expression data using transition states and emission probabilities. CSHMM assumes a tree structure for the trajectory and assigns cells to specific locations on its edges. This enables both, the inference of the gene expression trajectories for each edge and the determination of overlapping edges (in time) which are potential interacting groups. In CSHMM, the expression of a gene j in cell i assigned to state s_p,t is modeled as

$$x^{i}_{j} \sim \mathcal{N}\left(\mu_{s_{p,t}}, \sigma^{2}_{j}\right) $$

where s_p,t is determined by both the edge p and the specific location t on the edge the cell is assigned to, and

$$\mu_{s_{p,t}} = g_{aj}\exp{(-K_{p,j}t) + g_{bj}(1 - \exp{(-K_{p,j}t)})}.$$

g_aj and g_bj are the mean expressions for gene j at branching node a and b (the beginning and the end of edge p, respectively) and K_p,j is the rate of change for gene j on edge p. $\sigma ^{2}_{j}$ is the variance of gene j. CSHMM is learned by using an initial assignment based on clustering single cells and then iteratively refining the model and assignment using an EM algorithm [10].

Selecting paired clusters

While most current methods look at all possible cluster pairs when searching for interactions, when using time series data, we can constrain the search space and reduce false positives. Specifically, cells can only interact if both are active at the same time. For example, predicting interactions between clusters representing cells in day 1 and day 30 in a developmental study is unlikely to lead to real signaling interactions. TraSig can either use the time in which cells were profiled for this or it can use the tree structure provided by CSHMM to match edges based on their predicted pseudotime. Interactions are only predicted for pairs of edges (clusters) representing overlapping time.

Ordering cells and inferring expression profiles

Given two groups of cells (cells assigned to two edges in the model) selected as discussed above, we first obtain a smooth expression profile for each gene along each of the edges. For this, we first divide each edge into 101 equal size bins. We then use a sliding window approach that summarizes expression levels for genes along overlapping windows of equal size. We tested window sizes comprising of L= {5, 10, 20, and 30} bins and found that window size of 20 works best (Additional file 1: Supplementary Notes). Windows overlap by L−1 bins so the first L−1 bins of a window are the last L−1 bins of its predecessor. Since most cells are usually assigned to locations that are near the branching nodes (start and end of the edges, Fig. 3a), we use L/2 as the length of the first sliding window and then increase to L when we reach the first L bins (Fig. 2). We next generate an expression profile for each gene using its mean expression within each window. Using overlapping intervals allows us to overcome issues related to dropout and noise while still obtaining an accurate profile of the expression of the gene along the edge.

Computing interaction scores for ligands and receptors

We used genes determined to be ligands or receptors from Ramilowski et al. [50]. This database consists of 708 ligands and 691 receptors with 2557 known ligand-receptor interactions. To calculate an interaction score between a ligand in group A (sender) and its corresponding receptor in group B (receiver), we use the expression profile for each edge calculated as discussed above. Denote the expression values of the ligand in group A as x=(x₁,x₂,...,x_M) and those for the receptor in group B as y=(y₁,y₂,...,y_M), where M is the total number of overlapping intervals. We use the dot product function to compute the score by calculating $\mathbf {x}^{T}\mathbf {y} = \sum ^{M}_{i}x_{i}y_{i}$. The advantage of using dot product for such analysis is that it enables the use of both the magnitude and the similarity of expression’s change over time to rank the top pairs.

To compute a p-value for the score, we use randomization analysis. Specifically, we permute the assignment of cells to edges and pseudotime in the model and re-compute the score as discussed above for the same pair of genes along the two clusters. Such permutation allows the method to identify interactions that are both, cluster (or cell type) specific and time dependent since genes that are active in most of the clusters will likely be also ranked high when permuting assignments between the clusters. We perform 100,000 permutations leading to a minimum p-value of 0.00001. We use Benjamini-Hochberg to control the false discovery rate (FDR) at 0.05 for multiple testing correction. For each pair of clusters, we also provide a summary score over all ligand-receptor pairs by counting how many ligand-receptor pairs are significant for this cluster pair.

Alignment between paired clusters

The interaction score calculated as described above assumes that the cell clusters (edges) fully overlap in terms of their real time trajectory. While this assumption holds for many studies including for the data we analyze in this paper (Additional file 1: Figure S11), there could be cases where the pseudotime represents different real time for different clusters or edges. To enable the use of TraSig in such cases, we also implemented another way of calculating the interaction score for TraSig. This option starts by obtaining the optimal aligned expression profiles for each pair of clusters (edges). By aligning clusters, we obtain the matching between the real time rather than the pseudotime dynamics of the two clusters. Next, we compute the dot product using the aligned profiles. The alignment method we used is adapted from those developed for bulk data [51, 52], based on B-spline interpolation using [53] and dynamic time warping (DTW). See Additional file 1: Supplementary Notes for details.

Using trajectories inferred by other methods

While we mainly discuss the use of TraSig with CSHMM, as we show in the “Results” section, it can be used with the output of any other trajectory inference tool. For this TraSig uses dynverse [4], which provides an R package that transforms the output of several popular trajectory inference and pseudotime ordering methods to a common output. Specifically, TraSig uses the “milestone_progression” output from dynverse which represents the location of a cell on an edge. This is a value in [0,1] which we use to determine the pseudotime assignment for each cell on an edge. All other steps are the same as when using CSHMM’s trajectory output. TraSig can also directly use pseudotime time and edge (cluster) assignment inputs from users if they prefer not to use the dynverse package.

For the trajectory inference results presented in Additional file 1: Figures S9-S10, we used the Slingshot [9] and Monocle 3 [49] softwares together with dynverse [4] to obtain the estimated trajectories and transform the outputs.

Assessment of cell-cell interaction to probe vascular formation in liver organoids

For evaluation of whole culture vascular network formation, liver organoids were cultured on 8-mm glass coverslips in a 48-well plate [11]. On day 9 of culture, indicated inhibitors 50 ng/mL VEGFR inhibitor, Axitinib (Sigma, Cat PZ0193-5MG); 15 uM CXCR4 inhibitor, WZ811 (Cayman, Cat 13639); 10 uM NOTCH inhibitor, DAPT (Stem Cell Technologies, Cat 082); 10 uM PI3K inhibitor, LY294002 (Stem Cell Technologies, Cat 72152); 1 uM MDK inhibitor, iMDK (Millipore, Cat 5.08052.0001); or vehicle control (DMSO, Sigma, Cat D2650-100mL) were supplemented to the culture medium daily for 5 days. After fixation with 4% PFA for 20 min at room temperature on day 14, the cultures were washed 3 × in PBS and stained as explained previously [11] with CD34 antibody (Abcam, Cat ab81289) and the whole coverslip was imaged using an EVOS M7000. Raw images were exported to ImageJ and applied a threshold to generate binary images of the CD34+ vasculature networks. Four 1200 pixel (2–3 mm) diameter circular areas were selected per coverslip for assessment in AngioTool (https://ccrod.cancer.gov/confluence/display/ROB2) [54]. For evaluation of CXCL12 and VEGF localized vascular network formation, liver organoid cultures were fixed on day 14 and stained for CD34 along with either CXCL12 or VEGF. Loci, which we define here as 300 pixel diameter areas with high and low relative CXCL12 or VEGF expression determined by relative fluorescence, were identified in ImageJ and vascular network was analyzed using AngioTool.

Availability of data and materials

TraSig is implemented in Python and is available at Github (https://github.com/doraadong/TraSig) [55] and Zenodo (https://doi.org/10.5281/zenodo.5949000) [56]. TraSig is licensed under the MIT license.

Single cell data for the liver organoid is available from the Gene Expression Omnibus (GEO) under accession number GSE159491 [11]. Single cell data for neocortical development [28] is available from the Gene Expression Omnibus (GEO) under accession number GSE123335. Single cell data for oligodendrocyte differentiation and for hepatoblast differentiation [4, 31, 57] are downloaded from https://doi.org/10.5281/zenodo.1443566.

References

Lin C, Ding J, Bar-Joseph Z. Inferring TF activation order in time series scRNA-Seq studies. PLoS Comput Biol. 2020; 16(2):1007644.
Article Google Scholar
Hurley K, Ding J, Villacorta-Martin C, Herriges MJ, Jacob A, Vedaie M, Alysandratos KD, Sun YL, Lin C, Werder RB, et al. Reconstructed single-cell fate trajectories define lineage plasticity windows during differentiation of human PSC-derived distal lung progenitors. Cell Stem Cell. 2020; 26(4):593–608.
Article CAS PubMed PubMed Central Google Scholar
Abdelaal T, Michielsen L, Cats D, Hoogduin D, Mei H, Reinders MJ, Mahfouz A. A comparison of automatic cell identification methods for single-cell RNA sequencing data. Genome Biol. 2019; 20(1):1–19.
Article CAS Google Scholar
Saelens W, Cannoodt R, Todorov H, Saeys Y. A comparison of single-cell trajectory inference methods. Nat Biotechnol. 2019; 37(5):547–54.
Article CAS PubMed Google Scholar
Pratapa A, Jalihal AP, Law JN, Bharadwaj A, Murali T. Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data. Nat Methods. 2020; 17(2):147–54.
Article CAS PubMed PubMed Central Google Scholar
Armingol E, Officer A, Harismendy O, Lewis NE. Deciphering cell–cell interactions and communication from gene expression. Nat Rev Genet. 2021; 22(2):71–88.
Article CAS PubMed Google Scholar
Efremova M, Vento-Tormo M, Teichmann SA, Vento-Tormo R. Cellphonedb: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat Protoc. 2020; 15(4):1484–506.
Article CAS PubMed Google Scholar
Cabello-Aguilar S, Alame M, Kon-Sun-Tack F, Fau C, Lacroix M, Colinge J. Singlecellsignalr: inference of intercellular networks from single-cell transcriptomics. Nucleic Acids Res. 2020; 48(10):55.
Article Google Scholar
Street K, Risso D, Fletcher RB, Das D, Ngai J, Yosef N, Purdom E, Dudoit S. Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics. BMC Genomics. 2018; 19(1):477.
Article PubMed PubMed Central Google Scholar
Lin C, Bar-Joseph Z. Continuous-state hmms for modeling time-series single-cell RNA-Seq data. Bioinformatics. 2019; 35(22):4707–15.
Article CAS PubMed PubMed Central Google Scholar
Velazquez JJ, LeGraw R, Moghadam F, Tan Y, Kilbourne J, Maggiore JC, Hislop J, Liu S, Cats D, de Sousa Lopes SMC, et al. Gene regulatory network analysis and engineering directs development and vascularization of multilineage human liver organoids. Cell Syst. 2021; 12(1):41–55.
Article CAS PubMed Google Scholar
Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck III WM, Hao Y, Stoeckius M, Smibert P, Satija R. Comprehensive integration of single-cell data. Cell. 2019; 177(7):1888–902.
Article CAS PubMed PubMed Central Google Scholar
Becht E, McInnes L, Healy J, Dutertre C-A, Kwok IW, Ng LG, Ginhoux F, Newell EW. Dimensionality reduction for visualizing single-cell data using UMAP. Nat Biotechnol. 2019; 37(1):38–44.
Article CAS Google Scholar
Si-Tayeb K, Lemaigre FP, Duncan SA. Organogenesis and development of the liver. Dev Cell. 2010; 18(2):175–89.
Article CAS PubMed Google Scholar
Gouysse G, Couvelard A, Frachon S, Bouvier R, Nejjari M, Dauge M-C, Feldmann G, Hénin D, Scoazec J-Y. Relationship between vascular development and vascular differentiation during liver organogenesis in humans. J Hepatol. 2002; 37(6):730–40.
Article CAS PubMed Google Scholar
Walter TJ, Cast AE, Huppert KA, Huppert SS. Epithelial VEGF signaling is required in the mouse liver for proper sinusoid endothelial cell identity and hepatocyte zonation in vivo. Am J Physiol Gastrointest Liver Physiol. 2014; 306(10):849–62.
Article Google Scholar
Carpenter B, Lin Y, Stoll S, Raffai RL, McCuskey R, Wang R. VEGF is crucial for the hepatic vascular development required for lipoprotein uptake. Development. 2005; 132(14):3293–303.
Article CAS PubMed Google Scholar
Blanco R, Gerhardt H. VEGF and Notch in tip and stalk cell selection. Cold Spring Harb Perspect Med. 2013; 3(1):006569.
Article Google Scholar
Chen L, Gu T, Li B, Li F, Ma Z, Zhang Q, Cai X, Lu L. Delta-like ligand 4/DLL4 regulates the capillarization of liver sinusoidal endothelial cell and liver fibrogenesis. Biochim Biophys Acta, Mol Cell Res. 2019; 1866(10):1663–75.
Article CAS Google Scholar
Kanda S, Mochizuki Y, Kanetake H. Stromal cell-derived factor-1alpha induces tube-like structure formation of endothelial cells through phosphoinositide 3-kinase. J Biol Chem. 2003; 278(1):257–62. https://doi.org/10.1074/jbc.m204771200.
Article CAS PubMed Google Scholar
Maeda N, Ichihara-Tanaka K, Kimura T, Kadomatsu K, Muramatsu T, Noda M. A receptor-like protein-tyrosine phosphatase PTPzeta/RPTPbeta binds a heparin-binding growth factor midkine. Involvement of arginine 78 of midkine in the high affinity binding to PTPzeta. J Biol Chem. 1999; 274(18):12474–79. https://doi.org/10.1074/jbc.274.18.12474.
Article CAS PubMed Google Scholar
Filippou PS, Karagiannis GS, Constantinidou A. Midkine (MDK) growth factor: a key player in cancer progression and a promising therapeutic target. Oncogene. 2020; 39(10):2040–54. https://doi.org/10.1038/s41388-019-1124-8.
Article CAS PubMed Google Scholar
Park M-H, Kim AK, Manandhar S, Oh S-Y, Jang G-H, Kang L, Lee D-W, Hyeon DY, Lee S-H, Lee HE, Huh T-L, Suh SH, Hwang D, Byun K, Park H-C, Lee YM. CCN1 interlinks integrin and hippo pathway to autoregulate tip cell activity. eLife. 2019; 8. https://doi.org/10.7554/elife.46012.
Zhang M, Qiu L, Zhang Y, Xu D, Zheng JC, Jiang L. CXCL12 enhances angiogenesis through CXCR7 activation in human umbilical vein endothelial cells. Sci Rep. 2017; 7(1):8289. https://doi.org/10.1038/s41598-017-08840-y.
Article PubMed PubMed Central Google Scholar
Ding B-S, Cao Z, Lis R, Nolan DJ, Guo P, Simons M, Penfold ME, Shido K, Rabbany SY, Rafii S. Divergent angiocrine signals from vascular niche balance liver regeneration and fibrosis. Nature. 2014; 505(7481):97–102. https://doi.org/10.1038/nature12681.
Article PubMed Google Scholar
Raudvere U, Kolberg L, Kuzmin I, Arak T, Adler P, Peterson H, Vilo J. g: Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 2019; 47(W1):191–98.
Article Google Scholar
Hou R, Denisenko E, Ong HT, Ramilowski JA, Forrest AR. Predicting cell-to-cell communication networks using NATMI. Nat Commun. 2020; 11(1):1–11.
Article Google Scholar
Loo L, Simon JM, Xing L, McCoy ES, Niehaus JK, Guo J, Anton E, Zylka MJ. Single-cell transcriptomic analysis of mouse neocortical development. Nat Commun. 2019; 10(1):1–11.
Article CAS Google Scholar
Barry DS, Pakan JM, McDermott KW. Radial glial cells: key organisers in CNS development. Int J Biochem Cell Biol. 2014; 46:76–79.
Article CAS PubMed Google Scholar
Sild M, Ruthazer ES. Radial glia: progenitor, pathway, and partner. Neuroscientist. 2011; 17(3):288–302. https://doi.org/10.1177/1073858410385870. PMID: 21558559.
Article PubMed Google Scholar
Marques S, Zeisel A, Codeluppi S, van Bruggen D, Falcão AM, Xiao L, Li H, Häring M, Hochgerner H, Romanov RA, et al. Oligodendrocyte heterogeneity in the mouse juvenile and adult central nervous system. Science. 2016; 352(6291):1326–29.
Article CAS PubMed PubMed Central Google Scholar
Fruttiger M, Karlsson L, Hall AC, Abramsson A, Calver AR, Bostrom H, Willetts K, Bertold C-H, Heath JK, Betsholtz C, et al. Defective oligodendrocyte development and severe hypomyelination in PDGF-A knockout mice. Development. 1999; 126(3):457–67.
Article CAS PubMed Google Scholar
See J, Zhang X, Eraydin N, Mun S-B, Mamontov P, Golden JA, Grinspan JB. Oligodendrocyte maturation is inhibited by bone morphogenetic protein. Mol Cell Neurosci. 2004; 26(4):481–92.
Article CAS PubMed Google Scholar
Tanga N, Kuboyama K, Kishimoto A, Kiyonari H, Shiraishi A, Suzuki R, Watanabe T, Fujikawa A, Noda M. The PTN-PTPRZ signal activates the AFAP1l2-dependent PI3K-AKT pathway for oligodendrocyte differentiation: targeted inactivation of PTPRZ activity in mice. Glia. 2019; 67(5):967–84.
Article PubMed Google Scholar
Olsen JJ, Pohl SÖ-G, Deshmukh A, Visweswaran M, Ward NC, Arfuso F, Agostino M, Dharmarajan A. The role of wnt signalling in angiogenesis. Clin Biochem Rev. 2017; 38(3):131.
PubMed PubMed Central Google Scholar
Capasso TL, Li B, Volek HJ, Khalid W, Rochon ER, Anbalagan A, Herdman C, Yost HJ, Villanueva FS, Kim K, et al. BMP10-mediated ALK1 signaling is continuously required for vascular development and maintenance. Angiogenesis. 2020; 23(2):203–20.
Article PubMed Google Scholar
Renault M-A, Roncalli J, Tongers J, Thorne T, Klyachko E, Misener S, Volpert OV, Mehta S, Burg A, Luedemann C, et al. Sonic hedgehog induces angiogenesis via Rho kinase-dependent signaling in endothelial cells. J Mol Cell Cardiol. 2010; 49(3):490–98.
Article CAS PubMed PubMed Central Google Scholar
Guye P, Ebrahimkhani MR, Kipniss N, Velazquez JJ, Schoenfeld E, Kiani S, Griffith LG, Weiss R. Genetically engineering self-organization of human pluripotent stem cells into a liver bud-like tissue using Gata6. Nat Commun. 2016; 7:10243. https://doi.org/10.1038/ncomms10243.
Article CAS PubMed PubMed Central Google Scholar
Ang NB, Saera-Vila A, Walsh C, Hitchcock PF, Kahana A, Thummel R, Nagashima M. Midkine-a functions as a universal regulator of proliferation during epimorphic regeneration in adult zebrafish. PloS ONE. 2020; 15(6):0232308. https://doi.org/10.1371/journal.pone.0232308.
Article Google Scholar
Qi M, Ikematsu S, Maeda N, Ichihara-Tanaka K, Sakuma S, Noda M, Muramatsu T, Kadomatsu K. Haptotactic migration induced by midkine: involvement of protein-tyrosine phosphatase zeta, mitogen-activated protein kinase, and phosphatidylinositol 3-kinase. J Biol Chem. 2001; 276(19):15868–75. https://doi.org/10.1074/jbc.m005911200.
Article CAS PubMed Google Scholar
Weckbach LT, Groesser L, Borgolte J, Pagel J-I, Pogoda F, Schymeinsky J, Müller-Höcker J, Shakibaei M, Muramatsu T, Deindl E, Walzog B. Midkine acts as proangiogenic cytokine in hypoxia-induced angiogenesis. Am J Physiol Heart Circ Physiol. 2012; 303(4):429–38. https://doi.org/10.1152/ajpheart.00934.2011.
Article Google Scholar
Onitsuka I, Tanaka M, Miyajima A. Characterization and functional analyses of hepatic mesothelial cells in mouse liver development. Gastroenterology. 2010; 138(4):1525–35153516. https://doi.org/10.1053/j.gastro.2009.12.059.
Article PubMed Google Scholar
Shin DH, Jo JY, Kim SH, Choi M, Han C, Choi BK, Kim SS. Midkine is a potential therapeutic target of tumorigenesis, angiogenesis, and metastasis in non-small cell lung cancer. Cancers. 2020; 12(9):2402. https://doi.org/10.3390/cancers12092402.
Article CAS PubMed Central Google Scholar
Kreidl E, Öztürk D, Metzner T, Berger W, Grusch M. Activins and follistatins: emerging roles in liver physiology and cancer. World J Hepatol. 2009; 1(1):17.
Article PubMed PubMed Central Google Scholar
Hashimoto O, Funaba M, Sekiyama K, Doi S, Shindo D, Satoh R, Itoi H, Oiwa H, Morita M, Suzuki C, et al. Activin E controls energy homeostasis in both brown and white adipose tissues as a hepatokine. Cell Rep. 2018; 25(5):1193–203.
Article CAS PubMed Google Scholar
Sugiyama M, Kikuchi A, Misu H, Igawa H, Ashihara M, Kushima Y, Honda K, Suzuki Y, Kawabe Y, Kaneko S, et al. Inhibin βe (INHBE) is a possible insulin resistance-associated hepatokine identified by comprehensive gene expression analysis in human liver biopsy samples. PLoS ONE. 2018; 13(3):0194798.
Article Google Scholar
Wimmer RA, Leopoldi A, Aichinger M, Wick N, Hantusch B, Novatchkova M, Taubenschmid J, Hämmerle M, Esk C, Bagley JA, et al. Human blood vessel organoids as a model of diabetic vasculopathy. Nature. 2019; 565(7740):505–10.
Article CAS PubMed PubMed Central Google Scholar
Bagley RG, Weber W, Rouleau C, Teicher BA. Pericytes and endothelial precursor cells: cellular interactions and contributions to malignancy. Cancer Res. 2005; 65(21):9741–50.
Article CAS PubMed Google Scholar
Cao J, Spielmann M, Qiu X, Huang X, Ibrahim DM, Hill AJ, Zhang F, Mundlos S, Christiansen L, Steemers FJ, et al. The single-cell transcriptional landscape of mammalian organogenesis. Nature. 2019; 566(7745):496–502.
Article CAS PubMed PubMed Central Google Scholar
Ramilowski JA, Goldberg T, Harshbarger J, Kloppmann E, Lizio M, Satagopam VP, Itoh M, Kawaji H, Carninci P, Rost B, et al. A draft network of ligand–receptor-mediated multicellular signalling in human. Nat Commun. 2015; 6(1):1–12.
Article Google Scholar
Lugo-Martinez J, Ruiz-Perez D, Narasimhan G, Bar-Joseph Z. Dynamic interaction network inference from longitudinal microbiome data. Microbiome. 2019; 7(1):1–14.
Article Google Scholar
Ruiz-Perez D, Lugo-Martinez J, Bourguignon N, Mathee K, Lerner B, Bar-Joseph Z, Narasimhan G. Dynamic Bayesian networks for integrating multi-omics time-series microbiome data. bioRxiv. 2020:835124.
Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, van der Walt SJ, Brett M, Wilson J, Millman KJ, Mayorov N, Nelson ARJ, Jones E, Kern R, Larson E, Carey CJ, Polat İ, Feng Y, Moore EW, VanderPlas J, Laxalde D, Perktold J, Cimrman R, Henriksen I, Quintero EA, Harris CR, Archibald AM, Ribeiro AH, Pedregosa F, van Mulbregt P, SciPy 1.0 Contributors. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020; 17:261–72. https://doi.org/10.1038/s41592-019-0686-2.
Article CAS PubMed PubMed Central Google Scholar
Zudaire E, Gambardella L, Kurcz C, Vermeren S. A computational tool for quantitative analysis of vascular networks. PloS ONE. 2011; 6(11):27385.
Article Google Scholar
Li D, Velazquez JJ, Ding J, Hislop J, Ebrahimkhani MR, Bar-Joseph Z. Python package TraSig. Github. 2022. https://github.com/doraadong/TraSig.
Li D, Velazquez JJ, Ding J, Hislop J, Ebrahimkhani MR, Bar-Joseph Z. Python package TraSig. Zenodo. 2022. https://doi.org/10.5281/zenodo.5949000.
Yang L, Wang W-H, Qiu W-L, Guo Z, Bi E, Xu C-R. A single-cell transcriptomic analysis reveals precise pathways and regulatory mechanisms underlying hepatoblast differentiation. Hepatology. 2017; 66(5):1387–401.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Figure 4a is created with Biorender.com. Additional file 1: Figures S9a-S10a were created using dynverse [4]. We also thank Haotian Teng for the fruitful discussion.

Peer review information

Stephanie McClelland and Barbara Cheifet were the primary editors of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.

Review history

This manuscript was previously reviewed at another journal and no review history is available.

Funding

Work was partially supported by NIH grants 1R01GM122096 and OT2OD026682 and by a C3.ai DTI Research Award to ZB-J. M.R.E. is supported by NIH grants EB028532, HL141805, and P30DK120531. J.H. is supported by the CATER Predoctoral Fellowship (NIBIB T32 EB001026).

Author information

Dongshunyi Li and Jeremy J. Velazquez contributed equally to this work.

Authors and Affiliations

Computational Biology Department, School of Computer Science, Carnegie Mellon Universit, Pittsburgh, 15213, PA, USA
Dongshunyi Li & Ziv Bar-Joseph
Department of Pathology, School of Medicine, University of Pittsburgh, Pittsburgh, 15213, PA, USA
Jeremy J. Velazquez, Joshua Hislop & Mo R. Ebrahimkhani
Pittsburgh Liver Research Center, University of Pittsburgh, Pittsburgh, 15261, PA, USA
Jeremy J. Velazquez, Joshua Hislop & Mo R. Ebrahimkhani
Meakins-Christie Laboratories, Department of Medicine, McGill University Health Centre, Montreal, H4A 3J1, Quebec, Canada
Jun Ding
Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, 15261, PA, USA
Joshua Hislop & Mo R. Ebrahimkhani
McGowan Institute for Regenerative Medicine, University of Pittsburgh, Pittsburgh, 15219, PA, USA
Mo R. Ebrahimkhani
Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, 15213, PA, USA
Ziv Bar-Joseph

Authors

Dongshunyi Li
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy J. Velazquez
View author publications
You can also search for this author in PubMed Google Scholar
Jun Ding
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Hislop
View author publications
You can also search for this author in PubMed Google Scholar
Mo R. Ebrahimkhani
View author publications
You can also search for this author in PubMed Google Scholar
Ziv Bar-Joseph
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.L., J.D., and Z.B.-J. designed the research; D.L., J.D., and Z.B.-J. developed the method; D.L. implemented the software. All authors analyzed the method outputs to select validation experiments. J.J.V., J.H., and M.R.E. designed and performed the validation experiments; D.L. and J.J.V. performed the analysis of validation data. All authors wrote the manuscript. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Mo R. Ebrahimkhani.

Ethics declarations

Ethics approval and consent to participate

Human induced pluripotent stem cell work performed in this study were approved by the University of Pittsburgh Human Stem Cell Research Oversight (hSCRO) committee.

Consent for publication

Not applicable.

Competing interests

M.R.E and J.J.V. have a patent (WO2019237124) for the organoid technology used in this publication.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

Supplementary Notes and Figures.

Additional file 2

Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Li, D., Velazquez, J.J., Ding, J. et al. TraSig: inferring cell-cell interactions from pseudotime ordering of scRNA-Seq data. Genome Biol 23, 73 (2022). https://doi.org/10.1186/s13059-022-02629-7

Download citation

Received: 20 December 2021
Accepted: 09 February 2022
Published: 07 March 2022
DOI: https://doi.org/10.1186/s13059-022-02629-7

TraSig: inferring cell-cell interactions from pseudotime ordering of scRNA-Seq data

Abstract

Background

Results

Reconstructing dynamic liver development model using CSHMM

Inferring cell type interactions for liver development

TraSig identifies ligand-receptor interactions important to vascular development

Experimental validation for predicted TraSig pairs

Comparing TraSig with prior methods

TraSig identifies interactions in neocortical development

Applying TraSig to trajectories obtained by Slingshot

Discussion

Methods

Learning trajectories for time series scRNA-Seq data

Selecting paired clusters

Ordering cells and inferring expression profiles

Computing interaction scores for ligands and receptors

Alignment between paired clusters

Using trajectories inferred by other methods

Assessment of cell-cell interaction to probe vascular formation in liver organoids

Availability of data and materials

References

Acknowledgements

Peer review information

Review history

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1

Additional file 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Biology

Contact us