- Open Access
Alternative polyadenylation factors link cell cycle to migration
Genome Biology volume 19, Article number: 176 (2018)
In response to a wound, fibroblasts are activated to migrate toward the wound, to proliferate and to contribute to the wound healing process. We hypothesize that changes in pre-mRNA processing occurring as fibroblasts enter the proliferative cell cycle are also important for promoting their migration.
RNA sequencing of fibroblasts induced into quiescence by contact inhibition reveals downregulation of genes involved in mRNA processing, including splicing and cleavage and polyadenylation factors. These genes also show differential exon use, especially increased intron retention in quiescent fibroblasts compared to proliferating fibroblasts. Mapping the 3′ ends of transcripts reveals that longer transcripts from distal polyadenylation sites are more prevalent in quiescent fibroblasts and are associated with increased expression and transcript stabilization based on genome-wide transcript decay analysis. Analysis of dermal excisional wounds in mice reveals that proliferating cells adjacent to wounds express higher levels of cleavage and polyadenylation factors than quiescent fibroblasts in unwounded skin. Quiescent fibroblasts contain reduced levels of the cleavage and polyadenylation factor CstF-64. CstF-64 knockdown recapitulates changes in isoform selection and gene expression associated with quiescence, and results in slower migration.
Our findings support cleavage and polyadenylation factors as a link between cellular proliferation state and migration.
Fibroblasts within the dermis bear much of the responsibility for the secretion and maintenance of extracellular matrix proteins . Fibroblasts in unwounded skin are mostly in a state of quiescence in which they have reversibly exited the proliferative cell cycle [1,2,3]. In the initial response to a wound, mitogens and chemokines such as platelet-derived growth factor and fibroblast growth factor released by platelets and keratinocytes stimulate fibroblasts to migrate to the wound-healing environment and proliferate [1,2,3,4]. In the wounded tissue, fibroblasts secrete collagen and other extracellular matrix molecules that remodel the extracellular environment and promote the formation of a scar . While fibroblasts are recognized to play an important role in normal skin and in the wound-healing environment, we do not yet have a full appreciation of the molecular mechanisms that control the changes in fibroblast behavior in the context of a wound.
We have been studying the transition between proliferation and quiescence in a model system in primary human dermal fibroblasts [5,6,7,8,9]. Using microarrays, we and others have shown that a shift between proliferation and quiescence is associated with a major reprogramming of gene expression patterns, and that these gene expression changes are important for the functional attributes of quiescent cells, such as their ability to re-enter the cell cycle [9,10,11,12]. Based on our previous studies showing changes in the levels of splicing factors as fibroblasts transition between proliferation and quiescence , and earlier studies showing that proliferating cells, stem cells, activated cells, and cancer cells rely heavily on alternative polyadenylation (APA) by preferential use of proximal polyadenylation sites [13,14,15,16,17,18,19,20,21], we sought to understand whether alternative isoform use [16, 22, 23] could represent a link between proliferation and migration.
To address this question, we defined the changes in isoform use and polyadenylation site selection that occur as cells transition from proliferation to quiescence. We found that APA factors are expressed at lower levels as fibroblasts become quiescent, and that knockdown of these factors results in APA and gene expression changes that overlap with the changes that occur with quiescence. Longer transcripts that end at distal polyadenylation sites tend to be more stable than shorter transcripts generated from proximal polyadenylation site use in proliferating cells. We also discovered that APA factors are functionally important for the transition to a more migratory state in proliferating versus quiescent fibroblasts and affect migration in cancer cells as well. Our data, taken as a whole, provide a deeper understanding of the role of mRNA processing in the close association between proliferation and migration.
Entry into quiescence results in downregulation of genes involved in the cell cycle, mRNA processing, and motility
Primary human dermal fibroblasts were isolated from human skin samples as previously described . Fibroblasts isolated from two different donors were collected in proliferating conditions or after being induced into quiescence by 7 days of contact inhibition (7dCI) of proliferation . RNA-Seq and microarray analyses were performed to determine changes in gene expression between three samples of proliferating and matched 7dCI cells (Fig. 1a and Additional file 1: Table S1) . Among the 19,673 genes monitored, transcripts from 1993 genes (10.1%) changed in expression twofold or more, demonstrating widespread changes in gene expression with contact inhibition-induced quiescence (Fig. 1b). Expression levels for 52% of these genes were upregulated in 7dCI compared with proliferating fibroblasts, and 48% were downregulated in 7dCI fibroblasts. Correlation between biological replicates analyzed by RNA-Seq was high (R2 values greater than or equal to 0.83) (Additional file 1: Figure S1A). When the same samples were analyzed with microarrays, the differential gene expression detected by microarray was largely in agreement with that detected by RNA-Seq (r2 = 0.785, p < 0.001) (Additional file 1: Figure S1B). Further, gene expression changes detected by RNA-Seq correlated well with the previously published “quiescence program” of gene expression changes identified in fibroblasts induced into quiescence by multiple independent conditions  (Additional file 1: Figure S1C). The findings support previous studies showing that quiescence is associated with regulation of a significant fraction of the genome [9, 10, 26].
Gene set enrichment analysis (GSEA) [27, 28] revealed that expression of genes involved in DNA replication and cell cycle regulation was downregulated in 7dCI compared with proliferating fibroblasts (Fig. 1c), consistent with cell cycle exit in contact-inhibited conditions. Expression of genes associated with extracellular matrix remodeling and collagen metabolism was upregulated with quiescence (Fig. 1c, d), consistent with our previous findings [6, 7]. Indeed, COL21A1, a collagen found associated with collagen I, is among the genes most strongly induced in quiescent compared with proliferating fibroblasts (Additional file 1: Table S2). Expression of genes in the categories of muscle filament sliding, regulation of muscle contraction, movement, and muscle contraction was downregulated in contact-inhibited compared with proliferating fibroblasts (Fig. 1c, d). Four genes involved in cell motility were among the most strongly downregulated genes with quiescence (KISS1, ACTC1, PODXL, and RLTPR) (Table 1 and Additional file 1: Table S2). Thus, we found that proliferating fibroblasts express higher levels of transcripts associated with motility and cytoskeletal remodeling.
Transcripts associated with splicing and polyadenylation were mostly downregulated in 7dCI compared with proliferating fibroblasts (Fig. 1c, d), consistent with previous reports [9, 21]. Transcripts encoding many of the proteins that are considered core components of the spliceosome were slightly downregulated in contact-inhibited compared with proliferating fibroblasts (Additional file 1: Table S3), with three genes reaching statistical significance (U1C (2.26-fold reduction), PRPF4 (2.77-fold reduction), and PPIH (2.89-fold reduction)). Expression levels of cleavage and polyadenylation factors were also reduced with quiescence (Additional file 2). We hypothesized that in addition to changes in gene expression, alterations in mRNA processing events between proliferating and quiescent fibroblasts could also contribute to functional changes in quiescent and proliferating states.
Quiescent fibroblasts retain more exons and introns than proliferating fibroblasts
To better understand changes in mRNA processing associated with proliferation, we investigated our RNA-Seq data further to identify examples of alternative start site, alternative splicing, or alternative polyadenylation. Applying the DEXSeq algorithm , we discovered 1975 exons, encoded within 1218 genes, with differential expression between proliferating and 7dCI fibroblasts (Additional file 3). Using g:Profiler , we found that genes that undergo alternative isoform expression in proliferating versus quiescent cells are enriched in categories of RNA binding, RNA processing, translational elongation, and RNA splicing (Table 2, Additional file 4). Thus, genes involved in RNA processing are themselves particularly likely to be alternatively processed during the transition between proliferation and quiescence.
To better understand the frequency of specific types of splicing events that occurred differentially in proliferating and quiescent fibroblasts, we applied the rMATS computational algorithm [31,32,33] (Fig. 2a, Additional file 5). Skipped exons (exons that are present in proliferating, but not quiescent, cells or vice versa) were the most common type of event detected (319 events, 53% of events). Of the splicing events detected by rMATS, 39% were also detected by DEXSeq. More exons were preferentially included in quiescent compared with proliferating conditions, than proliferating compared with quiescent conditions (1.5-fold, Fisher’s exact test, two-tailed p value = 0.013) (Fig. 2a). These exon-switching events provide opportunities for regulation of protein function based on the inclusion or exclusion of individual exons. Introns were significantly more frequently retained in quiescent than proliferating fibroblasts (3.7-fold, Fisher’s exact test, two-tailed p value < 0.0001) (Fig. 2a). 8.2% of the transcripts associated with retained intron events are annotated as nonsense-mediated decay (NMD) candidates (18 unique NMD transcripts/220 total unique intron retention transcripts in the Ensembl database). Gene ontology (GO) analysis of the differentially spliced genes revealed that genes that undergo alternative splicing with quiescence are enriched for the categories of RNA binding, RNA processing, and RNA splicing (Table 2 and Additional file 6), consistent with a growing literature demonstrating that genes involved in mRNA splicing are themselves regulated by splicing events [30, 34,35,36,37].
Some auxiliary splicing factors are downregulated in quiescent fibroblasts
To understand the changes in splicing in quiescent compared with proliferating fibroblasts, we investigated changes in the expression of splicing factors. Our RNA-Seq data revealed that expression from RNA splicing genes is modestly downregulated in contact-inhibited fibroblasts (Fig. 1c, d and Additional file 1: Table S3). We monitored protein levels of splicing factors with immunoblotting in fibroblasts that were proliferating or induced into quiescence by 7 days of contact inhibition (7dCI) or by serum starvation (7dSS). Levels of essential splicing factor U2AF65 were similar in proliferating and quiescent fibroblasts. Levels of core factor U1-70K and auxiliary factors TRA2β and FUS were downregulated in quiescent compared with contact-inhibited fibroblasts (Fig. 2b). Lower levels of some splicing factors in quiescent fibroblasts may contribute to the increased intron retention in quiescent conditions [38, 39].
Weaker splice sites for retained introns
In addition to lower levels of splicing factors, intron retention has been associated with weak splice sites [40, 41]. To better understand why some introns are retained in proliferating or quiescent cells, we analyzed the extent to which 5′ splice sites (9-nt length) and 3′ splice sites (23 nt) of differentially retained introns match consensus splice sites . We determined the probability of observing each sequence given the position weight matrix for consensus splice sites. Sequences at splice sites for introns differentially retained in proliferating or quiescent states matched the consensus sequence less well than the sequences near constitutively spliced exons, with a strong effect at the 3′ splice site (Fig. 2c). These findings are consistent with previous studies that also showed that 3′ splice sites are enriched for C’s compared with T’s in the polypyrimidine tracts of introns that are retained . Thus, in proliferating fibroblasts that have higher levels of most splicing factors, intron retention may be especially sensitive to the 3′ splice sequence.
A shift toward the use of more distal polyadenylation sites in quiescence
A shift toward the use of distal polyadenylation sites has been observed in previous studies that showed that non-dividing cells  and differentiated cells [18, 20, 44, 45] predominantly use distal polyadenylation sites, while proliferating cells [18, 21] and cancer cell lines [20, 45, 46] tend to use proximal polyadenylation sites. Our DEXSeq analysis revealed that many of the changes in isoform expression detected between proliferating and 7dCI fibroblasts involve the last exon of the analyzed transcript and would result in a change in polyadenylation site. For example, Inverted Formin, FH2 and WH2 domain (INF2), and brother of CDO (BOC) (Fig. 3a) exhibit alternative use of terminal exons in proliferating and 7dCI fibroblasts. Real-time PCR with isoform-specific primers confirmed that for both INF2 and BOC, the transition to quiescence in response to either 7dCI or 7dSS resulted in a change in polyadenylation site selection (Fig. 3b). For INF2, the strongest effect was a decrease in the use of the proximal polyadenylation site. For BOC, the strongest effect was an increase in the use of the distal polyadenylation site in quiescent fibroblasts. Restimulation of 7dCI fibroblasts to a proliferative state resulted in a reversal back toward a polyadenylation site selection profile more similar to that in proliferating cells for both INF2 and BOC.
To generate a large-scale dataset that would clearly define the 3′ ends of transcripts in proliferating and quiescent (7dCI) fibroblasts, we applied polyadenylation site-enriched RNA-Seq . With polyadenylation site-enriched RNA-Seq, ~ 64% of all mapped sequencing reads matched a polyadenylation site (Additional file 1: Table S4). Polyadenylation site-enriched RNA-Seq data were used to determine the relative use of the distal (RUD) (reads mapping to the distal polyadenylation site/total reads from proximal and distal polyadenylation sites) for each gene in proliferating and 7dCI conditions for detected genes with two polyadenylation sites (Additional file 7). For genes with greater than two polyadenylation sites (Additional file 8), a more general parameter called relative site usage (reads mapping to a polyadenylation site/total reads from all polyadenylation sites) was used. Data were highly reproducible when different biological replicates of proliferating and 7dCI samples were compared (Additional file 1: Figure S2A). Using polyadenylation site-enriched RNA-Seq, we confirmed the previous finding  of a shift toward the use of more distal polyadenylation sites upon entry into the quiescent state through contact inhibition (Fig. 3c, Additional file 7). Eighty-eight percent (628 out of 714) of genes with two polyadenylation sites, and with significant changes (|RUD| > 0.05) in alternative polyadenylation (APA) between the two cell states, were longer (greater use of distal pA sites compared to proximal pA sites) in the quiescent compared with the proliferating fibroblasts. For 572 of these 628 genes (91%), the proximal polyadenylation site localizes to the 3′ untranslated region (UTR; termed as UTR APA) (Fig. 3c), while for the remaining 9% of genes, the proximal polyadenylation site is found in the region upstream of the 3´ UTR (upstream region (UR) APA) including introns and exons. Genes with two polyadenylation sites that undergo APA with quiescence were enriched in genes involved in RNA splicing and processing (Table 2 and Additional file 9). Genes that undergo APA with quiescence also included genes involved in cell migration (Table 1).
Reduced levels of mRNA processing factors in quiescent fibroblasts
To better understand the regulation of polyadenylation site use with quiescence, we monitored the levels of APA factors in proliferating and quiescent fibroblasts. Cleavage and polyadenylation of pre-mRNA transcripts are mediated by the coordinated activity of three core protein complexes . The cleavage and polyadenylation specificity factor (CPSF) complex recognizes a hexameric sequence (AAUAAA or a similar sequence) in a 50-nt region upstream of the cleavage site [48, 49]; the 3′ pre-RNA, subunit 2, 64 kDa (CSTF2 or CstF-64) subunit of the CstF complex recognizes a U-rich or G/U-rich region about 20–40 nucleotides downstream of the cleavage site [19, 50,51,52,53]; and Nudix (nucleoside diphosphate linked moiety X)-type motif 21 (NUDT21 or CFIm25) recognizes UGUA sequences upstream of the cleavage and polyadenylation sites . CPSF73, a component of the CPSF complex, is the endonuclease that performs the cleavage event at the hexameric sequence . Increased levels of CSTF complex proteins have been associated with the use of proximal polyadenylation sites [19, 56, 57], while the CFIm complex has been reported to repress the use of proximal polyadenylation sites [45, 57, 58]. Our RNA-Seq data revealed that most of the core polyadenylation factors and auxiliary factors associated with cleavage and polyadenylation are modestly downregulated at the transcript level in quiescent compared with proliferating fibroblasts (Additional file 2). Among the core factors, CstF-64/CSTF2 is strongly and significantly (3.1-fold) downregulated at the transcript level. Using immunoblotting, we found that the protein levels of CstF-64, CPSF73, and CFIm25 are lower in 7dCI or 7dSS than in proliferating fibroblasts (Fig. 3d). By monitoring the extent of Serine 5 phosphorylation of RNA pol II carboxyterminal domain (CTD) as an indication of transcription initiation rate  with immunoblotting, we found that CstF-64 downregulation at the protein level with quiescence was stronger than the reduction in transcription initiation (Fig. 3d).
Knockdown of cleavage and polyadenylation factors replicates polyadenylation site selection with quiescence
To better understand the role of cleavage and polyadenylation factors in polyadenylation site selection with quiescence, we introduced siRNAs that target CstF-64, CPSF73 or CFIm25, or a control siRNA, into fibroblasts. Strong knockdown of the targeted gene was confirmed with real-time PCR (Additional file 1: Figure S3). In comparison to control cells, knockdown of these polyadenylation factors did not significantly affect cell viability (Additional file 1: Figure S4A and B). We tested whether knocking down the expression of cleavage and polyadenylation factors results in changes in the levels of shorter and longer isoforms of genes that undergo APA with quiescence using real-time PCR primers designed to recognize the short or long isoforms of INF2 or BOC (Fig. 3a). For INF2, knockdown of CstF-64 or CPSF73, but not CFIm25, resulted in reduced levels of the short isoform of INF2 and an increase in the long isoform of INF2 (Fig. 4a). For BOC, knockdown of CstF-64 or CPSF73, but not CFIm25, resulted in lower levels of the short BOC isoform (Fig. 4a). Knockdown of CstF-64 resulted in an increase in the long isoform of BOC (Fig. 4a).
To monitor global APA changes, we performed polyadenylation site-enriched RNA-Seq of fibroblasts transfected with a control siRNA or an siRNA that targets a polyadenylation factor (CstF-64, CPSF73, or CFIm25) . Knockdown in two different strains of fibroblasts resulted in highly reproducible results (Additional file 1: Figure S2B). Each knockdown resulted in significant changes (|RUD| > 0.05) in polyadenylation site selection, with CFIm25 knockdown resulting in a clear shift toward use of more proximal polyadenylation sites (Additional file 1: Figure S4C and Additional file 10), consistent with previous reports [60, 61]. We compared the genes that shift polyadenylation site use with quiescence with the results of knockdown of each cleavage and polyadenylation factor (Fig. 4b and Additional file 1: Figure S5A and B). Among the three polyadenylation factors, knockdown of CFIm25 resulted in the largest number of genes that shift to greater use of the proximal polyadenylation site (shorter isoforms), and the most genes that overlap with shifts to more proximal polyadenylation sites with quiescence (Fig. 4b and Additional file 1: Figure S5A). We observed significant overlap among the genes that use more distal polyadenylation sites (shift to longer isoforms) with quiescence and genes that use more distal polyadenylation sites with knockdown of each factor, with larger numbers of genes affected for CstF-64 or CPSF73 knockdown (Fig. 4b and Additional file 1: Figure S5A). Some of these changes in polyadenylation site use were specific for one factor, while some were regulated by more than one or even all three factors (Additional file 1: Figure S5B). For 626 unique genes that shift to distal polyadenylation site use with quiescence, 226 genes (36%) also shift to distal polyadenylation site use with knockdown of one or more polyadenylation factors. For 86 genes that shift to proximal polyadenylation site use with quiescence, 38 (44%) also shift to proximal polyadenylation site use with knockdown of one or more polyadenylation factors (Additional file 1: Figure S5B).
Knockdown of CstF-64 resulted in changes in gene expression that significantly overlap with gene expression changes with quiescence (Fig. 4c and Additional file 11). Gene expression changes upon knockdown of CPSF73 and CFIm25 overlapped with gene expression changes during quiescence as well, but fewer genes were involved (Additional file 1: Figure S5C).
Some of the genes that were regulated (APA changes or gene expression changes) with knockdown of CstF-64 was found to be associated with GO terms related to cell movement (Table 3). Several of these migration genes that undergo changes in APA upon CstF64 knockdown also did so with quiescence, such as Arp2/3 complex protein ACTR2 and CDC42 and RAC1-binding protein IQGAP1.
Cleavage and polyadenylation factor recognition sites are more prevalent in genes that undergo alternative isoform use with quiescence
To further understand the importance of different cleavage and polyadenylation site factors in the alternative use of polyadenylation sites with quiescence, we monitored the presence of their recognition motifs (Fig. 5a). For genes that undergo UR APA and shift to greater use of more distal polyadenylation sites during quiescence, their proximal polyadenylation site is more likely to have a strong hexamer (AAUAAA or AUUAAA), and less likely to have no hexamer, than for control genes (Fig. 5b). Similarly, when CPSF73 is knocked down, genes that shift to greater use of distal polyadenylation sites are less likely to have no hexamer than genes that do not lengthen with quiescence (Additional file 1: Figure S6). The findings support a role for reduced CPSF73 levels contributing to the use of more distal polyadenylation sites in genes undergoing UR APA in quiescent cells.
Extending the analysis to UGUA motifs recognized by CFIm25, among genes that use UR APA to shift to more distal polyadenylation site use in quiescent than proliferating cells, there was a significantly higher chance of a UGUA motif being present at the proximal site than for a control set of genes (Fig. 5c). With CFIm25 knockdown, the strongest effect was increased use of proximal polyadenylation sites, and the genes affected were more likely to have a UGUA motif at their distal polyadenylation site (Additional file 1: Figure S7).
To monitor the presence of binding sites for CstF-64, we determined the fraction of polyadenylation sites that contain a string of four or more uracils in the region 20–40 base pairs downstream of the polaydenylation site. With this analysis, there were more UUUU motifs at proximal polyadenylation sites among genes that shift to the use of more distal sites with quiescence, but the difference was not statistically significant (0.098) (Fig. 5d). We also monitored the fraction of U’s (U-rich) and the fraction of U’s or G’s (UG-rich) in the same 20–40 base pair region. Proximal polyadenylation sites were enriched in U-rich and UG-rich sequences for genes that shifted to greater use of longer isoforms with quiescence (Fig. 5e and Additional file 1: Figure S8). This result is consistent with downregulation of CstF-64 playing a role in the shift to more distal polyadenylation sites with quiescence. Thus, in proliferating conditions, CstF-64 levels are more available for binding to U-rich proximal sites, which supports the generation of shorter isoforms.
Shifting to more distal polyadenylation sites stabilizes transcripts in quiescent but not proliferating fibroblasts
Changes in the levels of transcripts that terminate at different polyadenylation sites could reflect changes in the rates that these isoforms are generated based on the levels of polyadenylation factors, or changes in the rates at which they decay. To understand the relationship between polyadenylation site selection and transcript fate, we first determined whether APA with quiescence was associated with a change in gene expression. Relative expression in quiescent compared with proliferating fibroblasts was slightly higher on average for genes that undergo a shift to greater use of distal polyadenylation sites with quiescence than for genes that do not undergo APA or use the proximal polyadenylation site preferentially in quiescence (Fig. 6a, p < 0.001, Wilcoxon signed-rank test). This finding would be consistent with longer transcripts being more stable.
To better understand the relationship between polyadenylation site selection and transcript decay rate, we added actinomycin D to inhibit new transcription in proliferating or 7dCI fibroblasts, collected RNA over a timecourse, and performed polyadenylation site-enriched RNA-Seq to monitor the rate that different gene isoforms decayed . The results extend our previous studies of genome-wide transcript decay rates in proliferating and 7dCI fibroblasts using microarrays . In two different fibroblast strains (12–1 and 12–3), we found that isoforms terminating at distal polyadenylation sites were more stable than isoforms terminating at proximal polyadenylation sites in quiescent, but not proliferating, fibroblasts (Additional file 12 and Fig. 6b, c).
We identified motifs enriched in the interpolyadenylation site regions in genes that shift to a longer isoform with quiescence. Among the RNA-binding proteins that bind to these motifs, some are induced in quiescent compared with proliferating cells and would be candidates for stabilizing longer transcripts in quiescent cells (Additional file 1: Table S5). Our findings indicate that the shift to the use of longer isoforms in quiescent cells results in an overall stabilization of transcripts and a modest increase in expression levels. Therefore, the higher levels of longer isoforms in quiescent than proliferating fibroblasts could reflect both a difference in polyadenylation site selection (influenced by levels of polyadenylation factors) and a difference in the rate at which the shorter and longer transcripts decay in the two proliferative states.
Cleavage and polyadenylation factors are expressed at higher levels in wound-healing than quiescent skin in vivo
Wound healing is a situation in which cells are activated to both proliferate and migrate. We investigated the levels of cleavage and polyadenylation factors in normal skin and in dermal excisional wounds in mice. We introduced punch biopsies into the backs of mice and collected wounded tissue and unwounded control skin approximately 2 cm from the wound. Immunohistochemistry for the proliferation marker Ki-67 revealed higher levels of proliferation of a migrating mass of cells that includes fibroblasts, myofibroblasts, and immune cells in the skin proximal to the wound compared with cells in the dermis of control, unwounded skin (Fig. 7) . Immunostaining for histone H4 as a control revealed similar staining in wounded and control skin as expected. Immunohistochemistry for CstF-64, CPSF73, or CFIm25 revealed a higher fraction of cells with positive nuclei in the region surrounding the wounded skin for all three factors than in control, unwounded skin (Fig. 7). This analysis revealed that the shift toward higher levels of cleavage and polyadenylation factors in proliferating fibroblasts in culture also occurs in the migratory, proliferating cells that heal wounds in vivo.
CstF-64 knockdown reduces fibroblast migration
Based on the consistency with which we observed changes in the mRNA processing and expression of genes important for cell motility in proliferating versus quiescent fibroblasts (Table 1), we hypothesized that changes in mRNA processing associated with the transition between proliferation and quiescence are also important for the closely linked process of cell migration. First we tested the association between proliferation and migration. We generated fibroblasts that were proliferating, induced into quiescence by 7dSS, or restimulated after 7dSS by re-addition of medium with serum. We monitored the rate at which fibroblasts in each condition migrated into a denuded area on a tissue culture plate with real-time imaging (Fig. 8a). Migration was quantified as the ratio of cell concentration in the denuded area compared to the cell concentration in the non-denuded area, thus normalizing for possible differences in proliferation rate. We discovered that the proliferating and restimulated fibroblasts migrated into the denuded area more rapidly than the serum-starved fibroblasts (Fig. 8b).
We observed changes in the transcript and protein levels of cleavage and polyadenylation factors as fibroblasts transition between proliferation and quiescence. To test whether levels of cleavage and polyadenylation factors change in fibroblasts induced to migrate into a denuded area, we introduced denuded areas into cultures of fibroblasts and performed immunofluorescence to monitor the levels of cleavage and polyadenylation factors. CstF-64 and CPSF73 levels were significantly higher in the cells that had migrated into the denuded area than cells that had not migrated, while no significant change was observed for CFIm25 (Additional file 1: Figure S9). We then tested the importance of alternative polyadenylation factors for fibroblast motility. We generated knockdown fibroblasts with control siRNAs or siRNAs against cleavage and polyadenylation factors, and monitored the rate of migration. Knockdown of CstF-64 with any of three different siRNAs (Fig. 8c) resulted in reduced migration into the denuded area (Fig. 8d). CstF-64 siRNA #1 had the strongest effect on CstF-64 levels and resulted in the most significant reduction in migration. Knockdown of CPSF73 (Fig. 8c) resulted in slower migration, but the difference was not statistically significant (Fig. 8d). Knockdown of CFIm25 (Fig. 8c) did not affect migration rate (Fig. 8d). Thus, CstF-64 is induced in migrating cells, and knockdown of CstF-64 resulted in APA changes and downregulation of genes that overlap with those that occur with quiescence, including genes associated with cell migration (Table 3). These findings are consistent with our observation here that knockdown of CstF-64 simulates the reduced migration observed for quiescent fibroblasts.
Knockdown of cleavage and polyadenylation factors reduces migration of triple negative breast cancer cells
To determine the generality of our findings for different types of cells, we tested the effects of siRNAs targeting CstF-64, CPSF73 or CFIm25 on the migration of triple negative breast cancer cells (Additional file 1: Figure S3). Triple negative breast cancer is a highly aggressive breast cancer subtype characterized by a lack of hormonal receptors and an absence of HER2 amplification . Knockdown of CstF-64 or CPSF73 resulted in significantly reduced migration of triple negative breast cancer cells (Fig. 8e). The triple negative breast cancer cells were even more sensitive to altered polyadenylation site selection than the fibroblasts, which may reflect the increased reliance of cancer cells on proximal polyadenylation sites [20, 45, 46, 66]. Our results demonstrate that the selection of polyadenylation sites can affect the migratory capacity of cancer cells as well as fibroblasts in wound healing (Fig. 8f).
While we and others have shown that the transition to quiescence is associated with widespread changes in gene expression [9,10,11], and others have previously shown changes in the selection of polyadenylation sites with quiescence , we sought here to better understand the relationship between quiescence and alternative polyadenylation. Gene expression analysis of RNA-Seq data revealed that genes involved in mRNA processing (splicing and polyadenylation) are downregulated as fibroblasts enter quiescence (Fig. 1c, d). These findings suggested to us that processing of pre-mRNA transcripts may be different in quiescent compared with proliferating cells, and that these changes may contribute to changes in transcript abundance and the functional attributes of proliferating versus quiescent fibroblasts. We further discovered through differential exon analysis of RNA-Seq data that hundreds of genes exhibit changes in isoform expression during the transition to quiescence. Quiescent fibroblasts expressed lower levels of some auxiliary splicing factors (Fig. 2b) and were more likely to include exons and retain introns than proliferating fibroblasts (Fig. 2a), demonstrating cell-cycle state-dependent changes in splicing and intron retention . Introns that were retained tended to have splicing motifs that varied from the consensus sequence, especially for the polypyrimidine tract adjacent to 3′ splice sites in the proliferating state (Fig. 2c), potentially reducing the effectiveness of splicing factors or associated RNA binding proteins. Our results are consistent with a model in which quiescence is associated not with a complete shut-down of mRNA processing events, but rather with a shift in the processing of specific transcripts such that, in addition to changes in gene expression, an alternative set of exons and isoforms are present in fibroblasts that are proliferating versus quiescent. Genes involved in cell motility were among those demonstrating consistent changes in splicing in proliferating versus quiescent cells (Table 1).
Among the changes in isoform use that we observed, the most prominent effect was a change in the selection of polyadenylation sites in proliferating versus quiescent fibroblasts. In response to quiescence induced by contact inhibition, 714 genes exhibited a change in polyadenylation site selection, and in 88% of instances, alternative polyadenylation site use resulted in a lengthening of transcripts in quiescent compared with proliferating cells (Fig. 3c). These findings are consistent with previous studies that revealed that 3′ UTRs are shorter in more rapidly proliferating cells [18, 21], stem cells , and cells and tissues derived from tumors [20, 46, 68], and longer in cells that divide less frequently such as differentiated tissues [13, 15, 67]. We found that 3′ UTR lengthening reverses when quiescent cells re-enter the cell cycle (Fig. 3b), demonstrating that these changes can be reversed based on proliferative state.
To better understand the basis for the changes in polyadenylation site selection in proliferating versus quiescent fibroblasts, we monitored the levels of polyadenylation factors in proliferating and quiescent cells. Transition to quiescence was associated with lower levels of cleavage and polyadenylation factors CstF-64, CFIm25, and CPSF73 (Fig. 3d). Knockdown of each these three factors resulted in changes in polyadenylation site use that overlapped significantly with the changes that occurred with quiescence (Fig. 4b and Additional file 1: Figure S5A and B). There were also changes in gene expression as a result of knockdown of specific factors, especially CstF-64. These gene expression changes overlapped with changes in gene expression that occur with quiescence (Fig. 4c and Additional file 1: Figure S5C).
To further understand the contribution of different cleavage and polyadenylation complexes to the shift in polyadenylation site selection with quiescence, we monitored the presence of their recognition sites. For genes that use more distal upstream region polyadenylation sites with quiescence, the proximal hexamer was much more likely to match the canonical hexamer, and very unlikely to be absent (Fig. 5b). A similar shift was observed with CPSF73 knockdown (Additional file 1: Figure S6A). This is consistent with reduced expression of CPSF73, and reduced use of upstream region proximal polyadenylation sites, as a factor contributing to the lengthening of transcripts with quiescence. A role for reduced CstF-64 levels in quiescent cells promoting the shift to more distal polyadenylation sites is supported by the finding that the sequence between 20 and 40 bps downstream of the proximal polyadenylation site included more Us on average and more Gs and Us on average, for genes that use more distal polyadenylation sites with quiescence (Fig. 5e). Taken together, the results support the importance of reduced levels of cleavage and polyadenylation factors with quiescence, with the polyadenylation pattern for specific sequences determined in part by the presence or absence of binding factors for the reduced factors.
Some previous studies have reported that shorter transcripts generated by alternative polyadenylation tend to be expressed at higher levels than the corresponding longer isoform [20, 46, 69, 70], while other studies have found little effect of alternative polyadenylation on transcript levels, transcript stability or protein abundance [71, 72]. Additional studies have found that shorter transcripts can be more or less stable [71, 73], and two detailed analyses in yeast showed clear examples of stability elements in 3′ UTRs that make longer isoforms more stable than shorter isoforms [74, 75]. In our study, we observed that genes with longer 3′ UTRs during quiescence, on average, exhibited a small but significant increase in expression level during quiescence compared to proliferating cells (Fig. 6a). Further, isoforms are more stable when distal rather than proximal polyadenylation sites are used in the quiescent state, but decay rates are similar when proximal or distal sites are used in the proliferating state (Fig. 6b, c). The findings are consistent with induction of an RNA-binding proteins in quiescent cells that bind to motifs present in the region between the polyadenylation sites and limit transcript degradation when the cells are quiescent. There are multiple motifs recognized by RNA-binding proteins in this inter-polyadenylation site region, and some of the factors that recognize these motifs are expressed at higher levels in quiescent than proliferating fibroblasts (Additional file 3). The findings are also consistent with the retention of longer transcripts in ribonucleoprotein storage granules or other structures in quiescent cells . These changes could contribute to the higher gene expression levels of transcripts undergoing transcript lengthening in quiescence (Fig. 6a).
In many , but not all , studies, cancerous tissue and cancer cell lines were found to be more likely to express transcripts that terminate at proximal than distal polyadenylation sites, consistent with our observations in proliferating fibroblasts. Different polyadenylation factors have been found to have distinct effects on APA. Downregulation of CFIm25 repressed proximal polyadenylation site use (Additional file 1: Figure S4C) consistent with previous reports [45, 54]. Depletion of CFIm25 has been found to enhance the tumorigenic properties of glioblastoma cells , while overexpression of CFIm25 reduced tumor growth . Shortening of 3′ UTRs has been associated with poor prognosis in breast and lung cancer . Further, in an analysis of multiple tumor datasets deposited in The Cancer Genome Atlas, expression of CstF-64 correlated most closely with shortening of transcripts, with CPSF73 showing the next best correlation among the factors investigated . Expression of shorter 3′ UTRs was an important predictor of patient outcome even beyond established clinical attributes . In another study, CstF-64 expression was found to be associated with poor prognosis in lung cancer and its overexpression increased lung cancer cell proliferation and invasion . In our dataset, cyclin D1 was the most strongly downregulated gene when CstF-64 was knocked down (Additional file 11), raising the possibility that CstF-64 levels modulate polyadenylation site selection and cyclin levels. Taken together with our data demonstrating that downregulation of CstF-64 in triple negative breast cancer cells reduces their migration (Fig. 8e), the data as a whole suggest that CstF-64-mediated APA may play an important role in regulating polyadenylation site selection, gene expression, cancer cell migration, metastasis, and patient outcome.
Fibroblasts transition from quiescence to proliferation and become more migratory in the context of wound healing. Some previous studies have supported a role for mRNA processing in wound healing [80,81,82,83]. By investigating the wound healing response in mice, we found that the levels of polyadenylation factors CstF-64, CFIm25, and CPSF73 were significantly higher in the area adjacent to the wound than distal to the wound (Fig. 7), similar to our finding that these factors are expressed at higher levels in proliferating than quiescent fibroblasts in culture (Fig. 3d). The results support a possible role for alternative polyadenylation in the proliferative and migratory changes that occur in the wound healing process.
Previous studies have identified mechanistic links between fibroblast proliferation and migration. Mitogen binding to receptor tyrosine kinases can activate focal adhesion kinase (FAK) and thereby stabilize focal adhesions [84, 85]. Activation of receptor tyrosine kinases can also recruit WASp , which promotes the formation of branched actin filaments that promote cell migration. The anti-proliferative cyclin-dependent kinase inhibitor p27Kip1 binds to and inhibits the activity of RhoA GTPase , an important regulator of actin dynamics and adhesion, spreading and migration . Our findings that downregulation of APA factors, as occurs in response to antiproliferative signals via E2F transcription factors , reduces the capacity of fibroblasts to migrate into a denuded area, represents another mechanism linking fibroblast proliferation to migration through APA. We found that CstF-64 is induced in migrating cells, and knockdown of CstF-64 resulted in changes in polyadenylation site selection, altered expression of several migration genes (Table 3), and reduced cell migration (Fig. 8d). Among the genes expressed at lower levels with CstF-64 knockdown are beta actin, α-actinin, and myosin 1b. Our findings support a model in which changes in the selection of polyadenylation sites or changes in gene expression mediated by the levels of alternative polyadenylation factors play an important role in critical cell functions including migration. In a separate manuscript, we investigate in more detail the effects of isoform changes in one particular gene, RECK (included in Table 1 under UR-APA), on migration . Taken together, our data and the data emerging from other laboratories, underscore the importance of CstF-64 as an important regulator of cellular functions, including migration, in multiple cellular contexts.
Our work demonstrates that, in addition to changes in gene expression, the shift from a proliferating to a quiescent state is associated with changes in intron and exon inclusion and with the selection of polyadenylation sites. Overall, quiescent cells tend to retain introns and express longer transcripts that are present at higher levels and are more stable. Cleavage and polyadenylation factor CstF-64 is more abundant in proliferating fibroblasts in culture and in fibroblasts near a denuded area or a wound in mice. Knockdown of CstF-64 recapitulates changes in isoform use and gene expression in quiescent cells, and results in reduced cell migration in fibroblasts and cancer cells. Fibroblasts are often induced to proliferate and migrate in similar situations, and our data indicate that changes in the levels of CstF-64 can serve as a link between proliferative cues and migratory capacity.
Human foreskin fibroblasts were isolated from human skin obtained from the National Disease Research Interchange (NDRI) as described previously [24, 90]. Cells were seeded at 5 × 105 cells per 10 cm dish for each cell cycle state and grown in Dulbecco’s modified Eagle medium (DMEM) (Life Technologies, Grand Island, NY) supplemented with 10% fetal bovine serum (FBS) (Atlanta Biologicals, Flowery Branch, GA and Corning, Thermo Fisher Scientific, Waltham, MA) at 37 °C in a 5% CO2 incubator. Detailed procedures for culturing proliferating and quiescent fibroblasts are described in . Briefly, proliferating fibroblasts were collected for analysis 2 days after plating (60–80% confluent). 7dCI fibroblasts were collected 7 days after plating, or at an equivalent density, while 7dSS fibroblasts were seeded in full serum medium (10% FBS in DMEM), changed to reduced serum medium (0.1% FBS in DMEM), and collected 7 days after adding the reduced serum medium. Medium was changed every 2 days for both 7dCI and 7dSS fibroblasts. Restimulated samples were prepared by first performing the relevant quiescence arrest and readding the limiting factor. Restimulated fibroblasts were monitored with Incucyte migration assays or collected 24 or 48 h later for real-time PCR analysis. The triple negative breast cancer cell line MDA-MB-231 cell line (generous gift of the Banerjee and Christofk laboratories) was grown in 10% FBS in DMEM.
RNA isolation for RNA-Seq and microarray analysis
RNA-Seq was performed on three biological replicates of fibroblasts isolated from two different donors, 12–1 and 10–5. Medium was aspirated from tissue culture plates of fibroblasts, and the attached cells were washed with 5 ml of PBS. Attached fibroblasts were lysed into 1 mL of Trizol reagent (Life Technologies, Carlsbad, CA) per 10 cm plate for 5 min. RNA was isolated from Trizol lysates as previously described [92, 93]. RNA concentrations were determined using a Nanodrop Spectrophotometer (Thermo Fisher Scientific Inc., Waltham, MA). RNA quality was verified on a Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA) using reagents from the RNA Nano 6000 kit (Agilent Technologies).
cDNA libraries were constructed using the Illumina TruSeq mRNA sample preparation kit (Illumina Inc., San Diego, CA) according to the manufacturer’s instructions for revision A of the protocol (Illumina Part #15008136). The low-input protocol was followed for all samples, and 1 to 10 μg of total RNA input was used per library (unstranded). Single-end 140 bp reads were generated on an Illumina HiSeq 2000 Instrument. Reads with Illumina (PHRED-based) quality scores above 10 (90% accuracy) were mapped to the hg19/GRCh37 build of the human genome using the TopHat (version 2.0.9) genome alignment algorithm [94, 95]. The bowtie indices for human were obtained from the bowtie website: http://bowtie-bio.sourceforge.net/tutorial.shtml. The standard workflow for Tophat alignment was followed as described here: https://ccb.jhu.edu/software/tophat/manual.shtml.
The default parameters for alignment as described in the Tophat manual were used. Standard DESeq (version 1.22.0) workflow  (https://bioconductor.org/packages/release/bioc/html/DESeq.html) was used to convert the output of TopHat (BAM files) to a file format with gene identifiers (UCSC gene annotation, GRCh37/hg19 assembly, date of access June, 2013) and read counts normalized for sequencing depth across the different biological samples and cell cycle conditions. Information about biological replicates was provided as input for variance calculations to determine differential expression among proliferating and 7dCI conditions in DESeq. To identify differentially expressed genes, the log2 (7dCI read count/proliferating read count) was used to compare expression differences between the two states. Genes with differences in read counts between conditions (proliferation versus 7dCI), and low variance in expression within the three biological replicates of each condition, were called significant by DESeq after multiple hypothesis correction (FDR < 5%) . Heat maps were generated using the heatmap2 function of gplots package (2.12.1) (https://cran.r-project.org/web/packages/gplots/index.html) implemented in the R programming language [98, 99].
Gene set enrichment analysis
For RNA-Seq data, gene sets with significantly different expression between proliferating and quiescent fibroblasts were identified using a Wilcoxon rank-sum test comparing the log fold-change estimates of genes within each set to genes not within the set . Graphics were created using the GSEMA package implemented in R .
Differential isoform analysis
To determine differential isoform use between proliferating and quiescent fibroblasts, the standard DEXSeq (version 1.14.2) workflow (https://bioconductor.org/packages/release/bioc/html/DEXSeq.html)  was followed. BAM files generated by aligning RNA-Seq reads to the human genome (hg19/GRCh37 build) were converted to gene-normalized read count files using exons as the identifiers. The Ensembl gene annotation (GRCh37 assembly) file was obtained from https://ccb.jhu.edu/software/tophat/igenomes.shtml. Differential exon expression was determined across the three biological replicates. Genes with significant differences in expression for specific exons (adjusted p value < 0.05) between proliferating and 7dCI conditions were used for further analysis.
Microarray gene expression analysis
An aliquot of the same total RNA that was analyzed by RNA-Seq was also analyzed by microarray. Total RNA was reverse-transcribed into cDNA and fluorescently labeled with Cyanine 3-CTP (7dCI samples) or Cyanine 5-CTP (proliferating samples) with the Quick Amp Labeling Kit for Microarray Analysis (Agilent Technologies, Santa Clara, CA) following the manufacturer’s protocol. cRNA samples that passed yield and labeling standards were fragmented, and proliferating and quiescent samples were hybridized to two-color Human gene expression 4 × 44 K microarrays (Agilent Technologies) for 17 h at 65 °C in an oven rotating the arrays at 10 rotations per minute. Fluorescence intensities were detected using the Genepix scanner (Agilent Technologies) and probe identities were determined using Agilent’s feature extractor version 11.5. Probes detected over background fluorescence thresholds were used in subsequent gene expression analyses to calculate log2 (7dCIintensity/Pintensity).
Differential splicing analysis
RNA-Seq reads (fastq files) from three replicates of proliferating fibroblasts and three replicates of 7dCI fibroblasts were analyzed with the rMATS algorithm release 3.2.1 (http://rnaseq-mats.sourceforge.net/rmats3.2.1.beta/) [31,32,33] using Ensembl gene annotation (GRCh37 assembly). Reads were trimmed to a length of 100 bps for analysis using the Trim Fastq tool provided as part of rMATS package. Standard workflow for rMATS (default parameters as described in: http://rnaseq-mats.sourceforge.net/rmats3.2.1.beta/user_guide.htm) was used for the splicing analysis using the reads that cover the splicing junctions and target regions. Alternative splicing events with an FDR of < 0.05 were considered statistically significant.
Polyadenylation site-enriched RNA-Seq
We performed polyadenylation site-enriched RNA-Seq with two methodologies (Gnomegen  and Nextera). Here we describe the second approach, Nextera. For polyadenylation site-enriched RNA-Seq, two different primary dermal fibroblasts, 12–1 and 12–3, were used as biological replicates. Proliferating, 7dCI, and siRNA-treated fibroblasts were lysed by adding 1 ml of Trizol per 10 cm plate and incubating the plate for 5 min at room temperature. RNA was isolated from the cell lysates using the Direct-zol™ RNA MiniPrep Plus kit (Zymo Research, Irvine CA) by following the manufacturer’s instructions. The concentration of RNA was measured using Nanodrop 2000c (Thermo Fisher Scientific). cDNA libraries containing fragments enriched for 3’UTR ends were created with the Nextera kit using the Smart-seq2 cDNA amplification method as described in . Common forward primers were used for all samples; reverse primers with a unique barcode sequence (i5 indices) were specific for each sample. The size distribution of the cDNA library was confirmed using a High Sensitivity DNA chip (Agilent Technologies) on a Bioanalyzer 2100 Instrument (Agilent Technologies). Libraries with a uniform size distribution between 150 and 1000 bp were subjected to gel size selection to enrich for 180–280 bp sized fragments. The concentration of the final library was measured on a qubit fluorometer (Thermo Fisher Scientific). Single-end 150 bp reads were generated on an Illumina HiSeq 2500 Instrument. The sequencing reaction was run for 150 cycles.
Polyadenylation site-enriched RNA-Seq analysis
Reads from polyadenylation site-enriched cDNA libraries were demultiplexed followed by removal of adapter and polyA tail sequences. Trimmed reads were aligned to the human genome (hg19/GRCh37 build) using TopHat (version 2.0.14)  using default parameters. Aligned reads were assigned to a polyadenylation site based on annotations in the Poly(A)site atlas (version:r1.0(hg19) by Gruber et al.  using the Perl script provided (http://www.polyasite.unibas.ch/). Only the polyadenylation sites annotated as TE (terminal exon), EX (any other exon except the terminal one), or IN (any intron), and with at least 10 counts across all the samples, were included for analysis. For genes containing two polyadenylation sites, the relative use of the distal polyadenylation site (RUD) [13, 18] was determined as distal polyadenylation counts/total read counts (distal plus proximal counts). The RUD values for two biological replicates were averaged to determine the RUD value of a gene. Changes in alternative polyadenylation between the two conditions were significant if the RUD difference between them was greater than 0.05. For genes with more than two polyadenylation sites, a parameter called relative site usage (counts for a polyadenylation site divided by total counts for all the polyadenylation sites) was calculated for all the polyadenylation sites of a gene. To perform differential expression analysis, counts from all the polyadenylation sites of a gene were combined and the combined counts for all the genes for two different conditions were subjected to DESeq2 (version 1.18) analysis [96, 104] using standard parameters (Ensembl annotation, GRCh37 assembly).
Transcript decay rate measurements
Detailed protocols for cell culture and actinomycin D treatment are described in [63, 105]. Briefly, to monitor transcript decay rates, proliferating and 7dCI fibroblasts were treated with 15 μg/ml actinomycin D (Sigma-Aldrich, Inc., St. Louis, MO). Cells were washed with PBS and cell lysates were collected using Trizol reagent (Life Technologies) at 0, 120, 240, and 480 min after addition of actinomycin D. RNA was isolated from Trizol lysates using the Direct-zol™ RNA MiniPrep Plus kit (Zymo Research). cDNA library preparation, sequencing, and processing of reads were performed as described for polyadenylation-site enriched RNA-Seq.
Decay rate calculations
For comparisons of decay rates under different conditions, only the genes with two polyadenylation sites (proximal and distal) in the 3′ UTR were used for analysis. Further, only transcripts with a minimum of 10 counts at t = 0 were used. For each polyadenylation site, the counts at four time points (0, 2, 4, and 8 h) were log-transformed and fit to a linear decay model ([63, 105]) using the least squares method to determine a fitting parameter (R2) and to obtain decay constants. Only the polyadenylation sites with R2 value greater than 0.6 were used. The decay constants (k) were converted to half-lives (ln2/k) for isoform-specific analysis.
For all of the transcripts that undergo APA with quiescence and had two detectable polyadenylation sites, sequences (in FASTA format) were obtained from the UCSC Genome Browser (Table browser tool, hg19/GRCh37 build, accessed on March 2018) that include the polyadenylation site itself, 100 nts upstream (for UGUA motif analysis), and the region 20 to 40 nt downstream (for U-rich and UG-rich motif analysis) of the polyadenylation site. For hexamer analysis, the hexamer associated with each of the polyadenylation sites was obtained from Poly(A)site atlas annotations (Homo sapiens-version:r1.0(hg19)) by Gruber et al. (http://www.polyasite.unibas.ch/) . For sites associated with more than one hexamer, we chose the hexamer with the highest signal strength as determined by Gruber et al. For UGUA analysis, FIMO (v4.12.0)  motif analysis tool of the MEME suite was used with p value set to 1 to return matches to all of the UGUA motifs. Post-processing of the FIMO results was used to check for exact matches. For RBP motif analysis, primary sequences (in FASTA format) from the alternate region (region between proximal and distal sites in the 3′ UTR) for genes that become longer (distal polyadenylation site use) with quiescence were extracted using the Table browser tool of the UCSC Genome Browser (hg19/GRCh37 build, accessed on March 2018). To generate a background dataset, all the sequences from alternate regions of genes that use more proximal sites with quiescence and genes with no change in polyadenylation site use with quiescence were used. RBP motifs enriched in primary sequences in comparison with background sequences were obtained using the analysis of motif enrichment (AME, v4.12.0) motif enrichment tool  of the MEME suite. The RNA motifs from Ray2013 Homo sapiens motif database  were used for enrichment testing. Only the RBP motifs enriched in both 12–1 and 12–3 biological replicates were considered. For U-rich and UG-rich analysis, the sequences of the regions encompassing 20 to 40 nt downstream of the polyadenylation site for each gene were extracted for all genes with two polyadenylation sites using the Table browser tool of the UCSC genome browser (hg19/GRCh37 build, accessed on March 2018). The U-rich sequences in this region have been shown to be the preferred binding sites of CstF64 using crosslinking immunoprecipitation (CLIP)-Seq analysis . Percent U was calculated by determining the fraction of Us present in this region. Percent UG was calculated by determining the sum of the fractions of Us and Gs present in this region. For analysis of 4-mer UUUU sequence , the presence or absence of a UUUU motif was determined in this region.
Splicing site analysis
Nucleotide sequences were extracted for the 5′ and 3′ splice sites for 139,180 constitutive exons from HEXEvent online database  and for the introns called differentially retained (FDR < 0.05) by rMATS in proliferating or quiescent fibroblasts (Additional file 4). For analyzing 5′ and 3′ splice sites, motifs of 9 bases (3 bases in the exon and 6 bases in the intron) and 23 bases (20 bases in the intron and 3 bases in the exon), respectively, were used. A position weight matrix was generated from constitutive exon 5′ and 3′ sequences using scripts written in the R programming language [112, 113]. Based on this position weight matrix, the probability of each sequence was determined for each sequence in the list of constitutive exons, introns retained in proliferating conditions and introns retained in quiescent conditions. Statistical significances of the groups of probabilities were determined with ANOVA with Tukey’s multiple comparison test. Sequence logos were generated from the position weight matrix using the R programming language (seqLogo package, https://bioconductor.org/packages/release/bioc/html/seqLogo.html) .
Antibodies for immunoblotting
Antibodies against tubulin (T6074) and CFIm25 (AV40695-100UG, 1:800 dilution) were obtained from Sigma-Aldrich, Inc. (Saint Louis, MO). An antibody against CstF-64 (sc-28201, 1:200) was purchased from Santa Cruz Biotechnology, Inc. (Dallas, TX). An antibody against U1-70K (06-1297, 1:2000) was purchased from EMD Millipore (Billerica, MA). Antibodies against CPSF73 (A301-090A-T), U2AF65 (A303-665A-T), FUS (A300-292A-T), and RNA Polymerase II Phospho S5 (A304-208A-T) were purchased from Bethyl Laboratories (Montgomery, TX) and used at 1:1000 dilution.
Immunoblotting was performed using a standard protocol similar to that described previously . Briefly, cells were lysed using mammalian protein extraction reagent (MPER) (Thermo Fisher Scientific Inc., Waltham, MA) containing protease and phosphatase inhibitors (Roche Applied Science, Indianapolis, IN) according to the manufacturer’s instructions (Thermo Fisher Scientific Inc.). Total protein concentrations in collected lysates were measured using Pierce™ BCA protein assay kit (Thermo Fisher Scientific Inc.). Samples were run on SDS PAGE gels and transferred to polyvinylidene difluoride Immobilon-P membranes (EMD Millipore, Billerica, MA). Membranes were blocked with 5% BSA in phosphate-buffered saline-Tween. Immunodetection was performed using primary and HRP-conjugated secondary antibodies based on standard protocols.
Mouse wounding assays
All experiments were approved by the UCLA Office for Animal Research, protocol number 2015–033. C57/BL6 mice were provided housing and husbandry in accordance with Institutional Animal Care and Use Committee approved protocols. Mice that were approximately 8–10 weeks of age were anesthetized, shaved, and provided with analgesia. We introduced one full thickness dermal punch biopsy of 3.5 mm on each mouse’s upper back. On day 5 after wounding, the mouse was 83.6% healed. Mice were euthanized with CO2 followed by cervical dislocation. We excised the wound bed en bloc with the surrounding soft tissue and at least 0.5 cm of normal tissue surrounding the incision. We also collected normal skin from the same mice for comparison. Skin and wounds were fixed in formalin and paraffin-embedded. Slides were cut from paraffin blocks for immunohistochemistry.
Tissue slices (4 μm) from paraffin-embedded blocks containing wounds were de-paraffinized and rehydrated with a graded series of alcohols. Slides were subjected to heat-induced antigen retrieval with pH 6.0 citrate buffer. Slides were treated with primary antibodies against Ki-67 (Abcam, catalog no. ab16667, dilution 1:150), histone H4 (EMD Millipore, 05-858, 1:2000), CstF-64 (Bethyl Laboratories, IHC-00221, 1:1000), CPSF73 (Bethyl, A301-090A, 1:200) or CFIm25 (Sigma, AV40695, 1:200), followed by EnVision+ HRP-conjugated secondary antibody (Dako) and DAB chromogen (Roche) visualization. Slides were counterstained with hematoxylin and imaged with a Zeiss AXIO Imager.D2 microscope.
A monolayer of contact-inhibited fibroblasts in a 35-mm dish with a glass bottom (MatTek Corporation, Ashland, MA) was scratched (crosswise) using a sterile 1 ml pipette tip to create a region free of cells (wound area). The cells were then gently washed two times using complete medium to remove the non-adherent cells generated during scratching. After 24 h, the cells were fixed with 4% paraformaldehyde (Santa Cruz Biotechnology Inc., Dallas, TX) in PBS for 15 min at room temperature and then washed three times with ice-cold PBS. The cell permeabilization was performed using 0.25% Triton X-100 (Thermo Fisher Scientific, NJ) followed by washing the cells three times with PBS. The cells were blocked using blocking solution (1% bovine serum album (BSA) in PBS containing 0.2% Tween (Thermo Fisher Scientific) at room temperature for 30 min. After blocking, the cells were incubated with primary antibodies (CstF64, CPSF73, or CFIm25) in blocking solution (1:100 dilution) at 4 °C in a humidified chamber overnight. The cells were then washed three times with PBS followed by incubation with Alexa-488 labeled secondary antibody (Thermo Fisher Scientific) at 1:250 dilution for 1 h at room temperature. After washing the cells three times with PBS, the cells were stained with DAPI using the VECTASHIELD hardset antifade mounting medium with DAPI (Vector Laboratories, Inc., Burlingame, CA). The images were taken at 10X magnification on a Zeiss confocal microscope (LSM 710, Carl Zeiss). Images were analyzed using ImageJ (v1.52a).
siRNAs against CFIm25 and CPSF73 were purchased from Sigma-Aldrich. siRNAs against CstF-64 were purchased from Sigma-Aldrich (CstF64.1) and Origene Technologies Inc., Rockville, MD (CstF64.2 and CstF64.3). siRNAs were transfected into fibroblasts or cancer cells using GeneMute transfection reagent from SignaGen Laboratories (Rockville, MD) according to the manufacturer’s instructions.
For real-time PCR, DNA primers were designed with Primer3 for UBC primers or NCBI Primer-BLAST for all other primers, and synthesized by Integrated DNA Technologies (Coralville, IA). RNA was isolated using the PureLink RNA Kit (Thermo Fisher Scientific). cDNA was treated with TURBO DNA-free™ Kit (Thermo Fisher Scientific) to eliminate the remaining DNA. Real-time PCR was performed with SYBR® Green One-Step Real-Time RT PCR Kit (Thermo Fisher Scientific). Samples were cycled on a BioRad CFX96 Real Time PCR instrument driving a Biorad C1000 Thermal Cycler for 40 cycles. The ΔΔCt method was used to determine the abundance of different PCR products . Values for each gene of interest were normalized to UBC for the same sample. Primer sequences were as follows: CstF64, 5’-GCAAGCTTCTATGCAGGGTG-3′ and 5′-TTGCATCGGCACTTGAACTC-3′; CPSF73, 5′-GAAGTCGAGGGGAGGAGTCT-3′ and 5′-AGCTCCAAGGGGTCGGAT-3′; CFIm25, 5′-GCACCATCAACCTGTACCCTC-3′ and 5′-AGTAACACATGGGGTAGCCG-3′; long INF2, 5′-GGAGGAGGTGTGTGTCATCG-3′ and 5′-CTCCTGCAGGGTTACTGGTG-3′; short INF2, 5′-GCTGCGGAACGAGTTTATCG-3′ and 5′-GGAGGTGCTGCTTAGGTGAG-3′; long BOC, 5′-TCAGCAACGTGATGATCTGTGA-3′ and 5′-CCGCTCTATGGTTTCAGGAAGG-3′; short BOC 5′-CCTCATCTCTCCCACCCTGAA- 3′ and 5′-TGAGGTTTTCCAAGGGCACAA-3′, UBC, 5′-TCTTGTTTGTGGATCGCTGTGA-3′ and 5′-CAGGAGGGATGCCTTCCTTATC-3′.
Incucyte in vitro wound healing assays
For wound healing assays, fibroblasts were plated in the wells of an Incucyte™ ImageLock™ 96-well plate (Essen BioScience) and the WoundMaker™ tool was used to create a denuded area in each well on the plate. The IncuCyte™ ZOOM live-cell analysis system (Essen BioScience) was used to automatically collect time-lapse images (phase-contrast) and to quantify cell migration over time as the density of cells in the denuded area relative to the density of cells out of the denuded area (relative wound density). Plots were determined to be statistically significantly different based on repeated measures two-way ANOVA with Dunnett’s multiple comparison test.
Statistical analyses and plots
Statistical significance determinations were performed with two-tailed tests for all analyses. For DESeq/DESeq2, splicing, and DEXSeq, the software included multiple hypothesis testing correction. All errors bars represent standard deviations. For the Wilcoxon test, we checked whether the data were normally distributed. We used Fisher’s exact tests when sample sizes were low. Statistical significance for t-tests was determined using Prism (6.0f, GraphPad Software, La Jolla, CA). Statistical significance for correlations were performed using the cor() function in R. The hypergeometric test was performed with dhyper() function in R. The Wilcoxon test was performed with the Wilcox.test() function in R. Time series analysis for migration assays was performed with Prism. All bar graphs for RT-PCR and plots for migration assays were performed in Prism. All box plots and density plots were generated with ggplot2 package . Plots for motif frequencies were generated in Prism.
7 days of contact inhibition
Binary version of a SAM file
Bicinchoninic acid assay
Brother of CDO
Nudix (nucleoside diphosphate linked moiety X)-type motif 21
Clusterin associated protein 1
Cleavage and polyadenylation specificity factor
Cleavage stimulation factor
Carboxy terminal domain
Dulbecco’s modified Eagle medium
Focal adhesion kinase
Fetal bovine serum
False discovery rate
Fused in sarcoma
Gene set enrichment analysis
Gene Set Enrichment Made Awesome
Human epidermal growth factor receptor 2
Horse radish peroxidase
Integrated Genome Viewer
- INF2 Inverted Formin:
FH2 and WH2 domain containing
Multiple Em for Motif Elicitation
Mammalian protein extraction reagent
Polyacrylamide gel electrophoresis
Peptidylprolyl isomerase H
Pre-MRNA Processing Factor 4
Replicate Multivariate Analysis of Transcript Splicing
Relative use of the distal polyadenylation site
Sodium dodecyl sulfate
Transformer-2 protein homolog beta
U1 small nuclear ribonucleoprotein 70K
U2 Small Nuclear RNA Auxiliary Factor 2
- UR APA:
Upstream region APA or alternative polyadenylation affecting at least one polyadenylation site in the coding sequence
- UTR APA:
Alternative polyadenylation affecting polyadenylation sites in the UTR
Wiscott-Aldrich Syndrome protein
Tschumperlin DJ. Fibroblasts and the ground they walk on. Physiology (Bethesda). 2013;28:380–90.
Hinz B, Phan SH, Thannickal VJ, Galli A, Bochaton-Piallat ML, Gabbiani G. The myofibroblast: one function, multiple origins. Am J Pathol. 2007;170:1807–16.
Werner S, Grose R. Regulation of wound healing by growth factors and cytokines. Physiol Rev. 2003;83:835–70.
De Donatis A, Ranaldi F, Cirri P. Reciprocal control of cell proliferation and migration. Cell Commun Signal. 2010;8:20.
Evertts AG, Manning AL, Wang X, Dyson NJ, Garcia BA, Coller HA. H4K20 methylation regulates quiescence and chromatin compaction. Mol Biol Cell. 2013;24:3025–37.
Suh EJ, Remillard MY, Legesse-Miller A, Johnson EL, Lemons JM, Chapman TR, Forman JJ, Kojima M, Silberman ES, Coller HA. A microRNA network regulates proliferative timing and extracellular matrix synthesis during cellular quiescence in fibroblasts. Genome Biol. 2012;13:R121.
Lemons JM, Feng XJ, Bennett BD, Legesse-Miller A, Johnson EL, Raitman I, Pollina EA, Rabitz HA, Rabinowitz JD, Coller HA. Quiescent fibroblasts exhibit high metabolic activity. PLoS Biol. 2010;8:e1000514.
Legesse-Miller A, Raitman I, Haley EM, Liao A, Sun LL, Wang DJ, Krishnan N, Lemons JM, Suh EJ, Johnson EL, et al. Quiescent fibroblasts are protected from proteasome inhibition-mediated toxicity. Mol Biol Cell. 2012;23:3566–81.
Coller HA, Sang L, Roberts JM. A new description of cellular quiescence. PLoS Biol. 2006;4:e83.
Iyer VR, Eisen MB, Ross DT, Schuler G, Moore T, Lee JC, Trent JM, Staudt LM, Hudson J Jr, Boguski MS, et al. The transcriptional program in the response of human fibroblasts to serum. Science. 1999;283:83–7.
Liu H, Adler AS, Segal E, Chang HY. A transcriptional program mediating entry into cellular quiescence. PLoS Genet. 2007;3:e91.
Sang L, Coller HA, Roberts JM. Control of the reversibility of cellular quiescence by the transcriptional repressor HES1. Science. 2008;321:1095–100.
Ji Z, Lee JY, Pan Z, Jiang B, Tian B. Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development. Proc Natl Acad Sci U S A. 2009;106:7028–33.
Ji Z, Tian B. Reprogramming of 3′ untranslated regions of mRNAs by alternative polyadenylation in generation of pluripotent stem cells from different cell types. PLoS One. 2009;4:e8419.
Hoque M, Ji Z, Zheng D, Luo W, Li W, You B, Park JY, Yehia G, Tian B. Analysis of alternative cleavage and polyadenylation by 3′ region extraction and deep sequencing. Nat Methods. 2013;10:133–9.
Tian B, Manley JL. Alternative polyadenylation of mRNA precursors. Nat Rev Mol Cell Biol. 2017;18(1):18–30. https://doi.org/10.1038/nrm.2016.116. Epub 2016 Sep 28.
Akman HB, Erson-Bensan AE. Alternative polyadenylation and its impact on cellular processes. Microrna. 2014;3:2–9.
Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB. Proliferating cells express mRNAs with shortened 3′ untranslated regions and fewer microRNA target sites. Science. 2008;320:1643–7.
Takagaki Y, Seipelt RL, Peterson ML, Manley JL. The polyadenylation factor CstF-64 regulates alternative processing of IgM heavy chain pre-mRNA during B cell differentiation. Cell. 1996;87:941–52.
Mayr C, Bartel DP. Widespread shortening of 3'UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell. 2009;138:673–84.
Elkon R, Drost J, van Haaften G, Jenal M, Schrier M, Vrielink JA, Agami R. E2F mediates enhanced alternative polyadenylation in proliferation. Genome Biol. 2012;13:R59.
Tian B, Manley JL. Alternative polyadenylation of mRNA precursors. Nat Rev Mol Cell Biol. 2017;18:18–30.
Shi Y, Manley JL. The end of the message: multiple protein-RNA interactions define the mRNA polyadenylation site. Genes Dev. 2015;29:889–97.
Legesse-Miller A, Elemento O, Pfau SJ, Forman JJ, Tavazoie S, Coller HA. Let-7 overexpression leads to an increased fraction of cells in G2/M, direct down-regulation of Cdc34, and stabilization of Wee1 kinase in primary fibroblasts. J Biol Chem. 2009;284:6605–9.
Johnson EL, Wang W, Buckles J, Mitra M, Coller HA: Differential gene expression analysis between proliferating and quiescent human dermal fibroblasts. Data sets. GEO GSE117444. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117444.
Venezia T, Merchant A, Ramos C, Whitehouse N, Young A, Shaw C, Goodell M. Molecular signatures of proliferation and quiescence in hematopoietic stem cells. PLoS Biol. 2004;2:e301.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545–50.
Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstrale M, Laurila E, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34:267–73.
Anders S, Reyes A, Huber W. Detecting differential usage of exons from RNA-seq data. Genome Res. 2012;22:2008–17.
Reimand J, Kull M, Peterson H, Hansen J, Vilo J. g: Profiler--a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 2007;35:W193–200.
Shen S, Park JW, Huang J, Dittmar KA, Lu ZX, Zhou Q, Carstens RP, Xing Y. MATS: a Bayesian framework for flexible detection of differential alternative splicing from RNA-Seq data. Nucleic Acids Res. 2012;40:e61.
Shen S, Park JW, Lu ZX, Lin L, Henry MD, Wu YN, Zhou Q, Xing Y. rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-Seq data. Proc Natl Acad Sci U S A. 2014;111:E5593–601.
Park JW, Tokheim C, Shen S, Xing Y. Identifying differential alternative splicing events from RNA sequencing data using RNASeq-MATS. Methods Mol Biol. 2013;1038:171–9.
Lareau LF, Brenner SE. Regulation of splicing factors by alternative splicing and NMD is conserved between kingdoms yet evolutionarily flexible. Mol Biol Evol. 2015;32:1072–9.
Stoilov P, Daoud R, Nayler O, Stamm S. Human tra2-beta1 autoregulates its protein concentration by influencing alternative splicing of its pre-mRNA. Hum Mol Genet. 2004;13:509–24.
Anko ML, Muller-McNicoll M, Brandl H, Curk T, Gorup C, Henry I, Ule J, Neugebauer KM. The RNA-binding landscapes of two SR proteins reveal unique functions and binding to diverse RNA classes. Genome Biol. 2012;13:R17.
Jumaa H, Nielsen PJ. The splicing factor SRp20 modifies splicing of its own mRNA and ASF/SF2 antagonizes this regulation. EMBO J. 1997;16:5077–85.
Middleton R, Gao D, Thomas A, Singh B, Au A, Wong JJ, Bomane A, Cosson B, Eyras E, Rasko JE, Ritchie W. IRFinder: assessing the impact of intron retention on mammalian gene expression. Genome Biol. 2017;18:51.
Dichmann DS, Walentek P, Harland RM. The alternative splicing regulator Tra2b is required for somitogenesis and regulates splicing of an inhibitory Wnt11b isoform. Cell Rep. 2015;10:527–36.
Sibley CR. Regulation of gene expression through production of unstable mRNA isoforms. Biochem Soc Trans. 2014;42:1196–205.
Wong JJ, Ritchie W, Ebner OA, Selbach M, Wong JW, Huang Y, Gao D, Pinello N, Gonzalez M, Baidya K, et al. Orchestrated intron retention regulates normal granulocyte differentiation. Cell. 2013;154:583–95.
Sibley CR, Emmett W, Blazquez L, Faro A, Haberman N, Briese M, Trabzuni D, Ryten M, Weale ME, Hardy J, et al. Recursive splicing in long vertebrate genes. Nature. 2015;521:371–5.
Sakabe NJ, de Souza SJ. Sequence features responsible for intron retention in human. BMC Genomics. 2007;8:59.
Lianoglou S, Garg V, Yang JL, Leslie CS, Mayr C. Ubiquitously transcribed genes use alternative polyadenylation to achieve tissue-specific expression. Genes Dev. 2013;27:2380–96.
Masamha CP, Xia Z, Yang J, Albrecht TR, Li M, Shyu AB, Li W, Wagner EJ. CFIm25 links alternative polyadenylation to glioblastoma tumour suppression. Nature. 2014;510:412–6.
Xia Z, Donehower LA, Cooper TA, Neilson JR, Wheeler DA, Wagner EJ, Li W. Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3'-UTR landscape across seven tumour types. Nat Commun. 2014;5:5274.
Mitra M, Swamy VS, Wang W, Buckles J, Coller HA: Genome wide mapping of polyadenylation sites in proliferating and contact-inhibited cells and cells with knockdown of cleavage and polyadenylation factors. Data sets. GEO GSE117121. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117121.
Elkon R, Ugalde AP, Agami R. Alternative cleavage and polyadenylation: extent, regulation and function. Nat Rev Genet. 2013;14:496–506.
Beaudoing E, Freier S, Wyatt JR, Claverie JM, Gautheret D. Patterns of variant polyadenylation signal usage in human genes. Genome Res. 2000;10:1001–10.
Legendre M, Gautheret D. Sequence determinants in human polyadenylation site selection. BMC Genomics. 2003;4:7.
Millevoi S, Vagner S. Molecular mechanisms of eukaryotic pre-mRNA 3′ end processing regulation. Nucleic Acids Res. 2010;38:2757–74.
Shi Y, Di Giammartino DC, Taylor D, Sarkeshik A, Rice WJ, Yates JR 3rd, Frank J, Manley JL. Molecular architecture of the human pre-mRNA 3′ processing complex. Mol Cell. 2009;33:365–76.
Tian B, Graber JH. Signals for pre-mRNA cleavage and polyadenylation. Wiley Interdiscip Rev RNA. 2012;3:385–96.
Brown KM, Gilmartin GM. A mechanism for the regulation of pre-mRNA 3′ processing by human cleavage factor Im. Mol Cell. 2003;12:1467–76.
Mandel CR, Kaneko S, Zhang H, Gebauer D, Vethantham V, Manley JL, Tong L. Polyadenylation factor CPSF-73 is the pre-mRNA 3′-end-processing endonuclease. Nature. 2006;444:953–6.
Shell SA, Hesse C, Morris SM Jr, Milcarek C. Elevated levels of the 64-kDa cleavage stimulatory factor (CstF-64) in lipopolysaccharide-stimulated macrophages influence gene expression and induce alternative poly(A) site selection. J Biol Chem. 2005;280:39950–61.
Hwang HW, Park CY, Goodarzi H, Fak JJ, Mele A, Moore MJ, Saito Y, Darnell RB. PAPERCLIP identifies microRNA targets and a role of CstF64/64tau in promoting non-canonical poly(A) site usage. Cell Rep. 2016;15:423–35.
Gruber AR, Martin G, Keller W, Zavolan M. Cleavage factor Im is a key regulator of 3' UTR length. RNA Biol. 2012;9:1405–12.
Phatnani HP, Greenleaf AL. Phosphorylation and functions of the RNA polymerase II CTD. Genes Dev. 2006;20:2922–36.
Yao C, Choi EA, Weng L, Xie X, Wan J, Xing Y, Moresco JJ, Tu PG, Yates JR 3rd, Shi Y. Overlapping and distinct functions of CstF64 and CstF64tau in mammalian mRNA 3′ processing. RNA. 2013;19:1781–90.
Li W, You B, Hoque M, Zheng D, Luo W, Ji Z, Park JY, Gunderson SI, Kalsotra A, Manley JL, Tian B. Systematic profiling of poly(a)+ transcripts modulated by core 3′ end processing and splicing factors reveals regulatory rules of alternative cleavage and polyadenylation. PLoS Genet. 2015;11:e1005166.
Mitra M, Nersesian LE, Wang W, Buckles J, Coller HA: To investigate the decay constants (half-lives) of transcript isoforms generated by alternative polyadenylation in proliferating and quiescent cells. Data sets. GEO GSE117121. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117121
Johnson EL, Robinson DG, Coller HA. Widespread changes in mRNA stability contribute to quiescence-specific gene expression patterns in a fibroblast model of quiescence. BMC Genomics. 2017;18:123.
Werner S, Krieg T, Smola H. Keratinocyte-fibroblast interactions in wound healing. J Invest Dermatol. 2007;127:998–1008.
Rakha EA, Ellis IO. Triple-negative/basal-like breast cancer: review. Pathology. 2009;41:40–7.
Erson-Bensan AE, Can T. Alternative polyadenylation: another foe in Cancer. Mol Cancer Res. 2016;14:507–17.
Shepard PJ, Choi EA, Lu J, Flanagan LA, Hertel KJ, Shi Y. Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. RNA. 2011;17:761–72.
Singh P, Alley TL, Wright SM, Kamdar S, Schott W, Wilpan RY, Mills KD, Graber JH. Global changes in processing of mRNA 3′ untranslated regions characterize clinically distinct cancer subtypes. Cancer Res. 2009;69:9422–30.
Graham RR, Kyogoku C, Sigurdsson S, Vlasova IA, Davies LR, Baechler EC, Plenge RM, Koeuth T, Ortmann WA, Hom G, et al. Three functional variants of IFN regulatory factor 5 (IRF5) define risk and protective haplotypes for human lupus. Proc Natl Acad Sci U S A. 2007;104:6758–63.
Kreth S, Limbeck E, Hinske LC, Schutz SV, Thon N, Hoefig K, Egensperger R, Kreth FW. In human glioblastomas transcript elongation by alternative polyadenylation and miRNA targeting is a potent mechanism of MGMT silencing. Acta Neuropathol. 2013;125:671–81.
Spies N, Burge CB, Bartel DP. 3' UTR-isoform choice has limited influence on the stability and translational efficiency of most mRNAs in mouse fibroblasts. Genome Res. 2013;23:2078–90.
Gruber AR, Martin G, Muller P, Schmidt A, Gruber AJ, Gumienny R, Mittal N, Jayachandran R, Pieters J, Keller W, et al. Global 3' UTR shortening has a limited effect on protein abundance in proliferating T cells. Nat Commun. 2014;5:5465.
de Klerk E, Venema A, Anvar SY, Goeman JJ, Hu O, Trollet C, Dickson G, den Dunnen JT, van der Maarel SM, Raz V, t Hoen PA. Poly(a) binding protein nuclear 1 levels affect alternative polyadenylation. Nucleic Acids Res. 2012;40:9089–101.
Gupta I, Clauder-Munster S, Klaus B, Jarvelin AI, Aiyar RS, Benes V, Wilkening S, Huber W, Pelechano V, Steinmetz LM. Alternative polyadenylation diversifies post-transcriptional regulation by selective RNA-protein interactions. Mol Syst Biol. 2014;10:719.
Geisberg JV, Moqtaderi Z, Fan X, Ozsolak F, Struhl K. Global analysis of mRNA isoform half-lives reveals stabilizing and destabilizing elements in yeast. Cell. 2014;156:812–24.
Protter DSW, Rao BS, Van Treeck B, Lin Y, Mizoue L, Rosen MK, Parker R. Intrinsically disordered regions can contribute promiscuous interactions to RNP granule assembly. Cell Rep. 2018;22:1401–12.
Fu Y, Sun Y, Li Y, Li J, Rao X, Chen C, Xu A. Differential genome-wide profiling of tandem 3' UTRs among human breast cancer and normal cells by high-throughput sequencing. Genome Res. 2011;21:741–7.
Lembo A, Di Cunto F, Provero P. Shortening of 3'UTRs correlates with poor prognosis in breast and lung cancer. PLoS One. 2012;7:e31129.
Aragaki M, Takahashi K, Akiyama H, Tsuchiya E, Kondo S, Nakamura Y, Daigo Y. Characterization of a cleavage stimulation factor, 3′ pre-RNA, subunit 2, 64 kDa (CSTF2) as a therapeutic target for lung cancer. Clin Cancer Res. 2011;17:5889–900.
Jensen MA, Wilkinson JE, Krainer AR. Splicing factor SRSF6 promotes hyperplasia of sensitized skin. Nat Struct Mol Biol. 2014;21:189–97.
Wang ET, Cody NA, Jog S, Biancolella M, Wang TT, Treacy DJ, Luo S, Schroth GP, Housman DE, Reddy S, et al. Transcriptome-wide regulation of pre-mRNA splicing and mRNA localization by muscleblind proteins. Cell. 2012;150:710–24.
Sundaram GM, Common JE, Gopal FE, Srikanta S, Lakshman K, Lunny DP, Lim TC, Tanavde V, Lane EB, Sampath P. ‘See-saw’ expression of microRNA-198 and FSTL1 from a single transcript in wound healing. Nature. 2013;495:103–6.
Davis J, Salomonis N, Ghearing N, Lin SC, Kwong JQ, Mohan A, Swanson MS, Molkentin JD. MBNL1-mediated regulation of differentiation RNAs promotes myofibroblast transformation and the fibrotic response. Nat Commun. 2015;6:10084.
Long W, Yi P, Amazit L, LaMarca HL, Ashcroft F, Kumar R, Mancini MA, Tsai SY, Tsai MJ, O'Malley BW. SRC-3Delta4 mediates the interaction of EGFR with FAK to promote cell migration. Mol Cell. 2010;37:321–32.
Sieg DJ, Hauck CR, Ilic D, Klingbeil CK, Schaefer E, Damsky CH, Schlaepfer DD. FAK integrates growth-factor and integrin signals to promote cell migration. Nat Cell Biol. 2000;2:249–56.
Thrasher AJ. WASp in immune-system organization and function. Nat Rev Immunol. 2002;2:635–46.
Besson A, Gurian-West M, Schmidt A, Hall A, Roberts JM. p27Kip1 modulates cell migration through the regulation of RhoA activation. Genes Dev. 2004;18:862–76.
Lawson CD, Burridge K. The on-off relationship of rho and Rac during integrin-mediated adhesion and cell migration. Small GTPases. 2014;5:e27958.
Lee HN, Mitra M, Bosompra O, Corney DC, Johnson EL, Rashed N, Ho LD, Coller HA. RECK isoforms have opposing effects on cell migration. Mol Biol Cell. 2018:mbcE17120708.
Pollina EA, Legesse-Miller A, Haley EM, Goodpaster T, Randolph-Habecker J, Coller HA. Regulating the angiogenic balance in tissues. Cell Cycle. 2008;7:2056–70.
Mitra M, Ho LD, Coller HA. An in vitro model of cellular quiescence in primary human dermal fibroblasts. Methods Mol Biol. 2018;1686:27–47.
Chomczynski P, Sacchi N. The single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction: twenty-something years on. Nat Protoc. 2006;1:581–5.
Chomczynski P, Sacchi N. Single-step method of RNA isolation by acid guanidinium thiocyanate phenol chloroform extraction. Anal Biochem. 1987;162:156–9.
Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25:1105–11.
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36.
Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106.
Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci U S A. 2003;100:9440–5.
Warnes GR, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, Lumley T, Maechler M, Magnusson A, Moeller S, et al: Gplots: various R programming tools for plotting data. R package version 2.14.2, 2014 http://CRAN.R-project.org/package=gplots.
Team RC. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2014.
Irizarry RA, Wang C, Zhou Y, Speed TP. Gene set enrichment analysis made simple. Stat Methods Med Res. 2009;18:565–75.
Robinson D: GSEAMA: gene set enrichment analysis made awesome. R package version 0.99.0. 2014.http://github.com/dgrtwo/GSEAMA.
Picelli S, Faridani OR, Bjorklund AK, Winberg G, Sagasser S, Sandberg R. Full-length RNA-seq from single cells using smart-seq2. Nat Protoc. 2014;9:171–81.
Gruber AJ, Schmidt R, Gruber AR, Martin G, Ghosh S, Belmadani M, Keller W, Zavolan M. A comprehensive analysis of 3′ end sequencing data sets reveals novel polyadenylation signals and the repressive role of heterogeneous ribonucleoprotein C on cleavage and polyadenylation. Genome Res. Genome Res. 2016;26(8):1145–59. https://doi.org/10.1101/gr.202432.115. Epub 2016 Jul 5.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Mitra M, Lee HN, Coller HA. Determining genome-wide transcript decay rates in proliferating and quiescent human fibroblasts. J Vis Exp. 2018(131). https://doi.org/10.3791/56423.
Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202–8.
McLeay RC, Bailey TL. Motif enrichment analysis: a unified framework and an evaluation on ChIP data. BMC Bioinf. 2010;11:165.
Ray D, Kazan H, Cook KB, Weirauch MT, Najafabadi HS, Li X, Gueroussov S, Albu M, Zheng H, Yang A, et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499:172–7.
Yao C, Biesinger J, Wan J, Weng L, Xing Y, Xie X, Shi Y. Transcriptome-wide analyses of CstF64-RNA interactions in global regulation of mRNA alternative polyadenylation. Proc Natl Acad Sci U S A. 2012;109:18773–8.
MacDonald CC, Wilusz J, Shenk T. The 64-kilodalton subunit of the CstF polyadenylation factor binds to pre-mRNAs downstream of the cleavage site and influences cleavage site location. Mol Cell Biol. 1994;14:6647–54.
Busch A, Hertel KJ. HEXEvent: a database of human EXon splicing events. Nucleic Acids Res. 2013;41:D118–24.
Pages H, Aoyoun P, Gentleman R, DebRoy S. Biostrings: string objects representing biological sequences, and matching algorithms. In: R package, version 2.40.2 edition; 2016.
Stojnic R, Diez D. PWMEnrich: PWM enrichment analysis. In: R package, version 4.8.2 edition; 2015.
Bembom O: seqLogo: sequence logos for DNA sequence alignments. Vol. R package, version 1.38.0 edition; 2016.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) method. Methods. 2001;25:402–8.
Wickam H. ggplot2: elegant graphics for data analysis. New York: Springer-Verlag ed; 2009.
Mitra M, Johnson EL, Swamy VS, Nersesian LE, Corney DC, Robinson DG, Taylor DG, Ambrus AM, Jelinek D, Wang W, Batista SL, Coller HA. Alternative polyadenylation factors link cell cycle to migration. Gene expression omnibus.2018 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117444. .
Mitra M, Johnson EL, Swamy VS, Nersesian LE, Corney DC, Robinson DG, Taylor DG, Ambrus AM, Jelinek D, Wang W, Batista SL, Coller HA. Alternative polyadenylation factors link cell cycle to migration. Gene expression omnibus. 2018 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117121. .
Mitra M, Johnson EL, Swamy VS, Nersesian LE, Corney DC, Robinson DG, Taylor DG, Ambrus AM, Jelinek D, Wang W, Batista SL, Coller HA. Alternative polyadenylation factors link cell cycle to migration. Gene expression omnibus 2018. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117033.
Schneider TD, Stephens RM. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990;18:6097–100.
Schneider TD, Stormo GD, Gold L, Ehrenfeucht A. Information content of binding sites on nucleotide sequences. J Mol Biol. 1986;188:415–31.
Tsurusawa M, Fujimoto T. Cell cycle progression and phenotypic modification of Ki67 antigen-negative G1- and G2-phase cells in phorbol ester-treated Molt-4 human leukemia cells. Cytometry. 1995;20:146–53.
The authors acknowledge all of the members of the Coller laboratory, the hoffmann2 cluster, Jessica Buckles, Suhua Feng, and Marco Morselli for assistance.
HAC was the Milton E. Cassel scholar of the Rita Allen Foundation (http://www.ritaallenfoundation.org). ELJ was supported in part by a National Science Foundation Graduate Research Fellowship DGE-0646086. This work was funded by Institute of General Medical Sciences Center of Excellence grant P50 GM071508, NIH R01 AR070245, PhRMA Foundation grant 2007RSGl9572, National Science Foundation Grant OCI-1047879 to David August, National Institute of General Medical Sciences R01 GM081686, National Institute of General Medical Sciences R01 GM0866465, the Eli & Edythe Broad Center for Regenerative Medicine & Stem Cell Research (Rose Hills and Hal Gaba awards), the Iris Cantor Women’s Health Center/UCLA, the UCLA Clinical and Translational Science Institute Grant UL1TR000124, the Leukemia Lymphoma Society, the Melanoma Research Alliance 564714, NIH 1 R01-CA221296-01A1. AA was supported by a Broad Stem Cell Center fellowship, the Tumor Cell Biology Training Grant T32 CA009056 and a Dermatology Training Grant T32AR071307. HAC is a member of the Eli & Edythe Broad Center of Regenerative Medicine & Stem Cell Research, the Jonsson Comprehensive Cancer Center, the UCLA Molecular Biology Institute, and the UCLA Bioinformatics Interdepartmental Program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Availability of data and materials
The data that support this study are provided in supplementary tables. All the sequencing data are available at Gene Expression Omnibus data repository under the following accession numbers: GSE117444 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117444) , GSE117121 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117121) , and GSE117033 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE117033) .
Ethics approval and consent to participate
Human skin collection was approved by the Princeton University Institutional Review Board protocol number #3134. Informed consent was obtained by the National Disease Research Interchange. Animal experimentation was approved by the UCLA Office for Animal Research, protocol number 2015-033. All experimental methods comply with the Helsinki Declaration.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary figures and supplementary Tables S1-S5. (PDF 6052 kb)
Expression of polyadenylation factors with quiescence. (XLSX 14 kb)
Alternative isoform use with quiescence. (XLS 1401 kb)
Gene Ontology for alternative isoform use. (XLSX 932 kb)
Alternative splicing with quiescence. (XLSX 136 kb)
Gene Ontology for alternative splicing. (XLSX 520 kb)
Polaydenylation site use with quiescence. (XLS 194 kb)
Polyadenylation site use with quiescence for genes with more than two polyadenylation sites. (XLSX 537 kb)
Gene Ontology for alternative polyadenylation. (XLSX 618 kb)
Alternative polaydenylation in knockdown cells. (XLS 309 kb)
Differential expression with quiescence and knockdown. (XLSX 160 kb)
Isoform-specific half-lives with quiescence. (XLSX 40 kb)
About this article
Cite this article
Mitra, M., Johnson, E.L., Swamy, V.S. et al. Alternative polyadenylation factors link cell cycle to migration. Genome Biol 19, 176 (2018). https://doi.org/10.1186/s13059-018-1551-9
- mRNA processing
- Wound healing