Chipping away at 'stemness'
© BioMed Central Ltd 2004
Published: 29 July 2004
Skip to main content
© BioMed Central Ltd 2004
Published: 29 July 2004
Global gene-expression analyses of human embryonic stem cells confirm the involvement of some known genes in stem-cell function and identify some new candidate regulators of stem-cell growth. Support remains elusive, however, for the concept of 'stemness' - a pattern of expression of genes that is common to all stem cells.
Embryonic stem (ES) cells are truly remarkable because of their ability both to renew themselves and to give rise to all the cells in the body. The ability to generate a wide range of differentiated cell types defines them as pluripotent. Human ES cells could be an incredible resource for the treatment of human disease. Differentiated cells derived from them could potentially be used to treat a wide variety of human conditions, including (but not limited to) heart disease, diabetes, stroke, Alzheimer's disease and Parkinson's disease. Several major hurdles remain to be overcome if such cells are to be used clinically, however. Stated simply, we know very little about their basic physiology or their true potential. What factors are required for their survival and proliferation? What factors can maintain them in a pluripotent state? What growth conditions affect their differentiation?
One way to address these questions is to analyze gene expression in ES cells. The goal is straightforward: if we can identify the full panoply of genes expressed in human ES cells and compare this with data from other stem-cell and non-stem-cell populations, it might be possible to define what makes ES cells unique. Such genes might be the ones that maintain ES cells in a self-renewing, pluripotent state. Knowledge of the genes expressed in ES cells could also have some very practical uses. For example, knowing that human ES cells express certain growth factor receptors could help in devising strategies to improve the growth of the cells in culture. Analyzing gene expression in human ES cells could provide critical insights into the cell-surface receptors involved in growth control, cell-substrate adhesion and cell-cell adhesion in these and other cell types, and into intracellular signaling pathways involved in their basic physiology.
Comparison of different methods and approaches for the analysis of human ES-cell gene expression
Sperger et al. 
Sato et al. 
Richards et al. 
Abeyta et al. 
Zeng et al. 
Human ES-cell lines used
H1, H7, H9, H13, H14
H9, HSF-1, HSF-6
Method of ES-cell isolation
Treatment with collagenase until colonies lifted off the MEFs
Treatment with dispase until cells were free of MEFs
Microdissection to free colonies of MEFs
Mechanical dissection of colonies from MEFs, then collagenase treatment
Affymetrix arrays (hU133A and mouse U74Av2)
Affymetrix arrays (hU133A and hU133B)
Custom 16,659-spot 70-bp oligonucleotide array
hEC, hES and seminoma
hES and published mES 
hES and hES and additional SAGE libraries 
hES and hES versus published mES 
hES and hES versus mES 
Primary subtraction method
Somatic and cancer cell lines
Differentiated hES cells
Pooled human RNA
Significance analysis of microarrays (SAM) 
dChip and MAS 4.0 (Affymetrix)
Comparison of two SAGE resources with SAGE 2000 [26,27]
MAS 5.0 (Affymetrix)
Gene Pix (Axon Instruments)
Number of genes enriched in human ES cells
Candidate pluripotency genes*
Confirmation of gene expression using
The data generated by the five groups [1–5] provide important confirmation of old ideas as well as providing support for some newly emerging concepts. For example, compelling evidence from studies carried out in mice  had previously demonstrated a critical role for the POU domain transcription factor Oct4 (also called Pou5F1) in controlling the stability of the pluripotent state . Encouragingly, all the studies [1–5] found high levels of expression of this factor in human ES cells. Also included in the list of genes enriched in human ES cells is the human homolog of the mouse Nanog gene. Nanog was recently found to be important for maintaining pluripotency in mouse ES cells [7, 8]; deciphering the role of NANOG in human ES cells will therefore be of great interest. Interestingly, components of the leukemia inhibitory factor (LIF) pathway do not appear to be enriched in human ES cells, consistent with the observed lack of response by human ES cells to LIF, despite the fact that activation of the LIF pathway allows self renewal of mouse cells. Two additional enriched genes encode members of the transforming growth factor superfamily, LeftyA and LeftyB. These factors have been generally overlooked in stem-cell studies but now clearly warrant more attention. New insights into ES cell biology will emerge from analysis and functional testing of the candidates.
Comparison of the datasets from different human ES-cell lines suggests that each line has a unique gene-expression profile. For example, the data from Abeyta et al.  show differences between lines: more than 20% of the approximately 14,000 genes found were expressed in only one of the three lines examined (see Figure 1d). What accounts for these differences in gene expression among human ES-cell lines? One possible explanation is genetic diversity. The genetic diversity present in the human population, represented to some degree in the available human ES cells, could affect gene-expression profiles among the cell lines. The expression of a gene can be affected by genetic polymorphisms in the gene itself or in modifier loci. The effects of modifier loci are well known to mouse geneticists and certainly act in human populations as well . The growing list of available human ES-cell lines could be used to address the role that genetic diversity plays in regard to differential gene expression .
Genetic diversity may be part of the answer but is not likely to be the whole story. Differences in the purity of the cell populations used as the source of mRNA in the different studies could also account for some of the variation observed. The optimal conditions for maintaining human ES cells in an undifferentiated state are still not fully defined. For this reason, it seems possible that some of the variation in gene expression among human ES-cell lines may arise because the mRNA pools that were used contained not only ES-cell mRNA but also mRNA from differentiated derivatives. It is also worth noting that the behavior of the various cell lines can be quite different. Some lines are amenable to bulk culture, while others are not; some lines can be grown in feeder-independent culture, while others cannot; and some lines spontaneously differentiate, while others do not [11, 12]. Culture conditions can affect gene imprinting, which in turn could have dramatic effects on gene expression. Variation in gene-expression patterns among human ES-cell lines could also reflect differences in the developmental stage represented by each line. If the cell lines are developmentally different - even slightly - then different gene-expression profiles could be observed. Indeed, REX1, an Oct4 target gene, and FOXD3, a gene required for development of pluripotent ES cells in mice, were enriched in some ES-cell lines but not others.
One of the most challenging - yet crucial - unanswered questions regarding human ES cells is how they can be maintained in a pluripotent state. By comparing the gene-expression profiles of pluripotent stem cells with those of other cell types, several of the groups [1–5] attempted to find genes whose expression could be unique to pluripotent stem cells; some have termed these the MOESMstem-cell orchestra" . One notable inclusion in the list of genes expressed by all human ES-cell lines is that encoding the transcription factor Oct4 mentioned above, consistent with its known role in maintaining pluripotency. Several other genes that have been associated with pluripotent cells are found in many of the datasets, including genes encoding Sox2, Nanog, Foxd3 and members of extracellular signaling pathway families including fibroblast (FGF), Wnt and bone morphogenetic protein (BMP).
All the studies [1–5] included lists of candidate genes that may play an important role in self-renewal and/or pluripotency. One perplexing observation is the lack of overlap among the gene lists. In part, this could result from the different subtraction methods used to removed genes expressed in non-ES cells from the datasets (see Table 1). Although there is unlikely to be one 'correct' method of subtraction, the differences in approach undoubtedly make comparison of results complicated. Analysis is further complicated by the need to set a level of gene expression that is considered enriched or not enriched. In most of the studies [1, 2, 4, 5], the authors chose to make the cut-off at two-fold to 3.5-fold; genes that show expression differences above these cut-offs were defined as enriched. It seems likely, however, that some of the most important players in the stem-cell orchestra may play a very soft tune. Indeed, studies have shown that changes in Oct4 expression of less than two-fold can have profound effects on mouse ES-cell fate . The arbitrary cut-off levels set by the authors [1, 2, 4, 5] could exclude some of the key genes. Microarray analysis is also hampered by the 'lamp-post effect' - that is, only being able to see things where the light shines (microarray analysis excludes genes not present on the array). Thus, the studies that used microarray analysis [1, 2, 4, 5] were able to identify only the genes with enriched expression in human ES cells that are present on the arrays. Importantly, one of the studies  used serial analysis of gene expression (SAGE). This technique avoids both the need for an arbitrary cut-off to define transcript enrichment and the lamp-post effect. It enables both quantitative characterization of gene expression and identification of novel splice variants, novel exons and novel genes [3, 15].
In the last few years, the concept of 'stemness' has been introduced - the idea that different stem-cell populations, whether pluripotent or not, will share gene-expression signatures [16, 17]. Can global gene-expression analysis provide new insights into this concept? It is difficult to answer this from the existing reports [1–5]. One idea that needs to be considered is that some of the genes that may be required to establish stemness may not be required to maintain that state. It also seems quite plausible that stem cells use pathways and molecules that are used elsewhere in non-stem-cell populations. A case in point is the STAT3 gene, whose activity is sufficient for the maintenance of murine ES cells as stem cells. Activation of STAT3 in mouse ES cells is brought about by LIF acting through gp130. This pathway is certainly not unique to stem cells, however, and plays important roles in many other aspects of development and adult homeostasis.
The recent gene-expression studies [1–5], nevertheless, provide a rich resource to gain important new insights into the physiology of human ES cells and provide an enormous amount of information that deserves our attention. But therein lies the difficulty: using microarray and SAGE approaches to examine gene expression is relatively easy, but deciphering gene function can be very difficult. How can gene function in human ES cells be determined? Altering gene expression by transfection or homologous recombination in human ES cells is technically difficult. Homologous recombination has been achieved in human ES cells but, to date, there is only one published report of its use . Moreover, some studies on gene function may require targeting of both alleles of a gene, a task that is wrought with difficulties even in mouse ES cells. Introducing genes into human ES cells with lentiviral vectors has been demonstrated and promises to be a workable technique [19–21]. RNA interference does work in human ES cells, but what is not yet clear is whether the effects are stable and whether true nulls (with no expression) are produced in addition to hypomorphs (with reduced expression) [2, 23]. Nonetheless, such an approach may be useful for an initial examination of gene function.
Incremental advances in growing human ES cells in an undifferentiated state are likely to follow from the recent gene-expression studies [1–5]. The technology for manipulating the cells will improve, and this in turn will aid our ability to carry out analysis of gene function in human ES cells. Gradually we will begin to understand what gives these cells their unique properties, gene by gene. It may seem tedious, but consider the alternatives. If human ES cells had been discovered a decade earlier, some of the approaches now used would not have been available. The sequencing of the human genome has allowed us to examine the complexity of gene expression in a way that was inconceivable ten years ago. Discoveries made using additional genome-wide approaches may finally allow us to see beyond the light cast by microarrays and deeper into the mysteries of stemness.
We thank Robert W. Clouse III for help in preparing the figure and gratefully acknowledge support from the National Institutes of Health and Johns Hopkins University.