Cancer genomics identifies determinants of tumor biology
© BioMed Central Ltd. 2010
Published: 5 May 2010
Skip to main content
© BioMed Central Ltd. 2010
Published: 5 May 2010
Unbiased sequencing and analysis of human tumors is revealing unsuspected somatic changes that, upon further study, are elucidating aspects of tumor biology and identifying new biomarkers.
Whole-genome sequencing and analysis of tumor and matched normal genomes with next-generation sequencing platforms has begun to illuminate commonly mutated genes and transcript-level events that contribute to the underlying tumor biology. To elucidate the role of frequent somatic mutations, the mutant proteins have been biochemically characterized and the results interpreted in terms of the selective advantages these variants may confer on the tumor. Certain somatic alterations have demonstrable prognostic value for specific tumor types in which they commonly occur, although their downstream metabolic signatures may obviate genotyping to identify their mutational status. The metabolic signature is a direct result of the mutation's impact on a given protein/enzyme; therefore, rather than performing sequencing to detect whether a mutation is present, metabolic profiling may be more straightforward, cheaper, and have a lower error rate, for example. New insights into the relationship between a primary tumor and its fatal metastatic disease are also beginning to emerge from genomic comparisons, with the fine detail afforded by next-generation sequencing enabling these comparisons.
The transcriptomes of cancer cells also have their own unique somatic complexities, which often result from structural perturbations to the genome, but can be due to transcription-level events such as alternative splicing, RNA editing or transcript fusion. These types of alterations may explain certain aspects of tumor biology and may also be corroborated by cytogenetic phenomena. In this review, I will describe some tumor-specific alterations that were discovered as a result of analyses of unbiased genome or transcriptome sequencing data (unbiased sequencing does not select for portions of the genome or transcriptome in advance, and the entire genome or transcriptome is therefore surveyed) and then illustrate how these discoveries were pursued further to reveal insights into tumor biology that have enhanced our clinical diagnosis of cancer and our concepts of how best to treat it.
In a 1956 paper , Otto Warburg observed that the predominant mode of energy production in cancer cells was by aerobic glycolysis rather than by oxidation of pyruvate in mitochondria, as in normal cells. This observation led Warburg to postulate that this change in metabolism was a fundamental cause of cancer. Years later, in 1986, Renato Dulbecco opined that studying the cellular genome should be pursued to learn more about cancer, either by taking a 'piecemeal' approach of looking gene by gene, or by sequencing the whole genome .
What are the metabolic consequences of IDH1 mutations? Using the U-87 MG human glioblastoma cell line, Zhao et al.  demonstrated a concomitant reduction in cellular α-KG levels after knocking down endogenous IDH1, and because α-KG is required by prolylhydroxylases, enzymes that hydroxylate and promote the degradation of hypoxia-inducible factor 1α (HIF-1α), the intracellular levels of HIF-1α were also characterized. Zhao et al. showed that when wild-type IDH1 is knocked down by RNA interference using short hairpin RNA, HIF-1α is elevated, and when IDH1 is overexpressed, HIF-1α levels are reduced. HIF-1α is a component of HIF-1, a transcription factor that regulates the expression of genes related to glucose metabolism, angiogenesis, and other signaling pathways by sensing low cellular oxygen levels. By performing quantitative PCR to measure the transcripts of three known HIF-1 target genes - glucose transporter 1 (Glut1), vascular endothelial growth factor (VEGF), and phosphoglycerate kinase (PGK1) - Zhao et al. demonstrated induced expression of these genes as a consequence of either the knockdown of wild-type IDH1 or the expression of the IDH1 R132H mutant. On staining glioma samples for HIF-1α, those tumors with previously identified R132H mutations showed a statistically stronger staining signal than those without mutations. Thus, this body of evidence has demonstrated that when IDH1 is mutated, its function is reduced and the downstream impact of that reduced function (the consequential upregulation of HIF-1α) contributes to the cell's progression to cancer, thereby indicating that a likely function of IDH1 is that of a tumor suppressor gene. Further experimentation is needed to support the claim that IDH2 may be a tumor suppressor gene also.
Building on the initial characterizations of IDH1 mutations in gliomas, Dang et al.  took a metabolomics-based approach to identify further changes in associated metabolite levels when an IDH1 mutation is present . They found 2-hydroxyglutarate (2-HG) to be the only metabolite with significantly increased abundance in cells expressing R132H mutant IDH1. In a clever series of experiments, the increase in 2-HG was shown to result from the NADPH-dependent reduction of α-KG by mutant IDH1, a new function that is enabled by the mutation at R132. The authors demonstrated a similar gain of function for the R132C, R132L and R132S mutations. Their X-ray crystallographic studies showed that the R132H mutation in IDH1 results in the formation of an active site distinct from that of the wild-type enzyme. With the aim of improving diagnostic efficacy, Dang et al. examined 12 GBM tumors with various R132 mutations in IDH1, and found 2-HG levels 100-fold greater or more than in tumors with wild-type IDH1; the measured decrease in α-KG was, however, not statistically different in mutant versus wild-type IDH1 tumors. This finding indicates that in the clinic, detecting patients with increased 2-HG levels would identify GBMs with IDH1 mutations, predicting an overall longer survival time. Indeed, since secondary GBMs develop from lower-grade gliomas, therapeutic inhibition of 2-HG production might slow the transition time to GBM development, offering an improved survival benefit as a result.
In our laboratory we have been using a whole-genome shotgun approach to sequence tumor genomes. In the second case of acute myeloid leukemia (AML) we sequenced, we discovered an IDH1 R132 mutation that was subsequently found in about 8% of our 187 banked AML patient samples, showing that this mutation is not restricted to gliomas . In these tumor genomes, we detected R132C, R132H and R132S substitutions, with R132C being most common (8 of 16). PCR assays designed to detect IDH2 R172 mutations in the 188 AML samples did not reveal any. Correlation analyses of clinical data and mutational status for AML patients with IDH1 mutations indicated that the presence of IDH1 mutations, in cases with normal cytogenetics and with wild-type nucleophosmin-1 (NPM1), a commonly mutated gene in AML that has been known for some time, predicted a worse prognosis, although we did not reach statistical significance with our cohort. Soon after, Schnittger et al.  reported an analysis of a cohort of 999 AML patients, finding that IDH1 mutations were frequent (in 9.3% of samples), that the R132C mutant was the most common, and that in the presence of wild-type NPM1 and intermediate cytogenetics (a cytogenetic evaluation of the leukemia cells revealing no clues as to the patient's prognosis), patients with IDH1 mutations had a significantly unfavorable prognosis (P = 0.038).
A subsequent study by Gross et al.  examined an additional 145 AML biopsies, identifying 11 IDH1 R132 mutant samples . Four IDH1-mutant primary samples had relapse samples that also carried the IDH1 mutation. AML cells carrying the R132 mutant of IDH1 were found by gas chromatography-mass spectrometry to have 2-HG levels around 50-fold greater than in samples with wild-type IDH1. Similarly, higher 2-HG levels were detected in sera from patients positive for the IDH1 R132 mutation. Two wild-type IDH1 samples had elevated 2-HG levels and were found to be carrying IDH2 R172 mutations, the first report of these in AML. Importantly, this paper reinforced the fact that metabolite screening rather than mutational screening can be an important diagnostic approach for the detection of elevated 2-HG levels and, by inference, IDH1/2 mutations. Because of the apparent predominance in AML of the IDH1 R132C mutation over R132H (which is more predominant in gliomas), Gross et al.  looked at the kinetics of the R132C mutant enzyme. The R132C enzyme showed a dramatic loss of affinity for isocitrate (resulting in a reduction in KM) and a drop of more than six orders of magnitude in net efficiency (Kcat/KM) of isocitrate metabolism.
In another recent study, our understanding of IDH mutations and their detection has been extended. Ward et al.  have determined that the gain of function seen in the IDH1 R132 mutants (that is, the ability to reduce α-KG) is also found in the IDH2 R172K mutant. Metabolic profiling of cells expressing IDH2 R172K revealed an approximately 100-fold increase in intracellular 2-HG compared with cells overexpressing wild-type IDH2, and this finding was extended to leukemia cells carrying the IDH2 R172K mutation. Ward et al.  also screened AML samples with normal cytogenetics but unknown IDH mutational status for increased levels of 2-HG, and then evaluated the mutational status based on the result of the screening assay. In this test, 2-HG measurement was found to predict mutational status with high accuracy. In addition, a new IDH2 mutation, R140Q, was identified in five samples. In a second evaluation of 78 AML samples, IDH2 R140Q mutations were found to be more frequent than either IDH1 R132 mutations or IDH2 R172K .
Despite some differences among sample sets, this body of work, aiming to characterize the impact of IDH mutations on tumor cell biology has led to the conclusion that all mutations discovered so far enable a gain of function in α-KG reduction with a concomitant increase in the tumor-specific metabolite, or oncometabolite, 2-HG. Although the contribution of 2-HG to tumor cell biology remains speculative, Ward et al. noted that all IDH mutation-containing tumor types identified so far (leukemias and gliomas) are distinguished by proliferation of a relatively undifferentiated cell population, and in this context the effect of 2-HG in the tumor and its microenvironment is to block cellular differentiation .
One intriguing aspect of cancer genomics, for which published examples are few, involves comparing genome-wide alterations between the matched primary and metastatic cancer genomes from the same patient as a way of elucidating both their inter-relationships and the metastatic process. The first such study, by Shah et al. , focused whole-genome and whole-transcriptome sequencing on a metastatic tumor genome from an estrogen-receptor-positive, invasive lobular breast cancer that occurred 9 years after the patient's initial diagnosis and treatment. After a combined analysis to identify somatic mutations in both the genomic and transcriptomic data, the primary tumor taken 9 years earlier and the matched blood normal genomes were queried in the light of these findings. The aim was to search the primary tumor genome for the 30 mutations that had been found and validated as tumor-associated in the metastatic genome, and to establish the somatic or germline nature of the variants by comparing their occurrence in blood cells. Variants that were found in the primary tumor data were deeply sampled by sequencing to provide an estimate of the allele frequency for the somatic mutation in the primary tumor. Because of the 9-year interval between diagnosis of the primary tumor and metastasis, significant differences in mutational load were present; only 3 of the 28 tested mutations were prevalent in the primary cancer cells, 6 had an allele frequency of 1 to 13%, and 19 were not detected and were therefore metastasis-specific. Genes found in this analysis to have somatic point mutations were tested for frequency of mutation in 192 breast cancers, revealing that both the gene for the receptor kinase HER2 (3 of 192) and for the HAUS augmin-like complex, subunit 3 (HAUS3); 2 of 192) were mutated . A second interesting finding from this study was the detection of two nonsynonymous variants that were introduced by RNA editing, perhaps the first description of this phenomenon from next-generation sequencing data and a strong testament to the importance of transcriptomic data in broadening the range of variant discovery from cancer genomics.
A recent study by our laboratory used next-generation whole-genome resequencing, analysis and comparison of the genomes of a matched primary breast tumor, metastatic brain tumor, and blood normal from an African-American patient with basal subtype breast cancer  This estrogen-receptor-negative tumor represents one the most aggressive types of breast cancer, and in this patient only 8 months elapsed between diagnosis of the primary tumor and a diagnosis of metastatic disease. We also sequenced the genome of a human-in-mouse xenograft  passage of the patient's primary tumor, taken by core biopsy procedure before adjuvant chemotherapy and placed into the fat pad of a NOD/SCID female mouse. Briefly, we found 48 mutations shared across all 3 tumors, and only 2 metastasis-specific point mutations. Two mutations, identified in this analysis, in the kinase JAK2 (affecting the JAK-STAT signaling pathway) and in the gene CSMD1 (CUB and Sushi multiple domains 1; loss of CSMD1 expression is associated with poor survival in invasive ductal breast carcinoma ) were also found mutated in other types of breast cancer samples. Interestingly, we established that the frequency of 16 mutations in the primary tumor cell population was lower than 10% but rose to very high frequency in both the metastatic and xenograft samples.
At the interface between variant discovery and the transition from primary to metastatic disease lies the possibility of identifying personalized biomarkers that might enable an oncologist to monitor the progression or remission of a cancer. This approach has been elegantly developed by Leary et al. , who utilized long-distance mate-pair sequencing (a variation of paired end sequencing involving circularization of long DNA fragments (>1 kb) and ligation to an internal adapter DNA of known sequence) of an approximately 1.4 kbp region on the SOLiD platform to detect and characterize tumor-specific structural variations in two colon and two breast tumors . Each sample was found to carry at least four validated somatic rearrangements that were then used to design PCR assays by which they could be detected. Patients' sera were assayed by PCR and this revealed that the amount of DNA circulating in plasma was sufficient to detect the tumor DNA rearrangements. In one case, serum samples were taken before and after tumor resection, and the levels of tumor-specific biomarker DNA in the plasma mirrored these procedures. This remarkable demonstration may transform our clinical approach to monitoring the course of cancer with minimally invasive methods.
Several groups now have pioneered efforts at unbiased transcriptome discovery using next-generation sequencing of mRNA (RNA-seq) from the tumor cell population and a variety of approaches to analyze the data. Although algorithmically complex to detect, there are a number of unique transcription-level processes that modify the encoded genome, including alternative splicing, RNA editing, and the formation of chimeric transcripts. Maher et al. have published two studies [23, 24] that illustrate the development and implementation of RNA-seq analytical approaches to discover novel fusion transcripts in prostate tumors, which are often regulated by androgen levels. Initially, a dual-platform strategy combined longer read-length RNA-seq data from the Roche/454 platform with shorter RNA-seq fragment reads from the Illumina platform, and resulted in the discovery of a novel chimeric transcript, SLC45A3-ELK4, in prostate tumor samples . The second approach  took advantage of Illumina paired-end RNA-seq data and a different algorithmic filtering of mapped paired ends to identify putative chimeric transcripts. When combining these read pairs with unmapped reads that span the fusion boundary, fusion transcripts previously identified in prostate cancer cells, such as TMPRSS2-ERG, were detected, as were novel fusion transcripts such as HERPUD1-ERG. These discoveries not only enhance our understanding of fusion transcripts in cancer, but have led to experiments to interrogate the role of hormonal signaling by androgens in inducing chromosomal movements that bring two genes that participate in a detected fusion event into close proximity .
In a recent study, Shah et al.  evaluated the transcriptomes of four adult granulosa-cell tumors (GCTs) of the ovary, identifying putative variants involved in tumorigenesis by shared analysis of all four tumors. A single missense point mutation in FOXL2 was identified (C134W), and was subsequently found in an additional 86 out of 89 cases of adult GCTs. The gene was not mutated in other ovarian tumors of different types, nor in breast cancers that were tested. FOXL2 is a transcription factor in the forkhead-winged-helix family and is required for the normal development of granulosa cells. Although loss-of-function mutations in FOXL2 have been described for germline genomes, this was the first description of FOXL2 somatic mutations in ovarian tumors. We await the results of downstream functional studies, which will be required to reveal the impact of the cysteine-to-tryptophan amino acid change on the activity of this transcription factor, as well as altered transcription of genes bound by FOXL2.
In conclusion, cancer genomics, largely due to the unbiased and comprehensive nature of data that can be produced by next-generation sequencing platforms, is being applied to unravel the DNA- and RNA-level somatic alterations that determine tumor development and progression. It has been remarkable to see the pursuit of enzymatic, biochemical, functional and diagnostic implications of the earliest discoveries afforded by these methods. Hopefully, these important efforts will scale to accommodate the wave of next-generation-based discovery that is imminent, and the ultimate benefactors of our enhanced knowledge will be the patients and families whose lives are touched by this disease.