Skip to main content

Combined image and genomic analysis of high-grade serous ovarian cancer reveals PTEN loss as a common driver event and prognostic classifier



TP53 and BRCA1/2 mutations are the main drivers in high-grade serous ovarian carcinoma (HGSOC). We hypothesise that combining tissue phenotypes from image analysis of tumour sections with genomic profiles could reveal other significant driver events.


Automatic estimates of stromal content combined with genomic analysis of TCGA HGSOC tumours show that stroma strongly biases estimates of PTEN expression. Tumour-specific PTEN expression was tested in two independent cohorts using tissue microarrays containing 521 cases of HGSOC. PTEN loss or downregulation occurred in 77% of the first cohort by immunofluorescence and 52% of the validation group by immunohistochemistry, and is associated with worse survival in a multivariate Cox-regression model adjusted for study site, age, stage and grade. Reanalysis of TCGA data shows that hemizygous loss of PTEN is common (36%) and expression of PTEN and expression of androgen receptor are positively associated. Low androgen receptor expression was associated with reduced survival in data from TCGA and immunohistochemical analysis of the first cohort.


PTEN loss is a common event in HGSOC and defines a subgroup with significantly worse prognosis, suggesting the rational use of drugs to target PI3K and androgen receptor pathways for HGSOC. This work shows that integrative approaches combining tissue phenotypes from images with genomic analysis can resolve confounding effects of tissue heterogeneity and should be used to identify new drivers in other cancers.


High-grade serous ovarian carcinoma (HGSOC) is the most common type of ovarian cancer and accounts for the majority of mortality from the disease. However, overall survival has been virtually unchanged since the introduction of platinum-based treatments [1]. HGSOC is characterised by ubiquitous mutation of TP53 [2] and high prevalence of BRCA1 and BRCA2 germ-line mutations. With the exception of these genes, little is known about other prevalent driver events, and BRCA1/2 and PR are the only robustly validated prognostic markers [3],[4]. HGSOC has genomic similarities with basal-like breast tumours, which are also characterised by TP53 and BRCA1 alterations but additionally have PTEN loss [5]–[7]. Since PTEN loss is an important early initiating event in BRCA1-associated basal-like breast tumours [8], we hypothesised that it could also be a driver event in HGSOC.

PTEN is a phosphatase that inhibits cell proliferation induced by the PI3K pathway and acts as a tumour suppressor gene [9]. Targeted deletion of PTEN has been used to modulate the initiation of HGSOC and endometrioid ovarian cancer (EOC) in mouse models [10]–[13], but it is unknown whether PTEN loss could initiate or drive the progression of HGSOC in humans. The Cancer Genome Atlas (TCGA) study on genetic and epigenetic alterations in 489 cases of HGSOC confirmed TP53 mutation and BRCA1 downregulation as the main driver events and identified PTEN alterations in only 7% of tumours [4]. However, other immunohistochemistry-based studies in smaller cohorts found much higher frequencies of PTEN alterations, with loss of PTEN expression in 15% and partial loss in 50% to 60% of cases [14]–[16].

HGSOC has previously been stratified into distinct molecular subgroups based on gene-expression profiles: proliferative, differentiated, immunoreactive and mesenchymal [4],[17],[18]. However, the clinical utility of these classifiers is unclear, particularly as individual HGSOC samples may express multiple subtype signatures and the signatures show strong effects from stromal factors [18]. These signatures are likely to be driven by cell-autonomous effects such as BRCA1 mutation (immunoreactive subtype) and the Let-7 pathway (mesenchymal subtype) [19],[20]. Identification of other dominant cell-autonomous drivers therefore requires deconvolution of stromal signatures from those of carcinoma cells. Joint analysis of tissue images and genomic profiles has only recently been introduced to study these effects, and reveals information that cannot be attained from genomic data alone [21].

We hypothesised that PTEN loss might be more frequent than observed in the TCGA data set owing to confounding by samples with high stromal content. Here, we have developed bioinformatic and image analysis methods to correct gene expression signatures in the TCGA HGSOC data and tested these predictions in two independent cohorts of HGSOC cases.


Estimation of PTENexpression in high-grade serous ovarian carcinoma is strongly influenced by stromal content

We evaluated the stromal content of 216 HGSOC samples from TCGA in a total of 302 images using a computational framework validated through scoring by an independent observer (Jonckheere–Terpstra test for trend P=0.001) (Figure 1A and Additional file 1: Figure S1). The automated stromal scores were highly correlated with the expression of genes from a published stromal gene signature (Figure 1B) [22]. ACTA2 ranked 17 in the top correlated stromal genes and was therefore selected for subsequent analysis on the basis of its known stromal-specific expression (Figure 1C) [23].

Figure 1
figure 1

PTEN expression in TCGA samples correlates with ACTA2 expression, and thus stromal content. (A) Example of H&E stained sections from TCGA samples having low and high stromal content. The stromal content detected using the segmentation algorithm is shown in green. (B) Average expression of combined stromal signature correlated well with automated quantification in (A) (r=0.397, N=216 patients). (C) Univariate correlation testing of stromal genes from Yoshihara with image analysis showing that ACTA2 is highly correlated with stromal content (r=0.306) [22]. (D) Scatter plot showing the distribution of PTEN vs ACTA2 mRNA expression for each sample (Pearson correlation, r=0.152; P<0.001) with corresponding box plots showing distribution of PTEN expression in different ACTA2 expression quartiles. There is higher PTEN expression with high ACTA2 expression (Jonchkeere Terpsa test, P=0.001). (E) Heat maps of top 50 differentially expressed genes between PTENhigh and PTENlow tumours in the TCGA data set before and after selecting for ACTA2 low samples. In each case, the top quartile of PTEN expression was labelled PTENhigh, and the lowest quartile PTENlow. Correction for ACTA2 content reveals AR as one of the top differentially expressed genes. (F) Stromal gene set enrichment plots after differential expression analysis between high and low PTEN. Stromal-related genes from the Yoshihara signature (141 genes, highlighted in red) are redistributed [22]. There is less enrichment for stromal-related genes after correcting for stroma content (enrichment score 0.5 to 0.1). Dotted lines indicate adjusted P=0.05.

High ACTA2 expression in the TCGA samples was directly correlated with PTEN expression and was never associated with low PTEN values, suggesting that in the majority of samples it was stromal PTEN expression that was being measured (Figure 1D).

Differential gene analysis comparing the upper and the lower quartiles of PTEN expression showed enrichment for stromal genes in tumours with high PTEN (Gene Set Enrichment Analysis (GSEA) Enrichment Score = 0.5). However, performing the analysis on samples with low ACTA2 content (the first quartile) showed a more random distribution of stromal genes (GSEA ES=0.1), suggesting this subset is less influenced by stromal content (Figure 1E,F). The wider distribution of PTEN expression in quartile one of ACTA2 expression also supports the hypothesis of tumour PTEN loss being more prevalent than previously estimated (Figure 1D).

PTENloss is prevalent and has prognostic value in high-grade serous ovarian carcinoma

To test the predictions that reduced PTEN expression could be a frequent event, we developed methods to quantify tumour-specific PTEN expression using a semi-quantitative immunofluorescence (IF) procedure. We applied this to tissue microarrays constructed from the population-based Study of Epidemiology and Risk Factors in Cancer Heredity (SEARCH) cohort (N=245 HGSOCs from 516 ovarian cancer samples; Table 1) [3]. For HGSOC, PTEN expression was variable and showed a range of intensities, from negative (23%;N=49), weak positive (32%;N=68) to strongly positive fluorescence (23%;N=49). Heterogeneous fluorescence was also observed in 48 samples, 22% of the cases (Figure 2A).

Figure 2
figure 2

PTEN loss is highly prevalent in HGSOC and is associated with poor prognosis. (A) Examples of immunofluorescence (IF) and immunohistochemistry (IHC) staining in ovarian cancer samples for scoring of PTEN expression. (B) Kaplan–Meier survival curves for PTEN positive vs PTEN negative, weak or heterogeneous staining for HGSOC from the SEARCH study (using IF) (multivariate hazard ratio 1.8, 95% CI 1.0 to 3.0, P=0.03) and (C) Kaplan–Meier survival curves for PTEN positive vs PTEN negative, weak or heterogeneous stainings for HGSOC from the Nottingham Ovarian Cancer Study (using IHC) (multivariate hazard ratio 1.8, 95% CI 1.2 to 2.6, N=228, P=0.002). CI, confidence interval; IF, immunofluorescence; IHC, immunohistochemistry.

Table 1 Summary of SEARCH cohort

Reduced PTEN fluorescence (including negative, weak or heterogeneous expression) was associated with significantly worse survival for HGSOC compared with positive fluorescence, independent of study site, age, stage and grade (hazard ratio 1.8, 95% confidence interval (CI) 1.0 to 3.0, P=0.03; Figure 2B, Additional file 2: Table S1).

Comparison of immunohistochemistry (IHC) for PTEN to the immunofluorescent assay showed strong correlation (P0.001; Additional file 3: Figure S2A). We therefore used IHC to extend the initial analysis in an independent validation cohort of incident ovarian cancer cases (N=276 HGSOC cases from 507 ovarian cancer samples; Table 2). Reduced PTEN expression was associated with significantly worse survival for HGSOC compared with positive expression, independent of study site, age, stage and grade (hazard ratio 1.8, 95% CI 1.2 to 2.6, N=228, P=0.002) (Figure 2C, Additional file 2: Table S2). Combined analysis of both data sets was associated with a multivariate hazard ratio 1.5 (95% CI 1.1 to 2.0, N=439, P=0.006; Additional file 3: Figure S2B) for reduced PTEN expression.

Table 2 Summary of Nottingham cohort

We examined for interactions between BRCA1 and PTEN loss of expression as PTEN loss is a frequent initiating event in BRCA1-associated breast tumours and is associated with basal-like breast cancer [8]. All patients with a deleterious germ-line BRCA1 mutation had HGSOC tumours and were marginally more likely to have negative or weak PTEN staining (Fisher’s exact test P=0.06, N=9/10 and Additional file 2: Table S1).

PTENis frequently deleted in high-grade serous ovarian carcinoma

TP53 is mutated in more than 95% of HGSOC cases [2],[4] and has been previously implicated in controlling PTEN transcription [24]. We tested for TP53 effects on PTEN expression by introducing a bacterial artificial chromosome transgene containing the entire human TP53 locus into the TP53-null cell line SKOV3. PTEN expression was independent of TP53 complementation for both wild-type and mutated (R175H, R273H) transgenes (Figure 3A). Comparison of promoter methylation and expression of PTEN in the TCGA data set showed infrequent methylation of the PTEN promoter region in low-PTEN expressing cases (Figure 3B).

Figure 3
figure 3

PTEN loss in HGSOC mainly occurs through copy number alteration (CNA). (A) Western blot showing the quantification of PTEN, TP53 and GAPDH (loading control) from extracts of the ovarian cancer SKOV3 cell line and complemented with wild-type and mutated TP53. (B) Box plot showing distribution of methylation levels according to PTEN expression levels in TCGA samples. A small increase in DNA methylation is observed in samples with lower PTEN expression (Wilcoxon test P0.001). (C) Pie chart showing distribution of PTEN ploidy within the TCGA data set. (D) Box plot showing distribution of PTEN mRNA expression according to PTEN ploidy, suggesting that CNA influences mRNA expression (t-test, diploid vs hetloss P0.001). (E) Example of IHC staining for PTEN scored as positive in [14], but reclassified here as weak staining, as staining in tumour cells is markedly reduced in comparison to stromal cells. (F) Table showing differences between scoring for PTEN IHC staining used [14] and reclassified score. Of the original 36 positive samples, 21 have been reclassified as heterogeneous or weakly positive. (G) Box plot showing distribution of PTEN mRNA expression within each scoring group for PTEN IHC staining, according to the Hanrahan score [14] and the reclassified score. There is a significant difference between PTEN expression in tumours classified as weak positives and positives (t-test, P=0.002). (H) Contingency table showing significant correlation between scores for PTEN IHC staining and PTEN CNA (Fisher’s exact test, P0.001). CNA, copy number alteration; IF, immunofluorescence; GAPDH, glyceraldehyde 3-phosphate dehydrogenase; hetero, heterogeneous; IHC, immunohistochemistry; TCGA, The Cancer Genome Atlas.

By contrast, loss of a single PTEN allele was common in HGSOC (36%; N=174) in addition to the previously described homozygous deletion occurring in 6% of tumours [4] (Figure 3C). PTEN gene expression was significantly lower when there was loss of at least one allele (Figure 3D; t-test, P0.001). To assess whether protein expression of PTEN was associated with copy number, we applied our IHC staining classification to a subgroup of 51 samples from the TCGA cohort that has been recently stained for PTEN [14]. A large number of positive samples were reclassified as weak positive (Figure 3E,F and Additional file 2: Table S3). These tumours had similar levels of PTEN mRNA to those with heterogeneous staining, and significantly lower levels than tumours with positive staining (Wilcoxon test, P=0.002; Figure 3G), emphasising the importance of differentiating intensities in PTEN staining. Negative staining was strongly correlated with homozygous deletion, and weak or heterogeneous staining with hemizygous loss; positive staining was associated with no chromosomal loss (Fisher’s exact test, P=0.001; Figure 3H).

Androgen receptor expression is associated with PTENexpression

We analysed PTEN differentially expressed genes for TCGA samples with low ACTA2 content to mitigate the effect of stromal contamination. AR was one of the top differentially expressed genes (Figure 1E), and this was confirmed using an orthogonal method, csSAM, which takes into account stromal content from H&E images (Additional file 4).

Protein–protein interaction data from TCGA available through cBioPortal [25] showed that the highly ranked phosphorylated proteins in the lower quartile of PTEN expression (defined as PTEN RNA-sequencing expression, z score<−0.5) included AKT1, AKT2 and AKT3, which reflects activation of the PI3K pathway. Additionally, another highly ranked protein expressed in this subgroup was AR. A direct link between AR and PTEN was suggested by overlaying genomic information, including copy number and gene expression, on a known protein interaction network (Figure 4A). Moreover, AR expression has also been associated with PTEN expression in prostate cancer [26]. Therefore, we hypothesised that AR expression was prognostically significant. Using the TCGA RNA-sequencing data, we found that low AR expression was associated with shorter overall survival (hazard ratio 1.5, 95% CI 1.1 to 2.1, P=0.02; Figure 4B).

Figure 4
figure 4

Androgen receptor is co-regulated with PTEN and is associated with good prognosis. (A) Protein–protein interaction network of proteins associated with high PTEN expression shows co-expression of AR and PTEN (figure obtained from cBioPortal website using the provisional TCGA data for ovarian serous cystadenocarcinoma). (B) Kaplan–Meier survival curves for AR expression show prognostic value, after dichotomising AR RNA-sequencing expression levels at the median level (N high=126, N low=129; hazard ratio 1.5, 95% CI 1.1 to 2.1, P=0.02). (C) Examples of immunohistochemistry (IHC) AR staining of HGSOC samples. (D) Heat map shows no statistical association between AR and PTEN IHC expression in HGSOC samples from the SEARCH cohort (chi-squared test, P=0.63). (E) Kaplan–Meier survival curves for AR positive vs AR negative for HGSOC from the SEARCH study (using IHC) (multivariate hazard ratio 1.7, 95% CI 1.1 to 2.4, P=0.01). CI, confidence interval; HGSOC, high-grade serous ovarian carcinoma; HR, hazard ratio; IF, immunofluorescence; IHC, immunohistochemistry; SEARCH, Study of Epidemiology and Risk Factors in Cancer Heredity.

These results were validated experimentally in 216 samples from the SEARCH cohort. It was found that 43%, 25% and 32% of HGSOC expressed high levels of AR (≥50% of tumour cells), low levels of AR (<50% of tumour cells) or no AR, respectively (Figure 4C). In a multivariate analysis, we found a similar prognostic effect for expression in these samples (hazard ratio 1.7, 95% CI 1.1 to 2.4, P=0.01), despite there being no strong association between AR and PTEN (chi-squared test P=0.63; Figure 4D,E).

Differentiated and proliferative expression subtypes are associated with high and low PTENexpression

TCGA subdivided HGSOC into four subgroups based on gene expression profiles, differentiated, proliferative, mesenchymal and immunoreactive, in which the latter two were enriched for stromal cells and leukocytes (Figure 5A) [4],[18].

Figure 5
figure 5

PTEN highAR highand PTEN lowAR lowtumours correlate with the differentiated and proliferative subgroups. (A) Heat maps representing the association between HGSOC TCGA subtypes and expression of CD3, ACTA2, AR and PTEN in the TCGA data set. Mesenchymal and immunoreactive subtypes show good correlation with ACTA2 and CD3 expression, respectively (left). The heat map on the right shows only the proliferative and differentiated subtypes and suggests that high AR and PTEN expression is associated with the differentiated subtype. (B) Categorisation of patients into high, medium and low expression for PTEN and AR shows an association between PTEN expression and the differentiated and proliferative subtypes (chi-squared test, PTEN P=0.02 and AR P=0.38). DIF, differentiated; HGSOC, high-grade serous ovarian carcinoma; med, medium; PRO, proliferative; TCGA, The Cancer Genome Atlas.

PTEN loss is associated with activation of the PI3K pathway and consequently with proliferation. Therefore, we hypothesised that PTEN expression could be associated with the remaining differentiated and proliferative subgroups. By focusing on these two subgroups and categorising the raw AR and PTEN expression into tertiles, higher PTEN expression was associated with the differentiated subgroup (Figure 5B, chi-squared test P=0.02), whilst lower PTEN expression was associated with the proliferative subgroup.


In this work, we have developed and validated image analysis methods to score stromal components in tumour images automatically. We used these methods to identify tissue samples with low stromal content from TCGA, which allowed us to examine the range of PTEN RNA expression accurately, confirming that PTEN expression in HGSOC was highly variable. These observations supported our hypothesis that PTEN loss could be common in HGSOC and was confirmed using a validated PTEN antibody on tissue microarrays from two independent clinical cohorts.

Extensive genomic analysis of HGSOC has revealed common involvement of tumour suppressor genes, including TP53, BRCA1 and BRCA2, but only rare involvement of ‘actionable’ oncogenic mutations. The importance of identifying driver mutations in cancer and using them for therapeutic targets has been extensively demonstrated and a robust molecular stratification of HGSOC is required to predict prognosis accurately and to support rational drug development for this disease. Previous genomic analysis of HGSOC has identified four molecular subtypes of HGSOC based on gene expression profiles: differentiated, proliferative, mesenchymal and immunoreactive [4],[17],[18]. However, these profiles have only weak prognostic value, are highly influenced by stromal contribution and do not have direct therapeutic implications. Cellular heterogeneity in tumour samples is a common confounder for genomic analysis and it is important to note that the spatial distribution and number of normal and tumour cells can provide important phenotypes that are not represented in genomic profiles [21],[27].

In this work, by using image analysis to focus on tumour-cell-specific PTEN expression, we have shown that PTEN loss is a common event in HGSOC, supporting previous IHC-based studies [14]–[16]. Although there is conflicting data on the prognostic value of PTEN [16],[28], in our analysis of 442 HGSOC cases from two separate cohorts, we have demonstrated that PTEN loss or downregulation is prognostic and is strongly associated with worse overall survival. Positive PTEN staining has comparable survival effects to those described for BRCA1/2 carriers [4]. TCGA previously demonstrated homozygous deletion of PTEN in 6% of HGSOC cases [4], and our reanalysis of this data set additionally shows that heterozygous loss is common in tumour cells (36%) and is associated with reduced expression of PTEN RNA and protein.

As methylation of PTEN was only infrequently observed in the TCGA data, other post-transcriptional modifications may play an important role in regulating PTEN [29]. PTEN is regulated by a complex network of miRNAs and mRNAs that share the same miRNA binding site – these have been termed competing endogeneous RNA (ceRNA). Also, the PTEN pseudogene PTENP1 can indirectly regulate PTEN expression [30]. Further studies will be needed to clarify whether the heterogeneous expression patterns observed can be explained by these post-transcriptional modifications, tumour heterogeneity or upstream genetic changes.

PTEN gene dosage is finely regulated and small expression changes may have important phenotypic effects [31]. In our analysis, weak expression of PTEN had similar detrimental effects on survival as total loss of expression. These survival effects are consistent with models suggesting haploinsufficiency phenotypes for PTEN [8],[32]. Previous analyses of early serous tubal carcinomas showed that 33% (N=4/12) of cases had complete loss of PTEN expression and a further 33% had heterogeneous loss [15]. Together with our observations, these data strongly support the contention that PTEN is a prevalent early driver event in HGSOC. This is further supported by data from mouse models where the addition of PTEN deletion to alteration of DICER, or TP53 and BRCA1, was critical for initiation and progression of HGSOC [10],[11].

In breast cancer, PTEN loss is more prevalent in association with BRCA1 germ-line mutations, and tumour analysis at the single-cell level has shown that it is the most common initiating event in BRCA1-associated breast tumours [7],[8]. Owing to our small sample size, our results suggest, but do not confirm, that PTEN loss is more prevalent in BRCA1-associated HGSOC.

We also show that the PTEN high subgroup with improved outcome was associated with the differentiated expression signature, whereas the poor prognosis PTEN low subgroup was associated with the proliferative signature. The stratification of HGSOC patients into PTEN high and PTEN low subgroups may have important therapeutic implications. Firstly, PTEN loss activates the PI3K pathway and tumours that present reduced expression of PTEN may respond to PI3K inhibitors. Additionally, PTEN loss has also been associated with response to PARP inhibitors in endometrial cell lines [33] and a mechanistic basis for this has been suggested by recent findings that implicate nuclear PTEN in the regulation of homologous recombination [34]. Recent data suggest strong activity for compound PARP-PI3K inhibition in prostate and breast cancers and these drugs may be effective for the PTENlow HGSOC subgroup [35],[36].

The important association between AR and PTEN has previously been demonstrated for prostate cancer but not tested for ovarian cancer [26]. By correcting gene signatures from TCGA for stromal content, we showed that AR is co-expressed with PTEN and higher levels of AR expression were associated with longer overall survival. This is consistent with a recent meta-analysis that also suggested a better overall survival for patients with breast tumours with higher androgen receptor (AR) expression, independent of Estrogen Receptor (ER), and new clinical trials targeting AR in breast cancer have already been put in place [37]. However, our tissue microarray data showed a weaker correlation between PTEN and AR protein expression than observed in the genomic data. This may reflect the possible role of post-transcriptional modifications on protein expression, and the large degree of intra-tumour heterogeneity observed in IHC staining. Clinical trials with AR antagonists in ovarian cancer performed over 20 years ago suggested that only a small subset of tumours may respond to these drugs [38]. Our results suggest that stratification based on AR expression may allow for the identification of a potentially responsive high AR subset, which is associated with better prognosis.


PTEN loss, together with TP53 and BRCA1 alterations, is a common event in HGSOC and, in combination with AR, allows for a prognostic stratification of HGSOC subgroups, which may be amenable to targeted therapies. We have demonstrated that important genomic events that are confounded by stromal contribution in tumour samples, such as PTEN loss, can be resolved by integrating image analysis with protein and gene expression. Such bioinformatic approaches are broadly applicable and could lead to important discoveries in other diseases where heterogeneity of the tissue may be a confounding issue in genomic analyses.

Materials and methods

Overview of data sets used

Analyses were performed on three data sets:

  1. 1.

    The publicly available TCGA, from which H&E images, mRNA expression, genomic data including copy number variation and DNA methylation data, and corresponding clinical information for 489 HGSOC patient samples were obtained. This was downloaded from the TCGA data portal [4] and cBioPortal [25]. A total of 51 IHC samples for PTEN from this data set were obtained from Hanrahan et al. [14] and used to correlate genomic information with protein expression [14].

  2. 2.

    SEARCH data set with tissue samples and corresponding clinical information for 245 HGSOC samples (out of 516 ovarian cancer cases; Table 1). Patients were recruited after a diagnosis of ovarian cancer and if they were able to consent for participation in the study [3]. Key demographical and clinical data on the patients, including BRCA1 germ-line mutation status, were presented anonymously.

  3. 3.

    Nottingham Ovarian Cancer Study (NOT) data set with tissue samples and corresponding clinical data from 276 HGSOC samples (out of 507 ovarian cancer cases; Table 2). This is a retrospective study of ovarian cancer cases diagnosed between 1991 and 2011 [3]. For this study, the institutional research ethics boards (East Of England Cambridgeshire REC (for SEARCH) and Derbyshire REC (for NOT)) waived the need to obtain consent. Both local human research investigation committees approved each study.

Correcting gene expression profiles using image analysis

Scoring by eye for overall image quality (based on discolouration, folding over of the mounted section and completeness of the section) was performed on 312 slides, of which 46 were discarded owing to poor quality, leaving 266 slides from 194 patients. Based on quality, 302 H&E slides from 216 patient samples in the TCGA database were selected, of which an observer (FCM) manually scored 266 for stromal content. Images were segmented by first applying an entropy filter to remove the background from an image, followed by colour deconvolution according to Ruifroks’ method (Additional file 5) [39]. The haematoxylin channel was subtracted from the eosin channel, leaving a raw stromal signal. Otsu’s thresholding and smoothing was then performed to estimate a stromal fraction [40]. These values along with stromal gene scores were used to predict the stromal content in the remaining TCGA samples. Correlation between automated and manual scoring was performed using the Jonckheere–Terpstra test for trend. Gene-expression-based validation of the method was performed by generating a stromal gene list [22] and performing univariate Pearson’s correlation testing between stromal quantification and expression.

Differential gene expression analysis

PTEN expression was categorised into quartiles and the top and bottom quartiles were used for differential expression analysis, which was performed using either the limma package [41] or an orthogonal method, csSAM [42], on the complete TCGA set (N=489) and in the subset of low ACTA2 tumours (N=122). Differentially expressed genes were selected after correction for the false discovery rate for multiple testing and using a cut-off P<0.05. The top 50 differentially expressed genes were selected for further unsupervised hierarchical clustering and visualisation using the made4 made4 library [43]. Unsupervised hierarchical clustering was performed using the Euclidean distance metric and Ward’s method.

Statistical tests and survival analysis

All statistical analysis were performed using R [44]. All the R code necessary to replicate the data analysis is available in Additional file 6. Survival analysis was performed using a Cox proportional hazards model. Since some patients died just after diagnosis and were not included in the SEARCH study, left truncation was included in the analysis of this cohort, which means that we took into account both the time from diagnosis to entry into the study and the time from entry into the study to censoring or death.

PTEN immunostaining, scanning and scoring of SEARCH and NOT samples

IHC was carried out using tissue microarrays obtained from Formalin Fixed, Paraffin-Embedded (FFPE) tissues and primary antibodies for PTEN and AR (Cell Signaling, Danvers, MA, USA; PTEN – Clone 138G6; AR – Clone D6F11). The staining protocol for PTEN clone 138G6 was previously described by FCM [8] and validated for specificity and sensitivity using Pten knockout mice and xenograft tissue from PTEN-positive and -negative human cancer cell lines (see Supplementary Figure 1 in [8]). Heat-induced antigen retrieval was carried out in 10 mmol −1 citric acid (pH 6.0) in a pressure cooker at 120°C for 10 min. Sections were incubated with PTEN or AR antibodies overnight, diluted 1:100 in 5% goat serum, followed by incubation with anti-rabbit biotinylated secondary antibody for 1 h and peroxidase-conjugated avidin-biotin complexes (Elite ABC; Vector Laboratories, Burlingame, CA, USA). Formed immunocomplexes were visualised using diaminobenzidine (DAKO, Glostrup, Denmark, EU) and slides were counterstained with haematoxylin. Sections were rinsed in PBS between each step.

IF was carried out by adding a tyramide signal amplification step (Perkin-Elmer) after the secondary antibody, followed by incubation with Alexa Fluor 647-conjugated streptavidin (Invitrogen). Nuclei were counterstained with 4,6-diamidino-2-phenylindole (DAPI) (Invitrogen).

IF and IHC samples were stored at −20°C and room temperature (RT), respectively, for at least 48 h before image analysis. For IHC, an Ariol scanning system (Leica, Wetzlar, Germany, EU) was used to obtain the digital images. For IF, images were acquired with a SP5 Leica Confocal Microscope, ×40 plan objective, and analysed by Leica software (Leica Application Suite, Advanced Fluorescence 2.2.0). Scoring was performed according to intensity (using stromal cells as internal positive controls) and percentage of stained cells by two independent observers (FCM and MJL). We subdivided tumours as negative (no staining in any tumour cell), weak positive (all tumour cells weakly stained compared to stromal cells), positive (all tumour and stromal cells equally stained) or heterogeneous (combination of positive and negative/weak staining) staining.

Western-blot PTEN quantification in relation to TP53status

Western blot analysis was performed for extracts from SKOV3 ovarian cancer cell lines (ATCC; HTB-77) and derivatives obtained by Bacterial artificial chromosome (BAC) lipofectamine transfection. A BAC modification kit (Gene Bridges, Heidelberg, Germany, EU; catalogue number K002) was used to obtain the BAC clones (empty control and with R175H or R273H mutations) from the original BAC containing wild-type TP53 (CTD-3049A20; Invitrogen, catalogue number 96012). Whole-cell extracts were collected after scraping cells in protein lysis buffer containing 50 mmoll −1 Tris-HCl (pH 7.4), 150 mmoll −1 NaCl, 5 mmoll −1 ethylenediaminetetraacetic acid (EDTA), 50 mmoll −1 NaF, 0.5% NP40, one Complete™ Mini and EDTA free tablet per 50 ml. Protein concentrations were determined using Bio-Rad protein assay kit (Hercules, CA, USA). Equal concentrations were resolved by sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE), transferred to Immobilon-fluorescence PVDF membrane (Millipore, Bedford, MA, USA), blocked and probed with anti-human TP53 (clone DO-1, Santa Cruz, Dallas, Texas, USA; 1:2,000; overnight incubation at 4°C), anti-human PTEN (clone 138G6, Cell Signaling; 1:1,000; overnight incubation at 4°C) and anti GAPDH (Cell Signaling; 1:5,000; overnight incubation at 4°C) antibodies. IRDye800-conjugated anti-rabbit immunoglobulin G (IgG) and IRDye700-conjugated anti-mouse IgG (Li-Cor, Lincoln, NE, USA; 1:5,000 and 1:10,000 dilutions; incubated at RT for 1 h) were used as secondary antibodies. Signal intensities were analysed by using the Odyssey infrared image system (Li-Cor, Lincoln, NE, USA). Two replicates of the experiment were performed and similar results were obtained.

Authors’ information

FM and JDB are joint corresponding authors.

Additional files



androgen receptor


confidence interval


copy number alteration


endometrioid ovarian cancer


glyceraldehyde 3-phosphate dehydrogenase


Gene Set Enrichment Analysis


hematoxylin and eosin


high-grade serous ovarian carcinoma








Nottingham Ovarian Cancer Study


phosphate-buffered saline


Study of Epidemiology and Risk Factors in Cancer Heredity


The Cancer Genome Atlas


  1. Vaughan S, Coward JI, Bast RC, Berchuck A, Berek JS, Brenton JD, Coukos G, Crum CC, Drapkin R, Etemadmoghadam D, Friedlander M, Gabra H, Kaye SB, Lord CJ, Lengyel E, Levine DA, McNeish IA, Menon U, Mills GB, Nephew KP, Oza AM, Sood AK, Stronach EA, Walczak H, Bowtell DD, Balkwill FR: Rethinking ovarian cancer: recommendations for improving outcomes . Nat Rev Cancer. 2011, 11: 719-725. 10.1038/nrc3144.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  2. Ahmed AA, Etemadmoghadam D, Temple J, Lynch AG, Riad M, Sharma R, Stewart C, Fereday S, Caldas C, Defazio A, Bowtell D, Brenton JD: Driver mutations in TP53 are ubiquitous in high grade serous carcinoma of the ovary . J Pathol. 2010, 221: 49-56. 10.1002/path.2696.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  3. Sieh W, Köbel M, Longacre TA, Bowtell DD, deFazio A, Goodman MT, Høgdall E, Deen S, Wentzensen N, Moysich KB, Brenton JD, Clarke BA, Menon U, Gilks CB, Kim A, Madore J, Fereday S, George J, Galletta L, Lurie G, Wilkens LR, Carney ME, Thompson PJ, Matsuno RK, Kjær SK, Jensen A, Høgdall C, Kalli KR, Fridley BL, Keeney GL, et al: Hormone-receptor expression and ovarian cancer survival: an ovarian tumor tissue analysis consortium study . Lancet Oncol. 2013, 14: 853-862. 10.1016/S1470-2045(13)70253-5.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  4. The Cancer Genome Atlas Network: Integrated genomic analyses of ovarian carcinoma . Nature. 2011, 474: 609-615. 10.1038/nature10166.

    Article  Google Scholar 

  5. The Cancer Genome Atlas Network: Comprehensive molecular portraits of human breast tumours . Nature. 2012, 490: 61-70. 10.1038/nature11412.

    Article  PubMed Central  Google Scholar 

  6. Saal LH, Holm K, Maurer M, Memeo L, Su T, Wang X, Yu JS, Malmström P-O, Mansukhani M, Enoksson J, Hibshoosh H, Borg A, Parsons R: PIK3CA mutations correlate with hormone receptors, node metastasis, and ERBB2, and are mutually exclusive with PTEN loss in human breast carcinoma . Cancer Res. 2005, 65: 2554-2559. 10.1158/0008-5472-CAN-04-3913.

    Article  PubMed  CAS  Google Scholar 

  7. Saal LH, Gruvberger-Saal SK, Persson C, Lövgren K, Jumppanen M, Staaf J, Jönsson G, Pires MM, Maurer M, Holm K, Koujak S, Subramaniyam S, Vallon-Christersson J, Olsson H, Su T, Memeo L, Ludwig T, Ethier SP, Krogh M, Szabolcs M, Murty VVVS, Isola J, Hibshoosh H, Parsons R, Borg A: Recurrent gross mutations of the PTEN tumor suppressor gene in breast cancers with deficient DSB repair . Nat Genet. 2008, 40: 102-107. 10.1038/ng.2007.39.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  8. Martins FC, De S, Almendro V, Gönen M, Park SY, Blum JL, Herlihy W, Ethington G, Schnitt SJ, Tung N, Garber JE, Fetten K, Michor F, Polyak K: Evolutionary pathways in BRCA1-associated breast tumors . Cancer Discov. 2012, 2: 503-511. 10.1158/2159-8290.CD-11-0325.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  9. Cully M, You H, Levine AJ, Mak TW: Beyond PTEN mutations: the PI3K pathway as an integrator of multiple inputs during tumorigenesis . Nat Rev Cancer. 2006, 6: 184-192. 10.1038/nrc1819.

    Article  PubMed  CAS  Google Scholar 

  10. Kim J, Coffey DM, Creighton CJ, Yu Z, Hawkins SM, Matzuk MM: High-grade serous ovarian cancer arises from Fallopian tube in a mouse model . Proc Natl Acad Sci USA. 2012, 109: 3921-3926. 10.1073/pnas.1117135109.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  11. Perets R, Wyant GA, Muto KW, Bijron JG, Poole BB, Chin KT, Chen JYH, Ohman AW, Stepule CD, Kwak S, Karst AM, Hirsch MS, Setlur SR, Crum CP, Dinulescu DM, Drapkin R: Transformation of the Fallopian tube secretory epithelium leads to high-grade serous ovarian cancer in Brca;Tp53;Pten models . Cancer Cell. 2013, 24: 751-765. 10.1016/j.ccr.2013.10.013.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  12. Dinulescu DM, Ince TA, Quade BJ, Shafer SA, Crowley D, Jacks T: Role of K-ras and Pten in the development of mouse models of endometriosis and endometrioid ovarian cancer . Nat Med. 2005, 11: 63-70. 10.1038/nm1173.

    Article  PubMed  CAS  Google Scholar 

  13. Wu R, Hendrix-Lucas N, Kuick R, Zhai Y, Schwartz DR, Akyol A, Hanash S, Misek DE, Katabuchi H, Williams BO, Fearon ER, Cho KR: Mouse model of human ovarian endometrioid adenocarcinoma based on somatic defects in the Wnt/ β -catenin and PI3K/Pten signaling pathways . Cancer Cell. 2007, 11: 321-333. 10.1016/j.ccr.2007.02.016.

    Article  PubMed  CAS  Google Scholar 

  14. Hanrahan AJ, Schultz N, Westfal ML, Sakr RA, Giri DD, Scarperi S, Janakiraman M, Janikariman M, Olvera N, Stevens EV, She Q-B, Aghajanian C, King TA, de Stanchina E, Spriggs DR, Heguy A, Taylor BS, Sander C, Rosen N, Levine DA, Solit DB: Genomic complexity and AKT dependence in serous ovarian cancer . Cancer Discov. 2012, 2: 56-67. 10.1158/2159-8290.CD-11-0170.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  15. Roh MH, Yassin Y, Miron A, Mehra KK, Mehrad M, Monte NM, Mutter GL, Nucci MR, Ning G, Mckeon FD, Hirsch MS, Wa X, Crum CP: High-grade fimbrial-ovarian carcinomas are unified by altered p53, PTEN and PAX2 expression . Mod Pathol. 2010, 23: 1316-1324. 10.1038/modpathol.2010.119.

    Article  PubMed  CAS  Google Scholar 

  16. Madore J, Ren F, Filali-Mouhim A, Sanchez L, Köbel M, Tonin PN, Huntsman D, Provencher DM, Mes-Masson A-M: Characterization of the molecular differences between ovarian endometrioid carcinoma and ovarian serous carcinoma . J Pathol. 2010, 220: 392-400.

    PubMed  Google Scholar 

  17. Tothill RW, Tinker AV, George J, Brown R, Fox SB, Lade S, Johnson DS, Trivett MK, Etemadmoghadam D, Locandro B, Traficante N, Fereday S, Hung JA, Chiew Y-E, Haviv I, Gertig D, DeFazio A, Bowtell DDL, AOCSG: Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome . Clin Cancer Res. 2008, 14: 5198-5208. 10.1158/1078-0432.CCR-08-0196.

    Article  PubMed  CAS  Google Scholar 

  18. Verhaak RGW, Tamayo P, Yang J-Y, Hubbard D, Zhang H, Creighton CJ, Fereday S, Lawrence M, Carter SL, Mermel CH, Kostic AD, Etemadmoghadam D, Saksena G, Cibulskis K, Duraisamy S, Levanon K, Sougnez C, Tsherniak A, Gomez S, Onofrio R, Gabriel S, Chin L, Zhang N, Spellman PT, Zhang Y, Akbani R, Hoadley KA, Kahn A, Köbel M, Huntsman D, et al: Prognostically relevant gene signatures of high-grade serous ovarian carcinoma . J Clin Invest. 2013, 123: 517-525.

    PubMed  CAS  PubMed Central  Google Scholar 

  19. Press JZ, De Luca A, Boyd N, Young S, Troussard A, Ridge Y, Kaurah P, Kalloger SE, Blood KA, Smith M, Spellman PT, Wang Y, Miller DM, Horsman D, Faham M, Gilks CB, Gray J, Huntsman DG: Ovarian carcinomas with genetic and epigenetic BRCA1 loss have distinct molecular abnormalities . BMC Cancer. 2008, 8: 17-10.1186/1471-2407-8-17.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Helland Å, Anglesio MS, George J, Cowin PA, Johnstone CN, House CM, Sheppard KE, Etemadmoghadam D, Melnyk N, Rustgi AK, Phillips WA, Johnsen H, Holm R, Kristensen GB, Birrer MJ, Pearson RB, Børresen-Dale A-L, Huntsman DG, deFazio A, Creighton CJ, Smyth GK, Bowtell DDL, AOCSG: Deregulation of MYCN, LIN28B and LET7 in a molecular subtype of aggressive high-grade serous ovarian cancers . PLoS One. 2011, 6: 18064-10.1371/journal.pone.0018064.

    Article  Google Scholar 

  21. Yuan Y, Failmezger H, Rueda OM, Ali HR, Gräf S, Chin S-F, Schwarz RF, Curtis C, Dunning MJ, Bardwell H, Johnson N, Doyle S, Turashvili G, Provenzano E, Aparicio S, Caldas C, Markowetz F: Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling . Sci Transl Med. 2012, 4: 157-143. 10.1126/scitranslmed.3004330.

    Google Scholar 

  22. Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, Treviño V, Shen H, Laird PW, Levine DA, Carter SL, Getz G, Stemke-Hale K, Mills GB, Verhaak RGW: Inferring tumour purity and stromal and immune cell admixture from expression data . Nat Commun. 2013, 4: 2612-10.1038/ncomms3612.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Leinster DA, Kulbe H, Everitt G, Thompson R, Perretti M, Gavins FNE, Cooper D, Gould D, Ennis DP, Lockley M, McNeish IA, Nourshargh S, Balkwill FR: The peritoneal tumour microenvironment of high-grade serous ovarian cancer . J Pathol. 2012, 227: 136-145. 10.1002/path.4002.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  24. Stambolic V, MacPherson D, Sas D, Lin Y, Snow B, Jang Y, Benchimol S, Mak TW: Regulation of PTEN transcription by p53 . Mol Cell. 2001, 8: 317-325. 10.1016/S1097-2765(01)00323-9.

    Article  PubMed  CAS  Google Scholar 

  25. Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, Jacobsen A, Byrne CJ, Heuer ML, Larsson E, Antipin Y, Reva B, Goldberg AP, Sander C, Schultz N: The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data . Cancer Discov. 2012, 2: 401-404. 10.1158/2159-8290.CD-12-0095.

    Article  PubMed  Google Scholar 

  26. Carver BS, Chapinski C, Wongvipat J, Hieronymus H, Chen Y, Chandarlapaty S, Arora VK, Le C, Koutcher J, Scher H, Scardino PT, Rosen N, Sawyers CL: Reciprocal feedback regulation of PI3K and androgen receptor signaling in PTEN-deficient prostate cancer . Cancer Cell. 2011, 19: 575-586. 10.1016/j.ccr.2011.04.008.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  27. Beck AH, Sangoi AR, Leung S, Marinelli RJ, Nielsen TO, van de Vijver MJ, West RB, van de Rijn M, Koller D: Systematic analysis of breast cancer morphology uncovers stromal features associated with survival . Sci Transl Med. 2011, 3: 108-113. 10.1126/scitranslmed.3002564.

    Google Scholar 

  28. Kolasa IK, Rembiszewska A, Janiec-Jankowska A, Dansonka-Mieszkowska A, Lewandowska AM, Konopka B, Kupryjańczyk J: PTEN mutation, expression and LOH at its locus in ovarian carcinomas. Relation to TP53, K-RAS and BRCA1 mutations . Gynecol Oncol. 2006, 103: 692-697. 10.1016/j.ygyno.2006.05.007.

    Article  PubMed  CAS  Google Scholar 

  29. Tay Y, Kats L, Salmena L, Weiss D, Tan SM, Ala U, Karreth F, Poliseno L, Provero P, Di Cunto F, Lieberman J, Rigoutsos I, Pandolfi PP: Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs . Cell. 2011, 147: 344-357. 10.1016/j.cell.2011.09.029.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  30. Poliseno L, Salmena L, Zhang J, Carver B, Haveman WJ, Pandolfi PP: A coding-independent function of gene and pseudogene mRNAs regulates tumour biology . Nature. 2010, 465: 1033-1038. 10.1038/nature09144.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  31. Alimonti A, Carracedo A, Clohessy JG, Trotman LC, Nardella C, Egia A, Salmena L, Sampieri K, Haveman WJ, Brogi E, Richardson AL, Zhang J, Pandolfi PP: Subtle variations in PTEN dose determine cancer susceptibility . Nat Genet. 2010, 42: 454-458. 10.1038/ng.556.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  32. Kwabi-Addo B, Giri D, Schmidt K, Podsypanina K, Parsons R, Greenberg N, Ittmann M: Haploinsufficiency of the PTEN tumor suppressor gene promotes prostate cancer progression . Proc Natl Acad Sci USA. 2001, 98: 11563-11568. 10.1073/pnas.201167798.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  33. Mendes-Pereira AM, Martin SA, Brough R, McCarthy A, Taylor JR, Kim J-S, Waldman T, Lord CJ, Ashworth A: Synthetic lethal targeting of PTEN mutant cells with PARP inhibitors . EMBO Mol Med. 2009, 1: 315-322. 10.1002/emmm.200900041.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  34. Bassi C, Ho J, Srikumar T, Dowling RJO, Gorrini C, Miller SJ, Mak TW, Neel BG, Raught B, Stambolic V: Nuclear PTEN controls DNA repair and sensitivity to genotoxic stress . Science. 2013, 341: 395-399. 10.1126/science.1236188.

    Article  PubMed  CAS  Google Scholar 

  35. Juvekar A, Burga LN, Hu H, Lunsford EP, Ibrahim YH, Balmañà J, Rajendran A, Papa A, Spencer K, Lyssiotis CA, Nardella C, Pandolfi PP, Baselga J, Scully R, Asara JM, Cantley LC, Wulf GM: Combining a PI3K inhibitor with a PARP inhibitor provides an effective therapy for BRCA1-related breast cancer . Cancer Discov. 2012, 2: 1048-1063. 10.1158/2159-8290.CD-11-0336.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  36. González-Billalabeitia E, Seitzer N, Song SJ, Song MS, Patnaik A, Liu X-S, Epping MT, Papa A, Hobbs RM, Chen M, Lunardi A, Ng C, Webster KA, Signoretti S, Loda M, Asara JM, Nardella C, Clohessy JG, Cantley LC, Pandolfi PP: Vulnerabilities of PTEN-TP53-deficient prostate cancers to compound PARP-PI3K inhibition . Cancer Discov. 2014, 4: 896-904. 10.1158/2159-8290.CD-13-0230.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Vera-Badillo FE, Templeton AJ, de Gouveia P, Diaz-Padilla I, Bedard PL, Al-Mubarak M, Seruga B, Tannock IF, Ocana A, Amir E: Androgen receptor expression and outcomes in early breast cancer: a systematic review and meta-analysis . J Natl Cancer Inst. 2014, 106: 319-10.1093/jnci/djt319.

    Article  Google Scholar 

  38. Tumolo S, Rao BR, van der Burg ME, Guastalla JP, Renard J, Vermorken JB: Phase II trial of flutamide in advanced ovarian cancer: an EORTC Gynaecological Cancer Cooperative Group study . Eur J Cancer. 1994, 30A: 911-914. 10.1016/0959-8049(94)90112-0.

    Article  PubMed  CAS  Google Scholar 

  39. Ruifrok AC, Johnston DA: Quantification of histochemical staining by color deconvolution . Anal Quant Cytol Histol. 2001, 23: 291-299.

    PubMed  CAS  Google Scholar 

  40. Otsu N: Threshold selection method from gray-level histograms . IEEE Trans Syst Man Cybern. 1979, 9: 62-66. 10.1109/TSMC.1979.4310076.

    Article  Google Scholar 

  41. Smyth GK, Michaud J, Scott HS: Use of within-array replicate spots for assessing differential expression in microarray experiments . Bioinformatics. 2005, 21: 2067-2075. 10.1093/bioinformatics/bti270.

    Article  PubMed  CAS  Google Scholar 

  42. Shen-Orr SS, Tibshirani R, Khatri P, Bodian DL, Staedtler F, Perry NM, Hastie T, Sarwal MM, Davis MM, Butte AJ: Cell type-specific gene expression differences in complex tissues . Nat Methods. 2010, 7: 287-289. 10.1038/nmeth.1439.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  43. Culhane AC, Thioulouse J, Perrière G, Higgins DG: Made4: an R package for multivariate analysis of gene expression data . Bioinformatics. 2005, 21: 2789-2790. 10.1093/bioinformatics/bti394.

    Article  PubMed  CAS  Google Scholar 

  44. R Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria: 2013 []

Download references


This work was supported by Cancer Research UK [grant numbers A15601, A17197,A16561, A10124]; the University of Cambridge; National Institute for Health Research Cambridge Biomedical Research Centre and Academic Clinical Fellowship scheme (FCM); Cambridge Experimental Cancer Medicine Centre and Hutchison Whampoa Limited. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations


Corresponding author

Correspondence to James D Brenton.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

FCM, IS, AT, FM and JDB designed the study, interpreted the data and wrote the manuscript. FCM, JX, AG, KS and MJL performed the lab experiments and pathology analysis. IS, AT, PDP and FM performed the bioinformatic and survival analyses. KD, MM, JA and PDP provided SEARCH samples and clinical data. SD provided Nottingham samples and clinical data. All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Figure S1. Automated quantification of stroma correlates well with manual scoring. Good correlation was observed between automated stromal scoring and by eye scoring (r=0.593, N=266 images; Jonckheere–Terpstra test for trend P=0.001). (PDF 79 KB)


Additional file 2: Supplementary Tables. Pathological and clinical data from SEARCH, NOT and Hanrahan cohorts. NOT, Nottingham Ovarian Cancer Study; SEARCH, Study of Epidemiology and Risk Factors in Cancer Heredity. (XLSX 151 KB)


Additional file 3: Figure S2. PTEN IHC staining correlates with PTEN IF staining and shows prognostic value. (A) Contingency table showing strong correlation between scores obtained from PTEN IF and IHC stainings (chi-squared test, P0.001). (B) Combined SEARCH and NOT studies (using IHC) (multivariate hazard ratio 1.5, 95% CI 1.1 to 2.0), P=0.006. CI, confidence interval; IF, immunofluorescence; IHC, immunohistochemistry; NOT, Nottingham Ovarian Cancer Study; SEARCH, Study of Epidemiology and Risk Factors in Cancer Heredity. (PDF 69 KB)


Additional file 4: Supplementary Information. Supplementary information includes all code for the generation of plots and statistical analyses in R. (PDF 2 MB)

Additional file 5: Code for image Analysis. Code used to estimate stromal content in TCGA histopathological images. (ZIP 7 MB)

Additional file 6: R code used for data analysis and to obtain all the results and figures in this publication. R code. (ZIP 18 KB)

Authors’ original submitted files for images

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Martins, F.C., Santiago, I.d., Trinh, A. et al. Combined image and genomic analysis of high-grade serous ovarian cancer reveals PTEN loss as a common driver event and prognostic classifier. Genome Biol 15, 526 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: