Comparative meta-analysis of prognostic gene signatures for late-stage ovarian cancer - PubMed (original) (raw)

Review

. 2014 Apr 3;106(5):dju049.

doi: 10.1093/jnci/dju049.

Benjamin Haibe-Kains 1, Aedín C Culhane 1, Markus Riester 1, Jie Ding 1, Xin Victoria Wang 1, Mahnaz Ahmadifar 1, Svitlana Tyekucheva 1, Christoph Bernau 1, Thomas Risch 1, Benjamin Frederick Ganzfried 1, Curtis Huttenhower 1, Michael Birrer 1, Giovanni Parmigiani 2

Affiliations

Review

Comparative meta-analysis of prognostic gene signatures for late-stage ovarian cancer

Levi Waldron et al. J Natl Cancer Inst. 2014.

Abstract

Background: Ovarian cancer is the fifth most common cause of cancer deaths in women in the United States. Numerous gene signatures of patient prognosis have been proposed, but diverse data and methods make these difficult to compare or use in a clinically meaningful way. We sought to identify successful published prognostic gene signatures through systematic validation using public data.

Methods: A systematic review identified 14 prognostic models for late-stage ovarian cancer. For each, we evaluated its 1) reimplementation as described by the original study, 2) performance for prognosis of overall survival in independent data, and 3) performance compared with random gene signatures. We compared and ranked models by validation in 10 published datasets comprising 1251 primarily high-grade, late-stage serous ovarian cancer patients. All tests of statistical significance were two-sided.

Results: Twelve published models had 95% confidence intervals of the C-index that did not include the null value of 0.5; eight outperformed 97.5% of signatures including the same number of randomly selected genes and trained on the same data. The four top-ranked models achieved overall validation C-indices of 0.56 to 0.60 and shared anticorrelation with expression of immune response pathways. Most models demonstrated lower accuracy in new datasets than in validation sets presented in their publication.

Conclusions: This analysis provides definitive support for a handful of prognostic models but also confirms that these require improvement to be of clinical value. This work addresses outstanding controversies in the ovarian cancer literature and provides a reproducible framework for meta-analytic evaluation of gene signatures.

© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

PubMed Disclaimer

Figures

Figure 1.

Figure 1.

Methodology for the systematic meta-analysis of ovarian cancer prognosis models. This outlines methodology for comparative evaluation of published genomic risk scores using a database of publicly available expression data.

Figure 2.

Figure 2.

Performance assessment of published risk scores. Citations for models and expression datasets are provided in Tables 1 and 2, respectively. A) Concordance statistic (C-index) for prediction of overall survival by each of the 14 models in each of the 10 microarray datasets. Datasets used for training a model are shown in black; datasets used by the authors of a model for testing are bordered in gray. Darker shades of orange correspond to better predictions. C-index is expected to be 0.5 for a random risk score, and 1.0 corresponds to a model that predicts the exact order of deaths correctly. Models are ordered from top to bottom by best to worst summary C-index, and datasets are ordered from left to right by average C-index for all models not trained on that dataset. That means prediction models in general validated well in Dressman et al. dataset (26) and models that validated in multiple other datasets did not validate in The Cancer Genome Atlas (TCGA, (12)) or Crijns dataset (20). B) Summary C-index for each model with training datasets excluded (orange boxes) and with test sets presented by the authors also excluded (vertical bars). 95% confidence intervals (CI; gray lines) were obtained from resampling of cases. The top-ranked model is that proposed by the TCGA Consortium, and this dataset is conversely one of the most difficult for prediction by other models not using it for training.

Figure 3.

Figure 3.

Similarity of risk predictions, models, and gene signatures. Citations for models are provided in Table 1. A) Quantile normalized risk predictions from each model for all 1251 patients in the database. Yellow indicates high predicted risk, and blue indicates low predicted risk. Models and patients are clustered by Spearman correlation of predicted risk. Patients who died within 4 years are labeled in black along the top. B) Spearman correlation heatmap of the risk scores produced by the 14 models, along with similarity of genes represented in each model, as calculated by Jaccard index (intersection divided by union of genes). Although the highest overlap between gene signatures is just greater than 2%, some of these models produce highly correlated risk predictions (ρ > 0.5). Gene overlap and correlation between risk scores are associated (ρ = 0.40; 95% confidence interval = 0.21 to 0.56).

Figure 4.

Figure 4.

Publication bias toward prognostic models with favorable independent validation. Citations for models are provided in Table 1. We calculated the meta-analysis concordance statistic (C-index) for each model whose publication presented independent validation using 1) only test datasets presented in the original publication of the model and 2) all available data not used in the original publication. Error bars indicate 95% confidence intervals for the C-index. Of 10 models that presented validation in test data, eight performed better in these test datasets than in datasets not used in the original publications (P = .06, two-sided Wilcoxon signed-rank test).

Figure 5.

Figure 5.

Prognostic (improvement over random signatures (IOR) score of gene signatures relative to random gene signatures, equalizing the influences of authors’ algorithms for generating risk scores, quality of the original training data, and gene signature size. A) Methodology for comparing prognostic quality of gene sets to random gene sets. A simple risk score, defined as the sum of expressions of bad-prognosis genes minus the sum of expressions of good-prognosis genes, was trained and evaluated using all allowable combinations of training and independent validation sets. The IOR score is the fraction of training/test set combinations in which the gene signature achieves a higher concordance index (C-index) than random gene signatures of the same size. It is expected to be 0.5 for a random gene signature and 1 for a gene signature that is better than random signatures in all available training/test set combinations. *Author training sets are excluded. †Cross-validation statistics are not used. B) Gene set improvement over random signatures. Citations for gene signatures are provided in Table 1. Average C-index for all training/test set combinations is plotted against the number of genes in the signature. For visualization, these averages are compared with the equivalent procedure repeated for 100 random gene sets (gray dots). Solid line is the quadratic best-fit line to C-index vs number of genes for random signatures; dashed line is the 95% confidence interval for the best-fit line; and dotted lines are the 95% prediction interval for expected average C-index of individual random gene signatures.

Comment in

Similar articles

Cited by

References

    1. Siegel R, Naishadham D, Jemal A. Cancer statistics, 2012. CA Cancer J Clin. 2012;62(1):10–29. - PubMed
    1. Cannistra SA, Bast RC, Jr, Berek JS, et al. Progress in the management of gynecologic cancer: consensus summary statement. J Clin Oncol. 2003;21(10 Suppl):129s–132s. - PubMed
    1. Na YJ, Farley J, Zeh A, et al. Ovarian cancer: markers of response. Int J Gynecol Cancer. 2009;19(Suppl 2):S21–S29. - PubMed
    1. Chon HS, Lancaster JM. Microarray-based gene expression studies in ovarian cancer. Cancer Control. 2011;18(1):8–15. - PubMed
    1. Sabatier R, Finetti P, Cervera N, et al. Gene expression profiling and prediction of clinical outcome in ovarian cancer. Crit Rev Oncol Hematol. 2009;72(2):98–109. - PubMed

Publication types

MeSH terms

LinkOut - more resources