In silico estimates of tissue components in surgical samples based on expression profiling data - PubMed (original) (raw)

In silico estimates of tissue components in surgical samples based on expression profiling data

Yipeng Wang et al. Cancer Res. 2010.

Abstract

Tissue samples from many diseases have been used for gene expression profiling studies, but these samples often vary widely in the cell types they contain. Such variation could confound efforts to correlate expression with clinical parameters. In principle, the proportion of each major tissue component can be estimated from the profiling data and used to triage samples before studying correlations with disease parameters. Four large gene expression microarray data sets from prostate cancer, whose tissue components were estimated by pathologists, were used to test the performance of multivariate linear regression models for in silico prediction of major tissue components. Ten-fold cross-validation within each data set yielded average differences between the pathologists' predictions and the in silico predictions of 8% to 14% for the tumor component and 13% to 17% for the stroma component. Across independent data sets that used similar platforms and fresh frozen samples, the average differences were 11% to 12% for tumor and 12% to 17% for stroma. When the models were applied to 219 arrays of "tumor-enriched" samples in the literature, almost one quarter were predicted to have 30% or less tumor cells. Furthermore, there was a 10.5% difference in the average predicted tumor content between 37 recurrent and 42 nonrecurrent cancer patients. As a result, genes that correlated with tissue percentage generally also correlated with recurrence. If such a correlation is not desired, then some samples might be removed to rebalance the data set or tissue percentages might be incorporated into the prediction algorithm. A web service, "CellPred," has been designed for the in silico prediction of sample tissue components based on expression data.

(c)2010 AACR.

PubMed Disclaimer

Conflict of interest statement

Disclosure of Potential Conflicts of Interest: M. McClelland and D. Mercola are cofounders of Proveri, Inc., which is engaged in translational development of aspects of the subject matter.

Figures

Figure 1

Figure 1

In silico tissue component prediction discrepancies compared with pathologists' estimates using 10-fold cross-validation. ●, data set 1; ○, data set 2; □, data set 3; ◊, data set 4. X axis, number of genes used in the prediction model. Y axis, average prediction error rates. A, prediction error rates for tumor components. B, prediction error rates for stroma components.

Figure 2

Figure 2

Tissue component predications on publicly available data sets. A, histogram of the in silico predicted tumor components of 219 arrays that were generated from samples prepared as tumor-enriched prostate cancer samples. X axis, in silico predicted tumor cell percentages. Y axis, frequency of samples. B, box plot shows the differences in tumor tissue components between the nonrecurrence and the recurrence groups of prostate cancer samples in data set 5. X axis, sample groups; NR, nonrecurrence group; REC, recurrence group. Y axis, tumor cell percentages.

Similar articles

Cited by

References

    1. Sorlie T, Tibshirani R, Parker J, et al. Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci U S A. 2003;100:8418–23. - PMC - PubMed
    1. Stuart RO, Wachsman W, Berry CC, et al. In silico dissection of cell-type-associated patterns of gene expression in prostate cancer. Proc Natl Acad Sci U S A. 2004;101:615–20. - PMC - PubMed
    1. Wang Y, Klijn JG, Zhang Y, et al. Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005;365:671–9. - PubMed
    1. Paweletz CP, Liotta LA, Petricoin EF., III New technologies for biomarker analysis of prostate cancer progression: Laser capture microdissection and tissue proteomics. Urology. 2001;57:160–3. - PubMed
    1. Sgroi DC, Teng S, Robinson G, LeVangie R, Hudson JR, Jr, Elkahloun AG. In vivo gene expression profile analysis of human breast cancer progression. Cancer Res. 1999;59:5656–61. - PubMed

Publication types

MeSH terms

Grants and funding

LinkOut - more resources