Partial Cox regression analysis for high-dimensional microarray gene expression data - PubMed (original) (raw)
Partial Cox regression analysis for high-dimensional microarray gene expression data
Hongzhe Li et al. Bioinformatics. 2004.
Abstract
Motivation: An important application of microarray technology is to predict various clinical phenotypes based on the gene expression profile. Success has been demonstrated in molecular classification of cancer in which different types of cancer serve as categorical outcome variable. However, there has been less research in linking gene expression profile to censored survival outcome such as patients' overall survival time or time to cancer relapse. In this paper, we develop a partial Cox regression method for constructing mutually uncorrelated components based on microarray gene expression data for predicting the survival of future patients.
Results: The proposed partial Cox regression method involves constructing predictive components by repeated least square fitting of residuals and Cox regression fitting. The key difference from the standard principal components of Cox regression analysis is that in constructing the predictive components, our method utilizes the observed survival/censoring information. We also propose to apply the time-dependent receiver operating characteristic curve analysis to evaluate the results. We applied our methods to a publicly available dataset of diffuse large B-cell lymphoma. The outcomes indicated that combining the partial Cox regression method with principal components analysis results in parsimonious model with fewer components and better predictive performance. We conclude that the proposed partial Cox regression method can be very useful in building a parsimonious predictive model that can accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients.
Availability: R codes are available upon request.
Similar articles
- Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.
Gui J, Li H. Gui J, et al. Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6. Bioinformatics. 2005. PMID: 15814556 - Dimension reduction methods for microarrays with application to censored survival data.
Li L, Li H. Li L, et al. Bioinformatics. 2004 Dec 12;20(18):3406-12. doi: 10.1093/bioinformatics/bth415. Epub 2004 Jul 15. Bioinformatics. 2004. PMID: 15256406 - Boosting proportional hazards models using smoothing splines, with applications to high-dimensional microarray data.
Li H, Luan Y. Li H, et al. Bioinformatics. 2005 May 15;21(10):2403-9. doi: 10.1093/bioinformatics/bti324. Epub 2005 Feb 15. Bioinformatics. 2005. PMID: 15713732 - Cross-study analysis of gene expression data for intermediate neuroblastoma identifies two biological subtypes.
Warnat P, Oberthuer A, Fischer M, Westermann F, Eils R, Brors B. Warnat P, et al. BMC Cancer. 2007 May 25;7:89. doi: 10.1186/1471-2407-7-89. BMC Cancer. 2007. PMID: 17531100 Free PMC article. Review. - Time-dependent covariates in the Cox proportional-hazards regression model.
Fisher LD, Lin DY. Fisher LD, et al. Annu Rev Public Health. 1999;20:145-57. doi: 10.1146/annurev.publhealth.20.1.145. Annu Rev Public Health. 1999. PMID: 10352854 Review.
Cited by
- CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes.
Xu Y, Ma Z, Wang Y, Zhang L, Ye J, Chen Y, Yuan Z. Xu Y, et al. Database (Oxford). 2024 Sep 3;2024:baae075. doi: 10.1093/database/baae075. Database (Oxford). 2024. PMID: 39231258 Free PMC article. - Clinically impactful metabolic subtypes of pancreatic ductal adenocarcinoma (PDAC).
Pervin J, Asad M, Cao S, Jang GH, Feizi N, Haibe-Kains B, Karasinska JM, O'Kane GM, Gallinger S, Schaeffer DF, Renouf DJ, Zogopoulos G, Bathe OF. Pervin J, et al. Front Genet. 2023 Oct 30;14:1282824. doi: 10.3389/fgene.2023.1282824. eCollection 2023. Front Genet. 2023. PMID: 38028629 Free PMC article. - A clinically useful and biologically informative genomic classifier for papillary thyroid cancer.
Craig S, Stretch C, Farshidfar F, Sheka D, Alabi N, Siddiqui A, Kopciuk K, Park YJ, Khalil M, Khan F, Harvey A, Bathe OF. Craig S, et al. Front Endocrinol (Lausanne). 2023 Sep 12;14:1220617. doi: 10.3389/fendo.2023.1220617. eCollection 2023. Front Endocrinol (Lausanne). 2023. PMID: 37772080 Free PMC article. - Five crucial prognostic-related autophagy genes stratified female breast cancer patients aged 40-60 years.
Li X, Zhang H, Liu J, Li P, Sun Y. Li X, et al. BMC Bioinformatics. 2021 Dec 7;22(1):580. doi: 10.1186/s12859-021-04503-y. BMC Bioinformatics. 2021. PMID: 34876005 Free PMC article. - Supervised two-dimensional functional principal component analysis with time-to-event outcomes and mammogram imaging data.
Jiang S, Cao J, Rosner B, Colditz GA. Jiang S, et al. Biometrics. 2023 Jun;79(2):1359-1369. doi: 10.1111/biom.13611. Epub 2022 Mar 15. Biometrics. 2023. PMID: 34854477 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources