Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach - PubMed (original) (raw)
Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach
Muxuan Liang et al. IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug.
Abstract
Identification of cancer subtypes plays an important role in revealing useful insights into disease pathogenesis and advancing personalized therapy. The recent development of high-throughput sequencing technologies has enabled the rapid collection of multi-platform genomic data (e.g., gene expression, miRNA expression, and DNA methylation) for the same set of tumor samples. Although numerous integrative clustering approaches have been developed to analyze cancer data, few of them are particularly designed to exploit both deep intrinsic statistical properties of each input modality and complex cross-modality correlations among multi-platform input data. In this paper, we propose a new machine learning model, called multimodal deep belief network (DBN), to cluster cancer patients from multi-platform observation data. In our integrative clustering framework, relationships among inherent features of each single modality are first encoded into multiple layers of hidden variables, and then a joint latent model is employed to fuse common features derived from multiple input modalities. A practical learning algorithm, called contrastive divergence (CD), is applied to infer the parameters of our multimodal DBN model in an unsupervised manner. Tests on two available cancer datasets show that our integrative data analysis approach can effectively extract a unified representation of latent features to capture both intra- and cross-modality correlations, and identify meaningful disease subtypes from multi-platform cancer data. In addition, our approach can identify key genes and miRNAs that may play distinct roles in the pathogenesis of different cancer subtypes. Among those key miRNAs, we found that the expression level of miR-29a is highly correlated with survival time in ovarian cancer patients. These results indicate that our multimodal DBN based data analysis approach may have practical applications in cancer pathogenesis studies and provide useful guidelines for personalized cancer therapy.
Similar articles
- Unsupervised deep learning reveals prognostically relevant subtypes of glioblastoma.
Young JD, Cai C, Lu X. Young JD, et al. BMC Bioinformatics. 2017 Oct 3;18(Suppl 11):381. doi: 10.1186/s12859-017-1798-2. BMC Bioinformatics. 2017. PMID: 28984190 Free PMC article. - Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.
Pavel AB, Sonkin D, Reddy A. Pavel AB, et al. BMC Syst Biol. 2016 Feb 11;10:16. doi: 10.1186/s12918-016-0260-9. BMC Syst Biol. 2016. PMID: 26864072 Free PMC article. - Subtype identification from heterogeneous TCGA datasets on a genomic scale by multi-view clustering with enhanced consensus.
Cai M, Li L. Cai M, et al. BMC Med Genomics. 2017 Dec 21;10(Suppl 4):75. doi: 10.1186/s12920-017-0306-x. BMC Med Genomics. 2017. PMID: 29322925 Free PMC article. - Computational Methods for Subtyping of Tumors and Their Applications for Deciphering Tumor Heterogeneity.
Zhang S. Zhang S. Methods Mol Biol. 2019;1878:193-207. doi: 10.1007/978-1-4939-8868-6_11. Methods Mol Biol. 2019. PMID: 30378077 Review. - Multi-omic and multi-view clustering algorithms: review and cancer benchmark.
Rappoport N, Shamir R. Rappoport N, et al. Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889. Nucleic Acids Res. 2018. PMID: 30295871 Free PMC article. Review.
Cited by
- Approaches to Integrating Metabolomics and Multi-Omics Data: A Primer.
Jendoubi T. Jendoubi T. Metabolites. 2021 Mar 21;11(3):184. doi: 10.3390/metabo11030184. Metabolites. 2021. PMID: 33801081 Free PMC article. Review. - An integrative deep learning framework for classifying molecular subtypes of breast cancer.
Mohaiminul Islam M, Huang S, Ajwad R, Chi C, Wang Y, Hu P. Mohaiminul Islam M, et al. Comput Struct Biotechnol J. 2020 Aug 11;18:2185-2199. doi: 10.1016/j.csbj.2020.08.005. eCollection 2020. Comput Struct Biotechnol J. 2020. PMID: 32952934 Free PMC article. - MOLI: multi-omics late integration with deep neural networks for drug response prediction.
Sharifi-Noghabi H, Zolotareva O, Collins CC, Ester M. Sharifi-Noghabi H, et al. Bioinformatics. 2019 Jul 15;35(14):i501-i509. doi: 10.1093/bioinformatics/btz318. Bioinformatics. 2019. PMID: 31510700 Free PMC article. - Cohesive Multi-Modality Feature Learning and Fusion for COVID-19 Patient Severity Prediction.
Zhou J, Zhang X, Zhu Z, Lan X, Fu L, Wang H, Wen H. Zhou J, et al. IEEE Trans Circuits Syst Video Technol. 2021 Mar 4;32(5):2535-2549. doi: 10.1109/TCSVT.2021.3063952. eCollection 2022 May. IEEE Trans Circuits Syst Video Technol. 2021. PMID: 35937181 Free PMC article. - Survival outcome prediction in cervical cancer: Cox models vs deep-learning model.
Matsuo K, Purushotham S, Jiang B, Mandelbaum RS, Takiuchi T, Liu Y, Roman LD. Matsuo K, et al. Am J Obstet Gynecol. 2019 Apr;220(4):381.e1-381.e14. doi: 10.1016/j.ajog.2018.12.030. Epub 2018 Dec 21. Am J Obstet Gynecol. 2019. PMID: 30582927 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources