A stability based method for discovering structure in clustered data - PubMed (original) (raw)
Affiliations
- PMID: 11928511
Free article
A stability based method for discovering structure in clustered data
Asa Ben-Hur et al. Pac Symp Biocomput. 2002.
Free article
Abstract
We present a method for visually and quantitatively assessing the presence of structure in clustered data. The method exploits measurements of the stability of clustering solutions obtained by perturbing the data set. Stability is characterized by the distribution of pairwise similarities between clusterings obtained from sub samples of the data. High pairwise similarities indicate a stable clustering pattern. The method can be used with any clustering algorithm; it provides a means of rationally defining an optimum number of clusters, and can also detect the lack of structure in data. We show results on artificial and microarray data using a hierarchical clustering algorithm.
Similar articles
- Randomized maps for assessing the reliability of patients clusters in DNA microarray data analyses.
Bertoni A, Valentini G. Bertoni A, et al. Artif Intell Med. 2006 Jun;37(2):85-109. doi: 10.1016/j.artmed.2006.03.005. Epub 2006 May 23. Artif Intell Med. 2006. PMID: 16720093 - Evaluation of stability of k-means cluster ensembles with respect to random initialization.
Kuncheva LI, Vetrov DP. Kuncheva LI, et al. IEEE Trans Pattern Anal Mach Intell. 2006 Nov;28(11):1798-808. doi: 10.1109/TPAMI.2006.226. IEEE Trans Pattern Anal Mach Intell. 2006. PMID: 17063684 - Combining multiple clusterings using evidence accumulation.
Fred AL, Jain AK. Fred AL, et al. IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):835-50. doi: 10.1109/TPAMI.2005.113. IEEE Trans Pattern Anal Mach Intell. 2005. PMID: 15943417 - Penalized probabilistic clustering.
Lu Z, Leen TK. Lu Z, et al. Neural Comput. 2007 Jun;19(6):1528-67. doi: 10.1162/neco.2007.19.6.1528. Neural Comput. 2007. PMID: 17444759 - Knowledge based cluster ensemble for cancer discovery from biomolecular data.
Yu Z, Wongb HS, You J, Yang Q, Liao H. Yu Z, et al. IEEE Trans Nanobioscience. 2011 Jun;10(2):76-85. doi: 10.1109/TNB.2011.2144997. Epub 2011 Jul 7. IEEE Trans Nanobioscience. 2011. PMID: 21742574
Cited by
- ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets.
Goggin SM, Zunder ER. Goggin SM, et al. Genome Biol. 2024 Sep 16;25(1):242. doi: 10.1186/s13059-024-03386-5. Genome Biol. 2024. PMID: 39285487 Free PMC article. - COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms.
Rintala TJ, Fortino V. Rintala TJ, et al. PLoS Comput Biol. 2024 Aug 5;20(8):e1012275. doi: 10.1371/journal.pcbi.1012275. eCollection 2024 Aug. PLoS Comput Biol. 2024. PMID: 39102448 Free PMC article. - Beyond humor styles: the nature of humor types and differences in basic personality traits from Zuckerman's Alternative Five-Factor Model.
Čekrlija Đ, Schermer JA, Mrđa P. Čekrlija Đ, et al. Curr Issues Personal Psychol. 2023 Apr 5;12(1):1-10. doi: 10.5114/cipp/159941. eCollection 2024. Curr Issues Personal Psychol. 2023. PMID: 38756195 Free PMC article. - Multi-Dimensional Validation of the Integration of Syntactic and Semantic Distance Measures for Clustering Fibromyalgia Patients in the Rheumatic Monitor Big Data Study.
Goldstein A, Shahar Y, Weisman Raymond M, Peleg H, Ben-Chetrit E, Ben-Yehuda A, Shalom E, Goldstein C, Shiloh SS, Almoznino G. Goldstein A, et al. Bioengineering (Basel). 2024 Jan 19;11(1):97. doi: 10.3390/bioengineering11010097. Bioengineering (Basel). 2024. PMID: 38275577 Free PMC article. - Starling: Introducing a mesoscopic scale with Confluence for Graph Clustering.
Gaume B. Gaume B. PLoS One. 2023 Aug 24;18(8):e0290090. doi: 10.1371/journal.pone.0290090. eCollection 2023. PLoS One. 2023. PMID: 37619240 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources