HDBStat!: a platform-independent software suite for statistical analysis of high dimensional biology data - PubMed (original) (raw)
doi: 10.1186/1471-2105-6-86.
Jode W Edwards, Jelai Wang, Gary L Gadbury, Vinodh Srinivasasainagendra, Stanislav O Zakharkin, Kyoungmi Kim, Tapan Mehta, Jacob P L Brand, Amit Patki, Grier P Page, David B Allison
Affiliations
- PMID: 15813968
- PMCID: PMC1087834
- DOI: 10.1186/1471-2105-6-86
HDBStat!: a platform-independent software suite for statistical analysis of high dimensional biology data
Prinal Trivedi et al. BMC Bioinformatics. 2005.
Abstract
Background: Many efforts in microarray data analysis are focused on providing tools and methods for the qualitative analysis of microarray data. HDBStat! (High-Dimensional Biology-Statistics) is a software package designed for analysis of high dimensional biology data such as microarray data. It was initially developed for the analysis of microarray gene expression data, but it can also be used for some applications in proteomics and other aspects of genomics. HDBStat! provides statisticians and biologists a flexible and easy-to-use interface to analyze complex microarray data using a variety of methods for data preprocessing, quality control analysis and hypothesis testing.
Results: Results generated from data preprocessing methods, quality control analysis and hypothesis testing methods are output in the form of Excel CSV tables, graphs and an Html report summarizing data analysis.
Conclusion: HDBStat! is a platform-independent software that is freely available to academic institutions and non-profit organizations. It can be downloaded from our website http://www.soph.uab.edu/ssg\_content.asp?id=1164.
Figures
Figure 1
Data analysis in HDBStat! is divided into four steps – data import, data preprocessing, quality control and hypotheses testing. At each step, user input is required and in return, the results are displayed in the interface and/or output to a file.
Figure 2
Screenshot of gene expression data file in Excel format
Figure 3
Screenshot of chip level information file in Excel format
Figure 4
Deleted residuals graph
Figure 5
Mix-o-matic graph
Figure 6
Power analysis graph
Similar articles
- MARS: microarray analysis, retrieval, and storage system.
Maurer M, Molidor R, Sturn A, Hartler J, Hackl H, Stocker G, Prokesch A, Scheideler M, Trajanoski Z. Maurer M, et al. BMC Bioinformatics. 2005 Apr 18;6:101. doi: 10.1186/1471-2105-6-101. BMC Bioinformatics. 2005. PMID: 15836795 Free PMC article. - WebArray: an online platform for microarray data analysis.
Xia X, McClelland M, Wang Y. Xia X, et al. BMC Bioinformatics. 2005 Dec 21;6:306. doi: 10.1186/1471-2105-6-306. BMC Bioinformatics. 2005. PMID: 16371165 Free PMC article. - MBEToolbox: a MATLAB toolbox for sequence data analysis in molecular biology and evolution.
Cai JJ, Smith DK, Xia X, Yuen KY. Cai JJ, et al. BMC Bioinformatics. 2005 Mar 22;6:64. doi: 10.1186/1471-2105-6-64. BMC Bioinformatics. 2005. PMID: 15780146 Free PMC article. - Using GenePattern for gene expression analysis.
Kuehn H, Liberzon A, Reich M, Mesirov JP. Kuehn H, et al. Curr Protoc Bioinformatics. 2008 Jun;Chapter 7:7.12.1-7.12.39. doi: 10.1002/0471250953.bi0712s22. Curr Protoc Bioinformatics. 2008. PMID: 18551415 Free PMC article. Review. - Unix survival guide.
Stein LD. Stein LD. Curr Protoc Bioinformatics. 2007 Jan;Appendix 1:Appendix 1C. doi: 10.1002/0471250953.bia01cs16. Curr Protoc Bioinformatics. 2007. PMID: 18428775 Review.
Cited by
- Illustrations on Using the Distribution of a P-value in High Dimensional Data Analyses.
Hu X, Gadbury GL, Xiang Q, Allison DB. Hu X, et al. Adv Appl Stat Sci. 2010 Feb;1(2):191-213. Adv Appl Stat Sci. 2010. PMID: 26719631 Free PMC article. - Global transcriptomic profiling of aspen trees under elevated [CO2] to identify potential molecular mechanisms responsible for enhanced radial growth.
Wei H, Gou J, Yordanov Y, Zhang H, Thakur R, Jones W, Burton A. Wei H, et al. J Plant Res. 2013 Mar;126(2):305-20. doi: 10.1007/s10265-012-0524-4. Epub 2012 Oct 13. J Plant Res. 2013. PMID: 23065025 - Exploring tomato gene functions based on coexpression modules using graph clustering and differential coexpression approaches.
Fukushima A, Nishizawa T, Hayakumo M, Hikosaka S, Saito K, Goto E, Kusano M. Fukushima A, et al. Plant Physiol. 2012 Apr;158(4):1487-502. doi: 10.1104/pp.111.188367. Epub 2012 Feb 3. Plant Physiol. 2012. PMID: 22307966 Free PMC article. - A comparison of imputation procedures and statistical tests for the analysis of two-dimensional electrophoresis data.
Miecznikowski JC, Damodaran S, Sellers KF, Rabin RA. Miecznikowski JC, et al. Proteome Sci. 2010 Dec 15;8:66. doi: 10.1186/1477-5956-8-66. Proteome Sci. 2010. PMID: 21159180 Free PMC article. - Gene Expression Browser: large-scale and cross-experiment microarray data integration, management, search & visualization.
Zhang M, Zhang Y, Liu L, Yu L, Tsang S, Tan J, Yao W, Kang MS, An Y, Fan X. Zhang M, et al. BMC Bioinformatics. 2010 Aug 20;11:433. doi: 10.1186/1471-2105-11-433. BMC Bioinformatics. 2010. PMID: 20727159 Free PMC article.
References
- Allison DB, Gadbury GL, Moonseong H, Fernandez JR, Lee C, Prolla TA, Weindruch R. A mixture model approach for the analysis of microarray gene expression data. Comp Statist & Data Anal. 2002;39:1–20. doi: 10.1016/S0167-9473(01)00046-9. - DOI
- Beasley TM, Page GP, Brand JPL, Gadbury GL, Mountz JD, Allison DB. Chebyshev's inequality for non-parametric testing with small N and a in microarray research. J R Statist Soc C. 2004;53:95–108. doi: 10.1111/j.1467-9876.2004.00428.x. - DOI
- Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B. 1995;57:289–300.
- Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Statist. 2001;29:1165–1188. doi: 10.1214/aos/1013699998. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources