Identifying differentially expressed genes using false discovery rate controlling procedures - PubMed (original) (raw)
Comparative Study
Identifying differentially expressed genes using false discovery rate controlling procedures
Anat Reiner et al. Bioinformatics. 2003.
Abstract
Motivation: DNA microarrays have recently been used for the purpose of monitoring expression levels of thousands of genes simultaneously and identifying those genes that are differentially expressed. The probability that a false identification (type I error) is committed can increase sharply when the number of tested genes gets large. Correlation between the test statistics attributed to gene co-regulation and dependency in the measurement errors of the gene expression levels further complicates the problem. In this paper we address this very large multiplicity problem by adopting the false discovery rate (FDR) controlling approach. In order to address the dependency problem, we present three resampling-based FDR controlling procedures, that account for the test statistics distribution, and compare their performance to that of the naïve application of the linear step-up procedure in Benjamini and Hochberg (1995). The procedures are studied using simulated microarray data, and their performance is examined relative to their ease of implementation.
Results: Comparative simulation analysis shows that all four FDR controlling procedures control the FDR at the desired level, and retain substantially more power then the family-wise error rate controlling procedures. In terms of power, using resampling of the marginal distribution of each test statistics substantially improves the performance over the naïve one. The highest power is achieved, at the expense of a more sophisticated algorithm, by the resampling-based procedures that resample the joint distribution of the test statistics and estimate the level of FDR control.
Availability: An R program that adjusts p-values using FDR controlling procedures is freely available over the Internet at www.math.tau.ac.il/\~ybenja.
Similar articles
- A mixture model for estimating the local false discovery rate in DNA microarray analysis.
Liao JG, Lin Y, Selvanayagam ZE, Shih WJ. Liao JG, et al. Bioinformatics. 2004 Nov 1;20(16):2694-701. doi: 10.1093/bioinformatics/bth310. Epub 2004 May 14. Bioinformatics. 2004. PMID: 15145810 Clinical Trial. - Multidimensional local false discovery rate for microarray studies.
Ploner A, Calza S, Gusnanto A, Pawitan Y. Ploner A, et al. Bioinformatics. 2006 Mar 1;22(5):556-65. doi: 10.1093/bioinformatics/btk013. Epub 2005 Dec 20. Bioinformatics. 2006. PMID: 16368770 - False discovery rate, sensitivity and sample size for microarray studies.
Pawitan Y, Michiels S, Koscielny S, Gusnanto A, Ploner A. Pawitan Y, et al. Bioinformatics. 2005 Jul 1;21(13):3017-24. doi: 10.1093/bioinformatics/bti448. Epub 2005 Apr 19. Bioinformatics. 2005. PMID: 15840707 - Controlling for false discoveries subsequently to large scale one-way ANOVA testing in proteomics: Practical considerations.
Burger T. Burger T. Proteomics. 2023 Sep;23(18):e2200406. doi: 10.1002/pmic.202200406. Epub 2023 Jun 25. Proteomics. 2023. PMID: 37357151 Review. - On dependence assumption in _p_-value based multiple test procedures.
Gou J. Gou J. J Biopharm Stat. 2023 Sep 3;33(5):596-610. doi: 10.1080/10543406.2022.2162066. Epub 2023 Jan 6. J Biopharm Stat. 2023. PMID: 36607042 Review.
Cited by
- Classification prediction of early pulmonary nodes based on weighted gene correlation network analysis and machine learning.
Li G, Yang M, Ran L, Jin F. Li G, et al. J Cancer Res Clin Oncol. 2023 Jul;149(7):3915-3924. doi: 10.1007/s00432-022-04312-7. Epub 2022 Aug 26. J Cancer Res Clin Oncol. 2023. PMID: 36018512 - Which measures of adiposity predict subsequent left ventricular geometry? Evidence from the Bogalusa Heart Study.
Hu T, Yao L, Gustat J, Chen W, Webber L, Bazzano L. Hu T, et al. Nutr Metab Cardiovasc Dis. 2015 Mar;25(3):319-26. doi: 10.1016/j.numecd.2014.11.001. Epub 2014 Nov 17. Nutr Metab Cardiovasc Dis. 2015. PMID: 25534865 Free PMC article. - Design of a tobacco exon array with application to investigate the differential cadmium accumulation property in two tobacco varieties.
Martin F, Bovet L, Cordier A, Stanke M, Gunduz I, Peitsch MC, Ivanov NV. Martin F, et al. BMC Genomics. 2012 Nov 28;13:674. doi: 10.1186/1471-2164-13-674. BMC Genomics. 2012. PMID: 23190529 Free PMC article. - SMAD4-dependent polysome RNA recruitment in human pancreatic cancer cells.
Thornley JA, Trask HW, Ringelberg CS, Ridley CJ, Wang S, Sal-Lari RC, Moore JH, Korc M, Tomlinson CR. Thornley JA, et al. Mol Carcinog. 2012 Oct;51(10):771-82. doi: 10.1002/mc.20845. Epub 2011 Aug 30. Mol Carcinog. 2012. PMID: 22965423 Free PMC article. - Site-directed mutagenesis of the quorum-sensing transcriptional regulator SinR affects the biosynthesis of menaquinone in Bacillus subtilis.
Wu J, Li W, Zhao SG, Qian SH, Wang Z, Zhou MJ, Hu WS, Wang J, Hu LX, Liu Y, Xue ZL. Wu J, et al. Microb Cell Fact. 2021 Jun 7;20(1):113. doi: 10.1186/s12934-021-01603-5. Microb Cell Fact. 2021. PMID: 34098969 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources