Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies - PubMed (original) (raw)
. 2011 Jun 1;27(11):1496-505.
doi: 10.1093/bioinformatics/btr171. Epub 2011 Apr 6.
Affiliations
- PMID: 21471010
- DOI: 10.1093/bioinformatics/btr171
Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies
Andrew E Teschendorff et al. Bioinformatics. 2011.
Abstract
Motivation: A common difficulty in large-scale microarray studies is the presence of confounding factors, which may significantly skew estimates of statistical significance, cause unreliable feature selection and high false negative rates. To deal with these difficulties, an algorithmic framework known as Surrogate Variable Analysis (SVA) was recently proposed.
Results: Based on the notion that data can be viewed as an interference pattern, reflecting the superposition of independent effects and random noise, we present a modified SVA, called Independent Surrogate Variable Analysis (ISVA), to identify features correlating with a phenotype of interest in the presence of potential confounding factors. Using simulated data, we show that ISVA performs well in identifying confounders as well as outperforming methods which do not adjust for confounding. Using four large-scale Illumina Infinium DNA methylation datasets subject to low signal to noise ratios and substantial confounding by beadchip effects and variable bisulfite conversion efficiency, we show that ISVA improves the identifiability of confounders and that this enables a framework for feature selection that is more robust to model misspecification and heterogeneous phenotypes. Finally, we demonstrate similar improvements of ISVA across four mRNA expression datasets. Thus, ISVA should be useful as a feature selection tool in studies that are subject to confounding.
Availability: An R-package isva is available from www.cran.r-project.org.
Similar articles
- Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.
Seo J, Bakay M, Chen YW, Hilmer S, Shneiderman B, Hoffman EP. Seo J, et al. Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29. Bioinformatics. 2004. PMID: 15117752 - A comparison of feature selection and classification methods in DNA methylation studies using the Illumina Infinium platform.
Zhuang J, Widschwendter M, Teschendorff AE. Zhuang J, et al. BMC Bioinformatics. 2012 Apr 24;13:59. doi: 10.1186/1471-2105-13-59. BMC Bioinformatics. 2012. PMID: 22524302 Free PMC article. - Compensating for unknown confounders in microarray data analysis using filtered permutations.
Scheid S, Spang R. Scheid S, et al. J Comput Biol. 2007 Jun;14(5):669-81. doi: 10.1089/cmb.2007.R009. J Comput Biol. 2007. PMID: 17683267 - PACK: Profile Analysis using Clustering and Kurtosis to find molecular classifiers in cancer.
Teschendorff AE, Naderi A, Barbosa-Morais NL, Caldas C. Teschendorff AE, et al. Bioinformatics. 2006 Sep 15;22(18):2269-75. doi: 10.1093/bioinformatics/btl174. Epub 2006 May 8. Bioinformatics. 2006. PMID: 16682424 - A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets.
Brägelmann J, Lorenzo Bermejo J. Brägelmann J, et al. Brief Bioinform. 2019 Nov 27;20(6):2055-2065. doi: 10.1093/bib/bby068. Brief Bioinform. 2019. PMID: 30099476 Free PMC article. Review.
Cited by
- Exploratory analysis of ERCC2 DNA methylation in survival among pediatric medulloblastoma patients.
Banfield E, Brown AL, Peckham EC, Rednam SP, Murray J, Okcu MF, Mitchell LE, Chintagumpala MM, Lau CC, Scheurer ME, Lupo PJ. Banfield E, et al. Cancer Epidemiol. 2016 Oct;44:161-166. doi: 10.1016/j.canep.2016.08.020. Epub 2016 Sep 5. Cancer Epidemiol. 2016. PMID: 27607585 Free PMC article. - An Immune Signature for Risk Stratification and Therapeutic Prediction in Helicobacter pylori-Infected Gastric Cancer.
Geng H, Dong Z, Zhang L, Yang C, Li T, Lin Y, Ke S, Xia X, Zhang Z, Zhao G, Zhu C. Geng H, et al. Cancers (Basel). 2022 Jul 4;14(13):3276. doi: 10.3390/cancers14133276. Cancers (Basel). 2022. PMID: 35805047 Free PMC article. - Methylation of a CpG site near the ALDH1A2 gene is associated with loss of control over drinking and related phenotypes.
Harlaar N, Bryan AD, Thayer RE, Karoly HC, Oien N, Hutchison KE. Harlaar N, et al. Alcohol Clin Exp Res. 2014 Mar;38(3):713-21. doi: 10.1111/acer.12312. Epub 2013 Nov 15. Alcohol Clin Exp Res. 2014. PMID: 24236815 Free PMC article. - A meta-analysis of immune-cell fractions at high resolution reveals novel associations with common phenotypes and health outcomes.
Luo Q, Dwaraka VB, Chen Q, Tong H, Zhu T, Seale K, Raffaele JM, Zheng SC, Mendez TL, Chen Y, Carreras N, Begum S, Mendez K, Voisin S, Eynon N, Lasky-Su JA, Smith R, Teschendorff AE. Luo Q, et al. Genome Med. 2023 Jul 31;15(1):59. doi: 10.1186/s13073-023-01211-5. Genome Med. 2023. PMID: 37525279 Free PMC article. - Differential DNA Methylation by Hispanic Ethnicity Among Firefighters in the United States.
Goodrich JM, Furlong MA, Caban-Martinez AJ, Jung AM, Batai K, Jenkins T, Beitel S, Littau S, Gulotta J, Wallentine D, Hughes J, Popp C, Calkins MM, Burgess JL. Goodrich JM, et al. Epigenet Insights. 2021 Mar 26;14:25168657211006159. doi: 10.1177/25168657211006159. eCollection 2021. Epigenet Insights. 2021. PMID: 35036834 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources