Detecting disease-causing genes by LASSO-Patternsearch algorithm - PubMed (original) (raw)
Detecting disease-causing genes by LASSO-Patternsearch algorithm
Weiliang Shi et al. BMC Proc. 2007.
Abstract
The Genetic Analysis Workshop 15 Problem 3 simulated rheumatoid arthritis data set provided 100 replicates of simulated single-nucleotide polymorphism (SNP) and covariate data sets for 1500 families with an affected sib pair and 2000 controls, modeled after real rheumatoid arthritis data. The data generation model included nine unobserved trait loci, most of which have one or more of the generated SNPs associated with them. These data sets provide an ideal experimental test bed for evaluating new and old algorithms for selecting SNPs and covariates that can separate cases from controls, because the cases and controls are known as well as the identities of the trait loci. LASSO-Patternsearch is a new multi-step algorithm with a LASSO-type penalized likelihood method at its core specifically designed to detect and model interactions between important predictor variables. In this article the original LASSO-Patternsearch algorithm is modified to handle the large number of SNPs plus covariates. We start with a screen step within the framework of parametric logistic regression. The patterns that survived the screen step were further selected by a penalized logistic regression with the LASSO penalty. And finally, a parametric logistic regression model were built on the patterns that survived the LASSO step. In our analysis of Genetic Analysis Workshop 15 Problem 3 data we have identified most of the associated SNPs and relevant covariates. Upon using the model as a classifier, very competitive error rates were obtained.
Similar articles
- LASSO-Patternsearch algorithm with application to ophthalmology and genomic data.
Shi W, Wahba G, Wright S, Lee K, Klein R, Klein B. Shi W, et al. Stat Interface. 2008;1(1):137-153. doi: 10.4310/sii.2008.v1.n1.a12. Stat Interface. 2008. PMID: 18852828 Free PMC article. - The partitioned LASSO-patternsearch algorithm with application to gene expression data.
Shi W, Wahba G, Irizarry RA, Bravo HC, Wright SJ. Shi W, et al. BMC Bioinformatics. 2012 May 15;13:98. doi: 10.1186/1471-2105-13-98. BMC Bioinformatics. 2012. PMID: 22587526 Free PMC article. - Genome-wide association analysis by lasso penalized logistic regression.
Wu TT, Chen YF, Hastie T, Sobel E, Lange K. Wu TT, et al. Bioinformatics. 2009 Mar 15;25(6):714-21. doi: 10.1093/bioinformatics/btp041. Epub 2009 Jan 28. Bioinformatics. 2009. PMID: 19176549 Free PMC article. - Summary of contributions to GAW15 Group 13: candidate gene association studies.
de Andrade M, Allen AS, Brinza D, Cheng R, Da Y, de Vries AR, Ewhida A, Feng Z, Jung H, Hsieh HJ, Köhler K, Liu Y, Liu-Mares W, Luan J, Marquard V, Nolte IM, Oh S, Platt A, Qin X, Yoo YJ, Yuan A, Tian X, Won S. de Andrade M, et al. Genet Epidemiol. 2007;31 Suppl 1:S110-7. doi: 10.1002/gepi.20287. Genet Epidemiol. 2007. PMID: 18046754 Review.
Cited by
- Exploiting genome structure in association analysis.
Kim S, Xing EP. Kim S, et al. J Comput Biol. 2014 Apr;21(4):345-60. doi: 10.1089/cmb.2009.0224. Epub 2011 May 6. J Comput Biol. 2014. PMID: 21548809 Free PMC article. - A review of feature reduction techniques in neuroimaging.
Mwangi B, Tian TS, Soares JC. Mwangi B, et al. Neuroinformatics. 2014 Apr;12(2):229-44. doi: 10.1007/s12021-013-9204-3. Neuroinformatics. 2014. PMID: 24013948 Free PMC article. Review. - LASSO-Patternsearch algorithm with application to ophthalmology and genomic data.
Shi W, Wahba G, Wright S, Lee K, Klein R, Klein B. Shi W, et al. Stat Interface. 2008;1(1):137-153. doi: 10.4310/sii.2008.v1.n1.a12. Stat Interface. 2008. PMID: 18852828 Free PMC article. - The partitioned LASSO-patternsearch algorithm with application to gene expression data.
Shi W, Wahba G, Irizarry RA, Bravo HC, Wright SJ. Shi W, et al. BMC Bioinformatics. 2012 May 15;13:98. doi: 10.1186/1471-2105-13-98. BMC Bioinformatics. 2012. PMID: 22587526 Free PMC article. - Selecting Genetic Variants and Interactions Associated with Amyotrophic Lateral Sclerosis: A Group LASSO Approach.
Feronato SG, Silva MLM, Izbicki R, Farias TDJ, Shigunov P, Dallagiovanna B, Passetti F, Dos Santos HG. Feronato SG, et al. J Pers Med. 2022 Aug 19;12(8):1330. doi: 10.3390/jpm12081330. J Pers Med. 2022. PMID: 36013279 Free PMC article.
References
- Breiman L, Friedman J, Olshen R, Stone C. Classification and Regression Trees. New York: Chapman & Hall; 1984.
- Ruczinski I, Kooperberg C, Leblanc M. Logic regression. J Comput Graph Stat. 2003;12:475–511. doi: 10.1198/1061860032238. - DOI
- Breiman L. Random forests. Mach Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. - DOI
- Park M, Hastie T. Penalized Logistic Regression for Detecting Gene Interactions Tech Rep 00-25. Palo Alto: Department of Statistics, Stanford University; 2006. - PubMed
- Tibshirani R. Regression shrinkage and selection via the lasso. J Roy Stat Soc B. 1996;58:267–288.
LinkOut - more resources
Full Text Sources