Prediction of S-sulfenylation sites using mRMR feature selection and fuzzy support vector machine algorithm - PubMed (original) (raw)
Prediction of S-sulfenylation sites using mRMR feature selection and fuzzy support vector machine algorithm
Zhe Ju et al. J Theor Biol. 2018.
Abstract
Cysteine S-sulfenylation is an important protein post-translational modification, which plays a crucial role in transcriptional regulation, cell signaling, and protein functions. To better elucidate the molecular mechanism of S-sulfenylation, it is important to identify S-sulfenylated substrates and their corresponding S-sulfenylation sites accurately. In this study, a novel bioinformatics tool named Sulf_FSVM is proposed to predict S-sulfenylation sites by using multiple feature extraction and fuzzy support vector machine algorithm. On the one hand, amino acid factors, binary encoding, and the composition of k-spaced amino acid pairs features are incorporated to encode S-sulfenylation sites. And the maximum relevance minimum redundancy method are adopted to remove the redundant features. On the other hand, a fuzzy support vector machine algorithm is used to handle the class imbalance and noise problem in S-sulfenylation sites training dataset. As illustrated by 10-fold cross-validation, the performance of Sulf_FSVM achieves a satisfactory performance with a Sensitivity of 73.26%, a Specificity of 70.78%, an Accuracy of 71.07% and a Matthew's correlation coefficient of 0.2971. Independent tests also show that Sulf_FSVM significantly outperforms existing S-sulfenylation sites predictors. Therefore, Sulf_FSVM can be a useful tool for accurate prediction of protein S-sulfenylation sites.
Keywords: Cross-validation; Feature extraction; K-spaced amino acid pairs; Post-translational modification; SVM.
Copyright © 2018 Elsevier Ltd. All rights reserved.
Similar articles
- Prediction of lysine glutarylation sites by maximum relevance minimum redundancy feature selection.
Ju Z, He JJ. Ju Z, et al. Anal Biochem. 2018 Jun 1;550:1-7. doi: 10.1016/j.ab.2018.04.005. Epub 2018 Apr 8. Anal Biochem. 2018. PMID: 29641975 - Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou's PseAAC.
Ju Z, He JJ. Ju Z, et al. J Mol Graph Model. 2017 Sep;76:356-363. doi: 10.1016/j.jmgm.2017.07.022. Epub 2017 Jul 25. J Mol Graph Model. 2017. PMID: 28763688 - SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites.
Al-Barakati HJ, McConnell EW, Hicks LM, Poole LB, Newman RH, Kc DB. Al-Barakati HJ, et al. Sci Rep. 2018 Jul 26;8(1):11288. doi: 10.1038/s41598-018-29126-x. Sci Rep. 2018. PMID: 30050050 Free PMC article. - A Comprehensive Review of In silico Analysis for Protein S-sulfenylation Sites.
Hasan MM, Khatun MS, Kurata H. Hasan MM, et al. Protein Pept Lett. 2018;25(9):815-821. doi: 10.2174/0929866525666180905110619. Protein Pept Lett. 2018. PMID: 30182830 Review. - Proteome-Wide Analysis of Cysteine S-Sulfenylation Using a Benzothiazine-Based Probe.
Fu L, Liu K, Ferreira RB, Carroll KS, Yang J. Fu L, et al. Curr Protoc Protein Sci. 2019 Feb;95(1):e76. doi: 10.1002/cpps.76. Epub 2018 Oct 12. Curr Protoc Protein Sci. 2019. PMID: 30312022 Free PMC article. Review.
Cited by
- pCysMod: Prediction of Multiple Cysteine Modifications Based on Deep Learning Framework.
Li S, Yu K, Wu G, Zhang Q, Wang P, Zheng J, Liu ZX, Wang J, Gao X, Cheng H. Li S, et al. Front Cell Dev Biol. 2021 Feb 23;9:617366. doi: 10.3389/fcell.2021.617366. eCollection 2021. Front Cell Dev Biol. 2021. PMID: 33732693 Free PMC article. - DeepCSO: A Deep-Learning Network Approach to Predicting Cysteine S-Sulphenylation Sites.
Lyu X, Li S, Jiang C, He N, Chen Z, Zou Y, Li L. Lyu X, et al. Front Cell Dev Biol. 2020 Dec 1;8:594587. doi: 10.3389/fcell.2020.594587. eCollection 2020. Front Cell Dev Biol. 2020. PMID: 33335901 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials