Reduced amino acid alphabet is sufficient to accurately recognize intrinsically disordered protein - PubMed (original) (raw)
Reduced amino acid alphabet is sufficient to accurately recognize intrinsically disordered protein
Edward A Weathers et al. FEBS Lett. 2004.
Free article
Abstract
Intrinsically disordered proteins are an important class of proteins with unique functions and properties. Here, we have applied a support vector machine (SVM) trained on naturally occurring disordered and ordered proteins to examine the contribution of various parameters (vectors) to recognizing proteins that contain disordered regions. We find that a SVM that incorporates only amino acid composition has a recognition accuracy of 87+/-2%. This result suggests that composition alone is sufficient to accurately recognize disorder. Interestingly, SVMs using reduced sets of amino acids based on chemical similarity preserve high recognition accuracy. A set as small as four retains an accuracy of 84+/-2%; this suggests that general physicochemical properties rather than specific amino acids are important factors contributing to protein disorder.
Similar articles
- Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.
Raghava GP, Han JH. Raghava GP, et al. BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59. BMC Bioinformatics. 2005. PMID: 15773999 Free PMC article. - Prediction of unfolded segments in a protein sequence based on amino acid composition.
Coeytaux K, Poupon A. Coeytaux K, et al. Bioinformatics. 2005 May 1;21(9):1891-900. doi: 10.1093/bioinformatics/bti266. Epub 2005 Jan 18. Bioinformatics. 2005. PMID: 15657106 - The intrinsic disorder alphabet. III. Dual personality of serine.
Uversky VN. Uversky VN. Intrinsically Disord Proteins. 2015 Mar 17;3(1):e1027032. doi: 10.1080/21690707.2015.1027032. eCollection 2015. Intrinsically Disord Proteins. 2015. PMID: 28232888 Free PMC article. Review. - Chemical approaches for the detection and synthesis of acetylated proteins.
Yang YY, Hang HC. Yang YY, et al. Chembiochem. 2011 Jan 24;12(2):314-22. doi: 10.1002/cbic.201000558. Epub 2011 Jan 11. Chembiochem. 2011. PMID: 21243719 Review. No abstract available.
Cited by
- Studies on titin PEVK peptides and their interaction.
Duan Y, DeKeyser JG, Damodaran S, Greaser ML. Duan Y, et al. Arch Biochem Biophys. 2006 Oct 1;454(1):16-25. doi: 10.1016/j.abb.2006.07.017. Epub 2006 Aug 15. Arch Biochem Biophys. 2006. PMID: 16949547 Free PMC article. - Research progress of reduced amino acid alphabets in protein analysis and prediction.
Liang Y, Yang S, Zheng L, Wang H, Zhou J, Huang S, Yang L, Zuo Y. Liang Y, et al. Comput Struct Biotechnol J. 2022 Jul 4;20:3503-3510. doi: 10.1016/j.csbj.2022.07.001. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 35860409 Free PMC article. Review. - Evidence for a shared nuclear pore complex architecture that is conserved from the last common eukaryotic ancestor.
DeGrasse JA, DuBois KN, Devos D, Siegel TN, Sali A, Field MC, Rout MP, Chait BT. DeGrasse JA, et al. Mol Cell Proteomics. 2009 Sep;8(9):2119-30. doi: 10.1074/mcp.M900038-MCP200. Epub 2009 Jun 13. Mol Cell Proteomics. 2009. PMID: 19525551 Free PMC article. - Prediction of Metal Ion Binding Sites in Proteins from Amino Acid Sequences by Using Simplified Amino Acid Alphabets and Random Forest Model.
Kumar S. Kumar S. Genomics Inform. 2017 Dec;15(4):162-169. doi: 10.5808/GI.2017.15.4.162. Epub 2017 Dec 29. Genomics Inform. 2017. PMID: 29307143 Free PMC article. - Hsp70 chaperones and type I PRMTs are sequestered at intranuclear inclusions caused by polyalanine expansions in PABPN1.
Tavanez JP, Bengoechea R, Berciano MT, Lafarga M, Carmo-Fonseca M, Enguita FJ. Tavanez JP, et al. PLoS One. 2009 Jul 29;4(7):e6418. doi: 10.1371/journal.pone.0006418. PLoS One. 2009. PMID: 19641605 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources