Semi-supervised learning for peptide identification from shotgun proteomics datasets - PubMed (original) (raw)
doi: 10.1038/nmeth1113. Epub 2007 Oct 21.
Affiliations
- PMID: 17952086
- DOI: 10.1038/nmeth1113
Semi-supervised learning for peptide identification from shotgun proteomics datasets
Lukas Käll et al. Nat Methods. 2007 Nov.
Abstract
Shotgun proteomics uses liquid chromatography-tandem mass spectrometry to identify proteins in complex biological samples. We describe an algorithm, called Percolator, for improving the rate of confident peptide identifications from a collection of tandem mass spectra. Percolator uses semi-supervised machine learning to discriminate between correct and decoy spectrum identifications, correctly assigning peptides to 17% more spectra from a tryptic Saccharomyces cerevisiae dataset, and up to 77% more spectra from non-tryptic digests, relative to a fully supervised approach.
Similar articles
- Oscore: a combined score to reduce false negative rates for peptide identification in tandem mass spectrometry analysis.
Shao C, Sun W, Li F, Yang R, Zhang L, Gao Y. Shao C, et al. J Mass Spectrom. 2009 Jan;44(1):25-31. doi: 10.1002/jms.1466. J Mass Spectrom. 2009. PMID: 18698557 - Added value for tandem mass spectrometry shotgun proteomics data validation through isoelectric focusing of peptides.
Heller M, Ye M, Michel PE, Morier P, Stalder D, Jünger MA, Aebersold R, Reymond F, Rossier JS. Heller M, et al. J Proteome Res. 2005 Nov-Dec;4(6):2273-82. doi: 10.1021/pr050193v. J Proteome Res. 2005. PMID: 16335976 - Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra.
Chen Y, Kwon SW, Kim SC, Zhao Y. Chen Y, et al. J Proteome Res. 2005 May-Jun;4(3):998-1005. doi: 10.1021/pr049754t. J Proteome Res. 2005. PMID: 15952748 - Shotgun proteomics: tools for the analysis of complex biological systems.
Wu CC, MacCoss MJ. Wu CC, et al. Curr Opin Mol Ther. 2002 Jun;4(3):242-50. Curr Opin Mol Ther. 2002. PMID: 12139310 Review. - Informatics for peptide retention properties in proteomic LC-MS.
Shinoda K, Sugimoto M, Tomita M, Ishihama Y. Shinoda K, et al. Proteomics. 2008 Feb;8(4):787-98. doi: 10.1002/pmic.200700692. Proteomics. 2008. PMID: 18214845 Review.
Cited by
- Proteogenomic analysis reveals non-small cell lung cancer subtypes predicting chromosome instability, and tumor microenvironment.
Song KJ, Choi S, Kim K, Hwang HS, Chang E, Park JS, Shim SB, Choi S, Heo YJ, An WJ, Yang DY, Cho KC, Ji W, Choi CM, Lee JC, Kim HR, Yoo J, Ahn HS, Lee GH, Hwa C, Kim S, Kim K, Kim MS, Paek E, Na S, Jang SJ, An JY, Kim KP. Song KJ, et al. Nat Commun. 2024 Nov 23;15(1):10164. doi: 10.1038/s41467-024-54434-4. Nat Commun. 2024. PMID: 39580524 Free PMC article. - Microalgae and cyanobacteria as microbial substrate and their influence on the potential postbiotic capability of a bacterial probiotic.
Domínguez-Maqueda M, Pérez-Gómez O, García-Márquez J, Espinosa-Ruíz C, Cuesta A, Esteban MÁ, Alarcón-López FJ, Cárdenas C, Tapia-Paniagua ST, Balebona MC, Moriñigo MÁ. Domínguez-Maqueda M, et al. Microb Biotechnol. 2024 Nov;17(11):e70046. doi: 10.1111/1751-7915.70046. Microb Biotechnol. 2024. PMID: 39573896 Free PMC article. - The long noncoding RNA ELFN1-AS1 promotes gastric cancer growth and metastasis by interacting with TAOK1 to inhibit the Hippo signaling pathway.
Wang Y, Shen K, Cheng Q, Zhou X, Liu K, Xiao J, Hu L. Wang Y, et al. Cell Death Discov. 2024 Nov 11;10(1):465. doi: 10.1038/s41420-024-02235-5. Cell Death Discov. 2024. PMID: 39528458 Free PMC article. - A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models.
Wen B, Noble WS. Wen B, et al. Sci Data. 2024 Nov 8;11(1):1207. doi: 10.1038/s41597-024-04068-4. Sci Data. 2024. PMID: 39516479 Free PMC article. - Deep quantification of substrate turnover defines protease subsite cooperativity.
Gudipati RK, Gaidatzis D, Seebacher J, Muehlhaeusser S, Kempf G, Cavadini S, Hess D, Soneson C, Großhans H. Gudipati RK, et al. Mol Syst Biol. 2024 Dec;20(12):1303-1328. doi: 10.1038/s44320-024-00071-4. Epub 2024 Oct 28. Mol Syst Biol. 2024. PMID: 39468329 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases