pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry - PubMed (original) (raw)
pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry
Le-Heng Wang et al. Rapid Commun Mass Spectrom. 2007.
Abstract
This paper describes the pFind 2.0 software package for peptide and protein identification via tandem mass spectrometry. Firstly, the most important feature of pFind 2.0 is that it offers a modularized and customized platform for third parties to test and compare their algorithms. The developers can create their own modules following the open application programming interface (API) standards and then add it into workflows in place of the default modules. In addition, to accommodate different requirements, the package provides four automated workflows adopting different algorithm modules, executing processes and result reports. Based on this design, pFind 2.0 provides an automated target-decoy database search strategy: The user can just specify a certain false positive rate (FPR) and start searching. Then the system will return the protein identification results automatically filtered by such an estimated FPR. Secondly, pFind 2.0 is also of high accuracy and high speed. Many pragmatic preprocessing, peptide-scoring, validation, and protein inference algorithms have been incorporated. To speed up the searching process, a toolbox for indexing protein databases is developed for high-throughput applications and all modules are implemented under a new architecture designed for large-scale parallel and distributed searching. An experiment on a public dataset shows that pFind 2.0 can identify more peptides than SEQUEST and Mascot at the 1% FPR. It is also demonstrated that this version of pFind 2.0 has better usability and higher speed than its previous versions. The software and more detailed supplementary information can both be accessed at http://pfind.ict.ac.cn/.
Copyright (c) 2007 John Wiley & Sons, Ltd.
Similar articles
- pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry.
Li D, Fu Y, Sun R, Ling CX, Wei Y, Zhou H, Zeng R, Yang Q, He S, Gao W. Li D, et al. Bioinformatics. 2005 Jul 1;21(13):3049-50. doi: 10.1093/bioinformatics/bti439. Epub 2005 Apr 7. Bioinformatics. 2005. PMID: 15817687 - swissPIT: a novel approach for pipelined analysis of mass spectrometry data.
Quandt A, Hernandez P, Masselot A, Hernandez C, Maffioletti S, Pautasso C, Appel RD, Lisacek F. Quandt A, et al. Bioinformatics. 2008 Jun 1;24(11):1416-7. doi: 10.1093/bioinformatics/btn139. Epub 2008 Apr 23. Bioinformatics. 2008. PMID: 18436540 - Probability-based pattern recognition and statistical framework for randomization: modeling tandem mass spectrum/peptide sequence false match frequencies.
Feng J, Naiman DQ, Cooper B. Feng J, et al. Bioinformatics. 2007 Sep 1;23(17):2210-7. doi: 10.1093/bioinformatics/btm267. Epub 2007 May 17. Bioinformatics. 2007. PMID: 17510167 - Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.
Sadygov RG, Cociorva D, Yates JR 3rd. Sadygov RG, et al. Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725. Nat Methods. 2004. PMID: 15789030 Review. - Software for computational peptide identification from MS-MS data.
Xu C, Ma B. Xu C, et al. Drug Discov Today. 2006 Jul;11(13-14):595-600. doi: 10.1016/j.drudis.2006.05.011. Drug Discov Today. 2006. PMID: 16793527 Review.
Cited by
- Protein analysis by shotgun/bottom-up proteomics.
Zhang Y, Fonslow BR, Shan B, Baek MC, Yates JR 3rd. Zhang Y, et al. Chem Rev. 2013 Apr 10;113(4):2343-94. doi: 10.1021/cr3003533. Epub 2013 Feb 26. Chem Rev. 2013. PMID: 23438204 Free PMC article. Review. No abstract available. - Genome Mining and Biological Engineering of Type III Borosins from Bacteria.
Xu K, Guo S, Zhang W, Deng Z, Zhang Q, Ding W. Xu K, et al. Int J Mol Sci. 2024 Aug 29;25(17):9350. doi: 10.3390/ijms25179350. Int J Mol Sci. 2024. PMID: 39273298 Free PMC article. - quantms: a cloud-based pipeline for quantitative proteomics enables the reanalysis of public proteomics data.
Dai C, Pfeuffer J, Wang H, Zheng P, Käll L, Sachsenberg T, Demichev V, Bai M, Kohlbacher O, Perez-Riverol Y. Dai C, et al. Nat Methods. 2024 Sep;21(9):1603-1607. doi: 10.1038/s41592-024-02343-1. Epub 2024 Jul 4. Nat Methods. 2024. PMID: 38965444 Free PMC article. - Characterization of protein unfolding by fast cross-linking mass spectrometry using di-ortho-phthalaldehyde cross-linkers.
Wang JH, Tang YL, Gong Z, Jain R, Xiao F, Zhou Y, Tan D, Li Q, Huang N, Liu SQ, Ye K, Tang C, Dong MQ, Lei X. Wang JH, et al. Nat Commun. 2022 Mar 18;13(1):1468. doi: 10.1038/s41467-022-28879-4. Nat Commun. 2022. PMID: 35304446 Free PMC article. - Transferred subgroup false discovery rate for rare post-translational modifications detected by mass spectrometry.
Fu Y, Qian X. Fu Y, et al. Mol Cell Proteomics. 2014 May;13(5):1359-68. doi: 10.1074/mcp.O113.030189. Epub 2013 Nov 7. Mol Cell Proteomics. 2014. PMID: 24200586 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources