Prediction of high-responding peptides for targeted protein assays by mass spectrometry (original) (raw)

References

  1. Rifai, N., Gillette, M.A. & Carr, S.A. Protein biomarker discovery and validation: the long and uncertain path to clinical utility. Nat. Biotechnol. 24, 971–983 (2006).
    Article CAS Google Scholar
  2. Uhlen, M. & Hober, S. Generation and validation of affinity reagents on a proteome-wide level. J. Mol. Recognit. (2008).
  3. Anderson, L. & Hunter, C.L. Quantitative mass spectrometric multiple reaction monitoring assays for major plasma proteins. Mol. Cell. Proteomics 5, 573–588 (2006).
    Article CAS Google Scholar
  4. Gerber, S.A., Rush, J., Stemman, O., Kirschner, M.W. & Gygi, S.P. Absolute quantification of proteins and phosphoproteins from cell lysates by tandem MS. Proc. Natl. Acad. Sci. USA 100, 6940–6945 (2003).
    Article CAS Google Scholar
  5. Keshishian, H., Addona, T., Burgess, M., Kuhn, E. & Carr, S.A. Quantitative, multiplexed assays for low abundance proteins in plasma by targeted mass spectrometry and stable isotope dilution. Mol. Cell. Proteomics 6, 2212–2229 (2007).
    Article CAS Google Scholar
  6. Stahl-Zeng, J. et al. High sensitivity detection of plasma proteins by multiple reaction monitoring of N-glycosites. Mol. Cell. Proteomics 6, 1809–1817 (2007).
    Article CAS Google Scholar
  7. Kuster, B., Schirle, M., Mallick, P. & Aebersold, R. Scoring proteomes with proteotypic peptide probes. Nat. Rev. Mol. Cell Biol. 6, 577–583 (2005).
    Article CAS Google Scholar
  8. Craig, R., Cortens, J.P. & Beavis, R.C. Open source system for analyzing, validating, and storing protein identification data. J. Proteome Res. 3, 1234–1242 (2004).
    Article CAS Google Scholar
  9. Deutsch, E.W., Lam, H. & Aebersold, R. PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows. EBMO reports 9, 429–434 (2008).
    Article CAS Google Scholar
  10. Mallick, P. et al. Computational prediction of proteotypic peptides for quantitative proteomics. Nat. Biotechnol. 25, 125–131 (2007).
    Article CAS Google Scholar
  11. Sanders, W.S., Bridges, S.M., McCarthy, F.M., Nanduri, B. & Burgess, S.C. Prediction of peptides observable by mass spectrometry applied at the experimental set level. BMC Bioinformatics 8 Suppl 7, S23 (2007).
    Article Google Scholar
  12. Tang, H. et al. A computational approach toward label-free protein quantification using predicted peptide detectability. Bioinformatics 22, e481–e488 (2006).
    Article CAS Google Scholar
  13. Webb-Robertson, B.J. et al. A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics. Bioinformatics 24, 1503–1509 (2008).
    Article CAS Google Scholar
  14. Jaffe, J.D. et al. Accurate inclusion mass screening: a bridge from unbiased discovery to targeted assay development for biomarker verification. Mol. Cell. Proteomics 7, 1952–1962 (2008).
    Article CAS Google Scholar
  15. Malmstrom, J., Lee, H. & Aebersold, R. Advances in proteomic workflows for systems biology. Curr. Opin. Biotechnol. 18, 378–384 (2007).
    Article Google Scholar
  16. Kawashima, S. & Kanehisa, M. AAindex: amino acid index database. Nucleic Acids Res. 28, 374 (2000).
    Article CAS Google Scholar
  17. Zhang, Z. Prediction of low-energy collision-induced dissociation spectra of peptides. Anal. Chem. 76, 3908–3922 (2004).
    Article CAS Google Scholar
  18. Breiman, L. Random forest. Mach. Learn. 45, 5–32 (2001).
    Article Google Scholar
  19. Liaw, A. & Wiener, M. ClassificatIon and Regression by randomForest. R News 2, 18–22 (2002).
    Google Scholar
  20. Diaz-Uriarte, R. & Alvarez de Andres, S. Gene selection and classification of microarray data using random forest. BMC Bioinformatics 7, 3 (2006).
    Article Google Scholar
  21. Enot, D.P., Beckmann, M., Overy, D. & Draper, J. Predicting interpretability of metabolome models based on behavior, putative identity, and biological relevance of explanatory signals. Proc. Natl. Acad. Sci. USA 103, 14865–14870 (2006).
    Article CAS Google Scholar
  22. Vapnik, V. The Nature of Statistical Learning Theory (Springer, New York, 1995).
    Book Google Scholar
  23. Bishop, C. Neural Networks for Pattern Recognition (Oxford University Press, Oxford, 1995).
    Google Scholar
  24. Fawcett, T. ROC Graphs: Notes and Practical Considerations for Researchers (Technical report, HP Laboratories, Palo Alto, CA, USA, 2004).
    Google Scholar
  25. Lange, V., Picotti, P., Domon, B. & Aebersold, R. Selected reaction monitoring for quantitative proteomics: a tutorial. Mol. Syst. Biol. 4, 222 (2008).
    Article Google Scholar
  26. Svetnik, V. et al. Random forest: a classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 43, 1947–1958 (2003).
    Article CAS Google Scholar
  27. Cech, N.B. & Enke, C.G. Relating electrospray ionization response to nonpolar character of small peptides. Anal. Chem. 72, 2717–2723 (2000).
    Article CAS Google Scholar
  28. Cech, N.B. & Enke, C.G. Practical implications of some recent studies in electrospray ionization fundamentals. Mass Spectrom. Rev. 20, 362–387 (2001).
    Article CAS Google Scholar
  29. Cowan, R. & Whittaker, R.G. Hydrophobicity indices for amino acid residues as determined by high-performance liquid chromatography. Pept. Res. 3, 75–80 (1990).
    CAS PubMed Google Scholar
  30. Parker, J.M., Guo, D. & Hodges, R.S. New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. Biochemistry 25, 5425–5432 (1986).
    Article CAS Google Scholar
  31. Whiteaker, J.R. et al. Integrated pipeline for mass spectrometry-based discovery and confirmation of biomarkers demonstrated in a mouse model of breast cancer. J. Proteome Res. 6, 3962–3975 (2007).
    Article CAS Google Scholar
  32. Zolg, J.W. & Langen, H. How industry is approaching the search for new diagnostic markers and biomarkers. Mol. Cell. Proteomics 3, 345–354 (2004).
    Article CAS Google Scholar
  33. Sokal, R.R. & Rohlf, F.J. Biometry the Principles and Practice of Statistics in Biological Research, edn. 3 (W.H. Freeman and Company, 1995).
    Google Scholar
  34. Thomson, R., Hodgman, T.C., Yang, Z.R. & Doyle, A.K. Characterizing proteolytic cleavage site activity using bio-basis function neural networks. Bioinformatics 19, 1741–1747 (2003).
    Article CAS Google Scholar
  35. Yen, C.Y. et al. Improving sensitivity in shotgun proteomics using a peptide-centric database with reduced complexity: protease cleavage and SCX elution rules from data mining of MS/MS spectra. Anal. Chem. 78, 1071–1084 (2006).
    Article CAS Google Scholar
  36. Chen, C., Liaw, A & Breiman, L. Using Random Forest to Learn Imbalanced Data (Technical Report 666. Statistics Department of University of California at Berkeley, Berkeley, 2004).
    Google Scholar
  37. Sing, T., Sander, O., Beerenwinkel, N. & Lengauer, T. ROCR: visualizing classifier performance in R. Bioinformatics 21, 3940–3941 (2005).
    Article CAS Google Scholar
  38. Klimek, J. et al. The standard protein mix database: a diverse data set to assist in the production of improved peptide and protein identification software tools. J. Proteome Res. 7, 96–103 (2008).
    Article CAS Google Scholar
  39. Wang, H. et al. Development and evaluation of a micro- and nanoscale proteomic sample preparation method. J. Proteome Res. 4, 2397–2403 (2005).
    Article CAS Google Scholar

Download references