The PeptideAtlas project - PubMed (original) (raw)

The PeptideAtlas project

Frank Desiere et al. Nucleic Acids Res. 2006.

Abstract

The completion of the sequencing of the human genome and the concurrent, rapid development of high-throughput proteomic methods have resulted in an increasing need for automated approaches to archive proteomic data in a repository that enables the exchange of data among researchers and also accurate integration with genomic data. PeptideAtlas (http://www.peptideatlas.org/) addresses these needs by identifying peptides by tandem mass spectrometry (MS/MS), statistically validating those identifications and then mapping identified sequences to the genomes of eukaryotic organisms. A meaningful comparison of data across different experiments generated by different groups using different types of instruments is enabled by the implementation of a uniform analytic process. This uniform statistical validation ensures a consistent and high-quality set of peptide and protein identifications. The raw data from many diverse proteomic experiments are made available in the associated PeptideAtlas repository in several formats. Here we present a summary of our process and details about the Human, Drosophila and Yeast PeptideAtlas builds.

PubMed Disclaimer

Figures

Figure 1

Figure 1

PeptideAtlas analysis pipeline for the annotation of genomes with high-quality peptide sequences derived from high-throughput 2D-LC-MS/MS analysis of biological samples.

Figure 2

Figure 2

Plots: cumulative number of MS/MS spectra versus cumulative number of distinct peptides (all spectra and peptides correspond to peptide identifications with probabilities ≥0.9). The figure will show saturation when new spectra do not yield new peptide identifications. Tables: statistics for atlas builds (number of experiments, number of spectra that yielded a peptide identification with probability P ≥ 0.9, number of distinct peptides identified, number of proteins identified and percentage of all genes to which the proteins map).

Figure 3

Figure 3

Peptide view in the PeptideAtlas web-based database interface. For the specified peptide (by accession number or sequence) a summary is displayed indicating individual attributes of the peptide, protein and genome mapping information, and the samples in which it has been observed. Different information is available in other views. Additionally there are two-way links to the Ensembl genome browser that allow users to see PeptideAtlas peptides as tracks in the genome browser and link out for more information. Additional information is available at our website.

References

    1. Eng J., McCormack A.L., Yates J.R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 1994;5:976–989. - PubMed
    1. Keller A., Nesvizhskii A.I., Kolker E., Aebersold R. Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem. 2002;74:5383–5392. - PubMed
    1. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed
    1. Dowell R.D., Jokerst R.M., Day A., Eddy S.R., Stein L. The distributed annotation system. BMC Bioinformatics. 2001;2:7. - PMC - PubMed
    1. Hubbard T., Andrews D., Caccamo M., Cameron G., Chen Y., Clamp M., Clarke L., Coates G., Cox T., Cunningham F., et al. Ensembl 2005. Nucleic Acids Res. 2005;33:D447–D453. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources