A complete mass-spectrometric map of the yeast proteome applied to quantitative trait analysis - PubMed (original) (raw)

. 2013 Feb 14;494(7436):266-70.

doi: 10.1038/nature11835. Epub 2013 Jan 20.

Mathieu Clément-Ziza, Henry Lam, David S Campbell, Alexander Schmidt, Eric W Deutsch, Hannes Röst, Zhi Sun, Oliver Rinner, Lukas Reiter, Qin Shen, Jacob J Michaelson, Andreas Frei, Simon Alberti, Ulrike Kusebauch, Bernd Wollscheid, Robert L Moritz, Andreas Beyer, Ruedi Aebersold

Affiliations

A complete mass-spectrometric map of the yeast proteome applied to quantitative trait analysis

Paola Picotti et al. Nature. 2013.

Abstract

Experience from different fields of life sciences suggests that accessible, complete reference maps of the components of the system under study are highly beneficial research tools. Examples of such maps include libraries of the spectroscopic properties of molecules, or databases of drug structures in analytical or forensic chemistry. Such maps, and methods to navigate them, constitute reliable assays to probe any sample for the presence and amount of molecules contained in the map. So far, attempts to generate such maps for any proteome have failed to reach complete proteome coverage. Here we use a strategy based on high-throughput peptide synthesis and mass spectrometry to generate an almost complete reference map (97% of the genome-predicted proteins) of the Saccharomyces cerevisiae proteome. We generated two versions of this mass-spectrometric map, one supporting discovery-driven (shotgun) and the other supporting hypothesis-driven (targeted) proteomic measurements. Together, the two versions of the map constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. To show the utility of the maps, we applied them to a protein quantitative trait locus (QTL) analysis, which requires precise measurement of the same set of peptides over a large number of samples. Protein measurements over 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, influencing the levels of related proteins. Our results suggest that selective pressure favours the acquisition of sets of polymorphisms that adapt protein levels but also maintain the stoichiometry of functionally related pathway members.

PubMed Disclaimer

Figures

Figure 1

Figure 1. Yeast proteome statistics and peptide selection criteria

(a) Detectability of the yeast proteome by large-scale proteomic and antibody-based approaches, as derived from the PeptideAtlas database and from Ghaemmaghammi et al (5). (b) Selection of PTPs for the yeast proteome. For each protein we selected up to eight most frequently observed PTPs in the PeptideAtlas (PA) that were also conforming to defined length constraints (see Methods). For proteins with zero to five PTPs in PA which had been detected via the antibody-based approach[5] we selected up to five additional PTPs predicted with PeptideSieve[26]. For proteins undetected by both proteomic and antibody-based methods, we predicted five PTPs, forcing the selection of at least two of these to a pI below 4.5, where available. For the remaining unmatched ORFs we relaxed the peptide length constraints (see Methods), and repeated the previous steps. The pie chart represents proteins associated to all S. cerevisiae ORFs. Proteins for which peptides were selected based only on PA evidence, (green), on PA evidence or PeptideSieve prediction (blue); on PeptideSieve prediction, including two peptides with pI<4.5; Proteins for which longer peptides were allowed (pink); proteins not covered by our peptide selection criteria (red). (c) Example of protein sequences that lack suitable proteotypic peptides. Trypsin cleavage sites are indicated in blue; RP/KP bonds, which prevent trypsin cleavage, are in green.

Figure 2

Figure 2. Generation of a reference mass spectrometric map for the yeast proteome

The schema shows the sequential steps of map generation: peptides were selected based on the known ORFeome and the criteria listed in the text; peptides were synthesized on a small scale using the SPOT-synthesis technology; the peptide library was analyzed on a QQQ MS and an ion trap-type mass spectrometer to generate the corresponding consensus spectral libraries; existing, quality filtered datasets were added to the synthetic spectral libraries; the QQQ library was used to extract the best coordinates for SRM measurements of yeast proteins; the consensus ion trap library can be used for spectral searches of any LC-MS/MS dataset from S. cerevisiae.

Figure 3

Figure 3. Composition and usage of the spectral libraries

(a) Number of peptides per protein in the SRM assay (green) or ion trap (yellow) library. The percentage is relative to the total of 6,607 theoretical S. cerevisiae proteins. (b) Hydrophobicity distribution for all selected peptides (green) and for those that could not be detected with our QQQ-workflow (yellow). Hydrophobicity is expressed as SSRCalc[64] score bins. % values are relative to the total number of peptides synthesized in each bin. (c) Number of protein identifications from an unfractionated whole yeast digest by various methods, as a function of decoy-estimated protein false discovery rates. The instrument used was an Orbitrap Velos (Thermo Scientific). DDA, conventional data-dependent acquisition; IL, inclusion list-based acquisition; SeqS, sequence database searching against the yeast theoretical proteome, performed with the engine X!Tandem[36]; SpS, spectral searching against the IT library, with the tool SpectraST[62]. (d) Minimal number of transitions needed to uniquely identify precursor ions in the QQQ library. A cumulative plot of the minimal number of transitions forming a unique ion signature (UIS) is shown for all the precursors in the QQQ SRM library. Transitions were selected with decreasing intensity from the SRMAtlas and uniqueness for a set of transitions was established if no other precursor within a Q1/Q3 window ± 0.7 m/z (and SSRCalc-predicted elution time window, for time-scheduled acquisition) produced transitions that contained all of the query transitions. The analysis is shown for two different backgrounds: all yeast peptides potentially detectable by MS, as derived from the yeast PA (purple) and all theoretical tryptic peptides in the yeast proteome (green). RT/no RT indicate whether time-scheduled SRM acquisition is considered.

Figure 4

Figure 4. Quantitative trait analysis from the targeted proteomic dataset

(a) Two-way cluster analysis of summarized protein abundances measured by SRM in 82 samples (78 strains) of the RMxBY cross. Columns are clustered according to the samples, and rows according to the proteomic traits. Abundance levels are color coded in a blue-red gradient, red corresponding to high abundance; white indicates missing data. The completeness of the dataset reaches 99.50% even though the summarization procedure was conservative and generated additional missing values. (b) Network representation of protein abundance QTL for the 48 targeted proteins. Metabolic interactions were manually reconstructed from BioCyc (

http://biocyc.org/

). Epistatic pQTL (red edges) always connect 2 loci that are in espistasis with respect to a protein abundance trait. (c) Abundances of the proteins of the B1B2 protein module in the parental strains and segregants. For each protein, abundances are shown for groups of strains separated according to their genotype at the respective pQTL. A pair of interacting loci was linked to the variations of Hsp60p. The epistatic interaction is clearly visible when the RM allele is inherited at both loci. In contrast, the effects of the two loci linked to Bat2p are additive. Interestingly, the directionality of regulation is shared by all components of the module but Bat2p (over expression when the RM alleles are inherited). Bat1p and Bat2p are paralogs catalyzing the same metabolic reaction in opposite directions (see Supplementary Discussion).

Comment in

Similar articles

Cited by

References

    1. Fleischmann RD, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269(5223):496–512. - PubMed
    1. Venter JC, et al. The sequence of the human genome. Science. 2001;291(5507):1304–51. - PubMed
    1. Uhlen M, Graslund S, Sundstrom M. A pilot project to generate affinity reagents to human proteins. Nat Methods. 2008;5(10):854–5. - PubMed
    1. Taussig MJ, et al. ProteomeBinders: planning a European resource of affinity reagents for analysis of the human proteome. Nat Methods. 2007;4(1):13–7. - PubMed
    1. Berglund L, et al. A genecentric Human Protein Atlas for expression profiles based on antibodies. Mol Cell Proteomics. 2008;7(10):2019–27. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources