Large-scale phosphorylation analysis of mouse liver - PubMed (original) (raw)

Large-scale phosphorylation analysis of mouse liver

Judit Villén et al. Proc Natl Acad Sci U S A. 2007.

Abstract

Protein phosphorylation is a complex network of signaling and regulatory events that affects virtually every cellular process. Our understanding of the nature of this network as a whole remains limited, largely because of an array of technical challenges in the isolation and high-throughput sequencing of phosphorylated species. In the present work, we demonstrate that a combination of tandem phosphopeptide enrichment methods, high performance MS, and optimized database search/data filtering strategies is a powerful tool for surveying the phosphoproteome. Using our integrated analytical platform, we report the identification of 5,635 nonredundant phosphorylation sites from 2,328 proteins from mouse liver. From this list of sites, we extracted both novel and known motifs for specific Ser/Thr kinases including a "dipolar" motif. We also found that C-terminal phosphorylation was more frequent than at any other location and that the distribution of potential kinases for these sites was unique. Finally, we identified double phosphorylation motifs that may be involved in ordered phosphorylation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.

Fig. 1.

Strategy used for the large-scale identification and characterization of phosphorylation sites from mouse liver. (A) Sample preparation. Tissue homogenization and lysis was followed by trypsin digestion. Tryptic peptides (10 mg) were subjected to a two-step phosphopeptide enrichment. SCX chromatography provided a substantial enrichment in early eluting fractions. Subsequent IMAC of each fraction provided additional selective capture. In addition, 80 mg of tryptic peptides were enriched for pTyr-containing peptides by immunoaffinity purification. (B) Data processing for the SCX/IMAC experiment. MS/MS spectra from 15 analyses were searched with Sequest against the mouse protein database (DB) containing both forward (target) and reversed (decoy) sequences for FP calculations. Importantly, the target/decoy database search strategy also provided a means to establish appropriate orthogonal filtering criteria (mass deviation, enzyme specificity, solution charge state at pH 2.65, and Sequest scoring). Only two reversed-sequence peptides were found in the final filtered list of 8,529 phosphopeptides (0.02% FP rate). In total, 5,250 nonredundant sites were identified. The Ascore algorithm (26) was used to determine a probability of correct localization for each individual site. Finally, phosphorylation motifs were extracted from the data set with the Motif-X algorithm (34).

Fig. 2.

Fig. 2.

Distribution of phosphopeptides and their properties across 15 SCX fractions. (A) Phosphopeptide distribution. Shown are data for phosphopeptides identified per fraction. (B) Phosphorylation sites per peptide. Shown are percentages of phosphopeptides in each fraction containing one, two, or three phosphorylation sites. (C) Net solution charge state (pH 2.65). Shown are percentages of phosphopeptides in each fraction with calculated solution charge states between −1 and +6. SCX chromatography separates primarily based on net solution charge state, and each phosphate group subtracts one net charge from a peptide.

Fig. 3.

Fig. 3.

Positional distribution of phosphorylation sites. (A) Frequency of site detection with respect to protein sequence position for this study. Protein sequences were divided into 1% bins and plotted by frequency. The dashed line shows the median value. A strong trend for C-terminal (98–100% of the protein length) phosphorylation was observed. (B) This same trend was observed for the distribution of sites from the Phospho.ELM database, a curated resource containing sites from the literature. (C) Classification of phosphorylation sites into the three most general motif classes based on their position within the protein. All localized sites were classified into one of three general kinase motifs or as “other” (see Methods), and their distributions at protein ends were determined. N-terminal, within first 10 aa; C-terminal, within last 10 aa; central, within all remaining residues. C-terminal sites had different distributions than N-terminal or central sites.

Fig. 4.

Fig. 4.

Classification of localized sites (P < 0.01) into the most general kinase recognition sequence categories: acidic, basic, Pro-directed, and “others.” (A) Comparison of sequence category distributions for the sites detected in this study with the control set of all Ser and Thr residues from the same set of proteins. (B) Diverse phosphorylation patterns were observed for proteins belonging to different GO annotation cellular localization categories. (C) Examples of general phosphorylation kinase classification patterns for three functional GO categories.

Fig. 5.

Fig. 5.

Phosphorylation-specific motifs using the Motif-X algorithm (34). The data set included all sites with Ascore values of ≥19 (n = 3,439). The complete set of motifs is shown in

SI Table 4

. (A_–_E) Sequence logos for some examples of single-phosphorylation motifs where the phosphorylated residue (S or T) is centered. (A) Pro-directed motifs. (B) Basic motifs representative of CaM kinase and Akt kinase substrates, respectively. (C) Acidic motif with 56 occurrences in our data set of 630 in the entire mouse database. (D) Pro-directed motif centered on Thr with a strong preference for additional Pro residues C-terminal to the phosphate. (E) pTyr motif. (F_–_I) Examples of double-phosphorylation motifs. Secondary phosphorylated sites away from the central residue position also are phosphorylated. (F) Pro-directed with additional pSer at +4. (G) Basic upstream and acidic downstream. (H) Acidic motifs with two pSer residues. This category was the most highly represented for doubly phosphorylated peptides. (I) pThr-directed motif.

Similar articles

Cited by

References

    1. Hunter T. Cell. 2000;100:113–127. - PubMed
    1. Pawson T, Scott JD. Trends Biochem Sci. 2005;30:286–290. - PubMed
    1. Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. Science. 2002;298:1912–1934. - PubMed
    1. Songyang Z, Blechner S, Hoagland N, Hoekstra MF, Piwnica-Worms H, Cantley LC. Curr Biol. 1994;4:973–982. - PubMed
    1. Hutti JE, Jarrell ET, Chang JD, Abbott DW, Storz P, Toker A, Cantley LC, Turk BE. Nat Methods. 2004;1:27–29. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources