O-Acetylated Chemical Reporters of Glycosylation Can Display Metabolism-Dependent Background Labeling of Proteins but Are Generally Reliable Tools for the Identification of Glycoproteins (original) (raw)
Materials and Methods
General Information
Reagents and solvents were obtained from various commercial suppliers and were used without further purification. Thin-layer chromatography (TLC) was performed on EMD Silica Gel 60 F254 plates and visualized by ceric ammonium molybdate or UV. Flash chromatography was perfumed on 60 Ã silica gel (EMD. 1H spectra were obtained on a 400 MHz on a Varian spectrometer Mercury 400 and chemical shifts are recorded in ppm (δ) relative to solvent and coupling constants (J) are reported in Hz.
Synthesis of Ac44AzGal
1,2,3,6-Tetra-_O_-acetyl-D-glucopyranose (250 mg, 0.72 mmol) was dissolved in 7 mL CH2Cl2 and 1 mL of pyridine was added and cooled to 0°C. Triflic anhydride (0.49 mL, 2.87 mmol) was then added dropwise and the reaction was allowed to stir at 0°C for 1 h. The mixture was then diluted with CH2Cl2 and washed twice with 1 M hydrogen chloride, once with saturated sodium bicarbonate, and once with brine. The organic layer was dried over sodium sulfate, filtered, and concentrated. The resulting oil was dissolved in 5 mL DMF, NaN3 (234 mg, 3.6 mmol) was added, and the mixture was stirred for 1.5 h. The reaction was then diluted with ethyl acetate and the organic layer was washed twice with water and once with brine. The organic layer was dried over sodium sulfate, filtered, and concentrated. The product was then purified by column chromatography (10–30% acetone in hexanes, 3 steps) to afford 205 mg of the product. 1H NMR (400 MHz, Chloroform-d) δ 6.31 (t, J = 1.6 Hz, 1H), 5.41–5.39 (m, 2H), 4.24–4.16 (m, 4H), 2.15 (s, 3H), 2.14 (s, 3H), 2.09 (s, 3H), 2.03 (s, 3H). These data are consistent with previously published characterization (Chen and Withers, 2018).
Cell Culture
H1299 cells were grown in RPMI media (GenClone), while HeLa cells were grown in DMEM high glucose media (GenClone). In both cases, media was supplemented with 10% Fetal Bovine Serum (Altanta Biologicals). NIH3T3 cells were grown in DMEM high glucose (GenClone) supplemented with 10% Fetal Calf Serum (Altanta Biologicals). All Cell lines were grown at 37°C and 5.0% CO2.
Methanol/Chloroform/H2O Precipitation
Proteins were recovered through first addition of a 3 × volume of methanol, a 0.75 × volume of chloroform, and a 2 × volume of H2O. The resulting mixtures were then subjected to mixing by vortexing and centrifugation (5 min, 5,000 × g). The aqueous phase separates at the top of the mixture and was removed and discarded without disturbing the interface layer. An additional 2.5 × volume of methanol was then added, followed by mixing by vortexing, and pelleting of protein by centrifugation (10 min, 5,000 × g).
Background Lysate-Labeling
HeLa cells were collected by trypsinization and washed two times with PBS (2 min, 2,000 × g, 4°C).
Native Lysis Conditions
The resulting cell-pellets were resuspended in 400 μL of PBS with 5 mg/mL Protease Inhibitor and then tip sonicated in the ice for 45 s (15 s on, 10 s off). Protein concentration was normalized using a BCA assay (Pierce, ThermoScientific) and diluted to 2 mg mL−1. To 100 μL (200 μg) of this protein solution, was added either Ac42AzMan or Ac44AzGal from either a 2 or 10 mM stock solution in DMSO to give a final concentration of 200 μM or 2 mM, respectively. After incubation of this mixture at 37°C for 2 h, the lysates were precipitated by the addition of 800 μL of cold MeOH and incubation at −80° for 1 h. The precipitates were collected by centrifugation at 8,000 × g, 5 min at 4°C and washed twice with cold MeOH. The supernatant was removed, and the pellet was allowed to air-dry, and then 188 μL 1% SDS buffer (1% SDS, 150 mM NaCl, 50 mM triethanolamine pH 7.4) was added to each sample. The mixture was sonicated in a bath sonicator to ensure complete dissolution. The resulting protein mixture was then subjected to the CuAAC conditions described below.
Denaturing Lysis Conditions
The resulting cell-pellets were resuspended in 100 μL 1% SDS with 5 mg/mL Protease Inhibitor and then tip sonicated for 15 s. Protein concentration was normalized using a BCA assay (Pierce, ThermoScientific) and diluted to 2 mg mL−1. To 50 uL (100 μg) of this protein solution, was added either Ac42AzMan or Ac44AzGal from either a 2 or 10 mM stock solution in DMSO to give a final concentration of 200 μM or 2 mM, respectively. After incubation of this mixture at 37°C for 2 h, the reaction was diluted with 50 μL of 4% SDS (4% SDS, 50 mM triethanolamine pH 7.4, 150 mM NaCl) and subjected to methanol/chloroform/H2O precipitation. The supernatant was removed, and the pellet was allowed to air-dry, and then 94 μL 1% SDS buffer (1% SDS, 150 mM NaCl, 50 mM 50 mM triethanolamine pH 7.4) was added to each sample. The mixture was sonicated in a bath sonicator to ensure complete dissolution. The resulting mixture was then subjected to the CuAAC conditions described below.
CuAAC
To 100 μg of the protein mixtures above was added fresh click chemistry cocktail (6 μL)[alkyne-TAMRA tag (Click Chemistry Tools, 100 μM, 10 mM stock solution in DMSO), tris(2-carboxyethyl)phosphine hydrochloride (TCEP) (1 mM, 50 mM freshly prepared stock solution in water), tris[(1-benzyl-1-_H_-1,2,3-triazol-4-yl)methyl]amine (TBTA) (Click Chemistry Tools, 100 μM, 10 mM stock solution in DMSO), CuSO4·5H2O (1 mM, 50 mM freshly prepared stock solution in water)]. After 1 h, the CuAAC reaction was subjected to methanol/chloroform/H2O precipitation. The resulting protein-pellet was air-dried (5–10 min), and then 25 μL of 4% SDS buffer (4% SDS, 150 mM NaCl, 50 mM 50 mM triethanolamine pH 7.4) was added to each sample. The mixture was sonicated in a bath sonicator until all the protein was dissolved, and 25 μL of 2 × SDS-free loading buffer (20% glycerol, 0.2% bromophenol blue, 1.4% β-mercaptoethanol, pH 6.8) was then added. The samples were boiled for 5 min at 97°C, and 40 μg of protein was then separated by SDS-PAGE. The resulting gels were then visualized using a Typhoon 9400 Variable Mode Imager (GE Healthcare) with a 532 nm laser for excitation and a 30 nm bandpass filter centered at 610 nm for detection.
Metabolic Labeling
To cells at 80–85% confluency, media was exchanged for fresh media containing Ac42AzMan, Ac44AzGal, Ac36AzGlcNAc, Ac4ManNAz, Ac45SGlcNAc (1,000 × stock in DMSO), or DMSO vehicle as indicated. Cells were removed from the plate by trypsinization, collected by centrifugation (2 min, 2,500 × g), and gently washed with PBS (1 mL, 2 ×). The resulting pellets were lysed in 100 μL of 1% NP-40 lysis buffer [1% NP-40, 150 mM NaCl, 50 mM triethanolamine (TEA) pH 7.4] with complete, Mini, EDTA-free Protease Inhibitor Cocktail Tablets (Sigma Aldrich) for 20 min. Any remaining cell debris was then removed by centrifugation (10 min, 10, 000 × g, 4°C). The soluble fraction, or soluble cell lysate, was collected and a BCA assay (Pierce, ThermoScientific) was used to determine protein concentration, which normalized to 1 μg μL−1 using more lysis buffer. CuAAC was then performed on 200 μg of protein by adding 12 μL of fresh click chemistry cocktail [alkyne-TAMRA tag (Click Chemistry Tools, 100 μM, 10 mM stock solution in DMSO), tris(2-carboxyethyl)phosphine hydrochloride (TCEP) (1 mM, 50 mM freshly prepared stock solution in water), tris[(1-benzyl-1-H-1,2,3-triazol-4-yl)methyl]amine (TBTA) (Click Chemistry Tools, 100 μM, 10 mM stock solution in DMSO), CuSO4·5H2O (1 mM, 50 mM freshly prepared stock solution in water)]. The reaction was gently vortexed before proceeding for 1 h at room temperature. After this time, ice cold methanol (1 mL) was then added to the reaction, and the samples were incubated at −20°C for 2 h. The samples were then centrifuged (10 min, 10,000 × g, 4°C), resulting in a pellet of the proteins. This protein-pellet was air-dried (5–10 min), and 25 μL of 4% SDS buffer (4% SDS, 150 mM NaCl, 50 mM 50 mM triethanolamine pH 7.4) was then added. The mixture was completely dissolved using a bath sonicator, followed by addition of 25 μL of 2 × SDS-free loading buffer (20% glycerol, 0.2% bromophenol blue, 1.4% β-mercaptoethanol, pH 6.8). The samples were fully denatured by incubation at 97°C for 5 min, and 40 μg of protein was then separated by SDS-PAGE. The resulting gels were then visualized using a Typhoon 9400 Variable Mode Imager (GE Healthcare) with a 532 nm laser for excitation and a 30 nm bandpass filter centered at 610 nm for detection.
β-Elimination
β-Elimination was performed as previously described (Darabedian et al., 2018).
PNGase F Treatment
PNGase F was obtained from New England Biolabs and treatment was performed according to the manufacturer's protocol with some changes as previously described (Darabedian et al., 2018).
Flow Cytometry of Cell-Surface Labeling With DBCO-Biotin
NIH3T3 cells were grown in 10 cm plates at 80–85% confluency and treated with 200 μM MCRs or Ac4GlcNAc in triplicate for 16 h. The media was then removed and cells were washed with DPBS before being detached from the plate by incubation in 10 mL of 10 mM EDTA in DPBS at 37°C for 10 min. Cells were collected by centrifugation (5 min, 800 × g, 4°C) and were washed three times with DPBS (5 min, 800 × g, 4°C). Cells were then resuspended in 200 μL PBS containing DBCO-biotin (Click Chemistry Tools, 60 μM, 10 mM stock in DMSO) for 1 h at RT, after which time they were washed three times with DPBS (5 min, 800 g at 4°C) before being resuspended in ice-cold PBS containing fluorescein isothiocynate (FITC) conjugated avidin (Sigma, 5 μg μL−1, 1 mg/mL stock) for 30 min at 4 °C. Cells were then washed three times in DPBS (5 min, 800 × g, 4°C) and then resuspended in 400 μL PBS containing propidium iodide (2.5 μg mL−1 in DPBS, 1 mg/mL stock in DPBS] for 30 min. A total of 50,000 cells were analyzed on a BD Accuri C6 Flow Cyometer, and dead cells (propidium iodide positive) were excluded.
GlcNAc-2-Epimerase Cloning and Expression
Condon optimized GlcNAc-2-Epimerase (11–427) was purchased from Integrated Device Technology and cloned into pET-28b vector between NdeI and HindIII using standard molecular cloning techniques. Plasmid is available upon request. Pet28b-6XHis-GlcNAc-2-Epimerase (11–427) was subsequently transformed into BL21 E. coli (Novagen). Terrific broth (1.8 L) containing kanamycin (50 μg mL−1) was inoculated with 24 mL of starter culture grown overnight at 37°C. The culture was grown at 37°C until an OD (A600) of 0.80 was obtained, at which time expression was induced with addition of 1 mM isopropyl β-D-1-thiogalacto-pyranoside (IPTG) for 5 h at 37°C. Cells were harvested by centrifugation (10 min, 6,000 × g, 4°C). The pooled pellets were suspended in 20 mL Buffer A (250 mM NaH2PO4, 300 mM NaCl, 20 mM imidazole, pH 7.4) and tip sonicated for 12 min (30 s on, 15 s off). The resulting lysate was then centrifuged (15 min, 30,000 × g, 4°C), and the supernatant was transferred into a new tube and centrifuged again (30 min, 30,000 × g, 4°C). The supernatant was transferred into a new tube and 2 mL of pre-washed Cobalt Metal Affinity Resin (Genesee Scientific) was added and placed onto a rotator for 1 h at 4°C. The solution was then transferred to a gravity-flow column, allowed to drain, and the beads were washed with an additional 100 mL of Buffer A. 6XHis-GlcNAc-2-Epimerase (11–427) was eluted with 5 mL fractions of Buffer B (25 mM NaH2PO4, 300 mM NaCl, 250 mM imidazole, pH 7.4). Fractions containing 6XHis-GlcNAc-2-Epimerase (11–427) were concentrated to 2 mL using a 10 kDa cutoff Amicon Ultra-15 Centrifugal Filter. The concentrate was dialyzed overnight into 2 L of dialysis buffer (20 mM NaH2PO4, 1 mM EDTA, 5% glycerol, 0.5% 2-Mercaptoethanol, pH 7.0). The dialyzed solution was then concentrated to 0.5 mL using a 10 kDa cutoff Amicon Ultra-15 Centrifugal Filter and stored at −80°C in single use aliquots.
HPLC Analysis for GlcNAc-2-Epimerase Conversion
To a 500 μL microcentrifuge tube was added 10 μL of phosphate buffer (1 M, pH 7.5), 10 μL of ATP (50 mM, dissolved in water), 10 μL of MgCl2 (100 mM, dissolved in water), 20 μL of ManNAc or 2AzMan (500 mM, dissolved in water), and 5 μL of GlcNAc-2-Epimerase (or water). Water, 45 μL for samples containing GlcNAc-2-Epimerase or 50 μL for null samples, was then added to the reaction mixtures. The samples were then incubated at 37°C for 12 h and then lyophilized. The resulting solids were suspended in 75 μL of pyridine and then 25 μL of acetic anhydride was added and allow to rotate for 16 h. Then 5 μL of each solution was diluted to 100 μL using water and 20 μL was injected onto an Agilent Eclipse XDB-C18 (5 μm, 4.6 × 150 mm) running at 1 ml/min and PDA set to a wavelength 200 nm. Buffer A was H2O containing 0.1% TFA, buffer B was 90% ACN, 10% H2O containing 0.1% TFA. HPLC conditions for 2AzMan were 10 min at 25% B and then a ramp to 70% over 20 min. HPLC conditions for ManNAc were 10 min at 10% B and then a ramp to 50% over 20 min.
Biotin Enrichment for Proteomics
H1299 cells were treated as indicated and cells were collected by trypsinization and pelleted by centrifugation for (2 min, 2,000 × g), followed by washing 2 × with PBS. The resulting cell-pellets were resuspended in 4% SDS buffer (4% SDS, 10 mM TEA pH 7.4, 150 mM NaCl) containing c0mplete, Mini, EDTA-free Protease Inhibitor Cocktail Tablets (Roche), tip sonicated for 15 s, and cleared by centrifugation (10 min, 10,000 × g, 15°C). Soluble protein concentration was normalized by BCA assay (Pierce, ThermoScientific) to 1 mg mL−1, and 1 mg of total protein was subjected to the appropriate amount of click chemistry cocktail containing alkyne-biotin (Click Chemistry Tools) for 1 h, then 10 μL of 0.5 M EDTA was added. Then proteins were precipitated by adding a 7.5 mL of methanol, 1.9 mL of CHCl3, and 5 mL of H2O followed by vortexing and centrifugation (5 min, 5,000 × g). The aqueous phase was discarded without disturbing the interface layer after which 3.7 mL of methanol was added, vortexed, and centrifuged (10 min, 10,000 × g, 4°C). The supernatant was removed and the pellet was allowed to air-dry for 5 min and then a 100 μL of 4% SDS was added. The mixture was sonicated in a bath sonicator to ensure complete dissolution, and 1.9 mL of 10 mM TEA, pH 7.4, 150 mM NaCl was added followed by 125 μL of high-capacity NeutrAvidin beads (ThermoScientific, pre-washed three times with 0.2% SDS, 150 mM NaCl, 50 mM TEA pH 7.4) and incubated on a rotator for 1.5 h. Afterwards, the beads were washed with 6 × 1% SDS in PBS, 3 × 4M urea in PBS, and 8 × 50 mM NH4HCO3. The beads were then resuspended in 1 mL of 50 mM NH4HCO3, 10 mM TCEP (pH 8) and incubated for 30 min with gentle shaking. Afterwards, the resin was washed with 50 mM NH4HCO3 and the beads were resuspended in 1 mL of 10 mM iodoacetamide (pH 8) in 50 mM NH4HCO3 and incubated in the dark for 30 min. The beads were then washed 3 × 50 mM NH4HCO3, and resuspended in 100 μL 50 mM NH4HCO3. Then 2 μL of CaCl2 (200 mM in H2O) and 2 μL of trypsin (Sequencing Grade, Promega, 0.1 μg/μl) were added and incubated for 18 h at 37°C. The beads were centrifuged, the supernatants were transferred into clean tubes, and the beads were washed with an additional 100 μL 1% formic acid, 100 μL 15 % acetonitrile in H2O and 100 μL 1% FA in H2O. The combined elution and wash were desalted on C18 Spin Columns (Pierce, ThermoScientific) according to the manufacturer's protocol and lyophilized to dryness.
Proteomics
A nanoElute was attached in line to a timsTOF Pro equipped with a CaptiveSpray Source (Bruker). Chromatography was conducted at 40°C through a 25 cm reversed-phase Aurora Series C18 column (IonOpticks) at a constant flow-rate of 0.4 μL/min. Mobile phase A was 98/2/0.1% Water/acetonitrile/formic acid (v/v/v) and phase B was acetonitrile with 0.1% Formic Acid (v/v). During a 120 min method, peptides were separated by a 4-step linear gradient (0% to 15% B over 60 min, 15% to 23% B over 30 min, 23% to 35% B over 10 min, 35% to 80% over 10 min) followed by a 10 min isocratic flush at 80% for 10 min before washing and a return to low organic conditions. Experiments were run as data-dependent acquisitions with ion mobility activated in parallel accumulation serial fragmentation (PASEF) mode. MS and MS/MS spectra were collected with m/z 400–1,500 and ions with z = +1 were excluded. Raw data files were processed with Peaks Studio. Fixed modifications included +57.02146 C. Variable modifications included Acetyl +42.010565 N-term, pyro-Glu −17.026549 N-term Q, pyro-Glu −18.010565 N-term E. Precursor tolerance 30.0 ppm. False discovery rate was set to 0.01 with significance calculated using ANOVA.
Chemical Enrichment of Glycoproteins and Sample Preparation for IsoTag
H1299 cells were treated as indicated and cells were collected by trypsinization and pelleted by centrifugation for (2 min, 2,000 × g), followed by washing 2 × with PBS. The resulting cell-pellets were lysed on ice by probe tip sonication in 1 × PBS + 1% SDS (1 mL), containing EDTA-free Pierce HaltTM protease inhibitor cocktail. Debris were removed from the cellular lysate by centrifugation (20,000 × g) for 20 min at 4°C and the supernatant transferred to a new Eppendorf tube. A BCA protein assay (Pierce) was performed and protein concentration was adjusted to 3.5 μg/μL with lysis buffer. Protein lysate (1.4 mg, 400 μL) was treated with a pre-mixed solution of the click chemistry reagents [100 μL; final concentration of 200 μM IsoTaG silane probe (3:1 heavy:light mixture), 500 μM CuSO4, 100 μM THPTA, 2.5 mM sodium ascorbate] and the reaction was incubated for 3.5 h at 24°C. The click reaction was quenched by a methanol-chloroform protein precipitation [aqueous phase/methanol/chloroform = 4:4:1 (v/v/v)]. The protein pellet was allowed to air dry for 5 min at 24°C. The dried pellet was resuspended in 1 × PBS + 1% SDS (400 μL) by probe tip sonication and then diluted in PBS (1.6 mL) to a final concentration of 0.2% SDS. Streptavidin-agarose resin [400 μL, washed with PBS (3 × 1 mL)] were added to the protein solution and the resulting mixture was incubated for 12 h at 24°C with rotation. The beads were washed using spin columns with 8 M urea (5 × 1 mL), and PBS (5 × 1 mL). The washed beads were resuspended in 500 μL PBS containing 10 mM DTT and incubated at 37°C for 30 min, followed by addition of 20 mM iodoacetamide for 30 min at 37°C in the dark. The reduced and alkylated beads were collected by centrifugation (1,500 × g) and resuspended in 520 μL PBS. Urea (8 M, 32 μL) and trypsin (1.5 μg) was added to the resuspended beads and digestion was performed for 16 h at 37°C with rotation. The beads were washed three times with PBS (200 μL) and distilled water (200 μL). The IsoTaG silane probe was cleaved with 2% formic acid/water (2 × 200 μL) for 30 min at 24°C with rotation and the eluent was collected. The beads were washed with 50% acetonitrile-water + 1% formic acid (2 × 500 μL), and the washes were combined with the eluent to form the cleavage fraction. The cleavage fractions were dried in a vacuum centrifuge and stored at −20°C until analysis.
Mass Spectrometry Parameters Used for Glycoproteomics
A Waters nanoAcquity system was coupled to a ThermoScientific Orbitrap Fusion Tribrid with a nano-electrospray ion source. Half of the sample was reconstituted in 10 μL of 5% acetonitrile and 0.1% formic acid in water, loaded onto a C18 trap column (WATERS Cat # 186008821 nanoEase MZ Symmetry C18 Trap Column, 100 Å, 5 μm × 180 μm × 20 mm), and separated on an analytical column (WATERS Cat # 186008795 nanoEase MZ Peptide BEH C18 Column, 130 Å, 1.7 μm × 75 μm × 250 mm). Mobile phases A and B were water with 0.1% formic acid (v/v) and acetonitrile with 0.1% formic acid (v/v), respectively. Peptides were separated with a linear gradient from 5 to 30% B within 95 min, followed by an increase to 50% B within 15 min and further to 98% B within 10 min, and re-equilibration. The instrument parameters were set as previously described (Ramirez et al., 2020) with minor modifications. Briefly, MS1 spectra were recorded from m/z 400–2,000 Da. If oxonium product ions (HexAz0Si +288.1190 Da; HexAz2Si +290.1316 Da) were observed in the HCD spectra, ETD with supplemental activation (35%) was performed in a subsequent scan on the same precursor ion selected for HCD. The raw data was processed using Proteome Discoverer 2.4 (Thermo Fisher Scientific). Both HCD and EThcD spectra were searched against a database containing the Swissprot 2018 annotated human proteome (20,355 proteins, downloaded on Feb. 21, 2019) and contaminant proteins using Sequest HT and Byonic algorithms. The searches were performed with the following guidelines: trypsin as enzyme, 2 missed cleavages allowed; 10 ppm mass error tolerance on precursor ions; 0.02 Da mass error tolerance on fragment ions; variable modifications (methionine oxidation, +15.995 Da; carbamidomethyl cysteine, +57.021 Da; and others as described below). Intact glycopeptide searches allowed for the tagged hexose, or for the mono-acetylated, di-acetylated or tri-acetylated form (HexAz0Si, +287.112 Da; HexAz2Si, +289.124 Da; mono-acetylated HexAz0Si, +329.122 Da; mono-acetylated HexAz2Si, + 331.135 Da; di-acetylated HexAz0Si, +371.133 Da; di-acetylated HexAz2Si, +373.145 Da; tri-acetylated HexAz0Si, +413.143 Da; and tri-acetylated HexAz2Si, +415.156 Da) on asparagine, cysteine, serine, and threonine. Glycopeptide spectral assignments passing a false discovery rate of 1% at the spectrum level based on a target decoy database were manually validated for an isotope precursor pattern.
UDP-GalNAz Chemoenzymatic Labeling and Cu(I)-Catalyzed [3 + 2] Azide–Alkyne Cycloaddition (CuAAC) for Western Blotting
H1299 cells were grown to 80% confluency, collected by trypsinization, and washed two times with PBS with collection by centrifugation (2 min, 2,000 × g, 4°C). The cell pellets were then resuspended in 2.5 mL 4% SDS (4% SDS, 50 mM TEA, 150 mM NaCl, pH 7.4) containing 12.5 mg of c0mplete Mini Protease Inhibitor Cocktail (Roche). The suspended cells were subjected to tip sonication (3 × 10 s on, 10 s off) followed by centrifugation (10 min, 10,000, 15°C). The soluble protein was collected and the concentration was normalized by BCA assay (Pierce) to 1 mg mL−1 using 1% SDS (1% SDS, 50 mM TEA, 150 mM NaCl, pH 7.4). Proteins were subjected to methanol/chloroform/H2O precipitation. The resulting protein pellet was allowed to air-dry for 5–10 min before being resuspended in 10% of the original volume using 1% SDS chemoenzymatic buffer (1% SDS, 20 mM HEPES, pH 7.9). Protein concentration was normalized using the BCA Assay and diluted to 2.5 mg mL−1 in 1% SDS chemoenzymatic buffer. To 2,400 μL of resuspended protein (6 mg) was added 2,940 μL of H2O, 4,800 μL of labeling buffer (2.5 ×; 5% IGEPAL CA-630, 125 mM NaCl, 50 mM HEPES, pH 7.9), 660 μL of MnCl2 (100 mM in H2O), and 900 μL of UDP-GalNAz (0.5 mM in 10 mM HEPES, pH 7.9) and vortexed. Finally, 300 μL of purified GalT Y289L or H2O was added, and the reaction mixture was incubated for 20 h at 4°C. The unreacted UDP-GalNAz was then removed by methanol/chloroform/H2O precipitation. Air-dried protein pellets were resuspended in 1,500 μL of 4% SDS. To generate inputs, 40 μL was removed and combined with 40 μL of 2 × loading buffer (20% glycerol, 0.2% bromophenol blue, pH 6.8, and 14 μL/mL β-mercaptoethanol). The remaining labeled lysate was diluted to 6 mL using SDS-free buffer (150 mM NaCl, 50 mM TEA pH 7.4) and newly made click chemistry cocktail (420 μL) was added to each sample [alkyne-azo-biotin (Click Chemistry Tools, 100 μM, 5 mM stock solution in DMSO); tris(2-carboxyethyl)phosphine hydrochloride (TCEP) (1 mM, 50 mM freshly prepared stock solution in water); tris[(1-benzyl-1-H-1,2,3- triazol-4-yl)methyl]amine (TBTA) (Click Chemistry Tools, 100 μM, 10 mM stock solution in DMSO); CuSO4·5H2O (1 mM, 50 mM freshly prepared stock solution in water). After 1 h, 60 μL of 0.5 M EDTA was added, and then the proteins were subjected to methanol/chloroform/H2O precipitation. The resulting proteins were then suspended in 600 μL of 4% SDS buffer.
Metabolic Chemical Labeling and Cu(I)-Catalyzed [3 + 2] Azide–Alkyne Cycloaddition (CuAAC) for Western Blotting
H1299 cells were grown to 80% confluency, collected by trypsinization, and washed two times with PBS with collection by centrifugation (2 min, 2,000 × g, 4°C). The cell pellets were then resuspended in 1 mL 4% SDS containing 5 mg of c0mplete Mini Protease Inhibitor Cocktail (Roche). The suspended cells were subjected to tip sonication (3 × 10 s on, 10 s off) and followed by centrifugation (10 min, 10,000, 15°C). The soluble protein was collected and the concentration was normalized by BCA assay (Pierce) to 4 μg μL−1 in 4% SDS buffer. To generate inputs, 40 μL was removed and combined with 40 μL of 2 × loading buffer. To 1,500 μL of lysate (6 mg of protein) was added 780 μL of 1.25 × SDS-buffer (1.25% SDS, 50 mM TEA, 150 mM NaCl, pH 7.4), 3.3 ml of SDS-free buffer, and 420 μL of freshly made click chemistry cocktail was added [alkyne-azo-biotin (Click Chemistry Tools, 100 μM, 5 mM stock solution in DMSO); tris(2-carboxyethyl)phosphine hydrochloride (TCEP) (1 mM, 50 mM freshly prepared stock solution in water); tris[(1-benzyl-1-H-1,2,3- triazol-4-yl)methyl]amine (TBTA) (Click Chemistry Tools, 100 μM, 10 mM stock solution in DMSO); CuSO4·5H2O (1 mM, 50 mM freshly prepared stock solution in water). After 1 h, the proteins were subjected to methanol/chloroform/H2O precipitation. The resulting protein pellets were then suspended in 600 uL of 4% SDS buffer.
Biotin Enrichment and Western Blotting
To the labeled proteins (600 μL, 6 mg) was added 11.4 mL of SDS-free buffer and 300 μL of high-capacity NeutrAvidin beads (ThermoScientific, pre-washed three times with 0.2% SDS, 150 mM NaCl, 50 mM TEA pH 7.4), then the mixture was then incubated for 1.5 h. The resulting mixture was transferred into a gravity flow chromatography column and drained. The beads were washed 10 × with 3 mL of 1% SDS in PBS and then transferred into a dolphin nose tube. Each sample was then incubated with 300 μL of 25 mM sodium hydrosulfite for 30 min, beads were then collected by centrifugation (2 min, 2,500 × g) and the supernatant was collected. The procedure was repeated two more times. The supernatant was pooled, 4 × volume of ice-cold methanol was added, and was placed at −20°C for 2 h. Precipitated proteins were then collected by centrifugation (10 min, 10,000 × g, 4°C). The supernatant was removed and the pellet was allowed to air-dry for 10 min and then 37.5 μL of 4% SDS buffer was added to each sample. The mixture was sonicated in a bath sonicator to ensure complete dissolution and then 37.5 μL of 2X SDS-free loading buffer was added. The samples were boiled for 5 min at 97°C and 20 μL of input or 25 μL of enriched sample was loaded per lane for SDS-PAGE separation.
Results
In order to further explore the potential of structural diverse monosaccharide analogs as potential MCRs, we purchased Ac42AzMan and synthesized Ac44AzGal in a two-step, one-pot synthesis from commercially available 1,2,3,6-_O_-acetyl-glucose (Supporting Information). We then incubated these compounds (200 μM or 2 mM) at 37°C for 2 h with HeLa cell lysates, under the same conditions previously reported to result in chemical modification of cysteine residues (Qin et al., 2018), as well as chemically-denatured (1% SDS) cell lysates. In parallel, we treated HeLa cells in culture with the same compounds (200 μM) for 16 h, our standard MCR labeling protocol. The samples were then subjected to copper(I)-catalyzed azide-alkyne cycloaddition (CuAAC) with alkyne-TAMRA and then analyzed by in-gel fluorescence (Figure 2). Incubation of the reporters with cell lysates resulted in a small amount of labeling over background that was essentially identical for the two compounds. However, we observed noticeably higher live-cell labeling for Ac42AzMan compared to Ac44AzGal. Notably, the pattern of both reporter-modified proteins in lysates largely matches those proteins that non-selectively react with alkyne-TAMRA during CuAAC, presumably abundant proteins in the lysate. This pattern is conserved in the cells labeled with Ac44AzGal, indicating that these proteins are indeed the result of background chemical modification. In contrast, cells treated with Ac42AzMan resulted in the visualization of several unique protein bands. These results confirm that high concentrations of per-_O_-acetylated MCRs can result in at least low levels of background protein labeling, but also suggest that cellular metabolism is critical for the robust labeling observed with Ac42AzMan treatment.
We next set out to characterize the labeling of living cells by Ac42AzMan. First, we explored the possibility that the reporter is incorporated into cell-surface glycosylation. Accordingly, H1299 cells were treated with Ac42AzMan (200 μM) or vehicle for 16 h. We chose this concentration, as in our experience acetylated MCRs are often toxic to cells at higher concentrations. The corresponding total cell lysates were then split and treated with water or PNGase-F for 5 h at 37°C to enzymatically remove N-linked glycosylation. The samples were then subjected to CuAAC with alkyne-TAMRA and analyzed by in-gel fluorescence and lectin blotting (Figure 3A). The fluorescent gel scanning showed no loss of signaling upon PNGase-F treatment but dramatic removal of N-linked glycans as visualized by Concanavalin A (ConA), demonstrating that the MCR is not incorporated into this type of glycosylation to any significant extent. Next, we used flow cytometry to more broadly examine the potential incorporation of Ac42AzMan into cell-surface glycoconjugates. More specifically, NIH3T3 cells were treated with Ac42AzMan (200 μM) or vehicle for 16 h. Simultaneously, the same cell-line was treated separately with either Ac4ManNAz (200 μM) or Ac36AzGlcNAc (200 μM). Ac4ManNAz treatment serves as a positive control for cell surface labeling (Saxon and Bertozzi, 2000), while we have previously shown that Ac36AzGlcNAc treatment results in the exclusive modification of intracellular proteins (Chuh et al., 2014). After 16 h, the cells were released, reacted with DBCO-Biotin, and incubated with FITC-Avidin before analysis by flow-cytometry (Figure 3B). As expected, we observed high levels of labeling after Ac4ManNAz treatment but essentially no signal over background from Ac36AzGlcNAc treated cells. Consistent with our PNGase-F experiment, we also found no cell-surface labeling with Ac42AzMan. Next, we took advantage of β-elimination chemistry to test whether the observed signal was due to base labile modifications on residues such as serine, threonine, or cysteine. H1299 cells were first treated with Ac42AzMan (200 μM) or vehicle for 16 h before the corresponding cell lysates were subjected to CuAAC with alkyne-biotin. The samples were then ran in duplicate on SDS-PAGE and transferred to a nitrocellulose membrane. The membranes were then incubated at 40°C for 24 h in either water or 55 mM NaOH and then visualized using streptavidin blotting (Figure 3C). We found that the β-elimination conditions removed essentially all of the Ac42AzMan labeling. As a control for the chemistry, we also visualized the loss of intracellular O-GlcNAc modifications by Western blotting under the same conditions (Figure 3C).
The fact that Ac42AzMan results in intracellular protein modification raised the possibility that it was entering the O-GlcNAc modification pathway. More specifically, we hypothesized that 2AzMan might be enzymatically converted to 2-azido-2-deoxy-glucose (2AzGlc) by the enzyme _N_-acetylglucosamine 2-epimerase, as we and the Vocadlo lab demonstrated that 2AzGlc is an MCR for O-GlcNAcylation (Shen et al., 2017; Zaro et al., 2017). To directly test this possibility, we first incubated an anomeric mixture of α- and β-ManNAc with either buffer or recombinantly expressed _N_-acylglucosamine 2-epimerase (Uniprot P51606) for 12 h before analysis by HPLC (Figure 4A). As expected, _N_-acetylmannosamine (ManNAc) was enzymatically converted to two new peaks consisting of the α- and β-anomers of GlcNAc. In contrast, we observed no conversion of 2AzMan to 2AzGlc, rejecting our conversion hypothesis (Figure 4A). Simultaneously, we also co-treated H1299 cells with Ac42AzMan (200 μM) and either the OGT inhibitor Ac45SGlcNAc (150 μM) or DMSO for 16 h (Gloster et al., 2011). The lysates were then subjected to CuAAC with alkyne-TAMRA and labeling visualized by in-gel fluorescence (Figure 4B). In support of our in vitro experiment, we observed very little loss of Ac42AzMan labeling upon OGT inhibition. In contrast, a similar OGT inhibition experiment with Ac42AzGlc previously resulted in loss of over half of the labeling (Shen et al., 2017). Together, these experiments argue against the conversion of 2AzMan to 2AzGlc and subsequent incorporation into O-GlcNAc modifications.
Next, we set out to identify the 2AzMan-modified proteins. Accordingly, we treated H1299 cells in triplicate with either Ac42AzMan (200 μM) or DMSO vehicle for 16 h, followed by CuAAC with alkyne-biotin and enrichment of any modified proteins with neutravidin beads. We then performed on-bead trypsinolysis and identification of the resulting peptides by Label Free Quantitative (LFQ) proteomics (Cox et al., 2014) (n = 3 biological replicates with a false discovery rate of 0.01). This allowed us to identify over 1,000 2AzMan-labeled proteins based on several criteria (Figure 5 and Table S1): the protein must have been identified in at least 2 out of the 3 biological replicates, the enrichment ratio (LFQ-based) must have been at least 5 linear-fold greater in the treated samples vs. vehicle, and the statistical significance (p-value) of this difference must have been <0.01 (Student's _t_-test) This cutoff was chosen arbitrarily and is fairly stringent; however, a full list of proteins enriched at lower ratios can be found in Table S1. Consistent with our other biochemical analysis, the enriched proteins represented a wide-range of intracellular proteins. We next employed the IsoTaG platform (Woo et al., 2015, 2017) to determine if 2AzMan or 4AzGal could be directly identified on proteins and the specific sites of those modifications. We again treated H1299 cells with either Ac42AzMan (200 μM), Ac44AzGal (200 μM), or DMSO vehicle in duplicate for 16 h. We then subjected the corresponding lysates to CuAAC with a mixture of isotopically-labeled, cleavable biotin tags. After enrichment of the labeled proteins on streptavidin beads and on-bead trypsinolysis, we eluted the directly modified, and therefore isotopically encoded, peptides using weak acid. Subsequent LC-MS analysis using the IsoStamp v2.0 software was then used to look for un-, mono-, di-, or tri-acetylated azido-hexose modification of any Asn, Ser, Thr, or Cys residue. Using IsoTaG, we were able to localize 2AzMan on 33 peptides, with all of the modifications on Cys (Table 1 and Table S2), while we found no peptides modified by 4AzGal. Overall our results are consistent with the background MCR labeling of proteins on cysteine residues seen by Chen and Wang (Qin et al., 2018; Hao et al., 2019) but also indicate that cellular metabolism of Ac42AzMan plays an important role that distinguishes it from Ac44AzGal.
| Entry | Peptide (modification site bolded) | Protein name | Uniprot accession |
|---|---|---|---|
| 1 | [K].AGQCVIGLQMGTNK.[C] | Calponin-2 | Q99439 |
| 2 | [K].AINCATSGVVGLVNCLR.[R] | Fatty acid synthase | P49327 |
| 3 | [K].AMSYKPICPTCQTSYGIQK.[G] | DTX3L | Q8TDB6 |
| 4 | [K].APPPSLTDCIGTVDSR.[A] | Charged multivesicular body protein 5 | Q9NZZ3 |
| 5 | [K].AVQDLCGWR.[I] | Protein RCC2 | Q9P258 |
| 6 | [K].ETTNIFSNCGCVR.[A] | Ataxin-10 | Q9UBB4 |
| 7 | [K].EINYVKNCER.[A] | Nucleophosmin | P06748 |
| 8 | [K].HELQANCYEEVKDRCTLAEK.[L] | Cofilin-1 | P23528 |
| 9 | [K].HFLSDTGMACR.[S] | 5′-nucleotidase domain-containing protein 1 | Q5TFE4 |
| 10 | [K].NCGCLGASPNLEQLQEENLK.[L] | Arginine-tRNA ligase | P54136 |
| 11 | [K].NSNVDSSYLESLYQSCPR.[G] | Zinc finger CCCH-type antiviral protein | Q7Z2W4 |
| 12 | [K].NVECAR.[V] | Charge multivesicular body protein 1a | Q9HD42 |
| 13 | [K].SGIQPLCPER.[S] | Lamina-associated polypeptide 2 | P42166 |
| 14 | [K].SSGCDVNLPGVNVK.[L] | Neuroblast differentiation-associated protein | Q09666 |
| 15 | [K].STACQMLVCYAK.[L] | Importin-5 | O00410 |
| 16 | [K].STLTDSLVCK.[A] | Elongation Factor 2 | P13639 |
| 17 | [K].STTTSDPPNICK.[V] | Protein PRRC2C | Q9Y520 |
| 18 | [K].TVCDAAEK.[G] | Perilipin-3 | O60664 |
| 19 | [K].TVGVQGDCR.[S] | GMP synthase | P49915 |
| 20 | [K].VACAEEWQESR.[T] | TIP41-liked protein | O75663 |
| 21 | [K].VVNEINIEDLCLTK.[A] | CDGSH iron-sulfur domain-containing protein | Q8N5K1 |
| 22 | [M].ACARPLISVYSEK.[G] | 60S ribosomal protein L4 | P36578 |
| 23 | [M].ACGLVASNLNLKPGECLR.[V] | Galectin-1 | P09382 |
| 24 | [R].EVGGGHGCTSPPFPQEAR.[A] | Portein lin-28 homolog B | Q6ZN17 |
| 25 | [R].FCEMCCDCR.[M] | Serine/threonine-protein phosphatase 4 regulatory subunit 1 | Q8TF05 |
| 26 | [R].IFGSIPMQACQQK.[D] | Zinc finger CCCH domain-containing protein 8 | Q6NZY4 |
| 27 | [R].INISEGNBCPER.[I] | Poly(rC)-binding protein 2 | Q15366 |
| 28 | [R].LQGINCGPDFTPSFANLGR.[T] | Eukaryotic translation initiation factor 4 | Q04637 |
| 29 | [R].LVVPATQCGSLIGK.[G] | Poly(rC)-binding protein 1 | Q15365 |
| 30 | [R].LYYFQYPCYQEGLR.[S] | APOBEC-3C | Q9NRW3 |
| 31 | [R].NLFISCK.[S] | Lamina-associated polypeptide 2, isoform alpha | P42166 |
| 32 | [R].SILSPGGSCGPIK.[V] | General transcription factor II | P78347 |
| 33 | [R].VMTIPYQPMPASSPVICAGGQDR.[C] | Poly(rC)-binding protein 1 | Q15365 |
2AzMan-modified peptides.
The chemical modification of intracellular proteins upon treatment of living cells with certain MCRs could detrimentally affect the discovery of legitimately O-GlcNAcylated proteins. To explore this question, we performed a meta-analysis of potential O-GlcNAcylated proteins identified using common MCRs for O-GlcNAc (Ac4GlcNAz, Ac4GalNAz, Ac36AzGlcNAc, etc.) (Worth et al., 2017) and with other methods that detect endogenous O-GlcNAcylation (lectin chromatography, anti-O-GlcNAc antibodies, or chemoenzymatic modification). A complete list of the proteomics studies used in this analysis is available in the Supporting Information. More specifically, we used python scripts (Scripts S1–S3). The first script produces one comprehensive file containing all of the identified proteins and their associated proteomic studies, ordered by the occurrences for each protein from most to least, which should be of general interest to the field and allow for the easy identification of potentially O-GlcNAcylated proteins across datasets. The second and third scripts first return a list a proteins identified by a particular method (e.g., MCR treatment) and then generate the numbers of exclusive and overlapping proteins for each identification technique, allowing us to generate a large Venn diagram (Figure 6A). Consistent with a significant amount of published work using MCRs to discover new O-GlcNAc modified proteins, essentially half of the MCR-identified proteins were also found to be O-GlcNAcylated by at least one of the other techniques. However, the MCRs also yielded the highest number of exclusively identified proteins. We reasoned that these proteins could arise from the chemical modification of proteins, the induction of O-GlcNAcylation by MCR treatment, or reporting on endogenous glycosylation that was missed during the proteomic analysis using other techniques. To estimate how many of the proteins exclusive to MCRs fall into this last category of bonafide O-GlcNAcylated proteins, we selected four such proteins at random: TRADD (Uniprot Q15628), calreticulin (Uniprot P27797), USP10 (Uniprot Q14694), and CYLD (Uniprot Q9NQC7). We then treated H1299 or HeLa cells with Ac4GlcNAz (200 μM) for 16 h, followed by CuAAC with a cleavable biotin-linker (Darabedian and Pratt, 2019). The modified proteins were then enriched on streptavidin beads, extensively washed, and eluted before visualization by Western blotting (Figure 6B). As expected from the proteomic data, we found all four of these proteins to be enriched, as well as the known O-GlcNAcylated proteins Nup62 and CREB. Simultaneously, we subjected H1299 and HeLa cell lysates to chemoenzymatic modification followed by the same CuAAC and enrichment procedure. Analysis by Western blotting showed enrichment over background of TRADD and calreticulin in both cells lines, confirming their O-GlcNAcylation status, while CYLD was not enriched in either cell line (Figure 6C). These data suggest that the commonly used MCRs for O-GlcNAc probably do result in the enrichment and false identification of some proteins that are not endogenously modified. However, they also indicate that this number is not overwhelmingly large, with a crude estimation that of the proteins only identified using an MCR ~50% are real O-GlcNAcylated proteins that can be confirmed using another technique. Combining this estimation with the documented overlap with other techniques in the Venn diagram suggests that around 75% of the proteins found by MCRs are indeed modified.