Collective dynamics differentiates functional divergence in protein evolution - PubMed (original) (raw)

Collective dynamics differentiates functional divergence in protein evolution

Tyler J Glembo et al. PLoS Comput Biol. 2012.

Abstract

Protein evolution is most commonly studied by analyzing related protein sequences and generating ancestral sequences through Bayesian and Maximum Likelihood methods, and/or by resurrecting ancestral proteins in the lab and performing ligand binding studies to determine function. Structural and dynamic evolution have largely been left out of molecular evolution studies. Here we incorporate both structure and dynamics to elucidate the molecular principles behind the divergence in the evolutionary path of the steroid receptor proteins. We determine the likely structure of three evolutionarily diverged ancestral steroid receptor proteins using the Zipping and Assembly Method with FRODA (ZAMF). Our predictions are within ~2.7 Å all-atom RMSD of the respective crystal structures of the ancestral steroid receptors. Beyond static structure prediction, a particular feature of ZAMF is that it generates protein dynamics information. We investigate the differences in conformational dynamics of diverged proteins by obtaining the most collective motion through essential dynamics. Strikingly, our analysis shows that evolutionarily diverged proteins of the same family do not share the same dynamic subspace, while those sharing the same function are simultaneously clustered together and distant from those, that have functionally diverged. Dynamic analysis also enables those mutations that most affect dynamics to be identified. It correctly predicts all mutations (functional and permissive) necessary to evolve new function and ~60% of permissive mutations necessary to recover ancestral function.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1

Figure 1. 3D structures of AncCR, AncGR1 and AncGR2.

AncCR was within 2.5 Å all-atom RMSD from the experimentally determined AncCR. AncGR1 was within 2.9 Å all-atom RMSD from the experimentally determined crystal structure. AncGR2 was within 2.8 Å all-atom RMSD from the experimentally determined AncGR2. Included for reference is a cartoon figure with helices labeled for reference and the ligand is bound, represented in blue spheres.

Figure 2

Figure 2. Plot and ribbon diagram of the dynamics of the three ancestral proteins characterized by slowest collective mode.

(A) The first two principal components of AncCR, AncGR1 and AncGR2 plotted against each other. The principal components were found via a Singular Value Decomposition of the G matrix (See Methods). Higher order modes are mostly orthogonal or mixed and therefore not represented here. (B) 3D structures of AncCR, AncGR1 and AncGR2 colored by residue fluctuation. The critical mutations in AncCR and AncGR1 have greater flexibility and thus, higher binding promiscuity. AncGR2 has much lower flexibility in general amongst these residues and therefore more selective binding. The S212Δ mutation also rigidifies the lower loop at the bottom end of h10 by shortening the loop and removing degrees of freedom. This also alters the packing of h10 (the frontmost helix) and decreases flexibility.

Figure 3

Figure 3. The change in fluctuation along the most collective mode between AncCR, AncGR1 and AncGR2.

The X, Y, Z, and Y27R mutation groups necessary to alter function toward cortisol binding specificity are noted in red, and those permissive W mutations necessary to reverse function and recover promiscuous binding are noted in purple. A cutoff of ±0.002 Å2 is applied to differentiate mutations critical to altering dynamics as also used in Fig. 4. The upper left region of the graph indicates mutations that most alter dynamics when comparing the function-altering mutation from AncGR1 (binding promiscuity) to AncGR2 (binding specificity to cortisol) whereas the lower right region of the plot indicates mutations that most alter dynamics when comparing AncCR and AncGR1, which do not diverge functionally.

Figure 4

Figure 4. The change in net fluctuations and correlations of the mutated residues for successive evolution of MR to GR proteins.

(A) The change in net fluctuation between successive ancestral proteins, AncCR, AncGR1 and AncGR2 for mutated residues. Those residues identified as critical to alter-function are noted in red. The activation-function (AF) helix contains mutations 224 and 229. A cutoff (solid line) results in all critical mutations identified except for Y91C and L197M. Y27R is noted as critical to function but sites 65, 117, and 158 are false positives. (B) The cross correlation map with AncGR2 on the upper left and AncGR1 on the lower right. Circled in black are changes in the cross correlation associated with critical residues near the binding pocket. Squared in black are the changes in cross correlation due to critical mutation N26T forming a hydrogen bond with the AF-helix. Circles in white are additional changes in cross correlation not associated with critical mutations. (C) The cross correlations between the X and W mutations. The correlation between X and W mutations is higher for AncGR2, whereas AncGR1 X functional mutations are uncorrelated, increasing the flexibility in the binding pocket and allowing for promiscuous binding.

Figure 5

Figure 5. The secondary structure is predicted through multiple sequence alignment with modern day homologs.

These secondary structural elements are then connected with loops in extended conformation to generate hundreds of conformations with high flexibility. Only a few are shown here. These structures all undergo a FRODA simulation which collapses them by adding attractive perturbations between all hydrophobic contact pairs (represented by arrows) into tightly packed structures with hydrophobic cores. A subset of hydrophobic residues are shown as spheres. After scoring, the collapsed structures they are ran in a restrained r-REMD simulation for 5 ns and then an unrestrained REMD simulation for 5 ns or until converged. The 3 ancestral structures are prediction to within 2.7 Å all atom RMSD of a similar experimentally determined structure. The final ensemble of restraint free generated structures are analyzed for dynamics using PCA.

Similar articles

Cited by

References

    1. James LC, Tawfik DS. Conformational diversity and protein evolution - a 60-year-old hypothesis revisited. Trends Biochem Sci. 2003;28:361–368. - PubMed
    1. Ma B, Shatsky M, Woflson HJ, Nussinov R. Multiple diverse ligands binding at a single protein site: A matter of preexisting populations. Protein Sci. 2009;11:184–197. - PMC - PubMed
    1. O'Brien PJ, Herschlag D. Catalytic promiscuity and the evolution of new enzymatic activities. Chem Biol. 1999;6:R97–R105. - PubMed
    1. Jimenez R, Salazar G, Yin J, Joo T, Romesberg FE, et al. Protein dynamics and the immunological evolution of molecular recognition. Proc Nat Acad Sci U S A. 2003;101:3803–3808. - PMC - PubMed
    1. Zimmerman J, Oakman EL, Thorpe IF, Shi X, Abbyad P, et al. Antibody Evolution Constains Conformational Heterogeneity by Tailoring Protein Dynamics. Proc Nat Acad Sci U S A. 2006;103:13722–13727. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources