Signal propagation in proteins and relation to equilibrium fluctuations - PubMed (original) (raw)
Signal propagation in proteins and relation to equilibrium fluctuations
Chakra Chennubhotla et al. PLoS Comput Biol. 2007 Sep.
Erratum in
- PLoS Comput Biol. 2007 Oct 12;3(10):2043
Abstract
Elastic network (EN) models have been widely used in recent years for describing protein dynamics, based on the premise that the motions naturally accessible to native structures are relevant to biological function. We posit that equilibrium motions also determine communication mechanisms inherent to the network architecture. To this end, we explore the stochastics of a discrete-time, discrete-state Markov process of information transfer across the network of residues. We measure the communication abilities of residue pairs in terms of hit and commute times, i.e., the number of steps it takes on an average to send and receive signals. Functionally active residues are found to possess enhanced communication propensities, evidenced by their short hit times. Furthermore, secondary structural elements emerge as efficient mediators of communication. The present findings provide us with insights on the topological basis of communication in proteins and design principles for efficient signal transduction. While hit/commute times are information-theoretic concepts, a central contribution of this work is to rigorously show that they have physical origins directly relevant to the equilibrium fluctuations of residues predicted by EN models.
Conflict of interest statement
Competing interests. The authors have declared that no competing interests exist.
Figures
Figure 1. Distribution of Hitting Times and Commute Times for Phospholipase A2 (1bk9 [20])
(A) Hitting time H(j,i) distribution shows more variation between rows than between columns, indicating that residues differ in their ability to receive signals, while their broadcasting properties are more uniform. (B) Shows commute time C(i,j) distribution. (C) Displays the average hitting times evaluated from (A). All three catalytic residues (blue dots) exhibit short hitting times.
Figure 2. Average Hit Times < _Hr_(_j_) > versus Their Standard Deviations σ(Hr(j))
Catalytic residues (red crosses) are fast and precise, being located at the lower left end of the plot. Ligand-binding residues are indicated by black +.
Figure 3. Results from Hitting Time Analysis for Four Enzymes
(A) HIV-1 protease (1a30, [21]), (B) Ricin (1br6, [22]), (C) Human rhinovirus 3C protease (1cqq, [23]), and (D) Endo-1,4-xylanase (1bvv, [24]). The plots reveal the tendency of catalytic residues (D25 and D30 in (A), Y80, V81, G121, Y123, E177, and R180 in (B), H40, E71, G145, and C147 in (C), and Y69, E78, and E172 in (D); red dots) to exhibit fast and precise communication, in accord with the results for phospholipase A2 (Figure 2). Ligand-binding residues are shown by blue dots. The catalytic residues with the highest communication propensity are labeled.
Figure 4. Physical Meaning of Hitting Times
Decomposing the hitting time H(j,i) matrix from Figure 1A into (A) one-body, (B) two-body, and (C) three-body terms, such that summation of these three matrices will reproduce the matrix in Figure 1A. The one-body term involves the fluctuations of only the destination node, apparent by the horizontal stripes seen (A). From the scale of this plot, it is easy to infer that the one-body term dominates the overall computation of the hitting time. However, the source node can modulate the hitting time to the destination node depending on the cross-correlations between the fluctuations of the two nodes (B). (C) reveals the contribution from the three-body terms to be negligibly small.
Figure 5. Correlation of Effective Communication Distances with Physical Distances
(A) Comparison of efficient communication distances (ordinate) and physical distances (abscissa) for all residue pairs in phospholipase A2. The points colored red refer to pairs involving the catalytic residue His48. (B) and (C) illustrate the differences in communication times, for residue pairs separated by similar distances, and the opposite situation of comparable communication times despite significant differences in inter-residue distances, (D) and (E). See text for more details.
Figure 6. Importance of Secondary Structure in Defining Effective Means of Communication
Probability distribution of hitting times H(j,i) for the cases where residue j is located on (A) α-helices, (B) _β_-strands, and (C) loops or disordered regions. A total set of 49,929, 64,732, and 79,444 pairs contribute to the three respective curves, derived from the examined five enzymes. The abscissa represents the hitting time divided by the number of residues, which permits a normalization of the data collected for different proteins. The histograms are based on bins of size 0.2 in the interval [0, 10].
Figure 7. Effect of Secondary Structure on the Hitting Time Distributions for Individual Proteins, Each Shown in a Separate Panel
The ribbon diagrams are colored by the secondary structure, namely helices (red), strands (blue), and coils/disordered regions (white). For each enzyme, the probability distribution of hitting times H(j,i), where j is located on (A) α-helices, (B) _β_-strands, and (C) loops or disordered regions is shown in blue, red, and green respectively. The distributions from α-helices and loops/disordered regions have roughly the same shape as the ones shown in Figure 6. However, the distributions for the _β_-strands exhibit significant variations, pointing to a dependency on their spatial location in the 3-D structure of the proteins.
Figure 8. Comparison of Effective (Commute) Distances and Physical Distances between Residue Pairs in E. coli Adenylate Kinase (PDB: 4ake)
Effective (ordinate) and physical (abscissa) distances between residues in the CORE, LID, and AMPbd domains (see inset), grouped as intradomain and interdomain distances and shown in different colors for each group. Note that communication between residues in the same domain is more efficient than that between residues in two different domains. This is evidenced by the longer commute distance corresponding to interdomain pairing for a given physical distance, compared with that of intradomain pairs. The inset gives a schematic overview of the distance distributions for intradomain and interdomain pairings.
Similar articles
- Deciphering protein-protein interactions. Part I. Experimental techniques and databases.
Shoemaker BA, Panchenko AR. Shoemaker BA, et al. PLoS Comput Biol. 2007 Mar 30;3(3):e42. doi: 10.1371/journal.pcbi.0030042. PLoS Comput Biol. 2007. PMID: 17397251 Free PMC article. Review. No abstract available. - A hybrid graph-theoretic method for mining overlapping functional modules in large sparse protein interaction networks.
Zhang S, Liu HW, Ning XM, Zhang XS. Zhang S, et al. Int J Data Min Bioinform. 2009;3(1):68-84. doi: 10.1504/ijdmb.2009.023885. Int J Data Min Bioinform. 2009. PMID: 19432377 - Predicting protein functions with message passing algorithms.
Leone M, Pagnani A. Leone M, et al. Bioinformatics. 2005 Jan 15;21(2):239-47. doi: 10.1093/bioinformatics/bth491. Epub 2004 Sep 17. Bioinformatics. 2005. PMID: 15377508 - Visualisation and navigation methods for typed protein-protein interaction networks.
Friedrich C, Schreiber F. Friedrich C, et al. Appl Bioinformatics. 2003;2(3 Suppl):S19-24. Appl Bioinformatics. 2003. PMID: 15130812 - Searching the MINT database for protein interaction information.
Cesareni G, Chatr-Aryamontri A, Licata L, Ceol A. Cesareni G, et al. Curr Protoc Bioinformatics. 2008 Jun;Chapter 8:8.5.1-8.5.13. doi: 10.1002/0471250953.bi0805s22. Curr Protoc Bioinformatics. 2008. PMID: 18551417 Review.
Cited by
- Signal transduction pathways in the pentameric ligand-gated ion channels.
Mowrey D, Chen Q, Liang Y, Liang J, Xu Y, Tang P. Mowrey D, et al. PLoS One. 2013 May 8;8(5):e64326. doi: 10.1371/journal.pone.0064326. Print 2013. PLoS One. 2013. PMID: 23667707 Free PMC article. - Mapping allosteric communications within individual proteins.
Wang J, Jain A, McDonald LR, Gambogi C, Lee AL, Dokholyan NV. Wang J, et al. Nat Commun. 2020 Jul 31;11(1):3862. doi: 10.1038/s41467-020-17618-2. Nat Commun. 2020. PMID: 32737291 Free PMC article. - Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes.
Sruthi CK, Prakash M. Sruthi CK, et al. PLoS One. 2020 Jan 10;15(1):e0227621. doi: 10.1371/journal.pone.0227621. eCollection 2020. PLoS One. 2020. PMID: 31923916 Free PMC article. - Integrated Computational Approaches and Tools forAllosteric Drug Discovery.
Sheik Amamuddy O, Veldman W, Manyumwa C, Khairallah A, Agajanian S, Oluyemi O, Verkhivker G, Tastan Bishop O. Sheik Amamuddy O, et al. Int J Mol Sci. 2020 Jan 28;21(3):847. doi: 10.3390/ijms21030847. Int J Mol Sci. 2020. PMID: 32013012 Free PMC article. Review. - Modulation of Human Hsp90α Conformational Dynamics by Allosteric Ligand Interaction at the C-Terminal Domain.
Penkler DL, Tastan Bishop Ö. Penkler DL, et al. Sci Rep. 2019 Feb 7;9(1):1600. doi: 10.1038/s41598-018-35835-0. Sci Rep. 2019. PMID: 30733455 Free PMC article.
References
- Bahar I, Atilgan A, Erman B. Direct evaluation of thermal fluctuations in protein using a single parameter harmonic potential. Folding Design. 1997;2:173–181. - PubMed
- Haliloglu T, Bahar I, Erman B. Gaussian dynamics of folded proteins. Phys Rev Lett. 1997;79:3090–3093.
- Bahar I, Atilgan AR, Demirel MC, Erman B. Vibrational dynamics of proteins: Significance of slow and fast modes in relation to function and stability. Phys Rev Lett. 1998;80:2733–2736.
- Cui Q, Bahar I, editors. Normal Mode Analysis: Theory and applications to biological and chemical systems. Boca Raton (Florida): CRC Press; 2006.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials