Correlated mutation analyses on very large sequence families - PubMed (original) (raw)
Correlated mutation analyses on very large sequence families
L Oliveira et al. Chembiochem. 2002.
Abstract
The 'omics era' (the era of genomics, proteomics, and so forth) is marked by a flood of data that need to be interpreted to become useful information. Thanks to genome sequencing projects, large numbers of sequence families with more than a thousand members each are now available. Novel analytical techniques are needed to deal with this avalanche of sequence data. Sequence entropy is a measure of the information present in an alignment, whereas sequence variability represents the mutational flexibility at a particular position. Entropy versus variability plots can reveal the roles of groups of residues in the overall function of a protein. Such roles can be as part of the main active site, part of a modulator binding site, or transduction of a signal between those sites. Residues that are involved in a common function tend to stay conserved as a group, but when they mutate, they tend to mutate together. Correlated mutation analysis can detect groups of residue positions that show this behaviour. The combination of entropy, variability and correlation is a powerful tool to convert sequence data into useful information. This analysis can, for example, detect the key residues involved in cooperativity in globins, the switch regions in ras-like proteins and the calcium binding and signalling residues in serine proteases. We have extrapolated from these three classes of structurally and functionally well-described proteins to G-protein-coupled receptors (GPCRs). We can detect the residues in the main functional site in GPCRs that are responsible for G-protein coupling, the residues in the endogenous agonist binding site, and the residues in between that transduce the signal to and fro between these sites. The results are discussed in the light of a simple two-step evolutionary model for the development of functional proteins.
Similar articles
- Identification of functionally conserved residues with the use of entropy-variability plots.
Oliveira L, Paiva PB, Paiva AC, Vriend G. Oliveira L, et al. Proteins. 2003 Sep 1;52(4):544-52. doi: 10.1002/prot.10490. Proteins. 2003. PMID: 12910454 - A family-based approach reveals the function of residues in the nuclear receptor ligand-binding domain.
Folkertsma S, van Noort P, Van Durme J, Joosten HJ, Bettler E, Fleuren W, Oliveira L, Horn F, de Vlieg J, Vriend G. Folkertsma S, et al. J Mol Biol. 2004 Aug 6;341(2):321-35. doi: 10.1016/j.jmb.2004.05.075. J Mol Biol. 2004. PMID: 15276826 Review. - Sequence analysis reveals how G protein-coupled receptors transduce the signal to the G protein.
Oliveira L, Paiva PB, Paiva AC, Vriend G. Oliveira L, et al. Proteins. 2003 Sep 1;52(4):553-60. doi: 10.1002/prot.10489. Proteins. 2003. PMID: 12910455 - Expanding the nitrogen regulatory protein superfamily: Homology detection at below random sequence identity.
Kinch LN, Grishin NV. Kinch LN, et al. Proteins. 2002 Jul 1;48(1):75-84. doi: 10.1002/prot.10110. Proteins. 2002. PMID: 12012339 - Structural and functional restraints in the evolution of protein families and superfamilies.
Gong S, Worth CL, Bickerton GR, Lee S, Tanramluk D, Blundell TL. Gong S, et al. Biochem Soc Trans. 2009 Aug;37(Pt 4):727-33. doi: 10.1042/BST0370727. Biochem Soc Trans. 2009. PMID: 19614584 Review.
Cited by
- Computing highly correlated positions using mutual information and graph theory for G protein-coupled receptors.
Fatakia SN, Costanzi S, Chow CC. Fatakia SN, et al. PLoS One. 2009;4(3):e4681. doi: 10.1371/journal.pone.0004681. Epub 2009 Mar 5. PLoS One. 2009. PMID: 19262747 Free PMC article. - UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.
Lua RC, Wilson SJ, Konecki DM, Wilkins AD, Venner E, Morgan DH, Lichtarge O. Lua RC, et al. Nucleic Acids Res. 2016 Jan 4;44(D1):D308-12. doi: 10.1093/nar/gkv1279. Epub 2015 Nov 20. Nucleic Acids Res. 2016. PMID: 26590254 Free PMC article. - Emerging methods in protein co-evolution.
de Juan D, Pazos F, Valencia A. de Juan D, et al. Nat Rev Genet. 2013 Apr;14(4):249-61. doi: 10.1038/nrg3414. Epub 2013 Mar 5. Nat Rev Genet. 2013. PMID: 23458856 Review. - New vistas in GPCR 3D structure prediction.
Rayan A. Rayan A. J Mol Model. 2010 Feb;16(2):183-91. doi: 10.1007/s00894-009-0533-y. Epub 2009 Jun 24. J Mol Model. 2010. PMID: 19551412 - Comparison of Algorithms for Prediction of Protein Structural Features from Evolutionary Data.
Bywater RP. Bywater RP. PLoS One. 2016 Mar 10;11(3):e0150769. doi: 10.1371/journal.pone.0150769. eCollection 2016. PLoS One. 2016. PMID: 26963911 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources