Correlated mutation analyses on very large sequence families - PubMed (original) (raw)
Correlated mutation analyses on very large sequence families
L Oliveira et al. Chembiochem. 2002.
Abstract
The 'omics era' (the era of genomics, proteomics, and so forth) is marked by a flood of data that need to be interpreted to become useful information. Thanks to genome sequencing projects, large numbers of sequence families with more than a thousand members each are now available. Novel analytical techniques are needed to deal with this avalanche of sequence data. Sequence entropy is a measure of the information present in an alignment, whereas sequence variability represents the mutational flexibility at a particular position. Entropy versus variability plots can reveal the roles of groups of residues in the overall function of a protein. Such roles can be as part of the main active site, part of a modulator binding site, or transduction of a signal between those sites. Residues that are involved in a common function tend to stay conserved as a group, but when they mutate, they tend to mutate together. Correlated mutation analysis can detect groups of residue positions that show this behaviour. The combination of entropy, variability and correlation is a powerful tool to convert sequence data into useful information. This analysis can, for example, detect the key residues involved in cooperativity in globins, the switch regions in ras-like proteins and the calcium binding and signalling residues in serine proteases. We have extrapolated from these three classes of structurally and functionally well-described proteins to G-protein-coupled receptors (GPCRs). We can detect the residues in the main functional site in GPCRs that are responsible for G-protein coupling, the residues in the endogenous agonist binding site, and the residues in between that transduce the signal to and fro between these sites. The results are discussed in the light of a simple two-step evolutionary model for the development of functional proteins.
Similar articles
- Identification of functionally conserved residues with the use of entropy-variability plots.
Oliveira L, Paiva PB, Paiva AC, Vriend G. Oliveira L, et al. Proteins. 2003 Sep 1;52(4):544-52. doi: 10.1002/prot.10490. Proteins. 2003. PMID: 12910454 - A family-based approach reveals the function of residues in the nuclear receptor ligand-binding domain.
Folkertsma S, van Noort P, Van Durme J, Joosten HJ, Bettler E, Fleuren W, Oliveira L, Horn F, de Vlieg J, Vriend G. Folkertsma S, et al. J Mol Biol. 2004 Aug 6;341(2):321-35. doi: 10.1016/j.jmb.2004.05.075. J Mol Biol. 2004. PMID: 15276826 Review. - Sequence analysis reveals how G protein-coupled receptors transduce the signal to the G protein.
Oliveira L, Paiva PB, Paiva AC, Vriend G. Oliveira L, et al. Proteins. 2003 Sep 1;52(4):553-60. doi: 10.1002/prot.10489. Proteins. 2003. PMID: 12910455 - Expanding the nitrogen regulatory protein superfamily: Homology detection at below random sequence identity.
Kinch LN, Grishin NV. Kinch LN, et al. Proteins. 2002 Jul 1;48(1):75-84. doi: 10.1002/prot.10110. Proteins. 2002. PMID: 12012339 - Structural and functional restraints in the evolution of protein families and superfamilies.
Gong S, Worth CL, Bickerton GR, Lee S, Tanramluk D, Blundell TL. Gong S, et al. Biochem Soc Trans. 2009 Aug;37(Pt 4):727-33. doi: 10.1042/BST0370727. Biochem Soc Trans. 2009. PMID: 19614584 Review.
Cited by
- Computing highly correlated positions using mutual information and graph theory for G protein-coupled receptors.
Fatakia SN, Costanzi S, Chow CC. Fatakia SN, et al. PLoS One. 2009;4(3):e4681. doi: 10.1371/journal.pone.0004681. Epub 2009 Mar 5. PLoS One. 2009. PMID: 19262747 Free PMC article. - UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.
Lua RC, Wilson SJ, Konecki DM, Wilkins AD, Venner E, Morgan DH, Lichtarge O. Lua RC, et al. Nucleic Acids Res. 2016 Jan 4;44(D1):D308-12. doi: 10.1093/nar/gkv1279. Epub 2015 Nov 20. Nucleic Acids Res. 2016. PMID: 26590254 Free PMC article. - Why should we care about molecular coevolution?
Codoñer FM, Fares MA. Codoñer FM, et al. Evol Bioinform Online. 2008 Feb 14;4:29-38. Evol Bioinform Online. 2008. PMID: 19204805 Free PMC article. - New methods to measure residues coevolution in proteins.
Gao H, Dou Y, Yang J, Wang J. Gao H, et al. BMC Bioinformatics. 2011 May 26;12:206. doi: 10.1186/1471-2105-12-206. BMC Bioinformatics. 2011. PMID: 21612664 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources