Supra-domains: evolutionary units larger than single protein domains - PubMed (original) (raw)
Supra-domains: evolutionary units larger than single protein domains
Christine Vogel et al. J Mol Biol. 2004.
Abstract
Domains are the evolutionary units that comprise proteins, and most proteins are built from more than one domain. Domains can be shuffled by recombination to create proteins with new arrangements of domains. Using structural domain assignments, we examined the combinations of domains in the proteins of 131 completely sequenced organisms. We found two-domain and three-domain combinations that recur in different protein contexts with different partner domains. The domains within these combinations have a particular functional and spatial relationship. These units are larger than individual domains and we term them "supra-domains". Amongst the supra-domains, we identified some 1400 (1203 two-domain and 166 three-domain) combinations that are statistically significantly over-represented relative to the occurrence and versatility of the individual component domains. Over one-third of all structurally assigned multi-domain proteins contain these over-represented supra-domains. This means that investigation of the structural and functional relationships of the domains forming these popular combinations would be particularly useful for an understanding of multi-domain protein function and evolution as well as for genome annotation. These and other supra-domains were analysed for their versatility, duplication, their distribution across the three kingdoms of life and their functional classes. By examining the three-dimensional structures of several examples of supra-domains in different biological processes, we identify two basic types of spatial relationships between the component domains: the combined function of the two domains is such that either the geometry of the two domains is crucial and there is a tight constraint on the interface, or the precise orientation of the domains is less important and they are spatially separate. Frequently, the role of the supra-domain becomes clear only once the three-dimensional structure is known. Since this is the case for only a quarter of the supra-domains, we provide a list of the most important unknown supra-domains as potential targets for structural genomics projects.
Similar articles
- The relationship between domain duplication and recombination.
Vogel C, Teichmann SA, Pereira-Leal J. Vogel C, et al. J Mol Biol. 2005 Feb 11;346(1):355-65. doi: 10.1016/j.jmb.2004.11.050. Epub 2004 Dec 23. J Mol Biol. 2005. PMID: 15663950 - The geometry of domain combination in proteins.
Bashton M, Chothia C. Bashton M, et al. J Mol Biol. 2002 Jan 25;315(4):927-39. doi: 10.1006/jmbi.2001.5288. J Mol Biol. 2002. PMID: 11812158 - Domain combinations in archaeal, eubacterial and eukaryotic proteomes.
Apic G, Gough J, Teichmann SA. Apic G, et al. J Mol Biol. 2001 Jul 6;310(2):311-25. doi: 10.1006/jmbi.2001.4776. J Mol Biol. 2001. PMID: 11428892 - The many faces of the helix-turn-helix domain: transcription regulation and beyond.
Aravind L, Anantharaman V, Balaji S, Babu MM, Iyer LM. Aravind L, et al. FEMS Microbiol Rev. 2005 Apr;29(2):231-62. doi: 10.1016/j.femsre.2004.12.008. FEMS Microbiol Rev. 2005. PMID: 15808743 Review. - Structure, function and evolution of multidomain proteins.
Vogel C, Bashton M, Kerrison ND, Chothia C, Teichmann SA. Vogel C, et al. Curr Opin Struct Biol. 2004 Apr;14(2):208-16. doi: 10.1016/j.sbi.2004.03.011. Curr Opin Struct Biol. 2004. PMID: 15093836 Review.
Cited by
- On Protein Loops, Prior Molecular States and Common Ancestors of Life.
Caetano-Anollés K, Aziz MF, Mughal F, Caetano-Anollés G. Caetano-Anollés K, et al. J Mol Evol. 2024 Oct;92(5):624-646. doi: 10.1007/s00239-024-10167-y. Epub 2024 Apr 23. J Mol Evol. 2024. PMID: 38652291 Free PMC article. Review. - Domain Architecture Based Methods for Comparative Functional Genomics Toward Therapeutic Drug Target Discovery.
Gollapalli P, Rudrappa S, Kumar V, Santosh Kumar HS. Gollapalli P, et al. J Mol Evol. 2023 Oct;91(5):598-615. doi: 10.1007/s00239-023-10129-w. Epub 2023 Aug 25. J Mol Evol. 2023. PMID: 37626222 Review. - The Continuing Saga of Tissue Inhibitor of Metalloproteinase 2: Emerging Roles in Tissue Homeostasis and Cancer Progression.
Stetler-Stevenson WG. Stetler-Stevenson WG. Am J Pathol. 2023 Oct;193(10):1336-1352. doi: 10.1016/j.ajpath.2023.08.001. Epub 2023 Aug 11. Am J Pathol. 2023. PMID: 37572947 Free PMC article. Review. - CeGAL: Redefining a Widespread Fungal-Specific Transcription Factor Family Using an In Silico Error-Tracking Approach.
Mayer C, Vogt A, Uslu T, Scalzitti N, Chennen K, Poch O, Thompson JD. Mayer C, et al. J Fungi (Basel). 2023 Mar 29;9(4):424. doi: 10.3390/jof9040424. J Fungi (Basel). 2023. PMID: 37108879 Free PMC article. - Nature-inspired engineering of an artificial ligase enzyme by domain fusion.
Tong CL, Kanwar N, Morrone DJ, Seelig B. Tong CL, et al. Nucleic Acids Res. 2022 Oct 28;50(19):11175-11185. doi: 10.1093/nar/gkac858. Nucleic Acids Res. 2022. PMID: 36243966 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources