TIGRFAMs: a protein family resource for the functional identification of proteins - PubMed (original) (raw)
TIGRFAMs: a protein family resource for the functional identification of proteins
D H Haft et al. Nucleic Acids Res. 2001.
Abstract
TIGRFAMs is a collection of protein families featuring curated multiple sequence alignments, hidden Markov models and associated information designed to support the automated functional identification of proteins by sequence homology. We introduce the term 'equivalog' to describe members of a set of homologous proteins that are conserved with respect to function since their last common ancestor. Related proteins are grouped into equivalog families where possible, and otherwise into protein families with other hierarchically defined homology types. TIGRFAMs currently contains over 800 protein families, available for searching or downloading at www.tigr.org/TIGRFAMs. Classification by equivalog family, where achievable, complements classification by orthology, superfamily, domain or motif. It provides the information best suited for automatic assignment of specific functions to proteins from large-scale genome sequencing projects.
Figures
Figure 1
Homology relationships can be classified by evolutionary history, as shown in this model phylogenetic tree. The ancestral node, or root, is at the top. Duplication creates paralogs A and B with distinct function. Speciation creates an orthologous set A1, A2 and A3 from A, and B1, B2 and B3 from B. If B1, B2 and B3 share the same function, they are equivalogs as well as orthologs. Dashed lines indicate a possible pattern of gene loss that leaves only A1, B2 and B3. The resulting protein subfamily should exhibit bi-directional best hits across species but is not orthologous and does not show conserved function.
Similar articles
- The TIGRFAMs database of protein families.
Haft DH, Selengut JD, White O. Haft DH, et al. Nucleic Acids Res. 2003 Jan 1;31(1):371-3. doi: 10.1093/nar/gkg128. Nucleic Acids Res. 2003. PMID: 12520025 Free PMC article. - TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes.
Selengut JD, Haft DH, Davidsen T, Ganapathy A, Gwinn-Giglio M, Nelson WC, Richter AR, White O. Selengut JD, et al. Nucleic Acids Res. 2007 Jan;35(Database issue):D260-4. doi: 10.1093/nar/gkl1043. Epub 2006 Dec 6. Nucleic Acids Res. 2007. PMID: 17151080 Free PMC article. - TIGRFAMs and Genome Properties in 2013.
Haft DH, Selengut JD, Richter RA, Harkins D, Basu MK, Beck E. Haft DH, et al. Nucleic Acids Res. 2013 Jan;41(Database issue):D387-95. doi: 10.1093/nar/gks1234. Epub 2012 Nov 28. Nucleic Acids Res. 2013. PMID: 23197656 Free PMC article. - Database analysis and gene discovery in pharmacogenetics.
Board P, Tetlow N, Blackburn A, Chelvanayagam G. Board P, et al. Clin Chem Lab Med. 2000 Sep;38(9):863-7. doi: 10.1515/CCLM.2000.125. Clin Chem Lab Med. 2000. PMID: 11097341 Review. - Clustering and analysis of protein families.
Kriventseva EV, Biswas M, Apweiler R. Kriventseva EV, et al. Curr Opin Struct Biol. 2001 Jun;11(3):334-9. doi: 10.1016/s0959-440x(00)00211-6. Curr Opin Struct Biol. 2001. PMID: 11406384 Review.
Cited by
- Genomic Comparison and Spatial Distribution of Different Synechococcus Phylotypes in the Black Sea.
Di Cesare A, Dzhembekova N, Cabello-Yeves PJ, Eckert EM, Slabakova V, Slabakova N, Peneva E, Bertoni R, Corno G, Salcher MM, Kamburska L, Bertoni F, Rodriguez-Valera F, Moncheva S, Callieri C. Di Cesare A, et al. Front Microbiol. 2020 Aug 12;11:1979. doi: 10.3389/fmicb.2020.01979. eCollection 2020. Front Microbiol. 2020. PMID: 32903389 Free PMC article. - Genome and Transcriptome of Clostridium phytofermentans, Catalyst for the Direct Conversion of Plant Feedstocks to Fuels.
Petit E, Coppi MV, Hayes JC, Tolonen AC, Warnick T, Latouf WG, Amisano D, Biddle A, Mukherjee S, Ivanova N, Lykidis A, Land M, Hauser L, Kyrpides N, Henrissat B, Lau J, Schnell DJ, Church GM, Leschine SB, Blanchard JL. Petit E, et al. PLoS One. 2015 Jun 2;10(6):e0118285. doi: 10.1371/journal.pone.0118285. eCollection 2015. PLoS One. 2015. PMID: 26035711 Free PMC article. - Draft Genome Sequence of the Tomato Stem Endophyte Bacillus safensis TS3.
Chebotar VK, Gancheva MS, Voshol GP, Malfanova NV, Karasev ES, Chizhevskaya EP, Zaplatkin AN, Khiutti AV, Lazarev AM, Gadjiev NM, Lebedeva VA, Balakina SV. Chebotar VK, et al. Microbiol Resour Announc. 2022 Nov 17;11(11):e0081622. doi: 10.1128/mra.00816-22. Epub 2022 Oct 27. Microbiol Resour Announc. 2022. PMID: 36301120 Free PMC article. - Microbial succession during the transition from active to inactive stages of deep-sea hydrothermal vent sulfide chimneys.
Hou J, Sievert SM, Wang Y, Seewald JS, Natarajan VP, Wang F, Xiao X. Hou J, et al. Microbiome. 2020 Jun 30;8(1):102. doi: 10.1186/s40168-020-00851-8. Microbiome. 2020. PMID: 32605604 Free PMC article. - Evolution of sensory complexity recorded in a myxobacterial genome.
Goldman BS, Nierman WC, Kaiser D, Slater SC, Durkin AS, Eisen JA, Ronning CM, Barbazuk WB, Blanchard M, Field C, Halling C, Hinkle G, Iartchuk O, Kim HS, Mackenzie C, Madupu R, Miller N, Shvartsbeyn A, Sullivan SA, Vaudin M, Wiegand R, Kaplan HB. Goldman BS, et al. Proc Natl Acad Sci U S A. 2006 Oct 10;103(41):15200-5. doi: 10.1073/pnas.0607335103. Epub 2006 Oct 2. Proc Natl Acad Sci U S A. 2006. PMID: 17015832 Free PMC article.
References
- Sonnhammer E.L., Eddy,S.R. and Durbin,R. (1997) Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins, 28, 405–420. - PubMed
- Srinivasarao G.Y., Yeh,L.S., Marzec,C.R., Orcutt,B.C. and Barker,W.C. (1999) PIR-ALN: a database of protein sequence alignments. Bioinformatics, 15, 382–390. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources