The TIGRFAMs database of protein families - PubMed (original) (raw)
The TIGRFAMs database of protein families
Daniel H Haft et al. Nucleic Acids Res. 2003.
Abstract
TIGRFAMs is a collection of manually curated protein families consisting of hidden Markov models (HMMs), multiple sequence alignments, commentary, Gene Ontology (GO) assignments, literature references and pointers to related TIGRFAMs, Pfam and InterPro models. These models are designed to support both automated and manually curated annotation of genomes. TIGRFAMs contains models of full-length proteins and shorter regions at the levels of superfamilies, subfamilies and equivalogs, where equivalogs are sets of homologous proteins conserved with respect to function since their last common ancestor. The scope of each model is set by raising or lowering cutoff scores and choosing members of the seed alignment to group proteins sharing specific function (equivalog) or more general properties. The overall goal is to provide information with maximum utility for the annotation process. TIGRFAMs is thus complementary to Pfam, whose models typically achieve broad coverage across distant homologs but end at the boundaries of conserved structural domains. The database currently contains over 1600 protein families. TIGRFAMs is available for searching or downloading at www.tigr.org/TIGRFAMs.
Figures
Figure 1
Neighbor-joining phylogenetic tree of aromatic amino acid hydroxylases. The nodes of a neighbor-joining tree based on aligned sequences are labeled to show assigned function. The tree is shown rooted at the left such that bacterial phenylalanine-4-hydroxylases (Phe-4) represented by TIGR01267, a tetrameric form, comprise the outgroup. Three other HMMs represent monomeric eukaryotic forms of aromatic amino acid hydroxylases (Tyr-3: tyrosine-3-monoxygenase, Trp-5: tryptophan-5-monoxygenase). The four equivalog models are children of the Pfam model PF00351. Note that the three closely related sets of eukaryotic proteins could have been represented by an additional subfamily HMM.
Figure 2
HMM hit regions for pyruvate carboxylase. The thin line represents the polypeptide sequence. Bars represent hit regions for various HMMs. Numbers in square brackets show the current size of each family. The number for each domain is larger than the number for the equivalog model because each domain is distributed more broadly than solely among pyruvate carboxylases.
Similar articles
- TIGRFAMs: a protein family resource for the functional identification of proteins.
Haft DH, Loftus BJ, Richardson DL, Yang F, Eisen JA, Paulsen IT, White O. Haft DH, et al. Nucleic Acids Res. 2001 Jan 1;29(1):41-3. doi: 10.1093/nar/29.1.41. Nucleic Acids Res. 2001. PMID: 11125044 Free PMC article. - TIGRFAMs and Genome Properties in 2013.
Haft DH, Selengut JD, Richter RA, Harkins D, Basu MK, Beck E. Haft DH, et al. Nucleic Acids Res. 2013 Jan;41(Database issue):D387-95. doi: 10.1093/nar/gks1234. Epub 2012 Nov 28. Nucleic Acids Res. 2013. PMID: 23197656 Free PMC article. - TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes.
Selengut JD, Haft DH, Davidsen T, Ganapathy A, Gwinn-Giglio M, Nelson WC, Richter AR, White O. Selengut JD, et al. Nucleic Acids Res. 2007 Jan;35(Database issue):D260-4. doi: 10.1093/nar/gkl1043. Epub 2006 Dec 6. Nucleic Acids Res. 2007. PMID: 17151080 Free PMC article. - HMM-based databases in InterPro.
Bateman A, Haft DH. Bateman A, et al. Brief Bioinform. 2002 Sep;3(3):236-45. doi: 10.1093/bib/3.3.236. Brief Bioinform. 2002. PMID: 12230032 - PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri A, Pugalenthi G, Sowdhamini R. Bhaduri A, et al. BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35. BMC Bioinformatics. 2004. PMID: 15059245 Free PMC article.
Cited by
- Evolutionary origin and population diversity of a cryptic hybrid pathogen.
Steenwyk JL, Knowles S, Bastos RW, Balamurugan C, Rinker D, Mead ME, Roberts CD, Raja HA, Li Y, Colabardini AC, de Castro PA, Dos Reis TF, Gumilang A, Almagro-Molto M, Alanio A, Garcia-Hermoso D, Delbaje E, Pontes L, Pinzan CF, Schreiber AZ, Canóvas D, Sanchez Luperini R, Lagrou K, Torrado E, Rodrigues F, Oberlies NH, Zhou X, Goldman GH, Rokas A. Steenwyk JL, et al. Nat Commun. 2024 Sep 28;15(1):8412. doi: 10.1038/s41467-024-52639-1. Nat Commun. 2024. PMID: 39333551 Free PMC article. - Fused radical SAM and αKG-HExxH domain proteins contain a distinct structural fold and catalyse cyclophane formation and β-hydroxylation.
Morishita Y, Ma S, De La Mora E, Li H, Chen H, Ji X, Usclat A, Amara P, Sugiyama R, Tooh YW, Gunawan G, Pérard J, Nicolet Y, Zhang Q, Morinaka BI. Morishita Y, et al. Nat Chem. 2024 Sep 18. doi: 10.1038/s41557-024-01596-9. Online ahead of print. Nat Chem. 2024. PMID: 39294420 - Seqrutinator: scrutiny of large protein superfamily sequence datasets for the identification and elimination of non-functional homologues.
Amalfitano A, Stocchi N, Atencio HM, Villarreal F, Ten Have A. Amalfitano A, et al. Genome Biol. 2024 Aug 26;25(1):230. doi: 10.1186/s13059-024-03371-y. Genome Biol. 2024. PMID: 39187866 Free PMC article. - Birth of protein folds and functions in the virome.
Nomburg J, Doherty EE, Price N, Bellieny-Rabelo D, Zhu YK, Doudna JA. Nomburg J, et al. Nature. 2024 Sep;633(8030):710-717. doi: 10.1038/s41586-024-07809-y. Epub 2024 Aug 26. Nature. 2024. PMID: 39187718 Free PMC article. - Climate-driven succession in marine microbiome biodiversity and biogeochemical function.
Larkin AA, Brock ML, Fagan AJ, Moreno AR, Gerace SD, Lees LE, Suarez SA, Eloe-Fadrosh EA, Martiny A. Larkin AA, et al. Res Sq [Preprint]. 2024 Aug 16:rs.3.rs-4682733. doi: 10.21203/rs.3.rs-4682733/v1. Res Sq. 2024. PMID: 39184082 Free PMC article. Preprint.
References
- Fitch W.M. (1970) Distinguishing homologous from analogous proteins. Syst. Zool., 19, 99–113. - PubMed
- Nelson K.E., Clayton,R.A., Gill,S.R., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Nelson,W.C., Ketchum,K.A. et al. (1999) Evidence for lateral gene transfer between archaea and bacteria from genome sequence of Thermotoga maritima. Nature, 399, 323–329. - PubMed
- Hayashi T., Makino,K., Ohnishi,M., Kurokawa,K., Ishii,K., Yokoyama,K., Han,C.G., Ohtsubo,E., Nakayama,K., Murata,T., Tanaka,M., Tobe,T., Iida,T., Takami,H., Honda,T., Sasakawa,C., Ogasawara,N., Yasunaga,T., Kuhara,S., Shiba,T., Hattori,M. and Shinagawa,H. (2001) Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12. DNA Res., 28, 11–22. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources