The TIGRFAMs database of protein families - PubMed (original) (raw)
The TIGRFAMs database of protein families
Daniel H Haft et al. Nucleic Acids Res. 2003.
Abstract
TIGRFAMs is a collection of manually curated protein families consisting of hidden Markov models (HMMs), multiple sequence alignments, commentary, Gene Ontology (GO) assignments, literature references and pointers to related TIGRFAMs, Pfam and InterPro models. These models are designed to support both automated and manually curated annotation of genomes. TIGRFAMs contains models of full-length proteins and shorter regions at the levels of superfamilies, subfamilies and equivalogs, where equivalogs are sets of homologous proteins conserved with respect to function since their last common ancestor. The scope of each model is set by raising or lowering cutoff scores and choosing members of the seed alignment to group proteins sharing specific function (equivalog) or more general properties. The overall goal is to provide information with maximum utility for the annotation process. TIGRFAMs is thus complementary to Pfam, whose models typically achieve broad coverage across distant homologs but end at the boundaries of conserved structural domains. The database currently contains over 1600 protein families. TIGRFAMs is available for searching or downloading at www.tigr.org/TIGRFAMs.
Figures
Figure 1
Neighbor-joining phylogenetic tree of aromatic amino acid hydroxylases. The nodes of a neighbor-joining tree based on aligned sequences are labeled to show assigned function. The tree is shown rooted at the left such that bacterial phenylalanine-4-hydroxylases (Phe-4) represented by TIGR01267, a tetrameric form, comprise the outgroup. Three other HMMs represent monomeric eukaryotic forms of aromatic amino acid hydroxylases (Tyr-3: tyrosine-3-monoxygenase, Trp-5: tryptophan-5-monoxygenase). The four equivalog models are children of the Pfam model PF00351. Note that the three closely related sets of eukaryotic proteins could have been represented by an additional subfamily HMM.
Figure 2
HMM hit regions for pyruvate carboxylase. The thin line represents the polypeptide sequence. Bars represent hit regions for various HMMs. Numbers in square brackets show the current size of each family. The number for each domain is larger than the number for the equivalog model because each domain is distributed more broadly than solely among pyruvate carboxylases.
Similar articles
- TIGRFAMs: a protein family resource for the functional identification of proteins.
Haft DH, Loftus BJ, Richardson DL, Yang F, Eisen JA, Paulsen IT, White O. Haft DH, et al. Nucleic Acids Res. 2001 Jan 1;29(1):41-3. doi: 10.1093/nar/29.1.41. Nucleic Acids Res. 2001. PMID: 11125044 Free PMC article. - TIGRFAMs and Genome Properties in 2013.
Haft DH, Selengut JD, Richter RA, Harkins D, Basu MK, Beck E. Haft DH, et al. Nucleic Acids Res. 2013 Jan;41(Database issue):D387-95. doi: 10.1093/nar/gks1234. Epub 2012 Nov 28. Nucleic Acids Res. 2013. PMID: 23197656 Free PMC article. - TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes.
Selengut JD, Haft DH, Davidsen T, Ganapathy A, Gwinn-Giglio M, Nelson WC, Richter AR, White O. Selengut JD, et al. Nucleic Acids Res. 2007 Jan;35(Database issue):D260-4. doi: 10.1093/nar/gkl1043. Epub 2006 Dec 6. Nucleic Acids Res. 2007. PMID: 17151080 Free PMC article. - HMM-based databases in InterPro.
Bateman A, Haft DH. Bateman A, et al. Brief Bioinform. 2002 Sep;3(3):236-45. doi: 10.1093/bib/3.3.236. Brief Bioinform. 2002. PMID: 12230032 - PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri A, Pugalenthi G, Sowdhamini R. Bhaduri A, et al. BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35. BMC Bioinformatics. 2004. PMID: 15059245 Free PMC article.
Cited by
- De novo transcriptome analysis of the Indian squid Uroteuthis duvaucelii (Orbigny, 1848) from the Indian Ocean.
Krishnan N, Sukumaran S, Vysakh VG, Sebastian W, Jose A, Raj N, Gopalakrishnan A. Krishnan N, et al. Sci Data. 2024 Nov 16;11(1):1236. doi: 10.1038/s41597-024-04112-3. Sci Data. 2024. PMID: 39550368 Free PMC article. - A step-by-step procedure for analysing the 16S rRNA-based microbiome diversity using QIIME 2 and comprehensive PICRUSt2 illustration for functional prediction.
Srivastava A, Akhter Y, Verma D. Srivastava A, et al. Arch Microbiol. 2024 Nov 14;206(12):467. doi: 10.1007/s00203-024-04177-z. Arch Microbiol. 2024. PMID: 39540937 Review. - Unveiling lignocellulolytic potential: a genomic exploration of bacterial lineages within the termite gut.
Salgado JFM, Hervé V, Vera MAG, Tokuda G, Brune A. Salgado JFM, et al. Microbiome. 2024 Oct 15;12(1):201. doi: 10.1186/s40168-024-01917-7. Microbiome. 2024. PMID: 39407345 Free PMC article. - Complete genome sequence of Rhodococcus qingshengii phage Perlina.
Jaryenneh J, Krishna R, Schoeniger JS, Mageeney CM. Jaryenneh J, et al. Microbiol Resour Announc. 2024 Nov 12;13(11):e0086924. doi: 10.1128/mra.00869-24. Epub 2024 Oct 8. Microbiol Resour Announc. 2024. PMID: 39377611 Free PMC article. - High-throughput protein characterization by complementation using DNA barcoded fragment libraries.
Biggs BW, Price MN, Lai D, Escobedo J, Fortanel Y, Huang YY, Kim K, Trotter VV, Kuehl JV, Lui LM, Chakraborty R, Deutschbauer AM, Arkin AP. Biggs BW, et al. Mol Syst Biol. 2024 Nov;20(11):1207-1229. doi: 10.1038/s44320-024-00068-z. Epub 2024 Oct 7. Mol Syst Biol. 2024. PMID: 39375541 Free PMC article.
References
- Fitch W.M. (1970) Distinguishing homologous from analogous proteins. Syst. Zool., 19, 99–113. - PubMed
- Nelson K.E., Clayton,R.A., Gill,S.R., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Nelson,W.C., Ketchum,K.A. et al. (1999) Evidence for lateral gene transfer between archaea and bacteria from genome sequence of Thermotoga maritima. Nature, 399, 323–329. - PubMed
- Hayashi T., Makino,K., Ohnishi,M., Kurokawa,K., Ishii,K., Yokoyama,K., Han,C.G., Ohtsubo,E., Nakayama,K., Murata,T., Tanaka,M., Tobe,T., Iida,T., Takami,H., Honda,T., Sasakawa,C., Ogasawara,N., Yasunaga,T., Kuhara,S., Shiba,T., Hattori,M. and Shinagawa,H. (2001) Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12. DNA Res., 28, 11–22. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources