Gene Ontology annotation of sequence-specific DNA binding transcription factors: setting the stage for a large-scale curation effort - PubMed (original) (raw)
Gene Ontology annotation of sequence-specific DNA binding transcription factors: setting the stage for a large-scale curation effort
Sushil Tripathi et al. Database (Oxford). 2013.
Abstract
Transcription factors control which information in a genome becomes transcribed to produce RNAs that function in the biological systems of cells and organisms. Reliable and comprehensive information about transcription factors is invaluable for large-scale network-based studies. However, existing transcription factor knowledge bases are still lacking in well-documented functional information. Here, we provide guidelines for a curation strategy, which constitutes a robust framework for using the controlled vocabularies defined by the Gene Ontology Consortium to annotate specific DNA binding transcription factors (DbTFs) based on experimental evidence reported in literature. Our standardized protocol and workflow for annotating specific DNA binding RNA polymerase II transcription factors is designed to document high-quality and decisive evidence from valid experimental methods. Within a collaborative biocuration effort involving the user community, we are now in the process of exhaustively annotating the full repertoire of human, mouse and rat proteins that qualify as DbTFs in as much as they are experimentally documented in the biomedical literature today. The completion of this task will significantly enrich Gene Ontology-based information resources for the research community. Database URL: www.tfcheckpoint.org.
Figures
Figure 1.
Primary GO terms/subgraphs used for DbTF annotation. (A) GO subgraph used for sequence-specific DbTF. In this graph, sequence-specific DNA binding MF terms (yellow), sequence-specific DNA binding TF activity MF terms (green) and transcription regulation BP (blue) are shown along with the relationships between terms in the graph structure. (B) GO subgraph used for transcription factor binding transcription factors. In this graph, the different color coding represents the following: TF binding MF terms (yellow), transcription regulation BP (blue) and TF binding TF activity MF terms (green). I, P and H on top of the lines stand for relationships ‘is_a’, ‘part_of’ and ‘has_part’.
Figure 2.
Sequence-specific DNA binding TF (DbTF) curation workflow. This workflow represents the step-by-step procedure for curating experimentally verified mammalian DbTFs from scientific publications. Selection of scientific publication from the literature corpus is the starting point of the curation procedure. From each relevant publication, DbTF-specific GO-terms are annotated and recorded.
Figure 3.
UniProt-GOA screenshot of some of the DbTF annotations. The annotations generated using the DbTF curation guidelines discussed here can be accessed from the GO database using the QuickGO tool.
Similar articles
- Gene regulation knowledge commons: community action takes care of DNA binding transcription factors.
Tripathi S, Vercruysse S, Chawla K, Christie KR, Blake JA, Huntley RP, Orchard S, Hermjakob H, Thommesen L, Lægreid A, Kuiper M. Tripathi S, et al. Database (Oxford). 2016 Jun 5;2016:baw088. doi: 10.1093/database/baw088. Print 2016. Database (Oxford). 2016. PMID: 27270715 Free PMC article. - TFcheckpoint: a curated compendium of specific DNA-binding RNA polymerase II transcription factors.
Chawla K, Tripathi S, Thommesen L, Lægreid A, Kuiper M. Chawla K, et al. Bioinformatics. 2013 Oct 1;29(19):2519-20. doi: 10.1093/bioinformatics/btt432. Epub 2013 Aug 9. Bioinformatics. 2013. PMID: 23933972 - Finding Gene Associations by Text Mining and Annotating it with Gene Ontology.
Iyyappan OR, Manoharan S. Iyyappan OR, et al. Methods Mol Biol. 2022;2496:71-90. doi: 10.1007/978-1-0716-2305-3_4. Methods Mol Biol. 2022. PMID: 35713859 - The Renal Gene Ontology Annotation Initiative.
Alam-Faruque Y, Dimmer EC, Huntley RP, O'Donovan C, Scambler P, Apweiler R. Alam-Faruque Y, et al. Organogenesis. 2010 Apr-Jun;6(2):71-5. doi: 10.4161/org.6.2.11294. Organogenesis. 2010. PMID: 20885853 Free PMC article. Review. - Transcriptional regulation in mammalian cells by sequence-specific DNA binding proteins.
Mitchell PJ, Tjian R. Mitchell PJ, et al. Science. 1989 Jul 28;245(4916):371-8. doi: 10.1126/science.2667136. Science. 1989. PMID: 2667136 Review.
Cited by
- TFCheckpoint database update, a cross-referencing system for transcription factors from human, mouse and rat.
Acencio ML, Vazquez M, Chawla K, Lægreid A, Kuiper M. Acencio ML, et al. Nucleic Acids Res. 2024 Jan 5;52(D1):D334-D344. doi: 10.1093/nar/gkad1030. Nucleic Acids Res. 2024. PMID: 37992291 Free PMC article. - Host Transcriptome Analysis of Ferret Tissues Following Henipavirus Infection.
Zeng TS, Yang DS, Kelvin AA, Kelvin DJ. Zeng TS, et al. Methods Mol Biol. 2023;2682:281-299. doi: 10.1007/978-1-0716-3283-3_20. Methods Mol Biol. 2023. PMID: 37610589 - Accurate detection of benign and malignant renal tumor subtypes with MethylBoostER: An epigenetic marker-driven learning framework.
Rossi SH, Newsham I, Pita S, Brennan K, Park G, Smith CG, Lach RP, Mitchell T, Huang J, Babbage A, Warren AY, Leppert JT, Stewart GD, Gevaert O, Massie CE, Samarajiwa SA. Rossi SH, et al. Sci Adv. 2022 Sep 30;8(39):eabn9828. doi: 10.1126/sciadv.abn9828. Epub 2022 Sep 28. Sci Adv. 2022. PMID: 36170366 Free PMC article. - Positive Selection in Gene Regulatory Factors Suggests Adaptive Pleiotropic Changes During Human Evolution.
Jovanovic VM, Sarfert M, Reyna-Blanco CS, Indrischek H, Valdivia DI, Shelest E, Nowick K. Jovanovic VM, et al. Front Genet. 2021 May 17;12:662239. doi: 10.3389/fgene.2021.662239. eCollection 2021. Front Genet. 2021. PMID: 34079582 Free PMC article. - Novel Variance-Component TWAS method for studying complex human diseases with applications to Alzheimer's dementia.
Tang S, Buchman AS, De Jager PL, Bennett DA, Epstein MP, Yang J. Tang S, et al. PLoS Genet. 2021 Apr 2;17(4):e1009482. doi: 10.1371/journal.pgen.1009482. eCollection 2021 Apr. PLoS Genet. 2021. PMID: 33798195 Free PMC article.
References
- Weake VM, Workman JL. Inducible gene expression: diverse regulatory mechanisms. Nat. Rev. Genet. 2010;11:426–437. - PubMed
- Perissi V, Jepsen K, Glass CK, Rosenfeld MG. Deconstructing repression: evolving models of co-repressor action. Nat. Rev. Genet. 2010;11:109–123. - PubMed
- Thomas MC, Chiang CM. The general transcription machinery and general cofactors. Crit. Rev. Biochem. Mmol. Biol. 2006;41:105–178. - PubMed
- Mitchell PJ, Tjian R. Transcriptional regulation in mammalian cells by sequence-specific DNA binding proteins. Science. 1989;245:371–378. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources