Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids - PubMed (original) (raw)
Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids
Lakshminarayan M Iyer et al. Cell Cycle. 2009.
Abstract
Modified bases in nucleic acids present a layer of information that directs biological function over and beyond the coding capacity of the conventional bases. While a large number of modified bases have been identified, many of the enzymes generating them still remain to be discovered. Recently, members of the 2-oxoglutarate- and iron(II)-dependent dioxygenase super-family, which modify diverse substrates from small molecules to biopolymers, were predicted and subsequently confirmed to catalyze oxidative modification of bases in nucleic acids. Of these, two distinct families, namely the AlkB and the kinetoplastid base J binding proteins (JBP) catalyze in situ hydroxylation of bases in nucleic acids. Using sensitive computational analysis of sequences, structures and contextual information from genomic structure and protein domain architectures, we report five distinct families of 2-oxoglutarate- and iron(II)-dependent dioxygenase that we predict to be involved in nucleic acid modifications. Among the DNA-modifying families, we show that the dioxygenase domains of the kinetoplastid base J-binding proteins belong to a larger family that includes the Tet proteins, prototyped by the human oncogene Tet1, and proteins from basidiomycete fungi, chlorophyte algae, heterolobosean amoeboflagellates and bacteriophages. We present evidence that some of these proteins are likely to be involved in oxidative modification of the 5-methyl group of cytosine leading to the formation of 5-hydroxymethylcytosine. The Tet/JBP homologs from basidiomycete fungi such as Laccaria and Coprinopsis show large lineage-specific expansions and a tight linkage with genes encoding a novel and distinct family of predicted transposases, and a member of the Maelstrom-like HMG family. We propose that these fungal members are part of a mobile transposon. To the best of our knowledge, this is the first report of a eukaryotic transposable element that encodes its own DNA-modification enzyme with a potential regulatory role. Through a wider analysis of other poorly characterized DNA-modifying enzymes we also show that the phage Mu Mom-like proteins, which catalyze the N6-carbamoylmethylation of adenines, are also linked to diverse families of bacterial transposases, suggesting that DNA modification by transposable elements might have a more general presence than previously appreciated. Among the other families of 2-oxoglutarate- and iron(II)-dependent dioxygenases identified in this study, one which is found in algae, is predicted to mainly comprise of RNA-modifying enzymes and shows a striking diversity in protein domain architectures suggesting the presence of RNA modifications with possibly unique adaptive roles. The results presented here are likely to provide the means for future investigation of unexpected epigenetic modifications, such as hydroxymethyl cytosine, that could profoundly impact our understanding of gene regulation and processes such as DNA demethylation.
Figures
Figure 1
Multiple alignment of selected examples of the newly predicted families of the nucleic-acid-modifying 2-oxoglutarate- and iron(II)-dependent dioxygenase superfamily. Protein sequences are represented by their gene names, species names and GenBank index numbers (where available). Temporary gene names were assigned for predicted proteins from Naegleria, Aureococcus, Daphnia and Micromonas. The full length protein sequences from these are available in the Supplementary material. The coloring scheme and consensus abbreviations are shown in the key. Family names are shown to the right of the alignment. The distinct inserts of the TET/JBP and the AlkB families are shown within boxes. The key conserved residues defining the 2OGFeDO protein have been marked below the alignment. The consensus secondary structure derived from crystal structures of characterized members of the superfamily is shown above.
Figure 2
Representative domain architectures of the newly identified versions of nucleic-acid-modifying 2OGFeDO proteins. Architectures are arranged by their phylogenetic and family affinities, and are labeled by their gene and species names. Domain architectures within a family are boxed. Domains are typically denoted by their standard names. Non-standard domain nomenclatures are clarified in the inset key at the bottom of the figure. Operons are shown as arrows where the arrow head points from the 5′ to the 3′ direction of the coding frame of the gene. Gene neighborhoods are labeled by the gene coding for the 2OGFeDO protein.
Figure 3
Genomic organization and domain architectures of predicted transposons encoding DNA-modifying enzymes. Genes are depicted as arrows with the arrow head pointing from the 5′ to the 3′ direction of the coding sequence. Gene neighborhoods of the predicted transposase are typically labeled with the gene name of the 2OGFeDO containing protein, the species name and the gi number. In potential fragmentary elements where the 2OGFeDO is absent, the gene neighborhood is labeled with the gene name of the predicted transposase-containing gene. The key at the bottom of the figure explains non-standard domain and gene names, while other gene and domains names are as commonly used in literature.
Figure 4
Multiple alignment of the proposed catalytic domain of the transposase of the novel predicted transposon encoding 2OGFeDO proteins. Protein sequences are represented by their gene names, species names and GenBank index numbers. The predicted secondary structure is shown above the alignment. The coloring scheme and consensus abbreviations are shown in the key. Conserved residues defining the catalytic site of the predicted transposase are marked below the alignment.
Similar articles
- Computational identification of novel biochemical systems involved in oxidation, glycosylation and other complex modifications of bases in DNA.
Iyer LM, Zhang D, Burroughs AM, Aravind L. Iyer LM, et al. Nucleic Acids Res. 2013 Sep;41(16):7635-55. doi: 10.1093/nar/gkt573. Epub 2013 Jun 28. Nucleic Acids Res. 2013. PMID: 23814188 Free PMC article. - Lineage-specific expansions of TET/JBP genes and a new class of DNA transposons shape fungal genomic and epigenetic landscapes.
Iyer LM, Zhang D, de Souza RF, Pukkila PJ, Rao A, Aravind L. Iyer LM, et al. Proc Natl Acad Sci U S A. 2014 Feb 4;111(5):1676-83. doi: 10.1073/pnas.1321818111. Epub 2014 Jan 7. Proc Natl Acad Sci U S A. 2014. PMID: 24398522 Free PMC article. - Natural history of eukaryotic DNA methylation systems.
Iyer LM, Abhiman S, Aravind L. Iyer LM, et al. Prog Mol Biol Transl Sci. 2011;101:25-104. doi: 10.1016/B978-0-12-387685-0.00002-0. Prog Mol Biol Transl Sci. 2011. PMID: 21507349 - Introduction: Metals in Biology: α-Ketoglutarate/Iron-Dependent Dioxygenases.
Guengerich FP. Guengerich FP. J Biol Chem. 2015 Aug 21;290(34):20700-20701. doi: 10.1074/jbc.R115.675652. Epub 2015 Jul 7. J Biol Chem. 2015. PMID: 26152720 Free PMC article. Review. - ALKBHs-facilitated RNA modifications and de-modifications.
A Alemu E, He C, Klungland A. A Alemu E, et al. DNA Repair (Amst). 2016 Aug;44:87-91. doi: 10.1016/j.dnarep.2016.05.026. Epub 2016 May 17. DNA Repair (Amst). 2016. PMID: 27237585 Free PMC article. Review.
Cited by
- DNMT3A and TET2 in the Pre-Leukemic Phase of Hematopoietic Disorders.
Sato H, Wheat JC, Steidl U, Ito K. Sato H, et al. Front Oncol. 2016 Aug 22;6:187. doi: 10.3389/fonc.2016.00187. eCollection 2016. Front Oncol. 2016. PMID: 27597933 Free PMC article. Review. - Sensitive enzymatic quantification of 5-hydroxymethylcytosine in genomic DNA.
Szwagierczak A, Bultmann S, Schmidt CS, Spada F, Leonhardt H. Szwagierczak A, et al. Nucleic Acids Res. 2010 Oct;38(19):e181. doi: 10.1093/nar/gkq684. Epub 2010 Aug 4. Nucleic Acids Res. 2010. PMID: 20685817 Free PMC article. - Impaired hydroxylation of 5-methylcytosine in myeloid cancers with mutant TET2.
Ko M, Huang Y, Jankowska AM, Pape UJ, Tahiliani M, Bandukwala HS, An J, Lamperti ED, Koh KP, Ganetzky R, Liu XS, Aravind L, Agarwal S, Maciejewski JP, Rao A. Ko M, et al. Nature. 2010 Dec 9;468(7325):839-43. doi: 10.1038/nature09586. Nature. 2010. PMID: 21057493 Free PMC article. - Reversing DNA methylation: mechanisms, genomics, and biological functions.
Wu H, Zhang Y. Wu H, et al. Cell. 2014 Jan 16;156(1-2):45-68. doi: 10.1016/j.cell.2013.12.019. Cell. 2014. PMID: 24439369 Free PMC article. - Epigenetic dynamics in cancer stem cell dormancy.
Ferrer AI, Trinidad JR, Sandiford O, Etchegaray JP, Rameshwar P. Ferrer AI, et al. Cancer Metastasis Rev. 2020 Sep;39(3):721-738. doi: 10.1007/s10555-020-09882-x. Cancer Metastasis Rev. 2020. PMID: 32394305 Review.
References
- Bloomfield VA, Crothers DM, Tinoco I., Jr . Nucleic Acids: Structures, Properties and Functions. Sausalito, CA: University Science Books; 2000.
- Pfeifer GP. Mutagenesis at methylated CpG sequences. Curr Top Microbiol Immunol. 2006;301:259–81. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- R01 AI044432-11/AI/NIAID NIH HHS/United States
- R37 CA042471-19/CA/NCI NIH HHS/United States
- R01 AI044432/AI/NIAID NIH HHS/United States
- RC1 DA028422-01/DA/NIDA NIH HHS/United States
- R37 CA042471-20/CA/NCI NIH HHS/United States
- R37 CA042471/CA/NCI NIH HHS/United States
- RC1 DA028422/DA/NIDA NIH HHS/United States
- R37 CA042471-21/CA/NCI NIH HHS/United States
- R01 AI044432-12/AI/NIAID NIH HHS/United States
- R01 AI044432-13/AI/NIAID NIH HHS/United States
- ImNIH/Intramural NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials