CRITICA: coding region identification tool invoking comparative analysis - PubMed (original) (raw)

Comparative Study

CRITICA: coding region identification tool invoking comparative analysis

J H Badger et al. Mol Biol Evol. 1999 Apr.

Abstract

Gene recognition is essential to understanding existing and future DNA sequence data. CRITICA (Coding Region Identification Tool Invoking Comparative Analysis) is a suite of programs for identifying likely protein-coding sequences in DNA by combining comparative analysis of DNA sequences with more common noncomparative methods. In the comparative component of the analysis, regions of DNA are aligned with related sequences from the DNA databases; if the translation of the aligned sequences has greater amino acid identity than expected for the observed percentage nucleotide identity, this is interpreted as evidence for coding. CRITICA also incorporates noncomparative information derived from the relative frequencies of hexanucleotides in coding frames versus other contexts (i.e., dicodon bias). The dicodon usage information is derived by iterative analysis of the data, such that CRITICA is not dependent on the existence or accuracy of coding sequence annotations in the databases. This independence makes the method particularly well suited for the analysis of novel genomes. CRITICA was tested by analyzing the available Salmonella typhimurium DNA sequences. Its predictions were compared with the DNA sequence annotations and with the predictions of GenMark. CRITICA proved to be more accurate than GenMark, and moreover, many of its predictions that would seem to be errors instead reflect problems in the sequence databases. The source code of CRITICA is freely available by anonymous FTP (rdp.life.uiuc.edu in/pub/critica) and on the World Wide Web (http:/(/)rdpwww.life.uiuc.edu).

PubMed Disclaimer

Cited by

Long non-coding RNA-encoded micropeptides: functions, mechanisms and implications.
Xiao Y, Ren Y, Hu W, Paliouras AR, Zhang W, Zhong L, Yang K, Su L, Wang P, Li Y, Ma M, Shi L. Xiao Y, et al. Cell Death Discov. 2024 Oct 23;10(1):450. doi: 10.1038/s41420-024-02175-0. Cell Death Discov. 2024. PMID: 39443468 Free PMC article. Review.
The absence of canonical respiratory complex I subunits in male-type mitogenomes of three Donax species.
Burzyński A, Śmietanka B, Fernández-Pérez J, Lubośny M. Burzyński A, et al. Sci Rep. 2024 Jun 24;14(1):14465. doi: 10.1038/s41598-024-63764-8. Sci Rep. 2024. PMID: 38914611 Free PMC article.
Isolation, characterisation and description of the roseoflavin producer Streptomyces berlinensis sp. nov.
Liunardo JJ, Messerli S, Gregotsch AK, Lang S, Schlosser K, Rückert-Reed C, Busche T, Kalinowski J, Zischka M, Weller P, Nouioui I, Neumann-Schaal M, Risdian C, Wink J, Mack M. Liunardo JJ, et al. Environ Microbiol Rep. 2024 Apr;16(2):e13266. doi: 10.1111/1758-2229.13266. Environ Microbiol Rep. 2024. PMID: 38653477 Free PMC article.
Small Open Reading Frame-Encoded Micro-Peptides: An Emerging Protein World.
Dong X, Zhang K, Xun C, Chu T, Liang S, Zeng Y, Liu Z. Dong X, et al. Int J Mol Sci. 2023 Jun 23;24(13):10562. doi: 10.3390/ijms241310562. Int J Mol Sci. 2023. PMID: 37445739 Free PMC article. Review.
Shining a light on the dark proteome: Non-canonical open reading frames and their encoded miniproteins as a new frontier in cancer biology.
Posner Z, Yannuzzi I, Prensner JR. Posner Z, et al. Protein Sci. 2023 Aug;32(8):e4708. doi: 10.1002/pro.4708. Protein Sci. 2023. PMID: 37350227 Free PMC article. Review.

CRITICA: coding region identification tool invoking comparative analysis - PubMed (original) (raw)