Whole proteome prokaryote phylogeny without sequence alignment: a K-string composition approach - PubMed (original) (raw)

Comparative Study

Whole proteome prokaryote phylogeny without sequence alignment: a K-string composition approach

Ji Qi et al. J Mol Evol. 2004 Jan.

Abstract

A systematic way of inferring evolutionary relatedness of microbial organisms from the oligopeptide content, i.e., frequency of amino acid K-strings in their complete proteomes, is proposed. The new method circumvents the ambiguity of choosing the genes for phylogenetic reconstruction and avoids the necessity of aligning sequences of essentially different length and gene content. The only "parameter" in the method is the length K of the oligopeptides, which serves to tune the "resolution power" of the method. The topology of the trees converges with K increasing. Applied to a total of 109 organisms, including 16 Archaea, 87 Bacteria, and 6 Eukarya, it yields an unrooted tree that agrees with the biologists' "tree of life" based on SSU rRNA comparison in a majority of basic branchings, and especially, in all lower taxa.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Mol Biol Evol. 1987 Jul;4(4):406-25 - PubMed
    1. Nature. 1997 Aug 7;388(6642):539-47 - PubMed
    1. J Biomol Struct Dyn. 1986 Aug;4(1):11-21 - PubMed
    1. Science. 1999 May 21;284(5418):1305-7 - PubMed
    1. Nucleic Acids Res. 1999 Nov 1;27(21):4218-22 - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources