Weighted genome trees: refinements and applications - PubMed (original) (raw)
Weighted genome trees: refinements and applications
Uri Gophna et al. J Bacteriol. 2005 Feb.
Abstract
There are many ways to group completed genome sequences in hierarchical patterns (trees) reflecting relationships between their genes. Such groupings help us organize biological information and bear crucially on underlying processes of genome and organismal evolution. Genome trees make use of all comparable genes but can variously weight the contributions of these genes according to similarity, congruent patterns of similarity, or prevalence among genomes. Here we explore such possible weighting strategies, in an analysis of 142 prokaryotic and 5 eukaryotic genomes. We demonstrate that alternate weighting strategies have different advantages, and we propose that each may have its specific uses in systematic or evolutionary biology. Comparisons of results obtained with different methods can provide further clues to major events and processes in genome evolution.
Figures
FIG.1.
Fitch-Margoliash tree based on conceptually translated complete genomic ORF sets, with equal weighting of all genes shared between a pair of genomes (see text for details). Bootstrap support (percentage of 20 replicates) was determined in a separate analysis by strict consensus. Where support is not shown, it is 100%. Where a node is marked with an empty circle, support is less than 50%. Taxa of particular interest are marked with an arrow. Whenever different strains of a single species form a single clade, they have been united to a single branch and the number of strains is given in parentheses as follows: Tropheryma whipplei includes T. whipplei Twist and TW0827; Mycobacterium tuberculosis/bovis includes M. bovis, M. tuberculosis H37Rv, and M. tuberculosis CDC1551; Agrobacterium tumefaciens includes A. tumefaciens C58 from both the Cereon and Dupont genomes; Vibrio vulnificus includes V. vulnificus strains CMCP6 and YJ016; Salmonella enterica Typhi includes S. enterica serovar Typhi strains CT18 and Ty2; Escherichia coli/Shigella flexneri includes E. coli strains O157:H7 EDL933, O157:H7, CFT073, and K-12 and S. flexneri strains 2a 2457T and 2a 301; Xylella fastidiosa includes X. fastidiosa strains 9a5c and Temecula1; Neisseria meningitidis includes strains MC58 and Z2491; Helicobacter pylori includes strains 26695 and J99; Streptococcus pneumoniae includes strains R6 and TIGR4; Streptococcus pyogenes includes strains MGAS315, SSI1, SF370, and MGAS8232; Streptococcus agalactiae includes strains 2603VR and NEM316; Staphylococcus aureus includes strains N315, Mu50, and MW2; and Chlamydophila pneumoniae includes strains AR39, TW183, J138, and CWL029. Branch end points for these species correspond to the former root of the clade of strains. For the full trees, which include all strains, see the supplemental material.
FIG. 2.
Fitch-Margoliash tree, as in Fig. 1, but excluding genes deemed phylogenetically discordant (8), at an alpha threshold of 5%.
FIG. 3.
Fitch-Margoliash tree, as in Fig. 1, but with preferential weighting of genes in the distance matrix according to their phylogenetic concordance (8) relative to the bulk of genes in the genome.
FIG. 4.
Fitch-Margoliash tree, as in Fig. 1, but with preferential weighting of genes in the distance matrix according to their phylogenetic discordance (8) relative to the bulk of genes in the genome.
FIG. 5.
Fitch-Margoliash tree, as in Fig. 1, but with preferential weighting of genes in the distance matrix according to their prevalence within the set of 147 genomes.
FIG. 6.
Fitch-Margoliash tree, as in Fig. 1, but with preferential weighting of genes in the distance matrix according to their rarity within the set of 147 genomes.
Similar articles
- Genome trees constructed using five different approaches suggest new major bacterial clades.
Wolf YI, Rogozin IB, Grishin NV, Tatusov RL, Koonin EV. Wolf YI, et al. BMC Evol Biol. 2001 Oct 20;1:8. doi: 10.1186/1471-2148-1-8. BMC Evol Biol. 2001. PMID: 11734060 Free PMC article. - The BISMiS 2011 special issue on prokaryotic systematics, a vital discipline entering a period of transition.
Sutcliffe IC, Trujillo ME, Goodfellow M. Sutcliffe IC, et al. Antonie Van Leeuwenhoek. 2012 Jan;101(1):1-2. doi: 10.1007/s10482-011-9674-y. Epub 2011 Nov 12. Antonie Van Leeuwenhoek. 2012. PMID: 22080412 No abstract available. - Bacterial Genes Outnumber Archaeal Genes in Eukaryotic Genomes.
Brueckner J, Martin WF. Brueckner J, et al. Genome Biol Evol. 2020 Apr 1;12(4):282-292. doi: 10.1093/gbe/evaa047. Genome Biol Evol. 2020. PMID: 32142116 Free PMC article. - En route to a genome-based classification of Archaea and Bacteria?
Klenk HP, Göker M. Klenk HP, et al. Syst Appl Microbiol. 2010 Jun;33(4):175-82. doi: 10.1016/j.syapm.2010.03.003. Epub 2010 Apr 20. Syst Appl Microbiol. 2010. PMID: 20409658 Review. - Composition vector approach to whole-genome-based prokaryotic phylogeny: success and foundations.
Li Q, Xu Z, Hao B. Li Q, et al. J Biotechnol. 2010 Sep 1;149(3):115-9. doi: 10.1016/j.jbiotec.2009.12.015. Epub 2009 Dec 29. J Biotechnol. 2010. PMID: 20036699 Review.
Cited by
- Comparative analyses of whole-genome protein sequences from multiple organisms.
Yokono M, Satoh S, Tanaka A. Yokono M, et al. Sci Rep. 2018 May 1;8(1):6800. doi: 10.1038/s41598-018-25090-8. Sci Rep. 2018. PMID: 29717164 Free PMC article. - Horizontal Gene Transfer Building Prokaryote Genomes: Genes Related to Exchange Between Cell and Environment are Frequently Transferred.
Paquola ACM, Asif H, Pereira CAB, Feltes BC, Bonatto D, Lima WC, Menck CFM. Paquola ACM, et al. J Mol Evol. 2018 Apr;86(3-4):190-203. doi: 10.1007/s00239-018-9836-x. Epub 2018 Mar 19. J Mol Evol. 2018. PMID: 29556740 - Ordered orthology as a tool in prokaryotic evolutionary inference.
Snir S. Snir S. Mob Genet Elements. 2015 Dec 30;6(6):e1120576. doi: 10.1080/2159256X.2015.1120576. eCollection 2016. Mob Genet Elements. 2015. PMID: 28090377 Free PMC article. Review. - No evidence of inhibition of horizontal gene transfer by CRISPR-Cas on evolutionary timescales.
Gophna U, Kristensen DM, Wolf YI, Popa O, Drevet C, Koonin EV. Gophna U, et al. ISME J. 2015 Sep;9(9):2021-7. doi: 10.1038/ismej.2015.20. Epub 2015 Feb 24. ISME J. 2015. PMID: 25710183 Free PMC article. - Phylo SI: a new genome-wide approach for prokaryotic phylogeny.
Shifman A, Ninyo N, Gophna U, Snir S. Shifman A, et al. Nucleic Acids Res. 2014 Feb;42(4):2391-404. doi: 10.1093/nar/gkt1138. Epub 2013 Nov 15. Nucleic Acids Res. 2014. PMID: 24243847 Free PMC article.
References
- Brochier, C., E. Bapteste, D. Moreira, and H. Philippe. 2002. Eubacterial phylogeny based on translational apparatus proteins. Trends Genet. 18:1-5. - PubMed
- Bruno, W. J., N. D. Socci, and A. L. Halpern. 2000. Weighted neighbor joining: a likelihood-based approach to distance-based phylogeny reconstruction. Mol. Biol. Evol. 17:189-197. - PubMed
- Cavalier-Smith, T. 2002. The neomuran origin of archaebacteria, the negibacterial root of the universal tree and bacterial megaclassification. Int. J. Syst. Evol. Microbiol. 52:7-76. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources