BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data - PubMed (original) (raw)
BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data
O Gascuel. Mol Biol Evol. 1997 Jul.
Abstract
We propose an improved version of the neighbor-joining (NJ) algorithm of Saitou and Nei. This new algorithm, BIONJ, follows the same agglomerative scheme as NJ, which consists of iteratively picking a pair of taxa, creating a new mode which represents the cluster of these taxa, and reducing the distance matrix by replacing both taxa by this node. Moreover, BIONJ uses a simple first-order model of the variances and covariances of evolutionary distance estimates. This model is well adapted when these estimates are obtained from aligned sequences. At each step it permits the selection, from the class of admissible reductions, of the reduction which minimizes the variance of the new distance matrix. In this way, we obtain better estimates to choose the pair of taxa to be agglomerated during the next steps. Moreover, in comparison with NJ's estimates, these estimates become better and better as the algorithm proceeds. BIONJ retains the good properties of NJ--especially its low run time. Computer simulations have been performed with 12-taxon model trees to determine BIONJ's efficiency. When the substitution rates are low (maximum pairwise divergence approximately 0.1 substitutions per site) or when they are constant among lineages, BIONJ is only slightly better than NJ. When the substitution rates are higher and vary among lineages,BIONJ clearly has better topological accuracy. In the latter case, for the model trees and the conditions of evolution tested, the topological error reduction is on the average around 20%. With highly-varying-rate trees and with high substitution rates (maximum pairwise divergence approximately 1.0 substitutions per site), the error reduction may even rise above 50%, while the probability of finding the correct tree may be augmented by as much as 15%.
Similar articles
- Fast NJ-like algorithms to deal with incomplete distance matrices.
Criscuolo A, Gascuel O. Criscuolo A, et al. BMC Bioinformatics. 2008 Mar 26;9:166. doi: 10.1186/1471-2105-9-166. BMC Bioinformatics. 2008. PMID: 18366787 Free PMC article. - Improvement of distance-based phylogenetic methods by a local maximum likelihood approach using triplets.
Ranwez V, Gascuel O. Ranwez V, et al. Mol Biol Evol. 2002 Nov;19(11):1952-63. doi: 10.1093/oxfordjournals.molbev.a004019. Mol Biol Evol. 2002. PMID: 12411604 - Getting a tree fast: Neighbor Joining, FastME, and distance-based methods.
Desper R, Gascuel O. Desper R, et al. Curr Protoc Bioinformatics. 2006 Oct;Chapter 6:Unit 6.3. doi: 10.1002/0471250953.bi0603s15. Curr Protoc Bioinformatics. 2006. PMID: 18428768 - A stepwise algorithm for finding minimum evolution trees.
Kumar S. Kumar S. Mol Biol Evol. 1996 Apr;13(4):584-93. doi: 10.1093/oxfordjournals.molbev.a025618. Mol Biol Evol. 1996. PMID: 8882501 - Neighbor-joining revealed.
Gascuel O, Steel M. Gascuel O, et al. Mol Biol Evol. 2006 Nov;23(11):1997-2000. doi: 10.1093/molbev/msl072. Epub 2006 Jul 28. Mol Biol Evol. 2006. PMID: 16877499 Review.
Cited by
- Computational study of β-N-acetylhexosaminidase from Talaromyces flavus, a glycosidase with high substrate flexibility.
Kulik N, Slámová K, Ettrich R, Křen V. Kulik N, et al. BMC Bioinformatics. 2015 Jan 28;16:28. doi: 10.1186/s12859-015-0465-8. BMC Bioinformatics. 2015. PMID: 25627923 Free PMC article. - Gene invasion in distant eukaryotic lineages: discovery of mutually exclusive genetic elements reveals marine biodiversity.
Monier A, Sudek S, Fast NM, Worden AZ. Monier A, et al. ISME J. 2013 Sep;7(9):1764-74. doi: 10.1038/ismej.2013.70. Epub 2013 May 2. ISME J. 2013. PMID: 23635865 Free PMC article. - Ants of French Guiana: 16S rRNA sequence dataset.
Rongier G, Sagne A, Etienne S, Petitclerc F, Jaouen G, Murienne J, Orivel J. Rongier G, et al. Biodivers Data J. 2023 Feb 7;11:e91577. doi: 10.3897/BDJ.11.e91577. eCollection 2023. Biodivers Data J. 2023. PMID: 38327367 Free PMC article. - The wtf meiotic driver gene family has unexpectedly persisted for over 100 million years.
De Carvalho M, Jia GS, Nidamangala Srinivasa A, Billmyre RB, Xu YH, Lange JJ, Sabbarini IM, Du LL, Zanders SE. De Carvalho M, et al. Elife. 2022 Oct 13;11:e81149. doi: 10.7554/eLife.81149. Elife. 2022. PMID: 36227631 Free PMC article. - A Comprehensive Investigation of Potential Novel Marine Psychrotolerant Actinomycetes sp. Isolated from the Bay-of-Bengal.
Ghosh M, Gera M, Singh J, Prasad R, Pulicherla KK. Ghosh M, et al. Curr Genomics. 2020 May;21(4):271-282. doi: 10.2174/1389202921666200330150642. Curr Genomics. 2020. PMID: 33071620 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources