Bootstrap confidence levels for phylogenetic trees - PubMed (original) (raw)
Bootstrap confidence levels for phylogenetic trees
B Efron et al. Proc Natl Acad Sci U S A. 1996.
Abstract
Evolutionary trees are often estimated from DNA or RNA sequence data. How much confidence should we have in the estimated trees? In 1985, Felsenstein [Felsenstein, J. (1985) Evolution 39, 783-791] suggested the use of the bootstrap to answer this question. Felsenstein's method, which in concept is a straightforward application of the bootstrap, is widely used, but has been criticized as biased in the genetics literature. This paper concerns the use of the bootstrap in the tree problem. We show that Felsenstein's method is not biased, but that it can be corrected to better agree with standard ideas of confidence levels and hypothesis testing. These corrections can be made by using the more elaborate bootstrap method presented here, at the expense of considerably more computation.
Figures
Figure 1
Part of the data matrix of aligned nucleotide sequences for the malaria parasite Plasmodium. Shown are the first 20 columns of the 11 × 221 matrix x of polytypic sites used in most of the analyses below. The final analysis of the last section also uses the data from 1399 monotypic sites.
Figure 2
Phylogenetic tree based on the malaria data matrix; species are numbered as in Fig. 1. The numbers at the branches are confidence values based on Felsenstein’s bootstrap method.B = 200 bootstrap replications.
Figure 3
Schematic diagram of tree estimation; triangle represents the space of all possible ˜π vectors in the multinomial probability model; regions ℛ1, ℛ2. . . correspond to the different possible trees. In the case shown ˜π and ˜π̂ lie in the same region so TREE =, but ˜π̂* lies in a region where* does not have the 9-10 clade.
Figure 4
Two cases of the simple normal model; in both we observe μ̂ = (4.5, 0) ∈ ℛ1, and wish to assign a confidence value to μ ∈ ℛ1. Case I, ℛ2 is the region {μ1 ≤ 3}. Case II, ℛ2 is the region {∥μ∥ < 3}. The dashed circles indicate bootstrap samplingμ̂* ∼_N_2(μ̂, I).
Figure 5
Confidence levels of the two cases in Fig. 4;μ̂0 = (3, 0) is the closest point toμ̂ = (4.5, 0) on the boundary separating ℛ1 from ℛ2; bootstrap vectorμ̂** ∼_N_2(μ̂0, I). The confidence level α̂ is the probability thatμ̂** is closer than μ̂ to the boundary.
Corrected and republished from
- Bootstrap confidence levels for phylogenetic trees.
Efron B, Halloran E, Holmes S. Efron B, et al. Proc Natl Acad Sci U S A. 1996 Jul 9;93(14):7085-90. doi: 10.1073/pnas.93.14.7085. Proc Natl Acad Sci U S A. 1996. PMID: 8692949 Free PMC article. Corrected and republished.
Similar articles
- Bootstrap confidence levels for phylogenetic trees.
Efron B, Halloran E, Holmes S. Efron B, et al. Proc Natl Acad Sci U S A. 1996 Jul 9;93(14):7085-90. doi: 10.1073/pnas.93.14.7085. Proc Natl Acad Sci U S A. 1996. PMID: 8692949 Free PMC article. Corrected and republished. - Felsenstein Phylogenetic Likelihood.
Posada D, Crandall KA. Posada D, et al. J Mol Evol. 2021 Apr;89(3):134-145. doi: 10.1007/s00239-020-09982-w. Epub 2021 Jan 13. J Mol Evol. 2021. PMID: 33438113 Free PMC article. - Renewing Felsenstein's phylogenetic bootstrap in the era of big data.
Lemoine F, Domelevo Entfellner JB, Wilkinson E, Correia D, Dávila Felipe M, De Oliveira T, Gascuel O. Lemoine F, et al. Nature. 2018 Apr;556(7702):452-456. doi: 10.1038/s41586-018-0043-0. Epub 2018 Apr 18. Nature. 2018. PMID: 29670290 Free PMC article. - Bayesian coestimation of phylogeny and sequence alignment.
Lunter G, Miklós I, Drummond A, Jensen JL, Hein J. Lunter G, et al. BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83. BMC Bioinformatics. 2005. PMID: 15804354 Free PMC article. - Tree disagreement: measuring and testing incongruence in phylogenies.
Planet PJ. Planet PJ. J Biomed Inform. 2006 Feb;39(1):86-102. doi: 10.1016/j.jbi.2005.08.008. Epub 2005 Sep 28. J Biomed Inform. 2006. PMID: 16243006 Review.
Cited by
- Estimating the mean in the space of ranked phylogenetic trees.
Berling L, Collienne L, Gavryushkin A. Berling L, et al. Bioinformatics. 2024 Aug 2;40(8):btae514. doi: 10.1093/bioinformatics/btae514. Bioinformatics. 2024. PMID: 39177090 Free PMC article. - Single-plant-omics reveals the cascade of transcriptional changes during the vegetative-to-reproductive transition.
Redmond EJ, Ronald J, Davis SJ, Ezer D. Redmond EJ, et al. Plant Cell. 2024 Oct 3;36(10):4594-4606. doi: 10.1093/plcell/koae226. Plant Cell. 2024. PMID: 39121073 Free PMC article. - A machine-learning-based alternative to phylogenetic bootstrap.
Ecker N, Huchon D, Mansour Y, Mayrose I, Pupko T. Ecker N, et al. Bioinformatics. 2024 Jun 28;40(Suppl 1):i208-i217. doi: 10.1093/bioinformatics/btae255. Bioinformatics. 2024. PMID: 38940166 Free PMC article. - Phylogenomic curation of Ovate Family Proteins (OFPs) in the U's Triangle of Brassica L. indicates stress-induced growth modulation.
Shahzaib M, Khan UM, Azhar MT, Atif RM, Khan SH, Zaman QU, Rana IA. Shahzaib M, et al. PLoS One. 2024 Jan 26;19(1):e0297473. doi: 10.1371/journal.pone.0297473. eCollection 2024. PLoS One. 2024. PMID: 38277374 Free PMC article. - Genetic Variation and Evolutionary Analysis of Eggplant Mottled Dwarf Virus Isolates from Spain.
Alfaro-Fernández A, Taengua R, Font-San-Ambrosio I, Sanahuja-Edo E, Peiró R, Galipienso L, Rubio L. Alfaro-Fernández A, et al. Plants (Basel). 2024 Jan 16;13(2):250. doi: 10.3390/plants13020250. Plants (Basel). 2024. PMID: 38256804 Free PMC article.
References
- Efron B, Tibshirani R. An Introduction to the Bootstrap. London: Chapman & Hall; 1993.
- Felsenstein J. Evolution. 1985;39:783–791. - PubMed
- Hillis D, Bull J. Syst Biol. 1993;42:182–192.
- Felsenstein J, Kishino H. Syst Biol. 1993;42:193–200.
- Newton M A. Biometrika. 1996;83:315–328.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources