Phylogenomics provides robust support for a two-domains tree of life - PubMed (original) (raw)
Phylogenomics provides robust support for a two-domains tree of life
Tom A Williams et al. Nat Ecol Evol. 2020 Jan.
Erratum in
- Author Correction: Phylogenomics provides robust support for a two-domains tree of life.
Williams TA, Cox CJ, Foster PG, Szöllősi GJ, Embley TM. Williams TA, et al. Nat Ecol Evol. 2020 Nov;4(11):1568. doi: 10.1038/s41559-020-01347-2. Nat Ecol Evol. 2020. PMID: 33077932
Abstract
Hypotheses about the origin of eukaryotic cells are classically framed within the context of a universal 'tree of life' based on conserved core genes. Vigorous ongoing debate about eukaryote origins is based on assertions that the topology of the tree of life depends on the taxa included and the choice and quality of genomic data analysed. Here we have reanalysed the evidence underpinning those claims and apply more data to the question by using supertree and coalescent methods to interrogate >3,000 gene families in archaea and eukaryotes. We find that eukaryotes consistently originate from within the archaea in a two-domains tree when due consideration is given to the fit between model and data. Our analyses support a close relationship between eukaryotes and Asgard archaea and identify the Heimdallarchaeota as the current best candidate for the closest archaeal relatives of the eukaryotic nuclear lineage.
Conflict of interest statement
Competing interests statement: The authors declare they have no competing interests.
Figures
Figure 1. The 35-gene matrix of Da Cunha et al. favours a two-domains tree using the best-fitting models in both maximum likelihood and Bayesian analyses.
The eukaryotes (green) group with the sampled Asgard archaea (orange) with maximum posterior support. Bacteria are in grey, TACK Archaea in yellow, Euryarchaeota in blue. This is a consensus tree inferred under the CAT+GTR+G4 model in PhyloBayes-MPI; branch lengths are proportional to the expected number of substitutions per site, as indicated by the scale bar. A 2D topology was obtained under a variety of other models in ML analyses (LG+G4+F, LG+PMSF+G4, LG+C60+G4+F; Supplementary Figure 1), and also with 4-state Susko-Roger recoding under the CAT+GTR+G4 and NDCH2 models (Supplementary Figure 2).
Figure 2. Evidence that the three-domains tree is an artifact of long branch attraction.
(a) Da Cunha et al. analysed a dataset of 35 core protein-coding genes under the LG+G4+F model and obtained a 3D tree; the better-fitting (Supplementary Table 4) CAT+GTR+G4 model recovers a 2D tree. Bootstrap support (a) and Bayesian posterior probability (b) are indicated for the key nodes defining the 3D and 2D trees. “Asgard” refers to a clade of Heimdallarchaeota and Lokiarchaeum. Plotting these trees to the same scale (in terms of substitutions per site) illustrates major differences in these analyses. The 3D/LG+G4+F analysis suggests that, on average, 30.77 changes have taken place per site; the two-domains/CAT+GTR+G4 analysis suggests that 47.4 changes per site have occurred. This difference amounts to ~128,511 additional substitutions in total inferred under the CAT+GTR+G4 model. (b) Posterior predictive tests indicate that CAT+GTR+G4 performs significantly better than LG+G4+F in capturing the site-specific evolutionary constraints reflected by lower biochemical diversity approaching that of the empirical data). This results in more realistic estimates of substitutional saturation and convergence found in the data. The longest branches on both the 3D and 2D tree are the stems leading to the bacteria and eukaryotes (in blue and green, respectively). CAT+GTR+G4 identifies many more convergent substitutions on these branches than does LG+G4+F, as can be seen by comparing the branch lengths in (a). This failure to detect convergent substitutions under LG+G4+F has the effect of drawing the bacterial and eukaryotic branches together, because convergences are mistaken for homologies (synapomorphies), resulting in a 3D tree.
Figure 3. An expanded sampling of microbial diversity supports a two-domains tree.
(a) Bayesian phylogeny of 21 concatenated proteins conserved across bacteria, archaea and eukaryotes under the CAT+GTR+G4 model, rooted on the branch separating bacteria and archaea. Eukaryotes group with Asgard archaea with maximum posterior support. (b) Bayesian phylogeny of 43 genes conserved between Archaea and eukaryotes under CAT+GTR+G4. Eukaryotes group with, or within, Heimdallarchaeota. All support values are Bayesian posterior probabilities, and branch lengths are proportional to the expected number of substitutions per site, as indicated by the scale bars. The Euryarchaeota are paraphyletic in the consensus tree in (a), consistent with some recent analyses using bacterial outgroups,, although the relevant support values are low and the analysis does not robustly exclude the alternative hypothesis of a monophyletic Euryarchaeota. The tree in (b) is formally unrooted because it does not include a bacterial outgroup. Based on (a) and published analyses,, the root may lie between the Euryarchaeota and the other taxa, or within the Euryarchaeota. Amino acid data were recoded using the 4-state scheme of Susko and Roger, which our posterior predictive simulations (Supplementary Table 7) suggest improved model fit by ameliorating substitutional saturation and compositional heterogeneity; phylogenies inferred on the original amino acid data are provided in Supplementary Figure 7.
Comment in
- Evolutionary relationships between archaea and eukaryotes.
Gribaldo S, Brochier-Armanet C. Gribaldo S, et al. Nat Ecol Evol. 2020 Jan;4(1):20-21. doi: 10.1038/s41559-019-1073-1. Nat Ecol Evol. 2020. PMID: 31836857 No abstract available. - Evolution: Two Domains of Life or Three?
Doolittle WF. Doolittle WF. Curr Biol. 2020 Feb 24;30(4):R177-R179. doi: 10.1016/j.cub.2020.01.010. Curr Biol. 2020. PMID: 32097647
Similar articles
- Inference and reconstruction of the heimdallarchaeial ancestry of eukaryotes.
Eme L, Tamarit D, Caceres EF, Stairs CW, De Anda V, Schön ME, Seitz KW, Dombrowski N, Lewis WH, Homa F, Saw JH, Lombard J, Nunoura T, Li WJ, Hua ZS, Chen LX, Banfield JF, John ES, Reysenbach AL, Stott MB, Schramm A, Kjeldsen KU, Teske AP, Baker BJ, Ettema TJG. Eme L, et al. Nature. 2023 Jun;618(7967):992-999. doi: 10.1038/s41586-023-06186-2. Epub 2023 Jun 14. Nature. 2023. PMID: 37316666 Free PMC article. - Expanding Asgard members in the domain of Archaea sheds new light on the origin of eukaryotes.
Xie R, Wang Y, Huang D, Hou J, Li L, Hu H, Zhao X, Wang F. Xie R, et al. Sci China Life Sci. 2022 Apr;65(4):818-829. doi: 10.1007/s11427-021-1969-6. Epub 2021 Aug 6. Sci China Life Sci. 2022. PMID: 34378142 - Asgard archaea illuminate the origin of eukaryotic cellular complexity.
Zaremba-Niedzwiedzka K, Caceres EF, Saw JH, Bäckström D, Juzokaite L, Vancaester E, Seitz KW, Anantharaman K, Starnawski P, Kjeldsen KU, Stott MB, Nunoura T, Banfield JF, Schramm A, Baker BJ, Spang A, Ettema TJ. Zaremba-Niedzwiedzka K, et al. Nature. 2017 Jan 19;541(7637):353-358. doi: 10.1038/nature21031. Epub 2017 Jan 11. Nature. 2017. PMID: 28077874 - Two or three domains: a new view of tree of life in the genomics era.
Zhou Z, Liu Y, Li M, Gu JD. Zhou Z, et al. Appl Microbiol Biotechnol. 2018 Apr;102(7):3049-3058. doi: 10.1007/s00253-018-8831-x. Epub 2018 Feb 27. Appl Microbiol Biotechnol. 2018. PMID: 29484479 Review. - An archaeal origin of eukaryotes supports only two primary domains of life.
Williams TA, Foster PG, Cox CJ, Embley TM. Williams TA, et al. Nature. 2013 Dec 12;504(7479):231-6. doi: 10.1038/nature12779. Nature. 2013. PMID: 24336283 Review.
Cited by
- Terraces in species tree inference from gene trees.
Habib M, Roy K, Hasan S, Rahman AH, Bayzid MS. Habib M, et al. BMC Ecol Evol. 2024 Nov 4;24(1):135. doi: 10.1186/s12862-024-02309-z. BMC Ecol Evol. 2024. PMID: 39497030 Free PMC article. - The emerging view on the origin and early evolution of eukaryotic cells.
Vosseberg J, van Hooff JJE, Köstlbacher S, Panagiotou K, Tamarit D, Ettema TJG. Vosseberg J, et al. Nature. 2024 Sep;633(8029):295-305. doi: 10.1038/s41586-024-07677-6. Epub 2024 Sep 11. Nature. 2024. PMID: 39261613 Review. - The Last Universal Common Ancestor of Ribosome-Encoding Organisms: Portrait of LUCA.
Forterre P. Forterre P. J Mol Evol. 2024 Oct;92(5):550-583. doi: 10.1007/s00239-024-10186-9. Epub 2024 Aug 19. J Mol Evol. 2024. PMID: 39158619 Review. - Phylogenetic reconciliation: making the most of genomes to understand microbial ecology and evolution.
Williams TA, Davin AA, Szánthó LL, Stamatakis A, Wahl NA, Woodcroft BJ, Soo RM, Eme L, Sheridan PO, Gubry-Rangin C, Spang A, Hugenholtz P, Szöllősi GJ. Williams TA, et al. ISME J. 2024 Jan 8;18(1):wrae129. doi: 10.1093/ismejo/wrae129. ISME J. 2024. PMID: 39001714 Free PMC article. Review. - The nature of the last universal common ancestor and its impact on the early Earth system.
Moody ERR, Álvarez-Carretero S, Mahendrarajah TA, Clark JW, Betts HC, Dombrowski N, Szánthó LL, Boyle RA, Daines S, Chen X, Lane N, Yang Z, Shields GA, Szöllősi GJ, Spang A, Pisani D, Williams TA, Lenton TM, Donoghue PCJ. Moody ERR, et al. Nat Ecol Evol. 2024 Sep;8(9):1654-1666. doi: 10.1038/s41559-024-02461-1. Epub 2024 Jul 12. Nat Ecol Evol. 2024. PMID: 38997462 Free PMC article.
References
- Embley TM, Martin W. Eukaryotic evolution, changes and challenges. Nature. 2006;440:623–630. - PubMed
- Roger AJ, Muñoz-Gómez SA, Kamikawa R. The Origin and Diversification of Mitochondria. Curr Biol. 2017;27:R1177–R1192. - PubMed
- Martijn J, Ettema TJG. From archaeon to eukaryote: the evolutionary dark ages of the eukaryotic cell. Biochem Soc Trans. 2013;41:451–457. - PubMed
- Williams Ta, Foster PG, Cox CJ, Embley TM. An archaeal origin of eukaryotes supports only two primary domains of life. Nature. 2013;504:231–236. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources