An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics - PubMed (original) (raw)
Comparative Study
An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics
Henner Brinkmann et al. Syst Biol. 2005 Oct.
Abstract
In the context of exponential growing molecular databases, it becomes increasingly easy to assemble large multigene data sets for phylogenomic studies. The expected increase of resolution due to the reduction of the sampling (stochastic) error is becoming a reality. However, the impact of systematic biases will also become more apparent or even dominant. We have chosen to study the case of the long-branch attraction artefact (LBA) using real instead of simulated sequences. Two fast-evolving eukaryotic lineages, whose evolutionary positions are well established, microsporidia and the nucleomorph of cryptophytes, were chosen as model species. A large data set was assembled (44 species, 133 genes, and 24,294 amino acid positions) and the resulting rooted eukaryotic phylogeny (using a distant archaeal outgroup) is positively misled by an LBA artefact despite the use of a maximum likelihood-based tree reconstruction method with a complex model of sequence evolution. When the fastest evolving proteins from the fast lineages are progressively removed (up to 90%), the bootstrap support for the apparently artefactual basal placement decreases to virtually 0%, and conversely only the expected placement, among all the possible locations of the fast-evolving species, receives increasing support that eventually converges to 100%. The percentage of removal of the fastest evolving proteins constitutes a reliable estimate of the sensitivity of phylogenetic inference to LBA. This protocol confirms that both a rich species sampling (especially the presence of a species that is closely related to the fast-evolving lineage) and a probabilistic method with a complex model are important to overcome the LBA artefact. Finally, we observed that phylogenetic inference methods perform strikingly better with simulated as opposed to real data, and suggest that testing the reliability of phylogenetic inference methods with simulated data leads to overconfidence in their performance. Although phylogenomic studies can be affected by systematic biases, the possibility of discarding a large amount of data containing most of the nonphylogenetic signal allows recovering a phylogeny that is less affected by systematic biases, while maintaining a high statistical support.
Similar articles
- Lack of resolution in the animal phylogeny: closely spaced cladogeneses or undetected systematic errors?
Baurain D, Brinkmann H, Philippe H. Baurain D, et al. Mol Biol Evol. 2007 Jan;24(1):6-9. doi: 10.1093/molbev/msl137. Epub 2006 Sep 29. Mol Biol Evol. 2007. PMID: 17012374 - Heterotachy and tree building: a case study with plastids and eubacteria.
Lockhart P, Novis P, Milligan BG, Riden J, Rambaut A, Larkum T. Lockhart P, et al. Mol Biol Evol. 2006 Jan;23(1):40-5. doi: 10.1093/molbev/msj005. Epub 2005 Sep 8. Mol Biol Evol. 2006. PMID: 16151191 - Implications of the new eukaryotic systematics for parasitologists.
Dacks JB, Walker G, Field MC. Dacks JB, et al. Parasitol Int. 2008 Jun;57(2):97-104. doi: 10.1016/j.parint.2007.11.004. Epub 2007 Dec 4. Parasitol Int. 2008. PMID: 18180199 Review. - Using models of nucleotide evolution to build phylogenetic trees.
Bos DH, Posada D. Bos DH, et al. Dev Comp Immunol. 2005;29(3):211-27. doi: 10.1016/j.dci.2004.07.007. Dev Comp Immunol. 2005. PMID: 15572070 Review.
Cited by
- GTRpmix: A Linked General Time-Reversible Model for Profile Mixture Models.
Banos H, Wong TKF, Daneau J, Susko E, Minh BQ, Lanfear R, Brown MW, Eme L, Roger AJ. Banos H, et al. Mol Biol Evol. 2024 Sep 4;41(9):msae174. doi: 10.1093/molbev/msae174. Mol Biol Evol. 2024. PMID: 39158305 Free PMC article. - Plastid isoprenoid metabolism in the oyster parasite Perkinsus marinus connects dinoflagellates and malaria pathogens--new impetus for studying alveolates.
Grauvogel C, Reece KS, Brinkmann H, Petersen J. Grauvogel C, et al. J Mol Evol. 2007 Dec;65(6):725-9. doi: 10.1007/s00239-007-9053-5. Epub 2007 Nov 27. J Mol Evol. 2007. PMID: 18040591 No abstract available. - Chromera velia, endosymbioses and the rhodoplex hypothesis--plastid evolution in cryptophytes, alveolates, stramenopiles, and haptophytes (CASH lineages).
Petersen J, Ludewig AK, Michael V, Bunk B, Jarek M, Baurain D, Brinkmann H. Petersen J, et al. Genome Biol Evol. 2014 Mar;6(3):666-84. doi: 10.1093/gbe/evu043. Genome Biol Evol. 2014. PMID: 24572015 Free PMC article. - A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny.
Wang HC, Li K, Susko E, Roger AJ. Wang HC, et al. BMC Evol Biol. 2008 Dec 16;8:331. doi: 10.1186/1471-2148-8-331. BMC Evol Biol. 2008. PMID: 19087270 Free PMC article. - The genus Limnospira contains only two species both unable to produce microcystins: L. maxima and L. platensis comb. nov.
Pinchart PE, Marter P, Brinkmann H, Quilichini Y, Mysara M, Petersen J, Pasqualini V, Mastroleo F. Pinchart PE, et al. iScience. 2024 Aug 30;27(9):110845. doi: 10.1016/j.isci.2024.110845. eCollection 2024 Sep 20. iScience. 2024. PMID: 39290841 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources