Assessing the performance of single-copy genes for recovering robust phylogenies - PubMed (original) (raw)
Assessing the performance of single-copy genes for recovering robust phylogenies
G Aguileta et al. Syst Biol. 2008 Aug.
Abstract
Phylogenies involving nonmodel species are based on a few genes, mostly chosen following historical or practical criteria. Because gene trees are sometimes incongruent with species trees, the resulting phylogenies may not accurately reflect the evolutionary relationships among species. The increase in availability of genome sequences now provides large numbers of genes that could be used for building phylogenies. However, for practical reasons only a few genes can be sequenced for a wide range of species. Here we asked whether we can identify a few genes, among the single-copy genes common to most fungal genomes, that are sufficient for recovering accurate and well-supported phylogenies. Fungi represent a model group for phylogenomics because many complete fungal genomes are available. An automated procedure was developed to extract single-copy orthologous genes from complete fungal genomes using a Markov Clustering Algorithm (Tribe-MCL). Using 21 complete, publicly available fungal genomes with reliable protein predictions, 246 single-copy orthologous gene clusters were identified. We inferred the maximum likelihood trees using the individual orthologous sequences and constructed a reference tree from concatenated protein alignments. The topologies of the individual gene trees were compared to that of the reference tree using three different methods. The performance of individual genes in recovering the reference tree was highly variable. Gene size and the number of variable sites were highly correlated and significantly affected the performance of the genes, but the average substitution rate did not. Two genes recovered exactly the same topology as the reference tree, and when concatenated provided high bootstrap values. The genes typically used for fungal phylogenies did not perform well, which suggests that current fungal phylogenies based on these genes may not accurately reflect the evolutionary relationships among species. Analyses on subsets of species showed that the phylogenetic performance did not seem to depend strongly on the sample. We expect that the best-performing genes identified here will be very useful for phylogenetic studies of fungi, at least at a large taxonomic scale. Furthermore, we compare the method developed here for finding genes for building robust phylogenies with previous ones and we advocate that our method could be applied to other groups of organisms when more complete genomes are available.
Similar articles
- FUNYBASE: a FUNgal phYlogenomic dataBASE.
Marthey S, Aguileta G, Rodolphe F, Gendrault A, Giraud T, Fournier E, Lopez-Villavicencio M, Gautier A, Lebrun MH, Chiapello H. Marthey S, et al. BMC Bioinformatics. 2008 Oct 27;9:456. doi: 10.1186/1471-2105-9-456. BMC Bioinformatics. 2008. PMID: 18954438 Free PMC article. - Inferring species phylogenies from multiple genes: concatenated sequence tree versus consensus gene tree.
Gadagkar SR, Rosenberg MS, Kumar S. Gadagkar SR, et al. J Exp Zool B Mol Dev Evol. 2005 Jan 15;304(1):64-74. doi: 10.1002/jez.b.21026. J Exp Zool B Mol Dev Evol. 2005. PMID: 15593277 - Phylogenomics reveal a robust fungal tree of life.
Kuramae EE, Robert V, Snel B, Weiss M, Boekhout T. Kuramae EE, et al. FEMS Yeast Res. 2006 Dec;6(8):1213-20. doi: 10.1111/j.1567-1364.2006.00119.x. FEMS Yeast Res. 2006. PMID: 17156018 - The search for the fungal tree of life.
McLaughlin DJ, Hibbett DS, Lutzoni F, Spatafora JW, Vilgalys R. McLaughlin DJ, et al. Trends Microbiol. 2009 Nov;17(11):488-97. doi: 10.1016/j.tim.2009.08.001. Epub 2009 Sep 24. Trends Microbiol. 2009. PMID: 19782570 Review. - New perspectives in hemiascomycetous yeast taxonomy.
Casaregola S, Weiss S, Morel G. Casaregola S, et al. C R Biol. 2011 Aug-Sep;334(8-9):590-8. doi: 10.1016/j.crvi.2011.05.006. Epub 2011 Jun 30. C R Biol. 2011. PMID: 21819939 Review.
Cited by
- Whole-genome sequencing and evolutionary analysis of the wild edible mushroom, Morchella eohespera.
Li Y, Yang T, Qiao J, Liang J, Li Z, Sa W, Shang Q. Li Y, et al. Front Microbiol. 2024 Feb 1;14:1309703. doi: 10.3389/fmicb.2023.1309703. eCollection 2023. Front Microbiol. 2024. PMID: 38361578 Free PMC article. - Impact of Dietary Regime and Seasonality on Hindgut's Mycobiota Diversity in Dairy Cows.
Sadek A, Taminiau B, Daube G, Sapountzis P, Chaucheyras-Durand F, Castex M, Coucheney F, Drider D. Sadek A, et al. Microorganisms. 2023 Dec 31;12(1):84. doi: 10.3390/microorganisms12010084. Microorganisms. 2023. PMID: 38257911 Free PMC article. - Uncovering the Biotechnological Importance of Geotrichum candidum.
Kamilari E, Stanton C, Reen FJ, Ross RP. Kamilari E, et al. Foods. 2023 Mar 7;12(6):1124. doi: 10.3390/foods12061124. Foods. 2023. PMID: 36981051 Free PMC article. Review. - The DNA replication protein Orc1 from the yeast Torulaspora delbrueckii is required for heterochromatin formation but not as a silencer-binding protein.
Maria H, Rusche LN. Maria H, et al. Genetics. 2022 Aug 30;222(1):iyac110. doi: 10.1093/genetics/iyac110. Genetics. 2022. PMID: 35894940 Free PMC article. - Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions.
Lo R, Dougan KE, Chen Y, Shah S, Bhattacharya D, Chan CX. Lo R, et al. Front Plant Sci. 2022 Apr 26;13:815714. doi: 10.3389/fpls.2022.815714. eCollection 2022. Front Plant Sci. 2022. PMID: 35557718 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources