Missing data and the design of phylogenetic analyses - PubMed (original) (raw)
Review
Missing data and the design of phylogenetic analyses
John J Wiens. J Biomed Inform. 2006 Feb.
Free article
Abstract
Concerns about the deleterious effects of missing data may often determine which characters and taxa are included in phylogenetic analyses. For example, researchers may exclude taxa lacking data for some genes or exclude a gene lacking data in some taxa. Yet, there may be very little evidence to support these decisions. In this paper, I review the effects of missing data on phylogenetic analyses. Recent simulations suggest that highly incomplete taxa can be accurately placed in phylogenies, as long as many characters have been sampled overall. Furthermore, adding incomplete taxa can dramatically improve results in some cases by subdividing misleading long branches. Adding characters with missing data can also improve accuracy, although there is a risk of long-branch attraction in some cases. Consideration of how missing data does (or does not) affect phylogenetic analyses may allow researchers to design studies that can reconstruct large phylogenies quickly, economically, and accurately.
Similar articles
- Does adding characters with missing data increase or decrease phylogenetic accuracy?
Wiens JJ. Wiens JJ. Syst Biol. 1998 Dec;47(4):625-40. doi: 10.1080/106351598260635. Syst Biol. 1998. PMID: 12066307 - Can incomplete taxa rescue phylogenetic analyses from long-branch attraction?
Wiens JJ. Wiens JJ. Syst Biol. 2005 Oct;54(5):731-42. doi: 10.1080/10635150500234583. Syst Biol. 2005. PMID: 16243761 - Highly incomplete taxa can rescue phylogenetic analyses from the negative impacts of limited taxon sampling.
Wiens JJ, Tiu J. Wiens JJ, et al. PLoS One. 2012;7(8):e42925. doi: 10.1371/journal.pone.0042925. Epub 2012 Aug 10. PLoS One. 2012. PMID: 22900065 Free PMC article. - Missing data, incomplete taxa, and phylogenetic accuracy.
Wiens JJ. Wiens JJ. Syst Biol. 2003 Aug;52(4):528-38. doi: 10.1080/10635150390218330. Syst Biol. 2003. PMID: 12857643 - Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.
Gu X. Gu X. Brief Bioinform. 2016 Mar;17(2):243-8. doi: 10.1093/bib/bbv035. Epub 2015 Jun 24. Brief Bioinform. 2016. PMID: 26108230 Review.
Cited by
- How to handle speciose clades? Mass taxon-sampling as a strategy towards illuminating the natural history of Campanula (Campanuloideae).
Mansion G, Parolly G, Crowl AA, Mavrodiev E, Cellinese N, Oganesian M, Fraunhofer K, Kamari G, Phitos D, Haberle R, Akaydin G, Ikinci N, Raus T, Borsch T. Mansion G, et al. PLoS One. 2012;7(11):e50076. doi: 10.1371/journal.pone.0050076. Epub 2012 Nov 28. PLoS One. 2012. PMID: 23209646 Free PMC article. - Insect phylogenomics: results, problems and the impact of matrix composition.
Letsch HO, Meusemann K, Wipfler B, Schütte K, Beutel R, Misof B. Letsch HO, et al. Proc Biol Sci. 2012 Aug 22;279(1741):3282-90. doi: 10.1098/rspb.2012.0744. Epub 2012 May 23. Proc Biol Sci. 2012. PMID: 22628473 Free PMC article. - Diversification and dispersal of the Hawaiian Drosophilidae: the evolution of Scaptomyza.
Lapoint RT, O'Grady PM, Whiteman NK. Lapoint RT, et al. Mol Phylogenet Evol. 2013 Oct;69(1):95-108. doi: 10.1016/j.ympev.2013.04.032. Epub 2013 May 10. Mol Phylogenet Evol. 2013. PMID: 23669011 Free PMC article. - Dolabra nepheliae on rambutan and lychee represents a novel lineage of phytopathogenic Eurotiomycetes.
Rossman AY, Schoch CL, Farr DF, Nishijima K, Keith L, Goenaga R. Rossman AY, et al. Mycoscience. 2010 Jul 1;51(4):300-309. doi: 10.1007/s10267-010-0042-y. Mycoscience. 2010. PMID: 20802819 Free PMC article. - Simultaneous lineage tracing and cell-type identification using CRISPR-Cas9-induced genetic scars.
Spanjaard B, Hu B, Mitic N, Olivares-Chauvet P, Janjuha S, Ninov N, Junker JP. Spanjaard B, et al. Nat Biotechnol. 2018 Jun;36(5):469-473. doi: 10.1038/nbt.4124. Epub 2018 Apr 9. Nat Biotechnol. 2018. PMID: 29644996 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources