Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data - PubMed (original) (raw)
- PMID: 6571220
- DOI: 10.1007/BF02300753
Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data
M Nei et al. J Mol Evol. 1983.
Abstract
The accuracies and efficiencies of three different methods of making phylogenetic trees from gene frequency data were examined by using computer simulation. The methods examined are UPGMA, Farris' (1972) method, and Tateno et al.'s (1982) modified Farris method. In the computer simulation eight species (or populations) were assumed to evolve according to a given model tree, and the evolutionary changes of allele frequencies were followed by using the infinite-allele model. At the end of the simulated evolution five genetic distance measures (Nei's standard and minimum distances, Rogers' distance, Cavalli-Sforza's f theta, and the modified Cavalli-Sforza distance) were computed for all pairs of species, and the distance matrix obtained for each distance measure was used for reconstructing a phylogenetic tree. The phylogenetic tree obtained was then compared with the model tree. The results obtained indicate that in all tree-making methods examined the accuracies of both the topology and branch lengths of a reconstructed tree (rooted tree) are very low when the number of loci used is less than 20 but gradually increase with increasing number of loci. When the expected number of gene substitutions (M) for the shortest branch is 0.1 or more per locus and 30 or more loci are used, the topological error as measured by the distortion index (dT) is not great, but the probability of obtaining the correct topology (P) is less than 0.5 even with 60 loci. When M is as small as 0.004, P is substantially lower. In obtaining a good topology (small dT and high P) UPGMA and the modified Farris method generally show a better performance than the Farris method. The poor performance of the Farris method is observed even when Rogers' distance which obeys the triangle inequality is used. The main reason for this seems to be that the Farris method often gives overestimates of branch lengths. For estimating the expected branch lengths of the true tree UPGMA shows the best performance. For this purpose Nei's standard distance gives a better result than the others because of its linear relationship with the number of gene substitutions. Rogers' or Cavalli-Sforza's distance gives a phylogenetic tree in which the parts near the root are condensed and the other parts are elongated. It is recommended that more than 30 loci, including both polymorphic and monomorphic loci, be used for making phylogenetic trees. The conclusions from this study seem to apply also to data on nucleotide differences obtained by the restriction enzyme techniques.
Similar articles
- Accuracy of estimated phylogenetic trees from molecular data. I. Distantly related species.
Tateno Y, Nei M, Tajima F. Tateno Y, et al. J Mol Evol. 1982;18(6):387-404. doi: 10.1007/BF01840887. J Mol Evol. 1982. PMID: 7175956 - Accuracy of phylogenetic trees estimated from DNA sequence data.
Sourdis J, Krimbas C. Sourdis J, et al. Mol Biol Evol. 1987 Mar;4(2):159-66. doi: 10.1093/oxfordjournals.molbev.a040432. Mol Biol Evol. 1987. PMID: 3447006 - Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.
Sourdis J, Nei M. Sourdis J, et al. Mol Biol Evol. 1988 May;5(3):298-311. doi: 10.1093/oxfordjournals.molbev.a040497. Mol Biol Evol. 1988. PMID: 3386530 - Comparison of phylogenetic trees defined on different but mutually overlapping sets of taxa: A review.
Li W, Koshkarov A, Tahiri N. Li W, et al. Ecol Evol. 2024 Aug 8;14(8):e70054. doi: 10.1002/ece3.70054. eCollection 2024 Aug. Ecol Evol. 2024. PMID: 39119174 Free PMC article. Review. - Evolutionary and statistical properties of three genetic distances.
Kalinowski ST. Kalinowski ST. Mol Ecol. 2002 Aug;11(8):1263-73. doi: 10.1046/j.1365-294x.2002.01520.x. Mol Ecol. 2002. PMID: 12144649 Review.
Cited by
- Assessing and broadening genetic diversity of a rapeseed germplasm collection.
Wu J, Li F, Xu K, Gao G, Chen B, Yan G, Wang N, Qiao J, Li J, Li H, Zhang T, Song W, Wu X. Wu J, et al. Breed Sci. 2014 Dec;64(4):321-30. doi: 10.1270/jsbbs.64.321. Epub 2014 Dec 1. Breed Sci. 2014. PMID: 25914586 Free PMC article. - Temperature-sensitive phenotype caused by natural mutation in Capsicum latescent in two tropical regions.
Koeda S, Hosokawa M, Saito H, Doi M. Koeda S, et al. J Plant Res. 2013 Sep;126(5):675-84. doi: 10.1007/s10265-013-0564-4. Epub 2013 Apr 30. J Plant Res. 2013. PMID: 23624987 - Asexual Evolution and Forest Conditions Drive Genetic Parallelism in Phytophthora ramorum.
Yuzon JD, Travadon R, Malar C M, Tripathy S, Rank N, Mehl HK, Rizzo DM, Cobb R, Small C, Tang T, McCown HE, Garbelotto M, Kasuga T. Yuzon JD, et al. Microorganisms. 2020 Jun 22;8(6):940. doi: 10.3390/microorganisms8060940. Microorganisms. 2020. PMID: 32580470 Free PMC article. - Genetic Vulnerability and the Relationship of Commercial Germplasms of Maize in Brazil with the Nested Association Mapping Parents.
Andrade LR, Fritsche Neto R, Granato ÍS, Sant'Ana GC, Morais PP, Borém A. Andrade LR, et al. PLoS One. 2016 Oct 25;11(10):e0163739. doi: 10.1371/journal.pone.0163739. eCollection 2016. PLoS One. 2016. PMID: 27780247 Free PMC article. - SNP genotyping in melons: genetic variation, population structure, and linkage disequilibrium.
Esteras C, Formisano G, Roig C, Díaz A, Blanca J, Garcia-Mas J, Gómez-Guillamón ML, López-Sesé AI, Lázaro A, Monforte AJ, Picó B. Esteras C, et al. Theor Appl Genet. 2013 May;126(5):1285-303. doi: 10.1007/s00122-013-2053-5. Epub 2013 Feb 5. Theor Appl Genet. 2013. PMID: 23381808
References
- Science. 1967 Jan 20;155(3760):279-84 - PubMed
- Genetics. 1964 Apr;49:725-38 - PubMed
- J Mol Evol. 1982;18(6):387-404 - PubMed
- Jinrui Idengaku Zasshi. 1978 Dec;23(4):341-69 - PubMed
- Genetics. 1978 Jul;89(3):583-90 - PubMed