Relative performance of Bayesian clustering software for inferring population substructure and individual assignment at low levels of population differentiation (original) (raw)

Abstract

Traditional methods for characterizing genetic differentiation among populations rely on a priori grouping of individuals. Bayesian clustering methods avoid this limitation by using linkage and Hardy–Weinberg disequilibrium to decompose a sample of individuals into genetically distinct groups. There are several software programs available for Bayesian clustering analyses, all of which describe a decrease in the ability to detect distinct clusters as levels of genetic differentiation among populations decrease. However, no study has yet compared the performance of such methods at low levels of population differentiation, which may be common in species where populations have experienced recent separation or high levels of gene flow. We used simulated data to evaluate the performance of three Bayesian clustering software programs, PARTITION, STRUCTURE, and BAPS, at levels of population differentiation below F ST=0.1. PARTITION was unable to correctly identify the number of subpopulations until levels of F ST reached around 0.09. Both STRUCTURE and BAPS performed very well at low levels of population differentiation, and were able to correctly identify the number of subpopulations at F ST around 0.03. The average proportion of an individual’s genome assigned to its true population of origin increased with increasing F ST for both programs, reaching over 92% at an F ST of 0.05. The average number of misassignments (assignments to the incorrect subpopulation) continued to decrease as F ST increased, and when F ST was 0.05, fewer than 3% of individuals were misassigned using either program. Both STRUCTURE and BAPS worked extremely well for inferring the number of clusters when clusters were not well-differentiated (F ST=0.02–0.03), but our results suggest that F ST must be at least 0.05 to reach an assignment accuracy of greater than 97%.

Access this article

Log in via an institution

Subscribe and save

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

Download references

Acknowledgements

We would like to thank Jukka Corander for providing technical advice and an advance copy of his manuscript for BAPS 3.1, and Khalid Belkhir for offering assistance regarding PARTITION. Funding was provided by Purdue University.

Author information

Authors and Affiliations

  1. Department of Forestry and Natural Resources, Purdue University, West Lafayette, Indiana, 47907, USA
    Emily K. Latch, Guha Dharmarajan & Olin E. Rhodes Jr.
  2. Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, 53706, USA
    Jeffrey C. Glaubitz

Authors

  1. Emily K. Latch
    You can also search for this author inPubMed Google Scholar
  2. Guha Dharmarajan
    You can also search for this author inPubMed Google Scholar
  3. Jeffrey C. Glaubitz
    You can also search for this author inPubMed Google Scholar
  4. Olin E. Rhodes Jr.
    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toEmily K. Latch.

Rights and permissions

About this article

Cite this article

Latch, E.K., Dharmarajan, G., Glaubitz, J.C. et al. Relative performance of Bayesian clustering software for inferring population substructure and individual assignment at low levels of population differentiation.Conserv Genet 7, 295–302 (2006). https://doi.org/10.1007/s10592-005-9098-1

Download citation

Key words