On identifying the optimal number of population clusters via the deviance information criterion

PLoS One. 2011;6(6):e21014. doi: 10.1371/journal.pone.0021014. Epub 2011 Jun 28.

Abstract

Inferring population structure using bayesian clustering programs often requires a priori specification of the number of subpopulations, K, from which the sample has been drawn. Here, we explore the utility of a common bayesian model selection criterion, the Deviance Information Criterion (DIC), for estimating K. We evaluate the accuracy of DIC, as well as other popular approaches, on datasets generated by coalescent simulations under various demographic scenarios. We find that DIC outperforms competing methods in many genetic contexts, validating its application in assessing population structure.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bayes Theorem*
  • Cluster Analysis
  • Genetics, Population / methods*
  • Humans
  • Models, Statistical