Assessing Predictive Properties of Genome-Wide Selection in Soybeans

G3 (Bethesda). 2016 Aug 9;6(8):2611-6. doi: 10.1534/g3.116.032268.

Abstract

Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set.

Keywords: SoyNAM; bayesian methods; genomic selection; machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genetics, Population
  • Genome, Plant
  • Genomics / methods*
  • Glycine max / genetics*
  • Models, Genetic
  • Plant Breeding / methods*
  • Polymorphism, Single Nucleotide
  • Population Density
  • Quantitative Trait Loci