Genomic prediction of unordered categorical traits: an application to subpopulation assignment in German Warmblood horses

Genet Sel Evol. 2016 Feb 11:48:13. doi: 10.1186/s12711-016-0192-2.

Abstract

Background: Categorical traits without ordinal representation of classes do not qualify for threshold models. Alternatively, the multinomial problem can be assessed by a sequence of independent binary contrasts using schemes such as one-vs-all or one-vs-one. Class probabilities can be arrived at by normalization or pair-wise coupling strategies. We assessed the predictive ability of whole-genome regression models and support vector machines for the classification of horses into four German Warmblood breeds.

Results: Prediction accuracies of leave-one-out cross-validation were high and ranged from 0.75 to 0.97 depending on the binary classifier and breeds incorporated in the training. An analysis of the population structure using eigenvectors of the genomic relationship matrix revealed clustering of individuals beyond the given breed labels. Admixture between two breeds became apparent which had substantial impact on the prediction accuracies between those two breeds and also influenced the contrasts between other breeds.

Conclusions: Genomic prediction of unordered categorical traits was successfully applied to subpopulation assignment of German Warmblood horses. The applied methodology is a straightforward extension of existing binary threshold models for genomic prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bayes Theorem
  • Breeding*
  • Genetics, Population
  • Genome
  • Genomics / methods*
  • Genotype
  • Germany
  • Horses / genetics*
  • Models, Genetic*
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quantitative Trait, Heritable*
  • Regression Analysis
  • Support Vector Machine