Correcting for base-population differences and unknown parent groups in single-step genomic predictions of Norwegian Red cattle

J Anim Sci. 2022 Sep 1;100(9):skac227. doi: 10.1093/jas/skac227.

Abstract

Bias and inflation in genomic evaluation with the single-step methods have been reported in several studies. Incompatibility between the base-populations of the pedigree-based and the genomic relationship matrix (G) could be a reason for these biases. Inappropriate ways of accounting for missing parents could be another reason for biases in genetic evaluations with or without genomic information. To handle these problems, we fitted and evaluated a fixed covariate (J) that contains ones for genotyped animals and zeros for unrelated non-genotyped animals, or pedigree-based regression coefficients for related non-genotyped animals. We also evaluated alternative ways of fitting the J covariate together with genetic groups on biases and stability of breeding value estimates, and of including it into G as a random effect. In a whole vs. partial data set comparison, four scenarios were investigated for the partial data: genotypes missing, phenotypes missing, both genotypes and phenotypes missing, and pedigree missing. Fitting J either as fixed or random reduced level-bias and inflation and increased stability of genomic predictions as compared to the basic model where neither J nor genetic groups were fitted. In most models, genomic predictions were largely biased for scenarios with missing genotype and phenotype information. The biases were reduced for models which combined group and J effects. Models with these corrected group covariates performed better than the recently published model where genetic groups were encapsulated and fitted as random via the Quaas and Pollak transformation. In our Norwegian Red cattle data, a model which combined group and J regression coefficients was preferred because it showed least bias and highest stability of genomic predictions across the scenarios.

Keywords: J factor; Norwegian Red cattle; genetic groups; inflation; level-bias; single-step genomic BLUP.

Plain language summary

Our study dealt with strategies on how to reduce biases (inflation and level-bias) and improve a parameter related to accuracy (stability) of genomic predictions of breeding values that combine genotyped and non-genotyped animals, which are denoted as single-step genomic predictions. We tried to remedy incompatibilities between the pedigree- and the genomics-based relationships matrices by fitting a covariate (J) that corrects for base-population differences that may occur between both relationship matrices. We also evaluated alternative ways to combine the J covariate and genetic group effects to account for missing parental information, which often occurs in practical breeding schemes. We found that fitting J either as fixed or random reduced level-bias and inflation and increased stability of genomic predictions as compared to the basic model where neither J nor genetic groups were fitted. Level-biases and inflation of breeding value estimates were reduced, and stability of genomic predictions improved for models which combined group and J effects. A model which fits group regression coefficients minus the part that could be explained from pedigree was recommended because it showed least bias and highest stability across the scenarios and has theoretical justification.

MeSH terms

  • Animals
  • Cattle / genetics
  • Genome*
  • Genomics / methods
  • Models, Genetic*
  • Norway
  • Pedigree
  • Phenotype