Supervised Machine Learning Techniques for Breeding Value Prediction in Horses: An Example Using Gait Visual Scores

Fernando Bussiman; Anderson A C Alves; Jennifer Richter; Jorge Hidalgo; Renata Veroneze; Tiago Oliveira

doi:10.3390/ani14182723

Supervised Machine Learning Techniques for Breeding Value Prediction in Horses: An Example Using Gait Visual Scores

Animals (Basel). 2024 Sep 20;14(18):2723. doi: 10.3390/ani14182723.

Authors

Fernando Bussiman¹, Anderson A C Alves¹, Jennifer Richter¹, Jorge Hidalgo¹, Renata Veroneze^{1

2}, Tiago Oliveira³

Affiliations

¹ Animal and Dairy Science Department, University of Georgia, Athens, GA 30602, USA.
² Animal Science Department, Federal University of Viçosa, Viçosa 36570-900, Brazil.
³ Statistics Department, State University of Paraíba, Campina Grande 58429-500, Brazil.

Abstract

Gait scores are widely used in the genetic evaluation of horses. However, the nature of such measurement may limit genetic progress since there is subjectivity in phenotypic information. This study aimed to assess the application of machine learning techniques in the prediction of breeding values for five visual gait scores in Campolina horses: dissociation, comfort, style, regularity, and development. The dataset contained over 5000 phenotypic records with 107,951 horses (14 generations) in the pedigree. A fixed model was used to estimate least-square solutions for fixed effects and adjusted phenotypes. Variance components and breeding values (EBV) were obtained via a multiple-trait model (MTM). Adjusted phenotypes and fixed effects solutions were used to train machine learning models (using the EBV from MTM as target variable): artificial neural network (ANN), random forest regression (RFR) and support vector regression (SVR). To validate the models, the linear regression method was used. Accuracy was comparable across all models (but it was slightly higher for ANN). The highest bias was observed for ANN, followed by MTM. Dispersion varied according to the trait; it was higher for ANN and the lowest for MTM. Machine learning is a feasible alternative to EBV prediction; however, this method will be slightly biased and over-dispersed for young animals.

Keywords: gait prediction; machine learning; support vector regression; visual scores.

Grants and funding

F.B. was the recipient of an MBA scholarship from Instituto Pecege.