Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle

Zhida Zhao; Qunhao Niu; Jiayuan Wu; Tianyi Wu; Xueyuan Xie; Zezhao Wang; Lupei Zhang; Huijiang Gao; Xue Gao; Lingyang Xu; Bo Zhu; Junya Li

doi:10.1186/s13062-024-00574-y

Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle

Biol Direct. 2024 Dec 31;19(1):147. doi: 10.1186/s13062-024-00574-y.

Authors

Zhida Zhao^#¹, Qunhao Niu^#¹, Jiayuan Wu¹, Tianyi Wu¹, Xueyuan Xie¹, Zezhao Wang¹, Lupei Zhang¹, Huijiang Gao¹, Xue Gao¹, Lingyang Xu², Bo Zhu^{3

4}, Junya Li⁵

Affiliations

¹ Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
² Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China. [email protected].
³ Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China. [email protected].
⁴ Northern Agriculture and Livestock Husbandry Technology Innovation Center, Hohhot, 010010, China. [email protected].
⁵ Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China. [email protected].

^# Contributed equally.

Abstract

Background: Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging.

Methods: We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis.

Results: We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database.

Conclusions: Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.

MeSH terms

Animals
Bayes Theorem
Cattle / genetics
Genome-Wide Association Study / methods
Genomics* / methods
Metabolomics / methods
Quantitative Trait Loci
Whole Genome Sequencing / methods

Abstract

MeSH terms

Grants and funding