Multi-trait modeling and machine learning discover new markers associated with stem traits in alfalfa

Front Plant Sci. 2024 Sep 9:15:1429976. doi: 10.3389/fpls.2024.1429976. eCollection 2024.

Abstract

Alfalfa biomass can be fractionated into leaf and stem components. Leaves comprise a protein-rich and highly digestible portion of biomass for ruminant animals, while stems constitute a high fiber and less digestible fraction, representing 50 to 70% of the biomass. However, little attention has focused on stem-related traits, which are a key aspect in improving the nutritional value and intake potential of alfalfa. This study aimed to identify molecular markers associated with four morphological traits in a panel of five populations of alfalfa generated over two cycles of divergent selection based on 16-h and 96-h in vitro neutral detergent fiber digestibility in stems. Phenotypic traits of stem color, presence of stem pith cells, winter standability, and winter injury were modeled using univariate and multivariate spatial mixed linear models (MLM), and the predicted values were used as response variables in genome-wide association studies (GWAS). The alfalfa panel was genotyped using a 3K DArTag SNP markers for the evaluation of the genetic structure and GWAS. Principal component and population structure analyses revealed differentiations between populations selected for high- and low-digestibility. Thirteen molecular markers were significantly associated with stem traits using either univariate or multivariate MLM. Additionally, support vector machine (SVM) and random forest (RF) algorithms were implemented to determine marker importance scores for stem traits and validate the GWAS results. The top-ranked markers from SVM and RF aligned with GWAS findings for solid stem pith, winter standability, and winter injury. Additionally, SVM identified additional markers with high variable importance for solid stem pith and winter injury. Most molecular markers were located in coding regions. These markers can facilitate marker-assisted selection to expedite breeding programs to increase winter hardiness or stem palatability.

Keywords: GWAS; alfalfa; machine learning; multivariate modeling; stem traits.

Associated data

  • figshare/10.6084/m9.figshare.25686405.v1

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported in part by the U.S. Department of Agriculture, Agricultural Research Service. Breeding Insight was funded by U.S. Department of Agriculture, under agreement numbers (8062-21000-043-004-A, 8062-21000-052-002-A, and 8062-21000-052-003-A).