Analysis of Kinship and Population Genetic Structure of 53 Apricot Resources Based on Whole Genome Resequencing

Curr Issues Mol Biol. 2024 Dec 13;46(12):14106-14118. doi: 10.3390/cimb46120844.

Abstract

Based on the single nucleotide polymorphism (SNP) markers developed by whole genome resequencing (WGRS), the relationship and population genetic structure of 53 common apricot (P. armeniaca) varieties were analyzed to provide a theoretical basis for revealing the phylogenetic relationship and classification of the common apricot. WGRS was performed on 53 common apricot varieties, and high-quality SNP sites were obtained after alignment with the "Yinxiangbai" apricot genome as a reference. Phylogenetic analysis, G matrix analysis, principal component analysis, and population structure analysis were performed using Genome-wide Complex Trait Analysis (GCTA), FastTree, Admixture, and other software. The average comparison ratio between the sequencing results and the reference genome was 97.66%. After strict screening, 88,332,238 high-quality SNP sites were finally obtained. Based on the statistical SNP variation type, it was found that LNLJX had the largest number of variations (3,951,322) and the lowest base transition/base transversion ratio (ts/tv = 1.77), indicating that its gene exchange events occurred less frequently. Based on the SNP point estimation of the relationship and genetic distance between samples, the relationship between species was 1.41-0.01, among which PLDJX and BK1 had the closest relationship of 1.41, and YZH and LGWSX had the farthest relationship of 0.01. The genetic distance between species was 0.00367-0.264344, the genetic distance between HMX and JM was the closest, and the genetic distance between WYX and YX was the farthest, which was the largest. Phylogenetic tree, PCA, and genetic structure analysis results all divided 53 common apricot varieties into four groups, and the classification results were consistent. The SNP markers mined using WGRS technology are useful not only to analyze the variation of common apricots, but also to effectively identify their kinship and genetic structure, which plays a critical role in the classification and utilization of common apricot germplasm resources.

Keywords: SNP; common apricot; population genetic structure; whole genome resequencing.