Comparing the performances of SSR and SNP markers for population analysis in Theobroma cacao L., as alternative approach to validate a new ddRADseq protocol for cacao genotyping

PLoS One. 2024 May 31;19(5):e0304753. doi: 10.1371/journal.pone.0304753. eCollection 2024.

Abstract

Proper cacao (Theobroma cacao L.) plant genotyping is mandatory for the conservation and use of the species genetic resources. A set of 15 international standard SSR markers was assumed as universal cacao genotyping system. Recently, different SNPs and SNP genotyping techniques have been exploited in cacao. However, a consensus on which to use has not been reached yet, driving the search for new approaches. To validate a new ddRADseq protocol for cacao genotyping, we compared the performances for population analysis of a dataset with 7,880 SNPs obtained from ddRADseq and the genotypic data from the aforementioned SSR set, using 158 cacao plants from productive farms and gene bank. Four genetic groups were identified with STRUCTURE and ADMIXTURE softwares using SSR and SNP data, respectively. Similarities of cacao ancestries among these groups allowed the identification of analogous pairs of groups of individuals, referred to as: G1SSR/G1SNP, G2SSR/G2SNP, G3SSR/G3SNP, G4SSR/G4SNP, whether SSRs or SNPs were used. Both marker systems identified Amelonado and Criollo as the most abundant cacao ancestries among all samples. Genetic distance matrices from both data types were significantly similar to each other according to Mantel test (p < 0.0001). PCoA and UPGMA clustering mostly confirmed the identified genetic groups. AMOVA and FST pairwise comparison revealed a moderate to very large genetic differentiation among identified groups from SSR and SNP data. Genetic diversity parameters from SSR (Hobs = 0.616, Hexp = 0.524 and PIC = 0.544) were higher than that from SNP data (0.288, 0.264, 0.230). In both cases, genetic groups carrying the highest Amelonado proportion (G1SSR and G1SNP) had the lowest genetic diversity parameters among the identified groups. The high congruence among population analysis results using both systems validated the ddRADseq protocol employed for cacao SNP genotyping. These results could provide new ways for developing a universal SNP-based genotyping system very much needed for cacao genetic studies.

Publication types

  • Comparative Study

MeSH terms

  • Cacao* / genetics
  • Genetic Markers
  • Genotype
  • Genotyping Techniques* / methods
  • Microsatellite Repeats* / genetics
  • Polymorphism, Single Nucleotide*

Substances

  • Genetic Markers

Grants and funding

This research work was funded by the “Projet de Recherche et Développement” of the Cooperation program from the “Académie de Recherche et d’Enseignement Supérieur” (ARES), Belgium (ARES CCD Programme PRD-PFS 2017 - Cuba – Cacao). However, the authors did not receive supports in the form of salary from the funder. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.