We designed a set of 580 simple sequence repeat markers; 506 from transcription factor-coding genes, and 74 from long non-coding RNAs and designated them as regulatory gene-derived simple sequence repeat (ReG-SSR) markers. From this set, we could anchor 559 ReG-SSR markers on 15 flax chromosomes with an average marker distance of 0.56 Mb. Thirty-one polymorphic ReG-SSR primers, amplifying SSR loci length of at least 20 bp were chosen from 134 screened primers. This primer set was used to characterize a diversity panel of 93 flax accessions. The panel included 33 accessions from India, including released varieties, dual-purpose lines and landraces, and 60 fiber flax accessions from the global core collection. Thirty-one ReG-SSR markers generated 76 alleles, with an average of 2.5 alleles per primer and a mean allele frequency of 0.77. These markers recorded 0.32 average gene diversity, 0.26 polymorphism information content and 1.35% null alleles. All the 31 ReG-SSR loci were found selectively neutral and showed no evidence of population reduction. A model-based clustering analysis separated the flax accessions into two sub-populations-Indian and global, with some accessions showing admixtures. The distinct clustering pattern of the Indian accessions compared to the global accessions, conforms to the principal coordinate analysis, genetic dissimilarity-based unweighted neighbor-joining tree and analysis of molecular variance. Fourteen flax accessions with 99.3% allelic richness were found optimum to adopt in breeding programs. In summary, the genome-wide ReG-SSR markers will serve as a functional marker resource for genetic and phenotypic relationship studies, marker-assisted selections, and provide a basis for selection of accessions from the Indian and global gene pool in fiber flax breeding programs.
Keywords: Bast fiber; Flax; Functional markers; Genetic diversity; Genic SSRs; Linum usitatissimum; Long non-coding RNA; Transcription factors.